Literature DB >> 35656596

Differential expression of single-cell RNA-seq data using Tweedie models.

Himel Mallick1, Suvo Chatterjee2, Shrabanti Chowdhury3, Saptarshi Chatterjee4, Ali Rahnavard5, Stephanie C Hicks6.   

Abstract

The performance of computational methods and software to identify differentially expressed features in single-cell RNA-sequencing (scRNA-seq) has been shown to be influenced by several factors, including the choice of the normalization method used and the choice of the experimental platform (or library preparation protocol) to profile gene expression in individual cells. Currently, it is up to the practitioner to choose the most appropriate differential expression (DE) method out of over 100 DE tools available to date, each relying on their own assumptions to model scRNA-seq expression features. To model the technological variability in cross-platform scRNA-seq data, here we propose to use Tweedie generalized linear models that can flexibly capture a large dynamic range of observed scRNA-seq expression profiles across experimental platforms induced by platform- and gene-specific statistical properties such as heavy tails, sparsity, and gene expression distributions. We also propose a zero-inflated Tweedie model that allows zero probability mass to exceed a traditional Tweedie distribution to model zero-inflated scRNA-seq data with excessive zero counts. Using both synthetic and published plate- and droplet-based scRNA-seq datasets, we perform a systematic benchmark evaluation of more than 10 representative DE methods and demonstrate that our method (Tweedieverse) outperforms the state-of-the-art DE approaches across experimental platforms in terms of statistical power and false discovery rate control. Our open-source software (R/Bioconductor package) is available at https://github.com/himelmallick/Tweedieverse.
© 2022 John Wiley & Sons Ltd. This article has been contributed to by U.S. Government employees and their work is in the public domain in the USA.

Entities:  

Keywords:  Tweedie distribution; differential expression; exponential dispersion model; generalized linear model; single-cell RNA-sequencing; zero-inflation

Mesh:

Year:  2022        PMID: 35656596      PMCID: PMC9288986          DOI: 10.1002/sim.9430

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.497


  70 in total

1.  Full-length RNA-seq from single cells using Smart-seq2.

Authors:  Simone Picelli; Omid R Faridani; Asa K Björklund; Gösta Winberg; Sven Sagasser; Rickard Sandberg
Journal:  Nat Protoc       Date:  2014-01-02       Impact factor: 13.491

2.  Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq.

Authors:  Saiful Islam; Una Kjällquist; Annalena Moliner; Pawel Zajac; Jian-Bing Fan; Peter Lönnerberg; Sten Linnarsson
Journal:  Genome Res       Date:  2011-05-04       Impact factor: 9.043

3.  Differential Expression Analysis in Single-Cell Transcriptomics.

Authors:  Luca Alessandrì; Maddalena Arigoni; Raffaele Calogero
Journal:  Methods Mol Biol       Date:  2019

4.  UMI or not UMI, that is the question for scRNA-seq zero-inflation.

Authors:  Yingying Cao; Simo Kitanovski; Ralf Küppers; Daniel Hoffmann
Journal:  Nat Biotechnol       Date:  2021-02-01       Impact factor: 54.908

5.  Bias, robustness and scalability in single-cell differential expression analysis.

Authors:  Charlotte Soneson; Mark D Robinson
Journal:  Nat Methods       Date:  2018-02-26       Impact factor: 28.547

6.  Marginalized zero-inflated negative binomial regression with application to dental caries.

Authors:  John S Preisser; Kalyan Das; D Leann Long; Kimon Divaris
Journal:  Stat Med       Date:  2015-11-15       Impact factor: 2.373

7.  TWO-SIGMA: A novel two-component single cell model-based association method for single-cell RNA-seq data.

Authors:  Eric Van Buren; Ming Hu; Chen Weng; Fulai Jin; Yan Li; Di Wu; Yun Li
Journal:  Genet Epidemiol       Date:  2020-09-29       Impact factor: 2.135

8.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

9.  Single-Cell RNA-Seq Reveals Lineage and X Chromosome Dynamics in Human Preimplantation Embryos.

Authors:  Sophie Petropoulos; Daniel Edsgärd; Björn Reinius; Qiaolin Deng; Sarita Pauliina Panula; Simone Codeluppi; Alvaro Plaza Reyes; Sten Linnarsson; Rickard Sandberg; Fredrik Lanner
Journal:  Cell       Date:  2016-04-07       Impact factor: 41.582

10.  Splatter: simulation of single-cell RNA sequencing data.

Authors:  Luke Zappia; Belinda Phipson; Alicia Oshlack
Journal:  Genome Biol       Date:  2017-09-12       Impact factor: 13.583

View more
  2 in total

1.  Metabolite, protein, and tissue dysfunction associated with COVID-19 disease severity.

Authors:  Ali Rahnavard; Brendan Mann; Abhigya Giri; Ranojoy Chatterjee; Keith A Crandall
Journal:  Sci Rep       Date:  2022-07-16       Impact factor: 4.996

Review 2.  Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.

Authors:  Samarendra Das; Anil Rai; Shesh N Rai
Journal:  Entropy (Basel)       Date:  2022-07-18       Impact factor: 2.738

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.