Literature DB >> 32989764

TWO-SIGMA: A novel two-component single cell model-based association method for single-cell RNA-seq data.

Eric Van Buren1, Ming Hu2, Chen Weng3, Fulai Jin3, Yan Li3, Di Wu1,4, Yun Li1,5,6.   

Abstract

In this paper, we develop TWO-SIGMA, a TWO-component SInGle cell Model-based Association method for differential expression (DE) analyses in single-cell RNA-seq (scRNA-seq) data. The first component models the probability of "drop-out" with a mixed-effects logistic regression model and the second component models the (conditional) mean expression with a mixed-effects negative binomial regression model. TWO-SIGMA is extremely flexible in that it: (i) does not require a log-transformation of the outcome, (ii) allows for overdispersed and zero-inflated counts, (iii) accommodates a correlation structure between cells from the same individual via random effect terms, (iv) can analyze unbalanced designs (in which the number of cells does not need to be identical for all samples), (v) can control for additional sample-level and cell-level covariates including batch effects, (vi) provides interpretable effect size estimates, and (vii) enables general tests of DE beyond two-group comparisons. To our knowledge, TWO-SIGMA is the only method for analyzing scRNA-seq data that can simultaneously accomplish each of these features. Simulations studies show that TWO-SIGMA outperforms alternative regression-based approaches in both type-I error control and power enhancement when the data contains even moderate within-sample correlation. A real data analysis using pancreas islet single-cells exhibits the flexibility of TWO-SIGMA and demonstrates that incorrectly failing to include random effect terms can have dramatic impacts on scientific conclusions. TWO-SIGMA is implemented in the R package twosigma available at https://github.com/edvanburen/twosigma.
© 2020 Wiley Periodicals LLC.

Entities:  

Keywords:  differential expression; random effects model; single-cell RNA sequencing; zero-inflated model

Mesh:

Year:  2020        PMID: 32989764      PMCID: PMC8570615          DOI: 10.1002/gepi.22361

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  21 in total

1.  Zero-inflated Poisson and binomial regression with random effects: a case study.

Authors:  D B Hall
Journal:  Biometrics       Date:  2000-12       Impact factor: 2.571

2.  Bias, robustness and scalability in single-cell differential expression analysis.

Authors:  Charlotte Soneson; Mark D Robinson
Journal:  Nat Methods       Date:  2018-02-26       Impact factor: 28.547

3.  Marginal mean models for zero-inflated count data.

Authors:  David Todem; KyungMann Kim; Wei-Wen Hsu
Journal:  Biometrics       Date:  2016-02-17       Impact factor: 2.571

Review 4.  Review and recommendations for zero-inflated count regression modeling of dental caries indices in epidemiological studies.

Authors:  J S Preisser; J W Stamm; D L Long; M E Kincade
Journal:  Caries Res       Date:  2012-06-15       Impact factor: 4.056

5.  Single-cell gene expression analysis reveals genetic associations masked in whole-tissue experiments.

Authors:  Quin F Wills; Kenneth J Livak; Alex J Tipping; Tariq Enver; Andrew J Goldson; Darren W Sexton; Chris Holmes
Journal:  Nat Biotechnol       Date:  2013-07-21       Impact factor: 54.908

6.  Bayesian approach to single-cell differential expression analysis.

Authors:  Peter V Kharchenko; Lev Silberstein; David T Scadden
Journal:  Nat Methods       Date:  2014-05-18       Impact factor: 28.547

7.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

8.  Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model.

Authors:  F William Townes; Stephanie C Hicks; Martin J Aryee; Rafael A Irizarry
Journal:  Genome Biol       Date:  2019-12-23       Impact factor: 13.583

9.  ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis.

Authors:  Emma Pierson; Christopher Yau
Journal:  Genome Biol       Date:  2015-11-02       Impact factor: 13.583

10.  Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.

Authors:  Koen Van den Berge; Fanny Perraudeau; Charlotte Soneson; Michael I Love; Davide Risso; Jean-Philippe Vert; Mark D Robinson; Sandrine Dudoit; Lieven Clement
Journal:  Genome Biol       Date:  2018-02-26       Impact factor: 13.583

View more
  3 in total

1.  Differential expression of single-cell RNA-seq data using Tweedie models.

Authors:  Himel Mallick; Suvo Chatterjee; Shrabanti Chowdhury; Saptarshi Chatterjee; Ali Rahnavard; Stephanie C Hicks
Journal:  Stat Med       Date:  2022-06-02       Impact factor: 2.497

2.  TWO-SIGMA-G: a new competitive gene set testing framework for scRNA-seq data accounting for inter-gene and cell-cell correlation.

Authors:  Eric Van Buren; Ming Hu; Liang Cheng; John Wrobel; Kirk Wilhelmsen; Lishan Su; Yun Li; Di Wu
Journal:  Brief Bioinform       Date:  2022-05-13       Impact factor: 13.994

Review 3.  Understanding the function of regulatory DNA interactions in the interpretation of non-coding GWAS variants.

Authors:  Wujuan Zhong; Weifang Liu; Jiawen Chen; Quan Sun; Ming Hu; Yun Li
Journal:  Front Cell Dev Biol       Date:  2022-08-19
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.