Literature DB >> 31009065

Detection of differentially expressed genes in discrete single-cell RNA sequencing data using a hurdle model with correlated random effects.

Michael Sekula1, Jeremy Gaskins1, Susmita Datta2.   

Abstract

Single-cell RNA sequencing (scRNA-seq) technologies are revolutionary tools allowing researchers to examine gene expression at the level of a single cell. Traditionally, transcriptomic data have been analyzed from bulk samples, masking the heterogeneity now seen across individual cells. Even within the same cellular population, genes can be highly expressed in some cells but not expressed (or lowly expressed) in others. Therefore, the computational approaches used to analyze bulk RNA sequencing data are not appropriate for the analysis of scRNA-seq data. Here, we present a novel statistical model for high dimensional and zero-inflated scRNA-seq count data to identify differentially expressed (DE) genes across cell types. Correlated random effects are employed based on an initial clustering of cells to capture the cell-to-cell variability within treatment groups. Moreover, this model is flexible and can be easily adapted to an independent random effect structure if needed. We apply our proposed methodology to both simulated and real data and compare results to other popular methods designed for detecting DE genes. Due to the hurdle model's ability to detect differences in the proportion of cells expressed and the average expression level (among the expressed cells), our methods naturally identify some genes as DE that other methods do not, and we demonstrate with real data that these uniquely detected genes are associated with similar biological processes and functions.
© 2019 The International Biometric Society.

Keywords:  RNA sequencing; differential expression; single-cell; variational inference

Year:  2019        PMID: 31009065     DOI: 10.1111/biom.13074

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  6 in total

1.  Differential expression of single-cell RNA-seq data using Tweedie models.

Authors:  Himel Mallick; Suvo Chatterjee; Shrabanti Chowdhury; Saptarshi Chatterjee; Ali Rahnavard; Stephanie C Hicks
Journal:  Stat Med       Date:  2022-06-02       Impact factor: 2.497

2.  SAREV: A review on statistical analytics of single-cell RNA sequencing data.

Authors:  Dorothy Ellis; Dongyuan Wu; Susmita Datta
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2021-05-20

3.  Single-Cell Differential Network Analysis with Sparse Bayesian Factor Models.

Authors:  Michael Sekula; Jeremy Gaskins; Susmita Datta
Journal:  Front Genet       Date:  2022-02-04       Impact factor: 4.599

Review 4.  Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.

Authors:  Samarendra Das; Anil Rai; Shesh N Rai
Journal:  Entropy (Basel)       Date:  2022-07-18       Impact factor: 2.738

5.  Combinatorial and statistical prediction of gene expression from haplotype sequence.

Authors:  Berk A Alpay; Pinar Demetci; Sorin Istrail; Derek Aguiar
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

6.  A Comprehensive Survey of Statistical Approaches for Differential Expression Analysis in Single-Cell RNA Sequencing Studies.

Authors:  Samarendra Das; Anil Rai; Michael L Merchant; Matthew C Cave; Shesh N Rai
Journal:  Genes (Basel)       Date:  2021-12-02       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.