| Literature DB >> 30674886 |
Gökcen Eraslan1,2, Lukas M Simon1, Maria Mircea1, Nikola S Mueller1, Fabian J Theis3,4,5.
Abstract
Single-cell RNA sequencing (scRNA-seq) has enabled researchers to study gene expression at a cellular resolution. However, noise due to amplification and dropout may obstruct analyses, so scalable denoising methods for increasingly large but sparse scRNA-seq data are needed. We propose a deep count autoencoder network (DCA) to denoise scRNA-seq datasets. DCA takes the count distribution, overdispersion and sparsity of the data into account using a negative binomial noise model with or without zero-inflation, and nonlinear gene-gene dependencies are captured. Our method scales linearly with the number of cells and can, therefore, be applied to datasets of millions of cells. We demonstrate that DCA denoising improves a diverse set of typical scRNA-seq data analyses using simulated and real datasets. DCA outperforms existing methods for data imputation in quality and speed, enhancing biological discovery.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30674886 PMCID: PMC6344535 DOI: 10.1038/s41467-018-07931-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919