Literature DB >> 35325552

Correlation Imputation for Single-Cell RNA-seq.

Luqin Gan1, Giuseppe Vinci2, Genevera I Allen1,3,4,5.   

Abstract

Recent advances in single-cell RNA sequencing (scRNA-seq) technologies have yielded a powerful tool to measure gene expression of individual cells. One major challenge of the scRNA-seq data is that it usually contains a large amount of zero expression values, which often impairs the effectiveness of downstream analyses. Numerous data imputation methods have been proposed to deal with these "dropout" events, but this is a difficult task for such high-dimensional and sparse data. Furthermore, there have been debates on the nature of the sparsity, about whether the zeros are due to technological limitations or represent actual biology. To address these challenges, we propose Single-cell RNA-seq Correlation completion by ENsemble learning and Auxiliary information (SCENA), a novel approach that imputes the correlation matrix of the data of interest instead of the data itself. SCENA obtains a gene-by-gene correlation estimate by ensembling various individual estimates, some of which are based on known auxiliary information about gene expression networks. Our approach is a reliable method that makes no assumptions on the nature of sparsity in scRNA-seq data or the data distribution. By extensive simulation studies and real data applications, we demonstrate that SCENA is not only superior in gene correlation estimation, but also improves the accuracy and reliability of downstream analyses, including cell clustering, dimension reduction, and graphical model estimation to learn the gene expression network.

Entities:  

Keywords:  auxiliary information; clustering; correlation completion; dimension reduction; ensemble learning; graphical modeling; imputation; single-cell RNA-sequencing

Mesh:

Year:  2022        PMID: 35325552      PMCID: PMC9125575          DOI: 10.1089/cmb.2021.0403

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.549


  35 in total

1.  Matrix Completion and Low-Rank SVD via Fast Alternating Least Squares.

Authors:  Trevor Hastie; Rahul Mazumder; Jason D Lee; Reza Zadeh
Journal:  J Mach Learn Res       Date:  2015       Impact factor: 3.654

Review 2.  Gene regulatory network inference: data integration in dynamic models-a review.

Authors:  Michael Hecker; Sandro Lambeck; Susanne Toepfer; Eugene van Someren; Reinhard Guthke
Journal:  Biosystems       Date:  2008-12-27       Impact factor: 1.973

3.  scRMD: Imputation for single cell RNA-seq data via robust matrix decomposition.

Authors:  Chong Chen; Changjing Wu; Linjie Wu; Xiaochen Wang; Minghua Deng; Ruibin Xi
Journal:  Bioinformatics       Date:  2020-03-02       Impact factor: 6.937

4.  A UNIFIED STATISTICAL FRAMEWORK FOR SINGLE CELL AND BULK RNA SEQUENCING DATA.

Authors:  Lingxue Zhu; Jing Lei; Bernie Devlin; Kathryn Roeder
Journal:  Ann Appl Stat       Date:  2018-03-09       Impact factor: 2.083

5.  Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data.

Authors:  Shouguo Gao; Xujing Wang
Journal:  BMC Bioinformatics       Date:  2011-08-31       Impact factor: 3.169

6.  Gene Network Reconstruction by Integration of Prior Biological Knowledge.

Authors:  Yupeng Li; Scott A Jackson
Journal:  G3 (Bethesda)       Date:  2015-03-30       Impact factor: 3.154

7.  Targeted disruption of DNMT1, DNMT3A and DNMT3B in human embryonic stem cells.

Authors:  Jing Liao; Rahul Karnik; Hongcang Gu; Michael J Ziller; Kendell Clement; Alexander M Tsankov; Veronika Akopian; Casey A Gifford; Julie Donaghey; Christina Galonska; Ramona Pop; Deepak Reyon; Shengdar Q Tsai; William Mallard; J Keith Joung; John L Rinn; Andreas Gnirke; Alexander Meissner
Journal:  Nat Genet       Date:  2015-03-30       Impact factor: 38.330

8.  A survey of human brain transcriptome diversity at the single cell level.

Authors:  Spyros Darmanis; Steven A Sloan; Ye Zhang; Martin Enge; Christine Caneda; Lawrence M Shuer; Melanie G Hayden Gephart; Ben A Barres; Stephen R Quake
Journal:  Proc Natl Acad Sci U S A       Date:  2015-05-18       Impact factor: 11.205

9.  Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model.

Authors:  F William Townes; Stephanie C Hicks; Martin J Aryee; Rafael A Irizarry
Journal:  Genome Biol       Date:  2019-12-23       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.