Literature DB >> 31028141

scMerge leverages factor analysis, stable expression, and pseudoreplication to merge multiple single-cell RNA-seq datasets.

Yingxin Lin1, Shila Ghazanfar1,2, Kevin Y X Wang1, Johann A Gagnon-Bartsch3, Kitty K Lo1, Xianbin Su4,5, Ze-Guang Han4,5, John T Ormerod1, Terence P Speed6,7, Pengyi Yang8,2, Jean Yee Hwa Yang8,2.   

Abstract

Concerted examination of multiple collections of single-cell RNA sequencing (RNA-seq) data promises further biological insights that cannot be uncovered with individual datasets. Here we present scMerge, an algorithm that integrates multiple single-cell RNA-seq datasets using factor analysis of stably expressed genes and pseudoreplicates across datasets. Using a large collection of public datasets, we benchmark scMerge against published methods and demonstrate that it consistently provides improved cell type separation by removing unwanted factors; scMerge can also enhance biological discovery through robust data integration, which we show through the inference of development trajectory in a liver dataset collection.

Keywords:  data integration; factor analysis; normalization; pseudoreplications; single-cell RNA-seq data

Year:  2019        PMID: 31028141      PMCID: PMC6525515          DOI: 10.1073/pnas.1820006116

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  50 in total

1.  Human housekeeping genes are compact.

Authors:  Eli Eisenberg; Erez Y Levanon
Journal:  Trends Genet       Date:  2003-07       Impact factor: 11.639

2.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

3.  Finding community structure in very large networks.

Authors:  Aaron Clauset; M E J Newman; Cristopher Moore
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-12-06

4.  Adjusting batch effects in microarray expression data using empirical Bayes methods.

Authors:  W Evan Johnson; Cheng Li; Ariel Rabinovic
Journal:  Biostatistics       Date:  2006-04-21       Impact factor: 5.899

5.  Mammalian genes are transcribed with widely different bursting kinetics.

Authors:  David M Suter; Nacho Molina; David Gatfield; Kim Schneider; Ueli Schibler; Felix Naef
Journal:  Science       Date:  2011-03-17       Impact factor: 47.728

Review 6.  From a common progenitor to distinct liver epithelial phenotypes.

Authors:  Anne Müsch
Journal:  Curr Opin Cell Biol       Date:  2018-03-02       Impact factor: 8.382

7.  Human housekeeping genes, revisited.

Authors:  Eli Eisenberg; Erez Y Levanon
Journal:  Trends Genet       Date:  2013-06-27       Impact factor: 11.639

8.  Single-cell transcriptomics reveals receptor transformations during olfactory neurogenesis.

Authors:  Naresh K Hanchate; Kunio Kondoh; Zhonghua Lu; Donghui Kuang; Xiaolan Ye; Xiaojie Qiu; Lior Pachter; Cole Trapnell; Linda B Buck
Journal:  Science       Date:  2015-11-05       Impact factor: 47.728

9.  Integrated single cell data analysis reveals cell specific networks and novel coactivation markers.

Authors:  Shila Ghazanfar; Adam J Bisogni; John T Ormerod; David M Lin; Jean Y H Yang
Journal:  BMC Syst Biol       Date:  2016-12-05

10.  SCnorm: robust normalization of single-cell RNA-seq data.

Authors:  Rhonda Bacher; Li-Fang Chu; Ning Leng; Audrey P Gasch; James A Thomson; Ron M Stewart; Michael Newton; Christina Kendziorski
Journal:  Nat Methods       Date:  2017-04-17       Impact factor: 28.547

View more
  38 in total

1.  Investigating higher-order interactions in single-cell data with scHOT.

Authors:  John C Marioni; Jean Yee Hwa Yang; Shila Ghazanfar; Yingxin Lin; Xianbin Su; David Ming Lin; Ellis Patrick; Ze-Guang Han
Journal:  Nat Methods       Date:  2020-07-13       Impact factor: 28.547

2.  Evaluating stably expressed genes in single cells.

Authors:  Yingxin Lin; Shila Ghazanfar; Dario Strbenac; Andy Wang; Ellis Patrick; David M Lin; Terence Speed; Jean Y H Yang; Pengyi Yang
Journal:  Gigascience       Date:  2019-09-01       Impact factor: 6.524

3.  Unbiased integration of single cell transcriptome replicates.

Authors:  Martin Loza; Shunsuke Teraguchi; Daron M Standley; Diego Diez
Journal:  NAR Genom Bioinform       Date:  2022-03-15

4.  RUV-III-NB: normalization of single cell RNA-seq data.

Authors:  Agus Salim; Ramyar Molania; Jianan Wang; Alysha De Livera; Rachel Thijssen; Terence P Speed
Journal:  Nucleic Acids Res       Date:  2022-09-09       Impact factor: 19.160

5.  scFeatures: multi-view representations of single-cell and spatial data for disease outcome prediction.

Authors:  Yue Cao; Yingxin Lin; Ellis Patrick; Pengyi Yang; Jean Yee Hwa Yang
Journal:  Bioinformatics       Date:  2022-10-14       Impact factor: 6.931

6.  Robust integration of multiple single-cell RNA sequencing datasets using a single reference space.

Authors:  Yang Liu; Tao Wang; Bin Zhou; Deyou Zheng
Journal:  Nat Biotechnol       Date:  2021-03-25       Impact factor: 54.908

Review 7.  Single-Cell RNA-Seq Technologies and Computational Analysis Tools: Application in Cancer Research.

Authors:  Qianqian Song; Liang Liu
Journal:  Methods Mol Biol       Date:  2022

Review 8.  Orchestrating single-cell analysis with Bioconductor.

Authors:  Robert A Amezquita; Aaron T L Lun; Etienne Becht; Vince J Carey; Lindsay N Carpp; Ludwig Geistlinger; Federico Marini; Kevin Rue-Albrecht; Davide Risso; Charlotte Soneson; Levi Waldron; Hervé Pagès; Mike L Smith; Wolfgang Huber; Martin Morgan; Raphael Gottardo; Stephanie C Hicks
Journal:  Nat Methods       Date:  2019-12-02       Impact factor: 28.547

9.  Pseudoreplication in genomic-scale data sets.

Authors:  Robin S Waples; Ryan K Waples; Eric J Ward
Journal:  Mol Ecol Resour       Date:  2021-09-07       Impact factor: 8.678

10.  FR-Match: robust matching of cell type clusters from single cell RNA sequencing data using the Friedman-Rafsky non-parametric test.

Authors:  Yun Zhang; Brian D Aevermann; Trygve E Bakken; Jeremy A Miller; Rebecca D Hodge; Ed S Lein; Richard H Scheuermann
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 13.994

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.