Literature DB >> 35561178

EDClust: an EM-MM hybrid method for cell clustering in multiple-subject single-cell RNA sequencing.

Xin Wei1, Ziyi Li2, Hongkai Ji3, Hao Wu1.   

Abstract

MOTIVATION: Single-cell RNA sequencing (scRNA-seq) has revolutionized biological research by enabling the measurement of transcriptomic profiles at the single-cell level. With the increasing application of scRNA-seq in larger-scale studies, the problem of appropriately clustering cells emerges when the scRNA-seq data are from multiple subjects. One challenge is the subject-specific variation; systematic heterogeneity from multiple subjects may have a significant impact on clustering accuracy. Existing methods seeking to address such effects suffer from several limitations.
RESULTS: We develop a novel statistical method, EDClust, for multi-subject scRNA-seq cell clustering. EDClust models the sequence read counts by a mixture of Dirichlet-multinomial distributions and explicitly accounts for cell-type heterogeneity, subject heterogeneity and clustering uncertainty. An EM-MM hybrid algorithm is derived for maximizing the data likelihood and clustering the cells. We perform a series of simulation studies to evaluate the proposed method and demonstrate the outstanding performance of EDClust. Comprehensive benchmarking on four real scRNA-seq datasets with various tissue types and species demonstrates the substantial accuracy improvement of EDClust compared to existing methods.
AVAILABILITY AND IMPLEMENTATION: The R package is freely available at https://github.com/weix21/EDClust. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2022        PMID: 35561178      PMCID: PMC9113373          DOI: 10.1093/bioinformatics/btac168

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


  37 in total

Review 1.  Estimating F-statistics.

Authors:  B S Weir; W G Hill
Journal:  Annu Rev Genet       Date:  2002-06-11       Impact factor: 16.830

2.  Counting absolute numbers of molecules using unique molecular identifiers.

Authors:  Teemu Kivioja; Anna Vähärautio; Kasper Karlsson; Martin Bonke; Martin Enge; Sten Linnarsson; Jussi Taipale
Journal:  Nat Methods       Date:  2011-11-20       Impact factor: 28.547

3.  MM Algorithms for Some Discrete Multivariate Distributions.

Authors:  Hua Zhou; Kenneth Lange
Journal:  J Comput Graph Stat       Date:  2010-09-01       Impact factor: 2.302

4.  Smart-seq2 for sensitive full-length transcriptome profiling in single cells.

Authors:  Simone Picelli; Åsa K Björklund; Omid R Faridani; Sven Sagasser; Gösta Winberg; Rickard Sandberg
Journal:  Nat Methods       Date:  2013-09-22       Impact factor: 28.547

5.  Single-Cell Multi-omic Integration Compares and Contrasts Features of Brain Cell Identity.

Authors:  Joshua D Welch; Velina Kozareva; Ashley Ferreira; Charles Vanderburg; Carly Martin; Evan Z Macosko
Journal:  Cell       Date:  2019-06-06       Impact factor: 41.582

6.  InDrops and Drop-seq technologies for single-cell sequencing.

Authors:  Allon M Klein; Evan Macosko
Journal:  Lab Chip       Date:  2017-07-25       Impact factor: 6.799

7.  A Single-Cell Transcriptomic Map of the Human and Mouse Pancreas Reveals Inter- and Intra-cell Population Structure.

Authors:  Maayan Baron; Adrian Veres; Samuel L Wolock; Aubrey L Faust; Renaud Gaujoux; Amedeo Vetere; Jennifer Hyoje Ryu; Bridget K Wagner; Shai S Shen-Orr; Allon M Klein; Douglas A Melton; Itai Yanai
Journal:  Cell Syst       Date:  2016-09-22       Impact factor: 10.304

8.  Single-cell analysis and sorting using droplet-based microfluidics.

Authors:  Linas Mazutis; John Gilbert; W Lloyd Ung; David A Weitz; Andrew D Griffiths; John A Heyman
Journal:  Nat Protoc       Date:  2013-04-04       Impact factor: 13.491

9.  Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors.

Authors:  Laleh Haghverdi; Aaron T L Lun; Michael D Morgan; John C Marioni
Journal:  Nat Biotechnol       Date:  2018-04-02       Impact factor: 54.908

10.  An integrative Bayesian Dirichlet-multinomial regression model for the analysis of taxonomic abundances in microbiome data.

Authors:  W Duncan Wadsworth; Raffaele Argiento; Michele Guindani; Jessica Galloway-Pena; Samuel A Shelburne; Marina Vannucci
Journal:  BMC Bioinformatics       Date:  2017-02-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.