Literature DB >> 31243426

Machine learning and statistical methods for clustering single-cell RNA-sequencing data.

Raphael Petegrosso1, Zhuliu Li1, Rui Kuang1.   

Abstract

Single-cell RNAsequencing (scRNA-seq) technologies have enabled the large-scale whole-transcriptome profiling of each individual single cell in a cell population. A core analysis of the scRNA-seq transcriptome profiles is to cluster the single cells to reveal cell subtypes and infer cell lineages based on the relations among the cells. This article reviews the machine learning and statistical methods for clustering scRNA-seq transcriptomes developed in the past few years. The review focuses on how conventional clustering techniques such as hierarchical clustering, graph-based clustering, mixture models, $k$-means, ensemble learning, neural networks and density-based clustering are modified or customized to tackle the unique challenges in scRNA-seq data analysis, such as the dropout of low-expression genes, low and uneven read coverage of transcripts, highly variable total mRNAs from single cells and ambiguous cell markers in the presence of technical biases and irrelevant confounding biological variations. We review how cell-specific normalization, the imputation of dropouts and dimension reduction methods can be applied with new statistical or optimization strategies to improve the clustering of single cells. We will also introduce those more advanced approaches to cluster scRNA-seq transcriptomes in time series data and multiple cell populations and to detect rare cell types. Several software packages developed to support the cluster analysis of scRNA-seq data are also reviewed and experimentally compared to evaluate their performance and efficiency. Finally, we conclude with useful observations and possible future directions in scRNA-seq data analytics. AVAILABILITY: All the source code and data are available at https://github.com/kuanglab/single-cell-review.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  clustering; machine learning; scRNA sequencing; single-cell technology

Year:  2019        PMID: 31243426     DOI: 10.1093/bib/bbz063

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  19 in total

1.  A comprehensive comparison of supervised and unsupervised methods for cell type identification in single-cell RNA-seq.

Authors:  Xiaobo Sun; Xiaochu Lin; Ziyi Li; Hao Wu
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  Deep learning tackles single-cell analysis-a survey of deep learning for scRNA-seq analysis.

Authors:  Mario Flores; Zhentao Liu; Tinghe Zhang; Md Musaddaqui Hasib; Yu-Chiao Chiu; Zhenqing Ye; Karla Paniagua; Sumin Jo; Jianqiu Zhang; Shou-Jiang Gao; Yu-Fang Jin; Yidong Chen; Yufei Huang
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

3.  Consensus clustering of single-cell RNA-seq data by enhancing network affinity.

Authors:  Yaxuan Cui; Shaoqiang Zhang; Ying Liang; Xiangyun Wang; Thomas N Ferraro; Yong Chen
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

4.  Dirichlet process mixture models for single-cell RNA-seq clustering.

Authors:  Nigatu A Adossa; Kalle T Rytkönen; Laura L Elo
Journal:  Biol Open       Date:  2022-04-04       Impact factor: 2.422

5.  scPADGRN: A preconditioned ADMM approach for reconstructing dynamic gene regulatory network using single-cell RNA sequencing data.

Authors:  Xiao Zheng; Yuan Huang; Xiufen Zou
Journal:  PLoS Comput Biol       Date:  2020-07-27       Impact factor: 4.475

6.  Clinical Influencing Factors of Acute Myocardial Infarction Based on Improved Machine Learning.

Authors:  Hongwei Du; Linxing Feng; Yan Xu; Enbo Zhan; Wei Xu
Journal:  J Healthc Eng       Date:  2021-03-27       Impact factor: 2.682

Review 7.  A Sight on Single-Cell Transcriptomics in Plants Through the Prism of Cell-Based Computational Modeling Approaches: Benefits and Challenges for Data Analysis.

Authors:  Aleksandr Bobrovskikh; Alexey Doroshkov; Stefano Mazzoleni; Fabrizio Cartenì; Francesco Giannino; Ulyana Zubairova
Journal:  Front Genet       Date:  2021-05-21       Impact factor: 4.599

Review 8.  Single-cell biology to decode the immune cellular composition of kidney inflammation.

Authors:  Stefan Bonn; Christian F Krebs; Yu Zhao; Ulf Panzer
Journal:  Cell Tissue Res       Date:  2021-06-14       Impact factor: 4.051

9.  Defective Cardiomyocyte Maturation in the Congenitally Hypoplastic Right Ventricle: A Wrong Molecular Trajectory Leading to Heart Failure?

Authors:  Paolo Madeddu
Journal:  J Am Heart Assoc       Date:  2020-10-16       Impact factor: 5.501

Review 10.  Human Cell Atlas and cell-type authentication for regenerative medicine.

Authors:  Yulia Panina; Peter Karagiannis; Andreas Kurtz; Glyn N Stacey; Wataru Fujibuchi
Journal:  Exp Mol Med       Date:  2020-09-15       Impact factor: 8.718

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.