Literature DB >> 31675098

Evaluation of single-cell classifiers for single-cell RNA sequencing data sets.

Xinlei Zhao1,2, Shuang Wu2, Nan Fang1, Xiao Sun1, Jue Fan2.   

Abstract

Single-cell RNA sequencing (scRNA-seq) has been rapidly developing and widely applied in biological and medical research. Identification of cell types in scRNA-seq data sets is an essential step before in-depth investigations of their functional and pathological roles. However, the conventional workflow based on clustering and marker genes is not scalable for an increasingly large number of scRNA-seq data sets due to complicated procedures and manual annotation. Therefore, a number of tools have been developed recently to predict cell types in new data sets using reference data sets. These methods have not been generally adapted due to a lack of tool benchmarking and user guidance. In this article, we performed a comprehensive and impartial evaluation of nine classification software tools specifically designed for scRNA-seq data sets. Results showed that Seurat based on random forest, SingleR based on correlation analysis and CaSTLe based on XGBoost performed better than others. A simple ensemble voting of all tools can improve the predictive accuracy. Under nonideal situations, such as small-sized and class-imbalanced reference data sets, tools based on cluster-level similarities have superior performance. However, even with the function of assigning 'unassigned' labels, it is still challenging to catch novel cell types by solely using any of the single-cell classifiers. This article provides a guideline for researchers to select and apply suitable classification tools in their analysis workflows and sheds some lights on potential direction of future improvement on classification tools.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Keywords:  benchmark; classification; comparative analysis; single-cell RNA-seq

Year:  2020        PMID: 31675098     DOI: 10.1093/bib/bbz096

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  17 in total

1.  A comprehensive comparison of supervised and unsupervised methods for cell type identification in single-cell RNA-seq.

Authors:  Xiaobo Sun; Xiaochu Lin; Ziyi Li; Hao Wu
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  Potential biomarkers in the fibrosis progression of nonalcoholic steatohepatitis (NASH).

Authors:  Z Wang; Z Zhao; Y Xia; Z Cai; C Wang; Y Shen; R Liu; H Qin; J Jia; G Yuan
Journal:  J Endocrinol Invest       Date:  2022-02-28       Impact factor: 4.256

3.  Cell-type modeling in spatial transcriptomics data elucidates spatially variable colocalization and communication between cell-types in mouse brain.

Authors:  Francisco Jose Grisanti Canozo; Zhen Zuo; James F Martin; Md Abul Hassan Samee
Journal:  Cell Syst       Date:  2021-10-08       Impact factor: 10.304

Review 4.  Breaking the Immune Complexity of the Tumor Microenvironment Using Single-Cell Technologies.

Authors:  Simone Caligola; Francesco De Sanctis; Stefania Canè; Stefano Ugel
Journal:  Front Genet       Date:  2022-05-16       Impact factor: 4.772

5.  scDLC: a deep learning framework to classify large sample single-cell RNA-seq data.

Authors:  Yan Zhou; Minjiao Peng; Bin Yang; Tiejun Tong; Baoxue Zhang; Niansheng Tang
Journal:  BMC Genomics       Date:  2022-07-12       Impact factor: 4.547

6.  scMAGIC: accurately annotating single cells using two rounds of reference-based classification.

Authors:  Yu Zhang; Feng Zhang; Zekun Wang; Siyi Wu; Weidong Tian
Journal:  Nucleic Acids Res       Date:  2022-05-06       Impact factor: 19.160

7.  Vec2image: an explainable artificial intelligence model for the feature representation and classification of high-dimensional biological data by vector-to-image conversion.

Authors:  Hui Tang; Xiangtian Yu; Rui Liu; Tao Zeng
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

8.  Identification of Tumor Mutation Burden, Microsatellite Instability, and Somatic Copy Number Alteration Derived Nine Gene Signatures to Predict Clinical Outcomes in STAD.

Authors:  Chuanzhi Chen; Yi Chen; Xin Jin; Yongfeng Ding; Junjie Jiang; Haohao Wang; Yan Yang; Wu Lin; Xiangliu Chen; Yingying Huang; Lisong Teng
Journal:  Front Mol Biosci       Date:  2022-04-11

9.  Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults.

Authors:  Yang Wu; Haofei Hu; Jinlin Cai; Runtian Chen; Xin Zuo; Heng Cheng; Dewen Yan
Journal:  Front Public Health       Date:  2021-06-29

10.  Single-cell classification using graph convolutional networks.

Authors:  Tianyu Wang; Jun Bai; Sheida Nabavi
Journal:  BMC Bioinformatics       Date:  2021-07-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.