Literature DB >> 35021202

A comprehensive comparison of supervised and unsupervised methods for cell type identification in single-cell RNA-seq.

Xiaobo Sun1, Xiaochu Lin2, Ziyi Li3, Hao Wu2.   

Abstract

The cell type identification is among the most important tasks in single-cell RNA-sequencing (scRNA-seq) analysis. Many in silico methods have been developed and can be roughly categorized as either supervised or unsupervised. In this study, we investigated the performances of 8 supervised and 10 unsupervised cell type identification methods using 14 public scRNA-seq datasets of different tissues, sequencing protocols and species. We investigated the impacts of a number of factors, including total amount of cells, number of cell types, sequencing depth, batch effects, reference bias, cell population imbalance, unknown/novel cell type, and computational efficiency and scalability. Instead of merely comparing individual methods, we focused on factors' impacts on the general category of supervised and unsupervised methods. We found that in most scenarios, the supervised methods outperformed the unsupervised methods, except for the identification of unknown cell types. This is particularly true when the supervised methods use a reference dataset with high informational sufficiency, low complexity and high similarity to the query dataset. However, such outperformance could be undermined by some undesired dataset properties investigated in this study, which lead to uninformative and biased reference datasets. In these scenarios, unsupervised methods could be comparable to supervised methods. Our study not only explained the cell typing methods' behaviors under different experimental settings but also provided a general guideline for the choice of method according to the scientific goal and dataset properties. Finally, our evaluation workflow is implemented as a modularized R pipeline that allows future evaluation of new methods. Availability: All the source codes are available at https://github.com/xsun28/scRNAIdent.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  cell type identification; scRNA-seq; supervised learning; unsupervised clustering

Mesh:

Year:  2022        PMID: 35021202      PMCID: PMC8921620          DOI: 10.1093/bib/bbab567

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  44 in total

1.  FateID infers cell fate bias in multipotent progenitors from single-cell RNA-seq data.

Authors:  Josip S Herman; Dominic Grün
Journal:  Nat Methods       Date:  2018-04-09       Impact factor: 28.547

2.  RNA Sequencing of Single Human Islet Cells Reveals Type 2 Diabetes Genes.

Authors:  Yurong Xin; Jinrang Kim; Haruka Okamoto; Min Ni; Yi Wei; Christina Adler; Andrew J Murphy; George D Yancopoulos; Calvin Lin; Jesper Gromada
Journal:  Cell Metab       Date:  2016-09-22       Impact factor: 27.287

3.  Construction of a human cell landscape at single-cell level.

Authors:  Xiaoping Han; Ziming Zhou; Lijiang Fei; Huiyu Sun; Renying Wang; Yao Chen; Haide Chen; Jingjing Wang; Huanna Tang; Wenhao Ge; Yincong Zhou; Fang Ye; Mengmeng Jiang; Junqing Wu; Yanyu Xiao; Xiaoning Jia; Tingyue Zhang; Xiaojie Ma; Qi Zhang; Xueli Bai; Shujing Lai; Chengxuan Yu; Lijun Zhu; Rui Lin; Yuchi Gao; Min Wang; Yiqing Wu; Jianming Zhang; Renya Zhan; Saiyong Zhu; Hailan Hu; Changchun Wang; Ming Chen; He Huang; Tingbo Liang; Jianghua Chen; Weilin Wang; Dan Zhang; Guoji Guo
Journal:  Nature       Date:  2020-03-25       Impact factor: 49.962

4.  Robust enumeration of cell subsets from tissue expression profiles.

Authors:  Aaron M Newman; Chih Long Liu; Michael R Green; Andrew J Gentles; Weiguo Feng; Yue Xu; Chuong D Hoang; Maximilian Diehn; Ash A Alizadeh
Journal:  Nat Methods       Date:  2015-03-30       Impact factor: 28.547

5.  Molecular Diversity of Midbrain Development in Mouse, Human, and Stem Cells.

Authors:  Gioele La Manno; Daniel Gyllborg; Simone Codeluppi; Kaneyasu Nishimura; Carmen Salto; Amit Zeisel; Lars E Borm; Simon R W Stott; Enrique M Toledo; J Carlos Villaescusa; Peter Lönnerberg; Jesper Ryge; Roger A Barker; Ernest Arenas; Sten Linnarsson
Journal:  Cell       Date:  2016-10-06       Impact factor: 41.582

6.  Batch effects and the effective design of single-cell gene expression studies.

Authors:  Po-Yuan Tung; John D Blischak; Chiaowen Joyce Hsiao; David A Knowles; Jonathan E Burnett; Jonathan K Pritchard; Yoav Gilad
Journal:  Sci Rep       Date:  2017-01-03       Impact factor: 4.379

7.  Exploring the single-cell RNA-seq analysis landscape with the scRNA-tools database.

Authors:  Luke Zappia; Belinda Phipson; Alicia Oshlack
Journal:  PLoS Comput Biol       Date:  2018-06-25       Impact factor: 4.475

8.  Fast, sensitive and accurate integration of single-cell data with Harmony.

Authors:  Ilya Korsunsky; Nghia Millard; Jean Fan; Kamil Slowikowski; Fan Zhang; Kevin Wei; Yuriy Baglaenko; Michael Brenner; Po-Ru Loh; Soumya Raychaudhuri
Journal:  Nat Methods       Date:  2019-11-18       Impact factor: 28.547

9.  Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage.

Authors:  Dvir Aran; Agnieszka P Looney; Leqian Liu; Esther Wu; Valerie Fong; Austin Hsu; Suzanna Chak; Ram P Naikawadi; Paul J Wolters; Adam R Abate; Atul J Butte; Mallar Bhattacharya
Journal:  Nat Immunol       Date:  2019-01-14       Impact factor: 25.606

10.  Evaluation of methods to assign cell type labels to cell clusters from single-cell RNA-sequencing data.

Authors:  J Javier Diaz-Mejia; Elaine C Meng; Alexander R Pico; Sonya A MacParland; Troy Ketela; Trevor J Pugh; Gary D Bader; John H Morris
Journal:  F1000Res       Date:  2019-03-15
View more
  1 in total

1.  D3K: The Dissimilarity-Density-Dynamic Radius K-means Clustering Algorithm for scRNA-Seq Data.

Authors:  Guoyun Liu; Manzhi Li; Hongtao Wang; Shijun Lin; Junlin Xu; Ruixi Li; Min Tang; Chun Li
Journal:  Front Genet       Date:  2022-07-01       Impact factor: 4.772

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.