Literature DB >> 33098418

Single-cell RNA-seq data semi-supervised clustering and annotation via structural regularized domain adaptation.

Liang Chen1, Qiuyan He1, Yuyao Zhai2, Minghua Deng1,3,4.   

Abstract

MOTIVATION: The rapid development of single-cell RNA sequencing (scRNA-seq) technologies allows us to explore tissue heterogeneity at the cellular level. The identification of cell types plays an essential role in the analysis of scRNA-seq data, which, in turn, influences the discovery of regulatory genes that induce heterogeneity. As the scale of sequencing data increases, the classical method of combining clustering and differential expression analysis to annotate cells becomes more costly in terms of both labor and resources. Existing scRNA-seq supervised classification method can alleviate this issue through learning a classifier trained on the labeled reference data and then making a prediction based on the unlabeled target data. However, such label transference strategy carries with risks, such as susceptibility to batch effect and further compromise of inherent discrimination of target data.
RESULTS: In this article, inspired by unsupervised domain adaptation, we propose a flexible single cell semi-supervised clustering and annotation framework, scSemiCluster, which integrates the reference data and target data for training. We utilize structure similarity regularization on the reference domain to restrict the clustering solutions of the target domain. We also incorporates pairwise constraints in the feature learning process such that cells belonging to the same cluster are close to each other, and cells belonging to different clusters are far from each other in the latent space. Notably, without explicit domain alignment and batch effect correction, scSemiCluster outperforms other state-of-the-art, single-cell supervised classification and semi-supervised clustering annotation algorithms in both simulation and real data. To the best of our knowledge, we are the first to use both deep discriminative clustering and deep generative clustering techniques in the single-cell field. AVAILABILITYAND IMPLEMENTATION: An implementation of scSemiCluster is available from https://github.com/xuebaliang/scSemiCluster. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Year:  2021        PMID: 33098418     DOI: 10.1093/bioinformatics/btaa908

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  scMAGIC: accurately annotating single cells using two rounds of reference-based classification.

Authors:  Yu Zhang; Feng Zhang; Zekun Wang; Siyi Wu; Weidong Tian
Journal:  Nucleic Acids Res       Date:  2022-05-06       Impact factor: 19.160

2.  Identifying tumor cells at the single-cell level using machine learning.

Authors:  Jan Dohmen; Artem Baranovskii; Jonathan Ronen; Bora Uyar; Vedran Franke; Altuna Akalin
Journal:  Genome Biol       Date:  2022-05-30       Impact factor: 17.906

3.  Precision DNA Mixture Interpretation with Single-Cell Profiling.

Authors:  Jianye Ge; Jonathan L King; Amy Smuts; Bruce Budowle
Journal:  Genes (Basel)       Date:  2021-10-20       Impact factor: 4.096

4.  Clustering CITE-seq data with a canonical correlation-based deep learning method.

Authors:  Musu Yuan; Liang Chen; Minghua Deng
Journal:  Front Genet       Date:  2022-08-22       Impact factor: 4.772

5.  An active learning approach for clustering single-cell RNA-seq data.

Authors:  Xiang Lin; Haoran Liu; Zhi Wei; Senjuti Basu Roy; Nan Gao
Journal:  Lab Invest       Date:  2021-07-09       Impact factor: 5.662

6.  Evaluation of some aspects in supervised cell type identification for single-cell RNA-seq: classifier, feature selection, and reference construction.

Authors:  Wenjing Ma; Kenong Su; Hao Wu
Journal:  Genome Biol       Date:  2021-09-09       Impact factor: 13.583

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.