Literature DB >> 27663498

Robust classification of single-cell transcriptome data by nonnegative matrix factorization.

Chunxuan Shao1,2, Thomas Höfer1,2.   

Abstract

MOTIVATION: Single-cell transcriptome data provide unprecedented resolution to study heterogeneity in cell populations and present a challenge for unsupervised classification. Popular methods, like principal component analysis (PCA), often suffer from the high level of noise in the data.
RESULTS: Here we adapt Nonnegative Matrix Factorization (NMF) to study the problem of identifying subpopulations in single-cell transcriptome data. In contrast to the conventional gene-centered view of NMF, identifying metagenes, we used NMF in a cell-centered direction, identifying cell subtypes ('metacells'). Using three different datasets (based on RT-qPCR and single cell RNA-seq data, respectively), we show that NMF outperforms PCA in identifying subpopulations in an accurate and robust way, without the need for prior feature selection; moreover, NMF successfully recovered the broad classes on a large dataset (thousands of single-cell transcriptomes), as identified by a computationally sophisticated method. NMF allows to identify feature genes in a direct, unbiased manner. We propose novel approaches for determining a biologically meaningful number of subpopulations based on minimizing the ambiguity of classification. In conclusion, our study shows that NMF is a robust, informative and simple method for the unsupervised learning of cell subtypes from single-cell gene expression data.
AVAILABILITY AND IMPLEMENTATION: https://github.com/ccshao/nimfa CONTACTS: c.shao@Dkfz-Heidelberg.de or t.hoefer@Dkfz-Heidelberg.deSupplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2016        PMID: 27663498     DOI: 10.1093/bioinformatics/btw607

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  Clustering and classification methods for single-cell RNA-sequencing data.

Authors:  Ren Qi; Anjun Ma; Qin Ma; Quan Zou
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

2.  scRCMF: Identification of Cell Subpopulations and Transition States From Single-Cell Transcriptomes.

Authors:  Xiaoying Zheng; Suoqin Jin; Qing Nie; Xiufen Zou
Journal:  IEEE Trans Biomed Eng       Date:  2019-08-23       Impact factor: 4.538

Review 3.  The triumphs and limitations of computational methods for scRNA-seq.

Authors:  Peter V Kharchenko
Journal:  Nat Methods       Date:  2021-06-21       Impact factor: 28.547

4.  Alignment and integration of spatial transcriptomics data.

Authors:  Ron Zeira; Max Land; Alexander Strzalkowski; Benjamin J Raphael
Journal:  Nat Methods       Date:  2022-05-16       Impact factor: 47.990

5.  Single-cell multiomics analysis reveals regulatory programs in clear cell renal cell carcinoma.

Authors:  Zhilin Long; Chengfang Sun; Min Tang; Yin Wang; Jiayan Ma; Jichuan Yu; Jingchao Wei; Jianzhu Ma; Bohan Wang; Qi Xie; Jiaming Wen
Journal:  Cell Discov       Date:  2022-07-19       Impact factor: 38.079

6.  Unified single-cell analysis of testis gene regulation and pathology in five mouse strains.

Authors:  Min Jung; Daniel Wells; Jannette Rusch; Suhaira Ahmad; Jonathan Marchini; Simon R Myers; Donald F Conrad
Journal:  Elife       Date:  2019-06-25       Impact factor: 8.140

7.  ClusterMine: A knowledge-integrated clustering approach based on expression profiles of gene sets.

Authors:  Hong-Dong Li; Yunpei Xu; Xiaoshu Zhu; Quan Liu; Gilbert S Omenn; Jianxin Wang
Journal:  J Bioinform Comput Biol       Date:  2020-06       Impact factor: 1.122

8.  Similarity and Dissimilarity Regularized Nonnegative Matrix Factorization for Single-Cell RNA-seq Analysis.

Authors:  Ya-Li Zhu; Sha-Sha Yuan; Jin-Xing Liu
Journal:  Interdiscip Sci       Date:  2021-07-06       Impact factor: 2.233

9.  Functional Virtual Flow Cytometry: A Visual Analytic Approach for Characterizing Single-Cell Gene Expression Patterns.

Authors:  Zhi Han; Travis Johnson; Jie Zhang; Xuan Zhang; Kun Huang
Journal:  Biomed Res Int       Date:  2017-07-17       Impact factor: 3.411

10.  Boosting scRNA-seq data clustering by cluster-aware feature weighting.

Authors:  Rui-Yi Li; Jihong Guan; Shuigeng Zhou
Journal:  BMC Bioinformatics       Date:  2021-06-02       Impact factor: 3.307

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.