Literature DB >> 34156956

Unsupervised Cluster Analysis and Gene Marker Extraction of scRNA-seq Data Based On Non-Negative Matrix Factorization.

Chuan-Yuan Wang, Ying-Lian Gao, Xiang-Zhen Kong, Jin-Xing Liu, Chun-Hou Zheng.   

Abstract

The development of single-cell RNA sequencing (scRNA-seq) technology has made it possible to measure gene expression levels at the resolution of a single cell, which further reveals the complex growth processes of cells such as mutation and differentiation. Recognizing cell heterogeneity is one of the most critical tasks in scRNA-seq research. To solve it, we propose a non-negative matrix factorization framework based on multi-subspace cell similarity learning for unsupervised scRNA-seq data analysis (MscNMF). MscNMF includes three parts: data decomposition, similarity learning, and similarity fusion. The three work together to complete the data similarity learning task. MscNMF can learn the gene features and cell features of different subspaces, and the correlation and heterogeneity between cells will be more prominent in multi-subspaces. The redundant information and noise in each low-dimensional feature space are eliminated, and its gene weight information can be further analyzed to calculate the optimal number of subpopulations. The final cell similarity learning will be more satisfactory due to the fusion of cell similarity information in different subspaces. The advantage of MscNMF is that it can calculate the number of cell types and the rank of Non-negative matrix factorization (NMF) reasonably. Experiments on eight real scRNA-seq datasets show that MscNMF can effectively perform clustering tasks and extract useful genetic markers. To verify its clustering performance, the framework is compared with other latest clustering algorithms and satisfactory results are obtained. The code of MscNMF is free available for academic (https://github.com/wangchuanyuan1/project-MscNMF).

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 34156956     DOI: 10.1109/JBHI.2021.3091506

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  2 in total

1.  Dissecting Cellular Heterogeneity Based on Network Denoising of scRNA-seq Using Local Scaling Self-Diffusion.

Authors:  Xin Duan; Wei Wang; Minghui Tang; Feng Gao; Xudong Lin
Journal:  Front Genet       Date:  2022-01-10       Impact factor: 4.599

2.  One-Step Robust Low-Rank Subspace Segmentation for Tumor Sample Clustering.

Authors:  Jian Liu; Yuhu Cheng; Xuesong Wang; Shuguang Ge
Journal:  Comput Intell Neurosci       Date:  2021-12-08
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.