| Literature DB >> 34076860 |
Jin-Xing Liu1, Chuan-Yuan Wang1, Ying-Lian Gao2, Yulin Zhang3, Juan Wang4, Sheng-Jun Li1.
Abstract
High-throughput sequencing of single-cell gene expression reveals a complex mechanism of individual cell's heterogeneity in a population. An important purpose for analyzing single-cell RNA sequencing (scRNA-seq) data is to identify cell subtypes and functions by cell clustering. To deal with high levels of noise and cellular heterogeneity, we introduced a new single cell data analysis model called Adaptive Total-Variation Regularized Low-Rank Representation (ATV-LRR). In scRNA-seq data, ATV-LRR can reconstruct the low-rank subspace structure to learn the similarity of cells. The low-rank representation can not only segment multiple linear subspaces, but also extract important information. Moreover, adaptive total variation also can remove cell noise and preserve cell feature details by learning the gradient information of the data. At the same time, to analyze scRNA-seq data with unknown prior information, we introduced the maximum eigenvalue method into the ATV-LRR model to automatically identify cell populations. The final clustering results show that the ATV-LRR model can detect cell types more effectively and stably.Keywords: Clustering; Low rank; Sample subspace; Single-cell RNA sequencing data; Total variation
Year: 2021 PMID: 34076860 DOI: 10.1007/s12539-021-00444-5
Source DB: PubMed Journal: Interdiscip Sci ISSN: 1867-1462 Impact factor: 2.233