| Literature DB >> 32425984 |
Ruiqing Zheng1, Zhenlan Liang1, Xiang Chen1, Yu Tian1, Chen Cao2, Min Li1.
Abstract
The rapid development of single-cell transcriptome sequencing technology has provided us with a cell-level perspective to study biological problems. Identification of cell types is one of the fundamental issues in computational analysis of single-cell data. Due to the large amount of noise from single-cell technologies and high dimension of expression profiles, traditional clustering methods are not so applicable to solve it. To address the problem, we have designed an adaptive sparse subspace clustering method, called AdaptiveSSC, to identify cell types. AdaptiveSSC is based on the assumption that the expression of cells with the same type lies in the same subspace; one cell can be expressed as a linear combination of the other cells. Moreover, it uses a data-driven adaptive sparse constraint to construct the similarity matrix. The comparison results of 10 scRNA-seq datasets show that AdaptiveSSC outperforms original subspace clustering and other state-of-art methods in most cases. Moreover, the learned similarity matrix can also be integrated with a modified t-SNE to obtain an improved visualization result.Entities:
Keywords: adaptive sparse strategy; similarity learning; single cell RNA-seq; subspace clustering; visualization
Year: 2020 PMID: 32425984 PMCID: PMC7212354 DOI: 10.3389/fgene.2020.00407
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1The pipeline of AdaptiveSSC to identify and visualize cell types from scRNA-seq data.
Single cell RNA-seq datasets.
| Darmanis (Darmanis et al., | 420 | 22,085 | SMARTer |
| Kolod (Kolodziejczyk et al., | 704 | 10,685 | Smart-Seq2 |
| Treutlein (Treutlein et al., | 80 | 959 | SMARTer |
| Yan (Yan et al., | 90 | 20,214 | Tang et al., |
| Ting (Ting et al., | 114 | 14,405 | Single CTC RNA-Seq |
| Engel (Engel et al., | 203 | 23,337 | Smart-seq2 |
| Kumar (Kumar et al., | 361 | 11,497 | SMARTer |
| Vento (Vento-Tormo et al., | 5,418 | 33,693 | Smart-seq2 |
| Baron (Baron et al., | 8,569 | 20,125 | inDrop |
| Shekhar (Shekhar et al., | 26,830 | 13,166 | Drop-seq |
Figure 2The corresponding NMI and ARI with different values of λ on eight datasets.
Figure 3The corresponding (A) NMI and (B) ARI of SIMLR, MPSSC, SNN-cliq, RAFSIL, Seurat, SinNLRR, SSC, and AdaptiveSSC on 10 datasets.
Figure 4The visualization of t-SNE, SIMLR, MPSSC, and AdaptiveSSC on (A) Darmanis and (B) Treutelin.