| Literature DB >> 34252936 |
Ziyang Wei1,2, Shuqin Zhang2,3,4.
Abstract
MOTIVATION: Single-cell RNA sequencing (scRNA-seq) technology has been widely applied to capture the heterogeneity of different cell types within complex tissues. An essential step in scRNA-seq data analysis is the annotation of cell types. Traditional cell-type annotation is mainly clustering the cells first, and then using the aggregated cluster-level expression profiles and the marker genes to label each cluster. Such methods are greatly dependent on the clustering results, which are insufficient for accurate annotation.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34252936 PMCID: PMC8686678 DOI: 10.1093/bioinformatics/btab286
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Summary of the 10 real datasets
| Data/references | Protocol |
| Cell types | Tissues |
|---|---|---|---|---|
| Baron ( | InDrop | 20 125 × 1937 | 14 | Pancreatic islets |
| Bladder (Consortium | 10X | 23 433 × 2500 | 4 | Bladder |
| Chen ( | Drop-seq | 23 284 × 14 437 | 47 | Hypothalamus |
| Kidney (Consortium | 10X | 23 433 × 2781 | 8 | Kidney |
| Lung (Consortium | 10X | 23 433 × 835 | 4 | Lung |
| Marrow (Consortium | 10X | 23 433 × 1732 | 14 | Marrow |
| PBMC10X ( | 10X | 32 738 × 2638 | 6 | Blood |
| PBMCSeqWell ( | SeqWell | 6173 × 3694 | 6 | Blood |
| Seger ( | Smart-Seq | 25 525 × 1099 | 9 | Pancreatic islet |
| Tongue ( | 10X | 23 433 × 7538 | 3 | Tongue |
Fig. 2.Visualization of the cells in ‘PBMC10X’
Fig. 3.Cell-type annotation with the labeled cells determined by marker genes. ‘CALLR’ uses two thresholds the 75th and 85th percentile for representative cell selection
Fig. 4.Classification accuracy for different ratios of labeled cells
Fig. 5.Performance of CALLR for different values of μ on the dataset ‘Lung’