| Literature DB >> 32450895 |
Abstract
Cell clustering is one of the most common routines in single cell RNA-seq data analyses, for which a number of specialized methods are available. The evaluation of these methods ignores an important biological characteristic that the structure for a population of cells is hierarchical, which could result in misleading evaluation results. In this work, we develop two new metrics that take into account the hierarchical structure of cell types. We illustrate the application of the new metrics in constructed examples as well as several real single cell datasets and show that they provide more biologically plausible results.Entities:
Keywords: Clustering; Gene expression; Single cell RNA-seq
Mesh:
Year: 2020 PMID: 32450895 PMCID: PMC7249323 DOI: 10.1186/s13059-020-02027-x
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Illustrative examples for using RI/MI and wRI/wMI to evaluate the clustering results. a, b Two examples of hierarchical relationship between a group of A1, A2, B1, and B2 cells. Texts under the trees indicate cell types from R, reference; C1, clustering 1; and C2, clustering 2. c Confusion matrices of two clustering and measures of clustering performance under reference a or b
A list of datasets used in this work
| Dataset | Protocol | No. of cells | No. of cell types | Sample |
|---|---|---|---|---|
| 10x | 3971 | 8 | PBMC | |
| SMARTer | 531 | 9 | Human ES | |
| SMARTer | 466 | 9 | Human brain | |
| SMARTer | 1140 | 5 | PBMC |
Fig. 2Results from PBMC1 dataset. a The reference hierarchy used in evaluation. b RI and wRI for five clustering methods. c NMI and wNMI for five clustering methods. d Confusion matrix for Seurat. e Confusion matrix for SC3
Agreement of pairwise relationship between reference (R) and clustering results (C)
| C | |||
|---|---|---|---|
| R | Related | Separated | |
| Related | |||
| Separated | |||
Cell membership agreement between reference (R) and clustering results (C)
| C | |||||||
|---|---|---|---|---|---|---|---|
| R | 1 | 2 | I | ||||
| 1 | |||||||
| 2 | |||||||
| ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ||
| ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ||