| Literature DB >> 31574155 |
Yun-Ching Chen1, Abhilash Suresh1, Chingiz Underbayev2, Clare Sun2, Komudi Singh1, Fayaz Seifuddin1, Adrian Wiestner2, Mehdi Pirooznia1.
Abstract
BACKGROUND: In single-cell RNA-sequencing analysis, clustering cells into groups and differentiating cell groups by differentially expressed (DE) genes are 2 separate steps for investigating cell identity. However, the ability to differentiate between cell groups could be affected by clustering. This interdependency often creates a bottleneck in the analysis pipeline, requiring researchers to repeat these 2 steps multiple times by setting different clustering parameters to identify a set of cell groups that are more differentiated and biologically relevant.Entities:
Keywords: Seurat; cell ontology; clustering; single-cell RNA-sequencing
Mesh:
Year: 2019 PMID: 31574155 PMCID: PMC6771546 DOI: 10.1093/gigascience/giz121
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:IKAP workflow. See Online Methods for details. SNN: shared nearest neighbor.
Figure 2:Major cell groups identified for PBMC_8K (A, B, and C) and the mouse cortex dataset (D, E, and F). (A) Shown are tSNE plots for the 7 major groups identified by IKAP with cell types labeled (top) and expression of known marker genes (bottom): CD3E for T cells, CD79A for B cells, GNLY for NK cells, and LYZ for monocytes. (B) The heat map for expression of the top 5 DE genes (by expression fold change) from each group in (A). Rows are genes and columns are cells. (C) Performance summary of 3 candidate sets proposed by IKAP (left) and the 20 trial sets (right). Note that the number of candidate sets can vary for different datasets. Running time is shown at the bottom. The dashed blue lines indicate the number of cell groups (top) and the median log2 fold change (bottom) of the best set (PC9K7). (D) The tSNE plot for 8 major groups identified by IKAP in the mouse cortex dataset consistent with previously annotated cell types. (E) The heat map for expression of marker genes annotated for major cell types in Zeisel et al. 2015 [6] (blue) and DE genes identified by IKAP for groups 3 and 7 in (D) (red). (F) The heat map indicates the proportion of overlapping cells between IKAP-identified major groups and major cell types annotated in Zeisel et al. 2015 [6].
Figure 3:Examples of cell ontology proposed by IKAP. Three cell ontology examples were built by recursively running IKAP on the biggest groups (circled in red) for the mouse cortex dataset (A), PBMC_4K (D), and PBMC_8K (E). Putative cell types were labeled. Unknown types were left as blanks. (B) Shown is the tSNE plot for major groups and subgroups of group 7 presented in the mouse cortex ontology in (A). (C) The heat map shows expression of DE genes identified by IKAP (red) and annotated in Zeisel et al. 2015 [6] (blue) for subgroups of group 7 in (A) (labeled at bottom). (F) Heat maps show expression of selected DE genes that differentiate T-cell subtypes in PBMC_4K (top; subgroups labeled according to the ontology in [D]) and in PBMC_8K (bottom; subgroups labeled according to the ontology in [E]). Subgroups with similar expression profiles are linked by lines between PBMC_4K and PBMC_8K.