| Literature DB >> 28798928 |
Zhi Han1,2, Travis Johnson2, Jie Zhang2,3, Xuan Zhang1, Kun Huang2.
Abstract
We presented a novel workflow for detecting distribution patterns in cell populations based on single-cell transcriptome study. With the fast adoption of single-cell analysis, a challenge to researchers is how to effectively extract gene features to meaningfully separate the cell population. Considering that coexpressed genes are often functionally or structurally related and the number of coexpressed modules is much smaller than the number of genes, our workflow uses gene coexpression modules as features instead of individual genes. Thus, when the coexpressed modules are summarized into eigengenes, not only can we interactively explore the distribution of cells but also we can promptly interpret the gene features. The interactive visualization is aided by a novel application of spatial statistical analysis to the scatter plots using a clustering index parameter. This parameter helps to highlight interesting 2D patterns in the scatter plot matrix (SPLOM). We demonstrated the effectiveness of the workflow using two large single-cell studies. In the Allen Brain scRNA-seq dataset, the visual analytics suggested a new hypothesis such as the involvement of glutamate metabolism in the separation of the brain cells. In a large glioblastoma study, a sample with a unique cell migration related signature was identified.Entities:
Mesh:
Year: 2017 PMID: 28798928 PMCID: PMC5536134 DOI: 10.1155/2017/3035481
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The workflow of the functional virtual flow cytometry system.
Figure 2Workflow for weighted GCNA and eigengene calculation.
The seven gene modules whose eigengenes show long tail distributions.
| Eigengene # | Index | Size | Kurtosis | Enrichment/notes |
|---|---|---|---|---|
| 1 | 3 | 38 | 10.7844 | 32 predicted genes: three genes are immunoglobulins and two are T cell receptors, acute lymphocytic leukemia ( |
|
| ||||
| 2 | 6 | 35 | 5.0379 | Ion transport ( |
|
| ||||
| 3 | 12 | 18 | 8.5550 | Glutamate decarboxylation to succinate ( |
|
| ||||
| 4 | 13 | 17 | 19.9492 | Development of lower uro neuro e15.5 BladdPelvicGanglion Sox10 top-relative-expression-ranked 1000 (1.227 |
|
| ||||
| 5 | 28 | 11 | 4.9068 | Hydrogen ion transmembrane transport ( |
|
| ||||
| 6 | 48 | 6 | 3.8686 | NADH metabolic process ( |
|
| ||||
| 7 | 60 | 5 | 12.5680 | Mostly predicted genes |
Figure 3Colored SPLOM for the seven long tail eigengenes from the Allen Brain scRNA-seq data. The subplot in the ith row, jth column of the matrix is a scatter plot of the ith eigengene against the jth eigengene. Along the diagonal are histogram plots of each eigengene.
Figure 4Four example scatterplots with broad classes annotated in different colors.
Figure 5(a) The 3D plot for the first three principal components using all genes for the cells. (b) The 3D plot for the first three principal components using genes in the gene modules in Table 1 for the cells.
Figure 6Colored SPLOM for the long tail eigengenes from the brain tumor study. The subplot in the ith row, jth column of the matrix is a scatter plot of the ith eigengene against the jth eigengene. Along the diagonal are histogram plots of each eigengene.
Figure 7(a) The scatter plot between eigengene #4 (x-axis) and eigengene #11 (y-axis). (b) The scatter plot between eigengenes #4 (x-axis) and #6 (y-axis).