| Literature DB >> 32411685 |
Fei Yuan1,2, XiaoYong Pan3, Tao Zeng4, Yu-Hang Zhang5, Lei Chen6,7, Zijun Gan5, Tao Huang5, Yu-Dong Cai1.
Abstract
Single-cell sequencing technologies have emerged to address new and longstanding biological and biomedical questions. Previous studies focused on the analysis of bulk tissue samples composed of millions of cells. However, the genomes within the cells of an individual multicellular organism are not always the same. In this study, we aimed to identify the crucial and characteristically expressed genes that may play functional roles in tissue development and organogenesis, by analyzing a single-cell transcriptomic atlas of mice. We identified the most relevant gene features and decision rules classifying 18 cell categories, providing a list of genes that may perform important functions in the process of tissue development because of their tissue-specific expression patterns. These genes may serve as biomarkers to identify the origin of unknown cell subgroups so as to recognize specific cell stages/states during the dynamic process, and also be applied as potential therapy targets for developmental disorders.Entities:
Keywords: cell type; expression rule; multi-class classification; single-cell transcriptomics; tissue development
Year: 2020 PMID: 32411685 PMCID: PMC7201067 DOI: 10.3389/fbioe.2020.00350
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Sample size of each tissue.
| Index | Tissue | Sample size |
| 1 | Bladder | 1638 |
| 2 | Brain microglia | 4762 |
| 3 | Brain neurons | 5799 |
| 4 | Colon | 4149 |
| 5 | Fat | 5862 |
| 6 | Heart | 7115 |
| 7 | Kidney | 865 |
| 8 | Liver | 981 |
| 9 | Lung | 1923 |
| 10 | Mammary | 2663 |
| 11 | Marrow | 5355 |
| 12 | Muscle | 2102 |
| 13 | Pancreas | 1961 |
| 14 | Skin | 2464 |
| 15 | Spleen | 1718 |
| 16 | Thymus | 1580 |
| 17 | Tongue | 1432 |
| 18 | Trachea | 1391 |
FIGURE 1The entire procedures for analyzing the single-cell expression profiles of mouse cells in 18 tissues.
FIGURE 2IFS curves for IFS with RF on the feature list yielded by mRMR and MCFS methods, respectively. The best MCC for RF on the list yielded by mRMR method is 0.882 when top 2265 features are used. The highest MCC for RF on the list yielded by MCFS method is 0.892 when top 1170 features are adopted.
Performance and optimum number of features of IFS with RF when using different feature ranking methods.
| Feature ranking | Number of optimum features | MCC | Overall accuracy |
| mRMR | 2265 | 0.882 | 0.890 |
| MCFS | 1170 | 0.892 | 0.899 |
FIGURE 3Bar chart to show accuracies on 18 tissues yielded by the optimum RF classifiers on the feature lists of mRMR and MCFS methods.
FIGURE 4Venn diagrams to show the intersection of optimum features for RF and PART based on the feature lists of mRMR and MCFS methods. (A) Venn diagram to show the intersection of optimum features for RF; (B) Venn diagram to show the intersection of optimum features for PART.
FIGURE 5IFS curves for IFS with PART on the feature list yielded by mRMR and MCFS methods, respectively. The best MCC for PART on the list yielded by mRMR method is 0.709 when top 200 features are used. The highest MCC for PART on the list yielded by MCFS method is 0.781 when top 400 features are adopted.
Performance and optimum number of features of IFS with PART when using different feature ranking methods.
| Feature ranking | Number of optimum features | Number of classification rules | MCC | Overall accuracy |
| mRMR | 200 | 7085 | 0.709 | 0.730 |
| MCFS | 400 | 7413 | 0.781 | 0.798 |
FIGURE 6Bar chart to show accuracies on 18 tissues yielded by the optimum PART classifiers on the feature lists of mRMR and MCFS methods.