| Literature DB >> 32195242 |
Quan Cheng1,2, Jing Li3, Fan Fan1, Hui Cao4, Zi-Yu Dai1, Ze-Yu Wang1, Song-Shan Feng1.
Abstract
Glioblastoma (GBM) is one of the most common and aggressive primary adult brain tumors. Tumor heterogeneity poses a great challenge to the treatment of GBM, which is determined by both heterogeneous GBM cells and a complex tumor microenvironment. Single-cell RNA sequencing (scRNA-seq) enables the transcriptomes of great deal of individual cells to be assayed in an unbiased manner and has been applied in head and neck cancer, breast cancer, blood disease, and so on. In this study, based on the scRNA-seq results of infiltrating neoplastic cells in GBM, computational methods were applied to screen core biomarkers that can distinguish the discrepancy between GBM tumor and pericarcinomatous environment. The gene expression profiles of GBM from 2343 tumor cells and 1246 periphery cells were analyzed by maximum relevance minimum redundancy (mRMR). Upon further analysis of the feature lists yielded by the mRMR method, 31 important genes were extracted that may be essential biomarkers for GBM tumor cells. Besides, an optimal classification model using a support vector machine (SVM) algorithm as the classifier was also built. Our results provided insights of GBM mechanisms and may be useful for GBM diagnosis and therapy.Entities:
Keywords: glioblastoma biomarkers; mRMR method; pericarcinomatous environment; scRNA-seq; support vector machine
Year: 2020 PMID: 32195242 PMCID: PMC7066068 DOI: 10.3389/fbioe.2020.00167
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
FIGURE 1The IFS curve of the top 100 mRMR genes. The x-axis was the number of genes and the y-axis was the prediction performance, i.e., LOOCV MCC. The peak MCC was 0.812 when 31 genes were used. These 31 genes were selected as optimal GBM biomarker genes.
The 31 selected GBM biomarker genes.
| Rank | Gene | Rank | Gene |
| 1 | TMSB4X | 17 | VIM |
| 2 | IPCEF1 | 18 | ATP1A2 |
| 3 | MTSS1 | 19 | RPL41 |
| 4 | S100A10 | 20 | EGR3 |
| 5 | HTRA1 | 21 | OMG |
| 6 | DHRS9 | 22 | LDHA |
| 7 | TPI1 | 23 | P2RY12 |
| 8 | SNX22 | 24 | SPOCK1 |
| 9 | FCGBP | 25 | NAMPT |
| 10 | TMSB10 | 26 | C1QL2 |
| 11 | CCL3 | 27 | PTN |
| 12 | SLC6A1 | 28 | CCL4 |
| 13 | SMOC1 | 29 | PDZD2 |
| 14 | SEC61G | 30 | LGALS1 |
| 15 | TGFBI | 31 | CLDN10 |
| 16 | CDR1 |
The confusion matrix of the 31 selected genes.
| Predicted GBM | Predicted non-GBM | |
| Actual GBM | 2220 | 123 |
| Actual non-GBM | 181 | 1065 |
FIGURE 2The t-SNE plots of predicted GBM cells and predicted non-GBM cells. (A) The t-SNE plots of predicted GBM cells. It can be seen that the false positive samples (red dots) and the true positive samples (black dots) were mixed and they were difficult to classify. (B) The t-SNE plots of predicted non-GBM cells. It can be seen that the false negative samples (black dots) and the true negative samples (red dots) were mixed. These t-SNE plots suggested that the GBM tissues may contain non-GBM cells and the non-GBM tissues may contain GBM cells, but most cells from the corresponding tissue were similar and the machine learning algorithm we used can get the robust single cell biomarkers even when there were tissue purity issues.
The GO enrichment results of the 31 selected genes.
| GO term | FDR | Genes | |
| GO:0007155 cell adhesion | 0.0068 | 8.26 | EGR3, LGALS1, OMG, PTN, S100A10, CCL4, SPOCK1, TGFBI, CLDN10, MTSS1, PDZD2, P2RY12 |
| GO:0022610 biological adhesion | 0.0068 | 8.74 | EGR3, LGALS1, OMG, PTN, S100A10, CCL4, SPOCK1, TGFBI, CLDN10, MTSS1, PDZD2, P2RY12 |
| GO:0031012 extracellular matrix | 0.0029 | 1.57 | LGALS1, OMG, HTRA1, PTN, SPOCK1, TGFBI, VIM, SMOC1 |
| GO:0005615 extracellular space | 0.0107 | 1.56 | LGALS1, OMG, HTRA1, PTN, CCL3, CCL4, SPOCK1, TGFBI, TMSB4X, TPI1, NAMPT |
| GO:0005576 extracellular region | 0.0107 | 1.87 | ATP1A2, LDHA, LGALS1, OMG, HTRA1, PTN, S100A10, CCL3, CCL4, SPOCK1, TGFBI, TMSB4X, TPI1, VIM, FCGBP, NAMPT, PDZD2, SMOC1, C1QL2 |
| GO:0005578 proteinaceous extracellular matrix | 0.0107 | 2.30 | LGALS1, OMG, PTN, SPOCK1, TGFBI, SMOC1 |
| GO:0044421 extracellular region part | 0.0108 | 2.89 | ATP1A2, LDHA, LGALS1, OMG, HTRA1, PTN, S100A10, CCL3, CCL4, SPOCK1, TGFBI, TMSB4X, TPI1, VIM, FCGBP, NAMPT, SMOC1 |