| Literature DB >> 26436699 |
Chuan-Bao Zhang1,2,3,4, Ping Zhu5, Pei Yang1,2,3,4, Jin-Quan Cai6, Zhi-Liang Wang2,3,4, Qing-Bin Li6, Zhao-Shi Bao2,4, Wei Zhang2,4, Tao Jiang1,2,3,4.
Abstract
Anaplastic gliomas are characterized by variable clinical and genetic features, but there are few studies focusing on the substratification of anaplastic gliomas. To identify a more objective and applicable classification of anaplastic gliomas, we analyzed whole genome mRNA expression profiling of four independent datasets. Univariate Cox regression, linear risk score formula and receiver operating characteristic (ROC) curve were applied to derive a gene signature with best prognostic performance. The corresponding clinical and molecular information were further analyzed for interpretation of the different prognosis and the independence of the signature. Gene ontology (GO), Gene Set Variation Analysis (GSVA) and Gene Set Enrichment Analysis (GSEA) were performed for functional annotation of the differences. We found a three-gene signature, by applying which, the anaplastic gliomas could be divided into low risk and high risk groups. The two groups showed a high concordance with grade II and grade IV gliomas, respectively. The high risk group was more aggressive and complex. The three-gene signature showed diagnostic and prognostic value in anaplastic gliomas.Entities:
Keywords: anaplastic glioma; diagnosis; mRNA; prognosis; signature
Mesh:
Substances:
Year: 2015 PMID: 26436699 PMCID: PMC4742201 DOI: 10.18632/oncotarget.5421
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Top 10 prognostic probes identified by Cox regression
| Probe ID | Symbol | Hazard Ratio | β | AUC | |
|---|---|---|---|---|---|
| A_24_P21161 | GPR85 | 0.368 | −1.000 | 1.16E-05 | 0.7188 |
| A_23_P215687 | GPR85 | 0.135 | −2.002 | 1.55E-05 | 0.8244 |
| A_24_P300021 | SHOX2 | 3.151 | 1.148 | 2.00E-05 | 0.8846 |
| A_23_P134684 | HMBOX1 | 0.103 | −2.273 | 5.18E-05 | 0.9382 |
| A_24_P102293 | SLITRK5 | 0.552 | −0.594 | 5.44E-05 | 0.9182 |
| A_23_P210323 | CEP68 | 0.149 | −1.904 | 5.87E-05 | 0.9239 |
| A_24_P417526 | FRG1B | 0.217 | −1.528 | 7.51E-05 | 0.9346 |
| A_23_P138574 | ATE1 | 0.112 | −2.189 | 7.83E-05 | 0.9379 |
| A_23_P3602 | NUDT7 | 0.348 | −1.056 | 9.58E-05 | 0.9290 |
| A_24_P167984 | ATMIN | 0.052 | −2.957 | 0.0001195 | 0.9274 |
the maximum of AUC.
Figure 1Prognostic value of the signature in training and validation sets and the grade II and grade IV like properties of anaplastic gliomas
Patients in low risk group showed a better prognosis than those in high risk group. The two groups also respectively showed similar prognosis with grade II and grade IV gliomas. A, D. CGGA data; B, E. GSE16011 data; C, F. REMBRANDT data; L, low risk group; H, high risk group; II, WHO grade II; IV, WHO grade IV.
Figure 2Distribution of risk score, OS, gene expression and clinical or molecular pathological features in CGGA, GSE16011 and REMBRANDT datasets
Figure 3Unsupervised hierarchical clustering of WHO grade II–IV glioma patients based on the expression of the three genes
Grade II and grade IV gliomas clustered distinctively while grade III gliomas showed a mix of both branches. The vast majority of low risk anaplastic gliomas (green in risk group) clustered closely to grade II gliomas (green in grade) while the high risk ones (red in risk group) clustered in the branch of grade IV (red in grade). A. CGGA data; B. GSE16011 data; C. REMBRANDT data.
Characteristics of patients in low risk and high risk group in four datasets
| CGGA | 16011 | REMBRANDT | TCGA | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LR | HR | LR | HR | LR | HR | LR | HR | ||||||
| Sample Size | 17 | 17 | 40 | 40 | 34 | 33 | 131 | 132 | |||||
| F | 8 | 8 | 15 | 13 | 12 | 8 | 48 | 65 | |||||
| Gender | M | 9 | 9 | > .05 | 25 | 27 | > .05 | 13 | 16 | > .05 | 83 | 67 | = 0.05 |
| NA | 0 | 0 | 0 | 0 | 9 | 9 | 0 | 0 | |||||
| Age | 38.4 ± 11.2 | 45.1 ± 15.3 | > .05 | 43.1 ± 11.8 | 51.9 ± 13.7 | < .01 | NA | NA | 41.6 ± 12.7 | 49.3 ± 13.2 | < 0.01 | ||
| AA | 4 | 4 | 8 | 6 | 20 | 23 | 48 | 76 | |||||
| Histology | AO | 8 | 3 | > .05 | 25 | 17 | > .05 | 12 | 9 | > .05 | 50 | 32 | < 0.01 |
| AOA | 5 | 10 | 7 | 17 | 2 | 1 | 33 | 24 | |||||
| WT | 6 | 8 | 8 | 18 | NA | NA | 9 | 64 | |||||
| IDH1 mutation | Mut | 11 | 9 | > .05 | 21 | 15 | > .05 | NA | NA | 122 | 68 | < 0.01 | |
| NA | 0 | 0 | 11 | 7 | NA | NA | 0 | 0 | |||||
| Radiotherapy | Y | 17 | 15 | 35 | 31 | NA | NA | NA | NA | ||||
| N | 0 | 2 | > .05 | 0 | 0 | > .05 | NA | NA | NA | NA | |||
| NA | 0 | 0 | 5 | 9 | NA | NA | NA | NA | |||||
| Chemotherapy | Y | 16 | 11 | 9 | 6 | NA | NA | NA | NA | ||||
| N | 1 | 6 | > .05 | 26 | 27 | > .05 | NA | NA | NA | NA | |||
| NA | 0 | 0 | 5 | 7 | NA | NA | NA | NA | |||||
| KPS | 87.6 ± 9.2 | 75.9 ± 12.3 | < 0.01 | 86.2 ± 18.0 | 80.0 ± 20.8 | > .05 | NA | NA | NA | NA | |||
P value for age and KPS: t test; p value for others: chi-square test or Fisher's exact test; LR, low risk group; HR, high risk group; F, female; M male; NA, not available; WT, wild type; Mut, mutation; Y, underwent radiotherapy/chemotherapy; N, not underwent radiotherapy/chemotherapy; KPS, Karnofsky performance status.
Univariate and mutivariate cox analysis in TCGA anaplastic glioma samples
| Univatiate | Multivariate | |||||||
|---|---|---|---|---|---|---|---|---|
| HR | 95% CI | HR | 95% CI | |||||
| Lower | Upper | Lower | Upper | |||||
| Gender | 0.898 | 0.549 | 1.467 | 0.666 | ||||
| Age | 1.069 | 1.046 | 1.092 | < 0.001 | 1.072 | 1.047 | 1.097 | < 0.001 |
| Histology | 1.376 | 1.039 | 1.821 | 0.026 | 1.390 | 1.040 | 1.857 | 0.026 |
| Risk score | 1.146 | 1.086 | 1.208 | < 0.001 | 1.069 | 1.004 | 1.137 | 0.036 |
| IDH mutation | 0.350 | 0.211 | 0.582 | < 0.001 | 0.557 | 0.288 | 1.074 | 0.081 |
In cox regression analysis: gender was defined as 1, male, 0, female; histology was defined as 1, oligodendroglioma, 2, oligoastrocytoma, 3, astrocytoma; IDH gene mutation was defined as 1, mutated, 0, wild type.
Figure 4Functional annotation of risk groups
A. GO analysis revealed the significant association of the genes with increased expression in high risk group with six main pathways. Column height: gene counts; point height: enrichment p value. B. Gene set variation analysis of proliferation associated genes in three datasets. The risk score (upper panel) was calculated with the formula described above and ranked from left to right. Gene set enrichment score (lower panel) of proliferation was analyzed by GSVA package of R. These genes showed higher expression with the risk score going from low to high. C. The top ten enriched pathways in high risk group, analyzed by gene set enrichment analysis of TCGA RNAseq data. D. three representative plots of GSEA from C.