| Literature DB >> 29163774 |
Anan Yin1, Amandine Etcheverry2,3,4, Yalong He1, Marc Aubry3,5, Jill Barnholtz-Sloan6, Luhua Zhang1,7, Xinggang Mao1, Weijun Chen1, Bolin Liu8, Wei Zhang1, Jean Mosser2,3,4,5, Xiang Zhang1.
Abstract
Molecular and clinical heterogeneity critically hinders better treatment outcome for glioblastomas (GBMs); integrative analysis of genomic and epigenomic data may provide useful information for improving personalized medicine. By applying training-validation approach, we identified a novel hypomethylation signature comprising of three CpGs at non-CpG island (CGI) open sea regions for GBMs. The hypomethylation signature consistently predicted poor prognosis of GBMs in a series of discovery and validation datasets. It was demonstrated as an independent prognostic indicator, and showed interrelationships with known molecular marks such as MGMT promoter methylation status, and glioma CpG island methylator phenotype (G-CIMP) or IDH1 mutations. Bioinformatic analysis found that the hypomethylation signature was closely associated with the transcriptional status of an EGFR/VEGFA/ANXA1-centered gene network. The integrative molecular analysis finally revealed that the gene network defined two distinct clinically relevant molecular subtypes reminiscent of different immature neuroglial lineages in GBMs. The novel hypomethylation signature and relevant gene network may provide new insights into prognostic classification, molecular characterization, and treatment development for GBMs.Entities:
Keywords: gene network; glioblastomas; molecular classification; non-CpG island hypomethylation; precision oncology
Year: 2017 PMID: 29163774 PMCID: PMC5685695 DOI: 10.18632/oncotarget.19171
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Identification of the novel three-CpGs signature for glioblastomas (GBMs)
(A) all patient cohorts and molecular data sets that were included for the study. (B) schematic diagram of the entire workflow for the study. (C) characteristics of the three-CpGs panel; Cox coefficients were calculated within the training set – GSE22891. (D) the effects of DNA methylation on locus-specific gene expression across tumors (left) and expression levels between GBMs and non-tumor brain tissues (right) for each CpGs; molecular data of GBMs (n=386) and controls (n = 10) were obtained from TCGA.
Figure 2The survival correlation of the three-CpGs signature in each dataset
(A) the hypomethylation signature predicted overall survival (OS) in two training sets – GSE22891 and TCGA. (B) It was also correlated with different OS in the testing test – GSE50923. (C) The three-CpGs signature was further validated in three independent validation cohorts, by yielding apparent OS difference in GSE36278 and Rennes cohort, and a trend for significance in GSE60274. (D) The three-CpGs signature was also able to identify patients with different prognoses within MGMT methylated tumors (left), unmethylated tumors (middle), and G-CIMP-negative tumors (right) among all available patients; the prognostic abilities were also confirmed by dataset-level meta-analysis, which was shown aside in a manner of forest plot.
Results of Cox regression analyses in rennes cohorts
| Variables | Univariate Cox model | Multivariate Cox model | ||||
|---|---|---|---|---|---|---|
| HR | 95%CI | HR | 95%CI | |||
| Rennes cohorts ( | ||||||
| Patient age | 1.026 | 1.005–1.049 | 0.016 | 1.031 | 1.007–1.056 | 0.011 |
| KPS | 1.000 | 0.986–1.013 | 0.942 | |||
| Three-CpGs signature | 0.354 | 0.209–0.601 | < 0.001 | 0.381 | 0.220–0.662 | 0.001 |
| MGMT methylation status | 2.567 | 1.650–3.994 | < 0.001 | 2.835 | 1.756–4.576 | < 0.001 |
| G-CIMP status | 4.901 | 0.681–35.261 | 0.114 | |||
| TCGA gene expression subtypesb | 1.149 | 0.943–1.400 | 0.167 | |||
| Gender | 0.866 | 0.558–1.343 | 0.519 | |||
| Extent of surgery (total/partial/biopsy) | 0.642 | 0.437–0.944 | 0.024 | 0.549 | 0.358–0.841 | 0.006 |
HR = hazard ratio; CI = confidence interval; KPS = Karnofsky performance score; G-CIMP = gliomaCpG island methylator phenotype.
aRennes cohorts included GSE22891 and Rennes cohort (n = 106).
bTCGA gene expression subtypes includes mesenchymal, classical, proneural, and neural subtypes.
In bold type were reported statistically significant results.
Figure 3Functional relevance of the hypomethylation signature
(A) GSEA enrichment plots for representative functional gene sets enriched in low-risk and high-risk tumors from TCGA. (B) based on the top differentially expressed genes (699 genes) between the risk groups in TCGA, a novel 88-gene interaction network was constructed by STRING database, which was centered on EGFR (25 connection nodes), VEGFA (16 connection nodes), and ANXA1 (12 connection nodes; left); the top featured functional groups for the gene network classifiers were identified by DAVID database, showing that those genes were mostly involved in biological processes related to cancer and neural cell development (middle); each bar was indicated by the most representative annotations (with the smallest P value) for each functional groups, and was ordered by group enrichment score, that was the geometric mean of member's p-values in a corresponding annotation cluster; hierarchical clustering on the gene network classifiers clearly separated signatures of immature oligodendrocytes (IO) including non-myelinated oligodendrocytes and oligodendrocyte progenitor cells, from immature astroctyes (IA) (postnatal 1 to 8 days), also suggesting the relevance of the gene network to neural cell development (right). (C) Pearson correlation analysis showed that the risk scores of the hypomethylation signature were consistently and strongly in positive correlation with the expression scores of the gene network not only in the deriving TCGA, but also in two independent databases – Rennes (GSE22891 and Rennes cohort collectively) and GEO (GSE36278 and GSE60274 collectively); only samples with corresponding DNA methylation and gene expression data were analyzed.
Figure 4Molecular and clinical characterization of the two distinct subtypes of GBMs defined by the EGFR/VEGFA/ANXA1-centerred gene network using TCGA multi-dimensional data
(A) the heat maps of K-means (k = 2) clustering on the gene network signature; each row represented a gene which was ordered according to the log2 fold change value calculated from TCGA; each column represented a sample; for each sample (n = 561), subgroup correlation, multi-level molecular features, and enrichment levels for signatures specific to distinct neural cell lineages were indicated; P values for Fisher’ exact test, Chi-square test, and GSEA were accordingly shown; (B) representative functional gene sets enriched in each subtype were also shown in a manner of enrichment plot; (C–D) the volcano plots of the differentially expressed microRNAs and proteins between the subtypes; the top-ranked ones (absolute log2 fold change > 0.5) were indicated; (E) the subtypes showed strong clinical correlations: the IO-like tumors were significantly associated with longer overall survival (OS) than the IA-like ones (left); incorporation of concurrent gain of chr.19/20 further identified a subgroup with more unfavorable prognosis within the IA-like tumors (right); (F) among TCGA patients treated with radiation (RT) and temozolomide (TMZ), the use of bevacizumab (either first-line or at progression) did confer a clear OS benefit to those with the IA-like tumors but was associated with similar OS among the IO-like tumors; recurrent, secondary or previously treated cases were excluded in this interaction analysis.