| Literature DB >> 32904984 |
Cymon N Kersch1, Cheryl J Claunch2,3, Prakash Ambady1, Elmar Bucher2,3, Daniel L Schwartz4,5, Ramon F Barajas3,4,6, Jeffrey J Iliff7, Tyler Risom8, Laura Heiser2,3, Leslie L Muldoon1, James E Korkola2,3, Joe W Gray2,3, Edward A Neuwelt1,9,10.
Abstract
BACKGROUND: Glioblastoma is a rapidly fatal brain cancer that exhibits extensive intra- and intertumoral heterogeneity. Improving survival will require the development of personalized treatment strategies that can stratify tumors into subtypes that differ in therapeutic vulnerability and outcomes. Glioblastoma stratification has been hampered by intratumoral heterogeneity, limiting our ability to compare tumors in a consistent manner. Here, we develop methods that mitigate the impact of intratumoral heterogeneity on transcriptomic-based patient stratification.Entities:
Keywords: gene signature; glioblastoma; heterogeneity; transcriptomics
Year: 2020 PMID: 32904984 PMCID: PMC7462280 DOI: 10.1093/noajnl/vdaa093
Source DB: PubMed Journal: Neurooncol Adv ISSN: 2632-2498
Figure 1.Variation in glioblastoma sample gene expression is primarily explained by histologic structure. (A) Representative image demonstrating the histologic structures that were microdissected, subject to RNAseq and archived in the IvyGAP database by the Allen Brain Institute, scale bar = 800 µm. (B–D) Analysis of the 1000 most variable genes in the IvyGAP data set. (B) Principle component analysis of dimensions 1 (Dim1) and 2 (Dim2) demonstrates that most variation in the data is explained by the histologic structure from which the RNA was extracted. Individual samples (symbol) are colored by the structure they is from; ellipse level = 0.66. (C) Correlation network analysis shows samples (nodes) from a histologic structure cluster. Colors depict the structure samples came from; edge length represents the degree of correlation between samples. (D) Dendrogram of hierarchically clustered (k = 4) samples demonstrating structures with the most similarity.
Figure 2.Biological processes enriched in glioblastoma structures. Enrichment map visualizations of GSEA demonstrating gene ontology (GO) biological processes enriched in the (A) LE and IT, (B) CT, (C) HBV and MVP, and (D) PNZ and PAN, relative to the rest of the tumor. Nodes represent GO terms. Functionally related clusters were manually circled and labeled. Node color represents the structure enriched (purple: LE/IT; green: CT; dark orange: HBV/MVP; blue: PNZ/PAN). Node size within each structure quadrant is proportional to the number of genes within each GO term. Edge thickness signifies degree of overlap between GO terms (number of genes shared between 2 gene sets).
Figure 3.Molecular subtype classification depends on the structure sampled, with CT able to distinguish biologically distinct subtypes. (A) Expression of subtype gene sets (y-axis) in IvyGAP samples (x-axis) from each region show that sample structure is a main contributor to expression of subtype gene signatures. Genes corresponding to each subtype were organized independently by unsupervised hierarchical clustering. (B) Subtype classification for samples from subjects with data from ≥4 different regions. CT* represents subtype analysis applying data z-scored across only CT samples. (C) Unsupervised hierarchical clustering of IvyGAP CT samples (data z-scored across CT samples) showing 3 main clusters with signatures of proneural, classical, and mesenchymal subtypes. (D) Unsupervised hierarchical clustering of TCGA CT-classified samples demonstrating 3 main clusters with signatures of proneural, classical, and mesenchymal subtypes. CT** represents TCGA samples predicted to be composed of predominantly CT based on our structure prediction gene signature. (E) Enrichment of hallmark gene sets in proneural and mesenchymal subtypes, characterized based on CT sample analysis. Proneural and mesenchymal tumors have enrichment of cell cycle checkpoints and immune processes, respectively. FDR, false discovery rate; NES, normalized enrichment score.
Figure 4.Established prognostic gene signature expression is driven by glioblastoma structure. (A) A survival prediction gene set demonstrates differential expression in tumor structures, with opposite expression in the IT/LE and the PAN/PNZ/HBV/MVP. Transcripts were organized independently by unsupervised hierarchical clustering. (B) Prognostic prediction for samples from subjects with data from ≥4 structures, with prognosis determined by sample metagene score. A single subject can be predicted to have high- or low-risk depending on which structure is analyzed. *CT normalized represents CT samples that underwent z-score normalization utilizing only the CT samples rather than all structure samples. (C) Kaplan–Meier survival analysis of all IvyGAP samples. (D) Kaplan–Meier analysis using a metagene score calculated applying CT samples, demonstrating a trend in survival stratification. (E) Kaplan–Meier analysis using a metagene score calculated applying only HBV samples, significantly and incorrectly stratified long versus short survivors. For survival analyses, metagene scores were used to risk stratify (poor prognosis: metagene score >0; good prognosis: metagene score <0).
Figure 5.Novel prognostic gene signature created utilizing CT expression data. (A) Risk score and hazard ratio (HR) prediction equation created using a novel prognostic model for glioblastoma. The risk score is the sum of the products of defined weighting factors with the corresponding predictors, MGMT promoter methylation status (0: not methylated; 1: methylated), patient age (in years), and normalized expression values of 6 genes. Kaplan–Meier survival analysis of (B) IvyGAP CT samples, (C) CT-predicted TCGA samples, (D) all IvyGAP samples, and (E) all TCGA samples dichotomized into high- and low-risk groups by MGMT methylation (left) and predicted HR (right). For MGMT methylation, survival was evaluated by separating samples into methylated (low-risk) or unmethylated (high-risk) groups. Tertiles of HR values were used to risk stratify (high-risk: HR > quantile (⅔); low-risk: HR < quantile (⅔)). Shading on survival lines correspond to 95% confidence intervals. *MGMT promoter methylation status. **Samples predicted to be predominantly CT.