| Literature DB >> 36182941 |
K H Brian Lam1,2,3, Phedias Diamandis4,5,6,7.
Abstract
Glioblastoma is often subdivided into three transcriptional subtypes (classical, proneural, mesenchymal) based on bulk RNA signatures that correlate with distinct genetic and clinical features. Potential cellular-level differences of these subgroups, such as the relative proportions of glioblastoma's hallmark histopathologic features (e.g. brain infiltration, microvascular proliferation), may provide insight into their distinct phenotypes but are, however, not well understood. Here we leverage machine learning and reference proteomic profiles derived from micro-dissected samples of these major histomorphologic glioblastoma features to deconvolute and estimate niche proportions in an independent proteogenomically-characterized cohort. This approach revealed a strong association of the proneural transcriptional subtype with a diffusely infiltrating phenotype. Similarly, enrichment of a microvascular proliferation proteomic signature was seen within the mesenchymal subtype. This study is the first to link differences in the cellular pathology signatures and transcriptional profiles of glioblastoma, providing potential new insights into the genetic drivers and poor treatment response of specific subsets of glioblastomas.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36182941 PMCID: PMC9526702 DOI: 10.1038/s41597-022-01716-5
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Deconvolution and estimation of histomorphological niches within bulk glioblastoma proteomes. (A) Schematic overview of our methodology to estimate niche proportions using reference microdissected proteomic profiles and classifying bulk tumor samples via a random forest algorithm. Hematoxylin and Eosin (H&E) images detailing the anatomical niches within GBM: leading edge (LE), infiltrating tumor (IT), cellular tumor (CT), microvascular proliferations (MVP), and palisading cells around necrosis (PAN). (B) Multidimensional scaling of CPTAC samples based on all proteins using principal component analysis highlights distinct grouping of TCGA subtypes (n = 110). (C) Gene Set Enrichment Analysis (GSEA) based on all samples and their comparisons against other sample types highlights similarities in pathways between the Normal brain samples and the proneural subgroup. Normalized enrichment score (NES) is derived from the GSEA output and accounts for differences in gene set size and in correlations between gene sets in the expression dataset. (D) Random forest algorithm trained on a proteomic dataset of histomorphological features classifies CPTAC proteomic samples into niche like signatures. Cases are classified into niches based on the major niche contribution. The machine learning classifications on the X-axis represent the most abundant feature. (E) A stacked bar chart highlights the variability of decision tree probabilities across the tumors and normal brain samples (n = 108). Machine learning classified proteomes show concordance with H&E slide images for (F) LE, (G) MVP, (H) CT, (I) IT, and (J) PAN -like signatures. These H & E images are representative sections and not whole slide images. Source data are provided as a Source Data file.
Fig. 2Association of the proneural subgroup with the infiltrating tumor phenotype. Differentially enriched protein (DEP) analysis by volcano plot comparing (A) randomized groupings of tumor samples (n = 86, randomly distributed into two groups of 43) and (B) machine learning classified tumor samples highlights distinct phenotypic tumors (n = 86, FDR 0.05, S0 > 0.1). (C) Unsupervised hierarchical clustering of CPTAC proteomic samples by Pearson correlation utilizing all proteins highlights an association between the infiltrative-like signature and the proneural subgroup (n = 110). (D) Multidimensional scaling of CPTAC samples based on all proteins using principal component analysis highlights increasing abundances of the synaptic marker SYN from left to right (n = 110). (E) Distribution of expected and observed abundances of IT-like signature tumor samples based on the total number of samples identified as IT-like signature tumors by the random forest classifier. (F) Comparison of CAMK2B by boxplot highlights enrichment within the proneural subgroup against other tumor subtypes; proneural vs mesenchymal (FDR = 3.97e-12), proneural vs classical (FDR = 5.18e-11), proneural vs IDH mutant (FDR = 0.86). (G) Comparison of SYP by boxplot highlights enrichment within the proneural subgroup against other tumor subtypes; proneural vs mesenchymal (FDR = 1.30e-12), proneural to classical (FDR = 4.15e-11), proneural to IDH mutant (FDR = 1). (H) Comparison of SNAP25 by boxplot highlights enrichment within the proneural subgroup against other tumor subtypes; proneural vs mesenchymal (FDR = 8.51e-18), proneural vs classical (FDR = 8.01e-12), proneural vs IDH mutant (FDR = 0.36). (I) Comparison of NEFH by boxplot highlights enrichment within the proneural subgroup against other tumor subtypes; proneural vs mesenchymal (FDR = 7.71e-5), proneural vs classical (FDR = 9.56e-8), proneural vs IDH mutant (FDR = 1). Data are presented as median values +/− IQR and min/max values (whiskers). P values were first calculated based on proteins from all samples utilizing a one-tailed t-test and then adjusted for the Benjamini-Hochberg correction (n = 108). (J) Schematic summary of the various features of the TCGA GBM subgroups, including genetics, clinical and histomorphologic correlates. Source data are provided as a Source Data file.