| Literature DB >> 32642694 |
L C Stetson1, Quinn T Ostrom2,3, Daniela Schlatzer4, Peter Liao1, Karen Devine1, Kristin Waite1,5, Marta E Couce6, Peggy L R Harris7, Amber Kerstetter-Fogle7, Michael E Berens8, Andrew E Sloan1,7, Mohammad M Islam9, Vilashini Rajaratnam9, Shama P Mirza9, Mark R Chance1,4, Jill S Barnholtz-Sloan1,5.
Abstract
BACKGROUND: Improving the care of patients with glioblastoma (GB) requires accurate and reliable predictors of patient prognosis. Unfortunately, while protein markers are an effective readout of cellular function, proteomics has been underutilized in GB prognostic marker discovery.Entities:
Keywords: glioblastoma; mass spectrometry; proteomics; survival
Year: 2020 PMID: 32642694 PMCID: PMC7212893 DOI: 10.1093/noajnl/vdaa039
Source DB: PubMed Journal: Neurooncol Adv ISSN: 2632-2498
Figure 1.Experimental schematic for discovery, verification, and multisample datasets. In the discovery dataset protein and mRNA were isolated from 27 snap-frozen GB samples from the Ohio Brain Tumor Study, RNAseq and LC-MS/MS were performed on the samples. In the publicly available verification dataset protein was extracted using SDS-PAGE fractionation and quantified using LC-MS/MS from 8 snap-frozen GB samples. Finally, using a multisampling approach 3 distinct samples were taken from 6 different GB patients and a total of 18 samples were quantified using LC-MS/MS.
Hallmark Clinical Characteristics of 3 Independent GB Study Datasets
| Discovery Dataset | HerouxMirza Dataset | Multisample Dataset | ||||
|---|---|---|---|---|---|---|
| STS ( | LTS ( | STS ( | LTS ( | STS ( | LTS ( | |
| Median age at diagnosis (range) | 58 (39–77) | 61 (48–83) | 64.5 (53–67) | 56.5 (48–65) | 67.5 (64–74) | 59 (54–64) |
| Median overall survival (months) (range) | 5.98 (3.37–10.12) | 25.59 (19.70–66.51) | 4.77 (1.27–9.70) | 28.09 (20.47–35.70) | 6.51 (1.65–7.34) | 30.87 (18.50–43.24) |
| Male ( | 11 | 5 | 2 | 1 | 2 | 1 |
| Concurrent radiation and temozolomide ( | 12 | 14 | NA | NA | 1 | 2 |
| IDH mutation ( | 1 (NT = 6) | 1 (NT = 8) | NT | NT | 0 (NT = 2) | 0 (NT = 2) |
| MGMT methylation ( | 5 (NT = 5) | 5 (NT = 6) | 0 | 1 | 1 (NT = 2) | 0 (NT = 2) |
| KPS ≥ 70 ( | 3 (missing = 3) | 11 (missing = 3) | 4 (missing = 2) | 0 (missing = 2) | 0 (missing = 2) | 1 (missing = 1) |
NT, not tested; NA, not available.
Figure 2.(A) Heatmap shows the z-score for the 469 proteins that were significantly differentially abundant (FDR P-value < .05) between STS and LTS in the discovery dataset. Individual patient samples are in columns. A patient’s survival (STS, OS ≤10 months or LTS, OS ≥18 months) is indicated by color-coded labeling in horizontal bars above the heatmap. (B) Volcano plot shows differential protein abundance between STS and LTS in the discovery dataset, where −logFC indicates increased protein abundance in STS and +logFC indicates increased protein abundance in LTS. Each dot represents a protein and they are color coded as follows: gray (not significant), green (logFC >1), blue (P < .05), and red (logFC > 1 and P < .05). (C) Heatmap shows the z-score for the 67 proteins that were significantly differentially abundant (FDR P-value < .05) between STS and LTS in the discovery dataset and verification dataset. Individual patient samples are in columns. A patient’s survival (STS, OS ≤10 months or LTS, OS ≥18 months) is indicated by color-coded labeling in horizontal bars above the heatmap.
Patient-Level mRNA–Protein Pearson’s Correlation for the Discovery Dataset
| Patient ID |
| Pearson Correlation Coefficient |
|---|---|---|
| LTS_1 | 1.43E-23 | 0.20 |
| LTS_2 | 3.74E-19 | 0.18 |
| LTS_4 | 1.47E-21 | 0.19 |
| LTS_5 | 1.92E-23 | 0.20 |
| LTS_6 | 6.79E-27 | 0.22 |
| LTS_7 | 2.17E-24 | 0.21 |
| LTS_8 | 2.60E-22 | 0.20 |
| LTS_9 | 1.61E-31 | 0.24 |
| LTS_10 | 3.66E-18 | 0.18 |
| LTS_11 | 1.37E-27 | 0.22 |
| LTS_12 | 1.90E-32 | 0.24 |
| LTS_13 | 2.87E-35 | 0.25 |
| LTS_14 | 1.11E-18 | 0.18 |
| STS_1 | 2.43E-26 | 0.22 |
| STS_2 | 4.33E-30 | 0.23 |
| STS_3 | 3.02E-30 | 0.23 |
| STS_5 | 1.08E-45 | 0.29 |
| STS_6 | 4.95E-28 | 0.22 |
| STS_7 | 1.81E-20 | 0.19 |
| STS_11 | 7.84E-35 | 0.25 |
Figure 3.Scatter plot of logFC values differential expression in the discovery dataset for both RNAseq and protein abundance. For 2369 gene/protein pairs the logFC in the RNAseq data is plotted versus the logFC in the protein LC/MS-MS data. Each dot represents a gene/protein and they are color coded relative to the correlation between protein abundance changes and gene expression changes, where blue is no change (−1 < logFC < 1 for both datasets), green is concordant (logFC < −1 or logFC > 1 for both datasets), and red is discordant (logFC < −1 for protein and logFC > 1 for gene expression or vice versa).
Figure 4.(A) Hierarchical clustering of GB samples in the multisample dataset using z-score of 2256. Samples are in columns and are color coded based on the patient, patient’s overall survival (STS, OS <10 months or LTS, OS >16 months), and tumor sample location (solid tumor, infiltrated brain, necrotic core, or enhancing margin). (B) Multidimensional scaling plot shows the distribution of samples from the multisample dataset based on the patient’s overall survival (top), patient sample (middle), and tumor sample location (bottom).