| Literature DB >> 34830758 |
Christopher P Wardell1,2, Emilie Darrigues2,3, Annick De Loose3, Madison P Lee3, Murat Gokden4, Issam Makhoul2,5, Alan J Tackett2,6, Analiz Rodriguez1,2,3.
Abstract
Brain metastases (BM) are the most common brain tumors in adults occurring in up to 40% of all cancer patients. Multi-omics approaches allow for understanding molecular mechanisms and identification of markers with prognostic significance. In this study, we profile 130 BM using genomics and transcriptomics and correlate molecular characteristics to clinical parameters. The most common tumor origins for BM were lung (40%) followed by melanoma (21%) and breast (15%). Melanoma and lung BMs contained more deleterious mutations than other subtypes (p < 0.001). Mutational signatures suggested that the bulk of the mutations were gained before metastasis. A novel copy number event centered around the MCL1 gene was found in 75% of all samples, suggesting a broader role in promoting metastasis. Unsupervised hierarchical cluster analysis of transcriptional signatures available in 65 samples based on the hallmarks of cancer revealed four distinct clusters. Melanoma samples formed a distinctive cluster in comparison to other BM subtypes. Characteristics of molecular profiles did not correlate with survival. However, patients with self-identified black race or those who did not receive radiation correlated with poor survival. These data identify potential new drivers of brain metastatic progression. Our data also suggest further investigation of sociodemographic and clinical features is needed in BM cohorts.Entities:
Keywords: bioinformatics; brain metastases; genomics; transcriptomics
Year: 2021 PMID: 34830758 PMCID: PMC8615723 DOI: 10.3390/cancers13225598
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Patient characteristics. The UAMS and dbGaP data sets were of similar sizes and compositions.
| Feature (%) | All Patients ( | UAMS Patients ( | dbGaP Data Set ( |
|---|---|---|---|
| Sex: | |||
| Female | 76 (58%) | 34 (50%) | 42 (68%) |
| Male | 54 (42%) | 34 (50%) | 20 (32%) |
| Age (mean, standard deviation, range) | 57, 11.5, 19–82 | 58, 12.3, 19–82 | 56, 10.6, 35–80 |
| Smoking | 82 (63%) | 41 (60%) | 41 (66%) |
| Prior radiation | 23 (18%) | 13 (19%) | 10 (16%) |
| Disease origin | |||
| Breast | 27 (21%) | 10 (15%) | 17 (27%) |
| Lung | 57 (44%) | 27 (40%) | 30 (48%) |
| Melanoma | 16 (12%) | 14 (21%) | 2 (3%) |
| Other | 30 (23%) | 17 (25%) | 13 (21%) |
| Median survival | NA | 372 days, 95% CI 299–555 | NA |
Figure 1Oncoplot of frequently mutated genes in BM. The 17 significantly mutated genes were selected, and ordered by their overall frequency. Each column represents a single sample and each row a gene, with samples grouped by subtype. The barplot above and to the right represent the total number of deleterious mutations in each sample and the type of mutations detected in each gene, respectively. The stacked barplot at the bottom shows the contributions of each of the 6 mutation classes in each sample.
Figure 2Mutational signatures detected in BM samples. Hierarchical clustering of the contribution of the three signatures suggests that most mutations were acquired before metastasis occurred, as they reflect the etiology of the primary disease (UV exposure for melanoma and tobacco exposure for lung cancers), with the remainder dominated by an age-related signature.
Figure 3(A): Copy number events detected in BM. Red denotes gain and blue loss, with broad events coloring the entire arm and focal events represented using colored bars at the site of the events. Only affected chromosomes are shown. Candidate genes for focal events are labelled and events only identified in the dbGaP data set are labelled with an asterisk. Detailed tabulation is shown in Table 2 and Table 3. (B): Correlation plot of copy number events. Positive correlations with significant p-values after Bonferroni multiple testing correction (p < 1.7 × 10−4) were found for multiple pairs of features and were not restricted to specific subtypes. Pearson correlation is labelled with a color scheme from negative (red) to positive (blue) and larger colored squares represent smaller p-values.
Broad focal copy number events in BM.
| Description | Total Frequency ( | Frequency UAMS ( | Frequency dbGaP ( | |||
|---|---|---|---|---|---|---|
| Gain1q | 1.69 × 10−7 | 4.13 × 10−4 | 3.84 × 10−2 | 55% | 56% | 53% |
| Del9p | 3.66 × 10−7 | 1.03 × 10−5 | 3.35 × 10−2 | 67% | 72% | 58% |
| Del9q | 6.15 × 10−4 | 5.14 × 10−2 | 1.98 × 10−3 | 53% | 49% | 60% |
| Del10q | 4.12 × 10−3 | 5.14 × 10−2 | 1.20 × 10−2 | 51% | 49% | 61% |
| Del17p | 7.16 × 10−4 | 4.22 × 10−2 | 1.98 × 10−3 | 58% | 51% | 73% |
| Del19q | 2.78 × 10−4 | 5.14 × 10−2 | 3.35 × 10−2 | 38% | 31% | 50% |
| Del22q | 1.11 × 10−7 | 5.14 × 10−2 | 5.05 × 10−5 | 57% | 38% | 74% |
Significant focal copy number events in BM, in order of overall frequency. Five features listed at the bottom of the table were only detectable in the comparison data set due to the coverage of the DNA capture used.
| Description | Genomic Coordinates (hg19) | Size (Megabases) | RefSeq Genes in Window | Candidate Gene | Total Frequency ( | Frequency UAMS ( | Frequency in dbGaP Data ( | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Gain1q21.3 | chr1:149459146–151435225 | 2.0 | Many | MCL1 | 1.78 × 10−7 | 1.83 × 10−4 | 3.25 × 10−3 | 75% | 78% | 73% |
| Gain8q24.21 | chr8:128600697–128904379 | 0.3 | 3 | MYC | 2.17 × 10−3 | 4.50 × 10−5 | NA | 68% | 69% | 68% |
| Del9p21.3 | chr9:21853980–22169008 | 0.3 | 2 | CDKN2A | 3.76 × 10−14 | 2.73 × 10−3 | 6.30 × 10−11 | 67% | 72% | 61% |
| Del10q26.2 | chr10:129878108–131344459 | 1.5 | 2 | MGMT | 2.33 × 10−6 | 7.26 × 10−4 | NA | 60% | 65% | 55% |
| Gain7q31.2 | chr7:116187728–116418819 | 0.2 | 2 | MET | 3.62 × 10−3 | 3.89 × 10−4 | NA | 56% | 57% | 55% |
| Gain7p11.2 | chr7:55220368–55419259 | 0.2 | 2 | EGFR | 7.99 × 10−3 | 6.91 × 10−4 | NA | 55% | 60% | 50% |
| Del11p15.4 | chr11:3677960–3833468 | 0.2 | 4 | NUP98 | 3.80 × 10−4 | 7.82 × 10−2 | 3.10 × 10−3 | 48% | 50% | 47% |
| Del4q35.2 | chr4:187489123–187657360 | 0.2 | 1 | FAT1 | 5.42 × 10−7 | 1.00 × 10−2 | 4.67 × 10−4 | 48% | 60% | 35% |
| Gain17q12 | chr17:37856575–38035342 | 0.2 | 4 | ERBB2 | 9.90 × 10−15 | 1.88 × 10−9 | 3.84 × 10−7 | 46% | 54% | 37% |
| Gain19q12 | chr19:30313521–30466344 | 0.2 | 2 | CCNE1 | 4.20 × 10−2 | 4.77 × 10−4 | NA | 42% | 53% | 31% |
| Gain14q13.2 | chr14:35873861–36509568 | 0.6 | 4 | NFKBIA | 1.09 × 10−5 | 3.96 × 10−3 | 1.34 × 10−2 | 42% | 47% | 35% |
| Gain11q13.3 | chr11:68944739–69588069 | 0.6 | 4 | CCND1 | 4.19 × 10−7 | 1.79 × 10−4 | NA | 41% | 43% | 39% |
| Gain3q26.2 | chr3:168496014–169105035 | 0.6 | 2 | MECOM * | 1.78 × 10−7 | NA | 5.62 × 10−6 | 79% | ||
| Del22q13.33 | chr22:47340083–51304566 | 4.0 | Many | Unknown * | 7.93 × 10−4 | NA | 9.82 × 10−4 | 79% | ||
| Del19p13.3 | chr19:1595678–2171104 | 0.6 | Many | Unknown * | 3.06 × 10−4 | NA | 1.80 × 10−4 | 73% | ||
| Del21q22.2 | chr21:40162170–40831414 | 0.7 | 5 | PSMG1 * | 1.25 × 10−6 | NA | 7.37 × 10−7 | 63% | ||
| Del14q32.13 | chr14:96158774–99640737 | 3.5 | Many | VRK1 * | 1.85 × 10−14 | NA | 2.52 × 10−13 | 45% |
Figure 4(A): Unsupervised hierarchical clustering of expression signatures based on the hallmarks of cancer in 65 UAMS BM samples. Scores are scaled between 0 (blue) and 1 (red), with higher values representing higher transcript abundances. Four clusters were formed. Notably, clusters C and D are predominantly melanoma and lung BM samples, respectively, and cluster A has a very transcriptionally active phenotype. (B): Principal component analysis of RNA sequencing data from 65 UAMS samples. Letters denote the RNA cluster from Figure 4A and colors denote the origin of disease. Melanoma BMs (orange letters) form a distinct cluster in both analyses, but the hallmark clusters are not replicated in the PCA, suggesting a genuine biological origin and not a batch effect or simple reflection of the origin of metastases. Signature names and references are detailed in order in Table S3.
Figure 5(A–D) Kaplan–Meier plots of overall survival for clinical features with univariate significance. (E): Forest plot of the hazard ratios (HR) for a multivariate Cox proportional hazards survival model (HR followed by 95% confidence interval in parentheses).