| Literature DB >> 29765148 |
Marieke Lydia Kuijjer1,2, Joseph Nathaniel Paulson3,4,5, Peter Salzman6, Wei Ding7, John Quackenbush3,4,8.
Abstract
BACKGROUND: With the onset of next-generation sequencing technologies, we have made great progress in identifying recurrent mutational drivers of cancer. As cancer tissues are now frequently screened for specific sets of mutations, a large amount of samples has become available for analysis. Classification of patients with similar mutation profiles may help identifying subgroups of patients who might benefit from specific types of treatment. However, classification based on somatic mutations is challenging due to the sparseness and heterogeneity of the data.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29765148 PMCID: PMC5988673 DOI: 10.1038/s41416-018-0109-7
Source DB: PubMed Journal: Br J Cancer ISSN: 0007-0920 Impact factor: 7.640
Fig. 1Mutated genes and pathways across the 23 cancer types. a, b Violin plots visualising the distribution of a the number of mutated genes and b the number of mutated pathways, for each of the 23 cancer types. c, d The fraction of mutated genes (c) and pathways (d) plotted against the number of samples in each cancer type. R: Pearson’s correlation coefficient, p: Pearson’s correlation test p-value
Fig. 2Distance between samples using different dissimilarity metrics. Boxplots visualise the median distance between samples using a gene mutation scores and b pathway mutation scores across the 23 cancer types. The line in the centre of the box denotes the median, the box edges denote the first and third quartiles of the data and the error bars denote 1.5× the interquartile range
Fig. 3De-sparsified mutation data retains histological and other clinical information that might define cancer subtypes. a Word cloud of the most associated variables with the top 5 principal components across the 23 cancer types. b First two coordinates of the principal components for UCEC samples. Each point represents an individual sample. Colours represent the histological type of the cancer, orange being endometrioid endometrial adenocarcinoma, purple being mixed serous and endometrioid and cyan representing serous endometrial adenocarcinoma samples. c Heatmap of the −log10 p-values from PC regression against clinical variables
Fig. 4Identification of prognostic subtypes. a Kaplan–Meier plots depicting overall survival for patients with different prognostic subtypes in ACC, LAML and LGG. Overall survival of patients in the poor prognostic group is shown in red. Plots include the number of samples in each subtype, as well as the log-rank test p-values. b Heatmaps of pathway mutation scores (shown on a log-scale). The subtypes are visualised in the column dendrograms, with the same colour coding as used in a
Drug targets enriched for mutations in cancer-specific subtypes
| Cancer |
| Patients | Enr. | Drugs |
|---|---|---|---|---|
| ACC | 2 | 12/90 | 41.4 | Idasanutlin, nutlin-3 |
| BLCA | 2 | 25/130 | 6.6 | Amuvatinib, BMS-817378, cabozantinib, golvatinib, OSI-930, PD-153035, PLX-4720, ZM-39923 |
| BRCA | 2 | 16/956 | 7.6 | SGC-CBP30 |
| GBM | 2 | 62/280 | 5.4 | Amuvatinib, BMS-817378, cabozantinib, golvatinib, lestaurtinib, OSI-930, PLX-4720 |
| HNSC | 2 | 69/305 | 4.7 | Amuvatinib, AS-703026, atiprimod, AZ-628, AZD1480, baricitinib, CEP-32496, curcumol, DCC-2618, LY2784544, LY3009120, MEK162, PD-198306, peficitinib, refametinib, Ro-5126766, ruxolitinib-(S), TG-101209, trametinib, vemurafenib, XL019, ZM-39923 |
| KIRC | 2 | 19/419 | 9.2 | AG-490 |
| KIRP | 2 | 16/164 | 4.8 | Bortezomib |
| LAML | 2 | 23/181 | 13.2 | Lonafarnib |
| LGG | 2 | 36/287 | 12.6 | Atiprimod, LY2784544, TG-101209 |
| LUSC | 2 | 28/177 | 6.4 | ZM-39923 |
| OV | 3 | 13/445 | 3.3 | Bortezomib |
| PRAD | 2 | 13/239 | 20.7 | SGC-CBP30 |
| READ | 2 | 49/116 | 7.4 | PD-153035 |
| THCA | 2 | 284/357 | 5.6 | Lonafarnib |
| UCS | 2 | 24/57 | 4.3 | Wortmannin |
k: the level at which the subtyping dendrogram was cut, Patients: the number of patients in a subtype, versus the number of patients of that cancer type in our data set, Enr.: enrichment, or the observed over expected ratio, Drugs: drugs of which targets were enriched for mutations in the subtype
Fig. 5Pan-cancer mutation subtyping results. Stacked bar charts of a the number of the different cancer types in each of the nine subtypes, b the number of the different subtypes per cancer type. c Enrichment of subtypes for specific cancer types. Test scores are calculated by multiplying the odds ratio to 1 for significant results (Bonferroni-adjusted p < 0.05) and to 0 for nonsignificant results. Significant scores with odds ratios >4 are visualised in blue. d Four overarching sets of mutations are associated (+) with multiple subtypes. e Word clouds visualise enrichment for specific words associated with each of the four sets of pan-cancer pathways