| Literature DB >> 30412573 |
Rebecca C Poulos1,2, Yuen T Wong1, Regina Ryan1, Herbert Pang3, Jason W H Wong1,4.
Abstract
Driver mutations are the genetic variants responsible for oncogenesis, but how specific somatic mutational events arise in cells remains poorly understood. Mutational signatures derive from the frequency of mutated trinucleotides in a given cancer sample, and they provide an avenue for investigating the underlying mutational processes that operate in cancer. Here we analyse somatic mutations from 7,815 cancer exomes from The Cancer Genome Atlas (TCGA) across 26 cancer types. We curate a list of 50 known cancer driver mutations by analysing recurrence in our cohort and annotations of known cancer-associated genes from the Cancer Gene Census, IntOGen database and Cancer Genome Interpreter. We then use these datasets to perform binary univariate logistic regression and establish the statistical relationship between individual driver mutations and known mutational signatures across different cancer types. Our analysis led to the identification of 39 significant associations between driver mutations and mutational signatures (P < 0.004, with a false discovery rate of < 5%). We first validate our methodology by establishing statistical links for known and novel associations between driver mutations and the mutational signature arising from Polymerase Epsilon proofreading deficiency. We then examine associations between driver mutations and mutational signatures for AID/APOBEC enzyme activity and deficient mismatch repair. We also identify negative associations (odds ratio < 1) between mutational signatures and driver mutations, and here we examine the role of aging and cigarette smoke mutagenesis in the generation of driver mutations in IDH1 and KRAS in brain cancers and lung adenocarcinomas respectively. Our study provides statistical foundations for hypothesised links between otherwise independent biological processes and we uncover previously unexplored relationships between driver mutations and mutagenic processes during cancer development. These associations give insights into how cancers acquire advantageous mutations and can provide direction to guide further mechanistic studies into cancer pathogenesis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30412573 PMCID: PMC6249022 DOI: 10.1371/journal.pgen.1007779
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Cancer types, abbreviations and cohort sizes for the whole-exome sequenced samples analysed in this study.
| Cancer types | Abbreviation | Number of samples | Fraction of cohort (%) |
|---|---|---|---|
| Adrenocortical carcinoma | ACC | 43 | 0.55 |
| Bladder urothelial carcinoma | BLCA | 398 | 5.09 |
| Breast invasive carcinoma | BRCA | 805 | 10.3 |
| Cervical squamous cell carcinoma and endocervical adenocarcinoma | CESC | 296 | 3.79 |
| Cholangiocarcinoma | CHOL | 42 | 0.54 |
| Colon adenocarcinoma (COAD) and rectum adenocarcinoma (READ) | CRC | 553 | 7.08 |
| Lymphoid neoplasm diffuse large B-cell lymphoma | DLBC | 48 | 0.61 |
| Esophageal carcinoma | ESCA | 180 | 2.3 |
| Glioblastoma multiforme | GBM | 362 | 4.63 |
| Head and neck squamous cell carcinoma | HNSC | 490 | 6.27 |
| Kidney renal clear cell carcinoma | KIRC | 290 | 3.71 |
| Kidney renal papillary cell carcinoma | KIRP | 243 | 3.11 |
| Brain lower grade glioma | LGG | 202 | 2.58 |
| Liver hepatocellular carcinoma | LIHC | 355 | 4.54 |
| Lung adenocarcinoma | LUAD | 536 | 6.86 |
| Lung squamous cell carcinoma | LUSC | 487 | 6.23 |
| Mesothelioma | MESO | 46 | 0.59 |
| Ovarian serous cystadenocarcinoma | OV | 420 | 5.37 |
| Pancreatic adenocarcinoma | PAAD | 117 | 1.5 |
| Prostate adenocarcinoma | PRAD | 191 | 2.44 |
| Sarcoma | SARC | 212 | 2.71 |
| Skin cutaneous melanoma | SKCM | 450 | 5.76 |
| Stomach adenocarcinoma | STAD | 418 | 5.35 |
| Thyroid carcinoma | THCA | 52 | 0.67 |
| Uterine corpus endometrial carcinoma | UCEC | 525 | 6.72 |
| Uterine carcinosarcoma | UCS | 54 | 0.69 |
Significant associations between mutational signatures and driver mutations within individual cancer types, alongside proposed aetiology and frequency of the trinucleotide context in the associated mutational signature.
| Mutational signature | Proposed aetiology of mutational signature* | Driver mutation | Protein change | Odds ratio (OR) | Cancer type | Mutated samples in cancer type (%) | Trinucleotide context | Frequency of trinucleotide context in signature | |
|---|---|---|---|---|---|---|---|---|---|
| Amount (%) | Rank | ||||||||
| Signature 1 | Spontaneous deamination of 5-methylcytosine | p.R132H | Less than 1 | GBM | 4.97 | A[C>T]G | 17.16 | High | |
| p.R132H | Less than 1 | LGG | 57.43 | A[C>T]G | 17.16 | High | |||
| Signature 2 | Activity of the AID/APOBEC family of cytidine deaminases | p.E545K | Greater than 1 | LUSC | 3.29 | T[C>T]A | 41.99 | High | |
| p.E545K | Greater than 1 | HNSC | 4.9 | T[C>T]A | 41.99 | High | |||
| p.E545K | Greater than 1 | CESC | 14.19 | T[C>T]A | 41.99 | High | |||
| p.E545K | Greater than 1 | BRCA | 6.34 | T[C>T]A | 41.99 | High | |||
| p.S310F | Greater than 1 | BLCA | 4.77 | T[C>T]C | 8.2 | High | |||
| p.E545K | Greater than 1 | BLCA | 7.29 | T[C>T]A | 41.99 | High | |||
| p.E542K | Greater than 1 | LUSC | 2.26 | T[C>T]A | 41.99 | High | |||
| p.E542K | Greater than 1 | BRCA | 3.85 | T[C>T]A | 41.99 | High | |||
| p.E542K | Greater than 1 | BLCA | 4.52 | T[C>T]A | 41.99 | High | |||
| Signature 6 | Defective DNA mismatch repair | p.H1047R | Greater than 1 | STAD | 3.83 | A[T>C]G | 2.17 | Low | |
| p.G13D | Greater than 1 | UCEC | 2.29 | G[C>T]C | 7.73 | High | |||
| p.G13D | Greater than 1 | STAD | 2.87 | G[C>T]C | 7.73 | High | |||
| p.G12D | Greater than 1 | UCEC | 6.29 | A[C>T]C | 1.63 | Low | |||
| p.V600E | Greater than 1 | CRC | 9.22 | G[T>A]G | 0.06 | Low | |||
| p.R233 | Greater than 1 | UCEC | 4.95 | A[C>T]G | 9.08 | High | |||
| p.H1047R | Greater than 1 | CRC | 3.44 | A[T>C]G | 2.17 | Low | |||
| Signature 7 | Ultraviolet light exposure | p.V600M | Greater than 1 | SKCM | 8.44 | A[C>T]T | 0.43 | Low | |
| Signature 10 | Altered activity of the error-prone polymerase | p.P286R | Greater than 1 | UCEC | 4 | C[C>G]T | 0 | Low | |
| p.R88Q | Greater than 1 | UCEC | 5.9 | T[C>T]G | 21.41 | High | |||
| p.R88Q | Greater than 1 | CRC | 2.53 | T[C>T]G | 21.41 | High | |||
| p.R213 | Greater than 1 | CRC | 2.17 | T[C>T]G | 21.41 | High | |||
| p.R130Q | Greater than 1 | UCEC | 4.76 | T[C>T]G | 21.41 | High | |||
| p.R1989 | Greater than 1 | UCEC | 3.81 | T[C>T]G | 21.41 | High | |||
| Signature 13 | p.P179R | Greater than 1 | UCEC | 4.38 | C[C>G]C | 0.09 | Low | ||
| p.E545K | Greater than 1 | BLCA | 7.29 | T[C>T]A | 11.38 | High | |||
| p.E545K | Greater than 1 | HNSC | 4.9 | T[C>T]A | 11.38 | High | |||
| p.E545K | Greater than 1 | BRCA | 6.34 | T[C>T]A | 11.38 | High | |||
| p.E542K | Greater than 1 | UCEC | 3.24 | T[C>T]A | 11.38 | High | |||
| p.E542K | Greater than 1 | LUSC | 2.26 | T[C>T]A | 11.38 | High | |||
| p.S310F | Greater than 1 | BLCA | 4.77 | T[C>T]C | 1.5 | Low | |||
| p.E545K | Greater than 1 | LUSC | 3.29 | T[C>T]A | 11.38 | High | |||
| Signature 14 | Loss of mismatch repair and polymerase proofreading | p.R130Q | Greater than 1 | UCEC | 4.76 | T[C>T]G | 0.94 | Low | |
| Signature 20 | Loss of mismatch repair and polymerase proofreading | p.G12D | Greater than 1 | UCEC | 6.29 | A[C>T]C | 2.22 | Low | |
| p.N1459S | Greater than 1 | UCEC | 5.14 | A[T>C]T | 0.73 | Low | |||
| Signature 26 | Defective DNA mismatch repair | p.H1047R | Greater than 1 | CRC | 3.44 | A[T>C]G | 5.18 | High | |
| p.V600E | Greater than 1 | CRC | 9.22 | G[T>A]G | 0.14 | Low | |||
| p.R273C | Greater than 1 | UCEC | 2.1 | G[C>T]G | 2.25 | Low | |||
* Proposed aetiology obtained from refs [5] and [9] and ‘Signatures of Mutational Processes in Human Cancer’ curated by the Catalogue of Somatic Mutations in Cancer (COSMIC) database [6, 7];
# List of abbreviations for each cancer type are given in ;
^ Trinucleotide frequencies for mutational signatures are obtained from ‘Signatures of Mutational Processes in Human Cancer’ curated by the COSMIC database [6, 7]. Amount designates the percentage of all mutations in the signature that occur in the trinucleotide context of the specific driver mutation. The rank designates whether that mutation in its trinucleotide context is high (amount > 5%) or low (amount ≤ 5%) in the mutational signature.