| Literature DB >> 35668106 |
Peter Georgeson1,2, Tabitha A Harrison3, Bernard J Pope1,2,4, Syed H Zaidi5, Conghui Qu3, Robert S Steinfelder3, Yi Lin3, Jihoon E Joo1,2, Khalid Mahmood1,2,4, Mark Clendenning1,2, Romy Walker1,2, Efrat L Amitay6, Sonja I Berndt7, Hermann Brenner6,8,9, Peter T Campbell10, Yin Cao11,12,13, Andrew T Chan14,15,16,17,18,19, Jenny Chang-Claude20,21, Kimberly F Doheny22, David A Drew14,15, Jane C Figueiredo23,24, Amy J French25, Steven Gallinger5,26, Marios Giannakis17,27,28, Graham G Giles29,30,31, Andrea Gsur32, Marc J Gunter33, Michael Hoffmeister6, Li Hsu3,34, Wen-Yi Huang7, Paul Limburg35, JoAnn E Manson18,28, Victor Moreno36,37,38,39, Rami Nassir40, Jonathan A Nowak41, Mireia Obón-Santacana36,37, Shuji Ogino17,18,41,42, Amanda I Phipps3,43, John D Potter3,44, Robert E Schoen45, Wei Sun3, Amanda E Toland46, Quang M Trinh5, Tomotaka Ugai18,41, Finlay A Macrae47,48,49, Christophe Rosty1,2,50,51, Thomas J Hudson5, Mark A Jenkins2,30, Stephen N Thibodeau25, Ingrid M Winship49,52, Ulrike Peters3,43, Daniel D Buchanan53,54,55.
Abstract
Carriers of germline biallelic pathogenic variants in the MUTYH gene have a high risk of colorectal cancer. We test 5649 colorectal cancers to evaluate the discriminatory potential of a tumor mutational signature specific to MUTYH for identifying biallelic carriers and classifying variants of uncertain clinical significance (VUS). Using a tumor and matched germline targeted multi-gene panel approach, our classifier identifies all biallelic MUTYH carriers and all known non-carriers in an independent test set of 3019 colorectal cancers (accuracy = 100% (95% confidence interval 99.87-100%)). All monoallelic MUTYH carriers are classified with the non-MUTYH carriers. The classifier provides evidence for a pathogenic classification for two VUS and a benign classification for five VUS. Somatic hotspot mutations KRAS p.G12C and PIK3CA p.Q546K are associated with colorectal cancers from biallelic MUTYH carriers compared with non-carriers (p = 2 × 10-23 and p = 6 × 10-11, respectively). Here, we demonstrate the potential application of mutational signatures to tumor sequencing workflows to improve the identification of biallelic MUTYH carriers.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35668106 PMCID: PMC9170691 DOI: 10.1038/s41467-022-30916-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Overview of the analysis steps and groups of CRC tumor sequencing data included in the study, totaling 5649 CRCs.
The SBS18/SBS36 TMS threshold was established using 102 CRCs down-sampled from whole-exome sequenced (WES) to intersect with the 1.34 Mb capture used to sequence the CRC tumors in the validation set. The 2528 CRCs sequenced with 1.34 Mb capture as part of the validation set were used to refine the SBS18/SBS36 classifier by including the somatic mutation count and TMS reconstruction error. The accuracy of the refined classifier was assessed using 3019 CRC tumors sequenced with a 1.96 Mb capture as part of the test set. The refined classifier was subsequently applied to 79 CRCs from monoallelic MUTYH pathogenic variant carriers, and CRCs defined as potential MUTYH biallelics and MUTYH uncertain status to determine its utility in variant classification. CI confidence interval, CIDR Center for Inherited Disease Research, CRC colorectal cancer, GECCO Genetic Epidemiology of Colorectal cancer Consortium, Mb megabase, OICR Ontario Institute of Cancer Research, PV pathogenic variant, SBS single bases substitution, TMS tumor mutational signature, VUS variant of uncertain clinical significance.
Fig. 2Observed tumor mutational signature profiles for 19 CRCs from germline biallelic MUTYH pathogenic variant carriers, and 17 CRCs carrying more than one MUTYH pathogenic or potentially pathogenic variant but not two pathogenic variants (potential biallelic).
All the CRCs from the germline biallelic MUTYH PV carriers exhibit dominant SBS18 and/or SBS36 tumor mutational signature. Source data are provided as a Source Data file.
Fig. 3The distribution of SBS18/SBS36 tumor mutational signature, reconstruction error and somatic single nucleotide variant (SNV) count by the five tumor classification categories.
Distribution of a SBS18/SBS36 tumor mutational signature, b tumor mutational signature reconstruction error, and c adjusted somatic SNV count across 5649 CRCs in the training set, validation set, and test set, grouped by germline pathogenic variant status (tumor classifications). The red horizontal line in each figure indicates the cut-offs that were determined based on the training set and validation set tumors. All boxes correspond to the 25th and 75th percentiles and the whiskers represent 1.5× the inter-quartile range (IQR) extending from the boxes. Lines at the middle of each box show the median. Individual observations are shown beyond the whiskers. Source data are provided as a Source Data file.
Fig. 4Distribution of SBS18/SBS36, somatic single nucleotide variants (SNVs), and tumor mutational signature (TMS) reconstruction error across CRCs from training, validation, and test sets.
a The CRCs from the biallelic MUTYH pathogenic variant carriers cluster together based on high SBS18/SBS36 TMS and low TMS reconstruction error highlighting the need to include TMS reconstruction error in classifier, and b CRCs with greater than 95% likelihood of arising from biallelic MUTYH pathogenic variants based on TMS. The number of SNV mutations used in determining TMS (horizontal axis) and the TMS reconstruction error (vertical axis) demonstrates the importance of low reconstruction error (<39%) and sufficient somatic mutation count (≥9) for correctly classifying tumors from biallelic MUTYH pathogenic variant carriers (true positives). Source data are provided as a Source Data file.
Participants categorized into the potential MUTYH biallelic group, based on either carrying a germline pathogenic variant and one or more VUSs, or multiple VUSs.
| ID | AgeDx | Sex | Source | ClinVar | GnomAD | CADD | REVEL | SBS18/36 (%) | Error (%) | Somatic mutations | TMS-based prediction | Variant reclassification | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C2904 | 70–79 | F | Germline Germline | c.91delG p.A31PfsTer27 c.1141G>T p.G381W | PV VUS | None None | 23.0 23.8 | None 0.521 | 62 | 38.1 | 15 | Positive | VUS->PV |
| O1016 | 40–49 | F | Germline Germline | c.536A>G p.Y179C c.577-5A>G | PV VUS | 0.001 None | 24.7 16.9 | 0.963 None | 56.1 | 17.1 | 14 | Positive | VUS->PV |
| C5024 | 50–59 | M | Germline Germline Germline | c.536A>G p.Y179C c.933+3A>C c.1465G>A p.A489T | PV VUS VUS | 1.5 × 10−3 6.5 × 10−5 6.5 × 10−5 | 24.7 6.4 29.4 | 0.963 None 0.724 | 76.7 | 17.1 | 26 | Positive | None |
| C0302 | 60–69 | M | Germline Germline | c.1187G>A p.G396D c.1276C>T p.R426C | PV VUS | 0.003 0.001 | 29.4 22.9 | 0.551 0.615 | 46.7 | 23.9 | 11 | Negative | VUS->Benign |
| C4622 | 40–49 | M | Germline Germline | c.1187G>A p.G396D c.912C>G p.S304R | PV VUS | 0.003 3 × 10−5 | 29.4 12.8 | 0.551 0.229 | 26.4 | 45.4 | 9 | Negative | VUS->Benign |
| O1569 | 30–39 | F | Germline Germline | c.1187G>A p.G396D c.821G>A p.R274Q | PV VUS | 0.003 0.0002 | 29.4 33 | 0.551 0.229 | 24.9 | 45.8 | 10 | Negative | VUS->Benign |
| C5299 | 40–49 | M | Germline Somatic | c.1187G>A p.G396D c.1596C>A p.F532L | PV VUS | 0.003 2.8 × 10−5 | 29.4 12.3 | 0.551 0.063 | 8.7 | 1.5 | 175 | Negative | None |
| C4551 | 40–49 | F | Germline Somatic | c.389-1G>A c.926G>A p.R309H | PV VUS | 1.2 × 10−5 0.0005 | 2.3 13.9 | 0.592 0.293 | 0 | 3.6 | 120 | Negative | None |
| O0497 | 70–79 | F | Germline Somatic | c.1187G>A p.G396D c.607C>T p.R203C | PV VUS | 0.003 3.7 × 10−5 | 29.4 23.9 | 0.551 0.358 | 0 | 4.6 | 19 | Negative | None |
| C1060 | 70–79 | M | Germline Germline | c.1465G>A p.A489T c.933+3A>C | VUS VUS | 6.5 × 10−5 6.5 × 10−5 | 6.4 29.4 | None 0.724 | 0 | 50.3 | 4 | Negative | None |
| O2193 | 60–69 | F | Germline Germline | c.1431G>C p.T477T c.932G>A p.R311K | VUS VUS | 0.006 2.6 × 10−4 | 4.6 5.3 | 0.039 0.24 | 53 | 54.7 | 9 | Negative | None |
| O3288 | 80–89 | F | Germline Germline | c.1420C>T p.R474C c.603G>T p.M201I | VUS VUS | 3.2 × 10−5 None | 23.2 16.8 | 0.546 0.262 | 9.9 | 41.7 | 10 | Negative | None |
| O3593 | 60–69 | F | Germline Germline | c.1276C>T p.R426C c.389-13C>G | VUS VUS | 0.001 6.5E−5 | 22.9 14.9 | 0.615 None | 9 | 10.7 | 251 | Negative | None |
| O1434 | 60–69 | M | Germline | c.925C>T p.R309C (H) | VUS | 5.4 × 10−4 | 13.90 | 0.592 | 0.9 | 25.8 | 14 | Negative | VUS->Benign |
| O3484 | 30–39 | M | Germline | c.925C>T p.R309C (H) | VUS | 5.5 × 10−4 | 13.90 | 0.592 | 0 | 39.4 | 4 | Negative | VUS->Benign |
| C1701 | 30–39 | F | Germline Germline Germline Germline | c.1431G>C p.T477T (H) c.74G>A p.G25D c.53C>T p.P18L c.165+37_1650+39delGTT | VUS VUS VUS VUS | 0.006 1.1 × 10−3 1.1 × 10−3 None | 4.6 14.2 16.7 12.8 | 0.039 0.111 0.2 None | 51.6 | 39.8 | 6 | Negative | VUS->Benign |
| O3625 | 70–79 | M | Somatic Somatic | c.1286G>T p.G429V c.404T>A p.V135D | None None | None None | 22.3 28.9 | 0.658 0.898 | 3.2 | 6.6 | 86 | Negative | None |
The characteristics of participants and each of the variants identified including ClinVar classification, CADD and REVEL prediction scores, and gnomAD allele frequency, as well as the features of the optimized classifier: SBS18 + SBS36 (>51% for positivity), TMS reconstruction error (<39% for positivity), and somatic mutation count (≥9 for positivity) and the TMS-based pathogenicity prediction (positive for biallelic inactivation, negative for no biallelic inactivation). We indicate the seven VUSs that the classifier provides evidence for reclassification as either likely pathogenic or likely benign. AgeDx age of diagnosis, PV pathogenic variant, TMS tumor mutational signature, VUS variant of uncertain significance, (H) homozygous for germline variant; CADD score >20.0 or REVEL score >0.6 considered predicted pathogenic.
Fig. 5Somatic mutation landscape of the 19 CRCs from biallelic MUTYH pathogenic variant carriers, as well as the 3 CRCs from carriers of variants of uncertain clinical significance that were reclassified as likely pathogenic in this study.
The 40 most commonly mutated CRC genes[48] are included, as well as known CRC genes ALK, CSMD1, POLE, and POLD1. KRAS was found to be significantly more commonly mutated in our biallelic MUTYH carrier CRCs. Source data are provided as a Source Data file. AgeDx age of diagnosis, TMB tumor mutational burden (mutations/Mb), TMS tumor mutational signature.
Significantly enriched individual somatic mutations, as well as genes significantly affected by non-synonymous somatic mutations, observed in more than two MUTYH positive tumors.
| Gene | Variant (context) | SBS18/36 relative likelihood ( | |||
|---|---|---|---|---|---|
| c.34G>T p.G12C (CCA>A) | 16/19 (84%) | 127/5364 (2.4%) | 2 × 10−23 | 62% | |
| c.1636C>A p.Q546K (GCA>A) | 7/19 (37%) | 36/5364 (0.7%) | 6 × 10−11 | 83% | |
| Gene-wide | 17/19 (89%) | 2025/5364 (38%) | 5 × 10−6 | 58% vs 17% | |
| Gene-wide | 9/19 (47%) | 592/5364 (11%) | 8 × 10−5 | 35% vs 12% | |
| Gene-wide | 10/19 (53%) | 934/5364 (17%) | 5 × 10−4 | 60% vs 12% | |
| Gene-wide | 3/19 (16%) | 55/5364 (1.0%) | 1 × 10−3 | 42% vs 20% | |
| Gene-wide | 5/19 (26%) | 420/5364 (8%) | 0.01 | 36% vs 13% | |
| Gene-wide | 6/19 (32%) | 638/5364 (12%) | 0.02 | 36% vs 13% | |
| Gene-wide | 4/19 (21%) | 308/5364 (6%) | 0.02 | 53% vs 13% | |
| Gene-wide | 17/19 (89%) | 3468/5364 (65%) | 0.03 | 45% vs 18% | |
| Gene-wide | 4/19 (21%) | 388/5352 (7%) | 0.045 | 47% vs 13% |
Somatic mutations observed in the significantly enriched genes in MUTYH positives were more often associated with the trinucleotide contexts related to the SBS18/36 tumor mutational signatures (TMS) as measured by the SBS18/36 relative likelihood. P-values were calculated with Fisher’s exact test (two-sided).