| Literature DB >> 31469826 |
Cavin Wong1,2, Fei Chen3, Najmeh Alirezaie4, Yifan Wang1,2, Adeline Cuggia1,2, Ayelet Borgida5, Spring Holter5, Tatiana Lenko1,2, Celine Domecq1,2, Gloria M Petersen6, Sapna Syngal7, Randall Brand8, Anil K Rustgi9, Michele L Cote10, Elena Stoffel11, Sara H Olson12, Nicholas J Roberts13,14, Mohammad R Akbari15,16, Jacek Majewski4, Alison P Klein13,14, Celia M T Greenwood17,18,19, Steven Gallinger5, George Zogopoulos1,2.
Abstract
Pancreatic adenocarcinoma (PC) is a lethal malignancy that is familial or associated with genetic syndromes in 10% of cases. Gene-based surveillance strategies for at-risk individuals may improve clinical outcomes. However, familial PC (FPC) is plagued by genetic heterogeneity and the genetic basis for the majority of FPC remains elusive, hampering the development of gene-based surveillance programs. The study was powered to identify genes with a cumulative pathogenic variant prevalence of at least 3%, which includes the most prevalent PC susceptibility gene, BRCA2. Since the majority of known PC susceptibility genes are involved in DNA repair, we focused on genes implicated in these pathways. We performed a region-based association study using the Mixed-Effects Score Test, followed by leave-one-out characterization of PC-associated gene regions and variants to identify the genes and variants driving risk associations. We evaluated 398 cases from two case series and 987 controls without a personal history of cancer. The first case series consisted of 109 patients with either FPC (n = 101) or PC at ≤50 years of age (n = 8). The second case series was composed of 289 unselected PC cases. We validated this discovery strategy by identifying known pathogenic BRCA2 variants, and also identified SMG1, encoding a serine/threonine protein kinase, to be significantly associated with PC following correction for multiple testing (p = 3.22x10-7). The SMG1 association was validated in a second independent series of 532 FPC cases and 753 controls (p<0.0062, OR = 1.88, 95%CI 1.17-3.03). We showed segregation of the c.4249A>G SMG1 variant in 3 affected relatives in a FPC kindred, and we found c.103G>A to be a recurrent SMG1 variant associating with PC in both the discovery and validation series. These results suggest that SMG1 is a novel PC susceptibility gene, and we identified specific SMG1 gene variants associated with PC risk.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31469826 PMCID: PMC6742418 DOI: 10.1371/journal.pgen.1008344
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Schematic of gene association study design.
Series A, cases at high risk for hereditary PC. Series B, unselected prospectively collected PC cases. WES, whole-exome sequencing. NGS, next-generation sequencing. SNVs, single nucleotide variants. INDELs, insertions/deletions.
List of genes with a p-value ≤ 0.05 in MiST.
| Gene | p-value | q-value |
|---|---|---|
| 0.01463 | 0.1527 | |
| 0.00118 | 0.0404 | |
| 0.01271 | 0.1387 | |
| 0.02218 | 0.1717 | |
| 0.01008 | 0.1338 | |
| 0.03549 | 0.2184 | |
| 0.00770 | 0.1264 | |
| 0.02971 | 0.2128 | |
| 0.01156 | 0.1387 | |
| 0.03025 | 0.2128 | |
| 0.03356 | 0.2151 | |
| 0.00288 | 0.06287 | |
| 0.02212 | 0.1717 | |
| 0.03918 | 0.2187 | |
| 0.01053 | ||
| 0.04734 | 0.2417 | |
| 0.01572 | 0.1528 | |
| 0.00032 | 0.01287 | |
| 0.03344 | 0.2151 | |
| 0.04628 | 0.2415 | |
| 0.00286 | 0.06287 | |
| 0.02609 | 0.1957 | |
| 0.03406 | 0.2151 | |
| 0.02034 | 0.1717 | |
| 0.02130 | 0.1717 | |
| 0.03719 | 0.2187 | |
| 0.03889 | 0.2187 | |
| 0.00024 | 0.01147 | |
| 0.03103 | 0.2128 | |
| 0.04856 | 0.2428 | |
| 0.00902 | 0.1273 | |
| 0.02094 | 0.1717 | |
| 0.01592 | 0.1528 | |
| 0.00843 | 0.1264 | |
| 0.01053 | ||
| 0.01820 | 0.1680 | |
| 0.01059 | 0.1338 | |
| 0.04173 | 0.2226 | |
| 0.04089 | 0.2226 | |
| 7.73E-05 | ||
| 0.03889 | 0.2187 | |
| 0.01053 | ||
| 0.00735 | 0.1264 | |
| 0.00556 | 0.1113 | |
| 0.00270 | 0.06287 | |
| 0.00234 | 0.06287 | |
| 0.00791 | 0.1264 | |
| 0.01238 | 0.1387 |
The 48 DNA repair genes with a significant association (p ≤ 0.05) in MiST and their corresponding q-values are listed. Bolded genes have a p-value that is significant following multiple testing correction by Bonferroni (p ≤ 0.00021). Genes with an asterisk have a q-value <0.05, but were not significant at the Bonferroni significance threshold.
Fig 2The–log p-value graphs for the LOO-W analysis for BRCA2, BRCA1 and SMG1.
A decrease in the–log p-value is an increase in p-value signifying the window dropped contains variants driving the association with PC risk. Any window with an increase in p-value was analyzed by LOO-V for potential variants of interest. A) LOO-W for BRCA2. B) LOO-W for BRCA1. C) LOO-W for SMG1.
Fig 3The–log p-value graphs for LOO-V analysis for each significant window for BRCA2, BRCA1, SMG1.
A decrease in the–log p-value corresponds to an increase in p-value, signifying the dropped variant is potentially driving the association with PC risk. The dotted line represents the–log p-value for an increase in p-value of 35% compared with the p-value of the window. This is the threshold for identifying a candidate variant. A) LOO-V for window 1 to window 4 of BRCA2. B) LOO-V for window 1 and window 2 of BRCA1. C) LOO-V for window 1 and window 2 of SMG1.
Summary of mutations identified in the discovery series by the leave-one-out analysis for 2 known susceptibility genes and a candidate PC susceptibility gene.
| Gene | Chromosomal Position | Mutation | Type | CADD | Case MAF | Control MAF | EVS | ExAC | 1000s | HSF/ MES | p-value increase (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 13:32893369 | c.223G>C:p.A75P | Missense | 23.8 | 0.0026 | 0.0005 | 3.1E-4 | 1.6E-4 | . | . | >55% | |
| 13:32900750 | c.631G>A:p.V211I | Missense | 26 | 0.0013 | 0 | . | . | . | BD | >65% | |
| 13:32903604 | Frameshift | 24.3 | 0.0013 | 0 | . | 4.9E-5 | . | . | >55% | ||
| 13:32906541 | Frameshift | 26.4 | 0.0013 | 0 | . | . | . | . | >65% | ||
| 13:32906766 | c.1151C>T:p.S384F | Missense | 23.4 | 0.0039 | 0 | 1.1E-3 | 6.8E-4 | . | . | >105% | |
| 13:32907395 | c.1780A>T:p.I594L | Missense | 20.5 | 0.0013 | 0 | . | 1.6E-5 | . | . | >45% | |
| 13:32911601 | Stopgain | 37 | 0.0013 | 0 | . | . | . | . | >105% | ||
| 13:32912060 | c.3568C>T:p.R1190W | Missense | 25.3 | 0.0013 | 0 | . | 1.1E-4 | . | . | >65% | |
| 13:32912663 | Stopgain | 41 | 0.0013 | 0 | . | . | . | . | >105% | ||
| 13:32913077 | c.4585G>A:p.G1529R | Missense | 29.2 | 0.0026 | 0 | 4.6E-4 | 4.2E-4 | . | CA | >105% | |
| 13:32913182 | Frameshift | 25.6 | 0.0013 | 0 | 8E-5 | . | . | . | >105% | ||
| 13:32913554 | Frameshift | 34 | 0.0013 | 0 | . | . | . | . | >105% | ||
| 13:32914437 | Frameshift | 35 | 0.0026 | 0 | . | 2.6E-4 | . | . | >105% | ||
| 13:32918706 | c.6853A>G:p.I2285V | Missense | 25.5 | 0.0026 | 0 | 2.3E-4 | 2.7E-4 | . | CA | >105% | |
| 13:32920979 | c.6953G>A:p.R2318Q | Missense | 35 | 0.0013 | 0 | . | 1.6E-5 | . | . | >45% | |
| 13:32928996 | Splicing | 23 | 0.0013 | 0 | . | . | . | . | >35% | ||
| 13:32936782 | c.7928C>G:p.A2643G | Missense | 32 | 0.0013 | 0 | 7.7E-5 | 2.5E-5 | . | . | >35% | |
| 13:32937618 | c.8279G>T:p.G2760V | Missense | 31 | 0.0013 | 0 | . | . | . | CD | >35% | |
| 13:32950851 | Stopgain | 51 | 0.0013 | 0 | . | . | . | . | >35% | ||
| 17:41276044 | Frameshift | 32 | 0.0013 | 0 | . | 2.2E-4 | . | . | >105% | ||
| 17:41245422 | Frameshift | 22.3 | 0.0013 | 0 | . | . | . | . | >105% | ||
| 16:18908268 | c.103G>A:p.A35T | Missense | 21.8 | 0.0088 | 0.0005 | 0.0027 | 0.019 | . | . | >105% | |
| 16:18908112 | c.256+2delGA | Splicing | 24.1 | 0.0039 | 0 | . | . | . | . | >105% | |
| 16:18908113 | c.256+2delTC | Splicing | 24.1 | 0.0013 | 0 | . | . | . | . | >105% | |
| 16:18887501 | c.1835T>A:p.I612K | Missense | 17.25 | 0.0013 | 0 | . | 0.043 | . | . | >65% | |
| 16:18881315 | c.2494A>G:p.N832D | Missense | 12.03 | 0.0026 | 0 | . | 0.0021 | . | . | >105% | |
| 16:18872021 | c.3773A>G:p.N1258S | Missense | 4.325 | 0.0013 | 0 | . | 4.2E-5 | . | CA | >105% | |
| 16:18870914 | c.3917C>T:p.P1306L | Missense | 8.947 | 0.0052 | 0.0005 | 0.0018 | 0.0017 | . | . | >105% | |
| 16:18866212 | c.4249A>G:p.I1417V | Missense | 6.2 | 0.0026 | 0 | . | 1.7E-5 | . | CD/CA | >105% | |
| 16:18863489 | c.4952C>G:p.S1651C | Missense | 19.77 | 0.0013 | 0 | 8.4E-5 | 2.0E-4 | . | CD | >45% | |
| 16:18848732 | c.7447G>A:p.V2483I | Missense | 18.38 | 0.0013 | 0 | . | . | . | . | >55% | |
| 16:18844389 | c.8665G>A:p.G2889S | Missense | 11.67 | 0.0013 | 0 | . | 0.0013 | . | CA | >85% | |
| 16:18841066 | c.9145C>T:p.L3049F | Missense | 12.59 | 0.0013 | 0 | . | . | . | . | >85% | |
| 16:18823386 | c.10685T>C:p.L3562P | Missense | 18.78 | 0.0013 | 0 | . | . | . | . | >35% | |
| 16:18820956 | c.10921A>G:p.I3641V | Missense | 18.52 | 0.0013 | 0 | . | . | . | . | >35% |
The variants correspond to the following transcripts: BRCA2 NM_000059, BRCA1 NM_007294, and SMG1 NM_015092. MAF for our case series, control series, and 3 public databases as well as the CADD score are shown. The p-value increase observed for the leave-one-out variant test and the prediction on splicing for missense variants are indicated. CADD, combined annotation depletion dependent. MAF, minor allele frequency. EVS, Exome Variant Server. ExAC, Exome Aggregation Consortium. 1000s, 1000 genomes project. HSF, Human Splicing Finder. MES, MaxEntScan. BD, broken splice donor site. CA, creation of splice acceptor site. CD, creation of splice donor site.
Clinical characteristics of carriers of the 14 SMG1 variants identified in LOO-analysis.
| ID | Variant | Sex | Ethnic Ancestry | Age of PC diagnosis (in years) | Stage at diagnosis | Other cancer diagnoses with age of diagnosis (in years) |
|---|---|---|---|---|---|---|
| A-26 | M | European | 70 | IIA | . | |
| A-78 | M | European | 53 | III | . | |
| A-79 | F | European | 75 | IV | Basal Cell Carcinoma, 65, Squamous Cell Carcinoma, 66 | |
| A-100 | M | European | 77 | III | Skin (unknown type), 70 | |
| B-8 | F | European | 78 | IV | Breast 48, 63, Bladder 61 | |
| B-11 | M | European | 70 | IV | . | |
| B-17 | M | Asian | 67 | IV | . | |
| B-24 | M | European | 75 | IV | . | |
| B-35 | F | European | 58 | III | . | |
| B-75 | M | European | 70 | III | . | |
| B-95 | F | European | 51 | IV | . | |
| B-99 | M | European | 61 | IV | . | |
| B-105 | F | European | 83 | IV | . | |
| B-149 | M | Asian | 65 | IV | . | |
| B-165 | M | Asian | 58 | IIB | . | |
| B-166 | M | Asian | 70 | I-III | . | |
| B-168 | M | European | 49 | IV | . | |
| B-191 | F | Asian | 76 | III | . | |
| B-207 | M | Asian | 79 | IIA | . | |
| B-221 | F | Asian | 76 | II | . | |
| B-242 | F | Asian | 64 | III | . | |
| B-247 | F | European | 76 | IIB | Melanoma, 75 | |
| B-253 | F | European | 60 | III | . | |
| B-259 | F | European | 92 | IV | . | |
| B-263 | M | Central/South American | 72 | IV | . | |
| B-276 | M | European | 69 | IV | . | |
| B-281 | F | European | 63 | II | . |
The ethnic ancestry column denotes the ancestry group the individual clustered with in the PCA. M, male; F, female.
Fig 4Pedigrees for two SMG1 variant carriers with segregation opportunities.
A) In family A-78, both the proband and the maternal aunt were included in our case series and were identified to be carriers of the c.4249A>G (p.I1417V) SMG1 variant. *SMGI c.4249A>G carrier status for the maternal uncle with PC was inferred by genotyping his son. B) In family B-105, the maternal aunt was included in our case series and was identified to be a carrier of the c.4952C>G (p.S1651C) SMG1 variant. The proband was found to not be a carrier of this SMG1 mutation.
Fig 5A lollipop diagram depicting the mutation landscape for SMG1 variants identified in both the discovery and validation series.
SMG1 consists of 4 main functional domains represented by the blue (N-terminal domain—amino acid (aa) 32–140), red (FAT domain—aa 1131–1866), green (PIKK domain—aa 2121–2482), and orange (FATC domain—aa 3629–3661). Variants identified in the discovery series are depicted by the upward lollipops and variants identified in the validation series are depicted by the downward lollipops. The red lollipop located in the N-terminal domain represents the recurrent p.A35T variant identified in both the discovery and validation series. The purple and orange lollipops located in the FAT domain represent the p.I1417V variant, which segregated in kindred A-78, and the p.S1651C variant identified in kindred B-105, respectively. The dark green and black lollipops represent the amino acid position adjacent to the splicing variants identified. The light blue lollipop represents the non-frameshift variant identified.