| Literature DB >> 26913838 |
Maxime P Vallée1, Tonya L Di Sera2, David A Nix3, Andrew M Paquette4, Michael T Parsons5, Russel Bell4, Andrea Hoffman5, Frans B L Hogervorst6, David E Goldgar7, Amanda B Spurdle5, Sean V Tavtigian4.
Abstract
Clinical mutation screening of the cancer susceptibility genes BRCA1 and BRCA2 generates many unclassified variants (UVs). Most of these UVs are either rare missense substitutions or nucleotide substitutions near the splice junctions of the protein coding exons. Previously, we developed a quantitative method for evaluation of BRCA gene UVs-the "integrated evaluation"-that combines a sequence analysis-based prior probability of pathogenicity with patient and/or tumor observational data to arrive at a posterior probability of pathogenicity. One limitation of the sequence analysis-based prior has been that it evaluates UVs from the perspective of missense substitution severity but not probability to disrupt normal mRNA splicing. Here, we calibrated output from the splice-site fitness program MaxEntScan to generate spliceogenicity-based prior probabilities of pathogenicity for BRCA gene variants; these range from 0.97 for variants with high probability to damage a donor or acceptor to 0.02 for exonic variants that do not impact a splice junction and are unlikely to create a de novo donor. We created a database http://priors.hci.utah.edu/PRIORS/ that provides the combined missense substitution severity and spliceogenicity-based probability of pathogenicity for BRCA gene single-nucleotide substitutions. We also updated the BRCA gene Ex-UV LOVD, available at http://hci-exlovd.hci.utah.edu, with 77 re-evaluable variants.Entities:
Keywords: BRCA1; BRCA2; cancer predisposition; rare variant; unclassified variant; variant of uncertain significance
Mesh:
Substances:
Year: 2016 PMID: 26913838 PMCID: PMC4907813 DOI: 10.1002/humu.22973
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Illustration of the MaxEntScan sliding window approach. A: BRCA1 cDNA reference sequence from 4,865 to 4,873, highlighting nucleotide C4868 in exon 16. B: Sliding a MaxEntScan donor window across an innocuous possible substitution, c.4868C>T. C: Sliding a MaxEntScan donor window across the de novo donor creating substitution c.4868C>G; note the relatively high MaxEntScan donor score at the sixth frame of the sliding window. D: MaxEntScan evaluation of the intron 16 splice donor; note that this score is actually lower than the score of c.4868C>G.
MaxEntScan Scores for Spliceogenic Variants from DBASS3 and DBASS5 Compared with Scores for Wild‐Type Splice Junctions for Disease Genes
| MaxEntScan score for wild‐type splice junction | MaxEntScan score for variant sequence damaging splice junction function | MaxEntScan score for wild‐type k‐mer sequence underlying de novo sites | MaxEntScan score for variant k‐mer sequence creating de novo sites | ||
|---|---|---|---|---|---|
| Number of sequence variants | Ave (SD) | Ave (SD) | Ave (SD) | Ave (SD) | |
| DBASS5 wild‐type donors | 336 | 8.02 (2.10) | 0.59 (2.88) | n/a | n/a |
| DBASS3 wild‐type acceptors | 240 | 8.21 (2.74) | 1.53 (3.54) | n/a | n/a |
| DBASS5 de novo donors | 71 | 7.57 (2.42) | n/a | 1.47 (4.94) | 7.29 (3.63) |
| DBASS3 de novo acceptors | 12 | 7.14 (2.26) | n/a | 1.90 (3.62) | 8.33 (2.39) |
|
| 110 | 8.02 (2.31) | n/a | n/a | n/a |
|
| 110 | 7.98 (2.44) | n/a | n/a | n/a |
Scores for the relevant wild‐type splice junction k‐mer of exons included in the analysis.
Scores for the k‐mer containing spliceogenic variants from DBASS. Excludes scores for BRCA1, BRCA2, and ATM.
Figure 2Distribution of MaxEntScan donor splice‐site scores. Red: Reference sequence splice donors. Purple: All possible single‐nucleotide substitutions to the reference splice donors. Green: All possible single‐nucleotide substitutions to the open‐reading frames of BRCA1 and BRCA2.
Figure 3Distribution of MaxEntScan acceptor splice‐site scores. Red: Reference sequence splice acceptors. Purple: All possible single‐nucleotide substitutions to the reference splice acceptors. Green: All possible single‐nucleotide substitutions to the open‐reading frames of BRCA1 and BRCA2.
MaxEntScan Z‐Score‐Based Assessment of Sequence Variants That Damage Wild‐Type Splice Junctions, Recorded in DBASS5, DBASS3, or the BIC
| MES | DBASS5 | DBASS5 (%) | BIC donors | #Possible | Possible |
|---|---|---|---|---|---|
| Potential splice donor damage | |||||
| (1.5, +inf) | 0 | 0.00 | 0 | 6 | 0.00 |
| (1, 1.5] | 0 | 0.00 | 0 | 77 | 0.00 |
| (0.5, 1] | 2 | 0.74 | 0 | 125 | 0.00 |
| (0, 0.5] | 1 | 0.37 | 0 | 161 | 0.00 |
| (−0.5, 0] | 2 | 0.74 | 1 | 133 | 0.75 |
| (−1, −0.5] | 2 | 0.74 | 1 | 99 | 1.01 |
| (−1.5, −1] | 11 | 4.06 | 3 | 104 | 2.88 |
| (−2, −1.5] | 15 | 5.54 | 3 | 58 | 5.17 |
| (−2.5, −2] | 36 | 13.28 | 8 | 126 | 6.35 |
| (−inf, −2.5] | 202 | 74.54 | 49 | 353 | 13.88 |
| Potential splice acceptor damage | |||||
| (1.5, +inf) | 1 | 0.55 | 0 | 202 | 0.00 |
| (1, 1.5] | 1 | 0.55 | 0 | 238 | 0.00 |
| (0.5, 1] | 1 | 0.55 | 0 | 438 | 0.00 |
| (0, 0.5] | 5 | 2.75 | 0 | 530 | 0.00 |
| (−0.5, 0] | 5 | 2.75 | 0 | 489 | 0.00 |
| (−1, −0.5] | 7 | 3.85 | 0 | 339 | 0.00 |
| (−1.5, −1] | 8 | 4.40 | 3 | 314 | 0.96 |
| (−2, −1.5] | 27 | 14.84 | 6 | 243 | 2.47 |
| (−2.5, −2] | 36 | 19.78 | 9 | 151 | 5.96 |
| (−inf, −2.5] | 91 | 50.00 | 39 | 368 | 10.60 |
BRCA1 and BRCA2 sequence variants recorded in DBASS were removed to avoid double counting.
Sequence variants deposited in the BIC (http://research.nhgri.nih.gov/bic/) that are located at the BRCA1 or BRCA2 native donor and acceptor sites. Evidence for spliceogenicity is not recorded in BIC for all these sequence variants.
Qualitative Z‐Score Ranges and Probabilities of Pathogenicity for Potential Damage to Reference Splice Junctions
| Qualitative category | MES | Alpha | 95% CI | Number of variants used | Prior probability |
|---|---|---|---|---|---|
| Improved | Score improved versus reference sequence | 0.00 | (0.00, 0.08) | 46 | 0.04 |
| Minimal | Donor: | 0.00 | (0.00, 0.08) | 24 | 0.04 |
| Acc: | |||||
| Moderate | Donor: −2 ≤ | 0.34 | (0.15, 0.55) | 66 | 0.34 |
| Acc: −1.5 ≤ | |||||
| High | Donor: | 0.97 | (0.82, 1.00) | 94 | 0.97 |
| Acc: |
Figure 4Enrichment for observed exonic substitutions as a function of MaxEntScan de novo donor or de novo acceptor scores. Red: Analysis of de novo donor enrichment as a function of MES donor score. Blue: analysis of de novo acceptor enrichment as a function of MES acceptor score.
Qualitative Z‐Score Ranges and Probabilities of Pathogenicity for Creation of Exonic De Novo Donors That Would Either Create a Frameshift or Alter a Key Functional Domain
| Qualitative category | MES | Alpha | 95% CI | Number of variants used | Prior probability |
|---|---|---|---|---|---|
| Weak/null and low |
| 0.01 | (0.00, 0.04) | 977 | 0.02 |
| Moderate | −2 <= | 0.30 | (0.00, 0.88) | 7 | 0.30 |
| Increased |
| 0.64 | (0.06, 0.98) | 8 | 0.64 |
Potentially spliceogenic sequence variants that have a higher MES score than the wild‐type donor for their exon are promoted to the next more severe qualitative category.
Summary of Published BRCA Gene Sequence Variant Spliceogenicity Assay Results
| No aberration | Aberration | Percentage with aberration (95% CI) | |
|---|---|---|---|
| Acceptor variants | |||
| High | 0 | 43 | 100.0 (88.0–100.0) |
| Moderate | 17 | 13 | 43.3 (25.5–62.6) |
| Minimal or improved | 18 | 1 | 5.3 (0.1–26.0) |
| Donor variants | |||
| High | 2 | 67 | 97.1 (89.9–99.6) |
| Moderate | 5 | 34 | 87.2 (72.6–95.7) |
| Minimal or improved | 7 | 1 | 12.5 (0.3–52.7) |
| Combined donor and acceptor variants | |||
| High | 2 | 110 | 98.2 (93.7–99.8) |
| Moderate | 22 | 47 | 68.1 (55.8–78.8) |
| Minimal or improved | 25 | 2 | 7.4 (0.9–24.3) |
| De novo donor variants | |||
| Increased | 4 | 9 | 69.2 (38.6–90.9) |
| Moderate | 2 | 2 | 50.0 (6.8–93.2) |
| Weak/null and low | 187 | 35 | 15.8 (11.2–21.2) |
Added one discordant observation in order to estimate a 95% confidence interval.
One of the variants in this category, BRCA1 c.591C>T, is IARC class 1, neutral [Dosil et al., 2010; de la Hoya et al, submitted].
[Friedman et al., 1994, 1995; Gayther et al., 1995; Petrij‐Bosch et al., 1997; Xu et al., 1997; Hoffman et al., 1998; Mazoyer et al., 1998; Fetzer et al., 1999; Ozcelik et al., 1999; Pyne et al., 1999; Santarosa et al., 1999; Scholl et al., 1999; Hartikainen et al., 2000; Pyne et al., 2000; Laskie Ostrow et al., 2001; Vega et al., 2001; Claes et al., 2002; Fackenthal et al., 2002; Howlett et al., 2002; Krajc et al., 2002; Meindl, 2002; Agata et al., 2003; Claes et al., 2003; Campos et al., 2003; Hofmann et al., 2003; Keaton et al., 2003; Yang et al., 2003; Brose et al., 2004; Sharp et al., 2004; Tesoriero et al., 2005; Bonatti et al., 2006; Chen et al., 2006; Chenevix‐Trench et al., 2006; Beristain et al., 2007; Ang et al., 2007; Anczukow et al., 2008; Bonnet et al., 2008; Farrugia et al., 2008; Goina et al., 2008; Kwong et al., 2008; Machackova et al., 2008; Spearman et al., 2008; Caux‐Moncoutier et al., 2009; Gutierrez‐Enriquez et al., 2009; Li et al., 2009; Vreeswijk et al., 2009; Willems et al., 2009; Dosil et al., 2010; Gaildrat et al., 2010; Hansen et al., 2010; Rouleau et al., 2010; Sanz et al., 2010; Steffensen et al., 2010; Walker et al., 2010; Whiley et al., 2010; Brandao et al., 2011; Thery et al., 2011; Whiley et al., 2011; Zhang et al., 2011; Acedo et al., 2012; Gaildrat et al., 2012; Houdayer et al., 2012; Joose et al. 2012; Menendez et al., 2012; Thomassen et al., 2012; Wappenschmidt et al., 2012; Colombo et al., 2013; Di Giacomo et al., 2013; Parsons et al., 2013; de Garibay et al., 2014; Santos et al., 2014; Whiley et al., 2014a; Acedo et al., 2015; Ahlborn et al., 2015].
BRCA1 and BRCA2 Sequence Variants Reclassified as a Consequence of Updated Prior Probabilities of Pathogenicity
| Gene | HGVS (nucleotide) | HGVS (amino acid) | BIC | Motif location |
| Published prior | Splicing prior | Missense prior | Selected prior | |
|---|---|---|---|---|---|---|---|---|---|---|
|
| c.4479_4484+2dup | N/A | IVS14+2ins8 | Donor |
| 0.26 | 0.97 | N/A | 0.97 | |
| mRNA analysis: 8 bp retention of intron 14 [Whiley et al., | ||||||||||
|
| c.4868C>G | p.A1623G | A1623G | De novo donor | −2 ≤ | 0.01 | 0.64b | 0.02 | 0.64 | |
| mRNA analysis: 119 bp deletion of exon 16, variant also present in wild‐type transcripts [Walker et al., | ||||||||||
|
| c.5278‐14C>G | N/A | IVS20‐14C>G | Acceptor |
| 0.26 | 0.04 | N/A | 0.04 | |
| mRNA analysis: no aberration detected [Spearman et al., | ||||||||||
|
| c.5467+5G>C | N/A | IVS23+5G>C | Donor | −2 ≤ | 0.26 | 0.34 | N/A | 0.34 | |
| Exon 23 deletion [Whiley et al., | ||||||||||
|
| c.632‐16A>C | N/A | IVS7‐16A>C | Acceptor | Increased MES score | 0.26 | 0.04 | N/A | 0.04 | |
| mRNA analysis: no aberration detected [Thomassen et al., | ||||||||||
This variant was originally reported as c.4484+2ins8; insertion is due to duplication of GGAAAGGT.
De novo prior upgraded to the next higher qualitative category because the z‐score is higher than that of the corresponding wild‐type donor.
Pathology likelihood ratios (LRs) based on breast tumor ER, grade, or TN status and using revised estimates from Spurdle et al. (in press). Cosegregation, co‐occurrence, and family history LRs are from the cited source of observational data.
Class strata as described in Plon et al. (2008).