| Literature DB >> 28166811 |
Yuya Kobayashi1, Shan Yang2, Keith Nykamp2, John Garcia2, Stephen E Lincoln2, Scott E Topper2.
Abstract
BACKGROUND: The frequency of a variant in the general population is a key criterion used in the clinical interpretation of sequence variants. With certain exceptions, such as founder mutations, the rarity of a variant is a prerequisite for pathogenicity. However, defining the threshold at which a variant should be considered "too common" is challenging and therefore diagnostic laboratories have typically set conservative allele frequency thresholds.Entities:
Keywords: ACMG ISV guidelines; Allele-frequency threshold; ExAC; Variant interpretation
Mesh:
Year: 2017 PMID: 28166811 PMCID: PMC5295186 DOI: 10.1186/s13073-017-0403-7
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Concept diagram for using pathogenic variant frequency distributions to establish allele frequency thresholds. Depicted is a density plot of pathogenic variants in a hypothetical gene. x-axis: allele frequency; y-axis: number of pathogenic variants. The arrows (labeled A, B, and C) highlight three different scenarios of how allele frequencies of previously uncharacterized variants can be evaluated in the context of the pathogenic variant frequency distribution. In scenario A, the uncharacterized variant has an allele frequency that is highly consistent with known pathogenic variants. Because many benign variants are also rare or private, the allele frequency of this variant provide little weight towards either classification. In scenario B, the uncharacterized variant has an allele frequency that is consistent with known pathogenic variants, but is more common than the vast majority of them. The likelihood of such a variant being pathogenic is substantially reduced. In scenario C, the uncharacterized variant has an allele frequency outside of the pathogenic variant frequency distribution. The likelihood of such a variant being pathogenic is extremely low, as it is more common than any other previously characterized pathogenic variant
Fig. 2Allele counts of BRCA1 and BRCA2 pathogenic and likely pathogenic variants. Histograms of the allele counts of pathogenic and likely pathogenic variants in the ExAC dataset for (a) BRCA1 and (b) BRCA2. x-axis: allele count; y-axis: number of unique sequence variants. The red portion represents pathogenic NMDpositive variants and the orange portion represents pathogenic NMDnegative variants (missense, intronic, in-frame indels, and truncations expected to avoid NMD)
Most common pathogenic BRCA1 and BRCA2 variants in ExAC
| Gene | Variant | Effect | Allele count | MAF | Type |
|---|---|---|---|---|---|
|
| NM_007294.3:c.68_69delAG | p.Glu23Valfs*17 | 29 | 0.024% | Founder [ |
|
| NM_007294.3:c.5266dupC | p.Gln1756Profs*74 | 19 | 0.016% | Founder [ |
|
| NM_007294.3:c.181 T > G | p.Cys61Gly | 8 | 0.0067% | Founder [ |
|
| NM_007294.3:c.4035delA | p.Glu1346Lysfs*20 | 5 | 0.0041% | Founder [ |
|
| NM_007294.3:c.1687C > T | p.Gln563* | 5 | 0.0041% | Founder [ |
|
| NM_000059.3:c.5946delT | p.Ser1982Argfs*22 | 32 | 0.027% | Founder [ |
|
| NM_000059.3:c.3847_3848delGT | p.Val1283Lysfs*2 | 11 | 0.011% | Recurrent [ |
|
| NM_000059.3:c.658_659delGT | p.Val220Ilefs*4 | 6 | 0.0061% | Recurrent [ |
|
| NM_000059.3:c.7480C > T | p.Arg2494* | 6 | 0.0050% | Founder [ |
|
| NM_000059.3:c.3545_3546delTT | p.Phe1182* | 4 | 0.0033% | Recurrent [ |
|
| NM_000059.3:c.3599_3600delGT | p.Cys1200* | 4 | 0.0033% | Recurrent [ |
|
| NM_000059.3:c.5576_5579delTTAA | p.Ile1859Lysfs*3 | 4 | 0.0033% | Recurrent [ |
|
| NM_000059.3:c.7069_7070delCT | p.Leu2357Valfs*2 | 4 | 0.0033% | Recurrent [ |
|
| NM_000059.3:c.9118-2A > G | Splice acceptor | 4 | 0.0033% | Recurrent [ |
Fourteen pathogenic variants in BRCA1 and BRCA2 were observed in more than three individuals in the ExAC cohort. All of these variants have been previously reported as founder or common recurrent variants. Some of these recurrent variants are suggested to be founder mutations, but have not been confirmed as such. MAF minor allele frequency
Frequencies of NMDpositive variants in hereditary cancer, primary ciliary dyskinesia (PCD), and arrhythmia/cardiomyopathy genes in ExAC
| Variant allele frequencies in ExAC: | ||||||
|---|---|---|---|---|---|---|
| Gene | Inheritance | <0.005% | 0.005–0.01% | 0.01–0.05% | 0.05–0.1% | ≥0.1% |
| Hereditary cancer | ||||||
|
| Dominant | 6 | - | - | - | - |
|
| Recessive | 104 | 1 | - | - | - |
|
| Dominant | 22 | - | - | - | - |
|
| Recessive | 38 | 2 | 1 | - | - |
|
| Dominant | 3 | 1 | - | - | - |
|
| Dominant | 67 | - | 2 | - | - |
|
| Dominant | 103 | 2 | 1 | - | - |
|
| Recessive | 32 | - | 1 | - | - |
|
| Dominant | 5 | - | - | - | - |
|
| Dominant | 4 | - | - | - | - |
|
| Dominant | 29 | 3 | 2 | - | 1 |
|
| Dominant | 11 | 1 | - | - | - |
|
| Recessive | 9 | 1 | - | - | - |
|
| Recessive | 23 | 1 | - | - | - |
|
| Dominant | - | - | - | - | - |
|
| Dominant | 5 | - | - | - | - |
|
| Recessive | 18 | 1 | - | - | - |
|
| Dominant | 7 | 1 | - | - | - |
|
| Dominant | 25 | - | 1 | - | - |
|
| Recessive | 19 | 3 | 2 | - | 1 |
|
| Recessive | 33 | 1 | 1 | - | - |
|
| Dominant | 23 | 1 | - | - | - |
|
| Dominant | 35 | 2 | 1 | - | - |
|
| Dominant | 32 | - | - | - | - |
|
| Dominant | 2 | - | - | - | - |
|
| Dominant | 2 | - | - | - | - |
|
| Recessive | 42 | 2 | 3 | - | - |
|
| Recessive | 19 | 3 | - | - | - |
|
| Dominant | 2 | - | - | - | - |
|
| Dominant | 1 | - | 1 | - | - |
|
| Dominant | - | - | - | - | - |
|
| Dominant | 1 | - | - | - | - |
|
| Dominant | - | - | - | - | - |
| Primary ciliary dyskinesia | ||||||
|
| Recessive | 36 | 3 | - | - | - |
|
| Recessive | 7 | 1 | 1 | - | - |
|
| Recessive | 3 | - | - | - | - |
|
| Recessive | 6 | - | - | - | - |
|
| Recessive | 16 | 1 | - | - | - |
|
| Recessive | 22 | 2 | - | - | - |
|
| Recessive | 32 | 1 | 1 | - | - |
|
| Recessive | 17 | - | 1 | - | - |
|
| Recessive | - | - | - | - | - |
|
| Recessive | 19 | 2 | 1 | - | - |
|
| Recessive | 12 | - | - | - | - |
|
| Recessive | 16 | 2 | 1 | - | - |
|
| Recessive | 80 | 1 | 2 | - | - |
|
| Recessive | 113 | - | 2 | - | - |
|
| Recessive | 15 | - | 1 | - | - |
|
| Recessive | 21 | 1 | 1 | - | - |
|
| Recessive | 3 | - | - | - | - |
|
| Recessive | 22 | - | 2 | - | - |
|
| Recessive | 18 | 1 | - | - | - |
|
| Recessive | 1 | - | - | - | - |
|
| Recessive | 9 | 1 | 2 | - | - |
|
| Recessive | 27 | 1 | - | - | - |
|
| Recessive | 4 | - | - | - | - |
|
| Recessive | 21 | 2 | - | - | - |
|
| Recessive | 16 | - | - | - | - |
| Arrhythmia and cardiomyopathy | ||||||
|
| Dominant | 2 | - | - | - | - |
|
| Dominant | 6 | - | - | - | - |
|
| Recessive | 15 | - | - | - | - |
|
| Recessive | 5 | - | - | - | - |
|
| Dominant | 13 | - | - | - | - |
|
| Dominant | 19 | - | - | - | - |
|
| Recessive | 21 | - | - | - | - |
|
| Dominant | - | - | - | - | - |
|
| Dominant | 5 | - | - | - | - |
|
| Recessive | 7 | - | - | - | - |
|
| Dominant | - | - | - | - | - |
|
| Dominant | 5 | - | 2 | - | - |
|
| Dominant | 15 | - | 1 | - | - |
|
| Dominant | - | - | - | - | - |
|
| Dominant | 1 | - | - | - | - |
|
| Dominant | 17 | - | - | - | - |
|
| Dominant | 1 | - | - | - | - |
|
| Dominant | 18 | - | 1 | - | - |
|
| Dominant | - | - | - | - | - |
|
| Dominant | 9 | - | - | - | - |
|
| Recessive | 6 | 1 | 2 | - | - |
All NMDpositive variants for genes associated with hereditary cancer, primary ciliary dyskinesia, and arrhythmia/cardiomyopathy were binned based on their allele frequencies in the ExAC dataset. In all genes listed, loss of protein function has been established as the mechanism of disease. Several of the genes listed are associated with both dominant and recessive inheritance patterns; the listed inheritance patterns are specifically for those associated with the disorders most relevant to the clinical area, and the mechanism of disease is loss of function. For example, BRCA2 is associated with autosomal dominant hereditary breast and ovarian cancer, as well as autosomal recessive Fanconi anemia. DES and DSP genes are associated with both dominant and recessive arrhythmia and cardiomyopathy disorders. However, the loss-of-function mechanism has only been firmly established with the recessive disorders
ExAC allele frequencies of the 74 hypertrophic cardiomyopathy “gold standard” missense variants
| Gene | Variant | Effect | Classification | MAF |
|---|---|---|---|---|
|
| NM_000256.3:c.772G > A | p.Glu258Lys | Pathogenic | 0.0039%a |
|
| NM_000257.2:c.2389G > A | p.Ala797Thr | Pathogenic | 0.0033% |
|
| NM_000256.3:c.1504C > T | p.Arg502Trp | Pathogenic | 0.0025% |
|
| NM_000257.2:c.2167C > T | p.Arg723Cys | Pathogenic | 0.0025% |
|
| NM_000363.4:c.485G > A | p.Arg162Gln | Pathogenic | 0.0025% |
|
| NM_000257.2:c.1988G > A | p.Arg663His | Pathogenic | 0.0016% |
|
| NM_000256.3:c.1484G > A | p.Arg495Gln | Pathogenic | 0.00083% |
|
| NM_000432.3:c.173G > A | p.Arg58Gln | Pathogenic | 0.00083% |
|
| NM_000432.3:c.64G > A | p.Glu22Lys | Pathogenic | 0.00083% |
|
| NM_000363.4:c.433C > T | p.Arg145Trp | Pathogenic | 0.00083% |
|
| NM_000257.2:c.2609G > A | p.Arg870His | Pathogenic | 0.00082%b |
|
| NM_001001430.1:c.274C > T | p.Arg92Trp | Pathogenic | 0.00082% |
|
| NM_000363.4:c.433C > G | p.Arg145Gly | Pathogenic | 0.00080% |
|
| NM_000256.3:c.1351G > C | p.Glu451Gln | Pathogenic | 0 |
|
| NM_000256.3:c.1505G > A | p.Arg502Gln | Pathogenic | 0 |
|
| NM_000256.3:c.2265C > A | p.Asn755Lys | Pathogenic | 0 |
|
| NM_000257.2:c.1207C > T | p.Arg403Trp | Pathogenic | 0 |
|
| NM_000257.2:c.1208G > A | p.Arg403Gln | Pathogenic | 0 |
|
| NM_000257.2:c.1357C > T | p.Arg453Cys | Pathogenic | 0 |
|
| NM_000257.2:c.1750G > C | p.Gly584Arg | Pathogenic | 0 |
|
| NM_000257.2:c.1816G > A | p.Val606Met | Pathogenic | 0 |
|
| NM_000257.2:c.2146G > A | p.Gly716Arg | Pathogenic | 0 |
|
| NM_000257.2:c.2155C > T | p.Arg719Trp | Pathogenic | 0 |
|
| NM_000257.2:c.2156G > A | p.Arg719Gln | Pathogenic | 0 |
|
| NM_000257.2:c.2167C > G | p.Arg723Gly | Pathogenic | 0 |
|
| NM_000257.2:c.2221G > T | p.Gly741Trp | Pathogenic | 0 |
|
| NM_000257.2:c.2717A > G | p.Asp906Gly | Pathogenic | 0 |
|
| NM_000257.2:c.2722C > G | p.Leu908Val | Pathogenic | 0 |
|
| NM_000257.2:c.2770G > A | p.Glu924Lys | Pathogenic | 0 |
|
| NM_000257.2:c.2788G > A | p.Glu930Lys | Pathogenic | 0 |
|
| NM_000257.2:c.4135G > A | p.Ala1379Thr | Pathogenic | 0 |
|
| NM_000257.2:c.438G > T | p.Lys146Asn | Pathogenic | 0 |
|
| NM_000257.2:c.767G > A | p.Gly256Glu | Pathogenic | 0 |
|
| NM_000363.4:c.470C > T | p.Ala157Val | Pathogenic | 0 |
|
| NM_000363.4:c.557G > A | p.Arg186Gln | Pathogenic | 0 |
|
| NM_000363.4:c.575G > A | p.Arg192His | Pathogenic | 0 |
|
| NM_001001430.1:c.236 T > A | p.Ile79Asn | Pathogenic | 0 |
|
| NM_001001430.1:c.275G > A | p.Arg92Gln | Pathogenic | 0 |
|
| NM_001001430.1:c.421C > T | p.Arg141Trp | Pathogenic | 0 |
|
| NM_000366.5:c.523G > A | p.Asp175Asn | Pathogenic | 0 |
|
| NM_000366.5:c.688G > A | p.Asp230Asn | Pathogenic | 0 |
|
| NM_000256.3:c.2686G > A | p.Val896Met | Likely benign | 1.28%a |
|
| NM_000256.3:c.833G > A | p.Gly278Glu | Likely benign | 0.29%a |
|
| NM_000363.4:c.244C > T | p.Pro82Ser | Likely benign | 0.28% |
|
| NM_000256.3:c.565G > A | p.Val189Ile | Likely benign | 0.27% |
|
| NM_000256.3:c.3413G > A | p.Arg1138His | Likely benign | 0.13% |
|
| NM_000256.3:c.1519G > A | p.Gly507Arg | Likely benign | 0.068% |
|
| NM_000256.3:c.3004C > T | p.Arg1002Trp | Likely benign | 0.067% |
|
| NM_000363.4:c.235C > T | p.Arg79Cys | Likely benign | 0.043% |
|
| NM_000256.3:c.1564G > A | p.Ala522Thr | Likely benign | 0.039% |
|
| NM_000256.3:c.440C > T | p.Pro147Leu | Likely benign | 0.038%a |
|
| NM_000257.2:c.3981C > A | p.Asn1327Lys | Likely benign | 0.010% |
|
| NM_000256.3:c.1147C > G | p.Leu383Val | Likely benign | 0.0089% |
|
| NM_000256.3:c.842G > A | p.Arg281Gln | Likely benign | 0.0070%a |
|
| NM_000256.3:c.1633C > A | p.Leu545Met | Likely benign | 0.0034% |
|
| NM_000256.3:c.1246G > A | p.Gly416Ser | Likely benign | 0.0029% |
|
| NM_000256.3:c.2063C > A | p.Thr688Lys | Likely benign | 0.0019%a |
|
| NM_000256.3:c.3142C > T | p.Arg1048Cys | Likely benign | 0.0017% |
|
| NM_001001430.1:c.805A > T | p.Asn269Tyr | Likely benign | 0.0017%a |
|
| NM_000256.3:c.2410C > A | p.Leu804Met | Likely benign | 0 |
|
| NM_000257.2:c.321 T > G | p.Asp107Glu | Likely benign | 0 |
|
| NM_000257.2:c.4555A > T | p.Ser1519Cys | Likely benign | 0 |
|
| NM_000257.2:c.8A > C | p.Asp3Ala | Likely benign | 0 |
|
| NM_000363.4:c.244C > A | p.Pro82Thr | Likely benign | 0 |
|
| NM_000363.4:c.253 T > A | p.Leu85Met | Likely benign | 0 |
|
| NM_000363.4:c.257C > A | p.Ala86Asp | Likely benign | 0 |
|
| NM_001001430.1:c.682C > G | p.Gln228Glu | Likely benign | 0 |
|
| NM_000256.3:c.706A > G | p.Ser236Gly | Benign | 10.79% |
|
| NM_000256.3:c.472G > A | p.Val158Met | Benign | 9.04%a |
|
| NM_001001430.1:c.758A > G | p.Lys253Arg | Benign | 5.07% |
|
| NM_000257.2:c.4472C > G | p.Ser1491Cys | Benign | 0.75% |
|
| NM_000256.3:c.977G > A | p.Arg326Gln | Benign | 0.55% |
|
| NM_000256.3:c.1144C > T | p.Arg382Trp | Benign | 0.42% |
|
| NM_000256.3:c.2498C > T | p.Ala833Val | Benign | 0.22% |
Jordan et al. [28] previously published a curated list of 74 “gold standard” missense variants in genes associated with hypertrophic cardiomyopathy. All pathogenic variants were observed at allele frequencies of less than 0.005% in the ExAC dataset
aLoci covered by less than 80,000 total alleles in ExAC
bA variant that did not pass the variant calling quality filter in ExAC
Fig. 3Number of publications for variants observed at an allele frequency greater than 0.01%. a The top box plot, in blue, summarizes the publication of a randomly selected set of 100 variants with a consensus classification of benign in ClinVar and having an allele frequency greater than 0.01%. The bottom box plot, in red, summarizes the publication counts of all 129 variants with a consensus classification of pathogenic in ClinVar and having an allele frequency greater than 0.01%. The box represents the 25th–75th percentile range, with the median publication count depicted as the horizontal line in the middle. The “whiskers” represents the maximum and minimum values. b Scatter plot of allele frequencies and publication counts of the same set of variants. Each red circle represents a pathogenic variant and each blue circle represents a benign variant. x-axis: allele frequency; y-axis: publication counts. Four pathogenic variants that are extreme outliers were excluded for display purposes: (1) GJB2 NM_004004.5:c.35delG (p.Gly12Valfs*2) has an allele frequency of 0.60% and reported in 496 publications; (2) CFTR NM_000492.3:c.1521_1523delCTT (p.Phe508del) has an allele frequency of 0.68% and reported in 966 publications; (3) SERPINA1 NM_001127701.1:c.1096G > A (p.Glu366Lys) has an allele frequency of 1.2% and reported in 56 publications; and (4) BTD NM_000060.3:c.1330G > C (p.Asp444His) has an allele frequency of 3.2% and reported in 46 publications