| Literature DB >> 30696458 |
Roddy Walsh1,2,3, Francesco Mazzarotto4,5,6, Nicola Whiffin4,7,8, Rachel Buchan4,7, William Midwinter4,7, Alicja Wilk4,7, Nicholas Li4,8, Leanne Felkin4,7, Nathan Ingold4, Risha Govind4,7, Mian Ahmad4,7, Erica Mazaika4,7, Mona Allouba7,9, Xiaolei Zhang4,7, Antonio de Marvao8, Sharlene M Day10, Euan Ashley11, Steven D Colan12, Michelle Michels13, Alexandre C Pereira14, Daniel Jacoby15, Carolyn Y Ho16, Kate L Thomson17,18, Hugh Watkins18,19, Paul J R Barton4,7, Iacopo Olivotto5, Stuart A Cook4,7,20,21, James S Ware22,23,24.
Abstract
BACKGROUND: International guidelines for variant interpretation in Mendelian disease set stringent criteria to report a variant as (likely) pathogenic, prioritising control of false-positive rate over test sensitivity and diagnostic yield. Genetic testing is also more likely informative in individuals with well-characterised variants from extensively studied European-ancestry populations. Inherited cardiomyopathies are relatively common Mendelian diseases that allow empirical calibration and assessment of this framework.Entities:
Keywords: ACMG/AMP guidelines; Hypertrophic cardiomyopathy; Mendelian genetics; Variant interpretation
Mesh:
Year: 2019 PMID: 30696458 PMCID: PMC6350371 DOI: 10.1186/s13073-019-0616-z
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1The use of etiological fractions to evaluate variant classification methods. Illustration of how EFs can be used to evaluate methods for distinguishing pathogenic from benign variants (for a hypothetical gene). The overall EF of 0.85 [1] is based on a case frequency of 9.5% and a reference frequency of 1.5%. The aim of variant classification methods is to fully distinguish between pathogenic variants (producing an EF of 1.0 with frequency equal to case excess [2]) and benign variants (producing an EF of 0 with frequency equal to population reference, here ExAC [3]). We propose that an EF of 0.95 would be required to indicate a likely pathogenic variant
Etiological fractions and odds ratios for established HCM genes
| Gene | Transcript | Number of cases | Case frequency (variants/total) | ExAC frequency (variants/total) | Odds ratio (OR) | Etiological fraction (EF) | |
|---|---|---|---|---|---|---|---|
|
| |||||||
|
| ENST00000355349 | 6112 | 13.89% (849/6112) | 1.11% (672/60,469) | < 0.0001 | 14.4 (12.9–15.9) | 0.930 (0.923–0.938) |
|
| ENST00000545968 | 6179 | 9.35% (578/6179) | 1.21% (555/45,794) | < 0.0001 | 8.4 (7.5–9.5) | 0.881 (0.868–0.895) |
|
| ENST00000367318 | 6103 | 1.69% (103/6103) | 0.15% (86/57,018) | < 0.0001 | 11.4 (8.5–15.2) | 0.912 (0.889–0.935) |
|
| ENST00000344887 | 6047 | 2.10% (127/6047) | 0.15% (79/52,607) | < 0.0001 | 14.3 (10.8–18.9) | 0.930 (0.912–0.948) |
|
| ENST00000403994 | 4447 | 1.44% (64/4447) | 0.07% (42/58,642) | < 0.0001 | 20.4 (13.8–30.1) | 0.951 (0.933–0.969) |
|
| ENST00000228841 | 4185 | 1.03% (43/4185) | 0.11% (69/60,521) | < 0.0001 | 9.1 (6.2–13.3) | 0.890 (0.851–0.930) |
|
| ENST00000395869 | 4185 | 0.84% (35/4185) | 0.14% (85/60,605) | < 0.0001 | 6.0 (4.0–8.9) | 0.833 (0.772–0.895) |
|
| ENST00000290378 | 4185 | 0.53% (22/4185) | 0.06% (37/60,198) | < 0.0001 | 8.6 (5.1–14.6) | 0.884 (0.826–0.941) |
|
| ENST00000357525 | 5440 | 0.17% (9/5440) | 0.02% (15/60,475) | < 0.0001 | 6.7 (2.9–15.3) | 0.850 (0.737–0.964) |
|
| ENST00000533783 | 4866 | 0.62% (30/4866) | 0.19% (115/60,647) | < 0.0001 | 3.3 (2.2–4.9) | 0.694 (0.579–0.808) |
|
| ENST00000370690 | 2061 | 0.78% (16/2061) | 0.09% (53/60,278) | < 0.0001 | 8.9 (5.1–15.6) | 0.888 (0.826–0.949) |
|
| ENST00000232975 | 3335 | 0.24% (8/3335) | 0.06% (33/59,192) | 0.0013 | 4.3 (2.0–9.3) | 0.768 (0.598–0.938) |
|
| ENST00000325888 | 448 | 3.79% (17/448) | 2.15% (1225/56,897) | 0.0314 | 1.8 (1.1–2.9) | 0.442 (0.172–0.712) |
|
| ENST00000590592 | 3189 | 2.26% (72/3189) | 1.20% (683/57,035) | < 0.0001 | 1.9 (1.5–2.4) | 0.475 (0.353–0.597) |
|
| |||||||
|
| ENST00000545968 | 6179 | 9.16% (566/6179) | 0.09% (40/45,794) | < 0.0001 | 115.3 (83.6–159.1) | 0.991 (0.988–0.995) |
|
| ENST00000367318 | 6103 | 0.18% (11/6103) | 0.03% (17/57,018) | < 0.0001 | 6.1 (2.8–12.9) | 0.835 (0.722–0.948) |
|
| ENST00000344887 | 6047 | 0.08% (5/6047) | 0.01% (5/52,607) | 0.0019 | 8.7 (2.5–30.1) | 0.885 (0.757–1.013) |
|
| ENST00000357525 | 5440 | 0.17% (9/5440) | 0.01% (4/60,475) | < 0.0001 | 25.1 (7.7–81.4) | 0.960 (0.917–1.003) |
|
| ENST00000533783 | 4866 | 0.14% (7/4866) | 0.02% (14/60,647) | 0.0006 | 6.2 (2.5–15.5) | 0.840 (0.705–0.974) |
|
| ENST00000370690 | 2061 | 0.15% (3/2061) | 0.00% (0/60,278) | < 0.0001 | 205.0 (10.6–3969.8) | 0.995 (0.981–1.009) |
Displayed are the cumulative frequency of rare variants (rare defined by ExAC filtering allele frequency < 4 × 10−5 [22]), Fisher’s exact test p values and estimates of odds ratio and etiological fraction (with 95% confidence intervals) for non-truncating and truncating variants in HCM genes. The etiological fraction can be interpreted as an estimate of the probability that a rare variant, found in an individual with HCM, is causative. This suggests that the majority of variants are pathogenic when detected in cases, and for some genes, the probability that an individual variant is pathogenic is > 0.9, before considering variant-specific segregation of functional data. Only variant classes with a significant excess of variants in case cohorts over ExAC are displayed
Fig. 2Distribution of rare variants in HCM and ExAC cohorts for 6 genes with HCM clustering. Clustering analyses identify regions enriched for disease-associated variation, and therefore within which variants have a high likelihood of pathogenicity. For six HCM genes, the location of rare missense and single amino acid inframe indel variants found in cases (all variants regardless of clinical classification) and controls are shown alongside a cartoon of the cDNA structure. Darker grey indicates higher variant density (overlapping variants not plotted separately). Regions in which variants cluster significantly in cases are shown in red, and regions with clustering in population controls (ExAC) are shown in yellow. The HCM clusters detected were: MYH7 (residues 167–931), MYBPC3 (485–502, 1248–1266), TNNI3 (141–209), TNNT2 (79–179), MYL3 (143–180) and CSRP3 (44–71). For MYH7, existing functional annotations (as described in the “Discussion” section) are superimposed: In green, key residues of the converter kinetic domain and myosin mesa surface area enriched in disease-associated variants (Homburger et al. [37]); in blue, sites of inter- and intramolecular interaction between pairs of myosin heads (Alamo et al. [38]); and in grey, regions previously identified as constrained (intolerant of variation as evidenced by depletion of protein-altering variation in population controls), with the darker shades indicating higher constraint (Samocha et al. [36]). The coordinates describe amino-acid position within the canonical protein sequence
Refinement of etiological fractions for 6 HCM genes using variant clustering and functional prediction scores
| Gene | Case excess | EF (whole gene) | Predictor method | Prioritised variants | Variants not prioritised | ||
|---|---|---|---|---|---|---|---|
| Case freq. | EF | Case freq. | EF | ||||
|
| 12.76% | 0.930 (0.923–0.938) |
|
|
| 3.17% | 0.746 (0.706–0.785) |
| Consensus | 12.55% | 0.940 (0.933–0.947) | 1.32% | 0.783 (0.728–0.839) | |||
| MetaSVM | 12.53% | 0.944 (0.937–0.951) | 1.34% | 0.739 (0.675–0.804) | |||
| MetaLR | 13.29% | 0.944 (0.938–0.951) | 0.58% ( | 0.406 (0.185–0.627) | |||
|
| 7.98% | 0.879 (0.865–0.893) |
|
|
| 6.39% | 0.830 (0.809–0.850) |
|
|
|
|
|
| |||
|
|
|
|
|
| |||
|
|
|
|
|
| |||
|
| 1.54% | 0.912 (0.889–0.935) |
|
|
| 0.46% | 0.787 (0.699–0.874) |
| Consensus | 1.20% | 0.909 (0.880–0.937) | 0.49% | 0.832 (0.730–0.934) | |||
| MetaSVM | 1.11% | 0.894 (0.861–0.927) | 0.58% | 0.905 (0.848–0.961) | |||
| MetaLR | 1.11% | 0.889 (0.856–0.923) | 0.58% | 0.921 (0.872–0.971) | |||
|
| 1.95% | 0.930 (0.912–0.948) |
|
|
| 0.18% ( | 0.457 (0.140–0.774) |
| Consensus | 1.93% | 0.957 (0.943–0.970) | 0.17% ( | 0.566 (0.280–0.852) | |||
| MetaSVM | 1.77% | 0.939 (0.921–0.957) | 0.33% | 0.873 (0.803–0.944) | |||
| MetaLR | 1.87% | 0.932 (0.913–0.951) | 0.23% | 0.903 (0.833–0.973) | |||
|
| 0.70% | 0.833 (0.772–0.895) | HCM cluster | 0.55% | 0.925 (0.886–0.965) | 0.29% ( | 0.655 (0.455–0.856) |
| Consensus | 0.79% | 0.869 (0.817–0.921) | 0.05% ( | 0.310 (0–1) | |||
| MetaSVM | 0.50% | 0.840 (0.763–0.917) | 0.34% | 0.833 (0.735–0.930) | |||
| MetaLR | 0.53% | 0.809 (0.722–0.897) | 0.31% | 0.883 (0.809–0.958) | |||
|
| 0.41% | 0.683 (0.563–0.803) | HCM cluster | 0.43% | 0.882 (0.821–0.943) | 0.16% ( | 0.158 (0–0.724) |
| Consensus | 0.58% | 0.735 (0.630–0.839) | 0.02% ( | – | |||
| MetaSVM | 0.53% | 0.779 (0.687–0.871) | 0.07% ( | – | |||
| MetaLR | 0.55% | 0.751 (0.651–0.852) | 0.05% ( | – | |||
Comparison of performance of variant clustering and consensus functional prediction scores in enriching for disease-associated non-truncating/missense variants in 6 HCM genes where the clustering of case variants was detected. For each gene, the EF of all rare variants is shown, followed by the EF of variants prioritised by the approach, and the EF of the remaining variants that are not prioritised. Clustering analyses identified regions of 4 genes with an EF ≥ 0.95 (bold), and generally outperformed consensus functional prediction scores. Fisher’s exact p values for comparison of rare variation in cases and ExAC reference samples were < 0.0001 unless otherwise noted. For MYBPC3 (italics), the FATHMM predictor was not included in the consensus scores due to its poor performance for this gene, which also affected the MetaSVM and MetaLR consensus scores
Fig. 3Proposed adaptation of ACMG/AMP guidelines for rule PM1, relating to the relative frequencies of non-truncating variants in case cohorts and population controls
Fig. 4Effect of EF-based approach to variant classification in HCM cohorts. a Proportion of cases from the OMGL/LMM HCM cohorts with variants in 8 sarcomeric genes (only rare variants, ExAC filtering frequency < 4 × 10−5, are shown, excluding non-essential splice site variants). Coloured shading represents the clinical classification of the original diagnostic laboratory (OMGL and LMM), and, for variants originally classified as VUS, the proportion that could be reclassified as Likely Pathogenic based on occurrence within a gene or region with EF ≥ 0.95. Eighty-nine variants in 123 cases for MYH7, 12 variants in 27 cases for MYBPC3, 18 variants in 34 cases for TNNI3, 15 variants in 18 cases for TNNT2 and 22 variants in 33 cases for TPM1 would be upgraded based on this analysis. b Proportion of cases in a prospective HCM cohort classified as actionable based on application of fixed and automatable ACMG/AMP rules, alongside the addition of manual curation of published evidence and the proposed EF-calibrated PM1 rules. Thirty-one extra cases (4.5%) are upgraded with EF-based rules compared to just 4 (0.6%) with manual curation. c Comparison of indexed LV mass in cases with pathogenic variants, VUS in high EF (≥ 0.95) regions, and VUS in low EF regions (< 0.95) in MYH7/MYBPC3 as well as genotype-negative cases, from the prospective HCM cohort. The clinical phenotype of individuals with VUS at locations anticipated to be pathogenic is indistinguishable from known pathogenic/likely pathogenic variants, while individuals with VUS in other regions have a clinical phenotype more similar to individuals without a sarcomere variant. d Kaplan-Meier survival curve for the overall composite endpoint (including mortality, ventricular arrhythmia and heart failure composites) of the SHaRe cardiomyopathy registry stratified by genotype (HCM cases with pathogenic variants, VUS in high EF region (≥ 0.95), VUS in lower EF regions (< 0.95), and genotype-negative cases)
Fig. 5Distribution of rare variants in CPVT and ExAC cohorts for RYR2. All rare RYR2 non-truncating (missense and single amino acid inframe indel variants) variants in 1355 CPVT cases (well-phenotyped and referral) and ExAC are shown alongside a cartoon of the cDNA structure. Darker grey indicates higher variant density (overlapping variants not plotted separately). Three regions enriched for disease-associated variation were identified (shown in red)—residues 2138–2538, 3935–4196 and 4721–4959. Exons used in previously defined hotspot regions (original 41 exons and refined 21 exons) are highlighted as shown