| Literature DB >> 29179779 |
Rajarshi Ghosh1,2, Ninad Oak1,2, Sharon E Plon3,4.
Abstract
BACKGROUND: The American College of Medical Genetics and American College of Pathologists (ACMG/AMP) variant classification guidelines for clinical reporting are widely used in diagnostic laboratories for variant interpretation. The ACMG/AMP guidelines recommend complete concordance of predictions among all in silico algorithms used without specifying the number or types of algorithms. The subjective nature of this recommendation contributes to discordance of variant classification among clinical laboratories and prevents definitive classification of variants.Entities:
Keywords: ACMG; ClinVar; Clinical genetics; Diagnostics; In silico algorithm; ROC; Variant interpretation
Mesh:
Year: 2017 PMID: 29179779 PMCID: PMC5704597 DOI: 10.1186/s13059-017-1353-5
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Concordance among predictions of 18 algorithms for variants in ClinVar. Binary predictions made by 18 algorithms for each pathogenic or benign variants in ClinVar are shown in the upper and lower panels. Each variant is along a row and an orange, green, or white tile depicts a pathogenic, benign, or missing data call, respectively, by the corresponding algorithm. A total of 14,819 variants with ClinVar review status one star or above (a) and 2966 variants with ClinVar review status two stars or above (b) are shown
Concordance rate of different combination of algorithms
| Variant assertion in ClinVar | Variant source | Algorithms | Variants (n) | Concordance ( | False concordance (n (%)) |
|---|---|---|---|---|---|
| Benign | ClinVar* | All 18 | 7346 | 382 (5.2) | 57 (0.8) |
| Pathogenic | ClinVar* | All 18 | 7473 | 2930 (39.2) | 2 (0.03) |
| Benign | ClinVar** | All 18 | 1914 | 86 (4.5) | 12 (0.6) |
| Pathogenic | ClinVar** | All 18 | 1052 | 492 (46.8) | 0 (0) |
| Benign | ClinVar* | Polyphen, SIFT, CADD, PROVEAN, MutationTaster | 7346 | 2464 (33.5) | 815 (11.1) |
| Pathogenic | ClinVar* | Polyphen, SIFT, CADD, PROVEAN, MutationTaster | 7473 | 5904 (79.0) | 68 (0.9) |
| Benign | ClinVar* | Polyphen, SIFT, CADD | 7346 | 3392 (46.2) | 1340 (18.2) |
| Pathogenic | ClinVar* | Polyphen, SIFT, CADD | 7473 | 6342 (84.9) | 156 (2.1) |
ClinVar *: ClinVar variants with one star or above review status
ClinVar **: ClinVar variants with two stars or above review status
Fig. 2Concordance among algorithms. a Distribution of proportion of variants that had concordant calls by any given pair of algorithms (among 18 algorithms) for benign (green) and pathogenic (orange) variants in ClinVar. b Scatterplots of true concordance (variant assertion matches ClinVar assertion) vs false concordance (variant assertion does not match ClinVar assertion) for combinations of three, four, or five algorithms at a time. An orange and a green point depict the true and false concordance of a combination for benign and pathogenic variants, respectively, in ClinVar. The rugs on top and bottom, left and right represent the distribution of false and true concordances, respectively. c Hierarchical clustering of 25 algorithms with scores for 14,819 variants in ClinVar. Red rectangles indicate robust clusters with an AU p value of > 0.99 (see “Methods”)
Concordance among combinations of algorithms across all variants using publicly available thresholds.
| Algorithms (n) | Algorithms | Overall true concordance (%) | Overall false concordance (%) |
|---|---|---|---|
| 2 | REVEL, MetaSVM | 83.4 | 7.8 |
| 3 | VEST3, REVEL, MetaSVM* | 75.6 | 4.1 |
| 4 | Polyphen2, REVEL, MetaSVM, Eigen* | 69.3 | 4.1 |
| 5 | Provean, Polyphen2, REVEL, MetaSVM, Eigen | 64.5 | 3.2 |
Asterisks indicate that there were combinations with higher concordance but they included MetaSVM and MetaLR (see text)
Fig. 3Performance analysis of algorithms. The AUC of a ROC are plotted for 25 algorithms. Vertical dotted line indicates an AUC of 0.9 and 99% confidence intervals for each AUC are shown. Blue dots indicate AUC > 0.89. a AUCs of the algorithms across different datasets shown in the panels and described in text. b AUCs of the algorithms across different datasets (represented in panels) to address type I circularity as described in text. The same plots for ClinVar Status * and ClinVar Status ** as in Fig. 3a are used in 3b for comparison. Any instance of ** represents variants with ClinVar review status of two stars or above. Ensemble predictors are indicated by dark green labels on the y-axis