| Literature DB >> 34230640 |
C Cubuk1, A Garrett1, S Choi1, L King1, C Loveday1, B Torr1, G J Burghel2, M Durkie3, A Callaway4,5, R Robinson6, J Drummond7, I Berry6, A Wallace2, D Eccles5,8, M Tischkowitz7,9, N Whiffin10, J S Ware11,12, H Hanson1,13, C Turnbull14,15.
Abstract
PURPOSE: Where multiple in silico tools are concordant, the American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) framework affords supporting evidence toward pathogenicity or benignity, equivalent to a likelihood ratio of ~2. However, limited availability of "clinical truth sets" and prior use in tool training limits their utility for evaluation of tool performance.Entities:
Mesh:
Year: 2021 PMID: 34230640 PMCID: PMC8553612 DOI: 10.1038/s41436-021-01265-z
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.864
Forty-five in silico tools evaluated including components, training and test data sets.
| Tool name | Data output | Components of tool | Training data set(s) | Test (validation) data set(s) | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Categorical | Continuous | Evolutionary conservation | Protein features | Amino acid features | Nucleotide features | Clinical/population variation | Clinical classifications: ClinVar | Clinical classifications: HGMD | Clinical classifications: other | Nonclinical: human variation | Nonclinical: functional studies | Clinical classifications: ClinVar | Clinical classifications: HGMD | Clinical classifications: other | Nonclinical: human variation | Nonclinical: functional studies | ||
| Align-GVGD | ✓ | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||
| BayesDEL | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||
| CADD | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||
| CHASM | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||
| ClinPred | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||
| Condel | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||
| DANN | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||
| Eigen | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
| FATHMM | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||
| FATHMM-MKL | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||
| fitCons | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||||
| GAVIN | ✓ | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||
| GenoCanyon | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||||
| GERP++ | ✗ | ✓ | ✓ | ✓ | ||||||||||||||
| Grantham score | ✗ | ✓ | ✓ | ✓ | ||||||||||||||
| LRT | ✗ | ✓ | ✓ | ✓ | ||||||||||||||
| M-CAP | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||
| Meta-SNP | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||||
| MetaLR | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
| MetaSVM | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||||
| MLP | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
| MSC | ✓ | ✗ | – | – | – | – | – | ✓ | ✓ | ✓ | ✓ | |||||||
| MutationAssessor | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||||
| MutationTaster2 | ✓ | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
| MutPred | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||
| MVP | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||
| PANTHER | ✗ | ✓ | ✓ | ✓ | ✓ | |||||||||||||
| PhastCons | PhastCons100way | ✗ | ✓ | ✓ | ✓ | |||||||||||||
| PhastCons20way | ✗ | ✓ | ✓ | ✓ | ||||||||||||||
| PhD-SNPg | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||||
| PhyloP | PhyloP100 verebrate | ✗ | ✓ | ✓ | ✓ | |||||||||||||
| PhyloP20 mammalian | ✗ | ✓ | ✓ | ✓ | ||||||||||||||
| Pmut | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||
| PolyPhen-2 | HumDiv | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||
| HumVar | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||||
| PON-P2 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||||
| PredictSNP | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||
| primateAI | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||
| PROVEAN | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||||
| REVEL | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||
| rfPred | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||
| SIFT | ✗ | ✓ | ✓ | ✓ | ✓ | |||||||||||||
| SiPhy | ✗ | ✓ | ✓ | ✓ | ✓ | |||||||||||||
| SNAP2 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
| SNPs3D | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||||
| SuSPect (disease-susceptibility-based SAV phenotype prediction) | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||||||
| VEST3 | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
| VEST4 | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
See Supplementary Table 1 for additional details on outputs, thresholds, variant types covered, and references.
Fig. 1Balanced accuracy for 70 tool–threshold combinations for seven truth sets.
Rates of true positive (TP), false negative (FN), true negative (TN), and false positive (FP) tool calls against functional truth sets also shown, along with rates of tool calls of deleterious (DEL, pink), tolerated (TOL, blue) or indeterminate (gray).
Positive likelihood ratios (PLR) and negative likelihood ratios (NLR) for pathogenicity for REVEL and Meta-SNP predictions using binary thresholds and score bands, examining the mean of results for each gene for BRCA1, BRCA2, MSH2, PTEN, and TP53 and the unweighted result for all variants (see also Supplementary Tables 10 and 11).
| REVEL | Meta-SNP | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | All | Mean | All | ||||||
| PLR | NLR | PLR | NLR | PLR | NLR | PLR | NLR | ||
| Binary | 0.9 | 45.9 (9.63–302.) | 1.98 (1.25–5.67) | 5.01 (4.59–5.48) | 1.69 (1.54–1.84) | 32.9 (3.08–470) | 1.01 (0.085–26.2) | 49.2 (14.9–162) | 1.01 (0.308–3.36) |
| 0.8 | 6.24 (4.83–8.22) | 4.05 (3.37–4.98) | 3.15 (3.00–3.31) | 3.02 (2.87–3.17) | 23.5 (8.69–203) | 1.40 (0.773–5.99) | 15.2 (12.7–18.2) | 1.38 (1.15–1.65) | |
| 0.7 | 3.13 (2.75–3.58) | 7.20 (6.27–8.33) | 2.33 (2.26–2.41) | 5.39 (5.21–5.57) | 6.80 (5.27–9.09) | 3.64 (2.87–4.77) | 5.68 (5.33–6.06) | 3.43 (3.22–3.66) | |
| 0.5 | 1.49 (1.40–1.59) | 19.9 (18.7–21.2) | 1.37 (1.35–1.39) | 12.4 (12.2–12.6) | 2.79 (2.49–3.14) | 9.98 (8.81–11.3) | 2.75 (2.66–2.85) | 8.62 (8.32–8.92) | |
| 0.4 | 1.18 (1.14–1.23) | 19.0 (18.2–19.7) | 1.15 (1.13–1.16) | 10.6 (10.4–10.7) | 2.02 (1.86–2.20) | 15.4 (14.1–16.8) | 1.88 (1.84–1.93) | 14.3 (14.0–14.7) | |
| 0.1 | 1.00 (1.00–1.00) | 1.87 (1.87–1.88) | 1.00 (1.00–1.00) | 16.7 (16.7–16.7) | 1.05 (1.03–1.07) | 25.1 (24.6–25.6) | 1.02 (1.02–1.03) | 41.7 (41.5–41.9) | |
| Band | 0.9–1.0 | 102 (17.9–701) | Compared to REVEL < 0.7 | 6.19 (5.74–6.67) | Compared to REVEL < 0.7 | 122. (10.8–1,799) | Compared to Meta-SNP < 0.7 | 157 (47.9–518) | Compared to Meta-SNP < 0.7 |
| 0.8–1.0 | 6.74 (5.24–8.82) | 3.14 (3.00–3.28) | 42.9 (14.4–406.) | 24.9 (21.0–29.5) | |||||
| 0.8–0.9 | 6.77 (5.07–9.15) | 3.77 (3.50–4.06) | 42.7 (14.2–406.) | 24.6 (20.7–29.2) | |||||
| 0.7–0.8 | 3.07 (2.23–4.28) | 2.93 (2.62–3.26) | 6.64 (5.01–9.07) | 5.62 (5.20–6.07) | |||||
| 0.5–0.7 | Compared to REVEL > 0.7 | 7.44 (6.64–8.38) | Compared to REVEL > 0.7 | 4.95 (4.81–5.10) | Compared to Meta-SNP > 0.7 | 3.55 (2.87–4.51) | Compared to Meta-SNP > 0.7 | 3.19 (3.02–3.37) | |
| 0.4–0.5 | 55.1 (50.0–60.7) | 25.3 (24.8–25.8) | 14.4 (11.1–19.1) | 10.4 (10.0–11.0) | |||||
| 0–0.5 | 30.4 (27.1–34.1) | 16.8 (16.4–17.3) | 10.6 (8.32–14.2) | 8.92 (8.42–9.45) | |||||
| 0–0.4 | 34.3 (31.5–37.3) | 18.2 (17.9–18.6) | 19.4 (15.6–24.9) | 18.1 (17.1–19.1) | |||||