| Literature DB >> 28050887 |
Iain D Kerr1, Hannah C Cox2, Kelsey Moyes2, Brent Evans2, Brianna C Burdett2, Aric van Kan2, Heather McElroy2, Paris J Vail2, Krystal L Brown2, Dechie B Sumampong2, Nicholas J Monteferrante2, Kennedy L Hardman2, Aaron Theisen2, Erin Mundt2, Richard J Wenstrup2, Julie M Eggington2.
Abstract
Missense variants represent a significant proportion of variants identified in clinical genetic testing. In the absence of strong clinical or functional evidence, the American College of Medical Genetics recommends that these findings be classified as variants of uncertain significance (VUS). VUSs may be reclassified to better inform patient care when new evidence is available. It is critical that the methods used for reclassification are robust in order to prevent inappropriate medical management strategies and unnecessary, life-altering surgeries. In an effort to provide evidence for classification, several in silico algorithms have been developed that attempt to predict the functional impact of missense variants through amino acid sequence conservation analysis. We report an analysis comparing internally derived, evidence-based classifications with the results obtained from six commonly used algorithms. We compiled a dataset of 1118 variants in BRCA1, BRCA2, MLH1, and MSH2 previously classified by our laboratory's evidence-based variant classification program. We compared internally derived classifications with those obtained from the following in silico tools: Align-GVGD, CONDEL, Grantham Analysis, MAPP-MMR, PolyPhen-2, and SIFT. Despite being based on similar underlying principles, all algorithms displayed marked divergence in accuracy, specificity, and sensitivity. Overall, accuracy ranged from 58.7 to 90.8% while the Matthews Correlation Coefficient ranged from 0.26-0.65. CONDEL, a weighted average of multiple algorithms, did not perform significantly better than its individual components evaluated here. These results suggest that the in silico algorithms evaluated here do not provide reliable evidence regarding the clinical significance of missense variants in genes associated with hereditary cancer.Entities:
Keywords: HBOC; Lynch syndrome; Missense mutation; Sequence conservation; Variant classification; Variants of uncertain significance
Year: 2017 PMID: 28050887 PMCID: PMC5386911 DOI: 10.1007/s12687-016-0289-x
Source DB: PubMed Journal: J Community Genet ISSN: 1868-310X
Description of in silico tools
| In silico tool | Algorithm parameters | Notes |
|---|---|---|
| SIFT | • Sequence conservation | |
| PolyPhen-2 | • Sequence conservation | |
| Grantham matrix score | • Biochemical properties of amino acids | • Consideration of composition, polarity, and molecular volume of amino acids |
| Align-GVGD | • Sequence/biochemical variation | • Extension of Grantham matrix score to sequence alignments |
| MAPP-MMR | • Sequence conservation | • Only used for |
| CONDEL server | • Sequence conservation | • Weighted combination of SIFT, PolyPhen-2 and MutationAssessor scores |
SIFT sorting intolerant from tolerant, PolyPhen-2 polymorphism phenotyping v2, MAPP-MMR multivariate analysis of protein polymorphism, mismatch repair, CONDEL server CONsensus DELeteriousness score of missense SNVs
Performance of in silico tools
| Algorithm | Total variants | Accuracy | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|---|
| Overall performance | ||||||
| Align-GVGD | 368 | 90.8% | 84.1% | 91.7% | 57.8% | 97.7% |
| SIFT | 1118 | 60.2% | 99.0% | 56.4% | 18.2% | 99.8% |
| PolyPhen-2 | 1118 | 58.7% | 90.0% | 55.6% | 16.6% | 98.3% |
| PolyPhen-2 | 1118 | 71.1% | 81.0% | 70.1% | 21.0% | 97.4% |
| CONDEL | 1109 | 69.8% | 84.4% | 68.4% | 20.2% | 97.9% |
| Grantham | 866 | 67.1% | 78.2% | 66.0% | 18.5% | 96.8% |
| MAPP-MMRa | 71 | 74.6% | 100.0% | 56.1% | 62.5% | 100.0% |
|
| ||||||
| Align-GVGD | 103 | 97.1% | 91.7% | 97.8% | 84.6% | 98.9% |
| SIFT | 419 | 48.9% | 100.0% | 41.5% | 19.9% | 100.0% |
| PolyPhen-2 | 419 | 55.6% | 83.0% | 51.6% | 19.9% | 95.5% |
| PolyPhen-2 | 419 | 69.2% | 67.9% | 69.4% | 24.3% | 93.7% |
| CONDEL | 414 | 67.4% | 76.0% | 66.2% | 23.6% | 95.3% |
| Grantham | 329 | 69.0% | 90.2% | 66.0% | 27.4% | 97.9% |
|
| ||||||
| Align-GVGD | 165 | 92.1% | 100.0% | 92.1% | 7.1% | 100.0% |
| SIFT | 599 | 60.9% | 100.0% | 59.9% | 6.4% | 100.0% |
| PolyPhen-2 | 599 | 73.6% | 100.0% | 72.9% | 9.2% | 100.0% |
| PolyPhen-2 | 599 | 67.8% | 100.0% | 66.9% | 7.7% | 100.0% |
| CONDEL | 596 | 71.8% | 100.0% | 71.0% | 8.7% | 100.0% |
| Grantham | 455 | 64.4% | 53.8% | 64.7% | 4.3% | 97.9% |
|
| ||||||
| Align-GVGD | 49 | 79.6% | 76.2% | 82.1% | 76.2% | 82.1% |
| SIFT | 49 | 71.4% | 100.0% | 50.0% | 60.0% | 100.0% |
| PolyPhen-2 | 49 | 59.2% | 95.2% | 32.1% | 51.3% | 90.0% |
| PolyPhen-2 | 49 | 67.3% | 95.2% | 46.4% | 57.1% | 92.9% |
| CONDEL | 49 | 69.4% | 90.5% | 53.6% | 59.4% | 88.2% |
| Grantham | 39 | 71.8% | 68.8% | 73.9% | 64.7% | 77.3% |
| MAPP-MMR | 36 | 86.1% | 100.0% | 68.8% | 80.0% | 100.0% |
|
| ||||||
| Align-GVGD | 51 | 84.3% | 90.0% | 82.9% | 56.3% | 97.1% |
| SIFT | 51 | 52.9% | 90.0% | 43.9% | 28.1% | 94.7% |
| PolyPhen-2 | 51 | 56.9% | 100.0% | 46.3% | 31.3% | 100.0% |
| PolyPhen-2 | 51 | 60.8% | 90.0% | 53.7% | 32.1% | 95.7% |
| CONDEL | 50 | 66.0% | 88.9% | 61.0% | 33.3% | 96.2% |
| Grantham | 43 | 76.7% | 75.0% | 77.1% | 42.9% | 93.1% |
| MAPP-MMR | 35 | 62.9% | 100.0% | 48.0% | 43.5% | 100.0% |
aOnly used for MLH1 and MSH2
Fig. 1Overall (top) and gene-specific (bottom) Matthews correlation coefficients for in silico tools relative to laboratory classification
Case examples of discrepant classifications
| Laboratory classification | In silico classification(s) |
|---|---|
|
| |
| Benign | Benign |
| • Phenotypic evidence based on family history weighting algorithm (Pruss et al. | • PolyPhen-2 (HumDiv and HumVar) |
| Pathogenic | |
| • Align-GVGD, SIFT | |
|
| |
| Benign | Pathogenic/Likely Pathogenic |
| • | • Align-GVGD, PolyPhen-2 (HumDiv and HumVar), SIFT |
|
| |
| Benign | Pathogenic/Likely Pathogenic |
| • Phenotypic evidence based on family history weighting algorithm (Pruss et al. | • Align-GVGD, PolyPhen-2 (HumDiv and HumVar), SIFT, MAPP-MMR |