| Literature DB >> 25802880 |
Lauren C Walters-Sen1, Sayaka Hashimoto2, Devon Lamb Thrush3, Shalini Reshmi4, Julie M Gastier-Foster5, Caroline Astbury4, Robert E Pyatt4.
Abstract
Current practice by clinical diagnostic laboratories is to utilize online prediction programs to help determine the significance of novel variants in a given gene sequence. However, these programs vary widely in their methods and ability to correctly predict the pathogenicity of a given sequence change. The performance of 17 publicly available pathogenicity prediction programs was assayed using a dataset consisting of 122 credibly pathogenic and benign variants in genes associated with the RASopathy family of disorders and limb-girdle muscular dystrophy. Performance metrics were compared between the programs to determine the most accurate program for loss-of-function and gain-of-function mechanisms. No one program correctly predicted the pathogenicity of all variants analyzed. A major hindrance to the analysis was the lack of output from a significant portion of the programs. The best performer was MutPred, which had a weighted accuracy of 82.6% in the full dataset. Surprisingly, combining the results of the top three programs did not increase the ability to predict pathogenicity over the top performer alone. As the increasing number of sequence changes in larger datasets will require interpretation, the current study demonstrates that extreme caution must be taken when reporting pathogenicity based on statistical online protein prediction programs in the absence of functional studies.Entities:
Keywords: Diagnostics; pathogenicity; prediction; sequencing; variants
Year: 2014 PMID: 25802880 PMCID: PMC4367082 DOI: 10.1002/mgg3.116
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Figure 1Scheme for the selection of variants used in this study. Functional studies are the gold standard by which to establish the disease association (pathogenic) or normal variation (benign) status of any sequence variant. Unfortunately, functional studies have only been preformed for a small portion of identified variants. To ensure the greatest likelihood that we were using pathogenic and benign variants to examine this series of prediction programs, we used this selection criteria to establish our dataset. Thirty-six additional benign variants were extracted from Ensembl as few had been identified by clinical sequencing analysis. For detailed descriptions of criteria, please see the Materials and Methods section.
Figure 2Percentage of correct predictions. The ability of the prediction programs to correctly assign either pathogenic (black) or benign (white) status to variants in the RASopathy dataset (A) and the LGMD dataset (B) is shown. The program used and the number of variants with prediction outputs (pathogenic, benign) are listed below the graph. Percentages were generated by dividing the number of variants predicted correctly by the number of variants with prediction outputs for each class (pathogenic or benign). The RASopathy dataset contained 35 credibly pathogenic variants and 19 credibly benign variants. The LGMD dataset contained 36 credibly pathogenic variants and 32 credibly benign variants.
Statistical measures for program performance in the RASopathy dataset
| Program | PPV | NPV | Specificity | Sensitivity | Accuracy | PWeight | WAccuracy |
|---|---|---|---|---|---|---|---|
| PolyPhen2-HumDiv | 0.955 | 0.682 | 0.938 | 0.750 | 0.818 | 0.815 | 0.667 |
| PolyPhen2-HumVar | 1.000 | 0.680 | 1.000 | 0.692 | 0.814 | 0.796 | 0.648 |
| SIFT | 0.931 | 0.889 | 0.889 | 0.931 | 0.915 | 0.870 | 0.796 |
| PMut | 0.800 | 0.556 | 0.769 | 0.600 | 0.667 | 0.611 | 0.407 |
| SNPs3D | 0.913 | 0.818 | 0.818 | 0.913 | 0.882 | 0.680 | 0.600 |
| PANTHER | 1.000 | 0.800 | 1.000 | 0.889 | 0.923 | 0.765 | 0.706 |
| FATHMM-Weighted | 0.900 | 0.696 | 0.842 | 0.795 | 0.811 | 1.000 | 0.811 |
| FATHMM-Unweighted | 0.857 | 0.459 | 0.895 | 0.375 | 0.569 | 1.000 | 0.569 |
| MutationTaster | 0.739 | 1.000 | 0.200 | 1.000 | 0.755 | 0.925 | 0.698 |
| Condel | 0.926 | 0.708 | 0.895 | 0.781 | 0.824 | 1.000 | 0.824 |
| PROVEAN | 1.000 | 0.900 | 1.000 | 0.917 | 0.952 | 0.778 | 0.741 |
| Mutation assessor | 1.000 | 0.500 | 1.000 | 0.304 | 0.590 | 0.422 | 0.426 |
| MutPred | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.830 | 0.830 |
| nsSNPAnalyzer | 0.913 | 0.526 | 0.833 | 0.700 | 0.738 | 1.000 | 0.738 |
| PhD-SNP | 1.000 | 0.938 | 1.000 | 0.938 | 0.968 | 0.574 | 0.556 |
| SNAP | 0.955 | 0.846 | 0.917 | 0.913 | 0.914 | 0.648 | 0.593 |
| SNPs&GO | 1.000 | 0.708 | 1.000 | 0.563 | 0.788 | 0.660 | 0.520 |
PPV, positive predictive value; NPV, negative predictive value; PWeight, performance weight; WAccuracy, weighted accuracy.
See Materials and Methods section for calculations.
Figure 3Accuracy of prediction programs in the RASopathy and LGMD datasets. Both accuracy (white) and weighted accuracy (black) are shown for the prediction programs analyzed. The number of variants with usable prediction calls are listed for each individual program. (A) Gain-of-function RASopathy variants (n = 54); (B) Loss-of-function LGMD variants (n = 68).
Statistical measures for program performance in the LGMD dataset
| Program | PPV | NPV | Specificity | Sensitivity | Accuracy | PWeight | WAccuracy |
|---|---|---|---|---|---|---|---|
| PolyPhen2-HumDiv | 0.698 | 0.714 | 0.536 | 0.833 | 0.703 | 0.941 | 0.662 |
| PolyPhen2-HumVar | 0.700 | 0.704 | 0.679 | 0.724 | 0.702 | 0.838 | 0.588 |
| SIFT | 0.677 | 0.595 | 0.688 | 0.583 | 0.632 | 1.000 | 0.632 |
| PMut | 0.611 | 0.556 | 0.588 | 0.579 | 0.583 | 0.529 | 0.309 |
| SNPs3D | 0.947 | 0.500 | 0.875 | 0.720 | 0.758 | 0.805 | 0.610 |
| PANTHER | 0.778 | 0.609 | 0.778 | 0.609 | 0.683 | 0.719 | 0.491 |
| FATHMM-Weighted | 0.636 | 0.800 | 0.500 | 0.875 | 0.688 | 1.000 | 0.688 |
| FATHMM-Unweighted | 0.593 | 0.512 | 0.656 | 0.444 | 0.544 | 1.000 | 0.544 |
| MutationTaster | 0.738 | 1.000 | 0.450 | 1.000 | 0.784 | 0.750 | 0.588 |
| Condel | 0.722 | 0.688 | 0.688 | 0.722 | 0.706 | 1.000 | 0.706 |
| PROVEAN | 0.647 | 0.531 | 0.739 | 0.423 | 0.571 | 0.721 | 0.412 |
| Mutation assessor | 1.000 | 0.478 | 1.000 | 0.250 | 0.556 | 0.587 | 0.326 |
| MutPred | 0.912 | 0.926 | 0.893 | 0.939 | 0.918 | 0.897 | 0.824 |
| nsSNPAnalyzer | 0.923 | 0.333 | 0.667 | 0.750 | 0.737 | 1.000 | 0.737 |
| PhD-SNP | 0.762 | 0.700 | 0.737 | 0.727 | 0.732 | 0.574 | 0.420 |
| SNAP | 0.833 | 0.900 | 0.692 | 0.952 | 0.853 | 0.500 | 0.426 |
| SNPs&GO | 0.846 | 0.600 | 0.818 | 0.647 | 0.714 | 0.609 | 0.435 |
PPV, positive predictive value; NPV, negative predictive value; PWeight, performance weight; WAccuracy, weighted accuracy.
See Materials and Methods section for calculations.
Figure 4Performance of the combined program algorithm. The number of variants in each category (bars) and the percentage of correct predictions (line) are shown for each dataset when using the combined MutPred/Condel/FATHMM-Weighted method. RASopathy, n = 53; LGMD, n = 68; Combined, n = 121.