| Literature DB >> 25967940 |
Ivone U S Leong1, Alexander Stuckey2, Daniel Lai3, Jonathan R Skinner4,5,6, Donald R Love7.
Abstract
BACKGROUND: Long QT syndrome (LQTS) is an autosomal dominant condition predisposing to sudden death from malignant arrhythmia. Genetic testing identifies many missense single nucleotide variants of uncertain pathogenicity. Establishing genetic pathogenicity is an essential prerequisite to family cascade screening. Many laboratories use in silico prediction tools, either alone or in combination, or metaservers, in order to predict pathogenicity; however, their accuracy in the context of LQTS is unknown. We evaluated the accuracy of five in silico programs and two metaservers in the analysis of LQTS 1-3 gene variants.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25967940 PMCID: PMC4630850 DOI: 10.1186/s12881-015-0176-z
Source DB: PubMed Journal: BMC Med Genet ISSN: 1471-2350 Impact factor: 2.103
prediction tools and metaservers used in the current study
|
|
|
|
|
|
|---|---|---|---|---|
| PolyPhen-2, version 2.2.2 [ | Protein sequence and structure |
| ||
| SNPs&GO [ | Supervised learning (support vector machine) |
| ||
| SIFT, version 5.2.0 [ | Sequence and evolutionary conservation |
| ||
| PROVEAN, version 1.1 [ | Sequence and evolutionary conservation |
| ||
| SNAP [ | Supervised learning (neural networks) |
| ||
| Meta-SNP [ | Metaserver | PANTHER | Sequence and evolutionary conservation |
|
| PhD-SNP | Supervised-learning (support vector machines) | |||
| SIFT | Sequence and evolutionary conservation | |||
| SNAP | Supervised learning (neural networks) | |||
| PredictSNP [ | Metaserver | MAPP | Sequence and evolutionary conservation |
|
| PhD-SNP | Supervised-learning (support vector machines) | |||
| PolyPhen-1 | Protein sequence and structure | |||
| PolyPhen-2 | Protein sequence and structure | |||
| SIFT | Sequence and evolutionary conservation | |||
| SNAP | Supervised learning (neural networks) |
Conditions for SNV data output from two, three, four and all missense prediction tools in order to be considered tolerated and damaging
|
|
|
|
|---|---|---|
| Two tools | Unanimous neutral/tolerated/benign | Unanimous damaging/disease/deleterious/non-neutral |
| One output is damaging/disease/deleterious/non-neutral | ||
| Three tools | Unanimous neutral/tolerated/benign | Unanimous damaging/disease/deleterious/non-neutral |
| Two outputs are neutral/tolerated/benign | Two outputs are damaging/disease/deleterious/non-neutral | |
| Four tools | Unanimous neutral/tolerated/benign | Unanimous damaging/disease/deleterious/non-neutral |
| Three outputs are neutral/tolerate/benign | Two or more outputs are damaging/disease/deleterious/non-neutral | |
| All tools | Unanimous neutral/tolerated/benign | Unanimous damaging/disease/deleterious/non-neutral |
| Three outputs are neutral/tolerated/benign | Three or more outputs are damaging/disease/deleterious/non-neutral |
The total number of functionally characterised , and gene variants investigated in this study
|
|
|
|
|
|---|---|---|---|
|
| 101 | 8 | 109 |
|
| 82 | 8 | 90 |
|
| 99 | 14 | 113 |
| Total | 312 |
Area under the curve (AUC) calculated from Receiver operating curves for each of the prediction tools alone using their respective probabilities/scores for , , and all genes combined
|
|
|
|
|
| ||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| PolyPhen-2 | 0.942 | (0.877-1.000) | 0.774 | (0.624-0.924) | 0.626 | (0.485-0.767) | 0.769 | (0.688-0.850) |
| SNPs&GO | 0.933 | (0.882-0.984) | 0.864 | (0.752-0.976) | 0.666 | (0.528-0.803) | 0.781 | (0.714-0.849) |
| SIFT | 0.834 | (0.687-0.982) | 0.819 | (0.729-0.910) | 0.643 | (0.450-0.836) | 0.715 | (0.602-0.828) |
| PROVEAN | 0.943 | (0.891-0.995) | 0.904 | (0.840-0.968) | 0.631 | (0.456-0.807) | 0.786 | (0.689-0.883) |
| SNAP | 0.689 | (0.500-0.877) | 0.397 | (0.194-0.600) | 0.590 | (0.421-0.758) | 0.627 | (0.522-0.732) |
| Meta-SNP | 0.959 | (0.918-0.999) | 0.905 | (0.826-0.984) | 0.639 | (0.506-0.772) | 0.839 | (0.781-0.897) |
| PredictSNP | 0.713 | (0.571-0.856) | 0.549 | (0.373-0.725) | 0.555 | (0.397-0.714) | 0.603 | (0.513-0.693) |
Figure 1The accuracy (acc), sensitivity (sens), specificity (spec) and MCC scores of all the different combinations of in silico prediction tools and metaservers for variants in the KCNQ1, KCNH2, SCN5A genes, and all genes combined. The sensitivity and specificity percentages highlighted in light green represent the combination of in silico tools and metaservers with high sensitivity or specificity percentages; dark green represents the combination of in silico tools or metaservers with the best performance; light blue represents the combination of in silico tools and metaservers with high MCC scores, and dark blue represents the combination of in silico tools or metaservers with the best performance; light orange represents the combination of in silico tools and metaservers with high accuracy percentages; yellow represents the combination of in silico tools or metaservers with the best performance; red represents the combination of in silico tools and metaservers with MCC scores below 0.
Figure 2The accuracy (acc), sensitivity (sens), specificity (spec) and MCC scores of all the different combinations of in silico prediction tools and metaservers for the whole of SCN5A, only the amino-/carboxyl-terminus and transmembrane domain (N/TM/C) of SCN5A, and only the loop regions of SCN5A. The sensitivity and specificity percentages highlighted in light green represent the combination of in silico tools and metaservers with high sensitivity or specificity percentages; dark green represents the combination of in silico tools or metaservers with the best performance; light blue represents the combination of in silico tools and metaservers with high MCC scores, and dark blue represents the combination of in silico tools or metaservers with the best performance; light orange represents the combination of in silico tools and metaservers with high accuracy percentages; yellow represents the combination of in silico tools or metaservers with the best performance; red represents the combination of in silico tools and metaservers with MCC scores below 0.
Figure 3Receiver operating characteristic (ROC) curves for KCNQ1, KCNH2, SCN5A and all genes combined for each of the single in silico prediction tools and the two metaservers.
Figure 4Procedure used for assessing new missense mutations using in silico prediction tools for KCNQ1, KCNH2 and SCN5A. The in silico prediction tools for SCN5A are italicised as SNAP & PROVEAN should be used with caution for the prediction of SCN5A SNVs.