| Literature DB >> 23819482 |
Emidio Capriotti1, Remo Calabrese, Piero Fariselli, Pier Luigi Martelli, Russ B Altman, Rita Casadio.
Abstract
BACKGROUND: SNPs&GO is a method for the prediction of deleterious Single Amino acid Polymorphisms (SAPs) using protein functional annotation. In this work, we present the web server implementation of SNPs&GO (WS-SNPs&GO). The server is based on Support Vector Machines (SVM) and for a given protein, its input comprises: the sequence and/or its three-dimensional structure (when available), a set of target variations and its functional Gene Ontology (GO) terms. The output of the server provides, for each protein variation, the probabilities to be associated to human diseases.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23819482 PMCID: PMC3665478 DOI: 10.1186/1471-2164-14-S3-S6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Composition of the datasets
| SAP-SEQ | SAP-3D | SAP-NEW | |
|---|---|---|---|
| Total variations | 38,460 | 6,630 | 1,489 |
| Disease related variations | 19,230 | 3,342 | 960 |
| Neutral variations | 19,230 | 3,288 | 529 |
| Proteins | 9,067 | 721* | 271 |
*protein sequences of SAP-SEQ, endowed with structure, that are used to train/test SNPs&GO3d.
Figure 1Schematic view of SNPs&GO (panel A) and SNPs&GO. From the left to the right, the SNPs&GO and SNPs&GO3d input web pages, the flow chart of the sequence and structure-based methods and two examples of the returned outputs.
Performance of the different methods on the SAP-SEQ dataset
| Method | Qtot | P[D] | S[D] | P[N] | S[N] | MCC | AUC |
|---|---|---|---|---|---|---|---|
| PhD-SNP | 0.76 | 0.77 | 0.75 | 0.76 | 0.78 | 0.52 | 0.83 |
| SNPs&GO | 0.81 | 0.82 | 0.79 | 0.80 | 0.82 | 0.61 | 0.88 |
D=disease related, N=neutral. For index definition see "Measures of performance".
Performance of the different methods on the SAP-3D dataset
| Method | Otot | P[D] | S[D] | P[N] | S[N] | MCC | AUC |
|---|---|---|---|---|---|---|---|
| PhD-SNP | 0.79 | 0.83 | 0.74 | 0.76 | 0.84 | 0.58 | 0.87 |
| S3D-PROF | 0.81 | 0.80 | 0.84 | 0.83 | 0.78 | 0.63 | 0.88 |
| SNPs&GO | 0.81 | 0.80 | 0.83 | 0.82 | 0.78 | 0.61 | 0.89 |
| SNPs&GO3d | 0.84 | 0.82 | 0.86 | 0.85 | 0.81 | 0.68 | 0.91 |
D=disease related, N=neutral. For index definition see "Measures of performance".
Performance of the different methods on the SAP-NEW dataset
| Method | Qtot | P[D] | S[D] | P[N] | S[N] | MCC | AUC |
|---|---|---|---|---|---|---|---|
| PhD-SNP | 0.70 | 0.77 | 0.75 | 0.57 | 0.61 | 0.35 | 0.74 |
| S3D-PROF | 0.80 | 0.80 | 0.84 | 0.80 | 0.76 | 0.60 | 0.87 |
| SNPs&GO | 0.79 | 0.84 | 0.83 | 0.71 | 0.71 | 0.54 | 0.85 |
| SNPs&GO3d | 0.83 | 0.84 | 0.86 | 0.84 | 0.80 | 0.67 | 0.91 |
D=disease related, N=neutral. For index definition see "Measures of performance
Figure 2Performance of SNP&GO and SNPs&GO. In panel A the ROC curves of both methods are shown. In panels B and C the performances of SNP&GO and SNPs&GO3d as a function of the Reliability index (RI) are reported.