| Literature DB >> 20021687 |
Jagat S Chauhan1, Nitish K Mishra, Gajendra P S Raghava.
Abstract
BACKGROUND: One of the major challenges in post-genomic era is to provide functional annotations for large number of proteins arising from genome sequencing projects. The function of many proteins depends on their interaction with small molecules or ligands. ATP is one such important ligand that plays critical role as a coenzyme in the functionality of many proteins. There is a need to develop method for identifying ATP interacting residues in a ATP binding proteins (ABPs), in order to understand mechanism of protein-ligands interaction.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20021687 PMCID: PMC2803200 DOI: 10.1186/1471-2105-10-434
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Percentage composition of ATP interacting and non-interacting residues.
The performance of SVM model (learning parameter: g: 0.1 c: 2 j: 3) using amino acid sequence (The SVM parameter g (in RBF kernel), c: parameter for trade-off between training error & margin, j: cost-factor)
| Thres | Sen | Spec | Accuracy | MCC |
|---|---|---|---|---|
| -1 | 100 | 1.73 | 50.87 | 0.09 |
| -0.9 | 99.87 | 2.88 | 51.37 | 0.11 |
| -0.8 | 99.67 | 4.39 | 52.03 | 0.13 |
| -0.7 | 99.25 | 6.51 | 52.88 | 0.15 |
| -0.6 | 98.36 | 10.31 | 54.34 | 0.18 |
| -0.5 | 96.89 | 15.78 | 56.33 | 0.22 |
| -0.4 | 93.75 | 23.54 | 58.64 | 0.24 |
| -0.3 | 88.94 | 32.9 | 60.92 | 0.26 |
| -0.2 | 83.4 | 43.31 | 63.36 | 0.29 |
| 0.1 | 54.6 | 76.99 | 65.79 | 0.32 |
| 0.2 | 43.11 | 84.98 | 64.04 | 0.31 |
| 0.3 | 33.94 | 91.16 | 62.55 | 0.31 |
| 0.4 | 25.7 | 94.7 | 60.2 | 0.28 |
| 0.5 | 18.07 | 97.15 | 57.61 | 0.25 |
| 0.6 | 12.64 | 98.53 | 55.58 | 0.22 |
| 0.7 | 8.71 | 99.18 | 53.94 | 0.19 |
| 0.8 | 6.22 | 99.54 | 52.88 | 0.16 |
| 0.9 | 3.93 | 99.77 | 51.85 | 0.13 |
| 1 | 1.87 | 99.84 | 50.85 | 0.08 |
(Bold values indicate the point where sensitivity and specificity is roughly equal with maximum MCC.)
The performance of SVM model using binary pattern of different window size patterns.
| Window size | Threshold | Sensitivity | Specificity | Accuracy | MCC | parameters |
|---|---|---|---|---|---|---|
| 7 | 0 | 60.99 | 64.47 | 62.73 | 0.25 | g:0.1 c:1 j:1 |
| 9 | 0 | 61.25 | 67.41 | 64.33 | 0.29 | g:0.1 c:1 j:1 |
| 11 | 0 | 63.2 | 64.01 | 63.61 | 0.27 | g:0.1 c:3 j:3 |
| 13 | 0 | 63.4 | 64.81 | 64.11 | 0.28 | g:0.1 c:2 j:1 |
| 15 | 0 | 61.62 | 67.11 | 64.37 | 0.29 | g:0.1 c:1 j:1 |
| 19 | 0 | 61.98 | 69.44 | 65.71 | 0.32 | g:0.1 c:1 j:1 |
| 21 | 0 | 60.86 | 69.96 | 65.41 | 0.31 | g:0.1 c:1 j:1 |
| 23 | 0 | 63.65 | 68.52 | 66.08 | 0.32 | g:0.1 c:2 j:2 |
| 25 | 0 | 63.32 | 70.26 | 66.79 | 0.34 | g:0.1 c:2 j:1 |
(Bold values indicate the values where accuracy highest and sensitivity and specificity are roughly equal)
Figure 2ROC plot shows performance of SVM modules developed using amino acid sequence and PSSM profile.
The Performance of SVM model (Learning Parameter: g: 0.01 c: 4 j: 1) Using PSI-BLAST Profile
| Threshold | Sensitivity | Specificity | Accuracy | MCC |
|---|---|---|---|---|
| -1 | 98.52 | 15.47 | 57 | 0.25 |
| -0.9 | 97.93 | 20.43 | 59.18 | 0.29 |
| -0.8 | 96.55 | 25.2 | 60.87 | 0.31 |
| -0.7 | 95.07 | 30.68 | 62.88 | 0.34 |
| -0.6 | 93.27 | 36.96 | 65.11 | 0.37 |
| -0.5 | 90.87 | 43.59 | 67.23 | 0.39 |
| -0.4 | 88.44 | 50.1 | 69.27 | 0.42 |
| -0.3 | 85.48 | 56.34 | 70.91 | 0.44 |
| -0.2 | 82 | 63.34 | 72.67 | 0.46 |
| 0.2 | 65.41 | 84.4 | 74.9 | 0.51 |
| 0.3 | 60.32 | 87.78 | 74.05 | 0.5 |
| 0.4 | 55.85 | 90.6 | 73.23 | 0.5 |
| 0.5 | 51.22 | 92.97 | 72.09 | 0.49 |
| 0.6 | 46.39 | 94.58 | 70.48 | 0.47 |
| 0.7 | 40.21 | 96.12 | 68.17 | 0.44 |
| 0.8 | 34.63 | 97.44 | 66.03 | 0.41 |
| 0.9 | 28.65 | 98.03 | 63.34 | 0.37 |
| 1 | 21.78 | 98.92 | 60.35 | 0.33 |
(Italic-bold values indicate the point where sensitivity and specificity is roughly equal and Bold values indicate point where maximum Accuracy and MCC.)
The Performance of SVM model (Learning Parameter: g: 0.001 c: 4 j: 1) Using seven physiochemical properties.
| Threshold | Sensitivity | Specificity | Accuracy | MCC |
|---|---|---|---|---|
| -0.9 | 93.68 | 18.71 | 56.19 | 0.19 |
| -0.8 | 92.35 | 22.05 | 57.2 | 0.2 |
| -0.7 | 90 | 26.85 | 58.43 | 0.22 |
| -0.6 | 87.35 | 31.23 | 59.29 | 0.22 |
| -0.5 | 84.3 | 36.72 | 60.51 | 0.24 |
| -0.4 | 80.43 | 41.79 | 61.11 | 0.24 |
| -0.3 | 76.32 | 47.35 | 61.84 | 0.25 |
| -0.2 | 72.45 | 52.68 | 62.57 | 0.26 |
| -0.1 | 68.25 | 57.78 | 63.01 | 0.26 |
| 0.1 | 57.95 | 68.08 | 63.01 | 0.26 |
| 0.2 | 52.55 | 72.75 | 62.65 | 0.26 |
| 0.3 | 47.15 | 77.05 | 62.1 | 0.25 |
| 0.4 | 41.56 | 80.86 | 61.21 | 0.24 |
| 0.5 | 36.56 | 84.77 | 60.66 | 0.24 |
| 0.6 | 31.79 | 87.42 | 59.6 | 0.23 |
| 0.7 | 26.99 | 89.57 | 58.28 | 0.21 |
| 0.8 | 23.15 | 92.55 | 57.85 | 0.22 |
| 0.9 | 19.44 | 94.34 | 56.89 | 0.21 |
(Bold values indicate point where maximum Accuracy and MCC.)