| Literature DB >> 23369171 |
Chi-Wei Chen1, Jerome Lin, Yen-Wei Chu.
Abstract
BACKGROUND: Mutation of a single amino acid residue can cause changes in a protein, which could then lead to a loss of protein function. Predicting the protein stability changes can provide several possible candidates for the novel protein designing. Although many prediction tools are available, the conflicting prediction results from different tools could cause confusion to users.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23369171 PMCID: PMC3549852 DOI: 10.1186/1471-2105-14-S2-S5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Data Processing of iStable. After collecting two datasets used for training I-Mutant2.0 and PoPMuSic2.0, we integrated them into a non redundant dataset of protein stability change data, with the information of secondary structure and RSA value on the mutant site included.
Summaries of element predictors
| Predictors | References | URLs |
|---|---|---|
| [ | ||
| [ | ||
| [ | ||
| [ | ||
| [ |
List of chosen predictors used in the construction of iStable with the corresponding references and URLs.
Figure 2Grid computing architecture of iStable. When a user input the mutant protein's information through graphical user interface, the input/output dispatcher will pass the relative information to element predictors. After the results from predictors are collected into repository module, prediction layer will active the prediction program and the output result will be send to data visualization layer through input/output dispatcher, finally the integrated result will be presented to the user.
Figure 3Workflow of iStable. Illustration of how iStable prediction proceeds after the user has input the data of interested target protein.
Comparison of prediction result with M1311
| Predictors | Sn | Sp | Acc | MCC |
|---|---|---|---|---|
| 0.944 | 0.981 | 0.969 | 0.930 | |
| 0.555 | 0.922 | 0.800 | 0.530 | |
| 0.702 | 0.973 | 0.883 | 0.734 | |
| 0.893 | 0.991 | 0.958 | 0.906 | |
| 0.772 | 0.975 | 0.907 | 0.789 | |
| 0.775 | 0.956 | 0.896 | 0.761 | |
| 0.313 | 0.941 | 0.724 | 0.341 | |
| 0.579 | 0.823 | 0.742 | 0.411 | |
| 0.737 | 0.984 | 0.902 | 0.779 |
I-Mutant_PDB: I-Mutant2.0 prediction strategy using PDB ID.
I-Mutant_SEQ: I-Mutant2.0 prediction strategy using protein sequence.
AUTO-MUTE_RF: AUTO-MUTE Random Forest prediction model.
AUTO-MUTE_SVM: AUTO-MUTE SVM prediction model.
MUPRO_ SVM: MUPRO SVM prediction model.
Comparison of prediction result with M1820
| Predictors | Sn | Sp | Acc | MCC |
|---|---|---|---|---|
| 0.456 | 0.900 | 0.752 | 0.409 | |
| 0.198 | 0.906 | 0.670 | 0.148 | |
| 0.212 | 0.899 | 0.670 | 0.155 | |
| 0.129 | 0.985 | 0.700 | 0.234 | |
| 0.067 | 0.965 | 0.666 | 0.072 | |
| 0.276 | 0.885 | 0.682 | 0.206 | |
| 0.303 | 0.952 | 0.736 | 0.352 | |
| 0.370 | 0.757 | 0.628 | 0.133 | |
| 0.113 | 0.984 | 0.693 | 0.212 |
Comparison of prediction result with M3131
| Predictors | Sn | Sp | Acc | MCC |
|---|---|---|---|---|
| 0.688 | 0.941 | 0.857 | 0.669 | |
| 0.377 | 0.916 | 0.736 | 0.357 | |
| 0.457 | 0.934 | 0.775 | 0.464 | |
| 0.511 | 0.989 | 0.829 | 0.615 | |
| 0.420 | 0.969 | 0.786 | 0.499 | |
| 0.526 | 0.908 | 0.780 | 0.480 | |
| 0.308 | 0.945 | 0.733 | 0.348 | |
| 0.474 | 0.780 | 0.678 | 0.261 | |
| 0.425 | 0.980 | 0.795 | 0.527 |
comparison of algorithms
| WS+SEQ | SEQ | |||||||
|---|---|---|---|---|---|---|---|---|
| Methods | Sn | Sp | Acc | MCC | Sn | Sp | Acc | MCC |
| 0.688 | 0.941 | 0.857 | 0.625 | 0.812 | ||||
| 0.764 | 0.760 | 0.495 | 0.583 | 0.759 | 0.700 | 0.337 | ||
| 0.694 | 0.910 | 0.838 | 0.627 | 0.630 | 0.894 | 0.806 | 0.550 | |
| 0.584 | 0.965 | 0.838 | 0.627 | 0.605 | 0.651 | 0.327 | ||
| 0.685 | 0.888 | 0.820 | 0.588 | 0.649 | 0.868 | 0.795 | 0.529 | |
| 0.425 | 0.795 | 0.527 | N/A | N/A | N/A | N/A | ||
SEQ: Sequence scheme; WS: Website result scheme
Figure 4Evaluation of predicted ddG. Correlation plot of the experimental observed and the predicted values of ddG based on iStable.
Evaluation of different combinations for prediction performance
| Strategies | Sn | Sp | Acc | MCC |
|---|---|---|---|---|
| 0.688 | 0.941 | 0.857 | 0.669 | |
| 0.627 | 0.960 | 0.849 | 0.652 | |
| 0.658 | 0.925 | 0.836 | 0.622 | |
| 0.701 | 0.745 | 0.731 | 0.484 |
SEQ: Sequence scheme; WS: Website result scheme
Performance comparison of iStable_SEQ and sequential models
| Predictors | Sn | Sp | Acc | MCC |
|---|---|---|---|---|
| 0.625 | 0.906 | 0.812 | 0.564 | |
| 0.457 | 0.934 | 0.775 | 0.464 | |
| 0.526 | 0.908 | 0.780 | 0.480 |
Comparison of performance based on secondary structure
| Secondary Structure | Predictors | Sn | Sp | Acc | MCC |
|---|---|---|---|---|---|
| iStable | 0.702 | 0.933 | 0.850 | 0.666 | |
| I-Mutant_PDB | 0.415 | 0.901 | 0.728 | 0.371 | |
| I-Mutant_SEQ | 0.520 | 0.929 | 0.784 | 0.509 | |
| AUTO-MUTE_RF | 0.563 | 0.987 | 0.834 | 0.647 | |
| AUTO-MUTE_SVM | 0.495 | 0.957 | 0.792 | 0.536 | |
| MUPRO_SVM | 0.639 | 0.915 | 0.818 | 0.591 | |
| PoPMuSiC2.0 | 0.250 | 0.957 | 0.708 | 0.311 | |
| CUPSAT | 0.541 | 0.778 | 0.693 | 0.323 | |
| iStable | 0.691 | 0.946 | 0.876 | 0.676 | |
| I-Mutant_PDB | 0.348 | 0.944 | 0.782 | 0.385 | |
| I-Mutant_SEQ | 0.495 | 0.948 | 0.825 | 0.520 | |
| AUTO-MUTE_RF | 0.455 | 0.984 | 0.838 | 0.567 | |
| AUTO-MUTE_SVM | 0.297 | 0.996 | 0.805 | 0.426 | |
| MUPRO_SVM | 0.417 | 0.904 | 0.770 | 0.370 | |
| PoPMuSiC2.0 | 0.310 | 0.956 | 0.776 | 0.363 | |
| CUPSAT | 0.417 | 0.796 | 0.697 | 0.213 | |
| iStable | 0.680 | 0.943 | 0.847 | 0.666 | |
| I-Mutant_PDB | 0.365 | 0.893 | 0.699 | 0.311 | |
| I-Mutant_SEQ | 0.358 | 0.924 | 0.716 | 0.354 | |
| AUTO-MUTE_RF | 0.479 | 0.995 | 0.805 | 0.595 | |
| AUTO-MUTE_SVM | 0.386 | 0.954 | 0.745 | 0.434 | |
| MUPRO_SVM | 0.485 | 0.900 | 0.748 | 0.433 | |
| PoPMuSiC2.0 | 0.330 | 0.889 | 0.688 | 0.270 | |
| CUPSAT | 0.474 | 0.766 | 0.662 | 0.249 |
Helix: α helix; Sheet: β sheet; Other: turns and coil.
Comparison of performance based on RSA range
| RSA range | Predictors | Sn | Sp | Acc | MCC |
|---|---|---|---|---|---|
| iStable | 0.640 | 0.946 | 0.869 | 0.634 | |
| I-Mutant_PDB | 0.197 | 0.942 | 0.757 | 0.208 | |
| I-Mutant_SEQ | 0.394 | 0.947 | 0.809 | 0.428 | |
| AUTO-MUTE_RF | 0.387 | 0.988 | 0.839 | 0.528 | |
| AUTO-MUTE_SVM | 0.254 | 0.989 | 0.806 | 0.403 | |
| MUPRO_SVM | 0.445 | 0.922 | 0.803 | 0.423 | |
| PoPMuSiC2.0 | 0.201 | 0.969 | 0.778 | 0.285 | |
| CUPSAT | 0.381 | 0.822 | 0.714 | 0.209 | |
| iStable | 0.684 | 0.954 | 0.854 | 0.684 | |
| I-Mutant_PDB | 0.458 | 0.911 | 0.746 | 0.427 | |
| I-Mutant_SEQ | 0.537 | 0.940 | 0.792 | 0.542 | |
| AUTO-MUTE_RF | 0.604 | 0.981 | 0.843 | 0.665 | |
| AUTO-MUTE_SVM | 0.510 | 0.967 | 0.799 | 0.566 | |
| MUPRO_SVM | 0.508 | 0.905 | 0.759 | 0.460 | |
| PoPMuSiC2.0 | 0.146 | 0.963 | 0.667 | 0.189 | |
| CUPSAT | 0.536 | 0.781 | 0.692 | 0.323 | |
| iStable | 0.782 | 0.920 | 0.853 | 0.712 | |
| I-Mutant_PDB | 0.527 | 0.818 | 0.683 | 0.363 | |
| I-Mutant_SEQ | 0.502 | 0.927 | 0.728 | 0.480 | |
| AUTO-MUTE_RF | 0.598 | 0.993 | 0.807 | 0.653 | |
| AUTO-MUTE_SVM | 0.565 | 0.933 | 0.760 | 0.543 | |
| MUPRO_SVM | 0.665 | 0.902 | 0.788 | 0.587 | |
| PoPMuSiC2.0 | 0.439 | 0.857 | 0.658 | 0.329 | |
| CUPSAT | 0.513 | 0.661 | 0.592 | 0.177 |
Figure 5Individual performance of different window size. By comparing accuracy and MCC, a window size of 11 showed the best performance for the both parameters.
Evaluation of iStable prediction results with data from different protein superfamilies
| Protein categories | Predictors | Sn | Sp | Acc | MCC |
|---|---|---|---|---|---|
| iStable | 0.550 | 0.943 | 0.795 | 0.567 | |
| I-Mutant_PDB | 0.550 | 0.852 | 0.742 | 0.439 | |
| I-Mutant_SEQ | 0.300 | 0.943 | 0.704 | 0.343 | |
| AUTO-MUTE_RF | 0.250 | 0.971 | 0.704 | 0.359 | |
| AUTO-MUTE_SVM | 0.250 | 0.943 | 0.684 | 0.262 | |
| MUPRO_SVM | 0.450 | 0.857 | 0.704 | 0.395 | |
| PoPMuSiC2.0 | 0.400 | 0.910 | 0.724 | 0.355 | |
| CUPSAT | 0.350 | 0.552 | 0.476 | -0.073 | |
| iStable | 0.451 | 0.797 | 0.720 | 0.334 | |
| I-Mutant_PDB | 0.253 | 0.869 | 0.756 | 0.135 | |
| I-Mutant_SEQ | 0.242 | 0.878 | 0.762 | 0.131 | |
| AUTO-MUTE_RF | 0.138 | 0.978 | 0.825 | 0.217 | |
| AUTO-MUTE_SVM | 0.057 | 0.965 | 0.800 | 0.049 | |
| MUPRO_SVM | 0.281 | 0.859 | 0.753 | 0.144 | |
| PoPMuSiC2.0 | 0.344 | 0.931 | 0.824 | 0.328 | |
| CUPSAT | 0.390 | 0.740 | 0.676 | 0.112 | |
| iStable | 0.357 | 0.943 | 0.831 | 0.379 | |
| I-Mutant_PDB | 0.207 | 0.858 | 0.733 | 0.088 | |
| I-Mutant_SEQ | 0.361 | 0.798 | 0.714 | 0.161 | |
| AUTO-MUTE_RF | 0.129 | 0.965 | 0.805 | 0.145 | |
| AUTO-MUTE_SVM | 0.079 | 0.970 | 0.799 | 0.100 | |
| MUPRO_SVM | 0.204 | 0.864 | 0.737 | 0.076 | |
| PoPMuSiC2.0 | 0.100 | 0.964 | 0.798 | 0.091 | |
| CUPSAT | 0.461 | 0.778 | 0.717 | 0.216 |
Evaluation of iStable prediction results with data from pH-temperature ranges by accuracy
| pH | < = 6 | 6~8 | > 8 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| I-Mutant_PDB | 0.38 | 0.68 | 0.55 | 0.22 | 0.42 | 0.73 | 0.18 | 0.69 | 0.08 |
| I-Mutant_SEQ | 0.46 | 0.79 | 0.77 | 0.25 | 0.62 | 0.73 | 0.18 | 0.46 | 0.42 |
| AUTO-MUTE_SVM | 0.38 | 0.81 | 0.97 | 0.20 | 0.43 | 0.79 | 0.09 | 0.69 | 0.33 |
| AUTO-MUTE_RF | 0.48 | 0.97 | 1.00 | 0.28 | 0.47 | 0.85 | 0.09 | 0.77 | 0.83 |
| MUPRO | 0.46 | 0.69 | 0.90 | 0.40 | 0.49 | 0.88 | 0.27 | 0.85 | 0.58 |
| PoPMuSiC | 0.13 | 0.17 | 0.35 | 0.29 | 0.36 | 0.60 | 0.09 | 0.38 | 0.17 |
| CUPSAT | 0.57 | 0.59 | 0.55 | 0.38 | 0.59 | 0.54 | 0.27 | 0.54 | 0.50 |
| iStable | 0.61 | 0.94 | 1.00 | 0.55 | 0.77 | 0.88 | 0.27 | 0.77 | 0.75 |