| Literature DB >> 23468845 |
R Geetha Ramani1, Shomona Gracia Jacob.
Abstract
Prediction of secondary site mutations that reinstate mutated p53 to normalcy has been the focus of intense research in the recent past owing to the fact that p53 mutants have been implicated in more than half of all human cancers and restoration of p53 causes tumor regression. However laboratory investigations are more often laborious and resource intensive but computational techniques could well surmount these drawbacks. In view of this, we formulated a novel approach utilizing computational techniques to predict the transcriptional activity of multiple site (one-site to five-site) p53 mutants. The optimal MCC obtained by the proposed approach on prediction of one-site, two-site, three-site, four-site and five-site mutants were 0.775,0.341,0.784,0.916 and 0.655 respectively, the highest reported thus far in literature. We have also demonstrated that 2D and 3D features generate higher prediction accuracy of p53 activity and our findings revealed the optimal results for prediction of p53 status, reported till date. We believe detection of the secondary site mutations that suppress tumor growth may facilitate better understanding of the relationship between p53 structure and function and further knowledge on the molecular mechanisms and biological activity of p53, a targeted source for cancer therapy. We expect that our prediction methods and reported results may provide useful insights on p53 functional mechanisms and generate more avenues for utilizing computational techniques in biological data analysis.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23468845 PMCID: PMC3572112 DOI: 10.1371/journal.pone.0055401
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Site-Specific P53 Mutant Records.
|
|
|
|
|
|
| 1. | 1 | 8 | 54 | 62 |
| 2. | 2 | 57 | 16319 | 16376 |
| 3. | 3 | 63 | 49 | 112 |
| 4. | 4 | 7 | 24 | 31 |
| 5. | 5 | 6 | 2 | 8 |
| Training records | 16589 | |||
Figure 1Novel Computational Approach to Predict Site-Specific P53 Mutant Transcriptional Activity.
Performance of Attribute Evaluator Algorithms on Site-Wise P53 Mutants Transcriptional Activity.
|
|
|
| |||
|
|
|
|
| ||
| 1 | One | 11 | 19 | 19 | 19 |
| 2 | Two | 52 | 50 | 40 | 40 |
| 3 | Three | 35 | 417 | 417 | 417 |
| 4 | Four | 16 | 73 | 73 | 73 |
| 5 | Five | 154 | 154 | 154 | 154 |
Figure 2The IFS Curves for one-site, two-site, three-site, and four-site p53 mutants.
In the IFS curve, the x-axis is the number of features used for classification, and the y-axis is the Mathew’s correlation coefficients (MCC). (A) The IFS curve for one-site p53 mutants. The peak of MCC is 0.775 with 7 features. The top 7 features derived by the CFS Subset Evaluator approach form the optimal feature set for one-site p53 mutants. (B) The IFS curve for two-site p53 mutants. The peak of MCC is 0.341 with 52 features. The top 52 features derived by the CFS Subset Evaluator approach form the optimal feature set for two-site p53 mutants. (C) The IFS curve for three-site p53 mutants. The peak of MCC is 0.784 with 30 features. The top 30 features derived from the CFS Subset Evaluator approach form the optimal feature set for three-site p53 mutants. (D) The IFS curve for four-site p53 mutants. The peak of MCC is 0.916 with 15 features. The top 15 features derived from the CFS Subset Evaluator approach form the optimal feature set for four-site p53 mutants.
Optimal Performance of Novel Predictor Methods on Site-Wise P53 Mutants Transcriptional Activity.
|
|
|
|
|
|
|
|
|
| Site -1 | Independent Predictor | CFS+ADBS | 7 | 0.775 | 95.2 | 0.952 | 0.78 |
| Site -2 | Imbalanced Predictor | CFS+Random Committee | 52 | 0.341 | 99.2 | 0.992 | 0.178 |
| Site -3 | Balanced Predictor | CFS+Bayes Network | 30 | 0.784 | 89.3 | 0.893 | 0.894 |
| Site -4 | Balanced Predictor | CFS+Bayes Network | 15 | 0.916 | 96.8 | 0.968 | 0.991 |
| Site -5 | Imbalanced Predictor | CFS+RCRT | 1–154 | 0.655 | 87.5 | 0.875 | 0.625 |
Performance Comparison of Site-1 P53 Mutants Transcriptional Activity.
| S.No | Attribute Evaluator | Prediction techniques | Features |
|
|
|
|
| 1 |
|
|
|
|
|
| 0.773 |
| Bayesian Network Learning | -0.087 | 82.3 | 0.823 | 0.122 | |||
| Random Committee | 0.416 | 88.7 | 0.887 | 0.451 | |||
| 2 |
|
|
|
|
|
| 0.671 |
| Bayesian Network Learning | -0.087 | 82.3 | 0.823 | 0.122 | |||
| Random Committee | 0 | 87.1 | 0.871 | 0.129 | |||
|
|
|
|
|
|
|
| 0.671 |
| Bayesian Network Learning | -0.087 | 82.3 | 0.823 | 0.122 | |||
| Random Committee | 0.333 | 88.7 | 0.887 | 0.238 | |||
|
|
|
|
|
|
|
| 0.671 |
| Bayesian Network Learning | -0.087 | 82.3 | 0.823 | 0.122 | |||
| Random Committee | 0.333 | 88.7 | 0.887 | 0.238 |
Performance Comparison of Site-2 P53 Mutants Transcriptional Activity.
| S.No | Attribute Evaluator | Prediction techniques | Features |
|
|
|
|
| 1 |
| Adaboost (Decision Stump) | 52 | 0 | 99.7 | 0.997 | 0.003 |
| Bayesian Network Learning | .162 | 97.2 | 0.972 | 0.475 | |||
|
|
|
|
| 0.178 | |||
| 2 | Information Gain | Adaboost (Decision Stump) | 50 | 0 | 99.6 | 0.996 | 0.003 |
| Bayesian Network Learning | -0.001 | 96.5 | .965 | 0.003 | |||
| Random Committee | 0 | 99.7 | 0.997 | 0.003 | |||
| 3 | Gain Ratio | Adaboost (Decision Stump) | 40 | 0 | 99.7 | 0.997 | 0.003 |
| Bayesian Network Learning | 0.13 | 98.8 | .989 | 0.213 | |||
| Random Committee | .146 | 99.6 | .996 | 0.073 | |||
| 4 | Symmetric Uncertainty | Adaboost (Decision Stump) | 40 | 0 | 99.7 | 0.997 | 0.003 |
| Bayesian Network Learning | .132 | 99.8 | .998 | 0.231 | |||
| Random Committee | .159 | 99.6 | .996 | 0.073 |
Performance Comparison of Site-3 P53 Mutants Transcriptional Activity.
| S.No | Attribute Evaluator | Prediction techniques | Features |
|
|
|
|
| 1 |
| Adaboost (Decision Stump) | 35 | 0.498 | 75 | 0.75 | 0.751 |
|
|
|
|
| 0.866 | |||
| Random Committee | 0.57 | 78.6 | 0.786 | 0.788 | |||
| 2 | Information Gain | Adaboost (Decision Stump) | 417 | 0.451 | 73.2 | 0.732 | 0.705 |
| Bayesian Network Learning | 0.358 | 67 | 0.67 | 0.689 | |||
| Random Committee | 0.311 | 66.1 | 0.661 | 0.65 | |||
| 3 | Gain Ratio | Adaboost (Decision Stump) | 417 | 0.469 | 74.1 | 0.741 | 0.717 |
| Bayesian Network Learning | 0.358 | 67 | 0.67 | 0.689 | |||
| Random Committee | 0.311 | 66.5 | 0.665 | 0.65 | |||
| 4 | Symmetric Uncertainty | Adaboost (Decision Stump) | 417 | 0.469 | 74.1 | 0.741 | 0.717 |
| Bayesian Network Learning | 0.358 | 67 | 0.67 | 0.689 | |||
| Random Committee | 0.367 | 68.8 | 0.688 | 0.68 |
Performance Comparison of Site-4 P53 Mutants Transcriptional Activity.
| S.No | Attribute Evaluator | Prediction techniques | Features |
|
|
|
|
| 1 |
| Adaboost (Decision Stump) | 16 | 0.812 | 93.5 | 0.935 | 0.779 |
|
|
|
|
| 0.889 | |||
| Random Committee | 0.392 | 80.6 | 0.806 | 0.539 | |||
| 2 | Information Gain | Adaboost (Decision Stump) | 73 | 0.812 | 93.5 | .935 | 0.779 |
| Bayesian Network Learning | 0.321 | .774 | .774 | 0.529 | |||
| Random Committee | 0.354 | 80.6 | 0.806 | 0.438 | |||
| 3 | Gain Ratio | Adaboost (Decision Stump) | 73 | 0.812 | 93.5 | 0.935 | 0.779 |
| Bayesian Network Learning | 0.321 | .774 | .774 | 0.529 | |||
| Random Committee | 0.483 | 83.9 | .839 | 0.548 | |||
| 4 | Symmetric Uncertainty | Adaboost (Decision Stump) | 73 | 0.812 | 93.5 | 0.935 | 0.779 |
| Bayesian Network Learning | 0.321 | .774 | .774 | 0.529 | |||
| Random Committee | 0.517 | 83.9 | 0.839 | 0.649 |
Performance Comparison of Site-5 P53 Mutants Transcriptional Activity.
| S.No | Attribute Evaluator | Prediction techniques | Features |
|
|
|
|
| 1 |
|
| 154 |
|
|
| 0.625 |
| Bayesian Network Learning | 0 | 75 | 0.75 | 0.25 | |||
|
|
|
|
| 0.625 | |||
| 2 | Information Gain | Adaboost (Decision Stump) | 154 |
|
|
| 1 |
| Bayesian Network Learning | 0 | 75 | 0.75 | 0.25 | |||
| Random Committee | 0.655 | 87.5 | 0.875 | 0.625 | |||
| 3 | Gain Ratio | Adaboost (Decision Stump) | 154 |
|
|
| 1 |
| Bayesian Network Learning | 0 | 75 | 0.75 | 0.25 | |||
| Random Committee | 0.655 | 87.5 | 0.875 | 0.625 | |||
| 4 | Symmetric Uncertainty | Adaboost (Decision Stump) | 154 |
|
|
| 1 |
| Bayesian Network Learning | 0 | 75 | 0.75 | 0.25 | |||
| Random Committee | 0.655 | 87.5 | 0.875 | 0.625 |
Figure 3Site-wise Feature Relevance Graph.
The sites are represented in purple as solid diamonds. The optimal features for each site are represented by directed edges from the site to the feature. Site-specific features are displayed in different colours.
Comparison to Previous Work on P53 Mutants Transcriptional Activity Prediction.
|
|
|
|
| ||
|
|
|
|
| ||
| 1 | One | 8 | 0.678 | 7 | 0.775 |
| 2 | Two | 50 | 0.314 | 52 | 0.341 |
| 3 | Three | 282 | 0.705 | 30 | 0.784 |
| 4 | Four | 25 | 0.907 | 15 | 0.916 |
| 5 | Five | Not Reported | 1 | 0.655 | |