| Literature DB >> 19558493 |
Jonas Carlsson1, Thierry Soussi, Bengt Persson.
Abstract
A method has been developed to predict the effects of mutations in the p53 cancer suppressor gene. The new method uses novel parameters combined with previously established parameters. The most important parameter is the stability measure of the mutated structure calculated using molecular modelling. For each mutant, a severity score is reported, which can be used for classification into deleterious and nondeleterious. Both structural features and sequence properties are taken into account. The method has a prediction accuracy of 77% on all mutants and 88% on breast cancer mutations affecting WAF1 promoter binding. When compared with earlier methods, using the same dataset, our method clearly performs better. As a result of the severity score calculated for every mutant, valuable knowledge can be gained regarding p53, a protein that is believed to be involved in over 50% of all human cancers.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19558493 PMCID: PMC2730554 DOI: 10.1111/j.1742-4658.2009.07124.x
Source DB: PubMed Journal: FEBS J ISSN: 1742-464X Impact factor: 5.542
Fig. 1Comparison of calculated energy (A), positional conservation (B) and transactivating activity (C) of p53 mutants. The structure is based on the 1tsr crystal structure of p53. In (A), p53 is coloured according to the calculated energy for mutants at each position. Red indicates high energy and blue low energy. In (B), the colours illustrate conservation, where red corresponds to highly conserved and blue to nonconserved residues. In (C), the positions are colour coded from red to blue, where red indicates most severe and blue wild-type activity.
Description of the 12 parameters used to predict the severity of p53 mutants. Asterisks denote parameters calculated using icm.
| Parameter | Explanation |
|---|---|
| Accessibility* | Percentage of amino acid residues buried inside the protein when a sphere with the size of a water molecule van der Waals’ radius is rolled over the protein surface |
| Similarity of the surroundings* | Measure of the percentage of amino acid residues inside a sphere of 5 Å that have the same polarity or charge as the wild-type |
| DNA/zinc | If the amino acid residue is, according to Martin |
| Pocket/cavity* | A cavity is a volume inside the protein that is not occupied by any atom from the protein and not accessible from the outside. A pocket is a cleft into the protein with volume and depth above default values in |
| Calculated energy* | The calculated energy of the protein after residue exchange |
| Average calculated energy* | The average calculated energy of all 19 possible residue exchanges at a given position |
| Secondary structure* | If the exchanged residue is located in a regular secondary structure element, determined by the DSSP algorithm [ |
| Hydrophobicity difference | Change in hydrophobicity value according to the Kyte and Doolittle scale [ |
| Size difference | Change in size between native and new amino acid residue as defined in Protscale [ |
| Amino acid similarity | The amino acid similarity between native and mutated residues, as classified in C |
| Polarity change | If the mutant causes polarity or charge changes. Change equals unity and no change equals zero |
| Conservation | Percentage conservation at each position using p53 homologues of the vertebrate subphylum. The species included are listed in |
Prediction accuracy (%) for each of the six test runs on p53 cancer mutants, where each run was trained on five-sixths of the mutants and tested on the remaining one-sixth.
| Test run | Total | Class 1 (< 25% activity) | Class 2 (> 25% activity) |
|---|---|---|---|
| 1 | 81 | 74 | 85 |
| 2 | 76 | 73 | 77 |
| 3 | 79 | 79 | 79 |
| 4 | 75 | 70 | 78 |
| 5 | 76 | 70 | 82 |
| 6 | 74 | 77 | 72 |
| Total | 77 | 74 | 79 |
Fig. 2ROC curve. True positive rate (TPR) and false positive rate (FPR) depending on the cut-off value used to discriminate between the two severity classes in the test data. The broken line represents prediction on test data and the full line on training data. The straight line represents a random classification and the cross indicates the cut-off value used in PREDMUT.
Effect of cut-off value on the prediction accuracy. The prediction accuracy, specificity, sensitivity, number of mutants classified and MCC values on training data using different activity thresholds to delineate between severe and nonsevere mutants.
| Class 1 | Class 2 | |||||||
|---|---|---|---|---|---|---|---|---|
| Activity cut-off value (%) | Prediction accuracy (%) | Specificity (%) | Sensitivity (%) | Number of mutants | Specificity (%) | Sensitivity (%) | Number of mutants | MCC |
| 1 | 78.9 | 73.1 | 31.5 | 130 | 79.7 | 95.9 | 1018 | 0.38 |
| 2 | 78.4 | 76.1 | 35.9 | 155 | 78.8 | 95.5 | 993 | 0.42 |
| 3 | 76.1 | 78.4 | 36.2 | 172 | 75.7 | 95.2 | 976 | 0.41 |
| 5 | 73.9 | 81.5 | 39.1 | 206 | 72.2 | 94.7 | 942 | 0.43 |
| 10 | 72.3 | 83.4 | 51.7 | 336 | 67.7 | 90.8 | 812 | 0.47 |
| 15 | 78.1 | 79.6 | 75.1 | 541 | 76.5 | 80.8 | 607 | 0.56 |
| 20 | 78.4 | 79.3 | 80.8 | 642 | 77.5 | 75.9 | 524 | 0.57 |
| 25 | 78.7 | 81.0 | 82.3 | 669 | 75.6 | 74.1 | 479 | 0.57 |
| 30 | 77.8 | 78.0 | 84.6 | 706 | 77.4 | 68.8 | 442 | 0.54 |
| 40 | 76.9 | 75.2 | 88.8 | 773 | 80.5 | 61.2 | 375 | 0.53 |
Fig. 3Activity versus frequency. The WAF1 activity of p53 mutations is plotted against the number of times they are found in human cancer patients. The most frequent mutations, the hotspot mutations, are not included. However, they all have activity below 25%.
Parameter weights calculated by PREDMUT and PLS for the WAF1 promoter, together with parameter classification. General property parameters are completely protein nonspecific, position-specific parameters are dependent on the position in the protein and mutant-specific parameters depend on the position and type of amino acid residue substitution.
| Parameter | Weight PREDMUT | Weight PLS | Class |
|---|---|---|---|
| Accessibility | 22 | 20 | Position specific |
| Conservation | 16 | 24 | Position specific |
| Average calculated energy | 13 | 14 | Position specific |
| Size change | 12 | 6 | General property |
| Calculated energy | 11 | 8 | Mutant specific |
| Similar amino acids | 8 | 9 | General property |
| Hydrophobicity difference | −7 | 3 | General property |
| Secondary structure | −4 | −1 | Position specific |
| Polarity change | −2 | 0 | General property |
| Pocket/cavity | 2 | −6 | Position specific |
| Surrounding amino acids | −1 | −1 | Position specific |
Parameter weights for all promoters. (A) Average and individual weights for all parameters for each promoter. Values are sorted in descending order according to the absolute value of the average weight. (B) Average and individual weights for the grouped parameters for each promoter. Values are sorted in descending order according to the absolute value of the average weight. Parameters that are similar are grouped together. Energy = Energy of mutant + Average energy of mutant. General properties = Similar amino acids + Size change + Hydrophobicity difference + Polarity change. Other = Surrounding amino acids + Two-dimensional structure + Pocket/cavity.
| Parameter | WAF1 | MDM2 | BAX | 14-3-3-σ | AIP | GAD45 | NOXA | p53R2 | Average |
|---|---|---|---|---|---|---|---|---|---|
| Conservation | 16 | 24 | 25 | 30 | 27 | 21 | 15 | 21 | 22 |
| Accessibility | 22 | 15 | 7 | 14 | 16 | 27 | 31 | 43 | 22 |
| Average calculated energy | 13 | 10 | 28 | 14 | 24 | 25 | 12 | 11 | 17 |
| Calculated energy | 11 | 10 | 14 | 18 | 8 | 8 | 23 | 0 | 11 |
| Similar amino acids | 8 | 7 | 6 | 9 | 4 | 3 | 6 | 8 | 7 |
| Size change | 12 | 15 | −1 | −5 | 9 | −3 | 5 | 1 | 6 |
| Hydrophobicity difference | −7 | −10 | 2 | 7 | 0 | 3 | 0 | 7 | 4 |
| Surrounding amino acids | −1 | 5 | 10 | −1 | 6 | −2 | −6 | −1 | 4 |
| Two-dimensional structure | −4 | 2 | −2 | −1 | 2 | 3 | 2 | −2 | 2 |
| Polarity change | −2 | 0 | 2 | 2 | 4 | 3 | 0 | −4 | 2 |
| Pocket/cavity | 2 | 2 | 2 | 0 | 1 | 1 | 0 | 2 | 1 |
| Energy | 24 | 20 | 42 | 32 | 32 | 33 | 35 | 11 | 29 |
| Conservation | 16 | 24 | 25 | 30 | 27 | 21 | 15 | 21 | 22 |
| Accessibility | 22 | 15 | 7 | 14 | 16 | 27 | 31 | 43 | 22 |
| General properties | 30 | 32 | 11 | 23 | 16 | 13 | 11 | 20 | 19 |
| Other | 7 | 10 | 15 | 2 | 9 | 6 | 8 | 5 | 8 |
Fig. 4Energy diagram. Cumulative frequency of severe and nonsevere mutants, respectively, plotted against the normalized average calculated energy for all mutants.
Promoter prediction results (%) for eight p53-related promoters.
| Promoter | Training set | Test set |
|---|---|---|
| WAF1 | 79 | 77 |
| MDM2 | 76 | 72 |
| BAX | 77 | 74 |
| 14-3-3-σ | 77 | 74 |
| AIP | 78 | 75 |
| GAD45 | 80 | 74 |
| NOXA | 80 | 75 |
| p53R2 | 80 | 75 |
Cross-correlation between parameters. Parameters that show the highest pairwise correlation coefficients are shown. All other correlation coefficients are below 0.3, with the majority below 0.1.
| Parameter | Calculated energy | ||
|---|---|---|---|
| Average calculated energy | 0.48 | ||
| Parameter | Conservation | Two-dimensional structure | |
| Accessibility | 0.45 | 0.46 | |
| Parameter | Hydrophobicity difference | Similarity change | Size change |
| Polarity change | 0.43 | 0.47 | 0.25 |
| Hydrophobicity difference | 0.63 | 0.34 | |
| Similarity change | 0.43 |
Prediction accuracy (%) for the best of the methods tested and their respective MCC values.
| Prediction method | Total prediction accuracy | Class 1 (< 25% activity) | Class 2 (> 25% activity) | MCC |
|---|---|---|---|---|
| SVM (p = 5) | 76.7 | 82.5 | 68.6 | 0.52 |
| PLS | 73.3 | 86.7 | 63.0 | 0.50 |
| PREDMUT | 76.6 | 73.7 | 78.7 | 0.52 |
Prediction accuracy (%) depending on the size of the safety margin (%) used around the cut-off value. Mutants with a severity score inside the safety margin were classified as unknown.
| Safety margin | Total prediction accuracy | Class 1 (< 25% activity) | Class 2 (> 25% activity) | Unknown | MCC |
|---|---|---|---|---|---|
| 0 | 76.6 | 78.7 | 73.6 | 0 | 0.52 |
| 5 | 78.3 | 80.4 | 75.3 | 11.4 | 0.55 |
| 10 | 80.2 | 83.4 | 75.8 | 23.3 | 0.59 |
| 15 | 82.6 | 85.6 | 78.3 | 34.9 | 0.64 |
| 20 | 85.5 | 89.1 | 80.5 | 46.0 | 0.70 |
| 25 | 87.6 | 91.1 | 82.6 | 54.9 | 0.74 |
Mutants with very different behaviour depending on which promoter is measured. The top half shows mutants in which the activity for the p53R2 and NOXA promoters is similar to that of the wild-type, whereas the activity for all the other promoters measured is almost zero. The bottom half shows mutants that affect WAF1 and MDM2 more severely than the other promoters.
| Mutant | Promoter | Activity (%) | Activity for the other promoters (%) |
|---|---|---|---|
| M243T | p53R2/NOXA | 82–128 | 0–27 |
| G244D | p53R2 | 131 | 0–2 |
| M246I | p53R2 | 143 | 0–2 |
| M246L | p53R2 | 97 | 0–1 |
| M246V | p53R2 | 56 | 0–1 |
| C275S | p53R2 | 223 | 0–1 |
| Q192R | WAF1 | 32 | 67–135 |
| D208E | WAF1/MDM2 | 2–12 | 36–96 |
| T256A | WAF1 | 11 | 40–86 |
| N263D | WAF1/MDM2 | 1–18 | 54–108 |
| V272A | WAF1/MDM2 | 1–3 | 32–49 |
| A276T | WAF1/MDM2 | 2–20 | 53–221 |
| R283C | MDM2 | 0 | 25–153 |