| Literature DB >> 25057121 |
Manuel Giollo, Alberto J M Martin, Ian Walsh, Carlo Ferrari, Silvio C E Tosatto.
Abstract
BACKGROUND: The rapid growth of un-annotated missense variants poses challenges requiring novel strategies for their interpretation. From the thermodynamic point of view, amino acid changes can lead to a change in the internal energy of a protein and induce structural rearrangements. This is of great relevance for the study of diseases and protein design, justifying the development of prediction methods for variant-induced stability changes.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25057121 PMCID: PMC4083412 DOI: 10.1186/1471-2164-15-S4-S7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of the 10 pairs of mesophilic and thermophilic proteins used in the case study, their similarity and the environmental conditions (pH and Temperature) used to perform the test [31].
| Mesophile | Extremophile | Alignment | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PDB code | Species | pH | T (°C) | PDB code | Species | pH | T (°C) | Identity | Gaps | |
| Adenylate kinase | 1AK2A | 7 | 38 | 1ZIPA | 7 | 65 | 90/223 (40.4%) | 9 | ||
| Phosphoglycerate Kinase | 3PGKA | 6,6 | 30 | 1PHPA | 7 | 65 | 210/420 (50.0%) | 31 | ||
| Reductase | 1LVLA | 7 | 30 | 1EBDA | 7 | 65 | 192/466 (41.2%) | 19 | ||
| Lactate Dehydrogenase | 1LDMA | 7,9 | 11 | 1LDNA | 7 | 65 | 111/335 (33.1%) | 25 | ||
| TATA box binding protein | 1VOKA | 7 | 20 | 1PCZA | 7 | 98 | 75/198 (37.9%) | 22 | ||
| Subtilisin | 1ST3A | 7 | 20 | 1THMA | 6 | 60 | 132/282 (46.8%) | 16 | ||
| Carboxy Peptidase | 2CTCA | 7 | 38 | 1OBRA | 6 | 60 | 93/346 (26.9%) | 62 | ||
| Glyceraldehyde-3-phosphate | 1GADO | 7 | 37 | 1GD1O | 7 | 65 | 194/335 (57.9%) | 6 | ||
| Neutral Protease | 1NPCA | 7 | 30 | 1THLA | 7 | 80 | 231/318 (72.6%) | 2 | ||
| Phosphofructo Kinase | 2PFKD | 7 | 37 | 3PFKA | 7 | 65 | 172/320 (53.8%) | 20 | ||
Regression performance comparison of NeEMO with other methods on the ten-fold cross-validation test.
|
|
|
| |||||
|---|---|---|---|---|---|---|---|
| Auto-Mute | 1,144 | 0.640 | 0.635 | 0.456 | |||
| I-Mutant 2.0 | 2,171 | 0.642 | 0.623 | 0.467 | |||
| I-Mutant 3.0 | 2,112 | 0.620 | 0.623 | 0.434 | |||
| MuPro | 2,398 | 0.606 | 0.571 | 0.416 | |||
| PoPMuSiC 2.0 | 2,399 | 0.623 | 0.617 | 0.445 | |||
The evaluation is performed only on mutations where both methods were able to make a prediction. In addition, the other methods are likely to have used the test samples in their training.
Correlation measure performance of different NeEMO versions on the IM_631 dataset.
| r | ρ | τ | ||
|---|---|---|---|---|
| All | NeEMO | 0.666 | 0.644 | 0.465 |
| NeEMO_NOCC | 0.637 | 0.626 | 0.447 | |
| NeEMO_NORING | 0.618 | 0.603 | 0.430 | |
| Helix | NeEMO | 0.645 | 0.612 | 0.436 |
| NeEMO_NOCC | 0.613 | 0.607 | 0.430 | |
| NeEMO_NORING | 0.585 | 0.600 | 0.424 | |
| Beta | NeEMO | 0.716 | 0.687 | 0.506 |
| strand | NeEMO_NOCC | 0.694 | 0.672 | 0.490 |
| NeEMO_NORING | 0.690 | 0.662 | 0.482 | |
| Coil | NeEMO | 0.581 | 0.588 | 0.418 |
| NeEMO_NOCC | 0.546 | 0.560 | 0.391 | |
| NeEMO_NORING | 0.502 | 0.501 | 0.350 | |
| Exposed | NeEMO | 0.603 | 0.551 | 0.391 |
| NeEMO_NOCC | 0.553 | 0.516 | 0.360 | |
| NeEMO_NORING | 0.522 | 0.498 | 0.350 | |
| Buried | NeEMO | 0.638 | 0.614 | 0.441 |
| NeEMO_NOCC | 0.612 | 0.593 | 0.422 | |
| NeEMO_NORING | 0.591 | 0.559 | 0.397 | |
NeEMO uses all input features, NeEMO_NOCC does not use node centralities, NeEMO_NORING does not use any RIN feature. Comparisons are shown for the entire dataset, on each of the 3 different secondary structure states and occurring in amino acids exposed to the solvent (e, RSA > 25%) or buried (b, RSA <= 25%).
Performance of different methods on the independent S350 dataset.
| All mutations | Common mutations | Common mutations -10% | |||||||
|---|---|---|---|---|---|---|---|---|---|
| n | σ | n | σ | n | σ | ||||
| Automute | 315 | 0.46 | 1.42 | 299 | 0.44 | 1.45 | 264 | 0.60 | 1.06 |
| CUPSAT | 346 | 0.37 | 1.46 | 299 | 0.37 | 1.50 | 264 | 0.50 | 1.10 |
| Dmutant | 350 | 0.48 | 1.38 | 299 | 0.46 | 1.44 | 264 | 0.63 | 1.05 |
| Eris | 334 | 0.35 | 1.49 | 299 | 0.35 | 1.52 | 264 | 0.55 | 1.07 |
| I-Mutant 2.0 | 346 | 0.29 | 1.50 | 299 | 0.27 | 1.56 | 264 | 0.39 | 1.16 |
| I-Mutant 3.0 | 338 | 0.53 | 1.35 | 299 | 0.53 | 1.37 | 264 | 0.71 | 1.00 |
| MuPro | 350 | 0.41 | 1.43 | 299 | 0.41 | 1.48 | 264 | 0.49 | 1.12 |
| PoPMuSiC 1.0 | 350 | 0.62 | 1.23 | 299 | 0.63 | 1.26 | 264 | 0.72 | 0.93 |
| PoPMuSiC 2.0 | 350 | 0.67 | 1.16 | 299 | 0.67 | 1.21 | 264 | ||
| NeEMO | 350 | 299 | 264 | 0.79 | 0.88 | ||||
The comparison is reported (a) for all the mutations in the dataset, (b) the maximal subset of mutations where each tool is able to make a prediction and (c) the maximal subset where 10% of outliers are removed. The number of mutations (n) is shown together with the Pearson correlation (r) and distance from the real ΔΔG values (σ). The best prediction in each column is shown in bold.
NeEMO predictions on the mesophilic and thermophilic mutations.
| T → M | M → T | ||||||
|---|---|---|---|---|---|---|---|
| Mesophile | Thermophile | Increase | Decrease | Energy | Increase | Decrease | Energy |
| 1AK2A | 1ZIPA | 28 | 56.66 | 50 | 63 | 38.92 | |
| 3PGKA | 1PHPA | 66 | 55.22 | 42 | -42.47 | ||
| 1LVLA | 1EBDA | 102 | 50.85 | 71 | -63.93 | ||
| 1LDMA | 1LDNA | 46 | 94.32 | 81 | -1.99 | ||
| 1VOKA | 1PCZA | 18 | 78.02 | 42 | 53 | 11.63 | |
| 1ST3A | 1THMA | 35 | 75.91 | 48 | 77 | 34.62 | |
| 2CTCA | 1OBRA | 73 | 51.41 | 81 | 9.46 | ||
| 1GADO | 1GD1O | 50 | 23.93 | 54 | -14.69 | ||
| 1NPCA | 1THLA | 20 | 33.95 | 32 | -2.37 | ||
| 2PFKD | 3PFKA | 60 | 20.48 | 50 | 79 | 41.91 | |
| 498 | 633 | ||||||
Amount of reciprocal variants in mesophilc and thermophilic predicted to increase or decrease the stability of the 10 proteins, and their cumulative energy. Cases where predictions support our hypothesis of symmetric ΔΔG behavior of variants are highlighted in bold.