| Literature DB >> 29722803 |
Dinara R Usmanova1, Natalya S Bogatyreva2,3,4, Joan Ariño Bernad5, Aleksandra A Eremina6, Anastasiya A Gorshkova7, German M Kanevskiy8, Lyubov R Lonishin9, Alexander V Meister10, Alisa G Yakupova7, Fyodor A Kondrashov11, Dmitry N Ivankov4,11.
Abstract
Motivation: Computational prediction of the effect of mutations on protein stability is used by researchers in many fields. The utility of the prediction methods is affected by their accuracy and bias. Bias, a systematic shift of the predicted change of stability, has been noted as an issue for several methods, but has not been investigated systematically. Presence of the bias may lead to misleading results especially when exploring the effects of combination of different mutations.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29722803 PMCID: PMC6198859 DOI: 10.1093/bioinformatics/bty340
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Design of the protocol. Two example protein structures, A and B, differing by one amino acid residue are shown on the left. If we measure free energy change for the forward and reverse mutations, their sum must be zero: ΔΔGAB+ΔΔGBA=0 (arrows on the left). For predicted destabilizations (shown on the right), their sum may not be zero due to errors. Here, the case is given, when ΔΔGAB+ΔΔGBA>0
Bias for single substitutions
| Program | Bias, kcal/mol | Binary fraction of errors | |
|---|---|---|---|
| FoldX | 0.74 ± 0.05 | −0.15 (10−11) | 0.35 |
| Eris | 1.25 ± 0.11 | −0.39 (2 × 10−49) | 0.27 |
| Rosetta | 2.08 ± 0.12 | −0.06 (0.04) | 0.51 |
| I-Mutant | 0.80 ± 0.01 | −0.13 (3 × 10−8) | 0.74 |
Note: Bias is given for an individual substitution, i.e. bias = (ΔΔGAB + ΔΔGBA)/2, with the standard error of mean. r, Pearson correlation coefficient with the associated P-value. Binary fraction of errors is the fraction of errors in binary classification. A pair was considered as correctly classified if the signs of forward and reverse change of stability were opposite.
Fig. 2.The bias for single substitutions for FoldX, Eris, Rosetta and I-Mutant. (a–d) The relationship between predicted changes of stability ΔΔGAB and ΔΔGBA for the forward and reverse mutations, where the structures A and B differ by a single substitution. The ‘ideal’ relationship ΔΔGAB+ΔΔGBA=0 is shown as a solid line. Because of symmetry of the protocol, every pair of structures A and B is plotted as (A; B) and (B; A), so the plot is symmetric relative to the y=x line. ‘prR’ and ‘p’ are the Pearson correlation coefficient and the associated P-value. (e–h) The histograms of the sum of two changes of stability ΔΔGAB+ΔΔGBA for the forward and reverse mutations for FoldX, Eris, Rosetta and I-Mutant, respectively
Fig. 3.The bias for multiple mutations for FoldX, Eris and Rosetta. The individual value of the bias depending on the number of amino acid substitutions separating protein variants A and B in pair of structures. The error bars represent 3 standard errors of mean
Fig. 4.The bias for different modifications of the default protocol of FoldX. The error bars represent standard error of mean