| Literature DB >> 27464796 |
Giulia Menichetti1, Piero Fariselli2, Daniel Remondini1.
Abstract
Proteins fold using a two-state or multi-state kinetic mechanisms, but up to now there is not a first-principle model to explain this different behavior. We exploit the network properties of protein structures by introducing novel observables to address the problem of classifying the different types of folding kinetics. These observables display a plain physical meaning, in terms of vibrational modes, possible configurations compatible with the native protein structure, and folding cooperativity. The relevance of these observables is supported by a classification performance up to 90%, even with simple classifiers such as discriminant analysis.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27464796 PMCID: PMC4964642 DOI: 10.1038/srep30367
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1An example of different PCN representations for protein 1A6N, with MS folding kinetics and 151 C residues.
In Panel A the whole PCN is displayed. In Panel B, once calculated the number of diagonals b = 4 needed to break the protein network in more than one component, 3 backbone diagonals were removed. In Panel C the related Long-range Interaction Network (LIN) is shown1114 in which no backbone diagonal is present.
Classification performances of the newly defined observables and Matthews correlation coefficient MCC, based on quadratic discriminant analysis.
| (a) Classification performances | |||||
|---|---|---|---|---|---|
| % | |||||
| 76.6 ± 1.3 | 74.7 ± 1.4 | 70.7 ± 1.8 | 85.2 ± 1.4 | 78.4 ± 1.2 | |
| 76.7 ± 1.4 | 70.7 ± 1.4 | 75.9 ± 2.3 | |||
| 77.6 ± 1.1 | 87.3 ± 1.4 | 76.5 ± 1.9 | |||
| 75.9 ± 1.2 | 84.5 ± 1.3 | ||||
| 80.4 ± 1.8 | |||||
| 0.57 ± 0.02 | 0.53 ± 0.02 | 0.46 ± 0.03 | 0.69 ± 0.03 | 0.60 ± 0.02 | |
| 0.58 ± 0.02 | 0.46 ± 0.03 | 0.57 ± 0.04 | |||
| 0.59 ± 0.02 | 0.74 ± 0.03 | 0.56 ± 0.04 | |||
| 0.52 ± 0.02 | 0.67 ± 0.03 | ||||
| 0.59 ± 0.04 | |||||
The tables show the performances of couples of observables, with the performance of the single observables along the diagonal; the best overall performance is bold-typed. The results presented are the average values of 10-fold cross-validation over 10000 instances and their standard deviation.
Figure 2Scatterplots of two top-ranking classification couples of observables.
Panel A: λ and R0 (classification = 88.3% ± 1.1%, MCC = 0.76 ± 0.02). Panel B: S and R0 (classification = 84.5% ± 1.3%, MCC = 0.67 ± 0.03).
Classification performances of the observables used in literature and Matthews correlation coefficient MCC.
| (a) Classification performances | ||||
|---|---|---|---|---|
| % | 〈 | |||
| 78.2 ± 1.5 | 73.7 ± 2.1 | 78.4 ± 1.7 | ||
| 〈 | 57.3 ± 1.4 | 72.6 ± 2.8 | 62.7 ± 1.9 | |
| 68.4 ± 0.7 | 70.4 ± 2.1 | |||
| 54.4 ± 2.4 | ||||
| 0.54 ± 0.03 | 0.44 ± 0.05 | 0.54 ± 0.04 | ||
| 〈 | 0.17 ± 0.03 | 0.44 ± 0.06 | 0.22 ± 0.04 | |
| 0.36 ± 0.01 | 0.44 ± 0.04 | |||
| 0.14 ± 0.05 | ||||
The tables show the performances of couples of observables, with the performance of the single observables along the diagonal; the best overall performance is bold-typed. The results presented are the average values of 10-fold cross-validation over 10000 instances and their standard deviation.