| Literature DB >> 16845101 |
M Michael Gromiha1, A Mary Thangakani, S Selvaraj.
Abstract
We have developed a web server, FOLD-RATE, for predicting the folding rates of proteins from their amino acid sequences. The relationship between amino acid properties and protein folding rates has been systematically analyzed and a statistical method based on linear regression technique has been proposed for predicting the folding rate of proteins. We found that the classification of proteins into different structural classes shows an excellent correlation between amino acid properties and folding rates of two and three-state proteins. Consequently, different regression equations have been developed for proteins belonging to all-alpha, all-beta and mixed class. We observed an excellent agreement between predicted and experimentally observed folding rates of proteins; the correlation coefficients are, 0.99, 0.97 and 0.90, respectively, for all-alpha, all-beta and mixed class proteins. The prediction server is freely available at http://psfs.cbrc.jp/fold-rate/.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16845101 PMCID: PMC1538837 DOI: 10.1093/nar/gkl043
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Predicted folding rates in a set of 77 two and three-state proteins
| PDB code | Experimentala | ln( | Deviation |
|---|---|---|---|
| All-α proteins | |||
| 1LMB | 8.50 | 8.45 | 0.05 |
| 2ABD | 6.55 | 6.33 | 0.22 |
| 1IMQ | 7.31 | 7.20 | 0.11 |
| 2PDD | 9.80 | 9.54 | 0.26 |
| 1HRC | 8.76 | 8.66 | 0.10 |
| 1YCC | 9.62 | 9.74 | −0.12 |
| 256B | 12.20 | 12.42 | −0.22 |
| 1VII | 11.52 | 11.47 | 0.05 |
| 1BDD | 11.75 | 11.72 | 0.03 |
| 1l8W | 1.61 | 1.61 | 0.00 |
| 1ENH | 10.53 | 10.49 | 0.04 |
| 1EBD | 9.68 | 9.90 | −0.22 |
| | |||
| | |||
| | |||
| | |||
| All-β proteins | |||
| 1NYF | 4.54 | 4.34 | 0.20 |
| 1PKS | −1.05 | −0.62 | −0.43 |
| 1SHG | 1.41 | 1.57 | −0.16 |
| 1SRL | 4.04 | 4.09 | −0.05 |
| 1FNF-9 | −0.91 | −0.96 | 0.05 |
| 1TEN | 1.06 | 1.22 | −0.16 |
| 1WIT | 0.41 | 0.18 | 0.23 |
| 1CSP | 6.98 | 6.75 | 0.23 |
| 1MJC | 5.24 | 5.70 | −0.46 |
| 2AIT | 4.20 | 4.05 | 0.15 |
| 1PNJ | −1.10 | −1.77 | 0.67 |
| 1SHF | 4.50 | 4.78 | −0.28 |
| 1C9O | 7.20 | 7.24 | −0.04 |
| 1G6P | 6.30 | 6.11 | 0.19 |
| 1LOP | 6.60 | 6.57 | 0.03 |
| 1PIN | 9.44 | 9.63 | −0.19 |
| 1C8C | 6.91 | 7.04 | −0.13 |
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| Mixed-class proteins | |||
| 1APS | −1.48 | −1.18 | −0.30 |
| 1HDN | 2.70 | 2.45 | 0.25 |
| 1URN | 5.73 | 5.36 | 0.37 |
| 2HQI | 0.18 | 0.47 | −0.29 |
| 1PBA | 6.80 | 6.92 | −0.12 |
| 1UBQ | 7.33 | 6.63 | 0.70 |
| 2PTL | 4.10 | 3.77 | 0.33 |
| 1FKB | 1.46 | 1.23 | 0.23 |
| 1COA | 3.87 | 3.73 | 0.14 |
| 1DIV | 6.58 | 6.87 | −0.29 |
| 2VIK | 6.80 | 6.39 | 0.41 |
| 1CIS | 3.87 | 3.30 | 0.57 |
| 1PCA | 6.80 | 6.68 | 0.12 |
| 1HZ6 | 4.10 | 4.64 | −0.54 |
| 1PGB | 6.00 | 5.75 | 0.25 |
| 2CI2 | 3.90 | 3.88 | 0.02 |
| 1AYE | 6.80 | 7.62 | −0.82 |
| 1RIS | 5.90 | 6.08 | −0.18 |
| 1POH | 2.70 | 2.45 | 0.25 |
| 1BRS | 3.40 | 3.20 | 0.20 |
| 1UBQ | 5.90 | 6.63 | −0.73 |
| 2ACY | 0.92 | 1.07 | −0.15 |
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
| | |||
aExperimental folding rates are obtained from Galzitskaya et al. (13), Ivankov and Finkelstein (18) and Gromiha (19). The three-state proteins are shown in italics.
Figure 1Relationship between experimental and predicted ln(kf) values using multiple regression model with jack-knife test in a set of 77 two and three-state proteins.
Figure 2Web based prediction of protein folding rates. (a) First page showing the input format (amino acid sequence in single letter code; an example is shown for λ repressor (1LMB) and structural class information (all-α). (b) The query sequence, amino acid composition, type of the protein and predicted folding rate are shown. The ln(kf) value is predicted to be 8.44/s.