| Literature DB >> 20038291 |
Ranjit Prasad Bahadur1, Pinak Chakrabarti.
Abstract
BACKGROUND: Setting the rules for the identification of a stable conformation of a protein is of utmost importance for the efficient generation of structures in computer simulation. For structure prediction, a considerable number of possible models are generated from which the best model has to be selected.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20038291 PMCID: PMC2809062 DOI: 10.1186/1472-6807-9-76
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Average values of partner number (
| Residue | <PN> | <ASA> |
|---|---|---|
| Gly | 7.4 (2.2) | 26.6 (24.5) |
| Ala | 8.6 (2.5) | 28.1 (30.9) |
| Ser | 7.9 (2.6) | 39.2 (33.2) |
| Cys | 10.0 (2.3) | 17.1 (21.0) |
| Thr | 8.5 (2.6) | 44.2 (36.0) |
| Asp | 7.9 (2.5) | 58.1 (37.2) |
| Pro | 7.7 (2.6) | 54.2 (39.5) |
| Asn | 8.3 (2.7) | 57.9 (40.8) |
| Val | 10.3 (2.6) | 24.1 (32.0) |
| Glu | 8.4 (2.5) | 73.4 (41.9) |
| Gln | 9.0 (2.7) | 68.6 (43.3) |
| His | 9.7 (2.9) | 53.8 (44.6) |
| Leu | 11.0 (2.7) | 28.8 (38.0) |
| Ile | 11.0 (2.7) | 25.0 (35.2) |
| Met | 11.2 (3.1) | 35.5 (45.8) |
| Lys | 8.4 (2.4) | 95.8 (42.9) |
| Phe | 11.9 (2.9) | 31.0 (39.8) |
| Tyr | 11.5 (3.1) | 45.5 (45.0) |
| Arg | 10.1 (3.1) | 85.5 (53.3) |
| Trp | 12.6 (3.2) | 43.5 (47.6) |
Data taken from [26]. The standard deviations are in parenthesis.
Average values of Rs and Rp in various protein structural classesa
| Number of structures | Rs | Rp | |
|---|---|---|---|
| All-α | 157 | 112 (63) | 30 (20) |
| All-β | 142 | 115 (60) | 31 (19) |
| αβ b | 133 | 149 (70) | 42 (23) |
| Overall | 432 | 143 (91) | 39 (23) |
aAccording to CATH [59]. bIncluding a/β and a+β.
Standard deviations are in parentheses. Normalized (dividing the values obtained from equations (2) and (3) by the number of residues) values are given in italics.
Identification of the native structure from decoys in PROSTAR decoy sets using different scoring functionsa
| Parameters | Misfold | Ifu | Asilomar | Pdberr and sgpa |
|---|---|---|---|---|
| Rs b | 24/24 | 22/43 | 41/41 | 5/5 |
| Rp b | 20/24 | 21/43 | 36/41 | 5/5 |
| Atomic KBPc | 24/24 | 32/43 | 37/41 | 5/5 |
| RAPDFd | 24/24 | 30/43 | 37/41 | 5/5 |
| CDFd | 19/24 | 21/43 | 35/41 | 5/5 |
| Residue contact potentiale | 24/24 | 22/43 | 35/41 | 4/5 |
PROSTAR website [31].
aThe first number of each column is the number of correctly identified decoys, and the second one after the slash is the total number of decoys. With either of the first two parameters the native structure is correctly identified if its value is smaller than that from any other structure in the decoy set. The results with the other parameters are taken from [14].
bThe parameters developed in this study.
cThe atomic Knowledge-Based Potential from Lu and Skolnick [14].
dRAPDF and CDF are atomic and residue-based potentials, respectively, from Samudrala and Moult [13].
eResidue-based quasichemical potential from Skolnick et al[33].
Figure 1Scatter plot of R1ctf, along with its decoys.
Identification of native structure from decoys constructed by homology modeling
| Parameters | hg_structal | ig_structal_hires |
|---|---|---|
| Rs | 23/29 | 18/20 |
| Rp | 15/29 | 17/20 |
Dataset taken from [19]http://dd.compbio.washington.edu. The first column in each category is the number of correctly identified decoys, and the second column is the total number of decoys.
Performance of the different scoring function for predicting the native structure among the best near-native structures submitted in CASP7
| % of the native structureb | |||
|---|---|---|---|
| Methoda | Znat | Rank1 | Rank10 |
| Modcheck | 1.99 | 49.47 | 72.63 |
| RAPDF | -2.09 | 57.89 | 81.05 |
| DFIRE | -1.25 | 62.11 | 75.79 |
| ProQ | 1.51 | 9.47 | 33.68 |
| ProQ_SSE | 1.76 | 14.74 | 44.21 |
| FRST | -2.41 | 58.95 | 75.79 |
| QMEAN3 | -2.27 | 62.11 | 78.95 |
| Rp | 1.69 | 53.52 | 91.55 |
| Rs | 2.17 | 71.83 | 98.59 |
Znat corresponds to the average Z-score of the native structure.
a Except the last two functions, the performance of others are based on the data provided in Table 6 of [22].
b % of the native structure with rank 1 or within rank 10 from among all the solutions submitted in CASP7.
Rs and Rp for two proteins having the same fold belonging to the β class
| Name of the protein | Number of residues | Number of aligned residues | Rs | Rp |
|---|---|---|---|---|
| Azurin ( | 126 | 84 | 1.12 | 0.33 |
| Plastocyanin ( | 99 | 84 | 1.33 | 0.46 |
The structures are aligned using the software SSM at EBI http://www.ebi.ac.uk/msd-srv/ssm. The values calculated considering only the aligned amino acid residues are given in parenthesis. To quantify the sequence structure compatibility between the structures, two more parameters are computed over the aligned residues. (Rs)azu/pcy = 0.97 and (Rs)pcy/azu = 0.89. Each term contributing to the former corresponds to (ASAazu -