| Literature DB >> 22373394 |
Md Shariful Islam Bhuyan1, Xin Gao.
Abstract
BACKGROUND: Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22373394 PMCID: PMC3287466 DOI: 10.1186/1471-2105-12-S14-S10
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Protein dihedral angle. This figure illustrates different protein dihedral angles. ϕ, ψ and ω constitute backbone dihedral angles and χ1, χ2 and χ3 denote side-chain dihedral angles.
Figure 2Interaction graph for residue chain. This figure gives an example of the interaction graph for a protein sequence of seven amino acids. C denotes the alpha carbon of the ith residue. Apart from the neighboring relationships, the are two more edges in this map namely (C2, C4) and (C4, C6).
Figure 3Markov random field for interaction graph in Figure 2. This figure illustrates the corresponding Markov random field structure of the interaction graph from Figure 2. Square nodes are backbone random variables for which conformation is known and circular nodes are side-chain random variables for which conformation is unknown. x denotes the backbone conformation of the ith residue. Since this is given, the variable is not capitalized. X denotes the side-chain conformation R of the ith residue. Since this is not given, the variable is capitalized.
Comparison of rotamer libraries for side-chain prediction
| Amino acid | Dihedral angle | P10 | P02 | D10 | D02 |
|---|---|---|---|---|---|
| 55.76 | 50.16 | 50.09 | |||
| 67.13 | 61.94 | 61.84 | |||
| 87.81 | 86.13 | 85.85 | |||
| 90.58 | 86.94 | 86.99 | |||
| 78.33 | 69.53 | 69.31 | |||
| 78.27 | 72.47 | 73.12 | |||
| 77.93 | 63.33 | 62.06 | |||
| 91.18 | 86.91 | 87.05 | |||
| 83.70 | 74.89 | 74.20 | |||
| 86.90 | 72.95 | 73.03 | |||
| 82.20 | 80.96 | 80.92 | |||
| 68.06 | 53.49 | 53.01 | |||
| 86.38 | 72.64 | 72.68 | |||
| 70.67 | 63.62 | 62.47 | |||
| 66.39 | 62.36 | 61.71 | |||
| 71.54 | 60.03 | 60.85 | |||
| 71.35 | 63.60 | 61.41 | |||
| 72.02 | 66.43 | 66.28 | |||
| 80.05 | 73.80 | 73.43 | |||
The first column contains amino acid names. The second column denotes the combination of dihedral angles for which the accuracy is reported. Starting from the third column, the accuracy for our proposed library with Dunbrack’s 2010 library as input, our proposed library with Dunbrack’s 2002 library as input, Dunbrack’s backbone-dependent library proposed in 2010, and Dunbrack’s backbone-dependent library proposed in 2002 is reported, respectively.
Comparison of rotamer libraries for rotamer ranking
| Average rank of correct rotamer | ||
|---|---|---|
| 1.738 | ||
| 2.9517 | ||
| Average probability of finding correct rotamers at top 1 position | ||
| 0.6470 | ||
| 0.4127 | ||
| Average probability of finding correct rotamer at top 2 positions | ||
| 0.8899 | ||
| 0.7053 | ||
| Average probability of finding correct rotamer at top 3 positions | ||
| 0.9265 | ||
| 0.7479 | ||
The top part of the table shows the comparison of the average rank of the first correct rotamers between our protein-dependent rotamer library with Dunbrack’s 2010 library as input (P10) and Dunbrack’s backbone-dependent library proposed in 2010 (D10). The other part of the table shows the comparison of the average probability of finding correct rotamers in the top 1, 2 and 3 rotamers of P10 and D10, respectively.
Comparison of inference algorithms
| Comparison | ||||
|---|---|---|---|---|
| Accuracy of | 80.03 | 79.54 | 76.45 | |
| Accuracy of | 60.72 | 60.33 | 55.58 | |
| Average rank of | 1.51 | 1.54 | 1.59 | |
| Average rank of | 2.25 | 2.33 | 2.45 | |
| Average execution time (in seconds) | 29.99 | 63.19 | 189.94 | |
Comparison of the average accuracy for side-chain prediction, the average rank of the first correct rotamers, and the average running time for loopy belief propagation (LBP), generalized belief propagation (GBP), mean field approximation (MF), and tree re-weighted belief propagation (TRBP).