| Literature DB >> 22427840 |
Ashwin Subramani1, Christodoulos A Floudas.
Abstract
The prediction of the correct β-sheet topology for pure β and mixed α/β proteins is a critical intermediate step toward the three dimensional protein structure prediction. The predicted beta sheet topology provides distance constraints between sequentially separated residues, which reduces the three dimensional search space for a protein structure prediction algorithm. Here, we present a novel mixed integer linear optimization based framework for the prediction of β-sheet topology in β and mixed α/β proteins. The objective is to maximize the total strand-to-strand contact potential of the protein. A large number of physical constraints are applied to provide biologically meaningful topology results. The formulation permits the creation of a rank-ordered list of preferred β-sheet arrangements. Finally, the generated topologies are re-ranked using a fully atomistic approach involving torsion angle dynamics and clustering. For a large, non-redundant data set of 2102 β and mixed α/β proteins with at least 3 strands taken from the PDB, the proposed approach provides the top 5 solutions with average precision and recall greater than 78%. Consistent results are obtained in the β-sheet topology prediction for blind targets provided during the CASP8 and CASP9 experiments, as well as for actual and predicted secondary structures. The β-sheet topology prediction algorithm, BeST, is available to the scientific community at http://selene.princeton.edu/BeST/.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22427840 PMCID: PMC3302896 DOI: 10.1371/journal.pone.0032461
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The number of motifs possible for a protein with strands ().
| Strands | Number of Motifs | Strands | Number of Motifs |
| 2 | 2 | 3 | 12 |
| 4 | 96 | 5 | 960 |
| 6 | 11520 | 7 | 161280 |
| 8 | 2580480 | 9 | 46448640 |
| 10 | 928972800 | 11 |
|
Figure 1Changes in the average precision, recall and mcc values over the number of solutions.
Figure 2Top five results for proteins in the blind target test set from CASP8 and CASP9.