| Literature DB >> 21625446 |
Abstract
A complete macromolecule modeling package must be able to solve the simplest structure prediction problems. Despite recent successes in high resolution structure modeling and design, the Rosetta software suite fares poorly on small protein and RNA puzzles, some as small as four residues. To illustrate these problems, this manuscript presents Rosetta results for four well-defined test cases: the 20-residue mini-protein Trp cage, an even smaller disulfide-stabilized conotoxin, the reactive loop of a serine protease inhibitor, and a UUCG RNA tetraloop. In contrast to previous Rosetta studies, several lines of evidence indicate that conformational sampling is not the major bottleneck in modeling these small systems. Instead, approximations and omissions in the Rosetta all-atom energy function currently preclude discriminating experimentally observed conformations from de novo models at atomic resolution. These molecular "puzzles" should serve as useful model systems for developers wishing to make foundational improvements to this powerful modeling suite.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21625446 PMCID: PMC3098862 DOI: 10.1371/journal.pone.0020044
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Small “puzzles” for high resolution Rosetta tests.
(A) Trp cage, (B) α-conotoxin GI, (C) Reactive loop of chymotrypsin inhibitor from barley, (D) the UUCG tetraloop (RNA). Each panel shows experimental structures side-by-side with lowest energy Rosetta de novo model discovered in extensive runs (see Fig. 2 and Table 1).
Figure 2All-atom energy vs. RMSD plots for de novo modeling of the four puzzles and for optimizing experimental (“native”) conformations.
Panels correspond exactly to panels in Fig. 1. In protein cases (A)-(C), the default Rosetta all-atom energy function for de novo protein modeling (score12) is plotted against Cα RMSD. In the RNA case (D), the FARFAR energy function (which contains torsional terms for RNA, an orientation-dependent solvation function, and a carbon-hydrogen-bond model [19]) is plotted against all-heavy-atom RMSD. The conformational sampling algorithms (ABRELAX, SWA, etc.) used in the runs are denoted in the figure and described in detail in Methods.
Comparison of best scoring Rosetta models with optimized experimental models.
| Lowest energy | Lowest energy optimized native model | |||||
| Puzzle | Length | energy | rmsd | energy | rmsd | energy gap |
| A. Trp cage | 20 | –40.4 | 2.14 | –38.1 | 0.66 | –2.27 |
| B. α-conotoxin GI | 13 | 8.3 | 2.83 | 10.5 | 0.35 | –2.19 |
| C. chym. inhib. loop | 11 | –102.0 | 1.77 | –98.1 | 0.69 | –3.99 |
| D. UUCG RNA | 4 | –63.0 | 4.26 | –53.2 | 0.68 | –9.86 |
Rosetta all-atom energy (“score12”) for protein cases A–C[49], and Rosetta FARFAR energy for RNA case D [19]. A Rosetta score unit is approximately 0.5–1 kcal/mol[50].
Cα RMSD for proteins; all-atom RMSD for RNA.
Energy of de novo model minus energy of optimized native. A negative sign (observed in all cases) signifies an energy function error.
The entire RNA construct is 8 residues, but only 4 residues are built de novo.