| Literature DB >> 24584194 |
Parin Sripakdeevong1, Mirko Cevec2, Andrew T Chang3, Michèle C Erat4, Melanie Ziegeler2, Qin Zhao5, George E Fox6, Xiaolian Gao6, Scott D Kennedy7, Ryszard Kierzek8, Edward P Nikonowicz3, Harald Schwalbe2, Roland K O Sigel9, Douglas H Turner10, Rhiju Das11.
Abstract
Structured noncoding RNAs underlie fundamental cellular processes, but determining their three-dimensional structures remains challenging. We demonstrate that integrating ¹H NMR chemical shift data with Rosetta de novo modeling can be used to consistently determine high-resolution RNA structures. On a benchmark set of 23 noncanonical RNA motifs, including 11 'blind' targets, chemical-shift Rosetta for RNA (CS-Rosetta-RNA) recovered experimental structures with high accuracy (0.6-2.0 Å all-heavy-atom r.m.s. deviation) in 18 cases.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24584194 PMCID: PMC3985481 DOI: 10.1038/nmeth.2876
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Figure 1The CS-ROSETTA-RNA method illustrated on a UUAAGU hairpin. (a) The crystallographic structure (PDB: 1FJG). (b) Rosetta near-native model with a 0.52 Å all-heavy-atom rmsd to the crystallographic structure (rmsd calculated over the entire loop, excluding the flexible G6 extra-helical bulge). Two-dimensional schematics follow the Leontis and Westhof nomenclature[20]. (c) Plot of the Rosetta energy vs. rmsd to the crystallographic structure for all Rosetta models before the inclusion of the chemical shift pseudo-energy term. (d) Plot of back-calculated chemical shifts from the Rosetta near-native model vs. experimental 1H chemical shift values (rmsdshift= 0.19 ppm). (e) Plot of the average rmsdshift of all Rosetta models in 0.5-Å rmsd bins from the crystallographic structure. (f) Plot of the Rosetta energy vs. rmsd to the crystallographic structure for all Rosetta models after the inclusion of the chemical shift pseudo-energy term. With chemical shift data, the near-native model shown in b becomes the lowest energy model overall (green circle).
The CS-ROSETTA-RNA method benchmarked on 23 RNA motifs.
| Motif name | PDB | Nnt | rmsd-top1 | rmsd-top5 |
|---|---|---|---|---|
| Single G:G mismatch | 1F5G | 6 | ||
| UUCG tetraloop | 2KOC | 6 | ||
| Tandem GA:AG mismatch | 1MIS | 8 | ||
| Tandem UG:UA mismatch | 2JSE | 8 | 3.02 | 2.52 |
| 16S rRNA UUAAGU loop | 1FJG | 8 | ||
| HIV-1 TAR apical loop | 1ANR | 8 | 5.86 | 5.86 |
| tRNAiMet ASL | 1SZY | 9 | 3.89 | |
| Conserved SRP internal loop | 1LNT | 12 | ||
| R2 retrotransposon 4×4 loop | 2L8F | 12 | ||
| Hepatitis C virus IRES IIa | 2PN4 | 13 | 3.21 | |
| GAAA tetraloop-receptor | 2R8S | 15 | ||
| Sc.ai5γ 3-way junction | 2LU0 | 16 | 3.66 | |
|
| ||||
| UAAC tetraloop | 4A4R | 6 | ||
| UCAC tetraloop | 4A4S | 6 | ||
| UGAC tetraloop | 4A4U | 6 | 3.60 | |
| UUAC tetraloop | 4A4T | 6 | ||
| Chimp HAR1 GAA loop | 2LHP | 7 | 2.88 | 2.88 |
| Human HAR1 GAA loop | 2LUB | 7 | 2.26 | 2.03 |
| GU:UAU internal loop | – | 9 | ||
| tRNAGly ASL (cuUCCaa) | 2LBL | 9 | 3.28 | |
| tRNAGly ASL (cuUCCcg) | 2LBK | 9 | 3.42 | |
| tRNAGly ASL (uuGCCaa) | 2LBJ | 9 | 3.08 | 2.93 |
| 5′-GAGU/3′-UGAG loop | 2LX1 | 12 | ||
|
| ||||
| rmsd < 1.50 Å | – | – | 11/23 | 14/23 |
| rmsd < 2.00 Å | – | – | 12/23 | 18/23 |
Additional information and full motif names provided in Supplementary Tables 1 and 3.
PDB ID of reference experimental structure.
Motif size, the number of nucleotides in the modeled RNA motif. Each motif consists of noncanonical core nucleotides closed by boundary canonical (W.C. or G:U wobble) base pairs.
All-heavy-atom rmsd over all nucleotides, excluding the boundary canonical base pairs after alignment over all nucleotides. Nucleotides found to be extra-helical bulges (both unpaired and unstacked) in the reference experimental structure were excluded from both the alignment and the rmsd calculation. Bold text indicates rmsd better than 2.0 Å.
All-heavy-atom rmsd of the first-ranked (lowest energy) model to the experimental structure.
Lowest all-heavy-atom rmsd to the experimental structure among the five lowest energy cluster centers.
The 4 UNAC tetraloops were treated as separate motifs despite adopting similar conformations due to being blind targets.
The experimental structure was solved by the Sigel group at University of Zurich and has not yet been deposited into the PDB database.
The sequence of the 7-nt anticodon loop is given in parentheses with the anticodon triplet in uppercase.
Figure 2Comparison of experimental and CS-ROSETTA-RNA models for diverse RNA motifs. (a) Conserved 4×4 internal loop from the SRP RNA (PDB: 1LNT). (b) GAAA tetraloop-receptor tertiary interaction motif (PDB: 2R8S). (c) 3-way junction from yeast mitochondrial group II intron Sc.ai5γ (PDB: 2LU0). (d) 5′-GAGU-3′/3′-UGAG-5′ self-complementary internal loop (PDB: 2LX1). (e) 5′-GU-3′/3′-UAU-5′ internal loop from a group II intron. (f) Glycine tRNA(UCC) anticodon stem-loop from Bacillus subtilis (PDB: 2LBL). (g) UCAC tetraloop (PDB: 4A4S). The CS-ROSETTA-RNA models (shown in color) are overlaid on the experimental structures (shown in white). The rmsds between CS-ROSETTA-RNA models (energy cluster rank) and the experimental structure are (a) 0.81 Å (first), (b) 0.68 Å (first), (c) 1.74 Å (fourth), (d) 1.10 Å (first), (e) 1.37 (first), (f) 1.41 Å (third), and (g) 1.00 Å (first). The two-dimensional schematics are annotated based on the experimental structure and follow the Leontis and Westhof nomenclature[20].