| Literature DB >> 28472503 |
Robert M Hanson1, Xiang-Jun Lu2.
Abstract
Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org).Entities:
Mesh:
Substances:
Year: 2017 PMID: 28472503 PMCID: PMC5570162 DOI: 10.1093/nar/gkx365
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Definitions of key nucleic acid structural components in DSSR [reproduced from Figure 1 of reference (9)]. (A) Nucleotides are recognized using standard atom names and base planarity. This method works for both the standard bases (A, C, G, T and U), and those of modified nucleotides, regardless of their tautomeric or protonation states. (B) Bases are assigned a standard reference frame (25) that is independent of sequence identity: purines and pyrimidines are symmetrically placed with respect to the sugar. (C) The standard base frame is derived from an idealized Watson-Crick base pair, and defines three base edges (Watson-Crick, minor groove, and Major groove) that are used to classify pairing interactions. (D) Base pairs are identified from the co-planarity of base rings and the occurrence of hydrogen bonds. This geometric algorithm can find canonical (Watson-Crick and G–U wobble) as well as non-canonical pairs. Higher-order (three or more) co-planar base associations, termed multiplets, are also detected. (E) Helices are defined by stacking interactions of base pairs, regardless of pairing type (canonical or otherwise) or backbone connectivity (covalently connected or broken). A helix consists of at least two base pairs. The same algorithm is applied to identify continuous base stacks that are outside of helical regions, by using bases instead of pairs as the assembly unit. Nucleotides not involved in base-stacking interactions are collected into one separate group. A stem is defined as a special type of helix, made up of canonical pairs and with a continuous backbone along each strand. Coaxial stacking is defined by the presence of two or more stems within one helix. An isolated canonical pair is one that is not contained within a stem. (F) ‘Closed’ loops of various types (hairpin, bulge, internal, and junction loops) are delineated by stems or isolated pairs, and specified by the lengths of the intervening, consecutive nucleotide segments. A kissing-loop motif entails formation of one or more canonical pairs between the bases in different hairpin loops. Single-stranded segments that lie outside loops are separately listed.
DSSR-derived features and DSSR-specific selections in Jmol, using yeast phenylalanine tRNA (1ehz) as an example
| Accessible features (16 keys) | ["pairs", "multiplets", "helices", "stems", "isoCanonPairs", "coaxStacks", "hairpins", "bulges", "iloops", "junctions", "kissingLoops", "ssSegments", "stacks", "nonStack", "hbonds", "nts"] |
| Actual counts (1ehz) | {"pairs":34, "multiplets":4, "helices":2, "stems":4, "isoCanonPairs":1, "coaxStacks":2, "hairpins":3, "junctions":2, "kissingLoops":1, "ssSegments":1, "stacks":11, "nonStack":4, "hbonds":118, "nts":76} |
| Base pair (G1–C72) | {"index":1, "nt1":"|1|A|G|1||||", "nt2":"|1|A|C|72||||", "bp":"G-C", "name":"WC", "Saenger":"19-XIX", "LW":"cWW", "DSSR":"cW-W"} |
| Nucleotide (2MG10) | {"nt_name":"2MG", "nt_id":"|1|A|2MG|10||||","is_modified":true, "chi":169.599, "puckering":"C3΄-endo"} |
| Jmol (SQL) selections | SELECT hairpins |
| SELECT within(dssr, "nts WHERE is_modified") | |
| SELECT within(dssr, "pairs WHERE name != 'WC'") |
Figure 2.A screenshot of the DSSR-JSmol web interface, highlighting the two reverse Hoogsteen pairs (U8–A14 and 5MU54–1MA58) of yeast phenylalanine tRNA (PDB id: 1ehz). (A) DSSR-derived structural features integrated into Jmol. (B) The main JSmol viewer canvas for visualization and interactive manipulations. (C) Common representation styles for selected structural features. (D) A simple text input field for advanced users to enter (short) Jmol script commands. (E) Structure input by PDB id, file upload (drag-and-drop), or selecting one from the twelve sample RNA structures. (F) Utilities to toggle between two states for six common cases. (G) Export of coordinate file or PNG image. (H) Links to online resources for DSSR and Jmol.
Figure 3.Sample molecular images illustrating selected RNA structural features enabled by the DSSR-Jmol integration. (A) Structure of yeast phenylalanine tRNA (PDB id: 1ehz) in step diagram with bases labeled, highlighting (in red) the [2,1,5,0] four-way junction loop in 3D. (B) Structure of the Pistol self-cleaving ribozyme (PDB id: 5ktj), showcasing (in red) the vertical helix in space-filling representation. The helix is composed of six short stems stabilized via coaxial stacking interactions. (C) Structure of the xpt-pbuX guanine riboswitch in complex with hypoxanthine (PDB id: 4fe5) in ‘base blocks’ representation. The three-way junction loop encompassing the metabolite (in space-filling representation) is color-coded by base identity: A, red; C, yellow; G, green; U, cyan. The loop-loop interaction (a kissing-loop motif) at the top is highlighted in red. (D) Structure of the yeast GAL4 protein-DNA complex (PDB id: 1d66) in Jmol cartoon representation. DNA is color-coded by DSSR stems, and protein is in translucent brown. The DNA helix is broken into two stems due to a non-Watson-Crick pair (G11 with C28 in gray) in the middle. (E) Structure of the Thermus thermophilus 30S ribosomal subunit in complex with the antibiotics (PDB id: 1fjg) using a step diagram. The 16S ribosomal RNA is color-coded in spectrum with the 5΄-end in blue and the 3΄-end in red.