| Literature DB >> 24189228 |
Marco Marcia1, Elisabeth Humphris-Narayanan, Kevin S Keating, Srinivas Somarowthu, Kanagalaghatta Rajashankar, Anna Marie Pyle.
Abstract
Structured RNA molecules are key players in ensuring cellular viability. It is now emerging that, like proteins, the functions of many nucleic acids are dictated by their tertiary folds. At the same time, the number of known crystal structures of nucleic acids is also increasing rapidly. In this context, molecular replacement will become an increasingly useful technique for phasing nucleic acid crystallographic data in the near future. Here, strategies to select, create and refine molecular-replacement search models for nucleic acids are discussed. Using examples taken primarily from research on group II introns, it is shown that nucleic acids are amenable to different and potentially more flexible and sophisticated molecular-replacement searches than proteins. These observations specifically aim to encourage future crystallographic studies on the newly discovered repertoire of noncoding transcripts.Entities:
Keywords: RCrane; RNA structure; de novo structure design; homology modeling; long noncoding RNA; nucleic acid sequence homology
Mesh:
Substances:
Year: 2013 PMID: 24189228 PMCID: PMC3817690 DOI: 10.1107/S0907444913013218
Source DB: PubMed Journal: Acta Crystallogr D Biol Crystallogr ISSN: 0907-4449
Figure 1Statistics for nucleic acid versus protein structures. (a) Pie chart describing the percentage of X-ray structures solved containing different types of macromolecules. Nucleic acids are significantly underrepresented relative to proteins. Statistics were calculated from the PDB as of 19 February 2013. (b) Distribution of coding versus noncoding transcripts in the cell, as derived from genomic data analysis (Washietl et al., 2007 ▶; The Encode Project Consortium, 2012 ▶). An increasingly large number of noncoding elements have been shown to possess distinct tertiary structures; therefore, the quantity and the variety of nucleic acid crystallography targets are rapidly increasing.
Different MR strategies for nucleic acids of different lengths
| Short | Medium | Long | |
|---|---|---|---|
| Size (nt) | <30–40 | 40–200 | >200 |
| Typical secondary/tertiary structure | Hairpins | Combinations of hairpins | Complex tertiary structures |
| Availability of experimental models | 83.5 | 13.1 | 3.4 |
| Identification of structural homologues | Based on size, independently of sequence | By structure-similarity algorithm (sequence covariation) | By structure-similarity algorithm (sequence covariation) |
| Strategies to improve MR success | Generally unnecessary | Pruning bases/bases and sugars | Pruning bases/bases and sugars |
| Deletion of loops and junctions | Using only selected domains | ||
| Supporting the MR search using preliminary experimental phases | |||
| Limitations of MR using experimental models | Internal helical symmetry | R.m.s.d. | R.m.s.d. |
| Reference models in the absence of experimental data | Ideal helices modeled manually | Combinations of ideal helices modeled manually or | Combinations of ideal helices modeled manually (rare) or |
| Three-dimensional motifs modeled | Three-dimensional motifs modeled | ||
| Homology models | Homology models | ||
| Limitations of MR using | Internal helical symmetry | Difficulty in assigning small helical domains | Difficulty in assigning small helical domains |
| R.m.s.d. | R.m.s.d. | ||
| References | Baikalov & Dickerson (1998 | Scott (2012 | Humphris-Narayanan & Pyle (2012 |
Indicates the percentage of X-ray structures of nucleic acids of the corresponding size (statistics drawn from the PDB on 19 February 2013).
Root-mean-square deviation between the MR search model and the target structure.
Figure 2The low sensitivity of MR searches to nucleic acid sequence conservation. Group II intron structures were derived from the O. iheyensis group II intron structure (PDB entry 3igi) using (a) the original intron sequence (green tones), (b) a randomly generated sequence (blue tones) or (c) an ‘opposite’ sequence (gray tones). For each sequence, the backbone was distorted using FRODA with increasing values of the r.m.s.d. (from 0 Å in the darkest color to 4 Å in the lightest color). The TFZ scores from Phaser are plotted for each MR run. A TFZ value of 8 or higher was observed to confidently indicate correct solutions for protein structures (McCoy, 2007 ▶) and is thus taken here as a cutoff level for successful solutions. Although such a TFZ scale may not necessarily apply exactly to nucleic acid structures, we observed that our solutions with TFZ > 8 were generally associated with interpretable electron-density maps.
Figure 3Minimal MR models for nucleic acid structures. The structure of the group II intron can be successfully solved (TFZ > 8 in Phaser and interpretable electron-density maps) using only the sugar-phosphate backbone distorted by up to 2 Å r.m.s.d. (a) or only the phosphate groups distorted by up to 1 Å r.m.s.d. (b) but not by using only the coordinates of the nucleobases (no Phaser solution was obtained) (c).
Figure 4Multi-domain MR searches for phasing nucleic acid structures. (a) The experimental structure-factor amplitudes of an O. iheyensis group II intron data set (PDB entry 4faw) can be phased with a Phaser multi-domain MR search using ten individual intron subdomains as a starting model. (b) The resulting σ-weighted 2F o − F c electron-density map is shown around the active-site motifs in blue mesh (1.5σ contour level). The positive signal in the σ-weighted F o − F c map (green mesh, 3.0σ) is shown at the expected position of the catalytic metal ions, which were not included in the search model (M1/M2 and K1; yellow and purple spheres, respectively). (c) Successful solutions (TFZ > 8) could be obtained using intron subdomain structures distorted by up to 1 Å r.m.s.d. with respect to their original structure in the model (PDB entry 3igi).
Figure 5Phased MR. The D2 domain of the group II intron (black cartoon diagram; top left panel) was not placed correctly in multi-domain MR searches (see Fig. 4 ▶). However, it can be assigned to its expected position in the electron density (right panel) by MOLREP if information from weak experimental phases is provided (bottom left panel). The solvent-flattened experimental electron-density map is depicted as a blue mesh at a 1.5σ contour level.
Figure 6De novo design of nucleic acid domains to be used as MR search ensembles. Using a target sequence and a fragment library (left-hand panel), de novo design techniques can be used to build hundreds or thousands of models (middle panel). Each model could be scored based on an energy function and, if applicable, its accuracy of fit to an initial experimental density. Promising models could then be used as starting points for further tracing and refinement. Iterative model rebuilding and energy-guided optimization could significantly increase the quality of the final solution (right-hand panel, blue structure).