| Literature DB >> 30670629 |
Almudena Ponce-Salvatierra1, Katarzyna Merdas1, Chandran Nithin1, Pritha Ghosh1, Sunandan Mukherjee1, Janusz M Bujnicki2,3.
Abstract
RNA molecules are master regulators of cells. They are involved in a variety of molecular processes: they transmit genetic information, sense cellular signals and communicate responses, and even catalyze chemical reactions. As in the case of proteins, RNA function is dictated by its structure and by its ability to adopt different conformations, which in turn is encoded in the sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore the majority of known RNAs remain structurally uncharacterized. To address this problem, predictive computational methods were developed based on the accumulated knowledge of RNA structures determined so far, the physical basis of the RNA folding, and taking into account evolutionary considerations, such as conservation of functionally important motifs. However, all theoretical methods suffer from various limitations, and they are generally unable to accurately predict structures for RNA sequences longer than 100-nt residues unless aided by additional experimental data. In this article, we review experimental methods that can generate data usable by computational methods, as well as computational approaches for RNA structure prediction that can utilize data from experimental analyses. We outline methods and data types that can be potentially useful for RNA 3D structure modeling but are not commonly used by the existing software, suggesting directions for future development.Entities:
Keywords: RNA structure; computational biochemistry; integrative modeling; molecular modeling
Mesh:
Substances:
Year: 2019 PMID: 30670629 PMCID: PMC6367127 DOI: 10.1042/BSR20180430
Source DB: PubMed Journal: Biosci Rep ISSN: 0144-8463 Impact factor: 3.840
Experimental methods commonly used for RNA structure determination
| Experimental technique | Data type | Main disadvantages and bottlenecks | Number of 3D structures in the PDB | Resolution [Å] of 3D structures in the PDB | RNA length (nt) in 3D structures in the PDB | ||||
|---|---|---|---|---|---|---|---|---|---|
| Original | Interpreted | RNA | RNP | RNA | RNP | RNA | RNP | ||
| MX | Diffraction patterns | Electron density maps | • Obtaining well-diffracting crystals, especially for flexible or disordered molecules | 807 | 1680 | 0.61–11.5 | 1.14–11.5 | 4–2880 | 2–3396 |
| SAXS/SANS | Electron/neutron scattering patterns | Low-resolution envelope | • Aggregation at high concentration | NA | NA | NA | NA | NA | NA |
| EM (including cryo-EM) | 2D projections of sample | 3D density map | • Challenges of working with a liquid sample | 29 | 478 | 4.5–18.8 | 2.5–34 | 12–2904 | 2–5070 |
| Solution NMR | Free induction decay signal | NMR spectrum/atomic interactions | • Aggregation | 503 | 120 | NA | NA | 5–155 | 4–101 |
| ssNMR | Free induction decay signal | NMR spectrum/atomic interactions | • Broad line widths and resonance overlap | 1 | 0 | NA | NA | 26 | NA |
| SHAPE/ Probing methods | Stops/mutations in cDNA reversely transcribed from modified RNA | Reactivity profile | • Reproducibility | NA | NA | NA | NA | NA | NA |
| FRET | Dampening of energy | Distances between interacting chromophores | • Reduced signal-to-noise ratio associated with acquiring the complete spectrum | 1 | 0 | NA | NA | NA | 49 |
| Fiber Diffraction | Diffraction patterns | Electron density maps | • Loss of structural information due to cylindrical averaging of diffraction data | 0 | 5 | NA | 2.9–3.5 | NA | 3 - 3 |
| Combination of experimental techniques | • Combining data obtained by different techniques | 4 | 1 | NA | NA | 47–192 | 58 | ||
| Total | 1345 | 2284 | |||||||
Figure 1Schematic illustration of selected experimental methods applicable to RNA structure determination: from data acquisition, through data processing, to the generation of an atomic model, 3D shape, or secondary structure representation
Main computational approaches for RNA 3D structure modeling
| Modeling technique | Obligatory input data for the target RNA (prior knowledge) | Non-obligatory additional data often used | Main advantages | Main disadvantages |
|---|---|---|---|---|
| Comparative modeling | • Target RNA sequence | • Secondary structure | • Computationally inexpensive and hence very fast | • Does not work without the template |
| Fragment assembly | • Target RNA sequence | • Secondary structure | • Computationally inexpensive and hence very fast | • Usually depends on the accuracy of predicted secondary structure |
| • Target RNA sequence | • Secondary structure | • Often capable of modeling small RNA molecules (<100 nt long) based on sequence data alone, without any additional information | • Computationally expensive and hence relatively slow | |
| • Target RNA sequence | • Enables modeling based on first principles, only with the knowledge of the RNA sequence and environmental conditions, but without any restraints | • Computationally very expensive and hence very slow | ||
| Integrative/hybrid modeling | • Target RNA sequence | • Most data types used are non-obligatory, but the modeling procedure is mostly data-driven | • Enables modeling based on the combination of various types of experimental data | • Applicable to large systems |
Figure 2Information flow from experimental techniques to modeling approaches
Figure 3Data workflow in RNA 3D structure determination, with various experimental and theoretical approaches, and combinations thereof