| Literature DB >> 34432024 |
Mariusz Popenda1, Tomasz Zok2, Joanna Sarzynska1, Agnieszka Korpeta2, Ryszard W Adamiak1,2, Maciej Antczak1,2, Marta Szachniuk1,2.
Abstract
Computational methods to predict RNA 3D structure have more and more practical applications in molecular biology and medicine. Therefore, it is crucial to intensify efforts to improve the accuracy and quality of predicted three-dimensional structures. A significant role in this is played by the RNA-Puzzles initiative that collects, evaluates, and shares RNAs built computationally within currently nearly 30 challenges. RNA-Puzzles datasets, subjected to multi-criteria analysis, allow revealing the strengths and weaknesses of computer prediction methods. Here, we study the issue of entangled RNA fragments in the predicted RNA 3D structure models. By entanglement, we mean an arrangement of two structural elements such that one of them passes through the other. We propose the classification of entanglements driven by their topology and components. It distinguishes two general classes, interlaces and lassos, and subclasses characterized by element types-loops, dinucleotide steps, open single-stranded fragments-and puncture multiplicity. Our computational pipeline for entanglement detection, applied for 1,017 non-redundant models from RNA-Puzzles, has shown the frequency of different entanglements and allowed identifying 138 structures with intersected assemblies.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34432024 PMCID: PMC8464073 DOI: 10.1093/nar/gkab716
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Putative types of RNA 3D structure entanglements are shown using simplified hairpin representations as an example. Entangled structure elements are color-coded: loops in cyan, dinucleotide steps in orange, and single-stranded fragments in magenta. Intersection sites are marked with black beads.
Figure 2.The pipeline for entanglement identification in RNA 3D structures.
Figure 3.The number of (A) non-redundant models with and without entanglements predicted within every RNA-Puzzles challenge, and (B) lasso and interlace-type entanglements found in these models.
Detailed information on the non-redundant test dataset divided by puzzles
| Models | Entanglements | |||||
|---|---|---|---|---|---|---|
| Puzzle | Target | Length | RNA type | Entangled/total | Number | Types |
| PZ01 | 3MEI | 46 | RNA dimer | 0 / 14 | 0 | n/a |
| PZ02 | 3P59 | 100 | RNA nanosquare | 0 / 12 | 0 | n/a |
| PZ03 | 3OWZ | 84 | Glycine riboswitch | 0 / 12 | 0 | n/a |
| PZ04 | 3V7E | 126 | SAM-I riboswitch aptamer | 0 / 27 | 0 | n/a |
| PZ05 | 4P9R | 188 | Lariat-capping ribozyme | 9 / 24 | 14 | 5xL&L, 1xD&L,4xL(D), 4xL(L) |
| PZ06 | 4GXY | 168 | Adenosylcobalamin riboswitch | 5 / 34 | 9 | 3xD&D, 2xL(D), 1xL(L), 3xD(D) |
| PZ07 | 4R4V | 370 | VS ribozyme | 8 / 51 | 15 | 1xL&L, 5xD&L, 1xD&D, 4xL(D), 2xL(L), 2xD(D) |
| PZ08 | 4L81 | 96 | SAM-I/IV riboswitch | 5 / 42 | 6 | 1xD&L, 3xL(S), 1xL(D), 1xL(L) |
| PZ09 | 5KPY | 71 | 5-hydroxytryptophan aptamer | 0 / 32 | 0 | n/a |
| PZ10 | 4LCK | 171 | T-box - tRNA complex | 0 / 26 | 0 | n/a |
| PZ11 | 5LYS | 57 | 7SK 5′-hairpin riboregulator | 0 / 53 | 0 | n/a |
| PZ12 | 4QLM | 125 | Ydao riboswitch | 3 / 51 | 4 | 3xL(D), 1xL(L) |
| PZ13 | 4XW7 | 71 | ZTP riboswitch | 9 / 55 | 13 | 3xL&L, 1xL(S), 4xL(D), 5xL(L) |
| PZ14a | 5DDO | 61 | L-glutamine riboswitch (free) | 3 / 46 | 6 | 4xL(S), 2xL(D) |
| PZ14b | 5DDP | 61 | L-glutamine riboswitch (bound) | 2 / 56 | 3 | 2xL(D), 1xL(L) |
| PZ15 | 5DI4 | 68 | Hammerhead ribozyme | 17 / 70 | 30 | 10xL&L, 5xL(S), 14xL(D), 1xL(L) |
| PZ17 | 5K7C | 62 | Pistol ribozyme | 35 / 105 | 50 | 1xL&L, 32xL(S), 9xL(D), 8xL(L) |
| PZ18 | 5TPY | 71 | Exonuclease resistant RNA | 9 / 52 | 10 | 9xL(S), 1xL(D) |
| PZ19 | 5T5A | 62 | Twister sister ribozyme | 2 / 54 | 3 | 3xL(S) |
| PZ20 | 5Y85 | 68 | Twister sister ribozyme | 2 / 40 | 2 | 1xL(S), 1xL(D) |
| PZ21 | 5NWQ | 41 | Guanidine-III riboswitch | 9 / 51 | 9 | 9xL(S) |
| PZ24 | 6OL3 | 112 | Viral non-coding RNA | 19 / 88 | 27 | 2xL&L, 3xD&L, 10xL(S), 9xL(D), 3xL(L) |
The number of RNA models without and with entanglements
| Entanglements per model | 0 | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|---|
| No. of models | 859 | 94 | 32 | 6 | 5 | 1 |
The number of entanglements found in the analyzed dataset
|
| |||
| Class | L&L | D&L | D&D |
| No. of entanglements | 22 | 10 | 4 |
|
| |||
| Class | L(S) | L(D) | L(L) |
| Entanglements | 77 | 56 | 27 |
| Class | D(S) | D(D) | D(L) |
| No. of entanglements | 0 | 5 | 0 |
Figure 4.(A) The number of entangled and entanglement-free models, and (B) the distribution of entanglement types in RNA 3D structures with and without pseudoknots.
Figure 5.Example RNA 3D structure models predicted within RNA-Puzzles, with entanglements depicted in the secondary and tertiary structure visualization: (A) PZ17C5 (62nt) with L(S) entanglement, (B) PZ13D8 (71nt) with L&L entanglement, and (C) PZ05A1 (188nt) with L&L, L&D and L(D) entanglements.