| Literature DB >> 29236971 |
Maciej Antczak1, Mariusz Popenda2, Tomasz Zok1,3, Michal Zurkowski1, Ryszard W Adamiak1,2, Marta Szachniuk1,2.
Abstract
Motivation: Understanding the formation, architecture and roles of pseudoknots in RNA structures are one of the most difficult challenges in RNA computational biology and structural bioinformatics. Methods predicting pseudoknots typically perform this with poor accuracy, often despite experimental data incorporation. Existing bioinformatic approaches differ in terms of pseudoknots' recognition and revealing their nature. A few ways of pseudoknot classification exist, most common ones refer to a genus or order. Following the latter one, we propose new algorithms that identify pseudoknots in RNA structure provided in BPSEQ format, determine their order and encode in dot-bracket-letter notation. The proposed encoding aims to illustrate the hierarchy of RNA folding.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29236971 PMCID: PMC5905660 DOI: 10.1093/bioinformatics/btx783
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Subsequent tiers of cyanocobalamin aptamer (1DDY, chain A) folding pathway from (a) single-stranded form, through creation of (b) a hairpin and (c) first order pseudoknot of H-type, to (d) the final structure with second order pseudoknot of L-type
Base pair encoding in DBL notation
| Region order ( | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|---|---|
| Base pair representation: | () | [] | {} | < > | A a | B b | C c | D d | E e |
Fig. 2.DBL representations of cyanocobalamin aptamer (1DDY, chain A) secondary structure encoded by (a) HYB and (b) FCFS, the corresponding arc diagrams and fscoreII values
A number of instances in DS1 and DS2 which include k = 1…13 disjoint pseudoknots per one structure
| # Pseudoknots per str. | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| # Structures in | 154 | 18 | 8 | 1 | 0 | 5 | 6 | 1 | 4 | 4 | 5 | 2 | 1 |
| # Structures in | 221 | 25 | 8 | 1 | 0 | 5 | 6 | 1 | 4 | 4 | 5 | 2 | 1 |
A number of structures with pseudoknot order psorder = 1…8, found in particular datasets
| Pseudoknot order | 1 | 2 | 3 | 4 | 5 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| FCFS | 162 | 36 | 7 | 3 | 1 | 138 | 52 | 59 | 16 | 9 | 3 | 5 | 1 |
| EG | 160 | 32 | 12 | 4 | 1 | 133 | 61 | 57 | 15 | 6 | 5 | 5 | 1 |
| EC | 160 | 35 | 9 | 4 | 1 | 132 | 63 | 55 | 15 | 6 | 7 | 4 | 1 |
| DP | 159 | 33 | 11 | 5 | 1 | 132 | 62 | 56 | 16 | 6 | 6 | 5 | 0 |
| HYB | 161 | 33 | 9 | 5 | 1 | 137 | 58 | 58 | 14 | 8 | 5 | 3 | 0 |
| (c) | (d) | ||||||||||||
| All algorithms | 123 | 6 | 0 | 0 | 0 | 87 | 17 | 7 | 0 | 0 | 0 | 0 | 0 |
All-against-all algorithm comparison for dataset upon fscoreII
| FCFS | EG | EC | DP | HYB | # Duels won | # Battles won | |
|---|---|---|---|---|---|---|---|
| FCFS | – | 0 | 1 | 0 | 0 | 1 | 0 |
| EG | 75 | – | 22 | 1 | 2 | 100 | 1 |
| EC | 75 | 6 | – | 0 | 2 | 83 | 0 |
| DP | 79 | 6 | 22 | – | 1 | 108 | 0 |
| HYB | 78 | 12 | 23 | 7 | – | 120 | 7 |
| # Duels lost | 307 | 24 | 68 | 8 | 5 | – | – |
| # Battles lost | 71 | 0 | 1 | 0 | 0 | – | – |
All-against-all algorithm comparison for dataset upon fscoreII
| FCFS | EG | EC | DP | HYB | # Duels won | # Battles won | |
|---|---|---|---|---|---|---|---|
| FCFS | – | 0 | 15 | 0 | 0 | 15 | 0 |
| EG | 169 | – | 94 | 1 | 5 | 269 | 0 |
| EC | 108 | 6 | – | 0 | 5 | 119 | 0 |
| DP | 170 | 18 | 95 | – | 5 | 288 | 1 |
| HYB | 167 | 28 | 98 | 25 | – | 318 | 25 |
| # Duels lost | 614 | 52 | 302 | 26 | 15 | – | – |
| # Battles lost | 107 | 0 | 15 | 0 | 0 | – | – |
Percentage of instances from dataset DS1 and DS2 for which Pareto optimal solution was found by the algorithm
| Dataset | FCFS | EG | EC | DP | HYB |
|---|---|---|---|---|---|
| 62.68% | 94.25% | 88.52% | 96.17% | 99.04% | |
| 39.58% | 89.75% | 63.60% | 91.17% | 98.59% |
A number of the i-th order regions identified in pseudoknots of RNA from ribosomal subunit from human mitochondria (3J7Y, chain A)
| Region order ( | 0 | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|---|
| # FCFS-identified | 76 | 12 | 8 | 2 | 1 | 1 | 0 |
| # EG/EC/DP-identified | 76 | 13 | 6 | 2 | 1 | 1 | 1 |
| # HYB-identified | 76 | 13 | 7 | 3 | 1 | 0 | 0 |
Fig. 3.A distribution of regions with non-zero order in the structure of RNA from ribosomal subunit from human mitochondria (3J7Y, chain A) and their encoding by considered algorithms
Fig. 4.Arc diagrams of pseudoknot-involved regions in RNA from ribosomal subunit from human mitochondria (3J7Y, chain A) corresponding to HYB (top) and FCFS (bottom) results