| Literature DB >> 20823308 |
Yuki Kato1, Kengo Sato, Michiaki Hamada, Yoshihide Watanabe, Kiyoshi Asai, Tatsuya Akutsu.
Abstract
MOTIVATION: Considerable attention has been focused on predicting RNA-RNA interaction since it is a key to identifying possible targets of non-coding small RNAs that regulate gene expression post-transcriptionally. A number of computational studies have so far been devoted to predicting joint secondary structures or binding sites under a specific class of interactions. In general, there is a trade-off between range of interaction type and efficiency of a prediction algorithm, and thus efficient computational methods for predicting comprehensive type of interaction are still awaited.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20823308 PMCID: PMC2935440 DOI: 10.1093/bioinformatics/btq372
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.An example of RNA–RNA interaction containing two kissing hairpins. A broken line represents an internal base pair, and a black circle indicates a base that constitutes an external base pair (binding site).
Fig. 2.An illustration of the factorization [Equation (3)] of the posterior probability P(σ ∣ a, b) of a joint structure σ. A broken line shows an internal or external base pair.
Fig. 3.An illustration of variables used in the IP formulation. This variable setting enables the model to represent a kissing hairpin.
Fig. 4.An illustration of several constraints of the IP formulation. In each of the three diagrams, at most one variable shown by a broken (curved) line can take a value 1.
Fig. 5.An illustration of stacked pairing constraints of the IP formulation.
Comparison with competitive methods for joint structure prediction
| Antisense-target | No. of sites | Sensitivity | PPV | Time (s) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| RactIP | inRNAs | inteRNA | RactIP | inRNAs | inteRNA | RactIP | inRNAs | inteRNA | RactIP | ||
| CopA-CopT | 2 | 1.000 | 1.000 | 0.731 | 0.754 | 0.846 | 0.655 | 0.860 | 0.917 | 0.691 | 0.13 |
| DIS-DIS | 1 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.05 |
| IncRNA54-RepZ | 1 | 0.813 | 0.875 | 0.958 | 0.736 | 0.792 | 0.836 | 0.772 | 0.831 | 0.893 | 0.10 |
| R1inv-R2inv | 1 | 1.000 | 0.900 | 0.800 | 1.000 | 0.900 | 0.889 | 1.000 | 0.900 | 0.842 | 0.03 |
| Tar-Tar* | 1 | 1.000 | 1.000 | 1.000 | 0.875 | 0.875 | 0.875 | 0.933 | 0.933 | 0.933 | 0.03 |
| Average | 0.963 | 0.955 | 0.898 | 0.873 | 0.883 | 0.851 | 0.913 | 0.916 | 0.872 | ||
The five RNA–RNA interaction pairs were predicted by RactIP, inRNAs (the joint structure prediction model) (Salari et al., 2010a) and inteRNA (the Loop Model) (Aksay et al., 2007; Alkan et al., 2006). In the table, No. of sites represents the number of binding sites. We set the parameters for RactIP as α=0.5, θ=0.5 and θ=0.2. Running time of RactIP was measured on Mac OS X 10.6 running Intel Core 2 Duo 2.13 GHz with 2 GB memory. Note that computation time of inRNAs is reported to be at most 4000 s for long sequences on Sun Fire X4600 2.6 GHz with 64 GB memory (Salari et al., 2010a).
Comparison with competitive methods for binding site prediction
| Antisense-target | No. of sites | Sensitivity | PPV | Time (s) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RactIP | inRNAs | IntaRNA | RactIP | inRNAs | IntaRNA | RactIP | inRNAs | IntaRNA | RactIP | inRNAs | IntaRNA | ||
| CopA-CopT | 2 | 0.815 | 0.889 | 1.000 | 0.579 | 0.828 | 0.391 | 0.677 | 0.857 | 0.562 | 0.14 | 0.21 | 0.14 |
| DIS-DIS | 1 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.04 | 0.03 | 0.04 |
| IncRNA54-RepZ | 1 | 0.750 | 1.000 | 0.738 | 0.783 | 0.889 | 0.850 | 0.766 | 0.941 | 0.790 | 0.10 | 2.56 | 0.11 |
| R1inv-R2inv | 1 | 1.000 | 1.000 | 1.000 | 1.000 | 0.778 | 1.000 | 1.000 | 0.875 | 1.000 | 0.03 | 0.03 | 0.02 |
| Tar-Tar* | 1 | 1.000 | 1.000 | 1.000 | 0.833 | 0.833 | 0.833 | 0.909 | 0.909 | 0.909 | 0.02 | 0.03 | 0.03 |
| DsrA-RpoS | 1 | 0.654 | 0.808 | 0.808 | 0.739 | 0.778 | 0.778 | 0.694 | 0.793 | 0.793 | 0.06 | 6.80 | 0.05 |
| GcvB-argT | 1 | 0.950 | 0.950 | 0.950 | 1.000 | 0.864 | 0.950 | 0.974 | 0.905 | 0.950 | 0.04 | 8.07 | 0.03 |
| GcvB-dppA | 1 | 0.941 | 1.000 | 1.000 | 0.593 | 0.850 | 0.586 | 0.727 | 0.919 | 0.739 | 0.05 | 5.59 | 0.04 |
| GcvB-gltI | 1 | 1.000 | 0.750 | 0.000 | 1.000 | 0.500 | 0.000 | 1.000 | 0.600 | 0.000 | 0.05 | 2.74 | 0.04 |
| GcvB-livJ | 1 | 0.955 | 0.634 | 0.955 | 0.955 | 0.824 | 0.955 | 0.955 | 0.717 | 0.955 | 0.04 | 6.10 | 0.04 |
| GcvB-livK | 1 | 0.958 | 0.540 | 0.542 | 0.958 | 0.570 | 0.565 | 0.958 | 0.555 | 0.553 | 0.04 | 3.24 | 0.03 |
| GcvB-oppA | 1 | 1.000 | 1.000 | 1.000 | 1.000 | 0.733 | 0.957 | 1.000 | 0.846 | 0.978 | 0.05 | 8.23 | 0.04 |
| GcvB-STM4351 | 1 | 0.880 | 0.760 | 0.760 | 1.000 | 1.000 | 0.905 | 0.936 | 0.864 | 0.826 | 0.04 | 2.59 | 0.04 |
| IstR-tisAB | 1 | 0.778 | 0.722 | 0.879 | 1.000 | 1.000 | 0.960 | 0.875 | 0.839 | 0.918 | 0.05 | 8.24 | 0.05 |
| MicA-ompA | 1 | 0.875 | 1.000 | 1.000 | 0.875 | 1.000 | 1.000 | 0.875 | 1.000 | 1.000 | 0.32 | 3.29 | 0.04 |
| MicA-lamB | 1 | 0.565 | 1.000 | 1.000 | 0.867 | 1.000 | 0.821 | 0.684 | 1.000 | 0.902 | 0.08 | 5.38 | 0.08 |
| MicC-ompC | 1 | 0.727 | 1.000 | 1.000 | 0.889 | 1.000 | 0.537 | 0.800 | 1.000 | 0.699 | 0.07 | 8.11 | 0.06 |
| MicF-ompF | 1 | 0.833 | 0.960 | 0.960 | 0.769 | 0.960 | 0.960 | 0.800 | 0.960 | 0.960 | 0.73 | 17.82 | 0.83 |
| OxyS-fhlA | 2 | 0.563 | 0.813 | 0.500 | 0.818 | 1.000 | 1.000 | 0.667 | 0.897 | 0.667 | 0.32 | 0.21 | 0.39 |
| RyhB-sdhD | 1 | 0.824 | 0.618 | 0.588 | 0.824 | 0.955 | 1.000 | 0.824 | 0.750 | 0.741 | 0.07 | 7.74 | 0.06 |
| RyhB-sodB | 1 | 1.000 | 1.000 | 1.000 | 0.391 | 1.000 | 0.818 | 0.563 | 1.000 | 0.900 | 0.19 | 3.23 | 0.21 |
| SgrS-ptsG | 1 | 0.739 | 0.566 | 0.739 | 1.000 | 0.765 | 1.000 | 0.850 | 0.651 | 0.850 | 0.06 | 12.07 | 0.05 |
| Spot42-galK | 1 | 0.682 | 0.432 | 0.409 | 0.698 | 0.760 | 0.643 | 0.690 | 0.551 | 0.500 | 0.13 | 5.94 | 0.13 |
| Average | 0.847 | 0.845 | 0.819 | 0.851 | 0.865 | 0.805 | 0.836 | 0.845 | 0.791 | ||||
The 23 RNA–RNA interaction pairs were predicted by RactIP, inRNAs (the binding site prediction model) (Salari et al., 2010a) and IntaRNA (Busch et al., 2008). We set the parameters for RactIP as α=0.5, θ=0.3 and θ=0.5. Running time of RactIP and IntaRNA was measured on Mac OS X 10.6 running on Intel Core 2 Duo 2.13 GHz with 2 GB memory. Computation time of inRNAs measured on Intel Core 2 Duo 2.53 GHz with 4 GB memory was given by R. Salari (personal communication).
Comparison of accuracy and running time for joint structure prediction
| Antisense target | Sensitivity | PPV | Time | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| RactIP | rip | rip+RactIP | RactIP | rip | rip+RactIP | RactIP | rip | rip+RactIP | RactIP (s) | rip | |
| DIS-DIS | 1.000 | 0.500 | 0.500 | 1.000 | 0.500 | 0.500 | 1.000 | 0.500 | 0.500 | 0.05 | 19 m 40 s |
| IncRNA54-RepZ | 0.813 | 0.562 | 1.000 | 0.736 | 0.500 | 0.889 | 0.772 | 0.529 | 0.941 | 0.10 | 860 m |
| R1inv-R2inv | 1.000 | 0.900 | 1.000 | 1.000 | 0.900 | 1.000 | 1.000 | 0.900 | 1.000 | 0.03 | 37 s |
| Tar-Tar* | 1.000 | 1.000 | 1.000 | 0.875 | 0.875 | 0.875 | 0.933 | 0.933 | 0.933 | 0.03 | 9.5 s |
The four RNA–RNA interaction pairs were predicted by RactIP, rip (Huang et al., 2009, 2010) and RactIP with base-pairing probabilities calculated by rip (rip+RactIP). We set the parameters for RactIP as α=0.5, θ=0.5 and θ=0.2. Running time of RactIP was measured on Mac OS X 10.6 running on Intel Core 2 Duo 2.13 GHz with 2 GB memory. Computation time of rip was measured on linux kernel 2.6.30 running on Intel Xeon 3.33 GHz with 32 GB memory.