| Literature DB >> 27993777 |
Sinan Ugur Umu1,2, Paul P Gardner1,2,3.
Abstract
Motivation: The aim of this study is to assess the performance of RNA-RNA interaction prediction tools for all domains of life.Entities:
Mesh:
Substances:
Year: 2017 PMID: 27993777 PMCID: PMC5408919 DOI: 10.1093/bioinformatics/btw728
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.The distribution of scores for RNA–RNA interaction prediction algorithms. (A) RNAduplex gave the highest median TPR (sensitivity) followed by IntaRNA. (B) RNAup was the most precise algorithm based on PPV score followed by the other accessibility based methods IntaRNA and RNAplex. (C) RNAup was the best prediction algorithm based on median MCC score, with IntaRNA and RNAplex giving similar scores. RactIP produced the worst overall MCC (Color version of this figure is available at Bioinformatics online.)
Total run time of algorithms, and the cumulative TPR, PPV and MCC scores
| Algorithm | Total run time (s) on | TPR | PPV | MCC |
|---|---|---|---|---|
|
| (Sensitivity) | (Precision) | ||
| AccessFold | 596.44 | 0.38 | 0.31 | 0.35 |
| bifold | 404.63 | 0.37 | 0.31 | 0.34 |
| bistaRNA | 102.29 | 0.15 | 0.16 | 0.15 |
| DuplexFold |
| 0.48 | 0.17 | 0.29 |
| IntaRNA | 24.44 |
|
|
|
| NUPACK | 794.2 | 0.42 | 0.42 | 0.42 |
| pairfold | 90.24 | 0.39 | 0.29 | 0.34 |
| ractIP | 87.62 | 0.16 | 0.06 | 0.1 |
| RIsearch |
| 0.36 | 0.45 | 0.40 |
| RNAcofold | 15.28 | 0.41 | 0.32 | 0.36 |
| RNAduplex | 6.45 |
| 0.12 | 0.27 |
| RNAhybrid | 32.84 |
| 0.12 | 0.26 |
| RNAplex | 17.19 | 0.55 |
|
|
| RNAup | 137.48 | 0.51 |
|
|
| ssearch |
|
| 0.1 | 0.23 |
The cumulative scores (i.e. TPR, PPV, MCC) are calculated by adding individual TP, FP and FN values for all predictions.
The test of significance results of selected algorithms on bacterial sRNAs.
| Algorithm | Total # of significant ( | Total # of significant ( | Median rank of native interactions |
|---|---|---|---|
| AccessFold | 15 | 17 | 41.75 |
| DuplexFold | 2 | 8 | 63.5 |
| IntaRNA | |||
| RIsearch | 13 | 14 | 52.25 |
| RNAduplex | 8 | 11 | 54.25 |
| RNAhybrid | 5 | 6 | 76 |
| RNAplex | |||
| RNAup |
Higher is better for the second and third columns. Lower is better for the fourth column.
Fig. 2.This heatmap shows MCC values of each tool for entire dataset. The red cells display a higher MCC value denoting a better prediction. Similar methods are mostly clustered together based on these predictions (dendrogram at top). Row labels show the type of interactions. Predictions for the single archaeal sRNA are on the last row. An in depth examination of these results show that the algorithms are poor at predicting human miRNA-mRNA interactions (av. MCC: 0.22), snoRNAs (weaker for H/ACA as expected, av. MCC: 0.09), mouse piRNAs (av. MCC: 0.07). Conversely, they perform best on Arabidopsis miRNAs (av. MCC: 0.72), siRNAs (av. MCC: 0.71) and bacterial sRNAs (av. MCC: 0.40), which is most likely an effect of high complementarity in binding regions for these