| Literature DB >> 25760244 |
Adrien Pain1, Alban Ott, Hamza Amine, Tatiana Rochat, Philippe Bouloc, Daniel Gautheret.
Abstract
Most bacterial regulatory RNAs exert their function through base-pairing with target RNAs. Computational prediction of targets is a busy research field that offers biologists a variety of web sites and software. However, it is difficult for a non-expert to evaluate how reliable those programs are. Here, we provide a simple benchmark for bacterial sRNA target prediction based on trusted E. coli sRNA/target pairs. We use this benchmark to assess the most recent RNA target predictors as well as earlier programs for RNA-RNA hybrid prediction. Moreover, we consider how the definition of mRNA boundaries can impact overall predictions. Recent algorithms that exploit both conservation of targets and accessibility information offer improved accuracy over previous software. However, even with the best predictors, the number of true biological targets with low scores and non-targets with high scores remains puzzling.Entities:
Keywords: bacteria; sRNA; sRNA target prediction
Mesh:
Substances:
Year: 2015 PMID: 25760244 PMCID: PMC4615726 DOI: 10.1080/15476286.2015.1020269
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
Figure 1.Two representations of target predictor performances. (A) Rank distribution of trusted targets (lower=better). For each program, the distribution shows the ranks of the best ranking target of each sRNA (22 sRNAs). Horizontal lines and numbers indicate median ranks, red dots indicate mean ranks. (B) ROC-like curves. For each program, the curve shows the number of trusted pairs predicted by the program (Y axis) among the X best ranking predictions (X-axis).
Area under ROC-like curve (AUC) for target prediction with 9 programs, using default UTR or “real” RNA-seq-derived UTRs
| Program | AUC default UTR | AUC real UTR | AUC gain (%) |
|---|---|---|---|
| CopraRNA | 46.41 | na | na |
| IntaRNA | 27.02 | 28.83 | 6.3 |
| RNAplex | 20.81 | 21.88 | 4.9 |
| RNAup | 22.75 | 27.59 | 17.5 |
| TargetRNA2 | 18.52 | na | na |
| RNAhybrid | 4.14 | 4.38 | 5.5 |
| RNAduplex | 3.86 | 3.55 | −8.7 |
| RNAcofold | 0.57 | 1.42 | 59.9 |
| Pairfold | 0.25 | 1.22 | 79.5 |
Programs CopraRNA and TargetRNA2 can be run only using default UTRs.
Recall of experimentally demonstrated base pairs (%)
| IntaRNA/CopraRNA | 76.7 |
| RNAplex | 73.6 |
| TargetRNA2 | 55.9 |
| RNAup | 78.9 |
IntaRNA and CopraRNA predict the same hybrid region.
Average run time for matching 1 sRNA to all 4317 E. coli mRNAs
| TargetRNA2 (web site) | 1 min |
| RNAplex (incl. accessibility calculation) | 1 min |
| RNAhybrid | 2 min |
| RNAcofold | 23 min |
| RNAup | 55 min |
| IntaRNA | 66 min |
| RNAduplex | 75 min |
| CopraRNA (website) | 105 min |
| Pairfold | 120 min |