| Literature DB >> 28559915 |
Arthur C Oliveira1, Luiz A Bovolenta2, Pedro G Nachtigall1, Marcos E Herkenhoff1, Ney Lemke2, Danillo Pinhal1.
Abstract
Target prediction is generally the first step toward recognition of bona fide microRNA (miRNA)-target interactions in living cells. Several target prediction tools are now available, which use distinct criteria and stringency to provide the best set of candidate targets for a single miRNA or a subset of miRNAs. However, there are many false-negative predictions, and consensus about the optimum strategy to select and use the output information provided by the target prediction tools is lacking. We compared the performance of four tools cited in literature-TargetScan (TS), miRanda-mirSVR (MR), Pita, and RNA22 (R22), and we determined the most effective approach for analyzing target prediction data (individual, union, or intersection). For this purpose, we calculated the sensitivity, specificity, precision, and correlation of these approaches using 10 miRNAs (miR-1-3p, miR-17-5p, miR-21-5p, miR-24-3p, miR-29a-3p, miR-34a-5p, miR-124-3p, miR-125b-5p, miR-145-5p, and miR-155-5p) and 1,400 genes (700 validated and 700 non-validated) as targets of these miRNAs. The four tools provided a subset of high-quality predictions and returned few false-positive predictions; however, they could not identify several known true targets. We demonstrate that union of TS/MR and TS/MR/R22 enhanced the quality of in silico prediction analysis of miRNA targets. We conclude that the union rather than the intersection of the aforementioned tools is the best strategy for maximizing performance while minimizing the loss of time and resources in subsequent in vivo and in vitro experiments for functional validation of miRNA-target interactions.Entities:
Keywords: Pita; RNA22; TargetScan; bioinformatics; in silico prediction; miRanda-mirSVR; non-coding RNA
Year: 2017 PMID: 28559915 PMCID: PMC5432626 DOI: 10.3389/fgene.2017.00059
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Summary of the target prediction tools analyzed.
| TargetScan | miRanda-mirSVR | Pita | RNA22 | |
|---|---|---|---|---|
| Website | https://cm.jefferson. edu/rna22/ | |||
| Version | v7.1 (06/2016) | V3.3a (08/2010) | V6 (08/2008) | V2 (04/2015) |
| Predictions downloaded | Conserved sites | Good mirSVR score, Conserved miRNA | Seed 7- or 8-mer and conservation score 0.9 or higher | Base pair: >12 Folding energy: ≤ -12 kcal/mol |
| Reference | ||||
Sensitivity, specificity, and precision of the target prediction methods.
| Method | Tool | Sensitivity | Specificity | Precision |
|---|---|---|---|---|
| Individual tool | TargetScan | 0.524 ± 0.004 | 0.984 ± 0.005 | 0.971 ± 0.004 |
| miRanda-mirSVR | 0.617 ± 0.012 | 0.954 ± 0.006 | 0.930 ± 0.010 | |
| Pita | 0.336 ± 0.009 | 0.992 ± 0.004 | 0.977 ± 0.011 | |
| RNA22 | 0.336 ± 0.009 | 0.893 ± 0.019 | 0.805 ± 0.027 | |
| Union | TS + MR + PT + R22 | 0.825 ± 0.008 | 0.862 ± 0.020 | 0.857 ± 0.017 |
| TS + MR + PT | 0.710 ± 0.006 | 0.949 ± 0.007 | 0.933 ± 0.009 | |
| TS + MR + R22 | 0.822 ± 0.007 | 0.862 ± 0.020 | 0.857 ± 0.017 | |
| TS + PT + R22 | 0.757 ± 0.038 | 0.879 ± 0.028 | 0.863 ± 0.023 | |
| MR + PT + R22 | 0.784 ± 0.006 | 0.865 ± 0.019 | 0.853 ± 0.018 | |
| TS + MR | 0.706 ± 0.006 | 0.949 ± 0.008 | 0.932 ± 0.009 | |
| TS + PT | 0.558 ± 0.007 | 0.983 ± 0.005 | 0.970 ± 0.009 | |
| TS + R22 | 0.716 ± 0.004 | 0.885 ± 0.21 | 0.862 ± 0.21 | |
| MR + PT | 0.624 ± 0.016 | 0.954 ± 0.006 | 0.931 ± 0.010 | |
| MR + R22 | 0.773 ± 0.008 | 0.865 ± 0.019 | 0.851 ± 0.019 | |
| PT + R22 | 0.613 ± 0.005 | 0.889 ± 0.020 | 0.847 ± 0.023 | |
| Intersection | TS + MR + PT + R22 | 0.139 ± 0.006 | 0.998 ± 0.003 | 0.984 ± 0.018 |
| TS + MR + PT | 0.279 ± 0.014 | 0.994 ± 0.003 | 0.980 ± 0.010 | |
| TS + MR + R22 | 0.201 ± 0.007 | 0.995 ± 0.005 | 0.976 ± 0.022 | |
| TS + PT + R22 | 0.150 ± 0.006 | 0.997 ± 0.002 | 0.976 ± 0.016 | |
| MR + PT + R22 | 0.151 ± 0.005 | 0.997 ± 0.003 | 0.978 ± 0.018 | |
| TS + MR | 0.435 ± 0.013 | 0.989 ± 0.004 | 0.976 ± 0.009 | |
| TS + PT | 0.298 ± 0.012 | 0.993 ± 0.003 | 0.978 ± 0.009 | |
| TS + R22 | 0.249 ± 0.006 | 0.992 ± 0.004 | 0.969 ± 0.014 | |
| MR + PT | 0.312 ± 0.013 | 0.993 ± 0.003 | 0.977 ± 0.010 | |
| MR + R22 | 0.285 ± 0.008 | 0.981 ± 0.001 | 0.940 ± 0.005 | |
| PT + R22 | 0.164 ± 0.004 | 0.996 ± 0.003 | 0.974 ± 0.018 | |
| Majority vote | 0.581 ± 0.11 | 0.973 ± 0.005 | 0.955 ± 0.008 | |
Specificity values of random string predictions.
| Strings length | TargetScan | miRanda-mirSVR | Pita | RNA22 |
|---|---|---|---|---|
| 500 nt | 0.9489 | 0.9446 | 0.8874 | 0.9338 |
| 1,000 nt | 0.8966 | 0.8903 | 0.7917 | 0.8766 |
| 2,500 nt | 0.7668 | 0.7550 | 0.5562 | 0.7466 |
| 5,000 nt | 0.5846 | 0.5596 | 0.3081 | 0.6008 |
| 500 + 1,000 + 2,500 + 5,000 | 0.7992 | 0.7874 | 0.6359 | 0.7895 |
| 500 + 1,000 + 2,500 | 0.8708 | 0.8633 | 0.7451 | 0.8523 |
Summary of the learning attributes of each tool.
| Groups | Attributes | TargetScan | miRanda-mirSVR | Pita | RNA22 |
|---|---|---|---|---|---|
| Duplex features | Seed match | X | X | X | X |
| 3′ contribution | X | X | X | X | |
| SPS | X | ||||
| Heteroduplex free energy | X | ||||
| X | |||||
| Local context features | SA | X | X | X | |
| Flanking AU | X | X | X | ||
| Global context features | TA | X | |||
| Paring position | X | X | |||
| 3′UTR length | X | X | |||
| Sequence length | X | ||||
| Conservation | X | X | X | ||
| – | Others | X | X | ||
Positive and negative aspects of the target prediction tools analyzed.
| Tool | Positive aspects | Negative aspects |
|---|---|---|
| TargetScan | - Friendly user database | - Predictions are the similar for all members of a miRNA family |
| - Highest number of organism available | - Not possible to change parameter cutoffs | |
| miRanda-mirSVR | - Possible to change parameter cutoffs in source code only | - Database not so friendly |
| - mirSVR score not available in source code | ||
| Pita | - Possible to change parameter cutoffs | - Not shows interactive view of miRNA-target pairing |
| - Enable online predictions of users miRNA and 3′UTR | ||
| RNA22 | - Friendly user database | - Source code takes too long to run |
| - Allows predictions in multiple sources | ||
| - Possible to change parameter cutoffs | ||