| Literature DB >> 27207945 |
Jun Ding1, Xiaoman Li2, Haiyan Hu1.
Abstract
MOTIVATION: The identification of microRNA (miRNA) target sites is fundamentally important for studying gene regulation. There are dozens of computational methods available for miRNA target site prediction. Despite their existence, we still cannot reliably identify miRNA target sites, partially due to our limited understanding of the characteristics of miRNA target sites. The recently published CLASH (crosslinking ligation and sequencing of hybrids) data provide an unprecedented opportunity to study the characteristics of miRNA target sites and improve miRNA target site prediction methods.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27207945 PMCID: PMC5018371 DOI: 10.1093/bioinformatics/btw318
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Features selected by four different methods
Recall and precision of different methods on five testing datasets
| Lasso logistic | Randomized logistic | STEP-wise logistic | Random forest | TarPmiR | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Recall | Precision | Recall | Precision | Recall | Precision | Recall | Precision | Recall | Precision | |
| T1 | 0.8549 | 0.7765 | 0.8539 | 0.7785 | 0.8559 | 0.7795 | 0.8740 | 0.8283 | 0.5514 | 0.1905 |
| T2 | 0.8736 | 0.7713 | 0.8746 | 0.7730 | 0.8751 | 0.7736 | 0.8921 | 0.8296 | 0.5227 | 0.1626 |
| T3 | 0.8315 | 0.7626 | 0.8319 | 0.7898 | 0.8320 | 0.7904 | 0.8686 | 0.8253 | 0.5303 | 0.1661 |
| T4 | 0.836 | 0.7871 | 0.8411 | 0.7903 | 0.838 | 0.7894 | 0.8776 | 0.8266 | 0.5507 | 0.1902 |
| T5 | 0.8856 | 0.7639 | 0.8878 | 0.7662 | 0.8895 | 0.7649 | 0.8989 | 0.8173 | 0.5583 | 0.1909 |
Comparison of four methods on independent datasets
| Dataset | # of miRNAs input | Performance measurement | TarPmiR | miRanda | TargetScan V2010 | miRmap | TargetScan V2015 |
|---|---|---|---|---|---|---|---|
| I | 60 | # of predictions | 240 605 | 246 311 | 219 304 | 504 447 | 215 885 |
| % of correct predictions | 11 904/16 041=74.2% | 7061/16 041=44.0% | 6248/16 041=39.0% | 7121/16 041 =44.4% | 7472/16 041=46.6% | ||
| Recall | 0.742 | 0.440 | 0.390 | 0.444 | 0.466 | ||
| Precision | 0.0495 | 0.0287 | 0.0285 | 0.014 | 0.0346 | ||
| 120 | # of predictions | 481 135 | 476 827 | 461 280 | 906 654 | 446 074 | |
| % of correct predictions | 13 846/16 041=86.3% | 9683/16 041=60.4% | 8969/16 041=55.9% | 10 342/16 041=64.5% | 10 614/16 041=66.2% | ||
| Recall | 0.863 | 0.604 | 0.559 | 0.645 | 0.662 | ||
| Precision | 0.0288 | 0.0203 | 0.0194 | 0.0114 | 0.0238 | ||
| II | 60 | # of predictions | 469 752 | 453 880 | 437 791 | 971 238 | 399 746 |
| % of correct predictions | 34 301/43 251 =79.3% | 20 378/43 251 =47.1% | 17 556/43 251 =40.6% | 20 543/43 251 =47.5% | 19 442/43 251=46.1% | ||
| Recall | 0.793 | 0.471 | 0.406 | 0.475 | 0.461 | ||
| Precision | 0.0730 | 0.0449 | 0.0401 | 0.0211 | 0.0486 | ||
| 120 | # of predictions | 961 112 | 902 611 | 922 373 | 1 952 258 | 832 842 | |
| % of correct predictions | 38 821/43 251= 89.8% | 23 762/43 251=54.9% | 24 578/43 251= 56.8% | 25 667/43 251= 59.3% | 27 980/43 251=64.7% | ||
| Recall | 0.898 | 0.549 | 0.568 | 0.593 | 0.647 | ||
| Precision | 0.0403 | 0.0263 | 0.0266 | 0.0131 | 0.0336 | ||
| III | 119 | # of predictions | 285 491 | 439 485 | 875 442 | 341 773 | 382 173 |
| % of correct predictions | 10 766/11 080=97.2% | 9069/11 080=81.8% | 10 084/11 080=91.0% | 7840/11 080=70.8% | 10 334/11 080=93.3% | ||
| Recall | 0.972 | 0.818 | 0.910 | 0.708 | 0.933 | ||
| Precision | 0.0377 | 0.0206 | 0.0115 | 0.0229 | 0.0270 | ||
| IV | 50 | # of predicted interactions | 184 842 | 172 256 | 141 717 | 173 378 | 149 142 |
| % of correct predictions | 31 779/60 818=52.3% | 25 326/60 818=41.6% | 19 873/60 818=32.7% | 19 785/60 818=32.5% | 23 757/60 818=39.1% | ||
| Recall | 0.523 | 0.416 | 0.327 | 0.325 | 0.391 | ||
| Precision | 0.172 | 0.147 | 0.140 | 0.114 | 0.159 | ||
| 100 | # of predicted interactions | 412 149 | 337 863 | 286 667 | 413 213 | 298 004 | |
| % of correct predictions | 52 955/100 608=52.6% | 41 722/100 608=41.5% | 32 649/100 608=32.5% | 33 412/100 608=33.2% | 37 616/100 608=37.4% | ||
| Recall | 0.526 | 0.415 | 0.325 | 0.332 | 0.374 | ||
| Precision | 0.128 | 0.123 | 0.114 | 0.081 | 0.126 |
Comparison of different methods on the CLASH dataset
| Method | TP | FN | FP | Recall TP/(TP+FN) | Precision TP/(TP+FP) | F1-score |
|---|---|---|---|---|---|---|
| TarPmiR | 4695 | 3819 | 19 950 | 0.551 | 0.191 | 0.284 |
| miRanda | 3852 | 4662 | 51 849 | 0.452 | 0.069 | 0.120 |
| TargetScan V2010 | 1164 | 7350 | 10 281 | 0.136 | 0.101 | 0.116 |
| TargetScan V2015 | 2368 | 6146 | 10 182 | 0.278 | 0.189 | 0.225 |
| Mirmap | 1821 | 6693 | 30 746 | 0.214 | 0.056 | 0.089 |