| Literature DB >> 21441577 |
Daniel C Ellwanger1, Florian A Büttner, Hans-Werner Mewes, Volker Stümpflen.
Abstract
MOTIVATION: Pairing between the target sequence and the 6-8 nt long seed sequence of the miRNA presents the most important feature for miRNA target site prediction. Novel high-throughput technologies such as Argonaute HITS-CLIP afford meanwhile a detailed study of miRNA:mRNA duplices. These interaction maps enable a first discrimination between functional and non-functional target sites in a bulky fashion. Prediction algorithms apply different seed paradigms to identify miRNA target sites. Therefore, a quantitative assessment of miRNA target site prediction is of major interest.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21441577 PMCID: PMC3087955 DOI: 10.1093/bioinformatics/btr149
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Determined canonical seed types
| Functional | Non-functional | |||||
|---|---|---|---|---|---|---|
| Seed type | Frequency | % | Frequency | % | LOR | |
| 6 | 1793 | 24 | 20 746 | 32 | −0.12 | 1.20 |
| 6 | 1382 | 19 | 13 500 | 21 | −0.04 | 2.57 |
| 6 | 1755 | 24 | 17 954 | 28 | −0.06 | 2.26 |
| 7 | 760 | 10 | 5036 | 8 | 0.12 | 2.03 |
| 7 | 959 | 13 | 5250 | 8 | 0.21 | 1.34 |
| 8 | 693 | 9 | 2203 | 3 | 0.44 | 7.60E−132 |
aLog odds ratio based on sampling.
Fig. 1.Definition of seed types. The seed types were termed by the start position relative to the 5′-end of the miRNA and the length of the consecutive seed match. The defined set of canonical seed types can be surjectively projected to the seed type set of (Bartel, 2009). Equivalent definitions could be found for 6merγ, 6merβ and 7merβ. In the case of miRNAs having a seed sequence beginning with an uracile, 7merα complies with 7mer-A1 and 8merα is equal to 8mer. Otherwise 6merβ equates 7mer-A1 and 7mer β complies with 8mer. If the first position within the target sequence is not an adenine, 8merα equates 7mer-m8 and 7merα is equal to 6mer. Additionally, our set considered 6mer matches that are complementary to the first position of a miRNA seed (6merα). Common target site prediction tools focus on seeds of length seven and eight to increase precision.
Fig. 2.Accuracy evaluation. (A and C) The impact of each seed type on miRNA target site prediction was determined by means of recall and specificity. The effect of the (default) seed type selection is shown for several prediction algorithms. These values present respectively the minimum specificity and the maximum recall of the tools. The dashed line shows an average random prediction. (B and D) Removing non-conserved target sites increases the precision, but lowers the recall. Note that panels (C and D) do not reflect the ranking of predictions based on the algorithms' scoring schemes.
Fig. 3.Heatmap showing the seed type distribution for each miRNA. The colors affected by the row z-score indicate the bias of miRNAs to prefer targets holding a specific seed type. A red/blue coloration implies a higher/lower usage of a seed type compared to other miRNAs.
Fig. 4.Observed and expected fraction of conserved seed matches for each seed type illustrated for functional target sites.
Default miRNA seed type selection of prediction algorithms
| Seed type | ||||||
|---|---|---|---|---|---|---|
| Algorithm | 6 | 6 | 6 | 7 | 7 | 8 |
| PITA | ✓ | ✓ | ✓ | ✓ | ||
| TargetScan | ✓ | ✓ | ✓ | ✓ | ||
| PicTar | ✓ | ✓ | ✓ | |||
| EIMMO | ✓ | ✓ | ✓ | |||
| TargetSpy S. | ✓ | ✓ | ✓ | |||
| PACMIT | ✓ | ✓ | ||||
aConfigurable seed length, default seed types ensure high precision.
bIf miRNA seed sequence starts with an adenine, guanine, cytosine.
cIf miRNA seed sequence starts with an uracile.