| Literature DB >> 30670076 |
Weijun Liu1,2, Xiaowei Wang3.
Abstract
We perform a large-scale RNA sequencing study to experimentally identify genes that are downregulated by 25 miRNAs. This RNA-seq dataset is combined with public miRNA target binding data to systematically identify miRNA targeting features that are characteristic of both miRNA binding and target downregulation. By integrating these common features in a machine learning framework, we develop and validate an improved computational model for genome-wide miRNA target prediction. All prediction data can be accessed at miRDB ( http://mirdb.org ).Entities:
Keywords: CLIP-seq; MicroRNA; RNA-seq; Target prediction
Mesh:
Substances:
Year: 2019 PMID: 30670076 PMCID: PMC6341724 DOI: 10.1186/s13059-019-1629-z
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Twenty-five miRNAs analyzed in the RNA-seq experiments
| miRNA name | miRNA sequence | Identified targets |
|---|---|---|
| hsa-let-7c-5p | UGAGGUAGUAGGUUGUAUGGUU | 31 |
| hsa-miR-107 | AGCAGCAUUGUACAGGGCUAUCA | 35 |
| hsa-miR-10a-5p | UACCCUGUAGAUCCGAAUUUGUG | 32 |
| hsa-miR-124-3p | UAAGGCACGCGGUGAAUGCC | 151 |
| hsa-miR-126-3p | UCGUACCGUGAGUAAUAAUGCG | 11 |
| hsa-miR-126-5p | CAUUAUUACUUUUGGUACGCG | 48 |
| hsa-miR-133b | UUUGGUCCCCUUCAACCAGCUA | 108 |
| hsa-miR-142-3p | UGUAGUGUUUCCUACUUUAUGGA | 108 |
| hsa-miR-145-5p | GUCCAGUUUUCCCAGGAAUCCCU | 82 |
| hsa-miR-146a-5p | UGAGAACUGAAUUCCAUGGGUU | 42 |
| hsa-miR-155-5p | UUAAUGCUAAUCGUGAUAGGGGU | 154 |
| hsa-miR-15a-5p | UAGCAGCACAUAAUGGUUUGUG | 108 |
| hsa-miR-16-5p | UAGCAGCACGUAAAUAUUGGCG | 122 |
| hsa-miR-17-5p | CAAAGUGCUUACAGUGCAGGUAG | 74 |
| hsa-miR-193b-3p | AACUGGCCCUCAAAGUCCCGCU | 102 |
| hsa-miR-200a-3p | UAACACUGUCUGGUAACGAUGU | 35 |
| hsa-miR-200b-3p | UAAUACUGCCUGGUAAUGAUGA | 126 |
| hsa-miR-200c-3p | UAAUACUGCCGGGUAAUGAUGGA | 93 |
| hsa-miR-206 | UGGAAUGUAAGGAAGUGUGUGG | 206 |
| hsa-miR-210-3p | CUGUGCGUGUGACAGCGGCUGA | 43 |
| hsa-miR-21-5p | UAGCUUAUCAGACUGAUGUUGA | 11 |
| hsa-miR-31-5p | AGGCAAGAUGCUGGCAUAGCU | 85 |
| hsa-miR-34a-5p | UGGCAGUGUCUUAGCUGGUUGU | 155 |
| hsa-miR-9-3p | AUAAAGCUAGAUAACCGAAAGU | 182 |
| hsa-miR-9-5p | UCUUUGGUUAUCUAGCUGUAUGA | 106 |
Enrichment of miRNA seed match in the target sites
| Seed type | Matching positions in miRNA | Downregulated targets | Non-targets | Enrichment ratio |
|---|---|---|---|---|
| Seed6 | pos 2–7 | 0.86 | 0.36 | 2.40 |
| Seed7a | pos 1–7 | 0.46 | 0.13 | 3.45 |
| Seed7b | pos 2–8 | 0.62 | 0.15 | 4.18 |
| Seed7A1 | pos 2–7 + A at target pos 1 | 0.52 | 0.13 | 4.10 |
| Seed8 | pos 1–8 | 0.26 | 0.05 | 5.48 |
| Seed8A1 | pos 2–8 + A at target pos1 | 0.30 | 0.04 | 6.83 |
| Seed7b_not_U | Exclude miRNAs with 5′-U | 0.60 | 0.15 | 3.97 |
| Seed8_not_U | Exclude miRNAs with 5′-U | 0.19 | 0.06 | 3.32 |
| Seed8A1_not_U | Exclude miRNAs with 5′-U | 0.34 | 0.05 | 6.95 |
Fig. 1The impact of miRNA seed types on target downregulation. Six seed types were evaluated (see Table 2 for seed definitions). a Percentage of downregulated genes containing individual seed types in relation to gene expression changes. All 25 miRNAs were included in the analysis. b Analysis of a subset of 8 miRNAs that do not contain 5′-U
Summary of top-ranking miRNA targeting features identified by RFE analysis
| Feature name | RFE rank | Targets | Non-targets | |
|---|---|---|---|---|
| Seed 8A1, conserved | 1 | 0.184 | 0.018 | 2.8E−245 |
| Seed7b, low phyloP score | 2 | 0.273 | 0.445 | 3.2E−84 |
| GC content of target site | 3 | 1.554 | 1.901 | 4.9E−117 |
| UTR length (log2) | 4 | 10.960 | 11.430 | 1.5E−114 |
| Seed7A1, non-conserved | 5 | 0.142 | 0.341 | 9.5E−134 |
| Seed7A1, low phyloP score | 6 | 0.137 | 0.339 | 1.3E−138 |
| AG count | 7 | 0.517 | 0.774 | 2.6E−73 |
| Seed8A1, low phyloP score | 8 | 0.200 | 0.126 | 1.5E−29 |
| Pentamer motif match | 9 | 0.052 | 0.022 | 2.2E−19 |
| Free energy of seed binding (log2) | 10 | − 2.583 | − 2.596 | 2.6E−01 |
| Distance to UTR end (log2) | 11 | 8.403 | 9.125 | 4.7E−126 |
| Seed8A1, moderate phyloP score | 12 | 0.047 | 0.006 | 7.5E−53 |
| CA count | 13 | 0.758 | 0.743 | 2.7E−01 |
| Seed7b, conserved | 14 | 0.124 | 0.048 | 8.7E−55 |
| Seed8A1, high phyloP score | 15 | 0.146 | 0.009 | 7.3E−218 |
| Seed7A1, high phyloP score | 16 | 0.036 | 0.014 | 1.0E−15 |
| Seed7b, high phyloP score | 17 | 0.093 | 0.022 | 8.7E−74 |
| CT count | 18 | 0.893 | 0.829 | 3.5E−05 |
| CG count | 19 | 0.106 | 0.128 | 6.4E−04 |
| TA count | 20 | 0.871 | 0.655 | 2.1E−45 |
Fig. 2Comparison of miRNA target prediction algorithms using the HITS-CLIP dataset. MirTarget and four other target prediction algorithms were included in the analysis. a Receiver operating characteristic (ROC) curve analysis to evaluate the rate of false positive prediction in relation to the rate of true positive prediction. b Precision-recall (PR) curve analysis to evaluate prediction precision in relation to the recall rate
Fig. 3Comparison of target prediction algorithms using microarray data. Microarray profiling data were analyzed to identify target upregulation resulting from concurrent inhibition of 25 miRNAs. a Correlation of target upregulation and target prediction scores computed by 5 individual algorithms, as measured by Pearson correlation coefficient. b Average level of expression upregulation for predicted targets. For each algorithm, 100 top-scoring predicted targets per miRNA on average were included in the analysis