Literature DB >> 23716633

miRmap web: Comprehensive microRNA target prediction online.

Charles E Vejnar¹, Matthias Blum, Evgeny M Zdobnov.

Abstract

MicroRNAs (miRNAs) posttranscriptionally repress the expression of protein-coding genes. Based on the partial complementarity between miRNA and messenger RNA pairs with a mandatory so-called 'seed' sequence, many thousands of potential targets can be identified. Our open-source software library, miRmap, ranks these potential targets with a biologically meaningful criterion, the repression strength. MiRmap combines thermodynamic, evolutionary, probabilistic and sequence-based features, which cover features from TargetScan, PITA, PACMIT and miRanda. Our miRmap web application offers a user-friendly and feature-rich resource for browsing precomputed miRNA target predictions for model organisms, as well as for predicting and ranking targets for user-submitted sequences. MiRmap web integrates sorting, filtering and exporting of results from multiple queries, as well as providing programmatic access, and is available at http://mirmap.ezlab.org.

Entities: Chemical Gene Species

Mesh：

Substances：

Year: 2013 PMID： 23716633 PMCID： PMC3692044 DOI： 10.1093/nar/gkt430

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

MicroRNAs (miRNAs) are short (∼22 nt) noncoding RNAs that guide the RNA-induced silencing complex (RISC) to posttranscriptionally repress the expression of protein-coding genes by binding to targeted messenger RNAs (mRNAs) (1–3). While the detailed mechanism of this guidance is not yet resolved, binding between the mRNA and the miRNA from position 2–7 (or 8) from the 5′ end usually triggers RISC-mediated repression (4). In the miRNA–mRNA binding map based on Ago HITS-CLIP (HIgh-Throughput Sequencing of RNA isolated by CrossLinking ImmunoPrecipitation) (5), 73% of the studied target sites have a seed match. While searching for seed matches in 3′-UTR is a widely used feature for miRNA target prediction, it yields a very high number of potential targets. For example, human 3′-UTRs have 9.5 million potential targets defined as 7-mer seed matches of all human miRNAs, which means that on average a miRNA potentially targets ∼25% of the human genes. While this estimation is probably an upper bound, and as in a specific cell or tissue, not all miRNAs and all genes are expressed, thereby reducing the number of potential targets, prioritization of predictions is necessary to experimentally test the most probable targets first. We recently proposed a ranking function of miRNA-mediated repression strength parameterized from experimentally measured effects on mRNA or protein levels resulting in prediction of biologically meaningful potential targets (6). Implemented as an open-source Python software library, the miRmap library implements a comprehensive range of features using thermodynamic, probabilistic, evolutionary and sequence-based approaches to rank potential miRNA targets using information beyond the seed match. We compared (6) the power of individual approaches to predict the repression strength of miRNA–mRNA pairs, assessed using data from transcriptomics, immunopurification, proteomics and polysome fractionation high-throughput experiments, and combined the features with a linear model to predict the miRNA repression strength as the ‘miRmap score’. This score doubles the performance of the TargetScan context score measured as the proportion of explained variance of a transcriptomics data set (6). Moreover, our comprehensive set of features covers features included in many other tools (Figure 1). For example, PITA (8) ranks targets only with ‘ΔG total’, and PACMIT (9) uses a combination of ‘ΔG open’ and ‘P.over binomial’. As the miRmap score is more predictive than these individual features (6), miRmap can be used as a reference tool. Here, we present ‘miRmap web’, a web application based on a public REST (REpresentational State Transfer) service. With miRmap web, biologists can easily, through a user-friendly and feature-rich graphical interface, browse precomputed miRmap predictions from model organisms, and also predict and rank miRNA targets on their own sequences.

Figure 1.

Feature relative importance and performance of the miRmap model (as R2) compared with similar miRNA target predictions software. Similar features of each software to miRmap are marked with a black dot, with their performance also evaluated as R2. R2 is the proportion of variance explained by the model on transcriptomics data (6). TargetScan conservation score (Pct, probability of conserved targeting) (7) R2 was computed directly with TargetScan predictions, as Pct, which is based on the BLS feature (marked as a dark gray dot), is not included in miRmap.

IMPLEMENTATION

miRmap features and model

Thermodynamic features of miRmap, which we described previously (6), include ‘ΔG duplex’ and ‘ΔG binding’ that evaluate the energy of the miRNA–mRNA duplex, ‘ΔG open’ that evaluates the energy necessary to unfold the 3′-UTR and make the area accessible for RISC binding and ‘ΔG total’ [also named ‘ΔΔG’ by Kertesz et al. (8)] that sum ‘ΔG duplex’ and ‘ΔG open’. Since the publication of the miRmap library, we have added new features, ‘ΔG seed duplex’ and ‘ΔG seed binding’, that specifically evaluate the seed region (10,12). Our model (6) explains 12.7% (R2) of variance on the Grimson et al. data set (13), whereas the TargetScan context score (‘AU content’, ‘3′ pairing’ and ‘UTR position’) explains 7.49% with the same type of linear model (6). With the addition of the ‘ΔG seed duplex’ and ‘ΔG seed binding’ features, our model now explains 14.4% of the variance (Figure 1). It is worth nothing that TargetScan predictions also include an independent filter on conservation (7), named Pct. This score, based on the ‘Branch Length Score (BLS)’ feature, explains 4.88% of the variance. As described in our previous publication (6), fold-change distribution confirmed greater enrichment of stronger experimental repression with greater miRmap score (Supplementary Figure S1). This analysis was also applied specifically to 6-mer seeds and also showed increased performance (Supplementary Figure S2). The sequence-based features are, as described by the TargetScan context score (13), as follows: ‘AU content’ that is a weighted count of A and U nucleotides around the seed match, ‘UTR position’ that takes into account the enhanced repression for sites that are close to one end of the 3′-UTR and ‘3′ pairing’ that evaluates the contribution of the pairing on the opposite side to the miRNA seed. As only limited regions of 3′-UTRs have regulatory or structural roles, we introduced features in miRmap with probabilistic and evolutionary points of view. In the probabilistic approach, we modeled the 3′-UTR sequence composition with a Markov process (order 1) and determined the expected probability of finding at least n occurrences of the seed match, either with a binomial approximation (‘P.over binomial’) or with an exact solution (‘P.over exact’). The evolutionary approach assesses the conservation of the target sites compared with the rest of the 3′-UTR as measured by the ‘BLS’ and ‘PhyloP’ features. The ‘BLS’ evaluates the evolutionary time during which the seed match was present in a given set of species (14), while the ‘PhyloP’ feature detects negative selection pressure on the target sites within a statistical framework (11). MiRmap brings together common features of miRNA target prediction and extends these with novel features including ‘ΔG binding’, ‘ΔG seed binding’, ‘P.over exact’ and ‘phyloP’ to provide the first truly comprehensive open-source library for target predictions.

Precomputed predictions

As well as allowing predictions for user-submitted sequences, miRmap includes predictions for the annotated miRNAs [miRBase 19 (15)] and genes [Ensembl 69 (16)] of eight species: Human, Chimpanzee, Mouse, Rat, Cow, Chicken, Zebrafish and Opossum.

Web server implementation

As our service allows both interrogating a database and computing predictions, we developed a framework that ensures a reliable and responsive service. All computations on the server side are performed asynchronously: multiple users can search and compute predictions at the same time without impacting other users, with our web server computing capability as the only limit. This implementation was written using the Tornado framework (tornadoweb.org) and deployed using Nginx (nginx.org) reverse proxy.

Client implementation

MiRmap web can be used as a web application or as a REST service. The web application is a single web page for ‘click&play’ usage integrating the form for the user query and the dynamically updated results table. Users can also easily retrieve prediction data sets or predict targets in a more programmatic way using the REST service. We used the widely used ExtJS Javascript toolkit (sencha.com) to build our web application and obtain a homogeneous look and feel as well as flexible configurability. A detailed documentation of the REST service, as well as examples, is included in help section of miRmap Web site.

USAGE

Input

MiRmap web includes facilities for both browsing precomputed miRNA target predictions, and for online computing of miRNA targets on user-submitted sequences. The first step is to select a species (Figure 2). The user may then ‘Select’ or input the ‘Sequence’ of a miRNA and/or a protein-coding gene. It is important to note that for browsing precomputed predictions, users may select a miRNA or a gene, or both, with an auto-complete capability to facilitate rapid selections The gene selection and sequence input options may be combined such that the user may input a miRNA sequence and select a gene/transcript by name, or vice versa. When a gene name or identifier is entered, the canonical transcript of that gene is selected; it is also possible to select a specific transcript by directly entering the transcript identifier. When the form is valid, the user may then submit his/her query by clicking on the ‘Get targets’ button.

Figure 2.

miRmap web application page describing the integration of the user query, results table with filtering and sorting parts.

RESULTS

Results from user-submitted queries return feature values for each miRNA–mRNA pair by default (the ‘Pair’ view), where each record is expandable to access additional details about each predicted target site, or for each target site by selecting the ‘Site’ view. By default, only a selection of the complete set of all miRmap features is displayed, and intuitive options from the toolbar allow users to display all features or customize their feature selections. Additional toolbar options enable (i) switching between ‘Percentile’ and ‘Raw’ score formats, (ii) multicolumn ordered sorting and (iii) flexible filtering of the results table to suit the user’s requirements. Raw miRmap scores for each feature, e.g. ‘ΔG duplex’ in kcal/mol and ‘P.over exact’ as a probability, can be converted to percentiles to simplify and standardize a scale from strong to weak repression. This is achieved by ranking targets for each species from the weakest to the strongest predicted repression for each individual feature and assigning scores to percentiles from 0 to 100%, with 100 representing the strongest repression. Users therefore do not have to be familiar with the ranges of different feature values, nor their directions, i.e. whether larger or smaller values correspond to higher or lower repression. Results are sorted by default according to their miRmap scores, but users may define their own sorting functions on each selected feature through a simple ‘drag&drop’ ordering of each column in the results table. In addition, filters may be applied to limit results to only canonical transcripts for each gene as well as by setting minimum or maximum scores for each feature using sliding selectors. These customized results tables may then be easily exported as CSV (comma-separated values) formatted files and saved for viewing in common spreadsheet applications such as Excel, or as BED files for viewing in the UCSC (17) genome browser.

CONCLUSIONS

With this integrated web interface, featuring easy filtering and sorting options, miRmap web aims to provide biologists interested in a particular miRNA or gene the means to prioritize their research with confident ranking of miRNA targets using the miRmap model. In contrast to most other miRNA target prediction web interfaces, miRmap web covers a wide range of possible usage by providing both precomputed and online predictions. Updated 3′-UTR annotations and newly discovered miRNAs can therefore be easily computationally tested with a state-of-the-art analytical tool.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Figures 1–2.

17 in total

1. Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.

Authors: Alexander Stark; Michael F Lin; Pouya Kheradpour; Jakob S Pedersen; Leopold Parts; Joseph W Carlson; Madeline A Crosby; Matthew D Rasmussen; Sushmita Roy; Ameya N Deoras; J Graham Ruby; Julius Brennecke; Emily Hodges; Angie S Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V Han; Morgan L Maeder; Benjamin J Polansky; Bryanne E Robson; Stein Aerts; Jacques van Helden; Bassem Hassan; Donald G Gilbert; Deborah A Eastman; Michael Rice; Michael Weir; Matthew W Hahn; Yongkyu Park; Colin N Dewey; Lior Pachter; W James Kent; David Haussler; Eric C Lai; David P Bartel; Gregory J Hannon; Thomas C Kaufman; Michael B Eisen; Andrew G Clark; Douglas Smith; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal: Nature Date: 2007-11-08 Impact factor: 49.962

2. The role of site accessibility in microRNA target recognition.

Authors: Michael Kertesz; Nicola Iovino; Ulrich Unnerstall; Ulrike Gaul; Eran Segal
Journal: Nat Genet Date: 2007-09-23 Impact factor: 38.330

3. Relative contribution of sequence and structure features to the mRNA binding of Argonaute/EIF2C-miRNA complexes and the degradation of miRNA targets.

Authors: Jean Hausser; Markus Landthaler; Lukasz Jaskiewicz; Dimos Gaidatzis; Mihaela Zavolan
Journal: Genome Res Date: 2009-09-18 Impact factor: 9.043

4. Detection of nonneutral substitution rates on mammalian phylogenies.

Authors: Katherine S Pollard; Melissa J Hubisz; Kate R Rosenbloom; Adam Siepel
Journal: Genome Res Date: 2009-10-26 Impact factor: 9.043

Review 5. MicroRNA targeting in mammalian genomes: genes and mechanisms.

Authors: S A Muljo; C Kanellopoulou; L Aravind
Journal: Wiley Interdiscip Rev Syst Biol Med Date: 2010 Mar-Apr

6. Most mammalian mRNAs are conserved targets of microRNAs.

Authors: Robin C Friedman; Kyle Kai-How Farh; Christopher B Burge; David P Bartel
Journal: Genome Res Date: 2008-10-27 Impact factor: 9.043

Review 7. MicroRNAs: target recognition and regulatory functions.

Authors: David P Bartel
Journal: Cell Date: 2009-01-23 Impact factor: 41.582

8. Principles of microRNA-target recognition.

Authors: Julius Brennecke; Alexander Stark; Robert B Russell; Stephen M Cohen
Journal: PLoS Biol Date: 2005-03 Impact factor: 8.029

9. Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps.

Authors: Sung Wook Chi; Julie B Zang; Aldo Mele; Robert B Darnell
Journal: Nature Date: 2009-06-17 Impact factor: 49.962

10. MiRmap: comprehensive prediction of microRNA target repression strength.

Authors: Charles E Vejnar; Evgeny M Zdobnov
Journal: Nucleic Acids Res Date: 2012-10-02 Impact factor: 16.971

52 in total

1. Expression quantitative trait loci (eQTLs) in microRNA genes are enriched for schizophrenia and bipolar disorder association signals.

Authors: V S Williamson; M Mamdani; G O McMichael; A H Kim; D Lee; S Bacanu; V I Vladimirov
Journal: Psychol Med Date: 2015-03-30 Impact factor: 7.723

2. Regulation of the Human Fc-Neonatal Receptor alpha-Chain Gene FCGRT by MicroRNA-3181.

Authors: Daniel C Ferguson; Javier G Blanco
Journal: Pharm Res Date: 2018-01-04 Impact factor: 4.200

3. Analysis of APOBEC3A/3B germline deletion polymorphism in breast, cervical and oral cancers from South India and its impact on miRNA regulation.

Authors: Sundaramoorthy Revathidevi; Mayakannan Manikandan; Arunagiri Kuha Deva Magendhra Rao; Vilvanathan Vinothkumar; Ganesan Arunkumar; Kottayasamy Seenivasagam Rajkumar; Rajendran Ramani; Ramamurthy Rajaraman; Chandrasekar Ajay; Arasambattu Kannan Munirajan
Journal: Tumour Biol Date: 2016-05-07

4. Prediction of potential molecular markers of bovine mastitis by meta-analysis of differentially expressed genes using combined p value and robust rank aggregation.

Authors: Anushri Umesh; Praveen Kumar Guttula; Mukesh Kumar Gupta
Journal: Trop Anim Health Prod Date: 2022-08-19 Impact factor: 1.893

5. A positive role of microRNA-15b on regulation of osteoblast differentiation.

Authors: S Vimalraj; Nicola C Partridge; N Selvamurugan
Journal: J Cell Physiol Date: 2014-09 Impact factor: 6.384

6. MicroRNA-203 suppresses proliferation in liver cancer associated with PIK3CA, p38 MAPK, c-Jun, and GSK3 signaling.

Authors: Annie Zhang; Jaganathan Lakshmanan; Amirreza Motameni; Brian G Harbrecht
Journal: Mol Cell Biochem Date: 2017-09-08 Impact factor: 3.396

7. The biological functions of target genes in pan-cancers and cell lines were predicted by miR-375 microarray data from GEO database and bioinformatics.

Authors: Jiang-Hui Zeng; Xu-Zhi Liang; Hui-Hua Lan; Xu Zhu; Xiu-Yun Liang
Journal: PLoS One Date: 2018-10-31 Impact factor: 3.240

8. Introduction to Bioinformatics Resources for Post-transcriptional Regulation of Gene Expression.

Authors: Eliana Destefanis; Erik Dassi
Journal: Methods Mol Biol Date: 2022

9. Long noncoding RNA IL6-AS1 is highly expressed in chronic obstructive pulmonary disease and is associated with interleukin 6 by targeting miR-149-5p and early B-cell factor 1.

Authors: Erkang Yi; Jiahuan Zhang; Mengning Zheng; Yi Zhang; Chunxiao Liang; Binwei Hao; Wei Hong; Biting Lin; Jinding Pu; Zhiwei Lin; Peiyu Huang; Bing Li; Yumin Zhou; Pixin Ran
Journal: Clin Transl Med Date: 2021-07

Review 10. Bioinformatics Accelerates the Major Tetrad: A Real Boost for the Pharmaceutical Industry.

Authors: Tapan Behl; Ishnoor Kaur; Aayush Sehgal; Sukhbir Singh; Saurabh Bhatia; Ahmed Al-Harrasi; Gokhan Zengin; Elena Emilia Babes; Ciprian Brisc; Manuela Stoicescu; Mirela Marioara Toma; Cristian Sava; Simona Gabriela Bungau
Journal: Int J Mol Sci Date: 2021-06-08 Impact factor: 5.923