Literature DB >> 31504780

miRDB: an online database for prediction of functional microRNA targets.

Yuhao Chen1,2, Xiaowei Wang1.   

Abstract

MicroRNAs (miRNAs) are small noncoding RNAs that act as master regulators in many biological processes. miRNAs function mainly by downregulating the expression of their gene targets. Thus, accurate prediction of miRNA targets is critical for characterization of miRNA functions. To this end, we have developed an online database, miRDB, for miRNA target prediction and functional annotations. Recently, we have performed major updates for miRDB. Specifically, by employing an improved algorithm for miRNA target prediction, we now present updated transcriptome-wide target prediction data in miRDB, including 3.5 million predicted targets regulated by 7000 miRNAs in five species. Further, we have implemented the new prediction algorithm into a web server, allowing custom target prediction with user-provided sequences. Another new database feature is the prediction of cell-specific miRNA targets. miRDB now hosts the expression profiles of over 1000 cell lines and presents target prediction data that are tailored for specific cell models. At last, a new web query interface has been added to miRDB for prediction of miRNA functions by integrative analysis of target prediction and Gene Ontology data. All data in miRDB are freely accessible at http://mirdb.org.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31504780      PMCID: PMC6943051          DOI: 10.1093/nar/gkz757

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

MicroRNAs (miRNAs) are small noncoding RNAs that regulate many biological processes (1,2). About 2000 human miRNAs have been reported in miRBase (3). Computational and experimental analyses indicate that most known protein-coding genes are regulated by miRNAs at both post-transcriptional and translational levels (4–6). miRNAs function mainly by downregulating the expression of their gene targets. Thus, accurate prediction of miRNA targets is critical for characterizing miRNA functions. At present, reliable identification of miRNA targets is still a major challenge, and many researchers choose to use computational tools to predict candidate gene targets for further experimental validation. To facilitate the process of candidate target selection, we have previously developed an online database, miRDB, for miRNA target prediction and functional annotations (7,8). Here, we present major updates to miRDB, most noticeably the presentation of updated target prediction data based on an improved computational algorithm. Other new features include prediction of miRNA targets in specific cell models, and prediction of miRNA-regulated biological processes by integrative analysis of target prediction and Gene Ontology (GO) data. The web server interface of miRDB has been updated to present these new database features. All data in miRDB are freely accessible at http://mirdb.org.

DATABASE UPDATES

Presentation of updated target prediction data

We have recently developed an improved computational model for miRNA target prediction. Details of this prediction model have been described in our recent publication (9). The workflow of the model development process is presented in Figure 1. One unique aspect of the algorithm development process is the quality as well as the comprehensiveness of the training data. Specifically, we have performed a large-scale RNA-seq study to globally profile the impact on target expression by individual miRNAs. To our knowledge, our RNA-seq profiling dataset, consisting of 1.5 billion reads from 52 RNA samples, represents the largest of its kind for miRNA target analysis. By focusing on transcripts that are downregulated by miRNA overexpression, we were able to discover and further quantify miRNA targeting features that are characteristic of target downregulation. On the other hand, we also analyzed public CLIP-ligation data (10,11) to identify paired miRNA/target transcripts that reside in the same miRISC complex. In this way, we were able to identify features that are associated with miRNA target binding. Next, by integratively analyzing both miRNA binding and target downregulation data, we were able to identify significant targeting features that are common for both processes. At last, a support vector machine (SVM) model, MirTarget, was trained with the identified features for miRNA target prediction. Comparative analysis using independent datasets indicates that MirTarget has improved performance over other existing prediction algorithms (9).
Figure 1.

Overview of the updated miRNA target prediction algorithm, MirTarget. Training data were derived from both miRNA binding and target expression data. MirTarget was trained under an SVM machine learning framework.

Overview of the updated miRNA target prediction algorithm, MirTarget. Training data were derived from both miRNA binding and target expression data. MirTarget was trained under an SVM machine learning framework. With MirTarget, we performed transcriptome-wide miRNA target prediction for five species: human, mouse, rat, dog and chicken. Specifically, the miRNA sequences were downloaded from miRBase version 22 (3); target transcript sequences were retrieved from the NCBI RefSeq database (12) and further parsed with BioPerl to extract the 3’-UTR sequences. Unlike many existing algorithms, evolutionary conservation of the target binding site is not a required feature for miRNA target prediction with MirTarget. In this way, both conserved and nonconserved targets can be predicted by MirTarget. For each candidate target site, MirTarget generates a probability score as computed by the underlying SVM modeling tool. This score reflects statistical confidence of the prediction results. If a transcript contains multiple candidate target sites, individual site scores are combined to compute a final score for the entire transcript, as described in detail in (9). MirTarget prediction scores are in the range of 0–100, and candidate transcripts with scores ≥ 50 are presented as predicted miRNA targets in miRDB. In total, 3.5 million gene targets were predicted to be regulated by 7000 miRNAs across five species in the current version of miRDB (Version 6.0, Table 1). In comparison, 2.1 million gene targets and 6700 miRNAs were included in the previous version (Version 5.0). On average, there are 497 gene targets per miRNA across the five species, an increase of 58% from the previous version. The significant increase in gene target number is mainly a result of newly implemented MirTarget features such as integrative analysis of multiple miRNA seed types in a single prediction model and more comprehensive assessment of cross-species conservation of the seed binding sites. Specifically for humans, the number of predicted targets per miRNA is 606, which is significantly higher than other species. This likely reflects the relatively rich annotations of the human transcriptome as compared to other transcriptomes.
Table 1.

Summary statistics of miRDB target prediction data

Species Mature miRNAs Total gene targets Unique gene targets
Human26561 610 51029 161
Mouse1978986 41622 499
Rat764187 30312 612
Dog453170 43516 710
Chicken1235565 22021 577
Total 7086 3 519 884 102 559
Previous Version (Version 5.0)
Human2588947 94117 925
Mouse1912634 00918 639
Rat764179 53915 489
Dog453128 70313 150
Chicken992214 81612 911
Total 6709 2 105 008 78 114
Summary statistics of miRDB target prediction data miRDB presents a flexible web server interface for miRNA target retrieval. The default query form, the Target Search page, allows the users to retrieve target prediction data for one specific miRNA or gene target at a time. In addition, an advanced query form, the Target Mining page, enables the search for multiple miRNAs or gene targets at the same time. The Target Mining page also presents additional search filters to enable various combinations of search strategies based on user preference for miRNA and target selection.

Custom target prediction by implementing MirTarget into a web server

We performed a major update on the custom prediction function of miRDB by implementing the new MirTarget algorithm. miRDB allows the users to provide custom miRNA or gene target sequences for transcriptome-wide prediction of gene targets or miRNA regulators in one of the five species: human, mouse, rat, dog or chicken. The custom sequence length is in the range of 17–30 nt for miRNA and 100–30 000 nt for gene target. The users should first select the species and search type (miRNA or gene target), and then input their custom sequence. Then, the Perl script implementing the MirTarget algorithm takes the web form inputs and starts the target prediction process. Two precompiled sequence files are used to predict potential miRNA/target pairs: one contains the 3’-UTR sequences from all known genes in the five species and the other one contains the sequences of species-specific miRNAs, as collated from miRBase version 22. The target prediction process is as follows. First, the web server script collects the miRNA or candidate sequence from the web form. If the users input a custom miRNA sequence, the server script will import all 3’-UTR sequences from the selected species for target prediction; on the other hand, if the users input a candidate target sequence, all miRNA sequences from the selected species will be imported. Next, for every miRNA/candidate target pair, the server script scans for miRNA seed binding sites and generates targeting features for MirTarget prediction. The prediction data are presented as an annotation table for all miRNA/candidate target pairs, including target prediction scores and the miRNA/target sequences. The prediction results are sorted in descending order as ranked by the target score. Then, the web server script imports the sorted results for web presentation, including target rank, target prediction score, miRNA name, target gene symbol and description. Additionally, the users have the option to review the details of the prediction result for every miRNA/target pair, including miRNA sequence and target sequence with highlighted miRNA seed binding positions. Depending on the number of predicted targets for the input miRNA, the whole prediction process typically completes in about 30–60 s.

Target expression profiles in specific cell models

By default, miRDB presents miRNA target prediction data for all known genes in the genome. However, not all potential miRNA targets are functionally relevant in a given cell. Thus, researchers often need to perform target analysis in the context of specific cell models. To facilitate the selection of cell-specific miRNA targets, miRDB presents a Target Expression page, which enables the users to combine target prediction data with target expression profiles from over 1000 cell lines (Figure 2A). Specifically, we downloaded RNA-seq gene expression profiling data from two large-scale transcriptome studies (13,14) that were deposited in Expression Atlas (15). Combined together, these studies have profiled RNA expression in 1178 cell lines by RNA-seq analysis. Gene expression levels are represented as normalized RPKM read counts (Reads Per Kilobase of transcript per Million mapped reads). Based on the expression values, we defined four gene groups: high expression (RPKM > 20), moderate expression (RPKM 5–20), low expression (RPKM 1–5) and no detectable expression (RPKM < 1). On average, there are about 11 000 genes with detectable expression per cell line. Gene targets with high or moderate expression in a specific cell model are more likely to be functionally impacted by miRNA regulation. As shown in Figure 2B, the users may further limit target selection by defining a desired gene expression threshold. The expression level of each gene target is presented together with MirTarget prediction score. By integrating target prediction and expression data, the users can quickly identify cell-specific targets for further experimental validation.
Figure 2.

miRDB target expression analysis. miRDB hosts the expression profiles of over 1000 cell lines. (A) A screenshot for selection of specific cell models. (B) A screenshot for integrative presentation of both target prediction and expression data for the selected cell model.

miRDB target expression analysis. miRDB hosts the expression profiles of over 1000 cell lines. (A) A screenshot for selection of specific cell models. (B) A screenshot for integrative presentation of both target prediction and expression data for the selected cell model.

Prediction of miRNA functions by target ontology analysis

The function of a miRNA is defined by its gene targets. Thus, biological pathways regulated by miRNAs can be inferred by target analysis. However, as one miRNA can potentially regulate hundreds of gene targets, it is a challenge to reliably identify significant pathways impacted by miRNA regulation. One popular approach for miRNA functional prediction is to perform target enrichment analysis, i.e. identifying pathways or functional categories that are statistically enriched in miRNA targets. To this end, we have implemented a new web interface for target ontology analysis. As presented in the Target Ontology page, the users may first retrieve all predicted targets for a specific miRNA of interest, and then directly submit the target list for GO enrichment analysis (Figure 3). The GO enrichment analysis is performed by employing the PANTHER web server engine (16) using up-to-date GO terms (17). By providing a new web query interface, miRDB integrates both target prediction and GO enrichment analyses, and presents a streamlined pipeline for prediction of miRNA functions.
Figure 3.

A screenshot for miRDB target ontology analysis. miRNA target prediction data and GO data were integratively analyzed to predict miRNA functions.

A screenshot for miRDB target ontology analysis. miRNA target prediction data and GO data were integratively analyzed to predict miRNA functions. In summary, we have performed major updates on miRDB, including implementation of a new miRNA target prediction algorithm as well as presentation of new database features for prediction of miRNA functions. By combining miRNA target prediction data with other biological data such as cell-specific expression profiles or GO annotations, we expect these new miRDB features to be helpful for researchers to quickly identify relevant miRNA functions in specific experimental systems. In the future, we will continue to make improvements to the target prediction algorithm as well as integrate more heterogeneous types of data in miRDB for flexible analysis of miRNA functions in various experimental settings.

DATA AVAILABILITY

All data in miRDB are freely accessible at http://mirdb.org.
  17 in total

Review 1.  The functions of animal microRNAs.

Authors:  Victor Ambros
Journal:  Nature       Date:  2004-09-16       Impact factor: 49.962

Review 2.  How microRNAs control cell division, differentiation and death.

Authors:  Eric A Miska
Journal:  Curr Opin Genet Dev       Date:  2005-10       Impact factor: 5.578

3.  Widespread changes in protein synthesis induced by microRNAs.

Authors:  Matthias Selbach; Björn Schwanhäusser; Nadine Thierfelder; Zhuo Fang; Raya Khanin; Nikolaus Rajewsky
Journal:  Nature       Date:  2008-07-30       Impact factor: 49.962

4.  Unambiguous identification of miRNA:target site interactions by different types of ligation reactions.

Authors:  Stefanie Grosswendt; Andrei Filipchyk; Mark Manzano; Filippos Klironomos; Marcel Schilling; Margareta Herzog; Eva Gottwein; Nikolaus Rajewsky
Journal:  Mol Cell       Date:  2014-05-22       Impact factor: 17.970

5.  miRDB: a microRNA target prediction and functional annotation database with a wiki interface.

Authors:  Xiaowei Wang
Journal:  RNA       Date:  2008-04-21       Impact factor: 4.942

6.  A comprehensive transcriptional portrait of human cancer cell lines.

Authors:  Christiaan Klijn; Steffen Durinck; Eric W Stawiski; Peter M Haverty; Zhaoshi Jiang; Hanbin Liu; Jeremiah Degenhardt; Oleg Mayba; Florian Gnad; Jinfeng Liu; Gregoire Pau; Jens Reeder; Yi Cao; Kiran Mukhyala; Suresh K Selvaraj; Mamie Yu; Gregory J Zynda; Matthew J Brauer; Thomas D Wu; Robert C Gentleman; Gerard Manning; Robert L Yauch; Richard Bourgon; David Stokoe; Zora Modrusan; Richard M Neve; Frederic J de Sauvage; Jeffrey Settleman; Somasekar Seshagiri; Zemin Zhang
Journal:  Nat Biotechnol       Date:  2014-12-08       Impact factor: 54.908

7.  Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs.

Authors:  Lee P Lim; Nelson C Lau; Philip Garrett-Engele; Andrew Grimson; Janell M Schelter; John Castle; David P Bartel; Peter S Linsley; Jason M Johnson
Journal:  Nature       Date:  2005-01-30       Impact factor: 49.962

8.  Mapping the human miRNA interactome by CLASH reveals frequent noncanonical binding.

Authors:  Aleksandra Helwak; Grzegorz Kudla; Tatiana Dudnakova; David Tollervey
Journal:  Cell       Date:  2013-04-25       Impact factor: 41.582

9.  The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity.

Authors:  Jordi Barretina; Giordano Caponigro; Nicolas Stransky; Kavitha Venkatesan; Adam A Margolin; Sungjoon Kim; Christopher J Wilson; Joseph Lehár; Gregory V Kryukov; Dmitriy Sonkin; Anupama Reddy; Manway Liu; Lauren Murray; Michael F Berger; John E Monahan; Paula Morais; Jodi Meltzer; Adam Korejwa; Judit Jané-Valbuena; Felipa A Mapa; Joseph Thibault; Eva Bric-Furlong; Pichai Raman; Aaron Shipway; Ingo H Engels; Jill Cheng; Guoying K Yu; Jianjun Yu; Peter Aspesi; Melanie de Silva; Kalpana Jagtap; Michael D Jones; Li Wang; Charles Hatton; Emanuele Palescandolo; Supriya Gupta; Scott Mahan; Carrie Sougnez; Robert C Onofrio; Ted Liefeld; Laura MacConaill; Wendy Winckler; Michael Reich; Nanxin Li; Jill P Mesirov; Stacey B Gabriel; Gad Getz; Kristin Ardlie; Vivien Chan; Vic E Myer; Barbara L Weber; Jeff Porter; Markus Warmuth; Peter Finan; Jennifer L Harris; Matthew Meyerson; Todd R Golub; Michael P Morrissey; William R Sellers; Robert Schlegel; Levi A Garraway
Journal:  Nature       Date:  2012-03-28       Impact factor: 49.962

10.  miRDB: an online resource for microRNA target prediction and functional annotations.

Authors:  Nathan Wong; Xiaowei Wang
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 16.971

View more
  550 in total

Review 1.  Profilin: many facets of a small protein.

Authors:  Rhonda J Davey; Pierre Dj Moens
Journal:  Biophys Rev       Date:  2020-07-13

2.  Differential expression and bioinformatic analysis of circRNA in nonalcoholic steatohepatitis cirrhosis.

Authors:  Yuexin Fan; Yang Zheng; Jiahui Wang; Tiejian Zhao; Tianjian Liang
Journal:  Int J Clin Exp Pathol       Date:  2020-11-01

3.  Exploring the Potential Key IncRNAs with Endometriosis by Construction of a ceRNA Network.

Authors:  Chenglei Gu; Yuanguang Meng; Qingqing Meng; Wensheng Fan; Mingxia Ye; Qian Zhang; Nina Zhang; Lian Li
Journal:  Int J Gen Med       Date:  2021-08-05

4.  Identification of novel survival-related lncRNA-miRNA-mRNA competing endogenous RNA network associated with immune infiltration in colorectal cancer.

Authors:  Jianxin Li; Ting Han; Xin Wang; Yinchun Wang; Qingqiang Yang
Journal:  Am J Transl Res       Date:  2021-06-15       Impact factor: 4.060

5.  Bioinformatics Gene Analysis of Potential Biomarkers and Therapeutic Targets for Unstable Atherosclerotic Plaque-Related Stroke.

Authors:  Shaojiong Zhou; Shuo Liu; Xiaoqiang Liu; Weiduan Zhuang
Journal:  J Mol Neurosci       Date:  2020-11-06       Impact factor: 3.444

6.  Pleiotropic Locus 15q24.1 Reveals a Gender-Specific Association with Neovascular but Not Atrophic Age-Related Macular Degeneration (AMD).

Authors:  Christina Kiel; Tobias Strunz; Felix Grassmann; Bernhard H F Weber
Journal:  Cells       Date:  2020-10-08       Impact factor: 6.600

7.  Novel role of ASH1L histone methyltransferase in anaplastic thyroid carcinoma.

Authors:  Bin Xu; Tingting Qin; Jingcheng Yu; Thomas J Giordano; Maureen A Sartor; Ronald J Koenig
Journal:  J Biol Chem       Date:  2020-05-12       Impact factor: 5.157

Review 8.  Cholino-ncRNAs modulate sex-specific- and age-related acetylcholine signals.

Authors:  Nimrod Madrer; Hermona Soreq
Journal:  FEBS Lett       Date:  2020-05-04       Impact factor: 4.124

9.  The circRNA-miRNA-mRNA regulatory network in systemic lupus erythematosus.

Authors:  Junhui Zhang; Yuan Liu; Guixiu Shi
Journal:  Clin Rheumatol       Date:  2020-06-12       Impact factor: 2.980

10.  Low expression of miR-195 is associated with cell proliferation, glycolysis and poor survival in estrogen receptor (ER)-positive but not in triple negative breast cancer.

Authors:  Yoshihisa Tokumaru; Masanori Oshi; Ankit Patel; Eriko Katsuta; Li Yan; Fernando A Angarita; Subhamoy Dasgupta; Masayuki Nagahashi; Nobuhisa Matsuhashi; Manabu Futamura; Kazuhiro Yoshida; Kazuaki Takabe
Journal:  Am J Cancer Res       Date:  2021-06-15       Impact factor: 6.166

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.