Literature DB >> 15980567

miRU: an automated plant miRNA target prediction server.

Yuanji Zhang1.   

Abstract

MicroRNAs (miRNAs) play important roles in gene expression regulation in animals and plants. Since plant miRNAs recognize their target mRNAs by near-perfect base pairing, computational sequence similarity search can be used to identify potential targets. A web-based integrated computing system, miRU, has been developed for plant miRNA target gene prediction in any plant, if a large number of sequences are available. Given a mature miRNA sequence from a plant species, the system thoroughly searches for potential complementary target sites with mismatches tolerable in miRNA-target recognition. True or false positives are estimated based on the number and type of mismatches in the target site, and on the evolutionary conservation of target complementarity in another genome which can be selected according to miRNA conservation. The output for predicted targets, ordered by mismatch scores, includes complementary sequences with mismatches highlighted in colors, original gene sequences and associated functional annotations. The miRU web server is available at http://bioinfo3.noble.org/miRU.htm.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15980567      PMCID: PMC1160144          DOI: 10.1093/nar/gki383

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

MicroRNAs (miRNAs) are endogenously encoded small RNAs that can regulate gene expressions by base-pairing to protein-coding mRNAs for degradation or translation repression. Numerous miRNAs have been identified from genomes of many animals and plants, such as fruit fly, nematode, zebrafish, chicken, mouse, human, Arabidopsis, rice and maize. miRNA genes are abundant in humans, estimated to account for ∼1% of the total predicted genes. In Arabidopsis, at least 43 distinct miRNA families consisting of 111 members have been reported and archived in ‘The miRNA Registry’ thus far () (1). Although the function of most miRNAs remains unknown, a number of miRNAs have been shown to play important roles in developmental timing, cell death, cell proliferation, hematopoiesis and patterning of the nervous system in animals, and stress responses, and leaf and flower development in plants (2–6). Finding regulatory mRNA targets is essential to understanding the biological functions of miRNAs. Different methods are needed to predict animal and plant miRNA targets. While miRNA–target duplex free energy may be important for animal miRNA target prediction (7,8), plant miRNA targets can be predicted by sequence similarity since plant miRNA seems to bind almost perfectly to its cognate mRNA (7,9). Computational tools have been developed to predict plant miRNA targets (9–11), but none is in the web server format. Rhoades et al. (9) used PatScan (12) to predict plant miRNA targets with ≤3 mismatches. Jones-Rhoades and Bartel (11) used their own unpublished programs, together with PatScan, and the prediction seems to be more comprehensive. Wang et al. (10) deployed Smith–Waterman algorithm in miRNA target prediction, but failed to detect all previously identified targets (10). Since most biology laboratories involved in plant miRNA research may not have necessary bioinformatic resources for target prediction, a publicly accessible web application for plant miRNA target prediction has been developed. The tool allows systematic search for miRNA complementary targets in any plant whose genome sequence or a large number of expressed sequence tags (ESTs) are available. Backed by an exhaustive search algorithm, the tool is able to find all potential targets with the given mismatches. False positives are reduced by limiting the number of mismatches and by ensuring the target complementarity conservation in another plant species (11).

INPUT TO THE SERVER

The server has a user-friendly and intuitive input interface, as shown in Figure 1. The user is required to enter a mature miRNA sequence in 5′→3′direction. Although miRNAs are usually 21–24 nt (4), the input sequence can be in the range of 19–28 nt in length to accommodate an siRNA input, as the tool can also be used to search for siRNA targets and off-targets. To predict target genes, the user has to specify an mRNA dataset for the intended organism. Currently, the system includes genome mRNAs or ESTs and other transcripts-assembled Gene Indices (13) for 28 plant species downloaded from The Institute for Genome Research (TIGR) at . With the above input information, a Perl script at the backend will then do an exhaustive sequence similarity search, using an algorithm modified from BLAST (14) (see Additional File 1).
Figure 1

Data input web interface for plant miRNA target prediction.

To reduce false positives in predicted targets, the user can limit the number of mismatches, which are classified into three types and are assigned different scores; the higher scores are for more detrimental mismatches for miRNA function: G:U wobble pairings (each assigned 0.5 scores), insertions/deletions (indels) (2.0) and all other non-canonical Watson–Crick pairings (1.0). The total score for an alignment is calculated based on 20 nt. When the query is longer than 20 nt, scores for all possible consecutive 20 nt subsequences are computed and the minimum score is output as the total score for the query-subject alignment. Since target complementarity to the miRNA 5′ end seems to be critical to the target site function (15–18), any mismatch other than G:U wobble in positions 2–7 at the 5′ end is further penalized 0.5 points in the score. Based on the observation that both miRNAs and their target sites are evolutionarily conserved across genomes (18–20), the conservation of target complementarity in another genome can be used to further reduce false positives in plant miRNA target prediction (11). Furthermore, such analysis will also provide useful information about conserved regulatory roles of homologous miRNAs in different species. To use this strategy in the server, the information of the homologous miRNA and the mRNA dataset of the second genome should be provided for the system to do another search. Then the system compares potential targets to find whether homologous genes are predicted to be targeted by the homologous miRNAs in both genomes. Genes are considered to be homologous if they share ≥1 Pfam domains (21). All mRNA datasets are preprocessed by aligning to Pfam-A seed domain sequences (Pfam 16.0, which contains 7677 families, available at ). For Arabidopsis and rice genome mRNA datasets, the corresponding protein datasets are used for functional domain identification using HMMER (22) with E-value ≤0.1 as the significance level. Since HMMER does not allow DNA–protein comparisons, all gene index datasets are searched against Pfam-A seed domain sequences using blastx program (14) with E-value cut-off of 10−5. TIGR's ‘Eukaryotic Gene Orthologs’ dataset (23) is also used for determining homology relationships in the Gene Index datasets. The search results are parsed and stored in a MySQL database to facilitate the comparisons of target conservation in any two genomes.

OUTPUT TO THE USER

The output report consists of three parts (Figure 2). The first part is a summary of search input parameters, including the query sequence, mismatches allowed and target dataset. The next section is a list of predicted complementary targets displayed in the order of mismatch scores. Information shown for each predicted target includes gene identifier, target site position, mismatch score, number of mismatches and target complementary sequence with mismatches highlighted in colors (green for G·U mismatches, purple for indels and red for all other mismatches). The target is indicated if its complementarity is conserved in another genome. Therefore, the target list includes conserved targets that are highly likely to be true targets. It also includes targets whose counterparts in the second genome are not found. Some of these targets may still be true targets since the dataset to be compared for most plants are ESTs sampled from the genomes and may miss the conserved targets. The last part of the output is the target gene sequences in FASTA format, which includes the definition line for the original functional annotation. The target site in the gene can easily be located as it is highlighted in colors (Figure 2).
Figure 2

Result page of plant miRNA target prediction.

To verify the tool, its prediction was compared with two published prediction results (6,11). The prediction of Arabidopsis miRNA targets by Jones-Rhoades and Bartel (11) seems to be highly reliable since more than half of the predicted targets were experimentally verified as true targets. In this work, Arabidopsis miRNAs conserved in rice, as listed in Supplementary Table S1 in Jones-Rhoades and Bartel (11), were used as queries for the tool to predict target genes and the result can be found in Additional File 2. All the reported potential target genes were successfully detected by this tool. Recently, Sunkar and Zhu (6) identified stress-regulated miRNAs from Arabidopsis. They also predicted the potential target genes for these miRNAs using the criteria modified from Rhoades et al. (9). The new algorithm detected all their predicted targets. Moreover, the result indicates that Sunkar and Zhu's prediction seems to be incomplete. For example, a total of 23 targets were predicted by Sunkar and Zhu for ten miRNAs identified in their experiment, while this server predicts 203 potential targets in total (see Additional File 3).

SUMMARY

The server aims at predicting plant miRNA targets with the highest sensitivity and selectivity by using a search algorithm which guarantees finding all homologous sequences within given mismatches, and by applying current knowledge about miRNA targets to minimize false positives. As a practical tool, it should aid biologists in plant miRNA research.

SUPPLEMENTARY MATERIAL

Supplementary Material is available at NAR Online.
  22 in total

Review 1.  MicroRNAs: genomics, biogenesis, mechanism, and function.

Authors:  David P Bartel
Journal:  Cell       Date:  2004-01-23       Impact factor: 41.582

2.  The microRNA Registry.

Authors:  Sam Griffiths-Jones
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 3.  Role of microRNAs in plant and animal development.

Authors:  James C Carrington; Victor Ambros
Journal:  Science       Date:  2003-07-18       Impact factor: 47.728

4.  The TIGR gene indices: reconstruction and representation of expressed gene sequences.

Authors:  J Quackenbush; F Liang; I Holt; G Pertea; J Upton
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

5.  Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA).

Authors:  Yuandan Lee; Razvan Sultana; Geo Pertea; Jennifer Cho; Svetlana Karamycheva; Jennifer Tsai; Babak Parvizi; Foo Cheung; Valentin Antonescu; Joseph White; Ingeborg Holt; Feng Liang; John Quackenbush
Journal:  Genome Res       Date:  2002-03       Impact factor: 9.043

6.  Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes.

Authors:  Eric Bonnet; Jan Wuyts; Pierre Rouzé; Yves Van de Peer
Journal:  Proc Natl Acad Sci U S A       Date:  2004-07-22       Impact factor: 11.205

7.  Computational identification of plant microRNAs and their targets, including a stress-induced miRNA.

Authors:  Matthew W Jones-Rhoades; David P Bartel
Journal:  Mol Cell       Date:  2004-06-18       Impact factor: 17.970

8.  microRNA-mediated repression of rolled leaf1 specifies maize leaf polarity.

Authors:  Michelle T Juarez; Jonathan S Kui; Julie Thomas; Bradley A Heller; Marja C P Timmermans
Journal:  Nature       Date:  2004-03-04       Impact factor: 49.962

9.  The Pfam protein families database.

Authors:  Alex Bateman; Lachlan Coin; Richard Durbin; Robert D Finn; Volker Hollich; Sam Griffiths-Jones; Ajay Khanna; Mhairi Marshall; Simon Moxon; Erik L L Sonnhammer; David J Studholme; Corin Yeats; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

10.  Identification of Drosophila MicroRNA targets.

Authors:  Alexander Stark; Julius Brennecke; Robert B Russell; Stephen M Cohen
Journal:  PLoS Biol       Date:  2003-10-13       Impact factor: 8.029

View more
  143 in total

1.  MicroRNA profiles and their control of male gametophyte development in rice.

Authors:  Hua Peng; Jun Chun; Tao-bo Ai; Yong-ao Tong; Rong Zhang; Ming-ming Zhao; Fang Chen; Sheng-hua Wang
Journal:  Plant Mol Biol       Date:  2012-03-09       Impact factor: 4.076

2.  Computational identification of microRNAs and their targets from the expressed sequence tags of horsegram (Macrotyloma uniflorum (Lam.) Verdc.).

Authors:  Jyoti Bhardwaj; Hasan Mohammad; Sudesh Kumar Yadav
Journal:  J Struct Funct Genomics       Date:  2010-10-27

3.  Bioinformatic prediction and experimental validation of a microRNA-directed tandem trans-acting siRNA cascade in Arabidopsis.

Authors:  Ho-Ming Chen; Yi-Hang Li; Shu-Hsing Wu
Journal:  Proc Natl Acad Sci U S A       Date:  2007-02-20       Impact factor: 11.205

Review 4.  Dicer-like (DCL) proteins in plants.

Authors:  Qingpo Liu; Ying Feng; Zhujun Zhu
Journal:  Funct Integr Genomics       Date:  2009-02-17       Impact factor: 3.410

5.  Construction of Parallel Analysis of RNA Ends (PARE) libraries for the study of cleaved miRNA targets and the RNA degradome.

Authors:  Marcelo A German; Shujun Luo; Gary Schroth; Blake C Meyers; Pamela J Green
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

6.  Genome-wide survey of rice microRNAs and microRNA-target pairs in the root of a novel auxin-resistant mutant.

Authors:  Yijun Meng; Fangliang Huang; Qingyun Shi; Junjie Cao; Dijun Chen; Jinwei Zhang; Jun Ni; Ping Wu; Ming Chen
Journal:  Planta       Date:  2009-08-05       Impact factor: 4.116

7.  Differential expression of miRNAs in response to salt stress in maize roots.

Authors:  Dong Ding; Lifang Zhang; Hang Wang; Zhijie Liu; Zuxin Zhang; Yonglian Zheng
Journal:  Ann Bot       Date:  2008-10-24       Impact factor: 4.357

8.  miR398 and miR408 are up-regulated in response to water deficit in Medicago truncatula.

Authors:  Inês Trindade; Cláudio Capitão; Tamas Dalmay; Manuel Pedro Fevereiro; Dulce Metelo Dos Santos
Journal:  Planta       Date:  2009-12-11       Impact factor: 4.116

9.  Submergence-responsive MicroRNAs are potentially involved in the regulation of morphological and metabolic adaptations in maize root cells.

Authors:  Zuxin Zhang; Liya Wei; Xilin Zou; Yongsheng Tao; Zhijie Liu; Yonglian Zheng
Journal:  Ann Bot       Date:  2008-07-31       Impact factor: 4.357

10.  Identification of soybean microRNAs and their targets.

Authors:  Baohong Zhang; Xiaoping Pan; Edmund J Stellwag
Journal:  Planta       Date:  2008-09-25       Impact factor: 4.116

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.