| Literature DB >> 21602264 |
Christopher L Plaisier1, J Christopher Bare, Nitin S Baliga.
Abstract
Transcriptome profiling studies have produced staggering numbers of gene co-expression signatures for a variety of biological systems. A significant fraction of these signatures will be partially or fully explained by miRNA-mediated targeted transcript degradation. miRvestigator takes as input lists of co-expressed genes from Caenorhabditis elegans, Drosophila melanogaster, G. gallus, Homo sapiens, Mus musculus or Rattus norvegicus and identifies the specific miRNAs that are likely to bind to 3' un-translated region (UTR) sequences to mediate the observed co-regulation. The novelty of our approach is the miRvestigator hidden Markov model (HMM) algorithm which systematically computes a similarity P-value for each unique miRNA seed sequence from the miRNA database miRBase to an overrepresented sequence motif identified within the 3'-UTR of the query genes. We have made this miRNA discovery tool accessible to the community by integrating our HMM algorithm with a proven algorithm for de novo discovery of miRNA seed sequences and wrapping these algorithms into a user-friendly interface. Additionally, the miRvestigator web server also produces a list of putative miRNA binding sites within 3'-UTRs of the query transcripts to facilitate the design of validation experiments. The miRvestigator is freely available at http://mirvestigator.systemsbiology.net.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21602264 PMCID: PMC3125776 DOI: 10.1093/nar/gkr374
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.This implementation diagram describes the miRvestigator web server currently residing on the Amazon Web Services Elastic Compute Cloud (EC2) as an Extra Large Instance. The miRvestigator is organized into two levels, a web tier for user interaction and the backend where the bulk of computation occurs. Results are stored in a MySQL database which also contains the 3′-UTR sequence and miRNA seed sequences for each of the six species. The 3′-UTR sequences are generated by integrating information from the UCSC genome browser and NCBI Entrez gene database. The miRNA seed sequences are retrieved from miRBase.
Figure 2.miRvestigator results for an in vivo study where 372 genes were up-regulated when miR-122 was inhibited by antagomir in mice. (A) Summary of the results for the run. The table contains a motif logo plot, the top miRNA(s) complementary to the motif, the base pairing for each miRNA seed to the motif, the Viterbi P-Value of the significance of the complementarity between the motif and the miRNA seed, and the percent of the input sequences with a predicted miRNA site. The summary table contains an entry for each motif size (6 or 8 bp). (B) Table of top miRNA matches sorted by complementarity to the identified motif. The miRNA name(s) are listed for each unique miRNA seed sequence (each miRNA name is a link to the miRBase entry), the unique miRNA seed, the seed model, the complementary base pairing of the miRNA seed sequence to the motif and the Viterbi P-Value for the significance of the complementarity. (C) Table of predicted miRNA-binding sites from Weeder. The official gene symbols or Entrez gene identifiers are listed for each predicted binding site (which is a link to NCBI Entrez gene database entry), the unmapped identifier, the sequence of the site, the location of the start of the site relative to the stop codon, the similarity of the site to the identified motif (100% = perfect complementarity, 95% = 1-bp difference, etc.) and the duplexing free energy for the reverse complement of the motif consensus to the predicted target site.