Literature DB >> 18957447

The database of experimentally supported targets: a functional update of TarBase.

Giorgos L Papadopoulos1, Martin Reczko, Victor A Simossis, Praveen Sethupathy, Artemis G Hatzigeorgiou.   

Abstract

TarBase5.0 is a database which houses a manually curated collection of experimentally supported microRNA (miRNA) targets in several animal species of central scientific interest, plants and viruses. MiRNAs are small non-coding RNA molecules that exhibit an inhibitory effect on gene expression, interfering with the stability and translational efficiency of the targeted mature messenger RNAs. Even though several computational programs exist to predict miRNA targets, there is a need for a comprehensive collection and description of miRNA targets with experimental support. Here we introduce a substantially extended version of this resource. The current version includes more than 1300 experimentally supported targets. Each target site is described by the miRNA that binds it, the gene in which it occurs, the nature of the experiments that were conducted to test it, the sufficiency of the site to induce translational repression and/or cleavage, and the paper from which all these data were extracted. Additionally, the database is functionally linked to several other relevant and useful databases such as Ensembl, Hugo, UCSC and SwissProt. The TarBase5.0 database can be queried or downloaded from http://microrna.gr/tarbase.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18957447      PMCID: PMC2686456          DOI: 10.1093/nar/gkn809

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Mature microRNA (miRNA) molecules are approximately 22-nucleotide-long single-stranded RNAs that generally repress the expression of protein coding genes. Specifically, they preferentially bind to 3′ untranslated regions (UTRs) of messenger RNAs (mRNAs) and interfere with their stability and translational efficiency (1,2). The first miRNAs and their target genes were identified via classical forward genetic techniques in 1993, but it was not until 2001 that many more miRNAs were discovered experimentally and found to be abundant and widespread (3–5). Since then there has been a dramatic growth in the number of annotated miRNAs (Figure 1).
Figure 1.

The growth of the human miRNA genes in mirBase database and the growth of the human experimentally determined miRNA target interactions in TarBase.

The growth of the human miRNA genes in mirBase database and the growth of the human experimentally determined miRNA target interactions in TarBase. A crucial aspect of the functional analysis of miRNAs is the annotation of their protein-coding targets. A number of computational algorithms have been developed for the prediction of such targets (6). Although these programs are very important to guide wet lab experiments, they still lack in sensitivity and specificity (7,8). In parallel, and as support for these programs, a number of experimental procedures have been developed to provide indirect or direct support for predicted miRNA–target interactions and results from a growing number of such experiments have been published (Figure 1). The need for a systematic documentation of such experimentally supported targets was covered by the first version of TarBase (9). In 2006, the database recorded over 550 entries with miRNA–target interactions in human, mouse, fruit fly, worm and zebrafish. Here we present a substantially updated and extended version of this database, TarBase5.0, documenting over 1300 entries. The new database now contains all of the information included in the previous version, plus: Additionally, both the underlying SQL database and the user interface have been extensively redesigned with several added external links such as a direct link to the paper's abstract on the PubMed site. specific cells lines (if any) used in the experiments, cell-type-specific expression of the gene product and its potential involvement in carcinogenesis, differential expression of miRNAs in specific tissues, developmental or pathological events that a specific miRNA is involved in and any annotated types of miRNA-related mis-regulation in those events, HGNC Symbols as defined by HUGO (in order to provide consistent gene naming). The TarBase5.0 database can be directly accessed from the http://microrna.gr/tarbase web page.

METHOD AND RESULTS

TarBase5.0 contains data extracted from a total of 203 scientific papers resulting in 1333 entries describing a regulatory interaction between a miRNA and a target 3′ UTR (summarized in Table 1).
Table 1.

A list of all TarBase5.0 entries

OrganismNumber of papersNumber of entriesMicroarray datapSILAC data
Homo sapiens110285328474
Mus musculus2810513
D. melanogaster2377
C. elegans1814
Plants2130
Danio rerio11
Rat22
Total203514341474
A list of all TarBase5.0 entries The TarBase5.0 data set contains miRNA targets that tested either positive (induces target gene repression) or negative (no influence on target gene expression). For each experiment with a positive outcome the target site is described by the miRNA that binds it, the gene in which it occurs, the nature of the experiments that were conducted to test it, the sufficiency of the site to induce translational repression and/or cleavage, and the paper from which all these data is extracted. Additionally, for each miRNA and protein-coding gene, the database contains links to several other relevant and useful databases such as Ensembl (10), Hugo (11), UCSC genome browser (12) and SwissProt (13). There are a number of direct and indirect experimental procedures that have been developed to test a possible miRNA–mRNA interaction. The entries in Tarbase5.0 are classified into four categories: TRUE or FALSE in the cases where an assay provides direct experimental evidence, or MICROARRAY and/or pSILAC in the cases that present only indirect evidence from high-throughput techniques to measure miRNA-mediated global transcriptomic or proteomic changes. All of these approaches make use of technology for miRNA knock down or overexpression. To overexpress a miRNA, expression constructs can be engineered using the mature miRNA, the precursor (hairpin) miRNA, or the pri-miRNA sequence for transfection into in vitro or in vivo transformed cells. Also, silencing of a specific miRNA can be accomplished by introducing chemically modified oligonucleotides that are perfectly complimentary to the mature miRNA (antagomirs) (14). These methods for modifying miRNA expression allow for several types of follow-up techniques to quantify and interpret differences in target gene expression. Below we provide a more detailed description of each of the four categories: TRUE or FALSE: The most commonly used method for providing direct experimental evidence is the reporter gene assay. In its simplest form, an expression vector containing a reporter gene [i.e. Luciferase or Green Fluorescent Protein (GFP)] is first modified by cloning the predicted target 3′UTR downstream of the reporter gene, and then transfected into a cell line of interest in the absence and presence of the cognate miRNA. Despite the general utility of this approach to assay for 3′UTR-mediated effects on reporter protein expression, it is not informative for the precise location of the miRNA response element (MRE) or number of miRNA target sites in the 3′UTR. Integration of the reporter gene assay with site directed mutagenesis of the predicted MRE (and, further, restoring the complementarity of the miRNA–MRE interaction by mutating the mature miRNA sequence) yields a much more specific and direct result. To measure effects on reporter mRNA levels, the most commonly applied technique is quantitative RT-PCR (qRT-PCR). Measuring effects on both protein and mRNA levels can help provide information about the mode of miRNA-mediated silencing: mRNA translational repression or immediate RISC-mediated mRNA cleavage and degradation. A miRNA–MRE interaction is reported as TRUE or FALSE based on the results of the reporter gene assay. MICROARRAY and/or pSILAC: These high-throughput approaches measure global changes in the transcriptome (15) or proteome (8,16) given the presence or absence of a miRNA. Despite their power for large-scale analysis, these techniques only provide indirect evidence about a miRNA's targets since it is not possible to distinguish between primary direct targets and secondary indirect targets. Other high-throughput methods like degradome sequencing (17,18) are also immensely useful but only in the scenarios where a miRNA induces RISC-mediated mRNA cleavage. In order to facilitate user interaction, the query function is divided into several functionally related subgroups. The initial screen of the TarBase5.0 user interface allows users to query based on miRNA, gene and organism. For more advanced queries, the user can utilize the extended query options. In this case, the search menus are arranged into four functionally related groups. The first group contains the fields with information about the miRNA–target interaction: the validity of the interaction (field ‘Support Type’, either true or false), the function of the interaction which can be either translational repression or mRNA cleavage (field ‘DataType’), the sufficiency of a single target site to exert the specific function (field ‘S_S_S’) and the number of miRNA response elements present in the specific UTR (field ‘MRE’). The second group contains the fields that refer to the experimental methods that led to the reported result. The field ‘Direct Support’ refers to experimental procedures that provide direct evidence regarding the miRNA–target interaction (i.e. reporter gene assays) while ‘Indirect Support’ refers to experimental procedures that provide more global, system-wide miRNA-mediated effects (i.e. microarrays). The third group corresponds to biological properties of the miRNA or target gene: biological functions (field ‘Protein Type’), specific expression profiles (field ‘miRNA Expression’) or the physiological processes in which this interaction is involved (field ‘Event or Pathology’). The fourth and final group contains some general query features such as the scientific paper (searchable by Author or PMID). The results are presented in a similar format as the query fields. By default, the results screen (Figure 2) shows only the repression type, the miRNA identifier, the target gene identified by the HGNC symbol (if it is a human gene), the common gene name, the Refseq isoform id (particularly relevant in cases of gene variants or SNP haplotypes), the affected biological processes and the paper containing the information presented. Users can opt to view more detailed information by clicking on the ‘+’ box so that the expanded results view is opened (Figure 2). The additional information is divided into three categories: miRNA information, gene information and experimental conditions.
Figure 2.

Example of a result screen for a TarBase query. The context-specific links to other resources are indicated by the blue arrows.

Example of a result screen for a TarBase query. The context-specific links to other resources are indicated by the blue arrows. The ‘miRNA information’ category contains the properties of the specific miRNA such as the miRNA's sequence [extracted from miRBase (19)], the number and sequences of the MREs, their locations within the gene's 3′UTR, and the affected tissues (extracted from the paper). The ‘Gene information’ category gathers mostly biological properties of the target gene like the protein type, Ensembl and SwissProt IDs and chromosome location, providing direct links to Ensembl, SwissProt and the UCSC browser respectively. Moreover, expression profiles and tumor involvement are also provided for human genes (information extracted from the Ensembl eGenetics database). Finally, the ‘Experimental conditions’ category provides the nature of the direct or indirect evidence for the miRNA–target gene interaction. The cell lines used to carry out the specific experiment are also presented in order to render the experimental conditions more complete and reproducible.

CONCLUSION

Even though several computational programs exist to predict miRNA targets, the necessity for a systematic collection and description of miRNA targets with experimental support led to the development of TarBase. The continuously expanding number of known and newly identified miRNAs and their targets, combined with their central role in biological systems, renders this field particularly dependent on centralized information that is accurate, up-to-date, comprehensive and easy to browse or download. In order to satisfy these requirements, we have made extensive updates and modifications and present the new version of the database, TarBase5.0.

AVAILABILITY

TarBase is freely available at http://microrna.gr/tarbase. The TarBase data files can be freely downloaded and used according to the GNU Public License. The relevant literature is reviewed for new entries and the database is updated quarterly.

FUNDING

Funding for open access charge: Aristeia Award from General Secretary Research and Technology, Greece. Conflict of interest statement. None declared.
  19 in total

1.  Identification of novel genes coding for small expressed RNAs.

Authors:  M Lagos-Quintana; R Rauhut; W Lendeckel; T Tuschl
Journal:  Science       Date:  2001-10-26       Impact factor: 47.728

Review 2.  Micromanagers of gene expression: the potentially widespread influence of metazoan microRNAs.

Authors:  David P Bartel; Chang-Zheng Chen
Journal:  Nat Rev Genet       Date:  2004-05       Impact factor: 53.242

3.  Specificity of microRNA target selection in translational repression.

Authors:  John G Doench; Phillip A Sharp
Journal:  Genes Dev       Date:  2004-03-10       Impact factor: 11.361

Review 4.  UCSC genome browser: deep support for molecular biomedical research.

Authors:  Mary E Mangan; Jennifer M Williams; Scott M Lathe; Donna Karolchik; Warren C Lathe
Journal:  Biotechnol Annu Rev       Date:  2008

5.  Widespread changes in protein synthesis induced by microRNAs.

Authors:  Matthias Selbach; Björn Schwanhäusser; Nadine Thierfelder; Zhuo Fang; Raya Khanin; Nikolaus Rajewsky
Journal:  Nature       Date:  2008-07-30       Impact factor: 49.962

6.  An abundant class of tiny RNAs with probable regulatory roles in Caenorhabditis elegans.

Authors:  N C Lau; L P Lim; E G Weinstein; D P Bartel
Journal:  Science       Date:  2001-10-26       Impact factor: 47.728

7.  An extensive class of small RNAs in Caenorhabditis elegans.

Authors:  R C Lee; V Ambros
Journal:  Science       Date:  2001-10-26       Impact factor: 47.728

8.  The impact of microRNAs on protein output.

Authors:  Daehyun Baek; Judit Villén; Chanseok Shin; Fernando D Camargo; Steven P Gygi; David P Bartel
Journal:  Nature       Date:  2008-07-30       Impact factor: 49.962

9.  Global identification of microRNA-target RNA pairs by parallel analysis of RNA ends.

Authors:  Marcelo A German; Manoj Pillay; Dong-Hoon Jeong; Amit Hetawal; Shujun Luo; Prakash Janardhanan; Vimal Kannan; Linda A Rymarquis; Kan Nobuta; Rana German; Emanuele De Paoli; Cheng Lu; Gary Schroth; Blake C Meyers; Pamela J Green
Journal:  Nat Biotechnol       Date:  2008-06-09       Impact factor: 54.908

10.  Endogenous siRNA and miRNA targets identified by sequencing of the Arabidopsis degradome.

Authors:  Charles Addo-Quaye; Tifani W Eshoo; David P Bartel; Michael J Axtell
Journal:  Curr Biol       Date:  2008-05-08       Impact factor: 10.834

View more
  200 in total

1.  Computational methods for the identification of microRNA targets.

Authors:  Yang Dai; Xiaofeng Zhou
Journal:  Open Access Bioinformatics       Date:  2010-05-01

2.  Integration of MicroRNA databases to study MicroRNAs associated with multiple sclerosis.

Authors:  Charlotte Angerstein; Michael Hecker; Brigitte Katrin Paap; Dirk Koczan; Madhan Thamilarasan; Hans-Jürgen Thiesen; Uwe Klaus Zettl
Journal:  Mol Neurobiol       Date:  2012-05-02       Impact factor: 5.590

Review 3.  MicroRNA in ischemic stroke etiology and pathology.

Authors:  Cameron Rink; Savita Khanna
Journal:  Physiol Genomics       Date:  2010-09-14       Impact factor: 3.107

4.  Identification of microRNAs associated with hyperthermia-induced cellular stress response.

Authors:  Gerald J Wilmink; Caleb L Roth; Bennett L Ibey; Norma Ketchum; Joshua Bernhard; Cesario Z Cerna; William P Roach
Journal:  Cell Stress Chaperones       Date:  2010-03-30       Impact factor: 3.667

5.  MicroRNA, mRNA, and protein expression link development and aging in human and macaque brain.

Authors:  Mehmet Somel; Song Guo; Ning Fu; Zheng Yan; Hai Yang Hu; Ying Xu; Yuan Yuan; Zhibin Ning; Yuhui Hu; Corinna Menzel; Hao Hu; Michael Lachmann; Rong Zeng; Wei Chen; Philipp Khaitovich
Journal:  Genome Res       Date:  2010-07-20       Impact factor: 9.043

6.  Computational functional genomics based analysis of pain-relevant micro-RNAs.

Authors:  Jörn Lötsch; Ellen Niederberger; Alfred Ultsch
Journal:  Hum Genet       Date:  2015-09-18       Impact factor: 4.132

7.  AccessFold: predicting RNA-RNA interactions with consideration for competing self-structure.

Authors:  Laura DiChiacchio; Michael F Sloma; David H Mathews
Journal:  Bioinformatics       Date:  2015-11-20       Impact factor: 6.937

8.  Prenatal arsenic exposure and the epigenome: altered microRNAs associated with innate and adaptive immune signaling in newborn cord blood.

Authors:  Julia E Rager; Kathryn A Bailey; Lisa Smeester; Sloane K Miller; Joel S Parker; Jessica E Laine; Zuzana Drobná; Jenna Currier; Christelle Douillet; Andrew F Olshan; Marisela Rubio-Andrade; Miroslav Stýblo; Gonzalo García-Vargas; Rebecca C Fry
Journal:  Environ Mol Mutagen       Date:  2013-12-10       Impact factor: 3.216

9.  Post-transcriptional regulation of human breast cancer cell proteome by unliganded estrogen receptor β via microRNAs.

Authors:  Giovanni Nassa; Roberta Tarallo; Giorgio Giurato; Maria Rosaria De Filippo; Maria Ravo; Francesca Rizzo; Claudia Stellato; Concetta Ambrosino; Marc Baumann; Niina Lietzèn; Tuula A Nyman; Alessandro Weisz
Journal:  Mol Cell Proteomics       Date:  2014-02-13       Impact factor: 5.911

Review 10.  Integrating omics technologies to study pulmonary physiology and pathology at the systems level.

Authors:  Ravi Ramesh Pathak; Vrushank Davé
Journal:  Cell Physiol Biochem       Date:  2014-04-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.