| Literature DB >> 26424082 |
Zhi-Ping Liu1, Canglin Wu2, Hongyu Miao2, Hulin Wu3.
Abstract
Transcriptional and post-transcriptional regulation of gene expression is of fundamental importance to numerous biological processes. Nowadays, an increasing amount of gene regulatory relationships have been documented in various databases and literature. However, to more efficiently exploit such knowledge for biomedical research and applications, it is necessary to construct a genome-wide regulatory network database to integrate the information on gene regulatory relationships that are widely scattered in many different places. Therefore, in this work, we build a knowledge-based database, named 'RegNetwork', of gene regulatory networks for human and mouse by collecting and integrating the documented regulatory interactions among transcription factors (TFs), microRNAs (miRNAs) and target genes from 25 selected databases. Moreover, we also inferred and incorporated potential regulatory relationships based on transcription factor binding site (TFBS) motifs into RegNetwork. As a result, RegNetwork contains a comprehensive set of experimentally observed or predicted transcriptional and post-transcriptional regulatory relationships, and the database framework is flexibly designed for potential extensions to include gene regulatory networks for other organisms in the future. Based on RegNetwork, we characterized the statistical and topological properties of genome-wide regulatory networks for human and mouse, we also extracted and interpreted simple yet important network motifs that involve the interplays between TF-miRNA and their targets. In summary, RegNetwork provides an integrated resource on the prior information for gene regulatory relationships, and it enables us to further investigate context-specific transcriptional and post-transcriptional regulatory interactions based on domain-specific experimental data. Database URL: http://www.regnetworkweb.org.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26424082 PMCID: PMC4589691 DOI: 10.1093/database/bav095
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.The basic regulatory circuit involving TF, miRNA and target gene (A) and the schematic illustration of the mechanisms of transcriptional and post-transcriptional regulation of gene expression (B). In total, five types of regulatory relationships are considered among TF, miRNA and target gene.
The databases used to build the RegNetwork database by collecting knowledge on gene regulatory relationships in human and mouse
| Database | Description | Species | Website | Reference | Version/access date |
|---|---|---|---|---|---|
| BioGRID is an online interaction repository with data compiled through comprehensive curation efforts | Mouse | ( | Version 3.2.100 | ||
| Ensembl is to provide a centralized resource for geneticists, molecular biologists and other researchers studying the genomes of our own species and other vertebrates and model organisms | Human and mouse | ( | Release 71 (March 2013) | ||
| FANTOM | Functional Annotation Of Mammalian genome and is an international research consortium to assign functional annotations to the full-length complementary DNAs (cDNAs) | Human and mouse | ( | 5 March 2010 | |
| GenBank | A comprehensive database developed by NCBI, NIH, which contains publicly available nucleotide sequences for more than 250 00 formally described species | Human and mouse | ( | 14 August 2012 | |
| HPRD is a curated human protein-protein interaction database | Human | ( | Release 9 | ||
| IntAct is a database system of molecular interaction data. All interactions are derived from literature curation or direct user submissions | Mouse | ( | 16 October 2012 | ||
| JASPAR | An open-access database of annotated, matrix-based transcription factor binding site (TFBS) profiles for multicellular eukaryotes | Human and mouse | ( | 12 October 2009 | |
| KEGG is a widely used pathway database resource for understanding high-level linkage functions and utilities of biological system | Human and mouse | ( | 5 December 2012 | ||
| Liftover | A UCSC tool converts genome coordinates and genome annotation files between assemblies | Mouse | ( | 7 March 2012 | |
| MicroCosm | MicroCosm Targets (formerly miRBase Targets) is a web resource containing computationally predicted targets for microRNAs across many species | Human and mouse | ( | Version v5 | |
| DIANA-microT is a combined computational- experimental approach predicts mouse microRNA targets | Human and mouse | ( | Version v3.0 | ||
| miRanda is a miRNA target prediction method based on dynamic programming algorithm | Human and mouse | ( | Release August 2010 | ||
| miRBase database is a searchable database of published miRNA sequences and annotation | Human and mouse | ( | Release 18 | ||
| miRecords is a resource for animal miRNA-target interactions. The validated targets component is used, which is a large, high-quality database of experimentally validated miRNA targets | Human and mouse | ( | 25 November 2010 | ||
| miRTarBase is a database which curates experimentally validated microRNA-target interactions | Human and mouse | ( | Release 2.5 (October 2011) | ||
| PicTar is a computational method for identifying common targets of microRNAs | Human and mouse | ( | 26 March 2007 | ||
| RefSeq | RefSeq provides a non-redundant collection of sequences representing genomic data, transcripts and proteins | Human and mouse | ( | 19 May2013 | |
| STRING is a database of known and predicted protein interactions | Mouse | ( | Version 9.05 | ||
| Tarbase collectes available miRNA targets derived from all contemporary experimental techniques (gene specific and high-throughput) | Human and mouse | ( | Version 5.0 | ||
| TargetScan is an algorithm to predict biological targets of miRNAs by searching for the presence of conserved 8mer and 7mer sites that match the seed region of each miRNA | Human and mouse | ( | Release 5.2 | ||
| TRANSFAC | Transfac database is a manually curated database of eukaryotic transcription factors, their genomic binding sites (TFBS) and DNA binding profiles | Human and mouse | ( | TRANSFAC 7.0 | |
| TransmiR is a transcription factor-microRNA regulation database | Human and mouse | ( | Version 1.2 | ||
| Transcriptional Regulatory Element Database (TRED) is an integrated repository repository for both cis- and trans- regulatory elements in mammals. It contains the curated regulations between TF and target gene | Human and mouse | ( | 12 February 2012 | ||
| UniProt | UniProt is a catalog of information on proteins and it is a central repository of protein sequence and function | Human and mouse | ( | Release July2012 | |
| The University of California, Santa Cruz Genome Browser is a database of genomic sequence and annotation data for a wide variety of organisms | Human and mouse | ( | mm10, GRCm38 (December 2011) |
The ‘Species’ column shows whether the information in a database is available for human, mouse or both. Twenty-five databases are used to build the RegNetwork and they are ordered alphabetically here, among which 17 of these databases in italic contain the regulatory relationships, and the rest provide other necessary information (e.g. annotations) for the database construction.
Figure 2.The flowchart for RegNetwork construction.
Figure 3.Schematic illustration of pairing TF and genes by TFBSs. When the documented TFBS ‘MA0017’ is found in the promoter regions of ‘Gene2’ and ‘Gene 5’, TF NR2F1 is predicted to have a potential to regulate the two genes accordingly.
Figure 4.The web user interface of RegNetwork.
The basic statistics of the regulatory networks of human and mouse in RegNetwork
| Element | Description | Number | |
|---|---|---|---|
| Human | Mouse | ||
| Node | All nodes included in the regulatory network | 23 079 | 20 738 |
| Edge | All regulatory relationships included in the regulatory network | 369 277 | 323 636 |
| TF | The documented TFs included in the regulatory network | 1456 | 1328 |
| miRNA | The miRNAs included in the regulatory network | 1904 | 1290 |
| Gene | The target genes included in the regulatory network | 19 719 | 18 120 |
| TF–gene | The ‘TF–gene’ regulations included in the regulatory network | 149 841 | 94 876 |
| TF–TF | The ‘TF’–‘TF gene’ self-regulations included the regulatory network | 361 | 129 |
| TF–miRNA | The ‘TF–miRNA gene’ regulations included in the regulatory network | 21 744 | 25 574 |
| miRNA–gene | The ‘miRNA–target gene’ regulations included in the regulatory network | 171 477 | 176 512 |
| miRNA–TF | The ‘miRNA–TF gene’ regulations included in the regulatory network | 25 854 | 26 545 |
Selected measures in the established regulatory networks for human and mouse
| Parameter | Value | |
|---|---|---|
| Human | Mouse | |
| Clustering coefficient | 0.118 | 0.101 |
| Connected components | 3 | 1 |
| Network diameter | 8 | 8 |
| Shortest paths | 42 727 382 | 36 743 196 |
| Characteristic path length | 3.200 | 3.229 |
| Average number of neighbors | 31.391 | 30.548 |
The definitions of these measures are the same as in Refs. (43, 45).
Figure 5.The node degree distributions of the established regulatory networks in human (A) and mouse (B). A power law distribution in the form of is fitted in each subfigure, respectively. The results show that the node degrees satisfy the power-law distribution, i.e. , in human, , in mouse.
Figure 6.The regulatory relationships of a KEGG gene set for the human T cell receptor signaling pathway in RegNetwork. TF, miRNA and gene are in different colors and the transcriptional and post-transcriptional interplays are shown in red and blue, respectively.
The three-node network motifs ‘TF–miRNA–gene’ in human and mouse regulatory networks
The motifs are ranked by the absolute Z-Scores of network motifs in human. The higher the Z-Score, the more enriched is a motif (threshold is 2 as suggested in FANDOM (47)).