| Literature DB >> 24271392 |
Serdar Turkarslan1, Elisabeth J Wurtmann, Wei-Ju Wu, Ning Jiang, J Christopher Bare, Karen Foley, David J Reiss, Pavel Novichkov, Nitin S Baliga.
Abstract
The ease of generating high-throughput data has enabled investigations into organismal complexity at the systems level through the inference of networks of interactions among the various cellular components (genes, RNAs, proteins and metabolites). The wider scientific community, however, currently has limited access to tools for network inference, visualization and analysis because these tasks often require advanced computational knowledge and expensive computing resources. We have designed the network portal (http://networks.systemsbiology.net) to serve as a modular database for the integration of user uploaded and public data, with inference algorithms and tools for the storage, visualization and analysis of biological networks. The portal is fully integrated into the Gaggle framework to seamlessly exchange data with desktop and web applications and to allow the user to create, save and modify workspaces, and it includes social networking capabilities for collaborative projects. While the current release of the database contains networks for 13 prokaryotic organisms from diverse phylogenetic clades (4678 co-regulated gene modules, 3466 regulators and 9291 cis-regulatory motifs), it will be rapidly populated with prokaryotic and eukaryotic organisms as relevant data become available in public repositories and through user input. The modular architecture, simple data formats and open API support community development of the portal.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24271392 PMCID: PMC3964938 DOI: 10.1093/nar/gkt1190
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The network portal Framework. The network portal currently implements the Python–cMonkey algorithm for network inference. Publicly available gene-expression data and genomic information is collected from various databases along with functional associations from EMBL STRING. Conditionally co-regulated clusters of genes (modules) and motifs discovered by cMonkey are stored in the database. The most probable influences on these modules are identified by Inferelator, using a TF list collected from MicrobesOnline and JCVI-CMR. A Django-based Web interface dynamically creates module-centered views for Network, Functions, Genes, Regulators and Motifs. Further investigations of the networks can be performed by using interoperability and automation frameworks provided by Gaggle and Workflow, respectively.
Organisms currently in the network portal
| Organism | Domain: phylum | Number of genes | Number of gene-expression comparisons | Gene-expression data source | Number of regulators |
|---|---|---|---|---|---|
| Archaea: Euryarchaeota | 2701 | 1661 | ( | 125 | |
| Archaea: Euryarchaeota | 1863 | 58 | ( | 57 | |
| Bacteria: Bacteroidetes | 4902 | 324 | MicrobesOnline | 263 | |
| Bacteria: Firmicutes | 3995 | 111 | MicrobesOnline | 259 | |
| Bacteria: Firmicutes | 5501 | 151 | MicrobesOnline | 376 | |
| Bacteria: Firmicutes | 4313 | 138 | MicrobesOnline | 319 | |
| Bacteria: Cyanobacteria | 2717 | 129 | GEO | 90 | |
| Bacteria: Proteobacteria | 4341 | 165 | MicrobesOnline | 231 | |
| Bacteria: Proteobacteria | 5646 | 635 | MicrobesOnline | 475 | |
| Bacteria: Proteobacteria | 4497 | 868 | ( | 310 | |
| Bacteria: Proteobacteria | 1711 | 114 | MicrobesOnline | 39 | |
| Bacteria: Proteobacteria | 3519 | 87 | MicrobesOnline | 156 | |
| Bacteria: Proteobacteria | 3661 | 383 | MicrobesOnline | 128 |
Figure 2.An example module page. (A) The landing page for each module presents a summary view of the module, including an interactive plot of gene-expression profiles across conditions, motif locations upstream of the member genes and summary statistics. Tabs located on top of the page provide access to other visualization tools. (B) The motif table shows motif logos for de novo identified upstream regulatory motifs and E-value statistics. For selected organisms, a link to analyze the motifs using RegPredict is provided. (C) Interactive network visualization is created by using Cytoscape Web. An edge connects the module and each of its gene members, motifs and regulators. Clicking on a node opens up overlay window with detailed information.