| Literature DB >> 23685435 |
Eiru Kim1, Hanhae Kim, Insuk Lee.
Abstract
Revolutionary DNA sequencing technology has enabled affordable genome sequencing for numerous species. Thousands of species already have completely decoded genomes, and tens of thousands more are in progress. Naturally, parallel expansion of the functional parts list library is anticipated, yet genome-level understanding of function also requires maps of functional relationships, such as functional protein networks. Such networks have been constructed for many sequenced species including common model organisms. Nevertheless, the majority of species with sequenced genomes still have no protein network models available. Moreover, biologists might want to obtain protein networks for their species of interest on completion of the genome projects. Therefore, there is high demand for accessible means to automatically construct genome-scale protein networks based on sequence information from genome projects only. Here, we present a public web server, JiffyNet, specifically designed to instantly construct genome-scale protein networks based on associalogs (functional associations transferred from a template network by orthology) for a query species with only protein sequences provided. Assessment of the networks by JiffyNet demonstrated generally high predictive ability for pathway annotations. Furthermore, JiffyNet provides network visualization and analysis pages for wide variety of molecular concepts to facilitate network-guided hypothesis generation. JiffyNet is freely accessible at http://www.jiffynet.org.Entities:
Mesh:
Year: 2013 PMID: 23685435 PMCID: PMC3692116 DOI: 10.1093/nar/gkt419
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(a) Illustration of associalog, which is an inferred functional association between protein A′ and B′ in a query species. A′ and B′ are orthologs of template species protein A and B, respectively. (b) Schematic summary of network construction and analysis processes implemented in JiffyNet web server.
Template networks for the JiffyNet webserver
| Template network | Template species | No. of proteins (coverage of coding genome) | No. of functional associations |
|---|---|---|---|
| EcoliNet (draft version) | 4117 (99%) | 120 510 | |
| YeastNet (version 2) | 5483 (95%) | 102 803 | |
| WormNet (version 2) | 15 139 (75%) | 999 367 | |
| HumanNet (version 1) | 16 242 (87%) | 476 399 | |
| AraNet (version 1) | 19 647 (73%) | 1 062 222 | |
| RiceNet (version 1) | 18 377 (45%) | 588 221 |
Figure 2.Visualization and analysis of subnetworks for molecular concepts. The ‘Results and Analysis’ page provides two links, one for a list of subnetworks by GO terms and the other by KEGG pathway terms. The linked page for GO terms lists all available terms in multiple pages (the given example has 35 pages of GO term list). These terms can be filtered for each category of GO annotation (BP for biological process, CC for cellular component or MF for molecular function). One can use keywords to search for terms of interest. For example, the keyword p53 lists up to five GO terms containing the word. Clicking linked term ‘p53 binding’ opens a new page showing a subnetwork for the molecular concept of ‘p53 binding’. Each node shows both the template and query species protein names for a pair of orthologs. The user can specify different confidence levels for edges by using the slider bar for ‘LLS threshold.’ Using viewer tools, one can zoom in or out and move the network or individual nodes. The linked page also shows a list of subnetwork proteins ranked by sum of LLS scores.
Figure 3.Quality assessment of protein networks for (a) giant panda (A.melanoleuca), (b) soybean (G. max) and (c) rice (O. sativa). Graphs of KEGG pathway precision as a function of coding genome coverage (upper panels) and the AUC for KEGG pathways (lower panels) are shown. In the upper panel plots, each data point represents a bin of 1000 network links, sorted by likelihood score. AUC scores are summarized as box-and-whisker plots. We created networks using JiffyNet by selecting HumanNet and WormNet as template networks for panda (panda-Jiffy-HumanNet and panda-Jiffy-WormNet, respectively), AraNet and RiceNet for soybean (soybean-Jiffy-AraNet and soybean-Jiffy-RiceNet, respectively) and AraNet for rice (rice-Jiffy-AraNet). For comparison purposes, we show networks generated for the same species using the BIPS, which creates networks on the basis of protein–protein interactions between orthologs in other species. In addition, we used BIPS to define core networks by identifying protein pairs that share GO annotations (panda-BIPS-core, soybean-BIPS-core and rice-BIPS-core). For panda, HumanNet was a more effective template than WormNet and covered a larger portion of the panda coding genome (75%). For soybean, AraNet and RiceNet were similarly effective templates, and the resultant networks covered 35–40% of coding genome. For the giant panda network, median AUC scores were 0.803 and 0.681, respectively, using the HumanNet and WormNet templates. For soybean, AUC scores were 0.708 and 0.702 using the AraNet and RiceNet templates, respectively. An AUC of 0.5 is expected if results are due to chance alone. We also compared JiffyNet results for rice (rice-Jiffy-AraNet) to a high-quality protein network constructed by integrating 24 different types of datasets (RiceNet).