| Literature DB >> 24813450 |
Ara Cho1, Junha Shin1, Sohyun Hwang2, Chanyoung Kim1, Hongseok Shim1, Hyojin Kim1, Hanhae Kim1, Insuk Lee3.
Abstract
High-throughput experimental technologies gradually shift the paradigm of biological research from hypothesis-validation toward hypothesis-generation science. Translating diverse types of large-scale experimental data into testable hypotheses, however, remains a daunting task. We previously demonstrated that heterogeneous genomics data can be integrated into a single genome-scale gene network with high prediction power for ribonucleic acid interference (RNAi) phenotypes in Caenorhabditis elegans, a popular metazoan model in the study of developmental biology, neurobiology and genetics. Here, we present WormNet version 3 (v3), which is a new network-assisted hypothesis-generating server for C. elegans. WormNet v3 includes major updates to the base gene network, which substantially improved predictions of RNAi phenotypes. The server generates various gene network-based hypotheses using three complementary network methods: (i) a phenotype-centric approach to 'find new members for a pathway'; (ii) a gene-centric approach to 'infer functions from network neighbors' and (iii) a context-centric approach to 'find context-associated hub genes', which is a new method to identify key genes that mediate physiology within a specific context. For example, we demonstrated that the context-centric approach can be used to identify potential molecular targets of toxic chemicals. WormNet v3 is freely accessible at http://www.inetbio.org/wormnet.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24813450 PMCID: PMC4086142 DOI: 10.1093/nar/gku367
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(a) A Venn diagram of genes (the percentage coverage of the coding genome is indicated in the parenthesis) and links in WormNet v2 and WormNet v3. (b) A comparison of the prediction power between WormNet v2 and WormNet v3 using a total of 478 RNAi phenotypes, which contain between 5 and 500 genes, collected from WormBase239 (13). Network precision was measured by calculating the percentage of gene pairs that share RNAi phenotypes for different coverage of the coding genome. WormNet v3 shows superior performance over the entire range of the genome coverage. The assessment of new links in WormNet v3 (WormNet v3 specific links) and the excluded old links (WormNet v2 specific links) confirmed that the improved precision of WormNet v3 is attributable to the new network links that have been included in this update.
Figure 2.(a) A schematic illustration of the three network-assisted prediction methods. (b) Screen shots of the prediction results that are returned by the ‘find context-associated hub genes’ method. The analysis returned a table of hub genes that are predicted to be associated with the biological context characterized by the submitted differentially expressed genes (DEGs). If a user clicks a candidate hub (e.g. C14C10.1 shown in the table), a new web page displays the network of the hub gene and its neighbors. The network shows all links among the submitted DEGs as well as links from the hub to its neighbors that overlap with the given DEGs. By clicking a node or an edge of the network, users can also view in the lower panel detailed information about the gene or the co-functional link.
Figure 3.(a) A bar graph that shows the success rates of the predictions for the ‘extended life span’ genes. To assess the effectiveness of the phenotype-centric prediction, we performed a simulation in which 372 genes for ‘extended life span’ collected from WormBase239 (13) were predicted by network connectivity to the seed genes derived from each of three independent genome-wide RNAi screens: 29 genes from Hansen et al. (14), 85 genes from Hamilton et al. (15) and 61 genes from Curran et al. (16). For each seed gene set, the WormNet server prioritized new candidate genes for extended life span. The efficiency of each prediction was measured by the success rate, which was computed as the percentage of recapitulated non-seed genes of the 372 known genes for extended life span among the top 50, 100 and 200 candidates. The success rates ranged from 24 to 38% among the top 50 candidates for the three query sets. This range was slightly lower among the top 100 or 200 candidates. Notably, the success rate was significantly reduced when the base gene network in WormNet v2 was used, which demonstrates the significant improvement in network quality in WormNet v3. (b) A performance is measured by the number of correctly predicted RNAi phenotype annotations (y-axis) for the given rank threshold (x-axis). WormNet v3 performs slightly but consistently better than WormNet v2. Both versions of WormNet outperform randomized predictions (the curve represents the average performance of 100 random predictions). (c) A ROC curve that shows high performance of the context-centric prediction method for hypoxia response for associated genes annotated by the RNAi phenotype. Predictions based on context-associated hub (CAH) genes outperform those based on DEGs. TPR, true positive rate; FPR, false positive rate; randomized, random prediction.