| Literature DB >> 26527726 |
Eiru Kim1, Sohyun Hwang2, Hyojin Kim1, Hongseok Shim1, Byunghee Kang1, Sunmo Yang1, Jae Ho Shim1, Seung Yeon Shin1, Edward M Marcotte3, Insuk Lee4.
Abstract
Laboratory mouse, Mus musculus, is one of the most important animal tools in biomedical research. Functional characterization of the mouse genes, hence, has been a long-standing goal in mammalian and human genetics. Although large-scale knockout phenotyping is under progress by international collaborative efforts, a large portion of mouse genome is still poorly characterized for cellular functions and associations with disease phenotypes. A genome-scale functional network of mouse genes, MouseNet, was previously developed in context of MouseFunc competition, which allowed only limited input data for network inferences. Here, we present an improved mouse co-functional network, MouseNet v2 (available at http://www.inetbio.org/mousenet), which covers 17 714 genes (>88% of coding genome) with 788 080 links, along with a companion web server for network-assisted functional hypothesis generation. The network database has been substantially improved by large expansion of genomics data. For example, MouseNet v2 database contains 183 co-expression networks inferred from 8154 public microarray samples. We demonstrated that MouseNet v2 is predictive for mammalian phenotypes as well as human diseases, which suggests its usefulness in discovery of novel disease genes and dissection of disease pathways. Furthermore, MouseNet v2 database provides functional networks for eight other vertebrate models used in various research fields.Entities:
Mesh:
Year: 2015 PMID: 26527726 PMCID: PMC4702832 DOI: 10.1093/nar/gkv1155
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
MouseNet v2 and component networks inferred from 13 distinct data types
| Network | Description | Genes | Links |
|---|---|---|---|
| MouseNet v2 | Integrated network | 17 714 | 788 080 |
| Component networks | |||
| MM-CX | By co-expression of mouse genes | 14 087 | 180 037 |
| MM-GN | By gene neighborhood of two bacterial orthologs of mouse genes in prokaryotic genomes | 4608 | 157 769 |
| MM-PG | By phylogenetic profile similarity across species | 6376 | 231 833 |
| MM-LC | By literature curated mouse PPIs downloaded from iRefIndex v14.0 | 5289 | 11 678 |
| DM-CX | By co-expression of fly ( | 4047 | 31 951 |
| DM-LC | By literature curated fly orthologous PPIs | 1316 | 3240 |
| HS-HT | By high-throughput human orthologous PPIs | 4558 | 16 487 |
| HS-LC | By literature curated human orthologous PPIs | 13 676 | 163 754 |
| SC-CC | By co-citation of yeast ( | 3094 | 43 148 |
| SC-CX | By co-expression of yeast orthologs | 1996 | 40 804 |
| SC-GT | By genetic interactions of yeast orthologs | 1692 | 12 526 |
| SC-HT | By high-throughput yeast orthologous PPIs | 2304 | 41 735 |
| SC-LC | By literature curated yeast orthologous PPIs | 2553 | 25 891 |
Figure 1.Performance assessment of publicly available functional gene networks for the laboratory mouse. The results of assessment based on precision of network gene pairs for the same protein complexes by CORUM database (A), for protein–protein interactions by Reactome database (B), for the same human diseases by OMIM database (C) or GWAS catalog (D) for the given coverage of mouse coding genome suggests that MouseNet v2 generally performs better than other mouse functional gene networks including MouseNet v1.
Figure 2.Overview of network-assisted research tools in the MouseNet v2 database. MouseNet v2 provides two network search options, one is for finding new member genes of a pathway/trait and the other is for inferring functions for a query gene from its network neighbors. In addition, MouseNet v2 provides all network information including eight other vertebrate species, enabling various network analysis for mouse and other vertebrates with the integrated networks, component networks for distinct types of data, and co-expression networks for different biological contexts.
Figure 3.Validating predictions by MouseNet v2. (A) Validation of predictions for new members for a pathways using mouse phenotype and human disease database. If known genes for a MP or OMIM term are well connected to each other in the network, network-based prediction would predict new genes for the same MP or OMIM term. The interconnectivity among the known genes for a phenotype was analyzed by ROC curve which was then summarized into AUC. MouseNet v2 shows substantially higher distribution of AUCs for 5424 MP terms and 56 mouse OMIM terms compared with randomized networks. (B) Validation of predictions for new functional concepts for a query gene. We have run the prediction for KEGG pathway terms, and count the number of mouse genes whose correct KEGG annotation was retrieved within top N ranks. For example, known KEGG annotations for ∼60% of tested mouse genes was retrieved within top 10 predictions by MouseNet v2 ‘Infer functions from network neighbors’ option, whereas only ∼5% was so by randomized networks. (C) Validation of predictions for new member genes for a pathways in chicken using spatiotemporal expression data of chicken genes based on GEISHA database. MouseNet v2 shows substantially higher distribution of AUCs for 1749 spatiotemporal expression sets by GEISHA database compared with randomized networks.