| Literature DB >> 25943544 |
Junha Shin1, Sunmo Yang1, Eiru Kim1, Chan Yeong Kim1, Hongseok Shim1, Ara Cho1, Hyojin Kim1, Sohyun Hwang1, Jung Eun Shim1, Insuk Lee2.
Abstract
Drosophila melanogaster (fruit fly) has been a popular model organism in animal genetics due to the high accessibility of reverse-genetics tools. In addition, the close relationship between the Drosophila and human genomes rationalizes the use of Drosophila as an invertebrate model for human neurobiology and disease research. A platform technology for predicting candidate genes or functions would further enhance the usefulness of this long-established model organism for gene-to-phenotype mapping. Recently, the power of network prioritization for gene-to-phenotype mapping has been demonstrated in many organisms. Here we present a network prioritization server dedicated to Drosophila that covers ∼95% of the coding genome. This server, dubbed FlyNet, has several distinctive features, including (i) prioritization for both genes and functions; (ii) two complementary network algorithms: direct neighborhood and network diffusion; (iii) spatiotemporal-specific networks as an additional prioritization strategy for traits associated with a specific developmental stage or tissue and (iv) prioritization for human disease genes. FlyNet is expected to serve as a versatile hypothesis-generation platform for genes and functions in the study of basic animal genetics, developmental biology and human disease. FlyNet is available for free at http://www.inetbio.org/flynet.Entities:
Mesh:
Year: 2015 PMID: 25943544 PMCID: PMC4489278 DOI: 10.1093/nar/gkv453
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Summary of the 21 component networks for FlyNet
| Code | Description | # Genes | # Links |
|---|---|---|---|
| DM-CC | Co-citation of two fly genes across Medline and PubMed Central | 6027 | 503 475 |
| DM-CX | Co-expression of two fly genes in high-dimensional gene expression data from the GEO database ( | 11 718 | 275 033 |
| DM-DC | Co-occurrence of protein domains between two fly genes ( | 4407 | 7604 |
| DM-GN | Chromosomal proximity between bacterial orthologs of two fly genes in bacterial genomes ( | 1979 | 15 820 |
| DM-HT | Protein–protein interactions (PPIs) identified by high-throughput assays in iRefWeb database ( | 7759 | 25 519 |
| DM-LC | Protein–protein interactions (PPIs) identified by small-scale experiments collected via literature curation in iRefWeb database ( | 1202 | 2226 |
| DM-PG | Phylogenetic profile similarity between two fly genes | 3357 | 80 506 |
| AT-CC | Orthology transfer of co-citation links in an | 1747 | 17 501 |
| AT-CX | Orthology transfer of co-expression links in an | 1105 | 9455 |
| AT-HT | Orthology transfer of high-throughput PPI in an | 1013 | 2823 |
| AT-LC | Orthology transfer of literature curated PPI in an | 856 | 1977 |
| CE-CX | Orthology transfer of co-expression links in a | 1434 | 17 497 |
| DR-CX | Orthology transfer of co-expression links in a | 3223 | 55 515 |
| HS-CX | Orthology transfer of co-expression links in a | 3366 | 32 482 |
| HS-HT | Orthology transfer of high-throughput PPI in a | 2741 | 12 520 |
| HS-LC | Orthology transfer of literature curated PPI in a | 5254 | 50 488 |
| SC-CC | Orthology transfer of co-citation links in a | 2449 | 48 473 |
| SC-CX | Orthology transfer of co-expression links in a | 1674 | 18 488 |
| SC-GT | Orthology transfer of genetic interactions in a | 1254 | 6482 |
| SC-HT | Orthology transfer of high-throughput PPI in a | 1622 | 18 300 |
| SC-LC | Orthology transfer of literature-curated PPI in a | 2016 | 16 481 |
| FlyNet | Integrated network | 13 119 | 779 484 |
Figure 1.The assessment of FlyNet using gene pairs derived from (A) FlyReactome annotations and (B) GenomeRNAi phenotype terms. The precision of the gene pairs was measured every 1000 pairs, which were sorted by the network edge weight scores. For the assessment by GenomeRNAi, the coverage of the fly genome was used, because the number of gene pairs by the RNAi phenotype is so large (316 924) that the recall for the majority of the gene pairs is not feasible.
Figure 2.Fly prioritizer analyses. (A) The assessment of function prioritization using 5434 KEGG annotations for 2690 fly genes. The number of correctly predicted KEGG terms (y-axis) for the given top rank (x-axis) was assessed for FlyNet and random networks. (B) The results of the STN enrichment analysis for three GO-BP terms: compound eye development, sensory perception of smell and spermatogenesis. The logarithm of the number of unique network neighbors for each developmental stage or tissue type was calculated for all guide genes and represented as bar graphs. The codes for the four developmental stages are: EB, embryo; LV, larvae; PP, pupae and AD, adult. The codes for the 10 tissue types are: AG, accessory gland; CC, carcass; CN, central nervous system; DS, digestive system; FB, fat body; HD, head; ID, imaginal disc; OV, ovaries; SG, salivary gland and TT, testes. (C) A heat map of the STN scores for spermatogenesis after sorting for the most enriched tissue type, the accessory gland (AG). (D) The assessment of gene prioritization for 389 RNAi phenotypes derived from the GenomeRNAi database. (E) The discovery rates for the top 50, 100 and 200 novel candidate genes by FlyNet with three independent RNAi screens for the Imd pathway based on the Imd pathway genes annotated by the FlyReactome database. The discovery rate by random chance is ∼1.6%, which is indicated by the red dotted line.
Figure 3.Human prioritizer analyses. Novel candidate genes for autism, epilepsy and schizophrenia were predicted by FlyNet, and top-ranked genes were validated using (A) 138 neurological disease genes identified from a recent mutagenesis screen for X-chromosome fly genes and (B) 800 genes with de novo mutations for autism, epilepsy or schizophrenia. The discovery rates for the top 50, 100 and 200 candidates are represented on the bar graph, and the expected discovery rates by random chance are indicated by the dotted lines.