| Literature DB >> 20576703 |
David Warde-Farley1, Sylva L Donaldson, Ovi Comes, Khalid Zuberi, Rashad Badrawi, Pauline Chao, Max Franz, Chris Grouios, Farzana Kazi, Christian Tannus Lopes, Anson Maitland, Sara Mostafavi, Jason Montojo, Quentin Shao, George Wright, Gary D Bader, Quaid Morris.
Abstract
GeneMANIA (http://www.genemania.org) is a flexible, user-friendly web interface for generating hypotheses about gene function, analyzing gene lists and prioritizing genes for functional assays. Given a query list, GeneMANIA extends the list with functionally similar genes that it identifies using available genomics and proteomics data. GeneMANIA also reports weights that indicate the predictive value of each selected data set for the query. Six organisms are currently supported (Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus, Homo sapiens and Saccharomyces cerevisiae) and hundreds of data sets have been collected from GEO, BioGRID, Pathway Commons and I2D, as well as organism-specific functional genomics data sets. Users can select arbitrary subsets of the data sets associated with an organism to perform their analyses and can upload their own data sets to analyze. The GeneMANIA algorithm performs as well or better than other gene function prediction methods on yeast and mouse benchmarks. The high accuracy of the GeneMANIA prediction algorithm, an intuitive user interface and large database make GeneMANIA a useful tool for any biologist.Entities:
Mesh:
Year: 2010 PMID: 20576703 PMCID: PMC2896186 DOI: 10.1093/nar/gkq537
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Images from http://www.genemania.org. (A) The initial query screen, with the advanced options panel expanded, providing the user the ability to select desired networks, choose a network weighting method and the number of genes to return. (B) The results page for the mouse default query, which is a set of genes involved in leukemia. Users can examine information associated with each gene and network by expanding their entries on the corresponding panel. We include linkouts to model organism databases (FlyBase, WormBase, SGD and TAIR) and to the Arabidopsis resource BAR (19) that is useful to plant users.
Figure 2.Effects of data set selection on network topology. GeneMANIA showing the results of two yeast queries. (A) The yeast cell-cycle default query, using all default parameters. (B) The yeast cell-cycle default query, using default network weighting method. Only shared protein domain data sets are selected.
Figure 3.Effects of network weighting method selection on network topology. GeneMANIA showing the results of two human queries. (A) The human DNA repair and replication default query, using all default parameters. (B) The human DNA repair and replication default query, using the default data set selection but with the ‘Equal by data type’ network weighing method selected. Some genes and interactions found in this query that were not present in the default query (shown in A) are indicated by arrows.
Number of networks per organism
| Network type | Organism | |||||
|---|---|---|---|---|---|---|
| Worm | Fly | Human | Mouse | Yeast | ||
| Co-expression | 56 | 10 | 40 | 69 | 46 | 55 |
| Physical interaction | 11 | 8 | 8 | 113 | 28 | 64 |
| Genetic interaction | 1 | 4 | 2 | 1 | 0 | 16 |
| Shared protein domains | 2 | 2 | 2 | 2 | 2 | 2 |
| Co-localization | 1 | 1 | 7 | 2 | 2 | 1 |
| Pathways | 0 | 0 | 0 | 5 | 0 | 0 |
| Predicted interaction | 23 | 50 | 1 | 33 | 48 | 21 |
| Other | 0 | 1 | 0 | 0 | 3 | 4 |
Co-expression and shared protein domain network links are weighted continuously from 0…1, physical and genetic interaction networks are binary. The ‘other’ category consists of organism-specific functional genomics networks, such as from the MouseFunc competition.