| Literature DB >> 12702209 |
Barry R Zeeberg1, Weimin Feng, Geoffrey Wang, May D Wang, Anthony T Fojo, Margot Sunshine, Sudarshan Narasimhan, David W Kane, William C Reinhold, Samir Lababidi, Kimberly J Bussey, Joseph Riss, J Carl Barrett, John N Weinstein.
Abstract
We have developed GoMiner, a program package that organizes lists of 'interesting' genes (for example, under- and overexpressed genes from a microarray experiment) for biological interpretation in the context of the Gene Ontology. GoMiner provides quantitative and statistical output files and two useful visualizations. The first is a tree-like structure analogous to that in the AmiGO browser and the second is a compact, dynamically interactive 'directed acyclic graph'. Genes displayed in GoMiner are linked to major public bioinformatics resources.Entities:
Mesh:
Year: 2003 PMID: 12702209 PMCID: PMC154579 DOI: 10.1186/gb-2003-4-4-r28
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1GoMiner displays for microarray gene-expression data on prostate cancer cell line DU145 and a subline (RC0.1) selected for resistance to a topoisomerase 1 inhibitor. (a) Tree-like display showing underexpressed genes (green down-arrows), overexpressed genes (red up-arrows), and unchanged genes (gray circles) in the GO 'Apoptosis Regulator' category and its subcategories. The blue number indicates a 2.4-fold enrichment of changed genes in this category. The p-value (Fisher's exact) indicates that, despite this degree of enrichment, the small total number of genes (14) in this category prevents statistical significance. (b) Dynamically generated SVG graphic of the 'Biological Process' DAG with genes in the GO 'Apoptosis Regulator' category opened in a pull-down list by mousing-over. Categories enriched more than 1.5-fold with flagged genes are color-coded red; those depleted more than 1.5-fold are blue. The rest of the categories are gray.
Figure 2Schematic of GoMiner architecture and data flow.
Two-by-two contingency table for flagged and unflagged genes in a GO category
| Flagged genes | Non-flagged genes | Total | |
| In category | |||
| Not in category | ( | ||
| Total |
nf is the number of flagged genes in the category, n is the total number of genes in the category, Nf is the number of flagged genes on the microarray, and N is the total number of genes on the microarray. All numbers are those obtained after dereplicating multiple instances of the same gene.