| Literature DB >> 15998470 |
Barry R Zeeberg1, Haiying Qin, Sudarshan Narasimhan, Margot Sunshine, Hong Cao, David W Kane, Mark Reimers, Robert M Stephens, David Bryant, Stanley K Burt, Eldad Elnekave, Danielle M Hari, Thomas A Wynn, Charlotte Cunningham-Rundles, Donn M Stewart, David Nelson, John N Weinstein.
Abstract
BACKGROUND: We previously developed GoMiner, an application that organizes lists of 'interesting' genes (for example, under-and overexpressed genes from a microarray experiment) for biological interpretation in the context of the Gene Ontology. The original version of GoMiner was oriented toward visualization and interpretation of the results from a single microarray (or other high-throughput experimental platform), using a graphical user interface. Although that version can be used to examine the results from a number of microarrays one at a time, that is a rather tedious task, and original GoMiner includes no apparatus for obtaining a global picture of results from an experiment that consists of multiple microarrays. We wanted to provide a computational resource that automates the analysis of multiple microarrays and then integrates the results across all of them in useful exportable output files and visualizations.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15998470 PMCID: PMC1190154 DOI: 10.1186/1471-2105-6-168
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic of stand-alone and web versions of High-Throughput GoMiner architecture and data flow.
Figure 2Screen shot of High-Throughput GoMiner results in Excel for GO categories enriched in genes with altered expression. The 30 GO categories with FDR = 0.10 are color-coded red; the other GO categories are color-coded blue.
Figure 3Clustered image map (CIM) [5,6] showing GO categories versus genes for genes with altered expression in a patient with CVID. Yellow indicates absence of the gene from the GO category. Red and green indicate over-and underexpressed genes, respectively. Clustering was performed with the Pearson correlation metric and average linkage algorithm. Instructions for using CIMminer to generate the CIMs in this paper are given in Supplementary Materials [see Additional file 5].
Clusters Of Categories Derived From The CIM (Figure 3)
| 1 | Exogenous Antigen | antigen presentation exogenous antigen |
| antigen processing exogenous antigen via MHC class II | ||
| 2 | Xenobiotic | xenobiotic metabolism |
| response to xenobiotic stimulus | ||
| 3a | Signaling | cell surface receptor linked signal transduction |
| G-protein coupled receptor protein signaling pathway | ||
| 4 | Homeostasis | calcium ion homostasis |
| di-tri-valent inorganic cation homeostasis | ||
| cell ion homeostasis | ||
| cation homeostasis | ||
| metal ion homeostasis | ||
| ion homeostasis | ||
| 5a | Response | response to chemical substance |
| response to abiotic stimulus | ||
| 3b | Signaling | cell-cell signaling |
| 6 | Taxis | taxis |
| chemotaxis | ||
| 5b | Response | cellular defense response |
| 5c | Response | response to stimulus |
| response to external stimulus | ||
| organismal physiological process | ||
| immune response | ||
| response to biotic stimulus | ||
| defense response | ||
| 5d | Response | response to wounding |
| 5e | Response | response to pest pathogen parasite |
| response to stress | ||
| 7 | Adhesion | cell adhesion |
| 5f | Response | humoral immune response |
| humoral defense mechansim | ||
Figure 4Clustered image map (CIM) 5,6 showing transcription factor binding sites versus GO categories in a patient with CVID. Red indicate FDR = 0.0, and yellow indicates FDR > 1.0 or a missing value. Clustering was performed with the Pearson correlation metric and average linkage algorithm. The inset is a blow-up of the first few transcription factor binding site names. A full-size version in which all the transcription factor binding site names are readable is available in Supplementary Materials [see Additional file 8]. There are 35 rather than 30 GO categories because this result was computed with a more recent version of the GO Consortium database.
Names and Consensus Sequences for Transcription Factors that Co-Regulate the Changed Genes in the GO Category 'G-protein Coupled Receptor Protein Signaling Pathway' and a Large Core of 'Response' Categories (Figure 4 and Supplementary Materials [see Additional file 8])
| AP-1-IL-3 | TGAGTCA |
| AP-1-involucrin-H2 | TGCCTCA |
| ASP-CYP21 | CTCTGTGG |
| AlphaCE2Maf-Bfsp1 | TGCTGAC |
| B2_RS | TCCTATCA |
| CP2-consensus | GCNMNANCMAG |
| CPSI-B1 | TCTCCCA |
| EBF/Olf-1_site_2 | TCCCNNRRGRR |
| EBV-ZRE2 | TGAGCAA |
| FHX-type-A-CS | WMARYAAAYA |
| GAGA_box/CT_element | AGAGARRRR |
| GH-CSE2 | AATAAAT |
| GRE_CS7 | WCTGWTCT |
| GRE_CS8 | AGAWCAGW |
| GT-2B_RS | CCAGCTG |
| GT-IIBa-SV40 | ACAGCTG |
| HNF5-erk1 | TATTTGT |
| HiNF-Ahist | AGAAATG |
| IL4-P0/P1 | ATTTTCC |
| Initiator_CS | CTCANTCT |
| MEF-2-consensus | YTWWAAATAR |
| MEF-2_CS | YTAWAAATAR |
| NF-Y-consensus | BVDCCAATVVVVD |
| PuF_RS | GGGTGGG |
| RadLV-core | TGTGGTCA |
| Runx_CS | AACCACA |
| Six5_CS | TCARRTTNC |
| Sp1-VGF_1 | AGGGAGG |
| TCF-2-alpha_CS | SAGGAAGY |
| TCR-beta-site-6 | AATACAA |
| TRE.1 | TGACTCA |
| c-Myc_RS1 | TCTCTTA |
| c-mos_DS3 | GTTTTAA |
| delta-rpL7 | GGAGGCTG |
| forkhead_CS | WAARYAAAYW |
| p300-consensus | GGGAGTG |
Figure 5Time series for GO categories with low FDR for overexpressed genes. The data were obtained from a study of schistosomiasis in a murine model [37-40] over the course of 20 weeks after infection. 3D bar graph visualization in Excel. (Elnekave et al, in preparation).
Figure 6CIM [5,6] with hierarchically clustered categories (Pearson correlation, average linkage clustering) versus time (Elnekave et al., in preparation).