| Literature DB >> 16845060 |
Maurice Scheer1, Frank Klawonn, Richard Münch, Andreas Grote, Karsten Hiller, Claudia Choi, Ina Koch, Max Schobert, Elisabeth Härtig, Ulrich Klages, Dieter Jahn.
Abstract
A novel program suite was implemented for the functional interpretation of high-throughput gene expression data based on the identification of Gene Ontology (GO) nodes. The focus of the analysis lies on the interpretation of microarray data from prokaryotes. The three well established statistical methods of the threshold value-based Fisher's exact test, as well as the threshold value-independent Kolmogorov-Smirnov and Student's t-test were employed in order to identify the groups of genes with a significantly altered expression profile. Furthermore, we provide the application of the rank-based unpaired Wilcoxon's test for a GO-based microarray data interpretation. Further features of the program include recognition of the alternative gene names and the correction for multiple testing. Obtained results are visualized interactively both as a table and as a GO subgraph including all significant nodes. Currently, JProGO enables the analysis of microarray data from more than 20 different prokaryotic species, including all important model organisms, and thus constitutes a useful web service for the microbial research community. JProGO is freely accessible via the web at the following address: http://www.jprogo.de.Entities:
Mesh:
Year: 2006 PMID: 16845060 PMCID: PMC1538798 DOI: 10.1093/nar/gkl329
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of all 23 prokaryotic species currently available in JProGO together with the number of genes
| Organism | Gene number |
|---|---|
| 5603 | |
| 4101 | |
| 3737 | |
| 2373 | |
| 3099 | |
| 4291 | |
| 1580 | |
| 2981 | |
| 2855 | |
| 1770 | |
| 3918 | |
| 4187 | |
| Mycoplasma genitalium (G-37) | 480 |
| 688 | |
| 5573 | |
| 5351 | |
| 1374 | |
| 834 | |
| 4412 | |
| 2595 | |
| 2094 | |
| 1697 | |
| 4090 |
Only chromosomal protein-coding genes are considered; data were imported from PRODORIC database (January 2006) (19).
Figure 1JProGO web interface data submission form.
Figure 2Workflow of the analysis process of JProGO.
Example for the application of JProGO for a microarray dataset from E.coli (21) that investigates the influence of ArcA, a global transcriptional regulator on genes of aerobic function (compare to Figure 3)
| GO category | GO accession nos | GO name | |
|---|---|---|---|
| Molecular function | GO:0005215 | Transporter activity | 1.8845E-6 |
| Biological process | GO:0005996 | Monosaccharide metabolism | 4.71 97E-6 |
| Biological process | GO:0019318 | Hexose metabolism | 1.2557E-5 |
| Biological process | GO:0006066 | Alcohol metabolism | 1.3639E-5 |
| Biological process | GO:0006810 | Transport | 1.5379E-5 |
| Biological process | GO:0006351 | Transcription, DNA-dependent | 2.1493E-5 |
| Biological process | GO:0051179 | Localization | 2.2139E-5 |
| Biological process | GO:0051234 | Establishment of localization | 2.2139E-5 |
| Biological process | GO:0015980 | Energy derivation by oxidation of organic compounds | 4.5974E-5 |
| Molecular function | GO:0003676 | Nucleic acid binding | 4.9689E-5 |
Table view showing the significant GO nodes with their P-values.
Figure 3Example for the application of JProGO for a microarray dataset from E.coli (21) that investigates the influence of ArcA, a global transcriptional regulator on genes of aerobic function. View of the subgraph induced by the significant GO nodes (thick border, see also Table 1) and the root node (‘all’). The node's P-value is reflected by its size and brightness whereas nodes with lower P-values are larger and brighter. In addition, the node's colour represents its GO sub-ontology (category) which is either molecular function (red), biological process (green) or cellular component (blue).