| Literature DB >> 17586825 |
Nicolas Goffard1, Georg Weiller.
Abstract
PathExpress is a web-based tool developed to interpret gene expression data obtained from microarray experiments by identifying the most relevant metabolic pathways associated with a subset of genes (e.g. differentially expressed genes). A graphical pathway representation permits the visualization of the expressed genes in a functional context. Based on the publicly accessible KEGG Ligand database, PathExpress can be adapted to any organism and is currently available for seven Affymetrix genome arrays. About 20% of the probe sets of each array have been assigned to Enzyme Commission numbers by homology relationship and linked to corresponding metabolic pathways. PathExpress is available at http://bioinfoserver.rsbs.anu.edu.au/utils/PathExpress/.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17586825 PMCID: PMC1933187 DOI: 10.1093/nar/gkm261
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Representation of metabolic pathways and sub-pathways. The directed graph contains two types of nodes, compounds (labelled with their KEGG identifier and represented as ellipses) and reactions (labelled with the EC number of the enzyme involved and represented as boxes). Greyed reactions show that the corresponding enzyme has been identified in a given genome array. Directed arcs between two different nodes represent the consumption or the production of compounds by a reaction. The presented pathway contains eight enzymes with probe sets assignments. Two sub-pathways can be considered, containing three and four enzymes (surrounded by a black line).
Available Affymetrix genome arrays and assignment statistics
| Affymetrix Genome Array | Organism | Sequences | Assigned sequences | EC |
|---|---|---|---|---|
| ATH1 Genome Array | 22 765 | 5 177 | 823 | |
| Drosophila Genome 2.0 Array | 18 952 | 3 107 | 724 | |
| E. coli Genome 2.0 Array | 10 208 | 2 245 | 803 | |
| Human Genome U133 Plus 2.0 Array | 39 070 | 3 332 | 658 | |
| Medicago Genome Array | 50 900 | 8 981 | 953 | |
| Rice Genome Array | 57 194 | 10 068 | 923 | |
| Soybean Genome Array | 37 618 | 6 502 | 803 | |
| Yeast Genome 2.0 Array | 5 814 | 1 471 | 601 | |
| Yeast Genome 2.0 Array | 5 028 | 1 333 | 566 |
aNumber of probe set sequences.
bNumber of probe set sequences assigned to an EC number.
cNumber of distinct EC numbers corresponding to the probe set sequences.
Figure 2.Screenshots of the PathExpress web interface. (a) The list of metabolic pathways whose enzyme composition intersects with the enzymes corresponding to a list of submitted identifiers, ordered by increasing P-value. Each row reports information concerning the pathway's name and the number of enzymes. The comparison of groups is reported with the number of submitted enzymes involved in the pathway and the P-value for finding the group by chance, associated with the corresponding adjusted P-value. The significant pathways are highlighted in red (‘carbon fixation’, ‘glycolysis/gluconeogenesis’ and ‘pentose phosphate pathway’ in this example). (b) Graphical representation of the glycolysis/gluconeogenesis pathway for enzymes identified in the Affymetrix ATH1 Genome Array (Arabidopsis thaliana). Reactions mediated by enzymes found in the genome array are highlighted in grey whereas those where the enzymes are also present in the query are highlighted in yellow. (c) Example of the detail page of an enzymatic reaction. Each enzyme is reported with its EC number linked to the KEGG database (Entry), its recommended and alternative names, the pathways in which this enzyme is involved (glycolysis/gluconeogenesis in this example) and the list of probe sets assigned. For each probe set, the identifier and description of the best match in Swiss-Prot is displayed. The ‘detail’ button is linked to the complete blast report. The corresponding row is highlighted in yellow if the probe set belongs to the submitted list of identifiers.
Comparison of PathExpress with existing tools
| Software | Pathways | Input | Output | Comments | References |
|---|---|---|---|---|---|
| PathExpress | KEGG | List of identifiers (Affymetrix identifiers and/or Gene accession numbers) | Statistically significant pathways and sub-pathways with graphical visualisation | Affymetrix data but extendable to any organism or set of sequences | |
| ArrayXPath II | GenMAPP, KEGG, BioCarta, PharmGKB | Clustered gene expression profile | Statistically significant pathways with graphical visualisation | ( | |
| KOBAS | KEGG | List of KO identifiers | Statistically significant pathways | Annotation of a set of genes or proteins with KO terms | ( |
| PathMAPA | KEGG, TAIR, NCBI, GO | Expression data file (GenePix, Affymetrix) | Statistically significant pathways, enzymes, genes with graphical visualisation | ( | |
| PathwayExplorer | KEGG, BioCarta, GenMAPP | Expression data file (Tab-delimitated) | Statistically significant pathways with graphical visualisation | Well-annotated genomes | ( |
| Pathway Miner | GenMAPP, KEGG, BioCarta | List of Gene Accession numbers with gene expression values | Statistically significant pathways with graphical visualisation | ( |