| Literature DB >> 19920988 |
M F W Te Pas1, S van Hemert, B Hulsegge, A J W Hoekman, M H Pool, J M J Rebel, M A Smits.
Abstract
Pathway information provides insight into the biological processes underlying microarray data. Pathway information is widely available for humans and laboratory animals in databases through the internet, but less for other species, for example, livestock. Many software packages use species-specific gene IDs that cannot handle genomics data from other species. We developed a species-independent method to search pathways databases to analyse microarray data. Three PERL scripts were developed that use the names of the genes on the microarray. (1) Add synonyms of gene names by searching the Gene Ontology (GO) database. (2) Search the Kyoto Encyclopaedia of Genes and Genomes (KEGG) database for pathway information using this GO-enriched gene list. (3) Combine the pathway data with the microarray data and visualize the results using color codes indicating regulation. To demonstrate the power of the method, we used a previously reported chicken microarray experiment investigating line-specific reactions to Salmonella infection as an example.Entities:
Year: 2008 PMID: 19920988 PMCID: PMC2775695 DOI: 10.1155/2008/719468
Source DB: PubMed Journal: Adv Bioinformatics ISSN: 1687-8027
Figure 1Flow diagram of the pathway analysis tool. The method comprises four steps, of which the first three steps are automated by PERL scripts. Step 1. Use the gene names list of the microarray annotation to search the Gene Ontology database for synonyms of the gene names and add to the list. Step 2. Use the updated gene name list to search the pathway database for pathway information of the genes and add pathway names and web links to the file. Step 3. Visualize the microarray results on the pathways. Step 4. Construct networks of pathways and generate biological knowledge from the results.
Box 1
Figure 2The number of genes on the microarray found per pathway.
Pathways returned by searching the KEGG database. The list of gene names of the microarray was first updated for all synonyms using the Gene Ontology database before searching the KEGG database.
| Pathways category | Number of |
|---|---|
| pathways | |
| A: total number of pathways returned by KEGG search | 178 |
| B: positive associated pathways (with links to other pathways) | 57 (22) |
| C: no regulation* | 14 |
| D: false positive** | 30 |
| E: (too) limited information to conclude*** | 35 |
| F: uncertain**** | 42 |
*None of the genes in the pathways present on the microarray were regulated. These pathways were considered not related to the chicken line differences for Salmonella infection response.
**Plant or prokaryote-specific pathways—recognized due to gene name similarity. Almost always these pathways were recognized by a very limited number of genes on the microarray.
***The genes with microarray data on the pathway are too limited in number or their locations on the pathway are too scattered to conclude on the relevance of the pathway for the studied traits.
****The available microarray data of the genes on the pathway show either limited or confusing up- or downregulatory patterns.
Figure 3The number of pathways per gene returned by the KEGG database search.