| Literature DB >> 18615189 |
Issa Coulibaly1, Grier P Page.
Abstract
While it is possible to interpret microarray experiments a single gene at a time, most studies generate long lists of differentially expressed genes whose interpretation requires the integration of prior biological knowledge. This prior knowledge is stored in various public and private databases and covers several aspects of gene function and biological information. In this review, we will describe the tools and places where to find prior accurate biological information and how to process and incorporate them to interpret microarray data analyses. Here, we highlight selected tools and resources for gene class level ontology analysis (Section 2), gene coexpression analysis (Section 3), gene network analysis (Section 4), biological pathway analysis (Section 5), analysis of transcriptional regulation (Section 6), and omics data integration (Section 7). The overall goal of this review is to provide researchers with tools and information to facilitate the interpretation of microarray data.Entities:
Year: 2008 PMID: 18615189 PMCID: PMC2443398 DOI: 10.1155/2008/893941
Source DB: PubMed Journal: Int J Plant Genomics ISSN: 1687-5389
Recapitulative list of GO annotations tools.
| Tool name | Statistical model | GO abstraction level | GO visualization | Multiple testing | Type of array | Other annotation | OS |
|---|---|---|---|---|---|---|---|
| Onto-Express | hypergeometric, Fisher's exact test, binomial, | Available | DAG | Bonferroni, Holm, Sidak, FDR | 172 commercial arrays | Chromosomal position | Any |
| FatiGO+ | Fisher's exact test | Available | One level at a time | FDR | User-provided | KEGG pathways, SwissPROT keywords | Any |
| FuncAssociate | Fisher's exact test | Not available | Not available | Monte Carlo simulation | User-provided | Not available | Web-based |
| GoToolBox | hypergeometric test, Fisher's exact test or binomial | Available | One level at a time | Bonferroni | User-provided | Not available | Any |
| CLENCH2 | Hypergeometric, binomial, | Static global | DAG | None | User-provided | Not available | Windows |
| BiNGO | Hypergeometric, binomial | Available, GOSlim | DAG | FDR, Bonferroni | commercial arrays | Not available | |
| GoSurfer |
| Lowest level | DAG | FDR | Affymetrix only | Not available | Windows |
Additional links for pathways databases on the internet.
| Database name | Description | URL |
|---|---|---|
| PathDB | Biochemical pathways, compounds and metabolism |
|
| UM-BBD | University of Minnesota biocatalysis and biodegradation database |
|
| BIND | Biomolecular interaction network database |
|
| BRITE | Biomolecular relations in information transmission and expression, part of KEGG |
|
| PAJEK | Program for large network analysis |
|
| DDIB | Database of domain interactions and binding |
|
| DIP | Database of interacting proteins: experimentally determined protein-protein interactions |
|
| IntAct project | Protein-protein interaction data |
|
| InterDom | Putative protein domain interactions |
|
| PSIbase | Interaction of proteins with known 3D structures | |
| Reactome | A knowledgebase of biological pathways |
|
| STRING | Predicted functional associations between proteins |
|
| TRANSPATH | Gene regulatory networks and microarray analysis |
|
Databases for transcription factors available on the internet.
| Databse name | Description | URL |
|---|---|---|
| ACTIVITY | Functional DNA/RNA site activity |
|
| DoOP | Database of orthologous promoters: chordates and plants |
|
| EPD | Eukaryotic promoter database |
|
| JASPAR | PSSMs for transcription factor DNA-binding sites |
|
| MAPPER | Putative transcription factor binding sites in various genomes |
|
| TESS | Transcription element search system |
|
| TRANSCompel | Composite regulatory elements affecting gene transcription in eukaryotes |
|
| TRED | Transcriptional regulatory element database |
|
| TRRD | Transcription regulatory regions of eukaryotic genes |
|
| AthaMap | Genome-wide map of putative
transcription factor binding sites in |
|
| DATF | Database of |
|
Proteomics databases available on the internet.
| Databse name | Description | URL |
|---|---|---|
| RPD | Rice proteome database |
|
| ANPD | Arabidopsis nucleolar protein database |
|
| AMPD | Arabidopsis mitochondrial protein database |
|
| PA-GOSUB | Protein sequences from model organisma, GO assignement and subcellular localization |
|
| Swiss-Prot | A curated protein sequence database which strives to provide a high level of annotation |
|
| AAindex | Database of various physicochemical and biochemical properties of amino acids and pairs of amino acids |
|
| Prosite | Database of protein domains, families and functional sites, as well as associated patterns and profiles |
|
| PLANT-PIs | Database of information on the distribution and functional properties of protease inhibitors in higher plants |
|
| GeneFarm | Annotation of Arabidopsis genes and proteins |
|
Main features of the types of bioinformatics tools used for the analysis of DNA microarray data.
| Tools and resources | Goal | Methods |
|---|---|---|
| Class level functional Annotation | Determine a biological meaning to groups of related genes identified by microarray analysis | Overrepresentation test of gene ontology (GO) terms |
| Gene coexpression | Identify common expression patterns between genes in order to infer biological function | Correlation tests of gene expression |
| Gene network Analysis | Capture the interconnectedness of cellular components in order to explain biological phenomena | Systems biology approach |
| Gene network reconstruction | Develop models for gene regulatory networks | Bayesian inference theory Mutual information theory |
| Network visualization | Display a simplified view of large amount biological components and their interactions | Graph theory |
| Network exploration | Associate network nodes and edges with biological information | Incorporate heterogeneous data from various databases |
| Biological pathway resources | Map biological pathways information into inferred network | Collect and process information from pathway databases |
| Transcriptional regulation analysis | Identify transcription factors that regulate gene expression | Overrepresentation test of regulatory motifs in promoter regions of related genes |