| Literature DB >> 18487275 |
Abstract
Gene Ontology (GO) analysis has become a commonly used approach for functional studies of large-scale genomic or transcriptomic data. Although there have been a lot of software with GO-related analysis functions, new tools are still needed to meet the requirements for data generated by newly developed technologies or for advanced analysis purpose. Here, we present a Gene Ontology Enrichment Analysis Software Toolkit (GOEAST), an easy-to-use web-based toolkit that identifies statistically overrepresented GO terms within given gene sets. Compared with available GO analysis tools, GOEAST has the following improved features: (i) GOEAST displays enriched GO terms in graphical format according to their relationships in the hierarchical tree of each GO category (biological process, molecular function and cellular component), therefore, provides better understanding of the correlations among enriched GO terms; (ii) GOEAST supports analysis for data from various sources (probe or probe set IDs of Affymetrix, Illumina, Agilent or customized microarrays, as well as different gene identifiers) and multiple species (about 60 prokaryote and eukaryote species); (iii) One unique feature of GOEAST is to allow cross comparison of the GO enrichment status of multiple experiments to identify functional correlations among them. GOEAST also provides rigorous statistical tests to enhance the reliability of analysis results. GOEAST is freely accessible at http://omicslab.genetics.ac.cn/GOEAST/Entities:
Mesh:
Year: 2008 PMID: 18487275 PMCID: PMC2447756 DOI: 10.1093/nar/gkn276
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The GOEAST graphical output of enriched GO terms in the biological process category for a sample experiment. Boxes represent GO terms, labeled by its GO ID, term definition, and detailed information, organized as ‘q/m|t/k (P-value)’ (see Materials and Methods section for the meanings of q, m, t and k). Significantly enriched GO terms are marked yellow. The degree of color saturation of each node is positively correlated with the enrichment significance of the corresponding GO term. Nonsignificant GO terms within the hierarchical tree are shown as white boxes. Branches of the GO hierarchical tree without significantly enriched GO terms are not shown. Arrows represent connections between different GO terms. Red arrows represent relationships between two enriched GO terms, black solid arrows represent relationships between enriched and unenriched terms and black dashed arrows represent relationships between two unenriched GO terms.
Figure 2.Graphical output of Multi-GOEAST analysis results for differentially expressed genes in Jmjd1a and Jmjd2c knockdown mice. Red and green boxes represent enriched GO terms only found in Jmjd1a knockdown mice (first experiment) or Jmjd2c knockdown mice (second experiment), respectively. Yellow boxes represent commonly enriched GO terms in both experiments. The saturation degrees of all colors represent the significance of enrichment for corresponding GO terms.