| Literature DB >> 26329719 |
Ashley J Waardenberg1,2, Samuel D Basset3, Romaric Bouveret4,5, Richard P Harvey6,7,8,9.
Abstract
BACKGROUND: Gene ontology (GO) enrichment is commonly used for inferring biological meaning from systems biology experiments. However, determining differential GO and pathway enrichment between DNA-binding experiments or using the GO structure to classify experiments has received little attention.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26329719 PMCID: PMC4557902 DOI: 10.1186/s12859-015-0701-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Overview of the CompGO pipeline and implemented functions. a The “annotateBedFromDb” function annotates DNA coordinates from BED files against transcript coordinates from a reference genome, “getFnAnot_genome” queries gene lists using the RDAVIDWebService and returns statistics and counts of each GO term and “doZtrans.single” calculates the log odds ratio of GO term enrichment. Note: users can supply their own background genome regions; by default the whole genome is used. b Given log odds ratios, multiple experiments can be reduced into a single matrix for Principle Component Analysis or Hierarchical Clustering, via “plotPCA” and “plotDendrogram” respectively. c Differentially Enriched GOs (DiEGOs) between pairs of experiments are calculated via the differential log odds ratio and top DiEGOs can be visualized via Directed Acyclic Graphs, “plotZRankedDAG”, and top differentially enriched pathways via “viewKEGG”. CompGO functions are colored red
Fig. 2Example functionality of CompGO using published ChiP-seq data. 1000 BED coordinates were selected at random and form part of the example dataset packages with CompGO. a Differentially enriched GO and pathway terms. b Hierarchical clustering 1. c Principle Component Analysis. d Direct comparison of z-scores with Jaccard Coefficient overlaid (Eq. 5) onto terms. e Directed Acyclic Graph. f KEGG Pathway colored by which experiment the Gene was mapped to. a, b, e and f utilise Eq. 3 in their rankings. d utilises Eq. 1
Fig. 3Application of CompGO to experimental data. Direct comparison of z-scores with Jaccard Coefficient overlaid onto terms for a NKX2-51 vs. NKX2-52 and b NKX2-51 vs. ELK4. c Principle Component Analysis. d Hierarchical clustering
Fig. 4Comparison of CompGO to the overlap method. Differentially enriched GO terms using the overlap method (p ≤ 0.05) for a NKX2-51 vs. NKX2-52 and b NKX2-51 vs. ELK4. c Log odds ratio of NKX2-51 vs. NKX2-52 versus differential p-values returned from DAVID, scored as the difference between –log10 transformed vales. d Log odds ratio of NKX2-51 vs. ELK4 versus differential p-values returned from DAVID, scored as the difference between –log10 transformed vales. Red dots indicate GO terms determined significant and specific using the overlap method. Blue dots are GO terms returned as significant from CompGO. Blue dots with red centres are GO terms returned as significant by both methods. Black dots are non-significant terms using both approaches. For (b) and (c) the density distribution of log odds ratios returned by CompGO is on the top of each panel