| Literature DB >> 21762485 |
Edward Dere1, Raymond Lo, Trine Celius, Jason Matthews, Timothy R Zacharewski.
Abstract
BACKGROUND: The aryl hydrocarbon receptor (AhR) is a ligand-activated transcription factor (TF) that mediates responses to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). Integration of TCDD-induced genome-wide AhR enrichment, differential gene expression and computational dioxin response element (DRE) analyses further elucidate the hepatic AhR regulatory network.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21762485 PMCID: PMC3160422 DOI: 10.1186/1471-2164-12-365
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Summary of AhR enrichment within . Cyp1a1 is represented by two RefSeq sequences (NM_009992 and NM_001136059, dark blue tracks) that have different TSSs (dark blue box at far left). The rectangles and lines represent exons and introns, respectively, and the UTRs are depicted as the thinner rectangles. Arrowhead direction indicates the orientation of the gene. The grey boxes above represent the Affymetrix 2.0R mouse tiling array probe locations across the Cyp1a1 genic regions. The location and matrix similarity (MS) scores of the consensus DREs are represented by the purple histogram. The highlighted yellow box identifies bona fide functional DREs (matrix similarity (MS) score ≥ 0.8473) involved in AhR-mediated Cyp1a1 gene expression. The red boxes identify regions of significant AhR enrichments (FDR < 0.01) based on the moving average (MA) profile by TileMap. The green histogram plots the log2 fold enrichment values for each individual probe.
Distribution and density analysis of TCDD-induced AhR enriched regionsa in the mouse genome.
| Genic Regionc | ||||||||
|---|---|---|---|---|---|---|---|---|
| Genome | Intergenic DNAb | Intragenic DNAb | 10kb upstream | 5' UTR | CDS | 3' UTR | ||
| AhR enrichment | 14,446 | 4,163 | 10,283 | 4,601 | 2,569 | 7,499 | 225 | |
| Enrichment densityd | 5.44 | 2.62 | 9.64 | 18.65 | 17.29 | 7.21 | 7.29 | |
| AhR enrichment | 974 | 344 | 630 | 306 | 132 | 507 | 9 | |
| Enrichment densityd | 0.37 | 0.22 | 0.59 | 1.24 | 0.89 | 0.49 | 0.29 | |
a AhR enriched regions with a FDR < 0.01
b intergenic, intragenic and gene regions are defined as previously described in Additional File 1
c regions are defined using the genomic locations in the refGene database from the UCSC Genome Browser
d density calculated per million base pairs
Figure 2Characterization of TCDD-induced AhR enriched regions at 2 and 24 hrs (FDR < 0.01). Frequency analysis of enriched regions relative to log2 fold enrichment at 2 hr (A) and 24 hr (B) illustrating enrichment values in intragenic (light green) and intergenic (dark green) DNA regions. Distribution of enriched regions relative to region width (C) at 2 hrs (light red) and 24 hrs (dark red) identified 90.5% of enriched sites were ≤ 1,500 bp. Comparison of AhR enriched regions at 2 and 24 hrs identified 899 overlapping regions (D). Analysis of the fold enrichment values for the 899 overlapping regions at 2 and 24 hrs identified a positive correlation (two-tailed P-value < 0.0001, Pearson correlation coefficient = 0.4853; E).
Figure 3TCDD-induced AhR enrichment (FDR < 0.01) densities in the proximal promoter (10 kb upstream and 5 kb downstream of a TSS) at 2 hrs (A) and 24 hrs (B). The bars represent the number of enriched regions in each 200 bp window. The number of DRE cores in 100 bp non-overlapping windows is superimposed (line) illustrating the overlap between AhR enriched regions and DRE cores in the proximal promoter region.
Figure 4Confirmation of hepatic TCDD-induced AhR enrichment identified by ChIP-chip analysis (FDR < 0.01) at 2 hrs by ChIP-PCR. Selected regions were chosen for verification based on position relative to a TSS, ChIP-chip fold enrichment and the presence or lack of a DRE core within the region of enrichment (A). Immunoprecipitated DNA was measured by QRTPCR and AhR enrichment was calculated as fold induction above IgG controls. The color intensity of each box represents the mean value of three independent replicates. NS = not significant compared to IgG controls (p < 0.05). 2 hr ChIP-chip enrichment values are provided in Additional File 1.
Distribution of DRE cores in AhR enriched regionsa.
| Number of DRE coresb | Number of AhR enriched regions | |
|---|---|---|
| 2 hr | 24 hr | |
| 0 | 8,353 | 472 |
| 1 | 3,705 | 289 |
| 2 | 1,372 | 121 |
| 3 | 544 | 46 |
| 4 | 223 | 16 |
| 5 | 109 | 12 |
| 6 | 67 | 7 |
| 7 | 25 | 5 |
| 8 | 15 | 0 |
| 9 | 11 | 3 |
| 10 | 7 | 1 |
| 11 | 5 | 0 |
| 12 | 3 | 1 |
| 13 | 0 | 0 |
| 14 | 3 | 1 |
| 15 | 3 | 0 |
| 16 | 1 | 0 |
a AhR enriched regions with a FDR < 0.01
b 5'-GCGTG-3' core sequence
Figure 5Mapping TCDD-induced AhR enriched regions (FDR < 0.01) with DRE locations. Regions of enrichment identified in the intergenic (purple) and intragenic (blue) DNA regions of the genome at 2 hrs (A) and 24 hrs (B) were searched for high scoring (putative functional) DRE sequences (matrix similarity score ≥ 0.8473; dark blue and dark purple segments) and low scoring DRE sequences (matrix similarity score < 0.8473; mid blue and mid purple segments) using a position weight matrix developed from bona fide functional DREs [8]. Light blue and light purple segments represent regions with no DRE core sequence. A total of 6,595 enriched regions (6,093 at 2 hrs and 502 at 24 hrs) contained at least one DRE core (5'-GCGTG-3'). 50% of these regions were within 135 bp of a DRE core (based on the location of maximum enrichment within the enriched region; C).
Significantly over-represent transcription factor module families in TCDD-induced AhR enriched regionsa.
| TF Module Family | Module Description | 2 hr AhR enriched regions | 24 hr AhR enriched regions | ||||||
|---|---|---|---|---|---|---|---|---|---|
| # of matches | Expected # of matches | Over-representaionb | Z-Score | # of matches | Expected # of matches | Over-representaionb | Z-Score | ||
| AHR | AhR-ARNT heterodimer | 9,447 | 4,278.30 | 2.21 | 79.03 | 851 | 297.44 | 2.86 | 32.07 |
| SP1 | GC-Box factors SP1/GC | 19,356 | 12,839.04 | 1.51 | 57.54 | 1,346 | 892.61 | 1.51 | 15.17 |
| HIF | Hypoxia inducible factor, bHLH/PAS protein family | 8,763 | 4,841.13 | 1.81 | 56.37 | 569 | 336.57 | 1.69 | 12.64 |
| E2F | E2F-Myc activator/cell cycle regulator | 18,247 | 12,444.88 | 1.47 | 52.04 | 1,266 | 865.21 | 1.46 | 13.62 |
| ZBP | Zinc binding protein factors | 25,542 | 18,518.20 | 1.38 | 51.65 | 1,739 | 1,287.45 | 1.35 | 12.58 |
| NRF1 | Nuclear respiratory factor 1 | 3.475 | 1,494.81 | 2.32 | 51.21 | 252 | 103.92 | 2.42 | 14.48 |
| ZF5 | ZF5 POZ domain zinc finger | 3,156 | 1,442.92 | 2.19 | 45.09 | 205 | 100.32 | 2.04 | 10.40 |
| NF1 | Nuclear factor 1 | 13,047 | 8,876.90 | 1.47 | 44.27 | 886 | 617.15 | 1.44 | 10.81 |
| NR2F | Nuclear receptor subfamily 2 factors | 44,774 | 36,390.64 | 1.23 | 44.02 | 3,180 | 2,530.00 | 1.26 | 12.93 |
| EGR | EGR/nerve growth factor included protein c | 22,224 | 16,794.62 | 1.32 | 41.92 | 1,541 | 1,167.62 | 1.32 | 11. |
| PPAR | Peroxisome proliferator-activated receptor | 24,808 | 19,035.70 | 1.30 | 41.87 | 1,752 | 1,323.43 | 1.32 | 11.78 |
| RXR | RXR heterodimer binding sites | 41,027 | 33,441.00 | 1.23 | 41.54 | 2,932 | 2,324.93 | 1.26 | 12.60 |
| WHN | Winged helix binding sites | 3,030 | 1,477.23 | 2.05 | 40.39 | 206 | 102.70 | 2.01 | 10.14 |
a AhR enriched regions with a FDR < 0.01
b total number of matches in enriched regions/expected number of matches in genome
Complete list of over-represented TF module families are provided in Additional Files 6 and 7
Figure 6. The non-repetitive over-represented motifs from each region are shown with their consensus and reverse complement sequence, and the Gibbs motif sampler score. Over-represented motifs were associated with specific TFBSs in JASPAR and TRANSFAC based on the consensus sequence alignments and E-value scores.
Figure 7Molecular and cellar functions over-represented by genes associated with significant AhR enrichment (FDR < 0.01) containing a DRE core. The 4,544 and 332 unique genes with AhR enrichment with a DRE core at 2 hrs (A) and 24 hrs (B), respectively, were analyzed using Ingenuity Pathway Analysis for enriched biological functions using Fisher's Exact Test (p < 0.01; orange line). The blue bars represent the log Odds value calculated from the p-value of each functional group.
Figure 8Mapping TCDD-induced AhR enriched regions (FDR < 0.01) and DRE analysis to genes. The 10,283 and 660 AhR enrichments within the intragenic DNA regions at 2 and 24 hrs (blue shaded areas in Figures 5A-B) mapped to 5,307 (A) and 591 (B) distinct genes based on the refGene data from the UCSC Genome Browser. These genes were searched for the presence of high (matrix similarity score (MS) ≥ 0.8473; dark grey areas) and low (MS score < 0.84731; light grey areas) scoring DRE sequences, and the absence of a DRE core (white areas) within the region of AhR enrichment. Comparing 2 and 24 hrs data identified 575 overlapping genes with AhR enrichment and 513 of these genes contained a DRE core within the region of enrichment (C).
Figure 9Circos plots integrating DRE analysis, AhR enrichment (2 hrs; FDR < 0.01) and heatmaps for hepatic differential gene expression responses (|fold change| ≥ 1.5 and P1(t) > 0.999) induced by TCDD across the genome (A) and chromosome 9 (B). The inset legend image provides information represented by each data ring. DRE matrix similarity (MS) scores and AhR enrichment values increase radially outward. The time points for the gene expression heatmaps also increase radial outward. The arc of each heatmap wedge maps directly to the location of the gene in the genome. The arc length is proportional to the length of the transcribed region. Circos plots for the other chromosomes are provided in Additional File 12.
Distribution and AhR enrichment and DRE analysis of differentially expressed genes elicited by TCDD.
| 2 hr | 4 hr | 8 hr | 12 hr | 18 hr | 24 hr | 72 hr | 168 hr | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Up-regulated | Total | 68 | 255 | 341 | 218 | 236 | 287 | 267 | 719 | |
| With AhR enrichmentb | 55 | 200 | 202 | 148 | 172 | 186 | 164 | 243 | ||
| With AhR enrichmentb + DRE corec | 47 | 168 | 171 | 126 | 146 | 156 | 135 | 181 | ||
| Down-regulated | Total | 18 | 233 | 116 | 168 | 105 | 237 | 261 | 218 | |
| With AhR enrichmentb | 10 | 123 | 76 | 102 | 59 | 131 | 137 | 123 | ||
| With AhR enrichmentb + DRE corec | 6 | 79 | 49 | 63 | 30 | 72 | 79 | 70 | ||
a |fold-change| ≥ 1.5 and P1(t) > 0.999
b AhR enriched regions at 2 hrs with FDR < 0.01
c 5'-GCGTG-3' core sequence within AhR enriched region
Functional enrichment analysis of differently regulateda genes with AhR enrichmentb using DAVID.
| Category | Team | Gene Count | Fold enrichment | P-value |
|---|---|---|---|---|
| GOTERM_BP_3 | GO:0006629 ~ lipid metabolic process | 76 | 2.23 | 5.53E-11 |
| GOTERM_BP_3 | GO:0044255 ~ cellular lipid metabolic process | 53 | 2.23 | 7.80E-08 |
| GOTERM_BP_3 | GO:0008610 ~ lipid biosynthetic process | 32 | 2.30 | 2.28E-05 |
| GOTERM_BP_3 | GO:0048523 ~ negative regulation of cellular process | 94 | 1.60 | 4.46E-06 |
| GOTERM_BP_3 | GO:0048519 ~ negative regulation of biological process | 101 | 1.54 | 7.83E-06 |
| GOTERM_BP_3 | GO:0031324 ~ negative regulation of cellular metabolic process | 45 | 1.85 | 8.99E-05 |
| GOTERM_BP_3 | GO:0051172 ~ negative regulation of nitrogen compound metabolic process | 39 | 1.96 | 9.68E-05 |
| GOTERM_BP_3 | GO:0009890 ~ negative regulation of biosynthetic process | 40 | 1.86 | 2.33E-04 |
| GOTERM_BP_3 | GO:0009892 ~ negative regulation of metabolic process | 46 | 1.74 | 3.20E-04 |
| GOTERM_BP_3 | GO:0010605 ~ negative regulation of macromolecule metabolic process | 43 | 1.72 | 6.88E-04 |
| GOTERM_BP_3 | GO:0009410 ~ response to xenobiotic stimulus | 8 | 10.59 | 3.54E-06 |
| GOTERM_BP_3 | GO:0006805 ~ xenobiotic metabolic process | 7 | 11.59 | 1.12E-05 |
| GOTERM_BP_3 | GO:0018894 ~ dibenzo-p-dioxin metabolic process | 3 | 19.86 | 7.31E-03 |
| GOTERM_BP_3 | GO:0009404 ~ toxin metabolic process | 3 | 11.92 | 2.28E-02 |
| GOTERM_BP_3 | GO:0051272 ~ positive regulation of call motion | 9 | 4.58 | 6.00E-04 |
| GOTERM_BP_3 | GO:0051270 ~ regulation of cell motion | 15 | 2.84 | 7.55E-04 |
| GOTERM_BP_3 | GO:0040017 ~ Positive regulation of locomotion | 9 | 4.26 | 1.01E-03 |
| GOTERM_BP_3 | GO:0030334 ~ regulation of cell migration | 13 | 2.87 | 1.76E-03 |
| GOTERM_BP_3 | GO:0040012 ~ regulation of locomotion | 13 | 2.39 | 7.93E-03 |
| GOTERM_BP_3 | GO:0040012 ~ regulation of localization | 32 | 1.61 | 9.36E-03 |
| GOTERM_BP_3 | GO:0048518 ~ positive regulation of biological process | 106 | 1.44 | 7.45E-05 |
| GOTERM_BP_3 | GO:0048522 ~ positive regulation of cellular process | 94 | 1.45 | 2.03E-04 |
| GOTERM_BP_3 | GO:0009893 ~ positive regulation of metabolic process | 51 | 1.53 | 2.64E-03 |
| GOTERM_BP_3 | GO:0010604 ~ positive regulation of macromolecule metabolic process | 48 | 1.54 | 3.02E-03 |
| GOTERM_BP_3 | GO:0031325 ~ positive regulation of cellular | 47 | 1.48 | 6.67E-03 |
| GOTERM_BP_3 | GO:0009891 ~ positive regulation of biosynthetic | 38 | 1.38 | 3.96E-02 |
| GOTERM_BP_3 | GO:0031325 ~ positive regulation of nitrogen compound metabolic process | 36 | 1.39 | 4.40E-02 |
| GOTERM_BP_3 | GO:0005996 ~ monosaccharide metabolic process | 21 | 2.27 | 9.50E-04 |
| GOTERM_BP_3 | GO:0005975 ~ carbohydrate metabolic process | 37 | 1.67 | 2.63E-03 |
| GOTERM_BP_3 | GO:0016051 ~ carbohydrate biosynthetic process | 11 | 2.70 | 7.19E-03 |
| GOTERM_BP_3 | GO:0044262 ~ cellular carbohydrate metabolic process | 27 | 1.70 | 9.34E-03 |
a |fold-change| ≥ 1.5 and P1(t) >0.999 at one or more time points
b AhR enriched regions at 2 hrs with FDR < 0.01
Figure 10Molecular and cellar functions over-represented by differentially regulated genes (|fold change| ≥ 1.5, P1(t) > 0.999) associated with significant AhR enrichment (FDR < 0.01) at 2 hrs. The 900 differentially regulated genes with AhR enrichment were analyzed using Ingenuity Pathway Analysis for enriched biological functions using Fisher's Exact Test (p < 0.01; orange line). The blue bars represent the log Odds value calculated from the p-value of each functional group.