| Literature DB >> 18611952 |
Elizabeth D Wederell1, Mikhail Bilenky, Rebecca Cullum, Nina Thiessen, Melis Dagpinar, Allen Delaney, Richard Varhol, YongJun Zhao, Thomas Zeng, Bridget Bernier, Matthew Ingham, Martin Hirst, Gordon Robertson, Marco A Marra, Steven Jones, Pamela A Hoodless.
Abstract
Foxa2 (HNF3 beta) is a one of three, closely related transcription factors that are critical to the development and function of the mouse liver. We have used chromatin immunoprecipitation and massively parallel Illumina 1G sequencing (ChIP-Seq) to create a genome-wide profile of in vivo Foxa2-binding sites in the adult liver. More than 65% of the approximately 11.5 k genomic sites associated with Foxa2 binding, mapped to extended gene regions of annotated genes, while more than 30% of intragenic sites were located within first introns. 20.5% of all sites were further than 50 kb from any annotated gene, suggesting an association with novel gene regions. QPCR analysis demonstrated a strong positive correlation between peak height and fold enrichment for Foxa2-binding sites. We measured the relationship between Foxa2 and liver gene expression by overlapping Foxa2-binding sites with a SAGE transcriptome profile, and found that 43.5% of genes expressed in the liver were also associated with Foxa2 binding. We also identified potential Foxa2-interacting transcription factors whose motifs were enriched near Foxa2-binding sites. Our comprehensive results for in vivo Foxa2-binding sites in the mouse liver will contribute to resolving transcriptional regulatory networks that are important for adult liver function.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18611952 PMCID: PMC2504304 DOI: 10.1093/nar/gkn382
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Distances from sites similar to Foxa2-binding sequences to the peak maximum, normalized to peak width. This graph shows that Foxa2-binding sequence sites tended to cluster around peak maxima. (B) Peak enrichment in Foxa2-like sequences, as assessed with PWM scan scores. The Foxa2 PWM used was created from the aligned sequences used to generate the sequence logo shown in Supplementary Figure 2 (see Methods section). The graph indicates that peak genomic sequences contained higher scoring Foxa2-binding sequences than randomized peak sequences; that is, peaks were enriched relative to a random expectation.
Global peak properties and peak locations relative to UCSC mm8 known genes
| Minimum peak height | 10 |
| # Peaks | 11 475 |
| # Individual DNA reads under peaks | 285 176 |
| Average (median) peak height | 16 ( |
| Average (median) peak length (bp) | 727.5 (654) |
| # Peaks within ±1 kb TSS | 694 (6.0%) |
| # Peaks in 10 kb upstream TSS region | 1800 (15.7%) |
| # Intragenic peaks | 5575 (48.6%) |
| # Peaks in 1st intron | 1852 (16.1%) (33.2% of intragenic) |
| # Peaks within extended gene region of annotated genes | 7808 (68.0%) |
| # Peaks >50 kb from a gene | 2356 (20.5%) |
| # Genes (extended gene region) with peaks | 5060 |
| # Genes (extended gene region) with peak and SAGE tag | 2781 (43.5% of genes with tag) |
aPeak maximum falls within these regions.
While relatively few peaks (6%) had a maximum within 1 kb of a TSS, a higher than expected percentage of peaks (33.2%) had a maximum within a first intron. A peak was associated with a gene if the peak's maximum was within a region that extended from 10 kb upstream from the gene's TSS to 1 kb downstream of its transcription termination site. Genes with a peak and SAGE tag refers to peaks with maxima within the extended region of a gene that also had SAGE tag(s).
Figure 2.Characterized target Foxa2-binding sites. (A) Table of six characterized target sites in five target genes showing peak presence or absence, peak height and fold enrichment by qPCR. All three sites that contained a peak also showed enrichment above 2-fold. (B) UCSC genome browser mm8 screenshots of four target sites at two of the target genes. Of the HNF4α enhancer and promoter sites, only the enhancer was covered by a peak in our adult liver dataset. The Ttr gene has two closely related promoter sites from different publications, both of which were overlapped by a single peak in our dataset. An additional track displaying the Foxa2-like binding sequence sites from scanning our 10-mer model is also shown (‘Foxa2 logo sites’).
Figure 3.Characterization of novel target sites associated with known Foxa2-target genes. (A) UCSC genome browser screenshots of peaks associated with the Alb and Afp genes. The Alb novel site is an additional location to the characterized target site; both sites were overlapped by peaks, indicating binding by Foxa2 in the liver. The Afp novel site is an alternate location 5′ to the site characterized in embryonic liver. (B) QCPR fold enrichment for four novel target sites in characterized Foxa2-target genes averaged over three biological replicate Foxa2 ChIP experiments. Both the fold enrichment and the peak height are shown; the red line indicates the minimum peak height threshold for our dataset (i.e. 10 XSETs). All enrichments were calculated using IgG enrichment as a control in all graphs showing qPCR results. All four novel target sites had fold enrichments above 13.
Figure 4.QPCR fold enrichment of annotated genes containing Foxa2-binding site peaks. All genes shown contained SAGE tag(s) indicating that they were expressed in the adult liver. QPCR fold enrichment and peak height are shown for each target; the red line indicates the peak height threshold of 10 XSETs. As previously, IgG enrichment served as the control. (A) Annotated genes that have characterized roles in liver function. Peaks are divided into those that contained a Foxa2-binding sequence and those that did not. All eight targets showed fold enrichments above 4. (B) Annotated genes that have not previously been shown to be expressed in the liver, with peaks divided into those that contained a Foxa2-binding sequence and those that did not. Dock8 and Cask, which did not show enrichment, were the only targets out of 41 peaks tested that were not enriched. The peak height of these two targets was close to the peak height threshold. The remaining six targets all showed fold enrichments above 6. (C) Transcription factors that have a characterized role in liver function in literature. All six targets showed fold enrichments above 5. (D) Transcription factors with uncharacterized roles in the liver. Expression of these transcription factors in the liver is a novel finding. All four targets showed fold enrichments above 5.
Figure 5.Examples of peaks located >50 kb away from annotated genes. (A) UCSC genome browser mm8 screenshots of two remote peaks in regions containing no known genes. The first peak (unknown #4) lies close to a mouse EST, while the second peak (unknown #5) overlaps and is close to highly conserved sequence regions. (B) Graph of qPCR results of all six peaks lying in unknown gene regions. Four peaks contained Foxa2-binding sequences and two peaks did not. As previously, both fold enrichment by qPCR and peak height are shown. Fold enrichments are relative to IgG control. All six peaks showed fold enrichments above 19.
Figure 6.Positive correlation between fold enrichment by qPCR and peak height. Each data point represents the average fold enrichment of three separate Foxa2 ChIP experiments. As indicated, taller peaks showed greater fold enrichments. For this dataset, peak height can be considered to be a predictor of expected fold enrichment.
Figure 7.Foxa2 knockdown in Hepa1-6 cells and its effect on expression of targets identified by ChIP–Seq. Foxa2 siRNA treatment results in ∼50% knockdown of Foxa2 expression compared to control siRNA. Upon Foxa2 knockdown, expression of both known (Alb, HNF4α) and novel (HNF6, Hes1, Fgf1, Vtn) decreased. Expression of HNF1β increased, while expression of Cited2 and Gtls3 remained unchanged. The expression of Foxa1 was unaffected by Foxa2 knockdown, indicating the specificity of the Foxa2 siRNA. *P < 0.05.
The top 20 enrichment-ranked TRANSFAC v9.3 TFBS models that co-occurred within ±250 bp of Foxa2-binding sequences within peaks or occurred in within ±250 bp of a peak maximum in peaks without Foxa2 sites
| TFBS model | Model # | No. of peaks | TF in liver? | No. of genes | Level 5 GO Biological Process | Percentage (%) | |
|---|---|---|---|---|---|---|---|
| Peaks with exact FoxA2 sequence | |||||||
| FOX_Q2 | M00809 | 995 | 231.3 | Family | 621 | Cellular metabolic process | 25.0 |
| PAX6_Q2 | M00979 | 138 | 161.3 | No | 102 | Biopolymer modification/Cellular protein metabolic process | 29.0 |
| HFH3_01 | M00289 | 448 | 140.9 | No | 302 | RNA metabolic process/Cellular protein metabolic process | 21.4 |
| MAZR_01 | M00491 | 139 | 106.3 | Yes (PCR) | 83 | RNA metabolic process | 30.8 |
| HNF3B_01 | M00131 | 569 | 82.4 | Yes (tag) | 340 | Cellular protein metabolic process | 26.2 |
| HNF4_Q6_01 | M01031 | 791 | 58.8 | Yes (tag) | 462 | Cellular protein metabolic process | 24.0 |
| MAZ_Q6 | M00649 | 245 | 56.8 | Yes (tag) | 162 | Regulation of cellular metabolic process | 29.4 |
| HNF4_DR1_Q3 | M00764 | 754 | 53.3 | Yes (tag) | 462 | RNA metabolic process | 22.9 |
| SMAD3_Q6 | M00701 | 97 | 44.0 | Yes (PCR) | 64 | Cellular protein metabolic process | 33.3 |
| COUPTF_Q6 | M01036 | 479 | 43.8 | Family | 270 | Cellular protein metabolic process | 21.3 |
| PAX_Q6 | M00808 | 706 | 41.5 | Family | 433 | Cellular protein metabolic process | 26.5 |
| HNF1_Q6 | M00790 | 844 | 41.2 | Yes (tag) | 465 | Regulation of cellular metabolic process | 21.5 |
| GATA4_Q3 | M00632 | 1285 | 37.6 | Yes (tag) | 700 | Cellular protein metabolic process | 24.9 |
| HNF1_C | M00206 | 1029 | 34.8 | Yes (tag) | 549 | Cellular protein metabolic process | 29.7 |
| EVI1_04 | M00081 | 1028 | 33.5 | Yes (PCR) | 565 | Cellular protein metabolic process | 25.9 |
| PPARG_03 | M00528 | 522 | 32.2 | Yes (tag) | 310 | RNA metabolic process | 19.3 |
| LDSPOLYA_B | M00317 | 240 | 30.5 | No | 129 | RNA metabolic process | 28.8 |
| FREAC2_01 | M00290 | 34 | 30.5 | No | 23 | Regulation of cellular metabolic process/RNA metabolic process/Transcription | 43.8 |
| KROX_Q6 | M00982 | 13 | 29.0 | Family | 11 | Cell surface receptor linked signal transduction | 50 |
| FOXJ2_01 | M00422 | 124 | 27.6 | Yes (tag) | 75 | RNA metabolic process/Regulation of cellular metabolic process | 23.2 |
| Peaks with no FoxA2 sequence | |||||||
| MAZR_01 | M00491 | 138 | 82.9 | Yes (PCR) | 77 | Cellular protein metabolic process | 29.2 |
| PAX6_Q2 | M00979 | 69 | 57.0 | No | 44 | Intracellular signaling cascade | 31.8 |
| MAZ_Q6 | M00649 | 269 | 47.0 | Yes (tag) | 164 | RNA metabolic process/Transcription/Regulation of cellular metabolic process | 31.2 |
| KROX_Q6 | M00982 | 25 | 39.0 | Family | 20 | Regulation of cellular metabolic process/RNA metabolic process/Transcription/Cellular protein metabolic process | 33.3 |
| HNF4_Q6_01 | M01031 | 576 | 21.7 | Yes (tag) | 361 | Cellular protein metabolic process | 25.7 |
| UF1H3BETA_Q6 | M01068 | 2 | 19.9 | No | 1 | 15 different terms | 100 |
| HNF4_DR1_Q3 | M00764 | 537 | 17.4 | Yes (tag) | 321 | Cellular protein metabolic process | 21.9 |
| SMAD3_Q6 | M00701 | 51 | 15.5 | Yes (PCR) | 31 | Intracellular signaling cascade/RNA metabolic process | 19.2 |
| NRF1_Q6 | M00652 | 5 | 14.9 | Yes (tag) | 6 | Response to oxidative stress/Carboxylic acid metabolic process/Cellular protein metabolic process/Mitochondrion organization and biogenesis/Macromolecule biosynthetic process/DNA metabolic process | 33.3 |
| COUPTF_Q6 | M01036 | 333 | 14.7 | Family | 225 | Cellular protein metabolic process | 26.6 |
| MIF_01 | M00279 | 6 | 12.3 | Yes (PCR) | 4 | Cytoskeleton organization and biogenesis/Establishment of cellular localization | 100 |
| SMAD4_Q6 | M00733 | 46 | 11.8 | Yes (tag) | 29 | Phosphate metabolic process/Cellular protein metabolic process | 28.6 |
| ATF_B | M00338 | 4 | 11.7 | Family | 3 | RNA metabolic process/Regulation of cellular metabolic process/Transcription | 100 |
| PPARG_03 | M00528 | 423 | 11.1 | Yes (tag) | 246 | Cellular protein metabolic process | 24.8 |
| PAX_Q6 | M00808 | 483 | 10.9 | Family | 287 | Cellular protein metabolic process | 30.9 |
| SF1_Q6 | M00727 | 118 | 10.9 | Yes (tag) | 83 | Regulation of cellular metabolic process | 32.6 |
| BRN2_01 | M00145 | 75 | 10.8 | No | 21 | Regulation of cellular metabolic process | 50 |
| RREB1_01 | M00257 | 9 | 10.4 | Yes (tag) | 7 | Regulation of cellular metabolic process/RNA metabolic process/Transcription | 50 |
| SP3_Q3 | M00665 | 59 | 10.0 | Yes (tag) | 36 | Cellular protein metabolic process | 47.6 |
| LHX3_01 | M00510 | 11 | 8.3 | No | 5 | Regulation of cellular metabolic process/RNA metabolic process/Transcription | 66.7 |
Included in the table are: (i) the number of peaks that contained the TFBS model; (ii) the Z-score, which represents the difference between the MATCH v2.6 PWM scan enrichment of real peak sequences and the mean for randomized sequences, divided by the standard deviation for randomized sequences (see Methods section); (iii) an indication of whether the transcription factor is expressed in the adult liver, based either on SAGE tag(s) from our adult liver library or RT–PCR amplification in the adult liver; (iv) the number of annotated genes that contained at least one peak within its extended gene region (10 kb upstream of the TSS to 1 kb downstream of the TTS); (v) the ‘level 5’ Gene Ontology Biological Process category assigned to the largest subgroup within these gene groups and (vi) the corresponding percentage of genes in the subgroup. TFBS models that co-occurred with a Foxa2-binding sequence included both transcription factors that have previously been demonstrated to interact with Foxa2 (HNF4 and members of the Coup-TF family) and also novel potential interacting transcription factors (Smad3 and Maz). Ten of the TFBS models that co-occurred with Foxa2-like binding sequences in peaks containing such sequences also occurred in peaks that did not contain Foxa2-like binding sequences.
aRegulation of biological process; cellular metabolic process; macromolecule metabolic process; primary metabolic process; cell organization and biogenesis; regulation of metabolic process; regulation of cellular process; organelle organization and biogenesis; biopolymer metabolic process; nucleobase, nucleoside, nucleotide and nucleic acid metabolic process; chromosome organization and biogenesis; regulation of cellular metabolic process; RNA metabolic process; transcription; DNA metabolic process.
Figure 8.The distribution of TFBS sequence models relative to the corresponding Foxa2-binding sequence for three of the top 20 cotranscription factors in peaks containing a Foxa2-binding sequence. (A) The distribution of GATA4 TFBS models (Transfac model M00632). Motifs for this transcription factor show a uniform distribution across the 500 bp sequence. (B) The distribution of Pax6 TFBS models (Transfac model M00979). Pax6 motifs clustered at a distance of ∼40 bp away from the Foxa2-binding sequence (P = 1.1E-3). (C) The spatial distribution of HNF1 TFBS models (TRANSFAC model M00790). HNF1 motifs are spatially concentrated near the Foxa2-binding sequence (P = 5.7E-9).