| Literature DB >> 33674350 |
Jungkyun Seo1,2,3,4, D Dewran Koçak3,4,5, Luke C Bartelt3,6, Courtney A Williams3,4, Alejandro Barrera1,3,4, Charles A Gersbach2,3,4,5,6,7, Timothy E Reddy1,2,3,4,5,6,8.
Abstract
The AP-1 transcription factor (TF) dimer contributes to many biological processes and environmental responses. AP-1 can be composed of many interchangeable subunits. Unambiguously determining the binding locations of these subunits in the human genome is challenging because of variable antibody specificity and affinity. Here, we definitively establish the genome-wide binding patterns of five AP-1 subunits by using CRISPR to introduce a common antibody tag on each subunit. We find limited evidence for strong dimerization preferences between subunits at steady state and find that, under a stimulus, dimerization patterns reflect changes in the transcriptome. Further, our analysis suggests that canonical AP-1 motifs indiscriminately recruit all AP-1 subunits to genomic sites, which we term AP-1 hotspots. We find that AP-1 hotspots are predictive of cell type-specific gene expression and of genomic responses to glucocorticoid signaling (more so than super-enhancers) and are significantly enriched in disease-associated genetic variants. Together, these results support a model where promiscuous binding of many AP-1 subunits to the same genomic location play a key role in regulating cell type-specific gene expression and environmental responses.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33674350 PMCID: PMC8015846 DOI: 10.1101/gr.267898.120
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Characterization and dissection of AP-1 subunit occupancy. (A) Schema depicting the genome-engineering strategy. (B) Western blot of engineered cell lines, confirming tagging of the desired gene. (C) Bar plot showing the distribution of AP-1 binding classes across the genome defined by the varying levels of subunit occupancy in response to dexamethasone. (D) Browser tracks of various AP-1 binding configurations. Chromatin signatures and TF occupancy at sites enriched by various AP-1 configurations after GR activations in A549 cells are shown. The identified configurations are depicted by red (AP-1 hotspots), blue (dimer), and green (singleton), respectively. (E) Sitepro plot showing the AP-1 motif per bp per peak for distinct AP-1 binding classes. P-values were calculated using a two-sided Student's t-test based on the normalized motif density within a 1-kb window of peaks. (***) P < 0.001. (F) Sitepro plot showing 95% confidence intervals (C.I.) for the differences in the normalized AP-1 motif density between different AP-1 binding classes across flanking regions of peaks; 95% C.I.’s were calculated using a two-sided Student's t-test based on each of a 7-bp sliding window from the center of the peaks. (G) Sitepro plot showing DNA shape information featured by propeller twist for each AP-1 binding class. P-values were calculated using a two-sided Student's t-test based on quantified DNA shape information within a 100-bp window centered on the motif. (***) P < 0.001, not significant (N.S.) = P > 0.05. (H) Sitepro plot showing GC-matched sequence conservation defined by phastCons100way scores for each AP-1 binding class. P-values were calculated using a two-sided Student's t-test based on the phastCons100way scores within a 400-bp window centered on the motif. (***) P < 0.001, N.S. = P > 0.05.
Figure 2.Genomic and chromatin landscape of distinct AP-1 configurations. (A) The distribution of proportion that AP-1 hotspots are localized within super-enhancers and typical enhancers marked by H3K27ac before and after dex treatment. (B) Bar plot showing the odds ratio between super-enhancers and different AP-1 configurations after GR activation. Error bars represent 95% confidence intervals at a given time point. (C) Relative de novo motif enrichment of AP-1 hotspots. Z-scores are based on distribution of significances for given motifs using bootstrapped replicates of other AP-1 binding modes that matched in number for a de novo motif enrichment test. Significant motifs are highlighted. (D) Spatial distribution of TFs and histone marks for distinct AP-1 binding classes. P-values were calculated using a two-sided Student's t-test based on ChIP-seq signal intensity within 1 kb centered on each peak. (***) P < 0.001, N.S. = P > 0.05.
Figure 3.Genomic landscape of various AP-1 binding modes in glucocorticoid response. (A) Cumulative distribution of relative distance of AP-1 binding classes to the nearest genomic loci occupied by GR. (B) Heat map showing the means of the Jaccard index of overlaps between AP-1 classes and genomic occupancies of TFs. The means are based on the distribution of the indexes from bootstrapped replicates of genomic regions that matched in number. (C–E) Spatial distribution of dex-responsive TFs and histone marks for distinct AP-1 binding classes: (C) glucocorticoid receptor (GR); (D) coactivator EP300; (E) enhancer activity defined by genome self-transcribing active regulatory region-seq (STARR-seq). P-values were calculated using a two-sided Student's t-test based on ChIP-seq signal intensity within 1 kb centered on each peak. (***) P < 0.001. (F) The distribution of proportions of dynamic GR bindings explained by pre-established factors including AP-1s, EP300, and super-enhancers at a pre-dex time point.
Figure 4.Changes in AP-1 hotspot binding link to changes in gene expression. (A–C) Temporal gene expression trajectory plot showing log2 fold change in gene expression mapped to the gains of each AP-1 binding classes in response to dex according to (A) eQTLs, (B) Activity by Contact defined by Hi-C coupled with chromatin accessibility (DNase-seq) and H3K27ac, and (C) the proximity to the nearest genes. (D–F) Temporal gene expression trajectory plot showing log2 fold change in gene expression mapped to the losses of each AP-1 binding classes in response to dex according to (D) eQTLs, (E) Activity by Contact, and (F) the proximity to the nearest genes. (G–I) Temporal gene expression trajectory plot showing log2 fold change in gene expression mapped to maintained AP-1 binding classes in response to dex according to (G) eQTLs, (H) Activity by Contact, and (I) the proximity to the nearest genes. P-values were calculated using a Wilcoxon rank-sum test based on changes in log2 folds at 12 h. (**) P < 0.01, (***) P < 0.001, N.S. = P > 0.05.
Figure 5.Distinct genomic and regulatory features of cell type–specific AP-1hotspots. (A) Bar plot showing the number of distinct AP-1 binding modes including AP-1 hotspots, AP-1 heterodimer, and singleton in A549 and K562 cells. (B) Venn diagram showing overlap between AP-1 hotspots in A549 and K562. (C) Sitepro plot showing the AP-1 motif per bp per peak for distinct AP-1 binding classes in K562 cells. P-values were calculated using a two-sided Student's t-test based on the normalized motif density within 1 kb of peaks. (***) P < 0.001. (D) Box plots showing the distribution of rank-correlation for TFs and histone marks between cell type–specific and –shared AP-1 hotspots. Each rank-correlation is based on the effect size for the Mann–Whitney U test between size- and number-matched bootstrapped replicates from cell type–specific and –shared AP-1 hotspots for binding activity of TFs. (E) Cumulative distribution of distance to nearest TSS for cell type–specific and –shared AP-1 hotspots. (F) Bar plot displaying the proportion of cell type–specific and –shared AP-1 hotspot loci by genomic annotation. (G) Bar plot showing the distribution of log2 fold change in gene expression in response to dex at a 12-h time point by genomic annotation for pre-established cell type–specific and –shared AP-1 hotspots. y-axis represents genomic annotations: (a) TSS, (b) Promoter-TSS, (c) Noncoding, (d) Intron, (e) Intergenic, (f) Exon, (g) 5′ UTR, and (h) 3′ UTR. (H) MSigDB Perturbation ontology enrichment using the genomic region-based binomial test for cell type–specific AP-1 hotspots in A549 and K562 cells. y-axis shows the cell type used in the perturbation study (Cell type, PubMed PMID for each data set). The color and size of dots represent the cell types and adjusted P-value, respectively.
Figure 6.Disease-associated genetic variants and GO enrichment in distinct AP-1 binding classes. (A) Bar plot showing the proportion of each AP-1 binding class that contains at least single GWAS SNP. (B) Box plot showing the distribution of trait-associated SNP density (SNP/100 kb sequence) for distinct AP-1 binding classes. Each dot represents the SNP density based on the same number of loci repeatedly subsampled from each class. P-values were calculated using Wilcoxon rank-sum test. (***) P < 0.001, N.S. = P > 0.05. (C) GO enrichments using the genomic region-based binomial test (left) and gene-based hypergeometric test (right) for distinct AP-1 binding classes. The color and size of dots represent the different AP-1 binding classes and the adjusted P-values for a given annotation, respectively. (D) Disease Ontology (DO) enrichments using the genomic region-based binomial (left) and gene-based hypergeometric test (right) for distinct AP-1 binding classes. The color and size of dots represent the different AP-1 binding class and the adjusted P-value for a given annotation, respectively.