| Literature DB >> 34731603 |
Seraina Blümli1, Nicola Wiechens1, Meng-Ying Wu1, Vijender Singh2, Marek Gierlinski3, Gabriele Schweikert3, Nick Gilbert4, Catherine Naughton5, Ramasubramanian Sundaramoorthy1, Joby Varghese5, Robert Gourlay5, Renata Soares5, David Clark6, Tom Owen-Hughes7.
Abstract
The ARID1A subunit of SWI/SNF chromatin remodeling complexes is a potent tumor suppressor. Here, a degron is applied to detect rapid loss of chromatin accessibility at thousands of loci where ARID1A acts to generate accessible minidomains of nucleosomes. Loss of ARID1A also results in the redistribution of the coactivator EP300. Co-incident EP300 dissociation and lost chromatin accessibility at enhancer elements are highly enriched adjacent to rapidly downregulated genes. In contrast, sites of gained EP300 occupancy are linked to genes that are transcriptionally upregulated. These chromatin changes are associated with a small number of genes that are differentially expressed in the first hours following loss of ARID1A. Indirect or adaptive changes dominate the transcriptome following growth for days after loss of ARID1A and result in strong engagement with cancer pathways. The identification of this hierarchy suggests sites for intervention in ARID1A-driven diseases.Entities:
Keywords: ARID1A; BAF; EP300; SWI/SNF; cancer; chromatin remodeling; enhancer transcription; nucleosomes
Mesh:
Substances:
Year: 2021 PMID: 34731603 PMCID: PMC8578704 DOI: 10.1016/j.celrep.2021.109943
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Figure 1Establishment of a degron to specifically degrade the ARID1A subunit of SWI/SNF complexes
(A) Western blot showing ARID1A depletion in mouse ESCs. mESCs expressing endogenous ARID1A protein homozygously tagged with mAID-GFP after the addition of 500 μM auxin for indicated time points, and ARID1A abundance in cells lines where ARID1A is knocked out or not tagged.
(B) The abundance of total cell proteins prior to and 2 h following growth in the presence of auxin indicates that ARID1A is specifically targeted for degradation via the degron.
(C) The abundance of SMARCA4-associated proteins determined by mass spectrometry is indicated as fold change of protein abundance prior to and 2 h following addition of auxin to cell media. Other than BAF-specific subunits ARID1A, DPF1, and DPF2 other subunits remain substantially associated with SMARCA4.
Figure 2Chromatin changes at different sites over time following ARID1A degradation
(A) Density plots of lost (red) and gained (blue) ATAC-seq signals changing at time points (2, 6, 18, 54, and 162 h) following degradation of ARID1A and in knockout (ARID1A−/−) mESCs. The list of sites is ordered by the time at which a change of >1.5-fold is first detected. (Last panel) Enrichment for ChIP-seq signal of the ATPase SMARCA4 (de Dieuleveult et al., 2016) at the same ATAC sites.
(B) Distribution of SMARCA4 ChIP-seq peaks and lost ATAC changes across ESC chromatin states (Pintacuda et al., 2017). Color code for chromatin states defined by histone modifications and transcription factor binding below the panel.
(C) Motif-based sequence analysis of ATAC peaks lost after 2, 6, 18, 54, and 162 h of ARID1A depletion. E values given show the significance of the motif as reported by MEME-ChIP (Motif Analysis of Large Nucleotide Datasets) (Machanick and Bailey, 2011). Only factors with an E value of 1 × 10 to 1 × 100 or lower are included.
Figure 3ARID1A organizes clusters of nucleosomes adjacent to the binding sites of pluripotency transcription factors
(A) The centers of nucleosomal fragments protected from MNase digestion obtained prior to (red) and 2 h following degradation of ARID1A (blue) are shown aligned to the consensus binding site for CTCF.
(B) The distribution of chemical cleavage directed from histone H4 S47C (Voong et al., 2016) aligned to CTCF sites is shown in green. The distance to the adjacent nucleosome (d) and the width of the distribution of histone contacts (w) are indicated.
(C and D) Distributions of the same datasets at combined SOX2/OCT4 consensus motifs.
(E) Plot of the distance (d) between adjacent nucleosomes and the width of the distribution (w) at the DNA binding motifs for a range of transcription factors (see Figure S2).
(F) The limits of the distribution of ARID1A-sensitive transcription factor-bound nucleosomes have properties expected of a barrier capable of setting the locations of adjacent nucleosomes. In the absence of ARID1A a static distribution of nucleosomes across the factor binding site causes adjacent nucleosomes to be positioned heterogeneously.
Figure 4Loss of enhancer chromatin accessibility is enriched at downregulated genes
(A) The numbers of genes changing significantly (red) at each time point represented as volcano plots. Right panels indicate changes following incubation of mESCs that do not have ARID1A mAID tagged with auxin.
(B) Transcription for genes changing (FDR < 0.05) after 2 and 6 h following ARID1A degradation and those changing later is plotted for groups sorted into quintiles based on the fold change at 6 and 162 h. A group of 500 genes selected based on no change to transcription following depletion of ARID1A is included for comparison. Changes in TT-seq at each time point are represented as a heatmap in (B). Also shown are the changes in an ARID1A −/− line and control lines in which ARID1A is not degron tagged. The average log2 fold change in expression for genes in each quintile at each time point is plotted as a histogram for early changing genes in (G) and for late genes in (H).
(C) Loss of ATAC-seq (> 1.5-fold) in the 50 kb region (excluding promoters) adjacent to differentially expressed genes.
(D) Loss of ATAC-seq (> 1.5-fold) at promoters (−500 to +500 bp from the TSS).
(E) Gain of ATAC-seq in regions 50 kb either side of the promoters of each TT-seq gene.
(F) Gained promoter ATAC-seq.
(G and H) Quintiles represent: 1, top 20% of repressed genes; 2, top 20%–40% of repressed genes; 3, top 40%–60% of genes; 4, top 20%–40% of activated genes; 5, top 20% of activated genes.
(I and J) Bar graphs showing the percentage of genes in each differentially expressed group of genes that have loss of chromatin accessibility within 50 kb on either side of the TSS. The different colored bars represent the percentage of genes that have sites of lost chromatin accessibility first detected at successive time points according to the key.
(K and L) −Log10 probability mass function for the enrichments in (I) and (J) are plotted indicating the likelihood that the enrichments would occur by chance.
(M and N) Percentages of genes in each group with promoter chromatin changes. In each histogram bars are colored according to the time at which changes in ATAC-seq are detected as indicated in the key.
Figure 5ARID1A-dependent EP300 occupancy in different chromatin contexts is associated with upregulation and downregulation of gene expression
(A) The number of sites at which EP300 occupancy (FDR < 0.2 and fold change > 1.2) changes are detected are shown for each time point following degradation of ARID1A.
(B) Intersects between called changes in ATAC-seq, EP300 occupancy and histone acetylation.
(C) The distribution of the chromatin changes indicated in (B) across promoter and enhancer chromatin states (Pintacuda et al., 2017).
(D and E) Heatmaps showing enrichment for selected factors at the genes where chromatin changes indicated in (B) occur. Heatmaps are centered with 2 kb on either side.
(F) Average changes in transcription detected by TT-seq for quintiles of genes sorted by fold-change in expression and changing (FDR < 0.05) at 2, 6 and 18 h.
(G–M) The percentage of differentially expressed genes shown in (F) that have the indicated chromatin changes within a 50-kb flanking region that excludes the promoter (G–J). Sites of gained EP300 and histone acetylation are promoter proximal, so the proportion of genes with these changes is calculated including promoters (K–M). Data relating to the genes differentially expressed at 2, 6, and 18 h are shown in the top middle and bottom row of histograms. Each histogram is organized as a block of data for each differentially expressed quintile of genes that are first differentially expressed at the time point. For each quintile a different colored bar indicates the chromatin changes first detected at that time point. (M) Sites of histone acetylation accumulated during 2–18 h that coincide with gained EP300 occupancy at each time point.
Figure 6Changes to chromatin and transcription following ARID1A degradation at the Lef1 and Lefty2 loci
(A–C) Distribution of selected factors at the Lef1 (A), Lefty2 (B), and Nodal (C) loci. Changes to chromatin accessibility (ATAC-seq), EP300, histone H3K27 acetylation, histone H3K27 methylation, and nascent transcription (TT-seq) are shown at the indicated times following addition of auxin. Sites of lost and gained occupancy are indicated by red and green lines.
Figure 7Acute and chronic changes to cancer and stem cell signaling following loss of ARID1A
(A) Illustration of the two pathways by which ARID1A and EP300 act in upregulation and downregulation of gene expression. At sites adjacent to downregulated genes, chromatin accessibility (orange dots) and EP300 occupancy (blue dots) are reduced. In contrast, upregulated genes are linked to loss of EP300 occupancy in the absence of chromatin changes. These changes are enriched at genes that change expression after 2 h. Changes in expression are detected at many more genes in subsequent hours and days, but not mechanistically linked to loss of ARID1A.
(B) The top pathways enriched following chronic ARID1A loss are illustrated together with the progressive engagement with these pathways over time, indicated by −log10 p values.
(C) Genes associated with the pathways indicated that change (FDR < 0.05, fold change > log2 0.5-fold) up to 6 h following loss of ARID1A are indicated together with the log2 transcriptional changes at all time points. The early changing components of these pathways are likely to drive downstream changes that propagate further engagement with pathways. Several early changing regulators contribute to multiple pathways. This suggests that changes to transcription of a small number of genes drives engagement with the biological pathways that shape the ARID1A−/− phenotype.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Mouse monoclonal anti-EP300 | Santa Cruz Biotechnology | sc-48343; RRID: |
| Mouse monoclonal anti-EP300 | Abcam | ab14984; RRID: |
| Rabbit polyclonal anti-H3K27ac | Abcam | ab4729; RRID: |
| Rabbit monoclonal anti-H3K27me3 | Cell Signaling Technology | #9733; RRID: |
| Rabbit polyclonal anti-ARID1A | Sigma-Aldrich | HPA005456; RRID: |
| Mouse monoclonal anti-Actin, clone AC15 | Sigma-Aldrich | A5441; RRID: |
| Rabbit polyclonal anti-SMARCA4 | Abcam | ab11064; RRID: |
| ESGRO® Leukemia Inhibitory Factor (LIF), 10 million units/1 mL | Sigma-Aldrich | ESG1107 |
| auxin analog 1-naphthaleneacetic acid, NAA | Sigma-Aldrich | N0640 |
| HAT supplement (50x) | ThermoFisher | 21060017 |
| HT supplement (100x) | ThermoFisher | 11067030 |
| Cre recombinase | N/A | |
| Flp recombinase | N/A | |
| Lipofectamine LTX Reagent with PLUS Reagent | ThermoFisher | 15338030 |
| 6-thioguanine | Sigma-Aldrich | A4882 |
| Puromycin | Sigma-Aldrich | 540411 |
| Normocin | Invivogen | ant-nr-1 |
| 4-thiouridine | Sigma-Aldrich | T4509 |
| TRIzol | Invitrogen | 15596026 |
| MTSEA-Biotin | Biotum | 90066/90066-1 |
| Triethylammonium bicarbonate buffer pH 8.5, TEAB | Sigma-Aldrich | T7408 |
| Pierce TCEP, Tris(2-carboxyethyl)phosphine hydrochloride | ThermoFisher | 20490 |
| Pierce Universal nuclease | Pierce | 88702 |
| Iodoacetamide | Sigma-Aldrich | I6125 |
| Lysyl endopeptidase, Lys-C | FUJIFILM Wako Pure Chemical Corporation | 125-05061 |
| Pierce Trypsin | ThermoFisher | 90058 |
| Sera-Mag Speed Beads | VWR | CAT# 09-981-121 |
| Sera-Mag Speed Beads | VWR | CAT# 09-981-123 |
| C7BzO | Sigma-Aldrich | C0856 |
| Roche, Nuclease S7, Micrococcal nuclease | Roche | 50-100-3364 |
| Leukocyte Alkaline Phosphatase Kit | Sigma-Aldrich | 86R |
| TURBO DNA-free kit | ThermoFisher | AM1907 |
| μMACS Streptavidin MicroBeads | Miltenyi Biotec | 130-074-101 |
| NEBNext Ultra II Directional RNA library prep Kit for Illumina with Sample Purification | New England Biolabs | E7765S |
| NEBNext Multiplex Oligos for Illumina (Index Primers Set) | New England Biolabs | E7710S |
| NEBNext Ultra II DNA Library Prep with Sample Purification Beads | New England Biolabs | E7103L |
| NEBNext Multiplex Oligos for Illumina (Dual Index Primers Set) | New England Biolabs | E7600S |
| Nextera Index kit | Illumina | 15055289 |
| Nextera DNA library Prep Kit | Illumina | 15028121 |
| NEBNext High Fidelity 2X PCR Master Mix | New England Biolabs | M0541L |
| EZQ Protein Quantification Kit | ThermoFisher | R33200 |
| C18 Sep-Pak cartridges | Waters | WAT036945 |
| CBQCA Protein Quantitation Kit | ThermoFisher | C6667 |
| TMT10plex Isobaric Label Reagent Set | ThermoFisher | 90111 |
| Dynabeads® Protein A for Immunoprecipitation | ThermoFisher | 10002D |
| Dynabeads® Protein G for Immunoprecipitation | ThermoFisher | 10004D |
| Agencourt AMPure XP beads | beckman coulter | A63881 |
| PreCR Repair Mix | New England Biolabs | M0309 |
| SimpleChIP Enzymatic Chromatin IP Kit | Cell Signaling Technologies | 9003S |
| NEBNext End Prep | New England Biolabs | E7442 |
| NEBNext Adaptor for Illumina | New England Biolabs | E7335 |
| NEBNext Q5 Hot Start HiFi Master Mix | New England Biolabs | M0543S |
| Raw NGS data | This paper | GEO: |
| Processed NGS data (bigwig files) | This paper | GEO: |
| Proteomics data ARID1A-IP-MS | This paper | PRIDE Project accession: PXD021824 |
| Proteomics data EP300-IP-MS | This paper | PRIDE Project accession: PXD021624 |
| Proteomics data SMARCA4-IP-MS | This paper | PRIDE Project accession: PXD021631 |
| Proteomics data TMT analysis | This paper | PRIDE Project accession: PXD021636 |
| ChIP data for CTCF | GSE28247, SRR172853-SRR172854 | |
| ChIP data for REST | GSE27841, SRR122473 | |
| ChIP data for SOX2, OCT4, NANOG | GSE44286, SRR713341, SRR713340, SRR713342 | |
| ChIP data for KLF4 | GSE11431, SRR002000-SRR002003, SRR002016-SRR002019, SRR001988-SRR001991 | |
| ChIP data for TAF1 | GSE30959 | |
| ChIP data for MAFK | Snyder lab, 2014 | Mouse Encode ENCFF599GJE |
| ChIP data for RNAPolIIS5P | GSE34520, SRR391032-SRR391033 | |
| ChIP data for H3K4me1, H3K4me3 | GSE89211, SRR4453260 SRR4453261 | |
| ChIP data for BRD4 | Young Lab, Whitehead Institute for Biomedical Research | GSE36561, SRR500928 |
| ChIP data for MED1 | GSE22562, SRR058987, SRR058988 | |
| ChIP data for MLL3/MLL4 | GSE98063, SRR5466740-SRR5466741 | |
| ChIP data for SMARCA4 | GSE64825, SRR1747925, SRR1747926 | |
| ChIP data for CBP | SRR1014797 | |
| Chip Data for EZH2 | GSE123174, SRR8267520-SRR8267521 | |
| ChIP data for H3 K27me3 | GSE89211, SRR4453259 | |
| E14TG2a.4 mESC, | N/A | |
| ET905, mES E14TG2a.4 TIR1 clone #905 | This paper | N/A |
| AAG57, mES E14TG2a.4 TIR1, ARID1A-mini-auxin-inducible-GFP-tagged clone #57 | This paper | N/A |
| ARID1A-ko, mES E14TG2a.4 TIR1, homozygous ARID1A ko #53 | This paper | N/A |
| gRNAs site for the C-terminal ARID1A tagging sense: CTGATGAACTCATTGGTTTC | This paper | N/A |
| gRNAs site for the C-terminal ARID1A tagging antisense: CGGCTGTCATGACTGGCCAA | This paper | N/A |
| gRNA site of mouse ARID1A ko, sense: GTGTGGAGTCTGGGACCCATA | This paper | N/A |
| gRNA site of mouse ARID1A ko, antisense: GCGGTACCCCATGACCATGCA | This paper | N/A |
| pAW5-EF1α-OsTIR1 | This study | N/A |
| pBS-mARID1A-AID-GFP-donor | This study | N/A |
| pX335-gRNA-C-terminal-ARID1A | This study | N/A |
| pU6 puro-gRNA-C-terminal-ARID1A | This study | N/A |
| pBabeD-U6-mARID1A-gRNA-KO-ex2 s | This study | N/A |
| pX335-mARID1A-gRNA-KO-ex2-as | This study | N/A |
| FastQC | ||
| Trim Galore | ||
| cutadapt | ||
| Trimmomatic | N/A | |
| MaxQuant | ||
| Perseus Software | RRID: | |
| Bowtie2 | ||
| Samtools | ||
| MACS2 | ||
| STAR aligner | ||
| deepTools | ||
| BEDTools | ||
| edgeR | ||
| Diffbind | ||