| Literature DB >> 31759822 |
Roberto Ferrari1, Lara Isabel de Llobet Cucalon2, Chiara Di Vona3, François Le Dilly2, Enrique Vidal2, Antonios Lioutas2, Javier Quilez Oliete2, Laura Jochem4, Erin Cutts4, Giorgio Dieci5, Alessandro Vannini6, Martin Teichmann7, Susana de la Luna8, Miguel Beato9.
Abstract
How repetitive elements, epigenetic modifications, and architectural proteins interact ensuring proper genome expression remains poorly understood. Here, we report regulatory mechanisms unveiling a central role of Alu elements (AEs) and RNA polymerase III transcription factor C (TFIIIC) in structurally and functionally modulating the genome via chromatin looping and histone acetylation. Upon serum deprivation, a subset of AEs pre-marked by the activity-dependent neuroprotector homeobox Protein (ADNP) and located near cell-cycle genes recruits TFIIIC, which alters their chromatin accessibility by direct acetylation of histone H3 lysine-18 (H3K18). This facilitates the contacts of AEs with distant CTCF sites near promoter of other cell-cycle genes, which also become hyperacetylated at H3K18. These changes ensure basal transcription of cell-cycle genes and are critical for their re-activation upon serum re-exposure. Our study reveals how direct manipulation of the epigenetic state of AEs by a general transcription factor regulates 3D genome folding and expression.Entities:
Keywords: 3D genome structure; ADPN; Alu elements; CTCF; H3K18ac; Pol II; TFIIIC; breast cancer; cell cycle; serum starvation
Mesh:
Substances:
Year: 2019 PMID: 31759822 PMCID: PMC7014570 DOI: 10.1016/j.molcel.2019.10.020
Source DB: PubMed Journal: Mol Cell ISSN: 1097-2765 Impact factor: 17.970
Figure 1TFIIIC Occupancy at AEs Increases upon Serum Depletion in Tumor and Non-Tumor Cells
(A and B) Venn diagram of overlapping peaks for TFIIIC (GTF3C2) (A) and CTCF (B) in T47D in the presence (+S, red) or absence (–S, gray) of serum.
(C) Stacked plot for TFIIIC peaks over AEs, tDNA, or other loci in T47D in the presence (+S) or absence (–S) of serum. A comparison to a random set of genomic regions of same size as those in –S is also included (p value for the +S versus –S comparison, chi-square test due to differences in sample size; for the –S versus random comparison, two-tailed paired t test).
(D) Bar plot with the number of TFIIIC-bound AEs within a ±5 kb window around all human TSSs in T47D in the absence (–S) and presence of serum (+S) (p value, Kolmogorov-Smirnov test).
(E) cis-regulatory annotation system (CEAS) plots of the change in TFIIIC average binding in T47D in the presence (+S) or absence (–S) of serum (+S subtracted from –S) at TFIIIC-associated promoters (red). The profile of a random set of genes of the same size of the TFIIIC-associated promoters (purple), as well as the average for all human TSSs (black) are included.
(F) Genome browser view of the representative cell-cycle-regulated locus HELLS with ChIP-seq tag counts for TFIIIC in the presence (+S) or absence (–S) of serum. The AE bound by TFIIIC is highlighted with a gray rectangle. HELLS’ genomic structure and direction of transcription (arrow) are shown.
(G) CEAS plots of TFIIIC enrichment in T47D cells in the presence (+S), absence (–S), or absence followed by serum addition for 30 min (–S/+S-30 min). The graphs are plotted over the summit of TFIIIC peaks in the –S condition (–log10 of the Poisson p value).
Figure 2ADNP Guides TFIIIC Selective Recruitment to AEs and TFIIIC-Associated Promoters
(A) Area plot ranking unique spectral counts (average of three replicates) of proteins identified in a TFIIIC-RIME experiment using GTF3C2 as bait in T47D cells grown in the presence (+S) or absence of serum (–S). Data from control immunoprecipitations using normal IgG in the presence of serum are also included (blue). Prey proteins belonging to the ChAHP complex are shown in purple.
(B) Direct interaction of recombinant ADNP and TFIIIC. For TFIIIC expression, insect cells were infected with a single baculovirus expressing GTF3C1-3XFLAG and the rest of the TFIIIC subunits (GTF3C2-6) untagged. Left panel: Coomassie staining of anti-FLAG immuno-purifications of insect cell lysates overexpressing FLAG-TFIIIC (lane 1), HA-ADNP (lane 2), or both simultaneously (lane 3). Note that GTF3C1 immunopurification via the FLAG-tag pulled down the whole TFIIIC complex (lane 1). ADNP co-purifies with TFIIIC (lane 3). GTF3C5 fractionates on top of the heavy IgG chain (IgGH). Right panel: Coomassie staining of anti-HA immunopurification of lysates expressing FLAG-TFIIIC alone (lane 4), HA-ADNP alone (lane 5), or both together (lane 6). The presence of the TFIIIC subunits is detected in the ADNP immuno-complexes. The position of the IgG heavy and light chains (IgGH and IgGL, respectively) is indicated. The identity of the bands was confirmed by mass spectrometry and western blotting with HA and GTF3C2 antibodies (Figure S2A).
(C) Heatmap of ADNP-eGFP binding in K562 cells (GEO: GSE105573) spanning ±1 kb across all TFIIIC-bound AEs in T47D, ranked from high to low ADNP-eGFP enrichment. Color bar scale with increasing shades of color stands for higher enrichment (–log10 of the Poisson p value).
(D) Venn diagram with the number of TFIIIC peaks upon ADNP depletion (siADNP, yellow) compared to control cells (siCTRL, blue) in serum-starved T47D cells.
(E) Sitepro profile of TFIIIC occupancy in control (siCTRL) and ADNP-depleted cells (siADNP) in serum-starved T47D cells. The graphs are plotted over the summit of TFIIIC peaks in the siCTRL cells (p value, two-tailed unpaired t test). ADNP depletion levels are shown in Figures S2E and S2F.
(F) Genome browser view of representative TFIIIC-associated genes FEM1A and HELLS showing TFIIIC occupancy in ADNP-depleted (siADNP) and T47D control cells (siCTRL). ADNP binding from GEO: GSE105573 is represented by the blue track (eGFP-ADNP). The gene structure and the direction of transcription (arrow) are shown at the bottom.
Figure 3TFIIIC-Associated HAT Activity Acetylates H3K18 at Nucleosomes over AEs in Response to Serum Depletion
(A) ATAC-seq signal enrichment in T47D grown in the presence (+S) or in the absence of serum (–S) across TFIIIC-bound AEs detected in the absence of serum (–S) (–log10 of the Poisson p value). p value, two-tailed t test comparing each condition to its random control.
(B) Average profile of H3K18ac enrichment in T47D grown in the presence (+S,) or absence of serum (–S) across all TFIIIC-bound AEs (–log10 of the Poisson p value). p value, two-tailed paired t test.
(C) Sitepro average plot of nucleosome signals derived from ATAC-seq by selecting sequenced fragments of 180–250 bp (Buenrostro et al., 2013) spanning a 1 kb region over all the AEs bound by TFIIIC in T47D grown in the presence (+S) or absence (–S) of serum. Note the presence of a peak in the center of the region representing the AE (gray rectangle). SS causes a decrease in the nucleosome signal for both the AE-nucleosome and the surrounding ones (two-tailed unpaired t test).
(D) Genome browser view of representative cell-cycle-related TFIIIC-associated promoters with normalized counts of ATAC-derived nucleosome signal for T47D grown in the presence (+S) or absence (–S) of serum. The position of the AE bound by TFIIIC positive for H3K18ac is indicated by a yellow rectangle. AE nucleosomes reduced upon SS are highlighted by a gray rectangle.
(E) Venn diagram showing the total number of AEs bound by TFIIIC and those acetylated in H3K18 in serum-starved T47D.
(F) Venn diagram showing the total number of TFIIIC-associated Pol II promoters and those enriched in H3K18ac in T47D grown in the absence of serum.
(G) Plot for p300 occupancy across all TFIIIC-bound AEs spanning a 6 kb region in T47D grown in the presence (+S) or absence (–S) of serum.
(H) H3K18ac immunostaining (red) in serum-starved T47D cells knocked down for GTF3C1 (siGTF3C1) or control (siCTRL). DAPI was used to stain nuclei (blue). Two different fields are shown. Scale bar, 100 μm. GTF3C1 depletion levels are shown in Figure S3H and quantification in Figure S3I.
Figure 4TFIIIC Controls Gene Expression in Response to SS via AEs-Mediated Long-Range Chromatin Looping
(A) Hi-C analysis of intra-TAD contacts represented as log2 fold change (Observed / Expected: O/E) and 95% confidence interval (CI) of the contact enrichment between TFIIIC-bound AEs and Pol II promoters for genes differentially expressed (changed) or not affected (unchanged) by SS in T47D cells. Posterior probability (PPr) = 0.93.
(B) Heatmap matrix of Hi-C interactions of PLIN4/5-UHRF1 loci (left panel) and CCNF locus (right panel) for TFIIIC-bound AEs in T47D in the presence or absence of serum. Color scale bar is reported with red representing values from 200 and above, white values equal to 100, and blue values equal to 0. The loops representing the higher frequency of interactions of the AEs and the corresponding genes are framed (Hi-C data, top panel). TADs are indicated by blue rectangles. The regions with changes in their interactions upon SS are framed with gray rectangles and zoomed out to help the visualization of the genome browser tracks of mRNA-seq (A and B indicate biological replicates; the two strands are represented separately), and ChIP-seq data for the protein/histone marks are indicated. The gene structure and transcription direction are reported at the bottom of the figure. Gray bold rectangles highlight the mRNA-seq regions with significant changes for the PLIN4/5, UHRF1, and CCNF loci. Red bold rectangles highlight the position of TFIIIC binding to the AEs.
(C) Heatmaps of differential gene expression for siGTF3C5 and siCTRL T47D cells in the absence of serum compared to mock-transfected cells in the absence of serum. Color bar scale stands for log2FC of normalized RNA expression in each condition. Only the genes that changed their expression significantly in siGTF3C5 and not in siCTRL are shown. Two classes of genes were designated as TFIIIC-activated or TFIIIC-repressed genes. Cell-cycle-related genes whose expression was drastically reduced by siGTF3C5 are indicated, including UHRF1 and CCNF further analyzed in (B).
(D) TFIIIC contact enrichment for the TFIIIC-activated and TFIIIC-repressed genes using Hi-C data in the absence of serum (p value for logistic regression comparing the two groups).
(E) Venn diagram showing the overlap between the TFIIIC-activated genes and those bound by TFIIIC either directly within a 10 kb region or via DNA looping (TFIIIC-looped genes).
Figure 5Serum Depletion Induces TFIIIC and CTCF Interaction to Allow Long-Range DNA Looping and H3K18 Hyperacetylation of Distal TFIIIC-Looped Genes
(A) Boxplot of CTCF binding events within a 10 kb region around the TSSs of TFIIIC-looped genes or of a random dataset of TSS of the same size (p value, Friedman X2 test).
(B) Genome browser view of representative cell-cycle-related TFIIIC-activated genes CCND1, CCNF, CENPE, and POLQ. ChIP-seq data of CTCF and TFIIIC in T47D in the presence (+S) or absence (–S) of serum. Note that multiple CTCF binding sites (gray boxes) are present at the 5′ end of these genes but no TFIIIC is detected. Transcription directionality is indicated with an arrow.
(C) Immunoprecipitation of CTCF in soluble extracts from T47D cell grown in the presence (+) or absence (–) of serum (“beads” are used as a specificity control). Input lysates (10%) and immunoprecipitates were probed with CTCF and GTF3C2 antibodies (as a marker for TFIIIC).
(D) Hi-C analysis of TFIIIC and CTCF contacts represented as log2 FC (O / E∗; 95% CI) of the specific CTCF-TFIIIC contacts (O = observed) compared to the expected (E∗) CTCF and TFIIIC additive effect for both +S and –S conditions. The PPr value indicates a high probability of an increase in TFIIIC/CTCF Hi-C contacts in the absence versus presence of serum.
(E) Genomic representation of a 1 Mb region of chromosome 17 containing an AE-TFIIIC-CTCF loop induced by SS (framed with a circle in the Hi-C heatmaps). Color scale bar is reported with red representing values from 200 and above, white values equal to 100, and blue values equal to 0. ChIP-seq data for CTCF and TFIIIC in the presence (+S) or absence (–S) of serum is included at the top and bottom, respectively. Matrix heatmaps with Hi-C data are also included and aligned with the ChIP-seq data. Gray rectangles highlight the position of the AE-TFIIIC and the CTCF interacting regions.
(F) Heatmap representation of H3K18ac spanning a 20 kb region of all human promoters in T47D cells grown in the presence or absence of serum. Biased clustering shows promoters with increased H3K18ac in the absence of serum, and the cluster contains several TFIIIC-looped genes (selected examples are listed).
(G) Stacked plot representing changes in H3K18ac induced by SS in TFIIIC-looped genes. Note that around 70% of them display increased acetylation.
Figure 6Impaired DNA Looping at TFIIIC-Looped Genes by TFIIIC Depletion or AEs Deletion Abrogates the Reactivation of Gene Expression upon Serum Re-exposure
(A) Changes in specific intra-TADs contacts made by TFIIIC in T47D treated with siGTF3C5 or siCTRL in the absence of serum. Data are the log2 FC of observed versus expected (95% CI) of Hi-C data. The changes in siGTF3C5 versus siCTRL show significant (PPr = 0.99) decrease of total intra-TADs contacts compared to mock-transfected (–S) versus siCTRL.
(B) Hi-C matrix heatmaps of the UHRF1 locus for siCTRL and siGTF3C5 cells in the absence of serum. Color scale bar is reported with red representing values from 200 and above, white values equal to 100, and blue values equal to 0. The looping between the AE bound by TFIIIC and its respective targets (PLIN4/5 and UHRF1) is indicated by a box when detected and an arrow when lost. Note that even in the absence of serum the Alu-FEM1A loops back to the PLIN4/5 locus upon siGTF3C5 (as reported for the condition of presence of serum in Figure 4B).
(C) Venn diagram showing the overlap between the TFIIIC-looped and the serum-dependent genes compared to the regions showing compartment switch (Figure S4B). Note the higher overlap of the regions showing compartment changes with the serum-dependent genes (Fisher’s exact test).
(D) RNA-seq expression analysis in T47D cells of TFIIIC-activated genes after serum re-exposure for the indicated times in conditions of siCTRL and siGTF3C5. Expression data are reported as RPM (∗∗p < 1x10−12, two-tailed paired t test in comparisons for each time-point). The comparison between 0 to 3 h in siCTRL cells is shown to highlight the rapidity of gene activation after serum addition (∗∗∗p < 1x10−20). GTF3C5-depleted levels were maintained during the time course (Figure S6B).
(E) Schematic representation of the CRISPR-Cas9 approach to delete the TFIIIC-bound AE located between the PLIN4/5 and UHRF1 loci in chromosome 17. The wild-type (WT) and the deleted alleles are shown. The targeted AE is shown as a purple box, the position of the guide RNAs (gRNA1 and gRNA2) is marked with triangles, and the primers used for the screen (Figure S6C) are indicated with blue arrows. Arrows indicate the chromatin interactions in +S (red) and –S (black) conditions based on Figure 4B.
(F) UHRF1 expression by qRT-PCR in T47D parental cells (WT) and AE-deleted (Clone11) in the absence of serum (mean ± SEM of biological duplicates with data in WT cells normalized to 1; one-tailed t test).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| rabbit polyclonal antibody anti-GTF3C2 | This paper | NA |
| mouse monoclonal antibody anti-GTF3C2 | Abcam | RRID: |
| rabbit polyclonal antibody anti-BDP1 | NA | |
| rabbit polyclonal antibody anti-RPC39 | NA | |
| rabbit polyclonal antibody anti-H3K18ac | Active motif | RRID: |
| rabbit polyclonal antibody anti-CTCF | Millipore | RRID: |
| rabbit monoclonal antibody anti-CTCF | Abcam | ab128873 |
| rabbit polyclonal antibody anti-Pol II | Cell Signaling | (NTD) (D8L4Y, Cat# 14958 RRID: |
| rabbit affinity-purified anti-GTF3C5/TFIIIC63 | Bethyl | RRID: |
| rabbit affinity-purified anti-GTF3C1/TFIIIC220 | Novus Biologicals | RRID: |
| rabbit affinity-purified anti-GTF3C4/TFIIIC90 | Abcam | RRID: |
| mouse Monoclonal antibody anti-tubulin | Sigma | RRID: |
| rabbit anti-GTF3C1/TFIIIC220 Antibody, Affinity Purified ChIP | Bethyl | RRID: |
| rabbit anti-GTF3C1/TFIIIC220 Antibody, Affinity Purified WB | Bethyl | RRID: |
| mouse monoclonal [102C1a] to ADNP | Abcam | RRID: |
| Rabbit polyclonal HA tag antibody | Abcam | RRID: |
| rabbit affinity-purified anti-EP300 | Santa Cruz | RRID: |
| rabbit affinity-purified anti-EP300 | Santa Cruz | RRID: |
| Rabbit affinity-purified anti-H3K9ac | Active Motif | RRID: |
| Mouse Monoclonal anti-H3K27ac | Millipore | RRID: |
| Rabbit polyclonal to Histone H1.2 | Abcam | RRID: |
| Horseradish peroxidase conjugated anti-mouse | Amersham | NA931V |
| Horseradish peroxidase conjugated anti-rabbit | Amersham | NA934V |
| Anti-DYKDDDDK resin | Genscript | L00432 |
| Anti-HA Agarose | Pierce | 26181 |
| One Shot Stbl3 Chemically Competent | Invitrogen | C7373-03 |
| RPMI Red phenol | GIBCO | 42401-018 |
| RPMI No Phenol Red | GIBCO | 32404-014 |
| DMEM Phenol Red | GIBCO | 41965-039 |
| DMEM Phenol Red | GIBCO | 21063-029 |
| DMEM/F12 Phenol Red | GIBCO | 11330-032 |
| DMEM/F12 No Phenol Red | GIBCO | 11039-021 |
| EMEM Phenol Red | GIBCO | 31095-029 |
| EMEM No Phenol Red | GIBCO | 51200-038 |
| Fetal Bovine Serum | GIBCO | 10270-106 |
| Fetal Bovine Serum, charcoal stripped | GIBCO | 12676029 |
| 0.5% Trypsin-EDTA 1x | GIBCO | 25300-054 |
| L-Glutamine 200 mM 100x | GIBCO | 25030-024 |
| Penicillin-Streptomycin | GIBCO | 15140-122 |
| Human Insulin (Humulin regular) | Lilly | U100 |
| EGF | SIGMA | E-9644 |
| Hydrocortisone | SIGMA | H-0888 |
| Horse Serum | Life technologies | 16050122 |
| Cholera Toxin | SIGMA | C8052 |
| Trizol Reagent | Ambion | 15596018 |
| Lipofectamine 3000 | Invitrogen | 11668-019 |
| Proteinase K | ThermoFisher Scientific | AM2546 |
| Protein G Plus / Protein A Agarose | Millipore | IP05 |
| protease inhibitor cocktail, cOmplete EDTA-free | Roche | 05 892 791 001 |
| Na3VO4 | ||
| MboI | New England BioLabs | r0147-mboi |
| Herculase II Fusion DNA Polymerase | Agilent | 600675 |
| AMPure XP beads | Beckman Coulter | A63881 |
| Endopeptidase LysC, | Wako | 125-05061 |
| Sequencing grade Trypsin | Promega | V5111 |
| Ovation Ultralow DR Multiplex System 9-16 kit | NUGEN | 0535-32 |
| Micro BCA Protein Assay Kit | ThermoFisher Scientific | 23235 |
| qScript cDNA Synthesis kit | Quanta Biosciences | 95047-025 |
| LightCycler FastStart DNA Master SYBR Green I kit | Roche | 03 515 885 001 |
| Illumina TruSeq kit | Illumina | 20020594 |
| TruSeq Stranded Total RNA Library Prep Human/Mouse/Rat | Illumina | 20020596 |
| Pierce Coomassie (Bradford) kit | Thermo Fisher | 23200 |
| NEBNext DNA Library Prep Kit | New England BioLabs | NEB #E7645 |
| Qubit HS kit | Thermo Fisher | Q32854 |
| FISH Tag RNA Multicolor Kit | Thermo Fisher | mp32956 |
| Gibson Assembly Master | New England Biolabs | E2611S |
| BsmbI | Thermo Fisher | ER0451 |
| Anti-DYKDDDDK G1 Affinity Resin | Genscript | L00432 |
| Anti-HA Agarose | Thermo Fisher Scientific | 26181 |
| NuPAGE MOPS SDS Running Buffer | Novex by Life Technology | NP0001 |
| NuPAGE 4-12% Bis Tris Gels | Invitrogen | NP0335box |
| Skim Milk Powder | Sigma | 70166-500G |
| XCell SureLock Mini-Cell Electrophoresis System | Thermo Fisher | EI0001 |
| ChIP-seq, RNA-seq, Hi-C data | This study | GSE120162 |
| Human reference genome NCBI build 38, GRCh38 | Genome Reference Consortium | |
| CTCF ChIP-seq of T47D in -S | GSE53463 | |
| ChIP-seq of eGFP-ADNP in K562 cells | GSE105573 | |
| ChIP-seq of mouse Adnp | GSE97945 | |
| Gene expression data of breast cancer samples | ||
| EP300 ChIP-seq in T98G | GSE21026 | |
| Proteomic data: PRIDE | This study | |
| Mendeley | This study | |
| T47D | ATCC | CRL-2865 |
| T98G | ATCC | CRL-1690 |
| IMR90 | ATCC | CCL-186 |
| MCF10A | ATCC | CRL-10317 |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| siGENOME against human | Dharmacon | D-020031-02 |
| siGENOME Non-Targeting siRNA Pool #2 | Dharmacon | D-001206-14 |
| siGENOME against human | Dharmacon | LQ-012581-00-0002 |
| siGENOME against human | Dharmacon | LQ-012857-01-0002 |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| This study | NA | |
| pDECKO_mCherry | Addgene | 78534 |
| pBIG2abc vector | Addgene | 80617 |
| Samtools | ||
| FACS Diva Software v6.1.2 | Becton Dickinson | NA |
| ModFit v3.2 | Verity Software | NA |
| NA | ||
| Trimmomatic | ||
| STAR | NA | |
| ChIP-seq Enrich | ||
| DAVID | ||
| ImageJ | ||
| Xcalibur software v2.2 | Thermo Fisher Scientific | OPTON-30487 |
| Proteome Discoverer software suite v1.4 | Thermo Fisher Scientific | OPTON-30812 |
| Mascot search engine v2.5 | Matrix Science | NA |
| SAINT software | ||
| NucMap | ||
| Gene expression analysis script | This paper | |
| Hi-C analysis pipeline | ||