| Literature DB >> 33626359 |
Rebecca Broome1, Igor Chernukhin1, Stacey Jamieson2, Kamal Kishore1, Evangelia K Papachristou1, Shi-Qing Mao1, Carmen Gonzalez Tejedo1, Areeb Mahtey3, Vasiliki Theodorou4, Arnoud J Groen1, Clive D'Santos1, Shankar Balasubramanian5, Anca Madalina Farcas6, Rasmus Siersbæk7, Jason S Carroll8.
Abstract
Estrogen receptor-α (ER) drives tumor development in ER-positive (ER+) breast cancer. The transcription factor GATA3 has been closely linked to ER function, but its precise role in this setting remains unclear. Quantitative proteomics was used to assess changes to the ER complex in response to GATA3 depletion. Unexpectedly, few proteins were lost from the ER complex in the absence of GATA3, with the only major change being depletion of the dioxygenase TET2. TET2 binding constituted a near-total subset of ER binding in multiple breast cancer models, with loss of TET2 associated with reduced activation of proliferative pathways. TET2 knockdown did not appear to change global methylated cytosine (5mC) levels; however, oxidation of 5mC to 5-hydroxymethylcytosine (5hmC) was significantly reduced, and these events occurred at ER enhancers. These findings implicate TET2 in the maintenance of 5hmC at ER sites, providing a potential mechanism for TET2-mediated regulation of ER target genes. CrownEntities:
Keywords: 5hmC; GATA3; TET2; enhancers; estrogen receptor; gene regulation
Mesh:
Substances:
Year: 2021 PMID: 33626359 PMCID: PMC7921846 DOI: 10.1016/j.celrep.2021.108776
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Figure 1TET2 is recruited to the ER complex by GATA3
(A) ER qPLEX-RIME in MCF7 cells showing changes to the ER complex after GATA3 knockdown (48 h). Four replicates of ER RIME and one pooled immunoglobulin G (IgG) control RIME for each condition were included in the 10plex tandem mass tag (TMT) mass spectrometry (MS) run. Significantly enriched or depleted proteins according to the adjusted p value are highlighted in red (adjusted p value [p adj] ≤ 0.05, after multiple testing correction using Benjamini-Hochberg procedure).
(B) Overlap of lists of Uniprot IDs of specific interactors for ER, GATA3, and TET2. Specific interactors were defined as those occurring in at least two out of three independent replicates. Proteins that appeared in any one of the three IgG control RIME experiments were excluded. ER, GATA3, and TET2 shared a total of 379 common interactors by RIME. Several key ER complex proteins were among them, highlighted in the central portion of the diagram.
See also Figures S1, S2, and S5A–S5C.
Figure 2TET2 binds to ER enhancers in breast cancer cells
(A and B) Venn diagrams indicating positional overlap of ER and TET2 ChIP-seq peaks in ER+ breast cancer cell lines MCF7 and ZR-75-1 (A) and two ER+ PDX models, namely, STG195 and AB555 (B). Heatmaps below each Venn diagram illustrate the ChIP-seq signal intensity for ER and TET2 at ER/TET2 shared sites (top) and “TET2 low” sites where TET2 peaks were not called (bottom). Mouse schematic was created with BioRender.com.
(C and D) UCSC genome browser tracks indicating overlap of TET2 and ER peaks at ER target genes RARA and GREB1 in ER+ breast cancer cell lines MCF7 and ZR-75-1 and ER+ PDX models STG195 and AB555, respectively.
(E) Pie charts show classification of ER and TET2 binding sites according to genomic location for all the models tested. Promoters were defined as regions inclusive of 1 kb downstream and 2 kb upstream of the TSS.
(F) UCSC genome browser tracks demonstrating ER binding sites upstream of the TET2 TSS in two ER+ breast cancer cell lines (MCF7 and ZR-75-1) and two ER+ PDX models (STG195 and AB555). Scale bar indicates 5 kb. Cell line ChIPs were performed in biological quadruplicate.
See also Figures S3 and S5A–S5C.
Figure 3TET2, ER, and GATA3 regulate similar genes
(A) Heatmaps depicting the top 500 induced and top 500 repressed TET2-regulated genes according to log2 fold change. Color scale represents the relative expression (Z score) of genes across the two conditions (control and knockdown), calculated separately within each comparison (TET2 knockdown [siTET2] versus non-targeting control siRNA [siNT], siESR1 versus siNT, and siGATA3 versus siNT). Hierarchical clustering of genes in the leftmost (siTET2) heatmap is preserved across all three heatmaps. Columns represent independent biological replicates (n = 6). siRNA treatments were performed for 48 h.
(B) Pairwise correlations of data used for the heatmaps in (A).
(C) Graph showing the cumulative fraction of total ER/TET2 shared binding sites (n = 15,945, MCF7 cells) within up to 100 kb of the TSSs of the following three groups of genes: genes upregulated by siTET2 (n = 2,144, red line), genes downregulated by siTET2 (n = 2,269, blue line) (p ≤ 0.05), and genes unchanging in response to siTET2 (constant genes, gray lines). Constant genes were randomly selected from those with p > 0.5 and mean expression > 1.0. Grey lines indicate analysis based on constant genes: the dotted line indicates analysis matched to the number of downregulated genes, and the solid line indicates analysis matched to number of upregulated genes.
(D) Left: bar plot displaying –log10(false discovery rate [FDR]) for GO analysis of the top 500 induced and top 500 repressed TET2-regulated genes according to log2 fold change. Only categories with FDR ≤ 0.05 (threshold indicated by dotted line) are shown. Right: bar plot displaying –log10(FDR) for GO analysis of the top 500 induced and top 500 repressed TET2-regulated genes according to log2 fold change, sub-selected from genes also significantly (p ≤ 0.05) regulated by both GATA3 and ER silencing. The top 6 enriched categories are shown for repressed genes and the top 2 for induced genes. Dotted line indicates FDR of 0.05. Enriched processes were identified using the biological process category level 3 of the GO hierarchy (GOTERM_BP_3).
See also Figures S4A–S4F.
Figure 4ER is required to recruit TET2 to a subset of enhancer elements
(A) MA plot showing log2 fold change in ER binding under control versus siTET2 conditions against the log2 mean intensity of ChIP-seq signal for all ER sites (20,386 peaks).
(B) Normalized tag density of ER ChIP-seq signal under control (siNT) and siTET2 (siTET2) conditions within all ER peaks. ∗∗∗∗p ≤ 0.0001.
(C) Average plot showing normalized signal enrichment of ER ChIP-seq under control (siNT) or siTET2 conditions within all ER peaks. siRNA treatments were performed for 72 h. ChIPs were performed in biological triplicate.
(D) MA plot showing log2 fold change in TET2 binding in response to fulvestrant treatment (100 nM, 3 h) against the log2 mean intensity of TET2 ChIP-seq signal for all TET2 sites (20,599 peaks). “Lost” sites (n = 1,810) and “gained” sites (n = 64) according to DiffBind analysis (p ≤ 0.05) are highlighted in red.
(E) Normalized tag density of ER ChIP-seq signal at unchanging (common) (n = 18,725) and lost (n = 1,810) TET2 sites in response to fulvestrant treatment.
(F) Motif frequency (number of motifs divided by the total number of peaks in each category) of ER, FOXA1, and GATA3 motifs for lost, common, and background sites. Background values were obtained using random open chromatin regions from an MCF7 MNase dataset (EBI Array Express E-MTAB-1958) and are expressed as the average ± SD of two separate background values calculated matched to the number of sites in the lost and common cohorts. Significance against background is indicated; ∗p ≤ 0.05, ∗∗p ≤ 0.01, ∗∗∗∗p ≤ 0.0001.
(G) UCSC genome browser tracks showing TET2 binding in response to treatment with vehicle (ethanol, 3 h) or fulvestrant (100 nM, 3 h) at significantly (p ≤ 0.05) depleted sites according to Diffbind analysis, within 50 kb of the TSS of two key ER target genes (PGR and XBP1). For the gene schematics below each track, lines indicate introns, boxes indicate exons, and arrowheads indicate the direction of transcription. Scale bar indicates 10 kb.
(H) UCSC genome browser tracks showing TET2 binding in response to treatment with vehicle (ethanol, 3 h) or fulvestrant (100 nM, 3 h) at unchanged sites according to Diffbind analysis. ChIPs were performed in biological triplicate. For the gene schematics below each track, lines indicate introns, boxes indicate exons, and arrowheads indicate the direction of transcription. Scale bar indicates 10 kb.
See also Figures S4G–S4I and S5.
Figure 5ER and TET2 are recruited to GATA3 mutant-specific binding sites
(A) Schematic representation of the human GATA3 transcription factor (Uniprot: P23771), with the two transactivation (TA1 and TA2) and the two zinc finger (ZnF1 and ZnF2) domains illustrated. The insertion resulting in a frameshift (“409fs”) mutation at amino acid 409 (COSMIC genomic mutation ID: COSV60515158) generates the variant GATA3 that is extended by 62 amino acids.
(B) The GATA3 409 frameshift mutant was generated using CRISPR-based gene editing. PRM-based proteomics using a peptide common to both wild-type (WT) and mutant GATA3 (GATA3 WT amino acids 389–399, sequence NSSFNPAALSR) or a peptide specific to the elongated GATA3 mutant (GATA3 mutant amino acids 489–496, sequence IMFATLQR) was used to confirm the presence of the elongated GATA3 mutant. MCF7 wild-type (WT) cells were run in parallel as controls. The endogenous (light) peptide peak (where found) is shown in the top two chromatograms of each sub-panel, while the peak of the spiked-in (heavy) standard peptide is shown in the bottom two chromatograms of each sub-panel.
(C) Heatmaps illustrating the ChIP-seq signal intensity for GATA3, ER, and TET2 in WT and GATA3 mutant (MUT) MCF7 cells, focusing on the sites in which mutant GATA3 is specifically enriched (n = 450 sites). ChIPs were performed in biological triplicate.
(D) Average plots showing normalized signal enrichment of GATA3, ER, or TET2 ChIP-seq at GATA3 sites common between mutant and WT cells (top, n = 34,845 sites) or at the GATA3 sites specifically gained in the mutant cells (bottom, n = 450 from C). Lines illustrate the signal enrichment for the respective factors in GATA3 mutant cells, and dotted lines indicate the enrichment in MCF7 WT cells.
Figure 6TET2 is required for maintaining 5hmC globally and at ER enhancer elements
(A) Mass spectrometry was used to assess levels of 5mC or 5hmC in genomic DNA isolated from MCF7 cells treated with either siNT or siTET2 for various durations. Results represent mean ± SD (n ≥ 4). Results are expressed as % of total cytosines.
(B) MA plots showing log2 fold change in 5mC + 5hmC (left, MMS readout) and 5hmC exclusively (right, RRHP readout) under control versus siTET2 conditions. Each datapoint represents an individual 5mC or 5hmC residue. siRNA treatments were performed for 72 h. Results represent biological duplicates.
(C) MMS signal (5mC + 5hmC) and RRHP signal (5hmC) were assessed at ER peak regions (left) or ER/TET2 overlapping peak regions (right) under control (siNT) or siTET2 conditions. The total numbers of sites analyzed within each category are as follows: ER MMS sites = 8,463, ER RRHP sites = 10,104, ER/TET2 MMS sites = 3,762, and ER/TET2 RRHP sites = 4,512.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Rabbit monoclonal anti-β-actin (13E5) (used for Western Blot) | Cell Signaling Technology | Cat# 4970; RRID: |
| Mouse monoclonal anti-ERα (6F11) (used for Western Blot) | Leica | Cat# NCL-L-ER-6F11; RRID: |
| Rabbit polyclonal anti-ERα (used for ChIP) | Abcam | Cat# ab3575; RRID: |
| Rabbit polyclonal anti-ERα (used for ChIP) | Millipore | Cat# 06-935; RRID: |
| Mouse monoclonal anti-GATA3 (HG3-31) (used for ChIP, Western Blot) | Santa Cruz Biotechnology | Cat# sc268; RRID: |
| Rabbit polyclonal anti-GATA3 (used for ChIP, RIME) | Abcam | Cat# ab106625; RRID: |
| Rabbit polyclonal anti-IgG isotype control (used for RIME) | Abcam | Cat# ab171870; RRID: |
| Rabbit polyclonal anti-TET2 (used for ChIP, RIME) | Abcam | Cat# ab94580; RRID: |
| IRDye 800 CW Goat anti-Mouse IgG | Li-Cor | Cat# 925-32210; RRID: |
| IRDye 680LT Goat anti-Rabbit | Li-Cor | Cat# 926-68021; RRID: |
| One Shot TOP10 chemically competent | Thermo Fisher (Invitrogen) | Cat #C404010 |
| Patient-derived breast cancer xenograft models AB555 and STG195 | Caldas Lab, University of Cambridge ( | |
| Fulvestrant (ICI-182780, ZD 9238) | Selleckchem | Cat# S1191 |
| Disuccinimidyl glutarate (DSG) | Santa Cruz Biotechnology | Cat# CAS79642-50-5 |
| SpikeTides peptides for targeted proteomics (Parallel Reaction Monitoring), see | Custom-designed, synthesized by JPT Peptide Technologies | N/A |
| TruSeq Stranded mRNA Library Prep Kit | Illumina | Cat# RS-122-2101 |
| Methyl Midi-seq (MMS) | Zymo Research (outsourced) | N/A |
| Reduced Representation Hydroxymethylation Profiling (RRHP) | Zymo Research (outsourced) | N/A |
| Panomics Nuclear Extraction Kit for Use with Transcription Factor Assays | Panomics | Cat# 13938, AY2002 |
| Ultra-Micro C18 Spin Columns | Harvard Apparatus | Cat# 74-7226 |
| iST 96x Sample Preparation Kit | Preomics | Cat# P.O.00027 |
| ThruPlex DNA-seq kit | Rubicon Genomics | Cat# R400407 |
| DNA HT Dual Index Kit - 96N Set A | Takara | Cat# R400660 |
| ChIP-seq, RNA-seq, MMS and RRHP datasets | This paper | GEO: |
| RIME, qPLEX-RIME and whole proteome datasets | This paper | ProteomeXchange Consortium via PRIDE ( |
| PRM datasets | This paper | Panorama Public database: PXD019726 (also available via |
| Human: MCF7 | ATCC | HTB-22 |
| Human: T-47D | ATCC | HTB-133 |
| Human: ZR-75-1 | ATCC | CRL-1500 |
| TET2 forward primer for qRT-PCR 5′- ATTCTCGATTGTC | This paper | N/A |
| TET2 reverse primer for qRT-PCR 5′- CATGTTTGGACTT | This paper | N/A |
| UBC forward primer for qRT-PCR 5′- ATTTGGGTCGCG | This paper | N/A |
| UBC reverse primer for qRT-PCR 5′- TGCCTTGACATTC | This paper | N/A |
| RARα forward primer for ChIP-qPCR 5′- GCTGGGTCCT | This paper | N/A |
| RARα reverse primer for ChIP-qPCR 5′- CCGGGATAAAGCCACTCCAA-3′ | This paper | N/A |
| GREB1 forward primer for ChIP-qPCR 5′- GAAGGGCAGAGCTGATAACG-3′ | This paper | N/A |
| GREB1 reverse primer for ChIP-qPCR 5′- GACCCAGTTGCCACACTTTT-3′ | This paper | N/A |
| MYC forward primer for ChIP-qPCR 5′- GCTCTGGGCACACACATTGG-3′ | This paper | N/A |
| MYC reverse primer for ChIP-qPCR 5′- GGCTCACCCTTGCTGATGCT-3′ | This paper | N/A |
| Negative control region forward primer for ChIP-qPCR 5′- GCCACCAGCCTGCTTTCTGT-3′ | This paper | N/A |
| Negative control region reverse primer for ChIP-qPCR 5′- CGTGGATGGGTCCGAGAAAC-3′ | This paper | N/A |
| ON-TARGETplus SMARTpool siRNAs against ER | Dharmacon (Horizon Discovery) | Cat# L-003401-00 |
| ON-TARGETplus SMARTpool siRNAs against GATA3 | Dharmacon (Horizon Discovery) | Cat# L-003781-00 |
| ON-TARGETplus SMARTpool siRNAs against TET2 | Dharmacon (Horizon Discovery) | Cat# L-013776-03 |
| ON-TARGETplus SMARTpool non-targeting control siRNAs | Dharmacon (Horizon Discovery) | Cat# D-001810-10 |
| Guide RNA (target sequence) targeting wild-type GATA3, including 5 base-pair 3′ overhang facilitating ligation into the GeneArt CRISPR Nuclease Vector: 5′- AGTGGCTGAAGGGCGAGATGGTTTT-3′ (PAM = TGG) | This paper | N/A |
| Guide RNA (reverse complement) targeting wild-type GATA3, including 5 base-pair 3′ overhang facilitating ligation into the GeneArt CRISPR Nuclease Vector: 5′- CATCTCGCCCTTCAGCCACTCGGTG-3′ (PAM = TGG) | This paper | N/A |
| U6 Forward Primer for Sanger sequencing 5′- GGACTA | Standard sequence | N/A |
| Custom primer for sequencing CRISPR/Cas9-edited clones: sense 5′-GCATCCAGACCAGAAACCGA-3′ | This paper | N/A |
| Custom primer for sequencing CRISPR/Cas9-edited clones: antisense 5′-TGAAACCCTCAACGGCAACT-3′ | This paper | N/A |
| GeneArt CRISPR Nuclease Vector | GeneArt | Cat# A21174 |
| STAR v. 2.5.2b | ||
| DESeq2 | ||
| Skyline-daily software v.19.0.9.190 | MacCoss Lab, University of Washington | |
| Proteome Discoverer v. 1.4 or 2.1 | Thermo Scientific | Cat# OPTON-30945 |
| qPLEXanalyzer | ||
| Image Studio v. 4.0 software | Li-Cor | N/A |
| BioRad CFX Maestro software v. 1.1 | BioRad | N/A |
| bowtie2 v. 2.2.6 | ||
| MACS2 v. 2.0.10.20131216 | N/A | |
| DiffBind | ||
| R v. 3.5.1 or later | R Project | |
| MATLAB | MathWorks | |
| MEME Suite (v. 4.9.1): FIMO | ||
| MEME Suite (v. 4.9.1): MEME | ||
| MEME Suite (v. 4.9.1): DREME | ||
| MEME Suite (v. 4.9.1): TOMTOM | ||
| Prism v. 8 | GraphPad | |
| Protein (UniProt ID) | Peptide sequence | Peptide sequence (modified) | Isotope | Mass [m/z] |
|---|---|---|---|---|
| EAGPPAFYRPNSDNR | EAGPPAFYRPNSDNR | light | 564.3 | |
| EAGPPAFYRPNSDNR | EAGPPAFYRPNSDNR | heavy | 567.6 | |
| AANLWPSPLMIK | AANLWPSPLMIK | light | 670.9 | |
| AANLWPSPLMIK | AANLWPSPLMIK | heavy | 674.9 | |
| ELVHMINWAK | ELVHMINWAK | light | 620.8 | |
| ELVHMINWAK | ELVHMINWAK | heavy | 624.8 | |
| VSPDFTQESR | VSPDFTQESR | light | 583.3 | |
| VSPDFTQESR | VSPDFTQESR | heavy | 588.3 | |
| EGSFFGQTK | EGSFFGQTK | light | 500.7 | |
| EGSFFGQTK | EGSFFGQTK | heavy | 504.7 | |
| VSDVDEFGSVEAQEEK | VSDVDEFGSVEAQEEK | light | 884.4 | |
| VSDVDEFGSVEAQEEK | VSDVDEFGSVEAQEEK | heavy | 888.4 | |
| SGAIQVLSSFR | SGAIQVLSSFR | light | 582.8 | |
| SGAIQVLSSFR | SGAIQVLSSFR | heavy | 587.8 | |
| QLAELLR | QLAELLR | light | 421.8 | |
| QLAELLR | QLAELLR | heavy | 426.8 | |
| YPSQDPLSK | YPSQDPLSK | light | 517.8 | |
| YPSQDPLSK | YPSQDPLSK | heavy | 521.8 | |
| YGPDYVPQK | YGPDYVPQK | light | 533.8 | |
| YGPDYVPQK | YGPDYVPQK | heavy | 537.8 | |
| IMFATLQR | IMFATLQR | light | 490.3 | |
| IMFATLQR | IMFATLQR | heavy | 495.3 | |
| SSLWCLCSNH | SSLWC[+57.021464]LC[+57.021464]SNH | light | 632.3 | |
| SSLWCLCSNH | SSLWC[+57.021464]LC[+57.021464]SNH | heavy | 635.8 | |
| ALGSHHTASPWNLSPFSK | ALGSHHTASPWNLSPFSK | light | 646.3 | |
| ALGSHHTASPWNLSPFSK | ALGSHHTASPWNLSPFSK | heavy | 649.0 | |
| DVSPDPSLSTPGSAGSAR | DVSPDPSLSTPGSAGSAR | light | 850.9 | |
| DVSPDPSLSTPGSAGSAR | DVSPDPSLSTPGSAGSAR | heavy | 855.9 | |
| ECVNCGATSTPLWR | EC[+57.021464]VNC[+57.021464]GATSTPLWR | light | 825.9 | |
| ECVNCGATSTPLWR | EC[+57.021464]VNC[+57.021464]GATSTPLWR | heavy | 830.9 | |
| AGTSCANCQTTTTTLWR | AGTSC[+57.021464]ANC[+57.021464]QTTTTTLWR | light | 964.9 | |
| AGTSCANCQTTTTTLWR | AGTSC[+57.021464]ANC[+57.021464]QTTTTTLWR | heavy | 969.9 | |
| NSSFNPAALSR | NSSFNPAALSR | light | 582.3 | |
| NSSFNPAALSR | NSSFNPAALSR | heavy | 587.3 | |
| VAPEEHPVLLTEAPLNPK | VAPEEHPVLLTEAPLNPK | light | 652.0 | |
| VAPEEHPVLLTEAPLNPK | VAPEEHPVLLTEAPLNPK | heavy | 654.7 | |
| EITALAPSTMK | EITALAPSTMK | light | 581.3 | |
| EITALAPSTMK | EITALAPSTMK | heavy | 585.3 |