Literature DB >> 31372638

A CRISPR-Cas9 screen identifies essential CTCF anchor sites for estrogen receptor-driven breast cancer cell proliferation.

Gozde Korkmaz1, Zohar Manber2, Rui Lopes1, Stefan Prekovic1, Karianne Schuurman1, Yongsoo Kim1, Hans Teunissen3, Koen Flach3, Elzo de Wit3, Giorgio G Galli4, Wilbert Zwart1,5, Ran Elkon2, Reuven Agami1,6.   

Abstract

Estrogen receptor α (ERα) is an enhancer activating transcription factor, a key driver of breast cancer and a main target for cancer therapy. ERα-mediated gene regulation requires proper chromatin-conformation to facilitate interactions between ERα-bound enhancers and their target promoters. A major determinant of chromatin structure is the CCCTC-binding factor (CTCF), that dimerizes and together with cohesin stabilizes chromatin loops and forms the boundaries of topologically associated domains. However, whether CTCF-binding elements (CBEs) are essential for ERα-driven cell proliferation is unknown. To address this question in a global manner, we implemented a CRISPR-based functional genetic screen targeting CBEs located in the vicinity of ERα-bound enhancers. We identified four functional CBEs and demonstrated the role of one of them in inducing chromatin conformation changes in favor of activation of PREX1, a key ERα target gene in breast cancer. Indeed, high PREX1 expression is a bona-fide marker of ERα-dependency in cell lines, and is associated with good outcome after anti-hormonal treatment. Altogether, our data show that distinct CTCF-mediated chromatin structures are required for ERα- driven breast cancer cell proliferation.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31372638      PMCID: PMC6765117          DOI: 10.1093/nar/gkz675

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Approximately 75% of all human breast tumors expresses ERα, a hormone receptor encoded by the ESR1 gene that functions as a hormone-regulated transcription factor (1,2). ERα is generally activated upon binding of its endogenous ligand estradiol (E2), which results in the recruitment of a large spectrum of coregulators to form an active transcription complex and associate to the chromatin via Estrogen Responsive Elements (EREs) (3). Importantly, ERα rarely binds promoters but regulates gene expression by binding cis-regulatory enhancer elements, which interact in 3D genomic space with promoter regions of responsive genes through chromatin looping (4–6). ERα is considered the main driver in these tumors, and current treatment of ERα-positive breast cancer resolve around endocrine therapies like tamoxifen, which prevents cofactor recruitment and transcription complex formation (7) or fulvestrant that induces receptor degradation (8). Other treatments are based on inhibiting the synthesis of estrogen with aromatase inhibitors, which prevents receptor activation. However, resistance to treatment is common and ∼80% of metastases with acquired resistance to endocrine therapeutics still express ERα (9–11). Multiple drivers of endocrine therapy resistance have been identified, including ESR1 activating point mutants (12), ERα phosphorylation (13), overexpression of coregulators (14) and epigenetic reprogramming of the ERα cistrome (15,16). Nonetheless, other mechanisms of endocrine therapy resistance in breast cancer are likely to exist. Eukaryotic gene expression is tightly controlled by a set of steps that guarantee precise spatial and temporal gene expression patterns (17). The regulation of initiation of transcription by regulatory DNA elements is one of the most important events in gene expression (18). Regulatory DNA elements can be classified into two main branches according to their mode of activity as trans-acting factors and cis-regulatory elements (19). Cis-regulatory elements include promoters (20), enhancers (21), silencers (22) and insulators (23). They coordinate the timing and strength of gene expression during development and differentiation (24), growth and stress conditions. While promoters are located immediately upstream of their target genes, enhancers can be located far away from their target promoters on the linear chromosome template and rely on chromatin structure (DNA looping) to activate gene expression (25). One of the most critical determinants of chromatin structure is the zinc-finger DNA binding protein CTCF (CCCTC-binding factor) (26,27). Together with cohesin (28), CTCF functions to insulate enhancers and facilitate correct enhancer/target-promoter interactions by creating chromatin territories dubbed as topologically active domains (TADs) (29,30). Two convergent linear CTCF sites (31) are held together by a cohesin ring and form ‘insulated neighborhoods’ (27) (chromatin loop structures) which come together to form megabase-scale TADs (32). Therefore, it is likely that genetic or epigenetic perturbations of CTCF binding elements (CBEs) may affect chromatin structure leading to local rewiring of promoter-enhancer contacts which potentially can change gene expression patterns (27,33,34) and as a result cause a phenotypic changes in cell behavior (35,36). Indeed, indications for a link between CTCF and ERα-mediated gene expression and phenotypic changes were recently made (15,37). By an integrative genomics approach, Chan and colleagues have partitioned the genome to CTCF blocks (genomic regions between two CBEs), and demonstrated that genes located within ERα-containing CTCF blocks are generally upregulated in response to estrogen while genes in blocks devoid of ERα not. In fact, the presence of an ERα binding site (EBS) within a CTCF block, and the distance to the nearest ERα, were shown to be the best predictive score for estrogen-responsive genes (37). Moreover, CTCF binding appeared not to only mark boundaries for accessibility and insulated transcriptional blocks, but also seemed to be required to shape ERα-mediated gene expression in breast cancer cells (15). However, the functional significance and the causal role of CTCF binding and associated gene expression changes to cancer progression are yet to be explored. Since CBEs and EBSs overlap in thousands of cases (15) and enhancer/target-gene interactions depend on proper chromatin interactions, we hypothesize here that CBEs can affect ERα function and induce tumor cell proliferation and resistance to anti-hormonal treatments. The CRISPR-Cas9 system has been recently harnessed to delineate regulatory gene expression mechanisms mediated through promoter (38,39) and enhancer regions (40). Though applied to specific CTCF loci of interest before (27,34), a genome-wide CRISPR-Cas9 screen systematically targeting CTCF sites was not yet implemented. In this study, we designed a customized CRISPR-Cas9 library to target individual CBEs that reside in close proximity to EBSs and performed a drop-out genetic screen to identify CBEs essential for ERα-driven cell proliferation. Altogether, we validated four key CBEs, of which we further functionally characterized the one residing in the vicinity of PREX1 (Phosphatidylinositol 3,4,5-trisphosphate-dependent Rac exchanger 1) promoter; a key driver of breast cancer cell proliferation.

MATERIALS AND METHODS

Cell lines and chemical reagents

HEK293-T, MDA-MB-231 and MCF-7 cells were cultured in Dulbecco's modified Eagle's medium (DMEM) medium (Gibco), supplemented with 10% Fetal Calf Serum (Hyclone) and 1% penicillin/streptomycin (Gibco). For the estrogen-stimulation experiments, MCF-7 cells were washed three times with phosphate-buffered saline and cultured in phenol red–free DMEM medium (Gibco) supplemented with 5% charcoal stripped serum (Gibco) for 72 h prior to E2 treatment (10−8 M). E2 (17β-estradiol) was purchased from Sigma. All cell lines were obtained from ATCC. Custom single guide RNAs (sgRNAs) designed for this study:

Pooled library cloning

Standard de-salted DNA oligonucleotides were synthesized and purchased from IDT (Integrated DNA Technologies) to construct sgRNA library for CTCF sites in the vicinity of ERα (1709 sgRNAs+82 control sgRNAs). Complementary single-stranded oligos were phosphorylated and annealed by combining 100 μM oligos, 1× T4 PNK Buffer, 1 mM adenosine triphosphate (ATP), 5U T4 PNK and incubating the reaction at 37°C/30 min, 95°C/5 min followed by ramp down to 25°C at 5°C/min. Annealed oligos were diluted at 1:1000 in sterile water, and ligated to plasmid vector lentiCRISPR v2 (gift from Feng Zhang (Addgene plasmid #52961)) using the following parameters: 50 ng BsmBI (Fermentas) digested plasmid, 1 μl diluted oligo duplex, 1× Ligation Buffer (Roche), 5U T4 DNA Ligase (Roche) incubated at RT/30 min. We did five independent ligation reactions per pool and used them to transform highly competent Escherichia coli cells (EletroSHOX - Bioline, BIO-85038) according to manufacturer's protocol. In order to assess the complexity of our libraries, we plated 1 μl of cell transformation mixture on LB agar plates containing Ampicillin, incubated them overnight at 37°C, and counted individual bacterial colonies after 16 h. At this point, we estimated that each individual sgRNA is covered >100×, ensuring that our libraries have high-complexity and are suitable for pooled screening. Transformation mixtures were combined, grew in liquid LB until OD600 = 0.8 was reached, and plasmid DNA was harvested using Genopure Plasmid Maxi kit (Roche).

Lentivirus production and purification

To produce lentivirus, 4 × 106 HEK293T cells per pool were seeded in five 100 mm dishes one day prior to transfection. For each dish, we diluted 10 μg of CRISPR-CTCFERα plasmid library, 3.5 μg of pVSV-G, 5 μg of pMDL RRE and 2.5 μg of pRSV-REV in 450 μl of 0.1× TE/H2O, added 50 μl of CaCl2 and incubated 5 min at RT. Plasmid DNA was precipitated by adding 500 μl 2× HEPES-buffered saline to the solution while vortexing at full-speed. The precipitate was added immediately to the plate and the cells were incubated for 14 h at 37°C, after which the medium was refreshed. Lentivirus-containing supernatants were collected 60 h post-transfection, filtered through a 0.45 μm membrane (Milipore Steriflip HV/PVDF) and stored at −80°C. All cell types and lentivirus batches tested were titrated in order to achieve a MOI of 0.4–0.5. Cell lines were infected with lentivirus supernatants supplemented with 8 μg/ml polybrene (Sigma). At 24 h post-infection, medium was replaced and cells were selected with 2 μg/ml puromycin (Gibco). Antibiotic selection was stopped as soon as no surviving cells remained in the no-transduction control plate. All experiments were performed within the first 15 days following the antibiotic selection.

CRISPR-Cas9 dropout screen in breast cancer cells

MCF-7 and MDA-MB-231 cells were infected with three independent pools of CRISPR-CTCFERα. We infected ∼45 000 cells per vector, to ensure that every sgRNA was present in the cell population at the start of the experiment. Following antibiotic selection, cells were allowed to proliferate for 48 h to clear potentially toxic sgRNAs from the population. At this time point, we harvested half of the cells infected with CRISPR-CTCFERα pools (T = 0). The remaining cells were placed in culture, allowed to proliferate for 20 days, and then harvested (T = 20). Cell pellets were stored at −80°C and processed later on for further analysis. Genomic DNA was isolated from the harvested library-transduced cells, and amplified for the integrated vectors by polymerase chain reaction (PCR), and sequenced by Next-generation sequencing (NGS) to quantify the abundance of integrated sgRNAs present in each population. Read counts in each sample were normalized to 1M reads (RPM) and enrichment (depletion) scores were calculated for each sgRNA vector, in each cell line (MCF-7 and MDA-MB-231) by comparing its normalized frequency at T = 20 and T = 0 pools. (Only sgRNAs covered by at least 50 reads were considered. To avoid inflation of FC for sgRNAs covered by low number of reads, RPMs below 5 were set to 5). These enrichment/depletion scores were then standardized to Z-scores (raw counts, RPM, enrichment scores and Z-scores are provided in Supplementary Table S1). These standardized enrichment scores showed high correlation between replicates from the same cell line (Supplementary Figure S1A) Difference between the mean standardized enrichment scores in MCF-7 and MDA-MB-231 cells (which serve as control in the screen as they are not dependent on ERα) were calculated for each sgRNA. For validation, we selected sgRNA vectors whose repressive effect on proliferation was at least 30% stronger in MCF-7 cells compared to MDA-MB-231 cells.

Genomic DNA sequencing to identify sgRNAs

Frozen cell pellets were thawed and genomic DNA (gDNA) was isolated with DNeasy Blood and Tissue kit (Qiagen). Identification of sgRNAs was done by PCR in two steps. For the first PCR, the amount of input gDNA was calculated to achieve ∼500 coverage (assuming that 106 cells contain 6.6 μg gDNA), which resulted in 5 μg for CRISPR-CTCFERα. For each sample, we performed two separate reactions (max. 1 μg gDNA per reaction) using Phusion DNA polymerase (Thermo Scientific) and combined the resulting amplicons. In the first PCR, we used the following primer sequences to amplify sgRNAs: PCR1_F1 ACACTCTTTCCCTACACGACGCTCTTCCGATCTXXXXXXGGCTTTATATATCTTGTGGAAAGGACG (XXXXXX represents a 6 bp barcode) PCR1_R1 GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTACTGACGGGCACCGGAGCCAATTCC A second PCR was performed to attach Illumina adaptors and index samples. The second PCR was done in 50 μl reaction volume, including 7 μl of the product from the first PCR and using the following primers: PCR2_P5 AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT PCR2_P7 CAAGCAGAAGACGGCATACGAGATXXXXXXGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT (XXXXXX represents a 6 bp index) Amplification was carried out with 15 cycles for both first and second PCR. After the second PCR, resulting amplicons were purified using Agencourt AMPure XP beads (Beckman Coulter), quantified in a Bioanalyzer 2100 (Agilent), mixed and sequenced in a HiSeq 2500 (Illumina).

Genomic DNA sequencing to identify CRISPR-induced mutations

Cell pellets were collected and gDNA was isolated with DNeasy Blood and Tissue kit (Qiagen). Amplification of target regions for sequencing was done by PCR in two steps. For each sample, we used 500 ng of gDNA as input for the first PCR (done in duplicate). Resulting amplicons were combined and we used 10 μl as input for the second PCR. Amplification was carried out with 20 cycles for both first and second PCR. After the second PCR, amplicons were purified using Agencourt AMPure XP beads (Beckman Coulter), quantified in a Bioanalyzer 2100 (Agilent), mixed and sequenced in a HiSeq 2500 (Illumina). List of PCR primers:

Competitive proliferation assay

MCF-7 and MDA-MB-231 cells were infected with indicated sgRNAs to validate the results of the CRISPR-CTCFERα screen. Separately, we generated polyclonal MCF-7 and MDA-MB-231 cells stably expressing GFP using pLKO.1-GFP vector. GFP expressing cells were mixed in a 1:3 ratio with cells containing individual sgRNAs. The percentage of GFP expressing cells was assessed by flow cytometry at the beginning of the experiment (T = 0) and every 72 h onward (T = 3 days and T = 6 days). For every condition, 10 000 events were recorded and the data was analyzed using FlowJo software.

RNA isolation, reverse-transcription and quantitative real-time PCR (qPCR)

After the antibiotic selection was completed, cells were kept in cell culture at least 2–3 days in order to eliminate the adverse effect of antibiotic on cells. Total RNA was extracted using TRIsure (Bioline) reagent and following the manufacturers protocol. cDNA was produced with either SuperScript III (Invitrogen) or High-Capacity cDNA Reverse Transcription Kit (Applied Biosystem) using 2–4 μg of total RNA per reaction. qPCR reaction was performed with SYBR green I Master mix in a LightCycler 480 (Roche). TATA-binding protein (TBP) was used as an internal control. List of qPCR primers:

Western blot analysis

Whole-cell lysates were prepared as previously described (41). Membranes were immunoblotted with the following antibodies: PREX1 (ab183643, Abcam; 1:500), and beta-Actin (A2228, Sigma; 1:5000). Protein bands were visualized using corresponding secondary antibodies (Dako) and ECL reagent (GE Healthcare).

GRO-seq

GRO-seq was performed as described before with minor modifications. Briefly, 5 × 106 nuclei were isolated and incubated 5 min at 30°C with equal volume of reaction buffer (10 mM Tris–Cl pH 8.0, 5 mM MgCl2, 1 mM DTT, 300 mM KCL, 20 units of SUPERase In, 1% sarkosyl, 500 μM ATP, GTP and Br-UTP, 0.2 μM CTP+32P CTP) for the nuclear run-on. The reaction was stopped and total RNA was extracted with Trizol LS (Invitrogen) according to the manufacturer's instructions. RNA was fragmented using fragmentation reagents (Ambion) and the reaction was purified through p-30 RNAse-free spin column (BioRad). BrU-labeled RNA was immunoprecipitated with anti-BrU agarose beads (Santa Cruz), washed one time in binding buffer, one time in low salt buffer (0.2× SSPE, 1 mM ethylenediaminetetraacetic acid (EDTA), 0.05% Tween-20), one-time high salt buffer (0.25× SSPE, 1 mM EDTA, 0.05% Tween-20, 137.5 mM NaCl) and two times in TET buffer (TE with 0.05% Tween-20). RNA was eluted with elution buffer (20 mM DTT, 300 mM NaCl, 5 mM Tris–Cl pH 7.5, 1 mM EDTA and 0.1% sodium dodecyl sulphate) and isolated with Trizol LS. After the binding step, BrU-labbeled RNA was treated with tobacco acid pyrophosphatase (TAP, Epicenter) to remove 5′-methyl guanosine cap, followed by T4 polynucleotide kinase (PNK; NEB) to remove 3′-phosphate group. BrU-containing RNA was treated with T4 PNK again at high pH in the presence of ATP to add 5′-phosphate group. The reaction was stopped and RNA was extracted with Trizol LS. Sequencing libraries were prepared using TruSeq Small RNA kit (Illumina) following manufacturer's instructions. Briefly, end-repaired RNA was ligated to RNA 3′ and 5′ adapters, followed by RT-PCR amplification. cDNA was purified using Agencourt AMPure XP (Beckman Coulter) and amplified by PCR for 12 cycles. Finally, amplicons were cleaned and size-selected using Agencourt AMPure XP (Beckman Coulter), quantified in a Bioanalyzer 2100 (Agilent), and sequenced in a HiSeq 2500 (Illumina). Sequenced reads were aligned to the human genome (hg19) using bowtie2. Statistics on the number of sequenced and mapped reads is given in Supplementary Table S3.

RNA-seq

After the antibiotic selection was completed, cells were kept in cell culture until they reach 70–80% confluency. Total RNA was extracted using TRIsure (Bioline) reagent and following the manufacturers protocol. RNA-seq samples were processed with TruSeq RNA library prep kit v2 (Illumina) and sequenced in a HiSeq 2500 (Illumina). Sequenced reads were aligned to the human genome (hg19) using TopHat2 (42) and gene expression counts were calculated using HTseq (43) based on Ensembl's human gene annotations (v69) (44). Expression levels were normalized using quantile normalization. Statistics on the number of sequenced and mapped reads is given in Supplementary Table S3.

Chromatin immunoprecipitation (ChIP)

Chromatin immunoprecipitations were performed as previously described (16,45). Cells were fixed with formaldehyde (1%) for 10 min and subsequently quenched with glycine. Following that, samples were lysed as described (46) and sonicated for at least 11 cycles of 30 s on, 30 s off using a Diagenode Bioruptor Pico. We used antibodies against ERα (MS-315-P, Thermo Scientific), FOXA1 (ab5089, Abcam) and CTCF (07–729, Merck Millipore). For ERα/FOXA1 ChIPs, 5 μg of antibody was conjugated with 50 μl Protein G magnetic beads. On the other hand, for CTCF ChIP 7.5 μl of the antibody was conjugated with 50 μl Protein A magnetic beads. Immunoprecipitated DNA was processed for library preparation by using TruSeq ChIP Library Preparation Kit. Samples were sequenced using an Illumina Hiseq2500 genome analyzer (65 bp reads, single end), and aligned to the Human Reference Genome (hg19, February 2009). Reads were filtered based on MAPQ quality (quality ≥ 20) and duplicate reads were marked. Peak calling over input control was performed using DFilter (47) and MACS 1.4 (48) peak callers. MACS was run with the following parameters: bw 300 -m 10,30 -p 1.00e-7 –nomodel. DFilter was run with bs = 50, ks = 30, refine, nonzero. The peaks shared by both peak callers were used for analysis. For pair-wise comparison of ChIP-seq, the binding signals in the union of the binding sites from the data pair were quantified using count per million reads using pybedtools (49,50). The binding profiles are compared and visualized using seaborn v0.7.0 (51) where the sites overlap with the target of sgRNAs are highlighted in the scatter plot.

4C

4C-seq was performed as described previously (52). Briefly, after obtaining a single cell suspension, crosslinking of the nuclei, chromatin was digested with MboI and subsequently re-ligated. Proteinase K was used to de-crosslink the DNA o/n at 65°C. DNA was isolated by phenol/chloroform extraction and subsequently digested with a Csp6I. To create circular DNA molecules, digested DNA was ligated under diluted conditions (7 ml). DNA was precipitated with 1/10 volume 3M sodium acetate and 1 volume isopropanol. DNA was quantified using Qbit (Thermo Fisher). 4C PCR was done using Expand long template polymerase (Sigma-Aldrich) in five separate PCR reactions using 100 ng per PCR. After Illumina sequencing, reads were demultiplexed based on the primer sequence of the viewpoint (first 20 nt of the sequence read). Trimmed reads we mapped to the human genome (hg19) with BWA-MEM (53). Only reads whose start overlapped with an MboI restriction site were retained for further analysis. Read count files were normalized using peakC (54) before further analysis. CTCF motif positions are based on a previously done ChIPseq (15).

PREX1 expression data and patient survival

The performance of the PREX1 classifier for clinical outcome was assessed using the METABRIC patient series (55). Survival curves for PREX1 expression (lowest versus highest quartile) were constructed using the Kaplan–Meier method and compared using log-rank test. Adjusted Cox proportional hazard regression analyses were performed with the following covariates: PREX1 expression, age, tumor grade (grade 1 versus grade 2 versus grade 3), tumor size, tumor stage (1 versus 2 versus 3 versus 4 versus 5) and lymph node status (negative versus positive). When comparing the performance of the PREX1 classifier in the hormonal treatment setting, the group of patients receiving hormonal treatment was down sampled to match the patient numbers of the other group. Pearson's correlation coefficient was used to assess the correlation between PREX1 and ESR1 expression. A P-value < 0.05 was considered as a significant result.

Analysis of PREX1 expression and ESR1 KD sensitivity in breast cancer cell

We compared the expression levels of PREX1 (as measured by RNA-seq across the breast cancer cell lines of the Cancer Cell Line Encyclopedia) with the Ataris and Demeter scores for ESR1 knockdown previously reported by the projects DRIVE (56) and Achilles (57), respectively. The sensitivity scores of ESR1 knockdown correspond to an aggregate of 17 and 10 shRNAs for DRIVE and Achilles, respectively.

Analysis of PREX1 and ESR1 expression in tumor samples

The expression levels of PREX1 and ESR1 (log2 RNA-seq) were analyzed in 1095 breast cancer samples. The data was obtained from TCGA.

RESULTS

A functional genetic screen identifies CBEs essential for ERα-mediated mitogenic signaling

We hypothesized that correct chromatin structure is required to facilitate ERα-driven breast cell proliferation. As CTCF is one of the major determinants of chromatin structure, we setup a functional genetic screen in breast cancer cell lines to uncover CTCF-mediated DNA loops whose knockout attenuates ERα-mediated cell proliferation. We first constructed a CRISPR-Cas9 single guide RNA (sgRNA) library to target CBEs that are both engaged in DNA loop formation (anchor CTCF sites) and located in the vicinity of ERα-bound enhancers (Figure 1A). Initially, anchor CBEs were bioinformatically mapped using ENCODE’s Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) data in MCF-7 cells (4). Then, we searched for evidence of direct CTCF-binding by crossing this result with ENCODE CTCF ChIP-seq data in MCF-7 and the presence of a CTCF-motif in each region (CTCF motif was derived by MEME and then used by FIMO (using cutoff of P < 10–5) to scan the peaks for its hits). This set of CTCF anchor sites was intersected with a map of ERα-bound active enhancers (identified by the overlap of ERα binding sites (5) and ENCODE p300 peaks in MCF-7) to select those that reside within a 100 Kb distance from any ERα-bound enhancer. This pipeline resulted in 668 CBEs to which we generated 1,709 targeting sgRNA vectors. We also added 82 non-targeting sgRNA vectors (negative controls) and an sgRNA vector targeting ERα-bound enhancer (ERα1830) that we previously showed its requirement for mediating ERα-driven cell proliferation (positive control (40,58)). A lentiviral library was generated and named as CRISPR-CTCFERα.
Figure 1.

A focused CRISPR-Cas9 genetic screen identifies CBEs required for mitogenic function of ERα. (A) The selection criteria and a summary of the pipeline used to generate a customized CRISPR-Cas9-based library (named as CRISPR-CTCFERα). The number in brackets indicates remaining CTCF binding sites. (B) Screening strategy based on negative selection of sgRNAs. (C) Distribution of sgRNAs’ standardized enrichment scores (Z-scores) in only MCF-7 (left) and in MCF-7 relative to MDA-MB-231 cells (right). A green dot represents the positive control sgRNA targeting enhancer1830 (labeled as enh1830) (40). (D) Validation of candidate hits by a competitive proliferation assay in MCF-7 and MDA-MB-231 cells. CNT indicates a non-targeting sgRNA. sgRNA targeting enhancer1830 was used as a positive control (40). GFP positive cells were determined by flow cytometry. For complete plots see Figure S1A. (E) RNA-seq showed a local downregulation of the genes residing within 400 kbps away from each target site. CNT indicates a nontargeting sgRNA. Heatmap represents Z-score of expression fold-change (between the test and control sgRNAs). Mutation profile of each CBE target site was determined by DNA-seq experiment, and is presented at the left-hand side. In all samples the frequency of CRIPSR-Cas9 meditated mutations at the target CBE was more than 50%.

A focused CRISPR-Cas9 genetic screen identifies CBEs required for mitogenic function of ERα. (A) The selection criteria and a summary of the pipeline used to generate a customized CRISPR-Cas9-based library (named as CRISPR-CTCFERα). The number in brackets indicates remaining CTCF binding sites. (B) Screening strategy based on negative selection of sgRNAs. (C) Distribution of sgRNAs’ standardized enrichment scores (Z-scores) in only MCF-7 (left) and in MCF-7 relative to MDA-MB-231 cells (right). A green dot represents the positive control sgRNA targeting enhancer1830 (labeled as enh1830) (40). (D) Validation of candidate hits by a competitive proliferation assay in MCF-7 and MDA-MB-231 cells. CNT indicates a non-targeting sgRNA. sgRNA targeting enhancer1830 was used as a positive control (40). GFP positive cells were determined by flow cytometry. For complete plots see Figure S1A. (E) RNA-seq showed a local downregulation of the genes residing within 400 kbps away from each target site. CNT indicates a nontargeting sgRNA. Heatmap represents Z-score of expression fold-change (between the test and control sgRNAs). Mutation profile of each CBE target site was determined by DNA-seq experiment, and is presented at the left-hand side. In all samples the frequency of CRIPSR-Cas9 meditated mutations at the target CBE was more than 50%. Next, we transduced MCF-7 (ERα-positive) and MDA-MB-231 (ERα-negative) breast cancer cell lines with the CRISPR-CTCFERα library (three biological replicates, 0.3 MOI, 500× coverage). Depletion scores were calculated for each sgRNA vector in each cell line (MCF-7 and MDA-MB-231) by comparing its normalized abundance in the T = 20 and T = 0 pools. These scores showed high correlation between the biological replicates of the same cell line (Figure 1B, Supplementary Figure S1A and Table S1 and 4). We aimed to find sgRNAs whose abundance was reduced specially in MCF-7 without observing any significant change in MDA-MB-231 cells. As expected, none of the negative controls showed any significant reduction in abundance (Supplementary Figure S1B), while the abundance of sgRNA-ERα1830 was highly reduced only in MCF-7 cells (Figure 1C). Importantly, this experiment identified four CBEs (three of which were targeted by two independent sgRNAs) with a marked negative effect on the proliferation of MCF-7 but no significant effect on MDA-MB-231 cells (Figure 1C and Supplementary Figure S1B). Single transductions of these hits confirmed the functionality of the four CBEs by growth competition assays (Figure 1D; Supplementary Figure S2A and B). Following 9 days of culturing, each of the identified sgRNAs attenuated cell proliferation to a similar extent as observed with the positive control sgRNA-ERα1830 (40,58). Thus, our functional genetic screen approach has yielded four CBEs that affect the proliferation of MCF-7 while having no effect on MDA-MB-231 cells. Interestingly, examination of a publicly available dataset that profiled transcriptional responses to estradiol treatment in MCF-7 cells (using the GRO-seq technique) (59), showed that for all the four validated CBEs, the nearest gene was markedly responsive (induction above 2-fold) to the treatment (P = 0.0003; binomial test using the overall proportion of genes that were induced more than 2-fold in the dataset): ESRP2, AKAP1, BICD2 and PREX1 located, respectively, 7k, 35k, 48k and 36k bps from the CBEs targeted by sgRNA680, sgRNA810, sgRNA1659 and sgRNA1118 (Supplementary Table S2).

CBEs are required for local gene expression

We studied in a greater detail the impact of a subset of the validated sgRNAs (one from each of the four CBEs detected by the screen) on genome integrity and gene expression. First, we interrogated the level of DNA editing at the target sites by next-generation sequencing and found the expected >50% mutant allele frequencies, the majority of DNA alterations occurring within 15 nt from the planned cleavage sites (Figure 1E, left panel and Supplementary Figure S4A and B). Second, using mRNA sequencing, we assessed changes in gene expression between our hits and a non-targeting sgRNA. Notably, in all cases, this analysis detected a mild (∼10–50% reduction) but consistent negative impact on the expression of a handful of genes in the vicinity of the targeted regions (Figure 1E, right panel and Supplementary Figure S2C). Moreover, we used two publicly available GRO-seq datasets that profiled transcriptional responses to E2 in MCF-7 cells to examine the response of genes in the vicinity of the four validated CBEs (±400 kbp; Figure 1E) to estrogen (5,59). In both cases, nearby genes showed significant induction upon E2 treatment (Supplementary Figure S2D), suggesting that the selected CBEs are required for optimal local gene expression (27).

A key role for CBEs in controlling ERα-mediated PREX1 transcript activation

Next, we focused on the CBE region targeted by sgRNA1117 and sgRNA1118 (named as CBE1). This CBE has a strong effect on PREX1 expression, a target gene that shows a robust transcriptional activation following estradiol treatment in MCF-7 cells (∼2-fold, Figure 2E). Expression analysis of all genes residing within a 2 MB window around the target site (CBE1) indicated a significant effect on the mRNA level of PREX1 (Figures 1E and 2A). PREX1 is a Rac guanine nucleotide exchange factor, whose activity promotes breast cancer cell proliferation through the activation of the extracellular signal-regulated kinases (ERK1/2) (60). The PREX1 locus is frequently amplified in various cancer types (61,62), especially in breast cancer tumors (up to 30%) with a preference for ERα-positive cases (63), suggesting an oncogenic potential within the ERα pathway. Using qRT-PCR, we validated the changes in PREX1 gene expression induced by both CBE1 sgRNAs (Figure 2B). Notably, CRISPR-mediated targeting of PREX1 exons (Figure 2D and Supplementary Figure S2E), but not introns (Figure 2C and Supplementary Figure S4D), resembled the effect of targeting CBE1 in MCF-7 cells (Figure 2D). Intriguingly, in the same sgRNA1117 or sgRNA1118-transduced MCF-7 cell populations, we observed marked reduction in the protein levels of PREX1 isoforms (Supplementary Figure S3). It should be noted that due to the amplification of PREX1 locus in MCF-7 cells, a subtle effect of CRISPR-Cas9 targeting was observed when introns were targeted (64). In contrast to the effect in MCF-7 cells, targeting PREX1 exons did not affect the proliferation capacity of MDA-MB-231 cells (Figure 2D)—though PREX1 locus is amplified in MDA-MB-231 too.
Figure 2.

A key role for CBE1 in ERα-mediated stimulation of PREX1 activity. (A) Gene expression fold-change (log2) between sgRNA1118 (targeting CBE1) and non-targeting control depicted for genes in chr20 (the location of sg1118 CRISPR-Cas9 cut site is set to 0). The red line is a lowess (locally weighted scatterplot smoothing) regression curve fitted to the data. (B and C) Relative expression levels of PREX1 was measured by qRT-PCR in sgRNA1117 and sgRNA1118 (targeting CBE1) and intron targeted MCF-7 cells, compared with control non-targeting sgRNA (CNT). Gene expression levels were normalized to TBP. n = 3 ***P < 0.005, *P < 0.05, two-tailed Student's t-test. (D) Competitive proliferation assays in MCF-7 and MDA-MB-231 after transduction with individual sgRNAs targeting PREX1 gene. Values on day 6 (T = 6) and day 9 (T = 9) were normalized to day 0 (T = 0). ***P < 0.005, **P < 0.01, *P < 0.05, two-tailed Student's t-test. n.s., not significant.

A key role for CBE1 in ERα-mediated stimulation of PREX1 activity. (A) Gene expression fold-change (log2) between sgRNA1118 (targeting CBE1) and non-targeting control depicted for genes in chr20 (the location of sg1118 CRISPR-Cas9 cut site is set to 0). The red line is a lowess (locally weighted scatterplot smoothing) regression curve fitted to the data. (B and C) Relative expression levels of PREX1 was measured by qRT-PCR in sgRNA1117 and sgRNA1118 (targeting CBE1) and intron targeted MCF-7 cells, compared with control non-targeting sgRNA (CNT). Gene expression levels were normalized to TBP. n = 3 ***P < 0.005, *P < 0.05, two-tailed Student's t-test. (D) Competitive proliferation assays in MCF-7 and MDA-MB-231 after transduction with individual sgRNAs targeting PREX1 gene. Values on day 6 (T = 6) and day 9 (T = 9) were normalized to day 0 (T = 0). ***P < 0.005, **P < 0.01, *P < 0.05, two-tailed Student's t-test. n.s., not significant. To examine the estradiol (E2) responsiveness, we cultured MCF-7 cells (wildtype and CBE1 knockout) in hormone-deprived medium and subsequently treated with E2 or DMSO control for 30 min, 60 min and 6 h and measured PREX1 mRNA expression. Indeed, PREX1 expression was significantly increased after E2 treatment (Figure 2E and Supplementary Figure S2F), indicating that PREX1 is a primary target gene of ERα. Furthermore, knocking out CBE1 significantly compromised the induction of PREX1 expression by E2 (Figure 2E and Supplementary Figure S2F). Moreover, examining genes in the vicinity of PREX1 gene indicated that regardless of their induction by E2, CBE1 compromised their expression, substantiating its impact on local gene expression (Supplementary Figure S2G). Altogether, our experiments revealed a key role for CBE1 in ERα-mediated stimulation of PREX1 levels.

An interplay between several CBEs regulate PREX1 expression

To further assess the binding of CTCF to CBE1 upon sgRNA introduction, we performed CTCF chromatin-immunoprecipitation and sequencing (ChIP-seq) experiments. This analysis revealed a moderate binding of CTCF to CBE1, which was significantly reduced in cells transduced with either sgRNA1117 or 1118, compared to control (Figure 3A, left and middle panels). This result validates CBE1 as a binding site of CTCF, and more importantly demonstrates a strong and specific effect of CBE1-targeting vectors. Then, we examined the chromatin architecture surrounding CBE1, and designed circularized chromosome conformation capture (4C-seq) experiments with CBE1 as a viewpoint (52). This analysis uncovered an interaction of CBE1 with an upstream CBE (termed here as CBE2) and the promoter of PREX1 (Figure 3B and Supplementary Figure S5A). Interaction between CBE1 and CBE2 is in the expected convergent orientation (31). In CBE1-targeted cells, we detected a mild reduction in the interaction frequency with CBE2. To confirm the observed interaction patterns, we included two additional 4C viewpoints; CBE2 (Figure 3B and Supplementary Figure S5B) and the PREX1 promoter (Figure 3B and Supplementary Figure S5D). In the control condition, CBE2 interacts with another downstream CTCF site (termed here CBE3) (Figure 3B and Supplementary Figure S5C). However, CBE1 knockout induced a strong interaction frequency between CBE2 and the PREX1 promoter, suggesting that CBE1 is important to keep the promoter region in a more accessible chromatin configuration for transcriptional activation by impeding the interaction between CBE2 and PREX1 promoter and by favoring interaction between CBE2 and CBE3. Therefore, we targeted CBE2 using two different sgRNAs and examined cell proliferation and interaction profile of each CBE. Similar to targeting CBE1 and PREX1, targeting CBE2 markedly reduced cell proliferation in MCF-7, but not in MDA-MB-231 cells (Figure 3C and D). Interestingly, targeting CBE2 triggered an interaction between CBE3 and CBE1, which are in a non-convergent orientation (65), while decreasing the interaction frequency between CBE2 and CBE3 (Supplementary Figure S5). The interaction frequency between CBE2 and PREX1 promoter was preserved after targeting CBE2, suggesting that this is CTCF independent (Supplementary Figure S5). Thus, our analysis indicates an intricate interplay between several CBEs for maintaining a chromatin structure in the PREX1 locus that supports its induction and enhance stimulation of cell proliferation by ERα.
Figure 3.

Loss of CTCF binding to CBE1 modulates local chromatin interactions with PREX1 promoter. (A) MCF-7 cells stably expressing nontargeting sgRNA (CNT), sgRNA1117 and sgRNA1118 (targeting CBE1), and sgRNA-CBE2 were examined by CTCF-ChIP-seq. Red arrow for CBE1, blue arrow for CBE2 and purple arrow for ERα binding site 24365 (EBE24365). (B) Relative chromatin interaction intensity of PREX1 locus is indicated with viewpoints from CBE1, CBE2, CBE3 and promoter by 4C-seq data aligned with CTCF ChIP-seq. RefSeq gene panel was shown at the bottom of the panel. CNT indicates a nontargeting sgRNA. For complete data see Supplementary Figure S3. (C) Genomic coordinates of CBE1 and CBE2. (D) Competitive proliferation assays of MCF-7 cells transduced with individual sgRNAs targeting CBE2. Values on day 6 (T = 6) and day 9 (T = 9) normalized to day 0 (T = 0). ***P < 0.005, **P < 0.01, *P < 0.05, two-tailed Student's t-test. n.s., not significant.

Loss of CTCF binding to CBE1 modulates local chromatin interactions with PREX1 promoter. (A) MCF-7 cells stably expressing nontargeting sgRNA (CNT), sgRNA1117 and sgRNA1118 (targeting CBE1), and sgRNA-CBE2 were examined by CTCF-ChIP-seq. Red arrow for CBE1, blue arrow for CBE2 and purple arrow for ERα binding site 24365 (EBE24365). (B) Relative chromatin interaction intensity of PREX1 locus is indicated with viewpoints from CBE1, CBE2, CBE3 and promoter by 4C-seq data aligned with CTCF ChIP-seq. RefSeq gene panel was shown at the bottom of the panel. CNT indicates a nontargeting sgRNA. For complete data see Supplementary Figure S3. (C) Genomic coordinates of CBE1 and CBE2. (D) Competitive proliferation assays of MCF-7 cells transduced with individual sgRNAs targeting CBE2. Values on day 6 (T = 6) and day 9 (T = 9) normalized to day 0 (T = 0). ***P < 0.005, **P < 0.01, *P < 0.05, two-tailed Student's t-test. n.s., not significant.

A chromatin conformation structure around PREX1 promoter allows efficient transcriptional activation by ERα

As our results demonstrate that CBE1 is required for PREX1 expression, our next goal was to identify enhancers whose activity is attenuated upon targeting CBE1. Building on eRNAs as a quantitative measure of enhancer activity (66), we used Global Run-On sequencing (GRO-seq) (67) to measure changes in nascent transcription in CBE1-knockout cells compared to control. The dREG (discriminative Regulatory Element detection from GRO-seq (dREG)) (68) tool detected 25 putative transcriptional regulatory elements (TREs) in the PREX1 locus. Notably, the majority of these elements showed reduced transcriptional activity throughout the whole PREX1 locus in CBE-targeted cells (Figure 4A; Supplementary Figure S6A and B). Intersection with publicly available ERα ChIP-seq data showed that one of the elements that showed the most pronounced attenuation of transcription (interval number 24365 (Figure 4A), which resides between PREX1 promoter and CBE2) has a strong ERα binding site (named here EBE24365) (Supplementary Figure S4C), and contains key marks of a transcriptionally active ERα-bound enhancer (p300 binding and H3K27Ac; Supplementary Figure S6C) (40). We validated reduced transcriptional activity by measuring the expression level of enhancer-associated RNAs (eRNAs) in CBE1-targeted and control cells using qRT-PCR (Figure 4B). This effect was specific, since eRNA levels of ERα-bound enhancer 1830 (40) showed no change after targeting CBE1 (Figure 4B). Moreover, CRISPR-targeting of EBE24365 resulted in ∼2-fold reduction in PREX1 expression (Supplementary Figure S6D and E), and attenuated cell cycle progression of MCF-7 cells, while having a much smaller impact on the proliferation of MDA-MB-231 cells (Figure 4C). This indicates that this regulatory element (transcription interval 24365) is a functional unit linking ERα activity to PREX1 expression. Therefore, this result also suggests that CBE1 is required for optimal PREX1 expression at least in part through EBE24365.
Figure 4.

Enhancer activity but not ERα binding is affected by the local chromatin structure at PREX1 locus. (A) Transcriptional activity measured by GRO-seq in MCF-7 cells transduced with sgRNA1118 (targeting CBE1), sgRNA1830 here serving as a negative control or nontargeting sgRNA for normalization. Heat-blot was generated according to the Z-score. The heatmap shows the change (Z-score of fold-change relative to nontargeting control) in the activity of the putative TREs (intervals showing bi-directional transcription) that were identified by dREG in the PREX1 locus (see also Figure S5B). (B) Relative expression level of eRNA at EBE24365 site was measured by qRT-PCR in MCF-7 cells. Gene expression levels were normalized to TBP. n = 3 ***P < 0.005, *P < 0.05, two-tailed Student's t-test. n.s., not significant. CNT indicates a non-targeting sgRNA. (C) MCF-7 cells stably expressing nontargeting sgRNA (CNT), sgRNA1117 and sgRNA1118 (targeting CBE1), and sgRNA-EBE24365 were examined by ChIP-seq. (D) MCF-7 cells were assayed by competitive proliferation assay after transduced with individual sgRNAs targeting EBE24365. Values on day 6 (T = 6) and day 9 (T = 9) normalized to day 0 (T = 0). ***P < 0.005, **P < 0.01, *P < 0.05, two-tailed Student's t-test. n.s., not significant. (E) Model summarizes the role of chromatin conformation on PREX1 gene regulation.

Enhancer activity but not ERα binding is affected by the local chromatin structure at PREX1 locus. (A) Transcriptional activity measured by GRO-seq in MCF-7 cells transduced with sgRNA1118 (targeting CBE1), sgRNA1830 here serving as a negative control or nontargeting sgRNA for normalization. Heat-blot was generated according to the Z-score. The heatmap shows the change (Z-score of fold-change relative to nontargeting control) in the activity of the putative TREs (intervals showing bi-directional transcription) that were identified by dREG in the PREX1 locus (see also Figure S5B). (B) Relative expression level of eRNA at EBE24365 site was measured by qRT-PCR in MCF-7 cells. Gene expression levels were normalized to TBP. n = 3 ***P < 0.005, *P < 0.05, two-tailed Student's t-test. n.s., not significant. CNT indicates a non-targeting sgRNA. (C) MCF-7 cells stably expressing nontargeting sgRNA (CNT), sgRNA1117 and sgRNA1118 (targeting CBE1), and sgRNA-EBE24365 were examined by ChIP-seq. (D) MCF-7 cells were assayed by competitive proliferation assay after transduced with individual sgRNAs targeting EBE24365. Values on day 6 (T = 6) and day 9 (T = 9) normalized to day 0 (T = 0). ***P < 0.005, **P < 0.01, *P < 0.05, two-tailed Student's t-test. n.s., not significant. (E) Model summarizes the role of chromatin conformation on PREX1 gene regulation. Next, we investigated whether chromatin conformation changes induced by CBE1 are required for optimal activation of PREX1 transcription by EBE24365. Therefore, we first checked the recruitment of ERα to EBE24365 in the presence or absence of CBE1 by ChIP-seq analysis. Surprisingly, we found no clear difference in ERα binding between CBE1-knockout cells compared to control (sgRNA1117 and 1118 induced cells, Figure 4D left and middle panel), suggesting that CBE1 is required for transcriptional activity rather than ERα binding. As a positive control, we directly targeted EBE24365 and obtained the expected significant reduction in ERα binding and specific inhibition of MCF-7 cell proliferation (Figure 4D, right panel, and C). Additionally, targeting CBE2 did not affect ERα binding to EBE24365 (Supplementary Figure S7B). Altogether, our results indicate that CTCF binding to its cognate elements establishes a chromatin conformation structure around the PREX1 promoter that allows efficient transcriptional activation by ERα (Figure 4E).

PREX1 expression is a bona-fide marker of ER-dependency and sensitivity to anti-hormonal treatment of breast cancers

At last, we examined whether PREX1 expression can serve as a biomarker and indicate sensitivity to anti-hormonal treatment of breast cancer. To explore this, we initially examined the expression of PREX1 in the METABRIC cohort (containing 2000 breast cancer cases (55)), and in the TCGA dataset (1095 breast cancer samples). This analysis demonstrated a positive correlation between PREX1 and ESR1 in both datasets (Figure 5A and B). Second, we interrogated DRIVE (56) and Achilles (57) datasets and found out that PREX1 expression levels are associated with ERα-dependency in breast cancer cell lines (Figure 5C). At last, we examined the correlation between PREX1 and disease-free survival of breast cancer patients in the METABRIC dataset. This analysis revealed that PREX1 expression correlates significantly with disease-free survival of ERα-positive breast cancer patients, with low PREX1 levels indicating poor prognosis. (Figure 5D). Moreover, higher levels of PREX1 were associated with better outcome only in endocrine therapy treated cases, and not for patients who did not receive any adjuvant endocrine therapeutics (Figure 5E and F), indicating that PREX1 could be a predictive biomarker for the efficacy of adjuvant endocrine therapy response.
Figure 5.

Clinical parameters associated with PREX1 gene in the METABRIC cohort of breast cancer patients. (A) Correlation plot of PREX1 versus ESR1 expression data. (B) Scatter plot representing the expression of ESR1 and PREX1 (log2 RNA-seq) in 1095 primary breast tumor samples from TCGA. R2, correlation coefficient. (C) Dot plot representing the expression levels of PREX1 (X-axys, log2 RNA-seq) compared to the sensitivity to ESR1 knockdown according to the DRIVE (left panel) and Achilles (right panel) datasets. Each dot represents a breast cancer cell line and they are color-coded based on the expression level of ESR1 (red = maximum, white = median and blue = minimum). (D) Disease-specific survival of ERα-positive breast cancer patients from the METABRIC cohort categorized according to PREX1 expression levels. Log-rank and adjusted Cox regression is shown. (E and F) As in (D), but now patients were additionally stratified for their hormonal treatment status.

Clinical parameters associated with PREX1 gene in the METABRIC cohort of breast cancer patients. (A) Correlation plot of PREX1 versus ESR1 expression data. (B) Scatter plot representing the expression of ESR1 and PREX1 (log2 RNA-seq) in 1095 primary breast tumor samples from TCGA. R2, correlation coefficient. (C) Dot plot representing the expression levels of PREX1 (X-axys, log2 RNA-seq) compared to the sensitivity to ESR1 knockdown according to the DRIVE (left panel) and Achilles (right panel) datasets. Each dot represents a breast cancer cell line and they are color-coded based on the expression level of ESR1 (red = maximum, white = median and blue = minimum). (D) Disease-specific survival of ERα-positive breast cancer patients from the METABRIC cohort categorized according to PREX1 expression levels. Log-rank and adjusted Cox regression is shown. (E and F) As in (D), but now patients were additionally stratified for their hormonal treatment status.

DISCUSSION

In summary, we present here an approach to uncover and characterize the functional role of genetic elements important for chromatin structure in human breast cancer cells. We focused on ERα signaling, being the key-driver in the majority of breast cancers (69,70) and because ERα mainly binds to distal transcriptional enhancers whose activity is likely to be dependent on local chromatin conformation for proper activation of its target genes (15,71). Although genome-wide chromatin conformation capture (Hi-C) (72) and ChIA-PET (4) data support the chromatin interactions mediated by ERα, causal evidence for it is lacking. Therefore, we aimed to target CTCF binding sites in the vicinity of ERα-bound enhancers by using a CRISPR-Cas9-based screening approach. This allowed us to identify four functional CBEs, each of which is required for local transcriptional activity, suggesting that one or more genes in proximity of the target sites can be vital for ERα function. Though our study was limited to two cell lines, it demonstrates the robustness of functional genetic screens in uncovering the functional chromatin structure elements in ERα pathway. Our results pinpoint a key role of CBEs in mediating chromatin interactions that established robust expression of PREX1 by ERα. PREX1 is located at chr20q.13 which is amplified in 8–29% of breast tumors (63). While elevated PREX1 expression levels are observed in luminal type breast cancer cell lines and tumors, basal type cancer cell and tumors, as well as normal breast tissue, have either lower or no PREX1 expression at all (63). PREX1 functions as a Rac1 activator in cancer cells and downregulation of PREX1 levels repressed breast cancer cell proliferation and tumorigenesis (63). It has also been documented that PREX1 levels are inversely correlated with PI3K activation, and positively correlated with ERα expression in human breast tumors (73). Here, we validated the expression correlation of PREX1 and ERα, show that PREX1 is highly expressed in ERα−driven breast cancer cell lines, and correlates with patient survival and sensitivity to anti-hormonal treatment of breast cancer. Although a link between ERα and PREX1 was described before, the exact mechanism how ERα regulates PREX1 remained elusive. For example, Barrio-Real et al. have suggested that the methylation status of PREX1 promoter can determine responsiveness to ERα activity (74). Nevertheless, our study provides a mechanistic explanation to the PREX1 and ERα link in breast cancer, and proposes a key role for CTCF and chromatin conformation in the regulation of PREX1 expression in breast cancer. Since the PREX1 locus is amplified in MCF-7 cells (seven copies; (75)), loss of genomic DNA by generating two simultaneous cuts emerges as a possibility. However, CRISPR-mediated targeting of CBE1 did not affect neither CTCF binding to CBE2 nor ERα-binding to EBE24365 (Figure 3A and 4D). Moreover, targeting CBE1 did not change the interaction frequency between CBE2 and CBE3 (Supplementary Figure S4). In addition, PREX1 locus is also amplified in MDA-MB-231 cells (four copies; (75)) where we do not observe any effect on proliferation upon targeting CBE1 (Figure 1D and Supplementary Figure S2B). Therefore, we conclude that the proliferation effect seen in MCF-7 cells upon CBE1 targeting is due to the interference with CTCF binding. CTCF has a ubiquitous origin in its function. Its contribution to gene expression, heterochromatin spread and higher-order genomic compartmentalization is still under investigation. Nora et al. used an auxin-inducible degron system to elaborate the role of CTCF in gene regulation and organization of chromatin structure by triggering acute CTCF degradation (76). Their study revealed the involvement of CTCF in insulation of TADs and in the regulation of gene expression, while higher-order genomic compartmentalization seemed like a CTCF-independent event. Focusing on CTCF-mediated long-range enhancer-promoter interactions, Ren et al., used RNAi-mediated CTCF downregulation to demonstrate the effect of CTCF on cell type specific genes (77). Moreover, a CRISPR-mediated deletion of CTCF at the TFF1 locus clearly supported the notion that CTCF has a critical role in enhancer–promoter interaction of cell type specific genes (77). Our results here support a critical role of CTCF in ERα-mediated regulation of gene expression by shaping enhancer–promoter interactions in breast cancer cells. Our results indicate the importance of both CTCF and ERα binding for achieving an optimal transcription activation of PREX1. We observed that PREX1 expression level was attenuated in the absence of estrogen signaling (Figure 2E), but more severely affected estrogen deprivation was combined with loss of CTCF binding to CBE1. Moreover, even 6 h of estrogen signaling was not sufficient to stimulate the PREX1 transcription level in the absence CTCF-binding to CBE1 (Figure 2E). Therefore, CTCF-mediated chromatin conformation seems crucial to maintain ERα-mediated gene expression in the PREX1 locus. This conclusion is supported by the data obtained by Zhang et al. who also observed that CTCF binding is required for ERα-mediated transcription of the estrogen responsive gene TFF1 (78). In their work, ERα chromatin binding was blocked upon CTCF knockdown. Differently, we observed that while eRNA expression of the most prominent ERα-bound enhancer (EBE24365) in the PREX1 locus was compromised, ERα-binding to this site was not altered (Figure 4A, B and D). It is known that tightly enclosed chromatin interactions are required to maintain optimal transcriptional levels by inducing efficient cycling of the transcriptional machinery (4,79,80). Additionally, CTCF interacts directly with the large subunit of RNAPII (81) to modulate transcription, allow accurate transcription initiation, and induce transcription initiation pausing. Thus, ERα binding at the PREX1 locus requires CTCF binding to effectively activate transcription. Whether transcriptional initiation, or elongation, or both is affected by CTCF remains to be elucidated. The most prominent structural interaction in the PREX1 locus is observed between CBE2 and CBE3 (see Figure 4E). An additional interaction with lower frequency is observed between CBE1 and CBE2. CRISPR-Cas9 mediated mutations at CBE1 stimulated the interaction frequency between CBE2 and PREX1 promoter, and led to downregulation of PREX1 transcription (Figure 2A). We therefore speculate that the increased CBE2- PREX1 promoter interaction frequency following CBE1 targeting is inhibitory to PREX1 transcriptional activity either because of reduced accessibility of transcription factors to the promoter region, reduced binding of RNAPII or direct inhibition. Interestingly, also preventing CTCF binding to CBE2 (Figure 3A) caused a strong downregulation of PREX1 expression (Supplementary Figure S7A). This effect could not be explained by changes in ERα binding to sites in the region (Supplementary Figure S7B). In contrast, we did observe changes in chromatin conformation which can explain the requirement of CTCF binding to CBE2 for PREX1 expression (Figure 4D). Two interactions are favored as a result of targeting CBE2: between CBE2 and PREX1 promoter and between CBE3 and CBE1 (Supplementary Figure S7C). We speculate that both of these interactions are inhibitory. While CBE3-CBE1 interaction may affect progression of transcription CBE2-PREX1 promoter interaction may affect initiation leading to a greater effect on PREX1 gene expression than targeting CBE1 or EBE24365. When we examined T47D, another ERα positive breast cancer cell line; we found that CTCF did not bound CBE1 in this cell line (Supplementary Figure S8A). In line with this, when CBE1 was targeted either by sgRNA1117 or sgRNA118, no change in PREX1 expression was detected (Supplementary Figure S8B). Also, ERα binding pattern and eRNA activity show differences between MCF-7 and T47D (Supplementary Figure S8C). Therefore, as noted before (15), cell type specific patterns of CTCF binding can shape gene expression changes in response to E2. Our detailed biochemical analysis of PREX1 locus indicated that a correct chromatin structure is required for transcriptional activity of a nearby key ERα-bound enhancer (as determined by eRNA expression), but it is dispensable for ERα binding. This can be explained by reduced interference with promoter activity through a competitive mechanism between an upstream and a downstream CTCF sites. Recently, a similar mechanism was observed for c-MYC, where the promoter of PVT1 competes with the promoter of c-MYC for engagement with nearby enhancer elements, pinpointing PVT1 promoter as a DNA element with potential tumor suppressing activity (82). This suggests that competitive chromatin regulatory mechanisms are common and are employed by cancer cells to stimulate tumor proliferation. Altogether, our study shows that the CRISPR-Cas9 technology is a robust tool to identify and characterize functional regulatory nodes of chromatin conformation in an unbiased fashion. Furthermore, our study adds more evidence for the importance of CTCF-mediated chromatin conformation for gene expression and oncogenic signaling. As our approach here was based on a selected group of CTCF sites residing in the vicinity of ERα-enhancers, it is possible that some ERα-binding sites function to regulate target gene expression independently of CTCF-mediated chromatin conformation. To address this issue, a genome wide CTCF-binding site screen is required. Indeed, it is expected that the approach we presented here will be scaled up and be used to uncover novel functions of chromatin regulatory elements of the human genome under normal and pathological conditions. Nevertheless, our study highlights the importance of chromatin conformation in gene regulation and supports the notion that mutations in these regions can influence local gene expression and cellular phenotypes such as cancer progression.

DATA AVAILABILITY

The sequencing results of GRO-seq (GSE124572), RNA-seq (GSE124573) and ChIP-seq (GSE124667) on MCF-7 cells induced with sgRNA vectors targeting CTCF binding elements are available in the NCBI GEO database. Click here for additional data file.
NameOligo1Oligo2
CTCF_ERa_MCF7_sgRNA680 CACCGTGGGCATAAGGAGTAACAGA AAACTCTGTTACTCCTTATGCCCAC
CTCF_ERa_MCF7_sgRNA1117 CACCGCAATGCCAGGCTCTGCCATC AAACGATGGCAGAGCCTGGCATTGC
CTCF_ERa_MCF7_sgRNA1118 CACCGTTCGCGCTGCTGCTGCCAGA AAACTCTGGCAGCAGCAGCGCGAAC
CTCF_ERa_MCF7_sgRNA1659 CACCGAGACAGCGGTACCTGTCAGG AAACCCTGACAGGTACCGCTGTCTC
CTCF_ERa_MCF7_sgRNA1661 CACCGAAAGACAGCGGTACCTGTCA AAACTGACAGGTACCGCTGTCTTTC
CTCF_ERa_MCF7_sgRNA810 CACCGGACGCTTGGCTGCCACCTGG AAACCCAGGTGGCAGCCAAGCGTCC
CTCF_ERa_MCF7_sgRNA811 CACCGGAGGACGCTTGGCTGCCACC AAACGGTGGCAGCCAAGCGTCCTCC
CBE2 #1 CACCGGCTCGAGCATTGGTGCTCTC AAACGAGAGCACCAATGCTCGAGCC
CBE2 #2 CACCGCGAGCATTGGTGCTCTCTGG AAACCCAGAGAGCACCAATGCTCGC
EBE24365 #1 CACCGAGGAGCAGGCTGACCTGACC AAACGGTCAGGTCAGCCTGCTCCTC
EBE24365 #2 CACCGGTGACTCTGTTCTCCTGGTC AAACGACCAGGAGAACAGAGTCACC
intron #1 CACCGGATGGTGTCTGCGAACAGCG AAACCGCTGTTCGCAGACACCATCC
intron #2 CACCGATTTTAGGGTTGTCCCCACG AAACCGTGGGGACAACCCTAAAATC
PREX1 exon #1 CACCGGCGCGGCCGCGCTAGAATGG AAACCCATTCTAGCGCGGCCGCGCC
PREX1 exon #2 CACCGTGGGCGCCTCCATTCTAGCG AAACCGCTAGAATGGAGGCGCCCAC
PREX1 exon #3 CACCGGCAGAACGTGGCCGACTCAG AAACCTGAGTCGGCCACGTTCTGCC
PREX1 exon #4 CACCGGCATCGCATCCGGCAGAACG AAACCGTTCTGCCGGATGCGATGCC
NamePrimers
sgRNA680_F CTTGACTGGCACCTGGAGAG
sgRNA680_R ATTCCATGGTGTCTGCTGGG
sgRNA680_P5_SeqF ACACTCTTTCCCTACACGACGCTCTTCCGATCT CGTGAT AGAGACCATGTTGTGCCAGG
sgRNA680_P7_SeqR GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTGTTTTGTGTGGGGGCCTACT
sgRNA1118_F GTGAGAGGGTGAGAGTGTGC
sgRNA1118_R CTGGGGAGTTCTGGCTCTTG
sgRNA1118_P5_SeqF ACACTCTTTCCCTACACGACGCTCTTCCGATCT ACATCG CATGCACACACACACACACA
sgRNA1118_P7_SeqR GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTAGCTCAGTCATGTCAGCCAC
sgRNA1659_F CACCAGGCTTGAGAGAAGCA
sgRNA1659_R CATAGCCCTGTACCTCTGCG
sgRNA1659_P5_SeqF ACACTCTTTCCCTACACGACGCTCTTCCGATCT GCCTAA TAGGTTATCTGCTGGGCCCA
sgRNA1659_P7_SeqR GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTGGGAATCGAGGCTCCCTAAG
sgRNA1661_P5_SeqF ACACTCTTTCCCTACACGACGCTCTTCCGATCT TGGTCA TAGGTTATCTGCTGGGCCCA
sgRNA810_F GTTGGAACTGGGAGGAGCTC
sgRNA810_R GACTGGCTTCAGCATTGTGC
sgRNA810_P5_SeqF ACACTCTTTCCCTACACGACGCTCTTCCGATCT CACTGT CAACCCGTTTTGGAAGCCAG
sgRNA810_P7_SeqR GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTGAGTGTGGCGAGCTGGATC
sgRNA811_P5_SeqF ACACTCTTTCCCTACACGACGCTCTTCCGATCT ATTGGC CAACCCGTTTTGGAAGCCAG
ForwardReverse
PREX 1 TGGGACTCATCCCTGAACTC CTGTGCAACAATGGCTTCAT
TBP GGAGAGTTCTGGGATTGTAC CTTATCCTCATGATTACCGCAG
  80 in total

1.  Robust 4C-seq data analysis to screen for regulatory DNA interactions.

Authors:  Harmen J G van de Werken; Gilad Landan; Sjoerd J B Holwerda; Michael Hoichman; Petra Klous; Ran Chachik; Erik Splinter; Christian Valdes-Quezada; Yuva Oz; Britta A M Bouwman; Marjon J A M Verstegen; Elzo de Wit; Amos Tanay; Wouter de Laat
Journal:  Nat Methods       Date:  2012-09-09       Impact factor: 28.547

2.  Functional genetic screens for enhancer elements in the human genome using CRISPR-Cas9.

Authors:  Gozde Korkmaz; Rui Lopes; Alejandro P Ugalde; Ekaterina Nevedomskaya; Ruiqi Han; Ksenia Myacheva; Wilbert Zwart; Ran Elkon; Reuven Agami
Journal:  Nat Biotechnol       Date:  2016-01-11       Impact factor: 54.908

3.  Identification of the Rac-GEF P-Rex1 as an essential mediator of ErbB signaling in breast cancer.

Authors:  Maria Soledad Sosa; Cynthia Lopez-Haber; Chengfeng Yang; Hongbin Wang; Mark A Lemmon; John M Busillo; Jiansong Luo; Jeffrey L Benovic; Andres Klein-Szanto; Hiroshi Yagi; J Silvio Gutkind; Ramon E Parsons; Marcelo G Kazanietz
Journal:  Mol Cell       Date:  2010-12-22       Impact factor: 17.970

Review 4.  The loss of estrogen and progesterone receptor gene expression in human breast cancer.

Authors:  R G Lapidus; S J Nass; N E Davidson
Journal:  J Mammary Gland Biol Neoplasia       Date:  1998-01       Impact factor: 2.673

Review 5.  The hardwiring of development: organization and function of genomic regulatory systems.

Authors:  M I Arnone; E H Davidson
Journal:  Development       Date:  1997-05       Impact factor: 6.868

6.  CCCTC-binding factor confines the distal action of estrogen receptor.

Authors:  Chang S Chan; Jun S Song
Journal:  Cancer Res       Date:  2008-11-01       Impact factor: 12.701

7.  Ensembl 2015.

Authors:  Fiona Cunningham; M Ridwan Amode; Daniel Barrell; Kathryn Beal; Konstantinos Billis; Simon Brent; Denise Carvalho-Silva; Peter Clapham; Guy Coates; Stephen Fitzgerald; Laurent Gil; Carlos García Girón; Leo Gordon; Thibaut Hourlier; Sarah E Hunt; Sophie H Janacek; Nathan Johnson; Thomas Juettemann; Andreas K Kähäri; Stephen Keenan; Fergal J Martin; Thomas Maurel; William McLaren; Daniel N Murphy; Rishi Nag; Bert Overduin; Anne Parker; Mateus Patricio; Emily Perry; Miguel Pignatelli; Harpreet Singh Riat; Daniel Sheppard; Kieron Taylor; Anja Thormann; Alessandro Vullo; Steven P Wilder; Amonida Zadissa; Bronwen L Aken; Ewan Birney; Jennifer Harrow; Rhoda Kinsella; Matthieu Muffato; Magali Ruffier; Stephen M J Searle; Giulietta Spudich; Stephen J Trevanion; Andy Yates; Daniel R Zerbino; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

8.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

9.  Enhancer transcripts mark active estrogen receptor binding sites.

Authors:  Nasun Hah; Shino Murakami; Anusha Nagari; Charles G Danko; W Lee Kraus
Journal:  Genome Res       Date:  2013-05-01       Impact factor: 9.043

10.  P-REX1 creates a positive feedback loop to activate growth factor receptor, PI3K/AKT and MEK/ERK signaling in breast cancer.

Authors:  L M Dillon; J R Bean; W Yang; K Shee; L K Symonds; J M Balko; W H McDonald; S Liu; A M Gonzalez-Angulo; G B Mills; C L Arteaga; T W Miller
Journal:  Oncogene       Date:  2014-10-06       Impact factor: 9.867

View more
  9 in total

Review 1.  Intrinsic and Extrinsic Factors Governing the Transcriptional Regulation of ESR1.

Authors:  David K Lung; Rebecca M Reese; Elaine T Alarid
Journal:  Horm Cancer       Date:  2020-06-26       Impact factor: 3.869

2.  PtdIns(3,4,5)P3-dependent Rac exchanger 1 (P-Rex1) promotes mammary tumor initiation and metastasis.

Authors:  Nuthasuda Srijakotre; Heng-Jia Liu; Max Nobis; Joey Man; Hon Yan Kelvin Yip; Antonella Papa; Helen E Abud; Kurt I Anderson; Heidi C E Welch; Tony Tiganis; Paul Timpson; Catriona A McLean; Lisa M Ooms; Christina A Mitchell
Journal:  Proc Natl Acad Sci U S A       Date:  2020-10-23       Impact factor: 11.205

Review 3.  Tales from topographic oceans: topologically associated domains and cancer.

Authors:  Moray J Campbell
Journal:  Endocr Relat Cancer       Date:  2019-11       Impact factor: 5.678

Review 4.  The double dealing of cyclin D1.

Authors:  Guergana Tchakarska; Brigitte Sola
Journal:  Cell Cycle       Date:  2019-12-29       Impact factor: 4.534

Review 5.  Using CRISPR to understand and manipulate gene regulation.

Authors:  Ersin Akinci; Marisa C Hamilton; Benyapa Khowpinitchai; Richard I Sherwood
Journal:  Development       Date:  2021-04-29       Impact factor: 6.868

Review 6.  Genome-Wide Estrogen Receptor Activity in Breast Cancer.

Authors:  Anca M Farcas; Sankari Nagarajan; Sabina Cosulich; Jason S Carroll
Journal:  Endocrinology       Date:  2021-02-01       Impact factor: 4.736

Review 7.  Gene editing and its applications in biomedicine.

Authors:  Guanglei Li; Xiangyang Li; Songkuan Zhuang; Liren Wang; Yifan Zhu; Yangcan Chen; Wen Sun; Zeguang Wu; Zhuo Zhou; Jia Chen; Xingxu Huang; Jin Wang; Dali Li; Wei Li; Haoyi Wang; Wensheng Wei
Journal:  Sci China Life Sci       Date:  2022-02-18       Impact factor: 10.372

Review 8.  Strategies to overcome the main challenges of the use of CRISPR/Cas9 as a replacement for cancer therapy.

Authors:  Mohammed Fatih Rasul; Bashdar Mahmud Hussen; Abbas Salihi; Bnar Saleh Ismael; Paywast Jamal Jalal; Anna Zanichelli; Elena Jamali; Aria Baniahmad; Soudeh Ghafouri-Fard; Abbas Basiri; Mohammad Taheri
Journal:  Mol Cancer       Date:  2022-03-03       Impact factor: 27.401

9.  Comparison and Characterization of a Cell Wall Invertase Promoter from Cu-Tolerant and Non-Tolerant Populations of Elsholtzia haichowensis.

Authors:  Rongxiang Liu; Jing Zhao; Zhongrui Xu; Zhiting Xiong
Journal:  Int J Mol Sci       Date:  2021-05-18       Impact factor: 5.923

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.