| Literature DB >> 34146516 |
Joseph S Baxter1, Nichola Johnson2, Katarzyna Tomczyk2, Andrea Gillespie2, Sarah Maguire3, Rachel Brough4, Laura Fachal5, Kyriaki Michailidou6, Manjeet K Bolla7, Qin Wang7, Joe Dennis7, Thomas U Ahearn8, Irene L Andrulis9, Hoda Anton-Culver10, Natalia N Antonenkova11, Volker Arndt12, Kristan J Aronson13, Annelie Augustinsson14, Heiko Becher15, Matthias W Beckmann16, Sabine Behrens17, Javier Benitez18, Marina Bermisheva19, Natalia V Bogdanova20, Stig E Bojesen21, Hermann Brenner22, Sara Y Brucker23, Qiuyin Cai24, Daniele Campa25, Federico Canzian26, Jose E Castelao27, Tsun L Chan28, Jenny Chang-Claude29, Stephen J Chanock8, Georgia Chenevix-Trench30, Ji-Yeob Choi31, Christine L Clarke32, Sarah Colonna33, Don M Conroy5, Fergus J Couch34, Angela Cox35, Simon S Cross36, Kamila Czene37, Mary B Daly38, Peter Devilee39, Thilo Dörk40, Laure Dossus41, Miriam Dwek42, Diana M Eccles43, Arif B Ekici44, A Heather Eliassen45, Christoph Engel46, Peter A Fasching47, Jonine Figueroa48, Henrik Flyger49, Manuela Gago-Dominguez50, Chi Gao51, Montserrat García-Closas8, José A García-Sáenz52, Maya Ghoussaini53, Graham G Giles54, Mark S Goldberg55, Anna González-Neira56, Pascal Guénel57, Melanie Gündert58, Lothar Haeberle16, Eric Hahnen59, Christopher A Haiman60, Per Hall61, Ute Hamann62, Mikael Hartman63, Sigrid Hatse64, Jan Hauke65, Antoinette Hollestelle66, Reiner Hoppe67, John L Hopper68, Ming-Feng Hou69, Hidemi Ito70, Motoki Iwasaki71, Agnes Jager66, Anna Jakubowska72, Wolfgang Janni73, Esther M John74, Vijai Joseph75, Audrey Jung17, Rudolf Kaaks17, Daehee Kang76, Renske Keeman77, Elza Khusnutdinova78, Sung-Won Kim79, Veli-Matti Kosma80, Peter Kraft51, Vessela N Kristensen81, Katerina Kubelka-Sabit82, Allison W Kurian74, Ava Kwong83, James V Lacey84, Diether Lambrechts85, Nicole L Larson86, Susanna C Larsson87, Loic Le Marchand88, Flavio Lejbkowicz89, Jingmei Li90, Jirong Long24, Artitaya Lophatananon91, Jan Lubiński92, Arto Mannermaa80, Mehdi Manoochehri62, Siranoush Manoukian93, Sara Margolin94, Keitaro Matsuo70, Dimitrios Mavroudis95, Rebecca Mayes5, Usha Menon96, Roger L Milne54, Nur Aishah Mohd Taib97, Kenneth Muir91, Taru A Muranen98, Rachel A Murphy99, Heli Nevanlinna98, Katie M O'Brien100, Kenneth Offit101, Janet E Olson86, Håkan Olsson14, Sue K Park102, Tjoung-Won Park-Simon40, Alpa V Patel103, Paolo Peterlongo104, Julian Peto105, Dijana Plaseska-Karanfilska106, Nadege Presneau42, Katri Pylkäs107, Brigitte Rack73, Gad Rennert89, Atocha Romero108, Matthias Ruebner16, Thomas Rüdiger109, Emmanouil Saloustros110, Dale P Sandler100, Elinor J Sawyer111, Marjanka K Schmidt112, Rita K Schmutzler65, Andreas Schneeweiss113, Minouk J Schoemaker114, Mitul Shah5, Chen-Yang Shen115, Xiao-Ou Shu24, Jacques Simard116, Melissa C Southey117, Jennifer Stone118, Harald Surowy119, Anthony J Swerdlow120, Rulla M Tamimi121, William J Tapper43, Jack A Taylor122, Soo Hwang Teo123, Lauren R Teras103, Mary Beth Terry124, Amanda E Toland125, Ian Tomlinson126, Thérèse Truong57, Chiu-Chen Tseng60, Michael Untch127, Celine M Vachon128, Ans M W van den Ouweland129, Sophia S Wang84, Clarice R Weinberg130, Camilla Wendt131, Stacey J Winham132, Robert Winqvist107, Alicja Wolk87, Anna H Wu60, Taiki Yamaji71, Wei Zheng24, Argyrios Ziogas10, Paul D P Pharoah133, Alison M Dunning5, Douglas F Easton133, Stephen J Pettitt4, Christopher J Lord4, Syed Haider2, Nick Orr3, Olivia Fletcher134.
Abstract
A combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 (signal 1), 5 (signal 2), and 42 (signal 3) credible causal variants at these loci. We used publicly available in silico DNase I and ChIP-seq data with in vitro reporter gene and CRISPR assays to annotate signals 2 and 3. We identified putative regulatory elements that enhanced cell-type-specific transcription from the IGFBP5 promoter at both signals (30- to 40-fold increased expression by the putative regulatory element at signal 2, 2- to 3-fold by the putative regulatory element at signal 3). We further identified one of the five credible causal variants at signal 2, a 1.4 kb deletion (esv3594306), as the likely causal variant; the deletion allele of this variant was associated with an average additional increase in IGFBP5 expression of 1.3-fold (MCF-7) and 2.2-fold (T-47D). We propose a model in which the deletion allele of esv3594306 juxtaposes two transcription factor binding regions (annotated by estrogen receptor alpha ChIP-seq peaks) to generate a single extended regulatory element. This regulatory element increases cell-type-specific expression of the tumor suppressor gene IGFBP5 and, thereby, reduces risk of estrogen receptor-positive breast cancer (odds ratio = 0.77, 95% CI 0.74-0.81, p = 3.1 × 10-31).Entities:
Keywords: breast cancer risk; functional annotation; risk locus
Mesh:
Substances:
Year: 2021 PMID: 34146516 PMCID: PMC8322933 DOI: 10.1016/j.ajhg.2021.05.013
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025
Figure 2Luciferase reporter assays following transient transfection of PRE2 and PRE3, REF and ALT constructs, into MCF-7, T-47D, and HepG2 cells
The PRE containing the reference (REF) allele at each SNP was cloned downstream of the IGFBP5 promoter to generate reference (REF) luciferase constructs. Alternative (ALT) alleles were generated by site-directed mutagenesis. Coordinates of the PREs are given in Table S2, diagrams are in Figure S1. Error bars denote standard deviations based on three independent experiments each done in triplicate. p values were determined by t tests and a Bonferroni correction was applied to account for multiple testing. Comparing each PRE containing construct to IGFBP5-PROM, ∗p < 0.0056, ∗∗p ≤ 0.00056; comparing ALT to REF constructs #p < 0.0056.
Figure 3Luciferase reporter assays following transient transfection of constructs with allelic variants at PRE2B and PRE2DEL into MCF-7 and T-47D cells
Reporter gene constructs with all possible combinations of rs199804270 and rs72951831 and esv3594306 were generated by site-directed mutagenesis of the naturally occurring haplotypes at PRE2B and PRE2DEL (Material and methods) into MCF-7 (A) and T-47D (B) cells. Coordinates of the PREs are given in Table S2, diagrams are in Figure S1. Error bars denote standard deviations based on three independent experiments each done in triplicate. 3-way ANOVA was used to compare each variant, adjusted for the other two variants, a Bonferroni correction was applied to account for multiple testing. ∗p < 0.017, ∗∗p ≤ 0.0017.
Figure 4Systematic CRISPRa analysis of 2q35 putative regulatory elements
MCF-7 cells expressing dCas9-VPR were transduced with CRISPR sgRNAs targeting: (A) the PRE1 tag SNP rs13387042 (negative control), the IGFBP5 promoter and the PRE1 causal variant rs4442975 (positive control), and (B and C) a series of sites mapping across PRE2 (Figure 1B). Relative gene expression (compared to vector alone) was calculated using the ΔΔCT method. Full details of guide RNAs are listed in Table S3. Error bars denote standard deviations based on three independent experiments each done in triplicate. p values were determined by t tests and a Bonferroni correction was applied to account for multiple testing; (A) ∗p < 0.017, ∗∗p < 0.0017, ∗∗∗p < 0.00017; (B and C) ∗p < 0.0056, ∗∗p ≤ 0.00056.
Figure 12q35 breast cancer risk locus
(A) Fine-scale mapping at 2q35 identified three high-confidence (p < 1 × 10−6) signals annotated by rs4442975 (signal 1), rs138522813 (signal 2), and rs5838651 (signal 3). The putative target gene (IGFBP5) maps 360 kb, 399 kb, and 703 kb from signals 1, 2, and 3, respectively. All coordinates are based on GRCh37/hg19.
(B) Putative regulatory element 2 (PRE2; chr2:217,955,458–217,957,767) at signal 2 colocalizes with four highly correlated variants: three single-nucleotide polymorphisms (SNPs; rs572022984, rs199804270, and rs72951831) and a 1.4 kb insertion/deletion variant (esv3594306; indicated by a black bar). A fourth SNP (rs138522813) maps outside the proposed boundaries of PRE2. Regions of open chromatin (DNase I) and ChIP-seq binding peaks for transcription factors are shown as gray bars where the shade of gray indicates the strength of the ChIP-seq peak (light gray, weak binding; dark gray, strong binding). Also shown (yellow bars) are the coordinates of three reporter gene constructs (PRE2A, PRE2B, and PRE2DEL) and the locations of sequences targeted by nine small guide (sg)RNAs.
(C) PRE3 (chr2:218,305,944–218,306,443) indicated by a blue bar colocalizes with two SNPs (rs12694417 and rs12988242). Regions of open chromatin and ChIP-seq binding peaks are as in (B).
Association of rs4442975, rs138522813 and rs5838651 among women of European and Asian ancestry
| rs4442975 | 0.49 | 36,471 | 37,251 | 0.88 | 0.86–0.89 | 4.9 × 10−35 | 0.48 | 57,920 | 46,226 | 0.88 | 0.87–0.90 | 1.7 × 10−42 | 94,391 | 83,477 | 0.88 | 0.87–0.89 | 1.3 × 10−75 | 0.46 | 0.49 |
| rs138522813 | 0.035 | – | – | 0.81 | 0.76–0.86 | 2.2 × 10−12 | 0.03 | – | – | 0.79 | 0.75–0.83 | 3.0 × 10−21 | – | – | 0.80 | 0.77–0.83 | 5.5 × 10−32 | 0.62 | 0.035 |
| rs5838651 | 0.3 | – | – | 1.07 | 1.05–1.10 | 4.2 × 10−9 | 0.3 | – | – | 1.06 | 1.04–1.08 | 4.6 × 10−9 | – | – | 1.07 | 1.05–1.08 | 1.5 × 10−16 | 0.40 | 0.3 |
| rs4442975 | 0.87 | 4,994 | 5,866 | 0.96 | 0.88–1.04 | 0.29 | 0.88 | 7,487 | 6,892 | 0.93 | 0.87–1.01 | 0.07 | 12,481 | 12,758 | 0.94 | 0.89–1.00 | 0.04 | 0.68 | 0.02 |
| rs138522813 | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – |
| rs5838651 | 0.61 | – | – | 1.03 | 0.97–1.10 | 0.29 | 0.62 | – | – | 1.09 | 1.04–1.14 | 0.0005 | – | – | 1.07 | 1.03–1.11 | 0.0008 | 0.18 | 0.95 |
MAF, minor allele frequency
OR, per allele odds ratio
P1, test of H0 no association between SNP and breast cancer risk
Phet1, test of H0 no difference between iCOGS and OncoArray data
Phet2, test of H0 no difference between European and Asian data
rs138522813-Del allele is extremely rare in Asians (MAF ~0.05%) and was not analyzed in Asian data
Figure 5Increasing the local density of activator TF domains with dCas9-VPR or by juxtaposition of two ChIP-seq peaks is associated with increased expression of IGFBP5
(A) Introducing dCas9 fused to a VPR activator domain at the ERα, FOXA1, GATA3 ChIP-seq peak at the centromeric end of the deletion breakpoint (PRE2-1 and PRE2-2), proximal to, or at, the ERα, FOXA1, GATA3 ChIP-seq peak at the telomeric end of the deletion breakpoint (PRE2-5 and PRE2-8, respectively) increases expression of IGFBP5 in MCF-7 cells.
(B) Deletion of 1.4 kb on the ALT allele of esv3594306 juxtaposes these two ERα, FOXA1, GATA3 ChIP-seq peaks.
In each case (A and B) this increases the density of activating TF domains in the region and is associated with increased expression of IGFBP5.