| Literature DB >> 31910858 |
Jonathan Beesley1, Haran Sivakumaran1, Mahdi Moradi Marjaneh1,2, Luize G Lima1, Kristine M Hillman1, Susanne Kaufmann1, Natasha Tuano3, Nehal Hussein1,4, Sunyoung Ham1, Pamela Mukhopadhyay1, Stephen Kazakoff1, Jason S Lee1, Kyriaki Michailidou5,6, Daniel R Barnes5, Antonis C Antoniou5, Laura Fachal7, Alison M Dunning7, Douglas F Easton5,7, Nicola Waddell1, Joseph Rosenbluh3, Andreas Möller1, Georgia Chenevix-Trench1, Juliet D French8, Stacey L Edwards9.
Abstract
BACKGROUND: Genome-wide association studies have identified 196 high confidence independent signals associated with breast cancer susceptibility. Variants within these signals frequently fall in distal regulatory DNA elements that control gene expression.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31910858 PMCID: PMC6947858 DOI: 10.1186/s13059-019-1877-y
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1VCHi-C and PCHi-C in human breast cell lines. a Schematic of a hypothetical breast cancer risk signal and plausible chromatin interactions. Chromatin interactions are shown as blue arcs. Genes are depicted as black arrows. CHi-C baits are depicted as gray boxes. CCVs are shown as red vertical lines. The colored boxes illustrate variant-interacting regions (VIRs) or promoter-interacting regions (PIRs). b Principle component analysis of the CHiCAGO-scored interactions in VCHi-C or PCHi-C biological replicates. c Distribution of the CHiCAGO-scored interaction number per bait per cell line (combined biological replicates). d Agglomerative hierarchical clustering for the VCHi-C and PCHi-C in six breast cell lines
Fig. 2Promoter-interacting regions (PIRs) are enriched for regulatory features. Heatmaps showing PIR enrichment for a ATAC-seq peaks in breast cell lines, b histone marks by ChIP-seq in available breast cell lines, and c transcription factor binding in available breast cell lines, expressed as z-scores. The red outlines highlight key enrichment signals. d The number of interactions between expression single nucleotide polymorphisms (eSNPs) and associated target genes (observed) compared to randomly assigned interactions (random), binned by interaction distance. Asterisks represent the significance of enrichment of observed versus randomized interacting regions (permutation test *p < 0.05, **p < 0.01, ***p < 0.001)
Fig. 3PCHi-C Peaky fine-mapping prioritizes CCVs at 10q14 and 6p22.3. a Chromatin interactions at 9q33.1 in MCF7 breast cancer cells. Topologically associating domains (TADs; Additional file 2: Table S16) are shown as horizontal gray bars above GENCODE-annotated coding (blue) and non-coding (green) genes. The PCHi-C bait is depicted as a black box. CCVs are shown as red vertical lines. The ATAC-seq track is shown as a dark blue histogram. Peaky-defined MPPC values (from PCHi-C BaitID: 479054) are plotted with the prioritized CCV overlaid as a red vertical line. CHiCAGO-scored interactions are shown as black arcs. The dashed red outline highlights the prioritized CCV rs811688 and the dashed gray outline the target gene (PAPPA). b Chromatin interactions at 10q14 in T47D breast cancer cells. Topologically associating domains (TADs) are shown as horizontal gray bars above GENCODE-annotated coding (blue) and non-coding (green) genes. The PCHi-C bait is depicted as a black box. CCVs are shown as red vertical lines. The ATAC-seq track is shown as a dark blue histogram. Peaky-defined MPPC values (from PCHi-C BaitID: 486406) are plotted with the prioritized CCVs overlaid as red vertical lines. CHiCAGO-scored interactions are shown as black arcs. The dashed red outline highlights the prioritized CCVs and the dashed gray outline the target gene (GATA3). c Zoomed in view of prioritized CCVs at 10q14. HindIII fragments are shown as gray bars with their fragment IDs. CCVs are shown as red vertical lines. Black histograms denote ChIP-seq data from T47D cells for H3K27ac, GATA3, and estrogen receptor (ER; cells treated with DMSO or 17 beta-estradiol (EST)). The dashed gray outline highlights CCV rs12765282. d Position weight matrix of the GATA3 binding site from JASPAR (red arrowhead indicates the CCV position in the motif), with homology to the risk (t) and protective (g) alleles of rs12765282 colored below
Fig. 4Sequential CHiCAGO and Peaky fine-mapping prioritizes CCVs and target genes. a Chromatin interactions at 1p22.3 in Hs578T breast cancer cells. Topologically associating domains (TADs) are shown as horizontal gray bars above GENCODE-annotated coding (blue) and non-coding (green) genes. The VCHi-C or PCHi-C baits are depicted as black boxes. Risk signals 1 and 2 are numbered, and the CCVs within each signal are shown as colored vertical lines. The ATAC-seq track is shown as a dark blue histogram. Peaky-defined MPPC values (from specified BaitIDs) are plotted with the prioritized gene overlaid as a dark blue vertical line or prioritized CCVs overlaid as royal blue vertical lines. CHiCAGO-scored interactions for specified BaitIDs are shown as black arcs. The dashed red outline highlights the prioritized CCVs and the dashed gray outline the target gene (LMO4). b Zoomed in view of prioritized signal 2 CCVs at 1p22.3. VCHi-C baits are shown as gray bars with their fragment IDs. CCVs are shown as blue vertical lines. Black histograms denote DNase I hypersensitivity sites or ChIP-seq data for H3K4me1 and CTCF binding from HMEC cells. The dashed gray outline highlights CCV rs3008455. c Position weight matrix of the CTCF binding site from JASPAR (red arrowhead indicates the CCV position in the motif), with homology to the risk (g) and protective (a) alleles of rs3008455 colored below. d Chromatin interactions at 16q24.2 in B80T5 normal breast cells. Topologically associating domains (TADs) are shown as horizontal gray bars above GENCODE-annotated coding (blue) and non-coding (green) genes. The VCHi-C or PCHi-C baits are depicted as black boxes. CCVs are shown as red vertical lines. The ATAC-seq track is shown as a dark blue histogram. Peaky-defined MPPC values (from specified BaitIDs) are plotted with the prioritized genes overlaid as dark blue or green vertical lines and prioritized CCVs overlaid as red vertical lines. CHiCAGO-scored interactions for specified BaitIDs are shown as black arcs. The dashed red outline highlights the prioritized CCVs and the dashed gray outline the prioritized target genes (MTHFSD, FOXC2, FOXL1, and FOXC2-AS1)
Fig. 5Molecular analysis of signal 1 CCVs at 12q24. a Chromatin interactions in MCF7 cells. Topologically associating domains (TADs) are shown as horizontal gray bars above GENCODE-annotated coding (blue) and non-coding (green) genes. The PCHi-C baits are depicted as black boxes. Risk signals 1–4 are numbered, and the CCVs within each signal are shown as colored vertical lines. ENCODE ChIP-seq data for available histone marks are depicted as gray boxes. The ATAC-seq track is shown as a dark blue histogram. ESR1, GATA3, and FOXA1 binding are shown as black histograms. Peaky defined MPPC values (from PCHi-C BaitID: 596031) are plotted with the prioritized CCVs overlaid as red vertical lines. CHiCAGO-scored interactions are shown as black arcs. The dashed red outline highlights signal 1 CCVs and the dashed gray outline the target gene (TBX3). b The 12q24 enhancer was repressed by targeting dCas9-KRAB to the enhancer in MCF7 cells with two different CRISPRi single-guide (sg) RNAs (SgEnh1 and SgEnh2). PgCON contains a non-targeting control sgRNA. Gene expression of TBX3, TBX5, and MED13L was measured by qPCR and normalized to GUSB. Error bars represent the SEM (n = 3). p values were determined by two-way ANOVA followed by Dunnett’s multiple-comparison test (**p < 0.01). c Luciferase reporter assays following transient transfection of MCF7 cells. The 12q24 enhancer containing either the risk or protective (Prot.) haplotype was cloned into TBX3 promoter-driven luciferase constructs (TBX3 prom). Error bars represent the SEM (n = 3). p values were determined by two-way ANOVA followed by Dunnett’s multiple-comparison test (**p < 0.01). d Allele-specific DNase I hypersensitivity at CCV rs1391721 in heterozygous MCF7 cells. The depth of reads containing the risk (red) and protective (blue) alleles are shown. e EMSAs for signal 1 CCVs to detect allele-specific binding of nuclear proteins. Labeled oligonucleotide duplexes were incubated with MCF7 nuclear extract. Red arrowheads show the bands of different mobility detected between risk (R) and protective (P) alleles. f Position weight matrix of the GATA3 binding site from JASPAR, with homology to the risk (a) and protective (g) alleles of rs1391721 colored below. g Allele-specific GATA3 ChIP-PCR results assessed at CCV rs1391721 in heterozygous MCF7 cells. Error bars represent the SEM (n = 3). p values were determined by a two-tailed Student’s t test (**p < 0.01). h Allelic discrimination plot of the GATA3 ChIP in MCF7 cells. Genomic DNA extracted from homozygous T47D and Hs578T breast cancer cells were used as controls
Fig. 6Altered TBX3 levels affect breast cell growth and tumor formation. a Representative images of colonies grown in soft agar for HMLE-control (GFP CON) and HMLE-TBX3-overexpressing cells (TBX3 ORF). The graph depicts the total number of HMLE colonies formed. Error bars represent SEM (n = 2). b Representative images of colonies grown in soft agar for HMLE-control (PgCON) and HMLE-CRISPR/Cas9 TBX3-edited cells (SgTBX3-C1/C2). The graph depicts the total number of HMLE colonies formed. Error bars represent SEM (n = 2). c Representative images of colonies grown in soft agar for MCF7-control (GFP CON) and MCF7-TBX3-overexpressing cells (TBX3 ORF). The graph depicts the total number of MCF7 colonies formed. Error bars represent SEM (n = 4). d Representative images of colonies grown in soft agar for MCF7-control (PgCON) and MCF7-dCas9-KRAB TBX3 repressed cells (SgTBX3-P1/P2). The graph depicts the total number of MCF7 colonies formed. Error bars represent SEM (n = 4). a–d p values were determined by two-way ANOVA followed by Dunnett’s multiple-comparison test (**p < 0.01, ***p < 0.001, ****p < 0.0001). e MCF7-control (PgCON) or MCF7-dCas9-KRAB TBX3-repressed cells (SgTBX3-P1/P2) were injected into the mammary fat pads of nude mice. Tumor growth curves for each group are shown. Values are shown as average tumor volumes at each time point. Error bars represent SEM (n = 8–9 mice per group). f Tumors of individual mice were dissected at day 44 post-injection. The five largest tumors of each group are shown. The scale bar represents 1 cm. g Plot of the individual weights of tumors with mean and SEM shown by cross-bar and errors. e, g Mann-Whitney U test was used to compare the differences between the groups (*p < 0.05, ****p < 0.0001)
Independent breast cancer risk signals with ≤ 2 candidate protein-coding genes
| Cytoband | Locus | Signal | Target gene/s | Cytoband | Locus | Signal | Target gene/s |
|---|---|---|---|---|---|---|---|
| 1p22.3 | chr1:87656923_88656923 | 2 | 8q24.21 | chr8:127424659_130041931 | 3 | ||
| 1q32.1 | chr1:200937832_201937832 | 1 | 9q31.2 | chr9:109803808_111395353 | 1 | ||
| 2p23.3 | chr2:28670676_29670676 | 1 | 9q33.1 | chr9:118813486_119813486 | 1 | ||
| 2p24.1 | chr2:18815791_19820803 | 1 | 10p14 | chr10:8588113_9588113 | 1 | ||
| 2q35 | chr2:217405832_218796508 | 1 | 10q25.2 | chr10:114273927_115286154 | 1 | ||
| 2q35 | chr2:217405832_218796508 | 3 | 10q26.12 | chr10:122593901_123849324 | 2 | ||
| 3p24.1 | chr3:26827965_28285247 | 2 | 11p15.5 | chr11:1398664_2442575 | 1 | ||
| 3q23 | chr3:140612859_141612859 | 1 | 11q13.3 | chr11:68831418_69879161 | 1 | ||
| 4p14 | chr4:38312876_39312876 | 1 | 11q13.3 | chr11:68831418_69879161 | 2 | ||
| 4q24 | chr4:105569013_106856761 | 1 | 11q13.3 | chr11:68831418_69879161 | 3 | ||
| 5p13.3 | chr5:32067732_33067732 | 1 | 11q24.3 | chr11:128952507_129961171 | 1 | ||
| 5p15.33 | chr5:779790_1797488 | 1 | 12p11.22 | chr12:27639846_29034415 | 1 | ||
| 5p15.33 | chr5:779790_1797488 | 2 | 12p13.1 | chr12:13913931_14913931 | 1 | ||
| 5q11.1 | chr5:49141645_50695093 | 2 | 12q22 | chr12:95527759_96527759 | 1 | ||
| 5q11.2 | chr5:55531884_56587883 | 1 | 12q24.21 | chr12:115336522_116336522 | 1 | ||
| 5q11.2 | chr5:55531884_56587883 | 4 | 12q24.21 | chr12:115336522_116336522 | 2 | ||
| 5q11.2 | chr5:55531884_56587883 | 5 | 12q24.21 | chr12:115336522_116336522 | 3 | ||
| 5q11.2 | chr5:57684061_58865569 | 1 | 13q13.1 | chr13:32468810_33472626 | 1 | ||
| 5q11.2 | chr5:57684061_58865569 | 2 | 13q22.1 | chr13:73464519_74464519 | 1 | ||
| 5q33.3 | chr5:157730013_158744083 | 1 | 14q13.3 | chr14:36632769_37635752 | 1 | ||
| 6p22.3 | chr6:15899557_16899557 | 1 | 14q24.1 | chr14:68117194_69534682 | 1 | ||
| 6q14.1 | chr6:81628386_82795951 | 1 | 14q24.1 | chr14:68117194_69534682 | 2 | ||
| 6q23.1 | chr6:129849119_130849119 | 1 | 16q12.2 | chr16:53300954_54355291 | 2 | ||
| 6q25 | chr6:151418856_152937016 | 2 | 16q23.2 | chr16:80148327_81150805 | 1 | ||
| 6q25 | chr6:151418856_152937016 | 3 | 18q11.2 | chr18:23832476_25075396 | 1 | ||
| 6q25 | chr6:151418856_152937016 | 5 | 19q12 | chr19:29777729_30777729 | 1 | ||
| 6q25.1 | chr6:149086328_150086328 | 1 | 19q13.31 | chr19:43783447_44786513 | 1 | ||
| 7q22.1 | chr7:101054599_102054599 | 1 | 20p12.3 | chr20:5448227_6448227 | 1 | ||
| 7q34 | chr7:139442304_140442304 | 1 | 21q21.1 | chr21:16073983_17073983 | 1 | ||
| 8p12 | chr8:29009616_30009616 | 1 | 21q21.1 | chr21:16073983_17073983 | 2 | ||
| 8q21.11 | chr8:75730301_76917937 | 2 | 22q13.31 | chr22:45783297_46783297 | 1 | ||
| 8q23.3 | chr8:116709548_117709548 | 1 | 22q13.31 | chr22:45783297_46783297 | 2 | ||
| 8q24.21 | chr8:127424659_130041931 | 2 |