| Literature DB >> 27600471 |
Hatef Darabi1, Jonathan Beesley2, Arnaud Droit3, Siddhartha Kar4, Silje Nord5, Mahdi Moradi Marjaneh2, Penny Soucy6, Kyriaki Michailidou7,8, Maya Ghoussaini4, Hanna Fues Wahl1, Manjeet K Bolla7, Qin Wang7, Joe Dennis7, M Rosario Alonso9, Irene L Andrulis10,11, Hoda Anton-Culver12, Volker Arndt13, Matthias W Beckmann14, Javier Benitez15,16, Natalia V Bogdanova17, Stig E Bojesen18,19,20, Hiltrud Brauch21,22,23, Hermann Brenner13,23,24, Annegien Broeks25, Thomas Brüning26, Barbara Burwinkel27,28, Jenny Chang-Claude29,30, Ji-Yeob Choi31,32, Don M Conroy4, Fergus J Couch33, Angela Cox34, Simon S Cross35, Kamila Czene1, Peter Devilee36,37, Thilo Dörk38, Douglas F Easton4,7, Peter A Fasching14,39, Jonine Figueroa40,41, Olivia Fletcher42,43, Henrik Flyger44, Eva Galle45,46, Montserrat García-Closas41, Graham G Giles47,48, Mark S Goldberg49,50, Anna González-Neira15, Pascal Guénel51, Christopher A Haiman52, Emily Hallberg53, Ute Hamann54, Mikael Hartman55,56, Antoinette Hollestelle57, John L Hopper48, Hidemi Ito58, Anna Jakubowska59, Nichola Johnson42,43, Daehee Kang31,32,60, Sofia Khan61, Veli-Matti Kosma62,63,64, Mieke Kriege57, Vessela Kristensen5,65,66, Diether Lambrechts45,46, Loic Le Marchand67, Soo Chin Lee68,69, Annika Lindblom70, Artitaya Lophatananon71, Jan Lubinski59, Arto Mannermaa62,63,64, Siranoush Manoukian72, Sara Margolin73, Keitaro Matsuo74, Rebecca Mayes4, James McKay75, Alfons Meindl76, Roger L Milne47,48, Kenneth Muir71,77, Susan L Neuhausen78, Heli Nevanlinna61, Curtis Olswold53, Nick Orr42, Paolo Peterlongo79, Guillermo Pita9, Katri Pylkäs80,81, Anja Rudolph29, Suleeporn Sangrajrang82, Elinor J Sawyer83, Marjanka K Schmidt25, Rita K Schmutzler84,85,86, Caroline Seynaeve57, Mitul Shah4, Chen-Yang Shen87,88, Xiao-Ou Shu89, Melissa C Southey90, Daniel O Stram52, Harald Surowy27,28, Anthony Swerdlow43,91, Soo H Teo92,93, Daniel C Tessier94, Ian Tomlinson95, Diana Torres54,96, Thérèse Truong51, Celine M Vachon53, Daniel Vincent94, Robert Winqvist80,81, Anna H Wu52, Pei-Ei Wu88, Cheng Har Yip93, Wei Zheng89, Paul D P Pharoah4,7, Per Hall1, Stacey L Edwards2, Jacques Simard6, Juliet D French2, Georgia Chenevix-Trench2, Alison M Dunning4.
Abstract
Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90-0.94; P = 8.96 × 10(-15))) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10(-09), r(2) = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10(-11), r(2) = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27600471 PMCID: PMC5013272 DOI: 10.1038/srep32512
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Association Results for Overall Breast Cancer Risk.
Directly genotyped SNPs are shown as filled black circles, and imputed SNPs (r2 > 0.3, MAF > 0.02) are shown as open red circles, plotted as the negative log of the P value against relative position across the locus. A schematic of the gene structures is shown. Signals, encompassing all SNPs with a likelihood ratio of <1:100 compared with the most significant SNP, are labelled and are shown as grey regions. The dashed purple line represents genome-wide significance (P < 5 × 10−08).
Association result of independent signal among European and the set of highly correlated common variants.
| Population | European | Candidate Causal SNPs | Position | r2 | RL | Info | P-valuee | OR (95% CI) | |
|---|---|---|---|---|---|---|---|---|---|
| rs2787497 | 53176211 | 0.89 | 0.01 | 1 | 9.27E-13 | 0.93 (0.91, 0.95) | |||
|
| rs2787486 | rs8082622 | 53177567 | 0.90 | 0.01 | 1 | 7.06E-13 | 0.93 (0.91, 0.94) | |
| rs2529510 | 53182046 | 0.84 | 0.13 | 0.91 | 6.92E-14 | 0.92 (0.90, 0.94) | |||
|
| A/C (0.28) | rs244373 | 53184949 | 0.99 | 0.13 | 0.98 | 7.30E-14 | 0.92 (0.90, 0.94) | |
| rs244358 | 53192231 | 0.89 | 0.01 | 1 | 9.68E-13 | 0.93 (0.91, 0.95) | |||
|
|
| 8.96E-15 | rs187242 | 53192946 | 0.98 | 0.01 | 0.98 | 8.17E-13 | 0.93 (0.91, 0.95) |
|
| 0.92 (0.90, 0.94) | rs244353 | 53194769 | 0.99 | 0.16 | 1 | 5.75E-14 | 0.92 (0.90, 0.94) | |
| rs244348 | 53196291 | 0.90 | 0.01 | 1 | 7.08E-13 | 0.93 (0.91, 0.94) | |||
|
|
| 1.39E-14 | rs244342 | 53198407 | 0.99 | 0.14 | 1 | 6.73E-14 | 0.92 (0.90, 0.94) |
|
| 0.91 (0.88, 0.93) | rs244338 | 53200418 | 0.99 | 0.15 | 1 | 6.21E-14 | 0.92 (0.90, 0.94) | |
| rs244337 | 53201124 | 0.99 | 0.14 | 0.99 | 6.60E-14 | 0.92 (0.90, 0.94) | |||
|
|
| 0.0177 | rs244336 | 53201502 | 0.90 | 0.01 | 1 | 8.80E-13 | 0.93 (0.91, 0.95) |
|
| 0.95 (0.91, 0.99) | chr17:53205761:D | 53205761 | 0.90 | 0.01 | 0.99 | 9.42E-13 | 0.93 (0.91, 0.95) | |
| rs2628321 | 53205917 | 0.99 | 0.16 | 1 | 5.88E-14 | 0.92 (0.90, 0.94) | |||
| rs2787486 | 53209774 | 0.96 | 8.96E-15 | 0.92 (0.90, 0.94) | |||||
| rs2787481 | 53211110 | 0.90 | 0.01 | 1 | 6.44E-13 | 0.92 (0.91, 0.94) | |||
| rs244315 | 53214654 | 0.90 | 0.02 | 1 | 4.72E-13 | 0.92 (0.91, 0.94) | |||
| rs244317 | 53216985 | 0.90 | 0.02 | 1 | 5.79E-13 | 0.92 (0.91, 0.94) | |||
| rs2628316 | 53219837 | 0.99 | 0.15 | 1 | 6.00E-14 | 0.92 (0.90, 0.94) | |||
| chr17:53221365:I | 53221365 | 0.99 | 0.10 | 0.98 | 9.68E-14 | 0.92 (0.90, 0.94) | |||
| rs244318 | 53221368 | 0.97 | 0.02 | 0.98 | 4.19E-13 | 0.93 (0.91, 0.95) | |||
| rs244319 | 53222874 | 0.98 | 0.04 | 1 | 2.19E-13 | 0.93 (0.91, 0.94) | |||
| rs244320 | 53222920 | 0.98 | 0.04 | 1 | 2.19E-13 | 0.93 (0.91, 0.94) | |||
| rs244321 | 53223998 | 0.98 | 0.05 | 0.99 | 1.72E-13 | 0.92 (0.91, 0.94) | |||
| rs10432032 | 53224088 | 0.98 | 0.04 | 1 | 2.47E-13 | 0.93 (0.91, 0.94) | |||
| rs244322 | 53224759 | 0.98 | 0.05 | 0.99 | 2.03E-13 | 0.93 (0.91, 0.94) | |||
| rs2628315 | 53226622 | 0.99 | 0.10 | 1 | 9.02E-14 | 0.92 (0.90, 0.94) | |||
| rs244294 | 53228543 | 0.98 | 0.01 | 0.99 | 9.28E-13 | 0.93 (0.91, 0.95) |
aP-value, Odds ratio (OR) and 95% Confidence Interval (CI) for association with overall breast cancer risk, Estrogen Positive (ER+) and Estrogen Negative (ER−) disease.
bBuild 37 coordinates on chromosome 17.
cCorrelation (r2) and Relative Likelihood ratio (RL) with respect to lead SNP rs2787486
dIMPUTE2 info score.
eP-value, Odds ratio (OR) and 95% Confidence Interval (CI) for association with overall breast cancer risk.
§/§§Genotyped/Imputed marker.
*Lead SNP.
Figure 2In silico analysis of the 17q22 locus.
(A) Functional annotations using data from the ENCODE project. From top to bottom, epigenetic signals evaluated included DNase clusters, and ENCODE chromatin states (ChromHMM) and histone modifications in HMEC and MCF7 cell lines. The detailed colour scheme in chromatin states is described in the UCSC browser. All tracks were generated by the UCSC genome browser (hg 19). (B) Long-range chromatin interactions. From top to bottom, ChIA-PET data for Pol2 and CTCF in MCF7 cell lines. The ChIA-PET raw data available on GEO under the following accession (GSE63525.K56, GSE33664, GSE39495) were processed with the GenomicRanges package. -RNAseq data from MCF7 and HMEC cell lines. The value of the RNAseq analysis corresponds to the mean RPM value for COX11, TOM1L1 and STXBP4 from 4 HMEC and 19 MCF7 datasets, respectively. The annotation has been obtained through the Bioconductor annotation package TxDb.Hsapiens.UCSC.hg19.knownGene. The tracks have been generated using ggplot2 and ggbio library in R.
Figure 3Functional annotation of the 17q22 locus.
Positions of candidate causal variants are shown as black tick marks in relation to local genes. Epigenomic marks associated with enhancers are shown as described by FANTOM, ENCODE, PreSTIGE and Hnisz et al.18. Enhancers which overlap candidate SNPs are coded in color to match their predicted target gene. The grey shaded region area depicts the region bound by un-excluded variants.