| Literature DB >> 22258532 |
Yadav Sapkota1, Paula Robson, Raymond Lai, Carol E Cass, John R Mackey, Sambasivarao Damaraju.
Abstract
Genome-wide association studies for breast cancer have identified over 40 single-nucleotide polymorphisms (SNPs), a subset of which remains statistically significant after genome-wide correction. Improved strategies for mining of genome-wide association data have been suggested to address heritable component of genetic risk in breast cancer. In this study, we attempted a two-stage association design using markers from a genome-wide study (stage 1, Affymetrix Human SNP 6.0 array, cases=302, controls=321). We restricted our analysis to DNA repair/modifications/metabolism pathway related gene polymorphisms for their obvious role in carcinogenesis in general and for their known protein-protein interactions vis-à-vis, potential epistatic effects. We selected 22 SNPs based on linkage disequilibrium patterns and high statistical significance. Genotyping assays in an independent replication study of 1178 cases and 1314 controls were attempted using Sequenom iPLEX Gold platform (stage 2). Six SNPs (rs8094493, rs4041245, rs7614, rs13250873, rs1556459 and rs2297381) showed consistent and statistically significant associations with breast cancer risk in both stages, with allelic odds ratios (and P-values) of 0.85 (0.0021), 0.86 (0.0026), 0.86 (0.0041), 1.17 (0.0043), 1.20 (0.0103) and 1.13 (0.0154), respectively, in combined analysis (N=3115). Of these, three polymorphisms were located in methyl-CpG-binding domain protein 2 gene regions and were in strong linkage disequilibrium. The remaining three SNPs were in proximity to RAD21 homolog (S. pombe), O-6-methylguanine-DNA methyltransferase and RNA polymerase II-associated protein 1. The identified markers may be relevant to breast cancer susceptibility in populations if these findings are confirmed in independent cohorts.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22258532 PMCID: PMC3355265 DOI: 10.1038/ejhg.2011.273
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Characteristics of the SNPs used in the study
| rs17622933 (4p15.2) | 24 790 680 | Upstream | 204 496 | T | 0.30 | 0.06 | 1.00 | 0.31 | 0.52 | 1.00 | 0.31 | 0.89 | 1.00 | |
| rs7200108 (16p13.12) | 13 438 884 | Upstream | 575 129 | G | 0.12 | 0.61 | 1.00 | 0.12 | 0.94 | 1.00 | 0.12 | 0.66 | 1.00 | |
| rs7317643 (13q33.3) | 108 536 028 | Downstream | 323 765 | A | 0.12 | 0.95 | 1.00 | 0.14 | 0.53 | 1.00 | 0.13 | 0.52 | 1.00 | |
| rs1646807 (18q21.2) | 51 388 197 | Downstream | 289 773 | T | 0.17 | 0.80 | 0.99 | 0.19 | 1.00 | 1.00 | 0.18 | 0.91 | 1.00 | |
| rs4041245 (18q21.2) | 51 685 525 | Intron | 0 | G | 0.42 | 0.68 | 1.00 | 0.41 | 0.01 | 1.00 | 0.41 | 0.04 | 1.00 | |
| rs656923 (18q21.2) | 51 701 796 | Intron | 0 | G | 0.19 | 0.71 | 1.00 | 0.20 | 0.74 | 1.00 | 0.20 | 0.60 | 1.00 | |
| rs7239408 (18q21.2) | 51 432 800 | Downstream | 245 170 | A | 0.23 | 0.61 | 1.00 | 0.24 | 0.51 | 1.00 | 0.24 | 0.43 | 1.00 | |
| rs7614 (18q21.2) | 51 681 244 | 3′ UTR | 0 | C | 0.42 | 0.27 | 1.00 | 0.41 | 0.03 | 1.00 | 0.41 | 0.17 | 1.00 | |
| rs8094493 (18q21.2) | 51 700 391 | Intron | 0 | G | 0.42 | 0.68 | 1.00 | 0.41 | 0.02 | 1.00 | 0.41 | 0.05 | 1.00 | |
| rs904276 (18q21.2) | 51 434 340 | Downstream | 243 630 | C | 0.22 | 0.47 | 1.00 | 0.23 | 0.39 | 1.00 | 0.23 | 0.29 | 1.00 | |
| rs2044760 (2q23.1) | 148 989 731 | Intron | 0 | T | 0.39 | 0.23 | 1.00 | 0.35 | 0.21 | 1.00 | 0.36 | 0.51 | 1.00 | |
| rs1556459 (10q26.3) | 130 810 711 | Upstream | 454 766 | C | 0.16 | 0.80 | 1.00 | 0.15 | 0.07 | 1.00 | 0.15 | 0.12 | 1.00 | |
| rs3996018 (3q13.13) | 108 163 202 | Intron | 0 | G | 0.24 | 0.43 | 1.00 | 0.23 | 0.28 | 1.00 | 0.23 | 0.18 | 1.00 | |
| rs13250873 (8q24.11) | 117 806 169 | Downstream | 52 004 | G | 0.31 | 0.52 | 1.00 | 0.32 | 0.27 | 1.00 | 0.32 | 0.20 | 1.00 | |
| rs2297381 (15q15.1) | 41 827 655 | Intron | 0 | G | 0.50 | 0.84 | 1.00 | 0.48 | 0.67 | 1.00 | 0.48 | 0.63 | 1.00 | |
| rs6893184 (5q23.1) | 118 730 867 | Downstream | 574 | G | 0.34 | 0.63 | 1.00 | 0.33 | 0.0017 | 1.00 | 0.33 | 0.00240 | 1.00 | |
| rs7721752 (5q23.1) | 118 746 110 | Downstream | 15 817 | G | 0.33 | 0.77 | 1.00 | 0.30 | 0.01 | 1.00 | 0.31 | 0.04 | 1.00 | |
| rs6795465 (3q28) | 189 537 521 | Intron | 0 | C | 0.12 | 0.42 | 1.00 | 0.13 | 0.27 | 0.99 | 0.13 | 0.17 | 0.99 | |
| rs7636114 (3q28) | 189 088 705 | Downstream | 45 612 | C | 0.11 | 0.80 | 1.00 | 0.11 | 0.14 | 1.00 | 0.11 | 0.21 | 1.00 | |
| rs7700025 (4q34.3) | 177 814 863 | Upstream | 100 968 | G | 0.28 | 0.50 | 1.00 | 0.30 | 0.79 | 1.00 | 0.30 | 0.91 | 1.00 | |
| rs9992272 (4q34.3) | 177 737 136 | Upstream | 23 241 | C | 0.16 | 0.71 | 1.00 | 0.17 | 0.99 | 0.98 | 0.16 | 0.93 | 0.99 | |
Abbreviation: Chr., chromosome.
From NCBI human genome build 37; DHX15, DEAH (Asp–Glu–Ala–His) box polypeptide 15; ERCC4, excision repair cross-complementing rodent repair deficiency, complementation group 4; LIG4, ligase IV, DNA, ATP-dependent; MBD2, methyl-CpG-binding domain protein 2; MBD5, methyl-CpG-binding domain protein 5; MGMT, O-6-methylguanine-DNA methyltransferase; MYH15, myosin, heavy chain 15; RAD21, RAD21 homolog (S. pombe); RPAP1, RNA polymerase II-associated protein 1; TNFAIP8, tumor necrosis factor, alpha-induced protein 8; TP63, tumor protein 63; TPRG1, tumor protein p63 regulated 1; VEGFC, vascular endothelial growth factor C.
302 cases and 321 controls.
1178 cases and 1314 controls.
1480 cases and 1635 controls; MAF, combined minor allele frequency in both cases and controls.
Combined SNP call rate in both cases and controls.
Six SNPs with the strongest and consistent associations with breast cancer susceptibility across stages 1, 2 and in combined analysis
| P | P | P | |||||||
|---|---|---|---|---|---|---|---|---|---|
| rs8094493 | G (minor allele) | 0.68 (0.54, 0.85) | 0.0009 | 0.90 (0.81, 1.01) | 0.0773 | 0.85 (0.77, 0.94) | 0.045 | 0.038 | |
| GT | 0.66 (0.46, 0.94) | 0.0044 | 0.80 (0.67, 0.95) | 0.0410 | 0.77 (0.66, 0.90) | 0.0019 | ND | ND | |
| GG | 0.48 (0.31, 0.77) | 0.86 (0.68, 1.09) | 0.76 (0.61, 0.94) | ND | ND | ||||
| rs4041245 | G (minor allele) | 0.68 (0.54, 0.85) | 0.0009 | 0.91 (0.81, 1.02) | 0.0893 | 0.86 (0.77, 0.95) | 0.027 | 0.048 | |
| GA | 0.66 (0.46, 0.94) | 0.0044 | 0.79 (0.67, 0.94) | 0.0340 | 0.76 (0.65, 0.89) | 0.0018 | ND | ND | |
| GG | 0.48 (0.31, 0.77) | 0.87 (0.69, 1.10) | 0.77 (0.62, 0.95) | ND | ND | ||||
| rs7614 | C (minor allele) | 0.67 (0.54, 0.84) | 0.0006 | 0.92 (0.82, 1.03) | 0.1356 | 0.86 (0.78, 0.95) | 0.029 | 0.069 | |
| CT | 0.70 (0.49, 0.99) | 0.0038 | 0.81 (0.68, 0.97) | 0.0690 | 0.79 (0.67, 0.92) | 0.0053 | ND | ND | |
| CC | 0.47 (0.30, 0.74) | 0.89 (0.70, 1.12) | 0.77 (0.63, 0.95) | ND | ND | ||||
| rs13250873 | G (minor allele) | 1.29 (1.01, 1.64) | 0.0383 | 1.14 (1.01, 1.28) | 0.0306 | 1.17 (1.05, 1.30) | 0.023 | 0.07 | |
| GA | 1.34 (0.96, 1.87) | 0.1100 | 1.10 (0.93, 1.30) | 0.0910 | 1.14 (0.98, 1.33) | 0.0190 | ND | ND | |
| GG | 1.56 (0.91, 2.68) | 1.33 (1.02, 1.73) | 1.37 (1.08, 1.74) | ND | ND | ||||
| rs1556459 | C (minor allele) | 1.50 (1.10, 2.04) | 0.0102 | 1.13 (0.97, 1.32) | 0.1151 | 1.20 (1.04, 1.37) | 0.043 | 0.161 | |
| CT | 1.49 (1.03, 2.14) | 0.0390 | 1.05 (0.88, 1.26) | 0.0740 | 1.13 (0.96, 1.32) | 0.0120 | ND | ND | |
| CC | 2.21 (0.80, 6.07) | 1.89 (1.07, 3.32) | 1.96 (1.20, 3.20) | ND | ND | ||||
| rs2297381 | G (minor allele) | 1.27 (1.01, 1.58) | 0.0368 | 1.10 (0.98, 1.23) | 0.0986 | 1.13 (1.02, 1.25) | 0.054 | 0.234 | |
| GA | 1.28 (0.87, 1.89) | 0.1100 | 1.02 (0.85, 1.24) | 0.1800 | 1.07 (0.90, 1.27) | 0.0430 | ND | ND | |
| GG | 1.60 (1.03, 2.50) | 1.21 (0.97, 1.50) | 1.28 (1.05, 1.55) | ND | ND | ||||
Abbreviations: CI, confidence interval; FDR, false discovery rate; ND, not determined; OR, odds ratio.
302 cases and 321 controls.
1178 cases and 1314 controls.
1480 cases and 1635 controls.
Individual P-values across stages and combined analysis are indicated. P-values for minor allele were calculated using the correlation/trend test with one df, whereas the P-values for the heterozygote and variant genotypes (codominant genotypic model) were calculated using unconditional logistic regression with two df; FDR for observed associations in combined analysis using allelic association tests.
1000-times permutation P-value for observed associations in combined analysis using allelic association tests; P-values shown in bold indicate the combined analysis from allelic association tests.
Subgroup analyses based on family history of breast cancer, menopausal status and luminal A tumors
| SNPs | P | P | P | P | P | P | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs8094493 | 0.84 (0.74, 0.96) | 0.011 | 0.86 (0.76, 0.97) | 0.016 | 0.86 (0.75, 0.99) | 0.031 | 0.85 (0.75, 0.96) | 0.009 | 0.88 (0.78, 0.99) | 0.039 | 0.80 (0.68, 0.94) | 0.006 |
| rs4041245 | 0.85 (0.74, 0.97) | 0.015 | 0.86 (0.76, 0.97) | 0.016 | 0.87 (0.76, 0.99) | 0.041 | 0.85 (0.75, 0.96) | 0.009 | 0.88 (0.78, 1.00) | 0.043 | 0.80 (0.68, 0.94) | 0.006 |
| rs7614 | 0.85 (0.75, 0.97) | 0.018 | 0.87 (0.77, 0.98) | 0.027 | 0.88 (0.77, 1.01) | 0.079 | 0.85 (0.75, 0.96) | 0.010 | 0.89 (0.79, 1.01) | 0.064 | 0.80 (0.68, 0.94) | 0.007 |
| rs13250873 | 1.22 (1.06, 1.40) | 0.005 | 1.12 (0.98, 1.27) | 0.085 | 1.18 (1.02, 1.36) | 0.024 | 1.15 (1.01, 1.31) | 0.029 | 1.11 (0.97, 1.26) | 0.117 | 1.20 (1.02, 1.41) | 0.030 |
| rs1556459 | 1.27 (1.07, 1.51) | 0.008 | 1.16 (0.99, 1.37) | 0.074 | 1.20 (1.00, 1.45) | 0.047 | 1.19 (1.01, 1.40) | 0.037 | 1.17 (0.99, 1.39) | 0.062 | 1.20 (0.97, 1.48) | 0.092 |
| rs2297381 | 1.22 (1.07, 1.39) | 0.003 | 1.09 (0.97, 1.22) | 0.165 | 1.19 (1.04, 1.36) | 0.012 | 1.13 (1.00, 1.27) | 0.044 | 1.15 (1.02, 1.30) | 0.024 | 1.17 (1.00, 1.36) | 0.050 |
Abbreviations: CI, confidence interval; OR, odds ratio.
623 premenopausal women.
829 postmenopausal women.
575 cases with family history of breast cancer in their first- or second-degree relatives.
808 cases without family history of breast cancer.
761 cases with luminal A tumors.
397 cases with non-luminal A tumors; associations of SNPs with each of these subgroups were assessed using common 1635 controls and expressed in terms of OR and 95% CI; the P-values were obtained from a correlation/trend test with one df
Figure 1Pairwise LD profiles between SNPs from MBD2 gene region. (a) LD profile of whole MBD2 isoform 1 spanning ∼70.58 kbp. The gene is in reverse orientation (3′–5′) on chromosome 18q arm. Five SNPs (three from our study that shown in black and two from Zhu et al[35] that are shown in red) in MBD2 gene regions are shown based on their relative position on HapMap CEU data set (phase 1 and 2-full data set). LD blocks were defined using ‘CI' method as explained by Gabriel et al.[42] D′ values are given for LD between the markers. The darker the cell, the greater the D′ value between the SNPs. (b) LD profile for three MBD2 SNPs from our study based on our study population.
Haplotypes for three MBD2 SNPs and their associations with breast cancer risk
| P | ||||||||
|---|---|---|---|---|---|---|---|---|
| T | A | T | 0.601 | 0.5624 | 0.5823 | 1 | — | — |
| C | G | G | 0.385 | 0.4232 | 0.4053 | 0.86 | 0.77–0.95 | |
| * | * | * | Rare (<1%) | Rare (<1%) | 0.0124 | 1.11 | 0.70–1.78 | 0.66 |
Denotes alleles forming the rare haplotypes (<1% frequency).
P-value shown in bold indicates statistically significant association at alpha, 0.05.