| Literature DB >> 20585626 |
Jirong Long1, Qiuyin Cai, Xiao-Ou Shu, Shimian Qu, Chun Li, Ying Zheng, Kai Gu, Wenjing Wang, Yong-Bing Xiang, Jiarong Cheng, Kexin Chen, Lina Zhang, Hong Zheng, Chen-Yang Shen, Chiun-Sheng Huang, Ming-Feng Hou, Hongbing Shen, Zhibin Hu, Furu Wang, Sandra L Deming, Mark C Kelley, Martha J Shrubsole, Ui Soon Khoo, Kelvin Y K Chan, Sum Yin Chan, Christopher A Haiman, Brian E Henderson, Loic Le Marchand, Motoki Iwasaki, Yoshio Kasuga, Shoichiro Tsugane, Keitaro Matsuo, Kazuo Tajima, Hiroji Iwata, Bo Huang, Jiajun Shi, Guoliang Li, Wanqing Wen, Yu-Tang Gao, Wei Lu, Wei Zheng.
Abstract
Genetic factors play an important role in the etiology of breast cancer. We carried out a multi-stage genome-wide association (GWA) study in over 28,000 cases and controls recruited from 12 studies conducted in Asian and European American women to identify genetic susceptibility loci for breast cancer. After analyzing 684,457 SNPs in 2,073 cases and 2,084 controls in Chinese women, we evaluated 53 SNPs for fast-track replication in an independent set of 4,425 cases and 1,915 controls of Chinese origin. Four replicated SNPs were further investigated in an independent set of 6,173 cases and 6,340 controls from seven other studies conducted in Asian women. SNP rs4784227 was consistently associated with breast cancer risk across all studies with adjusted odds ratios (95% confidence intervals) of 1.25 (1.20-1.31) per allele (P = 3.2 x 10(-25)) in the pooled analysis of samples from all Asian samples. This SNP was also associated with breast cancer risk among European Americans (per allele OR = 1.19, 95% CI = 1.09-1.31, P = 1.3 x 10(-4), 2,797 cases and 2,662 controls). SNP rs4784227 is located at 16q12.1, a region identified previously for breast cancer risk among Europeans. The association of this SNP with breast cancer risk remained highly statistically significant in Asians after adjusting for previously-reported SNPs in this region. In vitro experiments using both luciferase reporter and electrophoretic mobility shift assays demonstrated functional significance of this SNP. These results provide strong evidence implicating rs4784227 as a functional causal variant for breast cancer in the locus 16q12.1 and demonstrate the utility of conducting genetic association studies in populations with different genetic architectures.Entities:
Mesh:
Year: 2010 PMID: 20585626 PMCID: PMC2891809 DOI: 10.1371/journal.pgen.1001002
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Characteristics of study participants and number of SNPs analyzed in each stage.
| Cases | Controls | |||
| Study population | N | Age | N | Age |
| Stage I (684,457 SNPs) | ||||
| Shanghai – SBCS (Chinese) | 2,073 | 49.3±8.3 | 2,084 | 49.4±8.5 |
| Stage II (53 SNPs) | ||||
| Stage II total | 4,425 | 53.9±10.2 | 1,915 | 52.8±9.2 |
| Shanghai -SBCS (Chinese) | 972 | 50.4±8.3 | 1,001 | 50.9±9.3 |
| Shanghai - SBCSS (Chinese) | 3,453 | 54.9±10.5 | ||
| Shanghai - SECS (Chinese) | 914 | 54.9±8.5 | ||
| Stage III (4 SNPs) | ||||
| Stage III total | 6,173 | 52.6±11.5 | 6,340 | 51.2±10.9 |
| Tianjin (Chinese) | 1,532 | 51.7±11.4 | 1,583 | 51.9±10.5 |
| Nanjing (Chinese) | 1,446 | 51.5±11.4 | 1,439 | 51.3±11.2 |
| Taiwan (Chinese) | 1,066 | 51.5±10.7 | 1,065 | 47.5±10.1 |
| Hong Kong (Chinese) | 517 | 45.8±9.5 | 651 | 45.6±10.3 |
| Nagoya - Japan (Japanese) | 644 | 51.4±11.0 | 644 | 51.1±10.9 |
| Nagano - Japan (Japanese) | 403 | 53.7±10.5 | 403 | 53.9±10.2 |
| Hawaii - MEC (Japanese) | 565 | 65.2±8.4 | 555 | 60.4±8.4 |
| European Americans | ||||
| NBHS | 1,652 | 54.9±10.2 | 1,520 | 52.2±11.0 |
| CGEMS | 1,145 | 1,142 | ||
| Total | 15,468 | 52.5±10.7 | 13,001 | 51.2±10.2 |
| Chinese | 11,059 | 51.8±10.3 | 8,737 | 50.4±10.1 |
| Japanese | 1,612 | 56.8±11.8 | 1,602 | 55.0±10.7 |
| European Americans | 2,797 | 2,662 | ||
a Selected from SNPs included in the Affymetrix 6.0 SNP array with MAF≥1%, call rate≥95%, and QC consistency.
b With the exception of studies in Nanjing (2 SNPs), Tianjin (3 SNPs) and Nagoya Japan (1 SNP), NBHS (3 SNPs), CGEMS (2 SNPs genotyped and 2 SNPs imputed), four SNPs were analyzed in all other studies.
c Mean ± SD.
Results from Stages I to III for rs4784227 and Stages I and II for other three SNPs that showed promising associations with breast cancer risk in Stage II.
| OR(95%CI) | |||||||||
| SNP (chr) | Allele | Stage | No of cases | No of controls | Frequency | Heter | Homo | Per allele | P for trend |
| rs10479046 (5q31.1) | C/T | I | 2,071 | 2,083 | 0.73/0.70 | 1.22(0.96–1.55) | 1.42(1.12–1.80) | 1.18(1.07–1.30) | 7.2×10−4 |
| II | 4,320 | 1,854 | 0.71/0.70 | 1.07(0.87–1.30) | 1.15(0.94–1.40) | 1.07(0.99–1.17) | 0.10 | ||
| I & II | 6,391 | 3,937 | 0.72/0.70 | 1.09(0.94–1.27) | 1.22(1.06–1.42) | 1.11(1.05–1.18) | 7.5×10−4 | ||
| rs3829849 (9q33.3) | T/C | I | 2,067 | 2,081 | 0.11/0.09 | 1.31(1.12–1.54) | 1.15(0.66–1.99) | 1.25(1.08–1.44) | 0.002 |
| II | 4,331 | 1,871 | 0.10/0.09 | 1.19(1.03–1.38) | 2.10(1.06–4.18) | 1.23(1.08–1.41) | 0.003 | ||
| I & II | 6,398 | 3,952 | 0.11/0.09 | 1.24(1.11–1.38) | 1.34(0.89–2.01) | 1.22(1.11–1.34) | 4.4×10−5 | ||
| rs7966820 (12q24.1) | T/C | I | 2,063 | 2,074 | 0.15/0.13 | 1.28(1.10–1.48) | 1.33(0.86–2.05) | 1.24(1.09–1.40) | 7.8×10−4 |
| II | 4,351 | 1,856 | 0.15/0.14 | 1.00(0.88–1.14) | 2.00(1.28–3.12) | 1.10(0.98–1.23) | 0.10 | ||
| I & II | 6,414 | 3,930 | 0.15/0.13 | 1.12(1.02–1.23) | 1.60(1.19–2.16) | 1.16(1.07–1.26) | 4.0×10−4 | ||
| rs4784227 (16q12.1) | T/C | I | 2,066 | 2,078 | 0.29/0.25 | 1.18(1.04–1.34) | 1.43(1.13–1.82) | 1.19(1.08–1.31) | 4.2×10−4 |
| II | 4,280 | 1,843 | 0.28/0.23 | 1.32(1.18–1.48) | 1.62(1.29–2.05) | 1.30(1.19–1.42) | 1.3×10−8 | ||
| I & II | 6,346 | 3,921 | 0.28/0.24 | 1.24(1.14–1.35) | 1.50(1.28–1.77) | 1.23(1.16–1.31) | 2.1×10−10 | ||
| I, II & III | 12,336 | 10,140 | 0.29/0.24 | 1.28(1.21–1.35) | 1.52(1.37–1.69) | 1.25(1.20–1.31) | 3.2×10−25 | ||
a Risk allele/reference allele.
b Risk allele frequency in cases/controls.
c In Stage I, adjusted for age and the first two principal components for population structure, in Stage II adjusted for age.
d SNP rs4784227 was replicated in Stage III, and combined results from all three stages are presented.
Figure 1Per-allele OR for rs4784227 in association with breast cancer risk by study and ethnic groups.
Figure 2SNP association at 16q12 with breast cancer risk.
In SBCS (A) and CGEMS (B). Top panel: Results (-log10P) are shown for the associations of breast cancer risk with directly genotyped (diamonds) and imputed (circles) SNPs located in the TOX3 and LOC643714 genes. P-values for rs4784227 are shown in red for Stage I data and in blue for the combined data. The three previously-reported SNPs (rs8051542, rs12443621, and rs3803662) are highlighted in black. Middle panel: Genomic view at 16q12.1. Gene locations are from the March 2006 UCSC genome browser assembly. Bottom panel: Estimates of pairwise LD (r2) for common SNPs (with MAF≥5%) from HapMap release 23a in the region from 10 kb downstream of rs8051542 to 10 kb upstream of rs4784227.
Linkage disequilibrium patterns among rs4784227 and the three previously-reported SNPs in 16q12.1.
| SNPs | rs8051542 | rs12443621 | rs3803662 | rs4784227 | |
| Chinese | rs8051542 | 0.1 | 0.08 | 0.37 | |
| rs12443621 | 0.77 | 0.04 | 0.07 | ||
| rs3803662 | 0.81 | 0.25 | 0.14 | ||
| rs4784227 | 0.76 | 0.52 | 0.87 | ||
| Caucasians | |||||
| rs8051542 | 0.01 | 0.13 | 0.12 | ||
| rs12443621 | 0.13 | 0.3 | 0.3 | ||
| rs3803662 | 0.52 | 0.92 | 0.86 | ||
| rs4784227 | 0.54 | 1 | 1 | ||
D′: Lower left triangle.
r2: Upper right triangle.
Caucasians: European samples included in the HapMap project.
Chinese: subjects in Stages I and II.
Association of rs4784227 and the three previously-reported SNPs at 16q12with breast cancer risk among Chinese women in Stages I and II.
| OR(95%CI) | ||||||||
| Tested SNP | Adjusted SNPs | No of cases | No of controls | Frequency | Heter | Homo | Per allele | P for trend |
| rs4784227 | None | 6,346 | 3,921 | 0.28/0.24 | 1.24(1.14–1.35) | 1.50(1.28–1.77) | 1.23(1.16–1.31) | 2.1×10−10 |
| rs8051542 | None | 6,158 | 3,658 | 0.20/0.18 | 1.10(1.01–1.21) | 1.36(1.09–1.69) | 1.13(1.05–1.21) | 0.002 |
| rs12443621 | None | 2,954 | 2,997 | 0.57/0.57 | 0.99(0.86–1.14) | 1.02(0.88–1.18) | 1.01(0.94–1.09) | 0.78 |
| rs3803662 | None | 6,345 | 3,795 | 0.68/0.65 | 1.26(1.10–1.44) | 1.34(1.17–1.53) | 1.12(1.06–1.19) | 1.7×10−4 |
| rs4784227 | rs8051542 | 6,082 | 3,614 | 0.28/0.24 | 1.27(1.14–1.40) | 1.57(1.29–1.91) | 1.26(1.16–1.37) | 9.5×10−8 |
| rs4784227 | rs12443621 | 2,920 | 2,961 | 0.29/0.24 | 1.32(1.18–1.47) | 1.63(1.32–2.01) | 1.29(1.19–1.41) | 3.1×10−9 |
| rs4784227 | rs3803662 | 6,221 | 3,749 | 0.28/0.25 | 1.20(1.10–1.32) | 1.41(1.19–1.68) | 1.20(1.11–1.28) | 7.0×10−7 |
| rs4784227 | all 3 SNPs | 2,702 | 2,646 | 0.29/0.24 | 1.31(1.14–1.50) | 1.60(1.23–2.08) | 1.28(1.15–1.44) | 1.6×10−5 |
| rs8051542 | rs4784227 | 6,082 | 3,614 | 0.20/0.18 | 0.93(0.84–1.04) | 0.97(0.76–1.25) | 0.95(0.87–1.05) | 0.33 |
| rs12443621 | rs4784227 | 2,920 | 2,961 | 0.57/0.57 | 0.92(0.80–1.07) | 0.89(0.77–1.04) | 0.95(0.88–1.02) | 0.16 |
| rs3803662 | rs4784227 | 6,221 | 3,749 | 0.68/0.65 | 1.17(1.03–1.35) | 1.17(1.01–1.35) | 1.05(0.99–1.13) | 0.12 |
a Risk allele frequency in cases/controls. Risk/reference alleles (based on forward strand) are T/C for rs8051542 and rs4784227 and A/G for rs12443621 and rs3803662.
Association of rs4784227 and the three previously-reported SNPs at 16q12with breast cancer risk among European American women.
| Tested SNP | Study | SNPs Adjusted | No of cases | No of controls | Frequency | OR (95% CI) | P for trend |
| rs4784227 | CGEMS | None | 1145 | 1142 | 0.29/0.25 | 1.17(1.03–1.34) | 0.016 |
| NBHS | None | 1357 | 1148 | 0.28/0.25 | 1.21(1.07–1.37) | 0.003 | |
| rs3803662 | CGEMS | None | 1145 | 1142 | 0.3/0.27 | 1.13(0.99–1.28) | 0.06 |
| NBHS | None | 1615 | 1467 | 0.31/0.27 | 1.22(1.09–1.36) | 4.0×10−4 | |
| rs8051542 | NBHS | None | 1587 | 1439 | 0.46/0.44 | 1.07(0.97–1.19) | 0.18 |
| rs12443621 | CGEMS | None | 1145 | 1142 | 0.50/0.49 | 1.02(0.91–1.15) | 0.70 |
| rs4784227 | NBHS | rs8051542 | 1263 | 1080 | 0.28/0.25 | 1.22(1.06–1.40) | 0.007 |
| rs4784227 | CGEMS | rs12443621 | 1145 | 1142 | 0.29/0.25 | 1.25(1.06–1.47) | 0.007 |
| rs4784227 | Both | rs3803662 | 2431 | 2243 | 0.29/0.25 | 1.22(0.94–1.59) | 0.13 |
| rs8051542 | NBHS | rs4784227 | 1263 | 1080 | 0.45/0.44 | 0.98(0.86–1.11) | 0.74 |
| rs12443621 | CGEMS | rs4784227 | 1145 | 1142 | 0.50/0.49 | 0.91(0.79–1.05) | 0.19 |
| rs3803662 | Both | rs4784227 | 2431 | 2243 | 0.31/0.27 | 0.97(0.75–1.25) | 0.82 |
a In CGEMS, data on rs8051542 was not available, and In NBHS data on for rs12443621 was not available.
b Risk allele frequency in cases/controls. Risk/reference alleles (based on forward strand) are T/C for rs8051542 and rs4784227 and A/G for rs12443621 and rs3803662.
Figure 3In vitro functional characterization of SNP rs4784227.
(A) Effect of rs4784227 on luciferase reporter activity. HEK 293, MCF10A, MCF-7, and MDA-231 cells were transiently transfected with pGL3 promoter vector and the constructs carrying the reference allele (C) and risk allele (T) of rs4784227, respectively. Relative luciferase activities are shown as mean ± SD of four experiments (relative to empty vector). Statistical analysis was done using Student's t-test comparing C and T alleles (* P<0.05, ** P<0.01, n = 4). (B) Electrophoretic mobility shift assays. Nuclear protein extracts from MCF-7 (top panel), MCF10A (middle panel), and MDA-231 (bottom panel) cells were incubated with biotin-labeled probes corresponding to reference allele C (lanes 1–5) or the risk allele T (lanes 6–10) of rs4784227 in the absence or presence of competitors. Lanes 1 and 6, no nuclear extracts; lanes 2 and 7, unlabeled competitor in 200-fold molar excess; lanes 3 and 8 (5 mM MgCl2), lanes 4 and 9 (2.5 mM MgCl2), and lanes 5 and 10 (1.25 mM MgCl2), no competitor. I: nonspecific DNA-protein complex bands from MCF-7, MCF10A, and MDA-231 cells; II: specific DNA-protein complex bands; III: free biotin-labeled probes.