| Literature DB >> 25327703 |
Laura Fejerman1, Nasim Ahmadiyeh2, Donglei Hu1, Scott Huntsman1, Kenneth B Beckman3, Jennifer L Caswell1, Karen Tsung2, Esther M John4, Gabriela Torres-Mejia5, Luis Carvajal-Carmona6, María Magdalena Echeverry7, Anna Marie D Tuazon7, Carolina Ramirez8, Christopher R Gignoux9, Celeste Eng10, Esteban Gonzalez-Burchard10, Brian Henderson11, Loic Le Marchand12, Charles Kooperberg13, Lifang Hou14, Ilir Agalliu15, Peter Kraft16, Sara Lindström16, Eliseo J Perez-Stable1, Christopher A Haiman11, Elad Ziv1.
Abstract
The genetic contributions to breast cancer development among Latinas are not well understood. Here we carry out a genome-wide association study of breast cancer in Latinas and identify a genome-wide significant risk variant, located 5' of the Estrogen Receptor 1 gene (ESR1; 6q25 region). The minor allele for this variant is strongly protective (rs140068132: odds ratio (OR) 0.60, 95% confidence interval (CI) 0.53-0.67, P=9 × 10(-18)), originates from Indigenous Americans and is uncorrelated with previously reported risk variants at 6q25. The association is stronger for oestrogen receptor-negative disease (OR 0.34, 95% CI 0.21-0.54) than oestrogen receptor-positive disease (OR 0.63, 95% CI 0.49-0.80; P heterogeneity=0.01) and is also associated with mammographic breast density, a strong risk factor for breast cancer (P=0.001). rs140068132 is located within several transcription factor-binding sites and electrophoretic mobility shift assays with MCF-7 nuclear protein demonstrate differential binding of the G/A alleles at this locus. These results highlight the importance of conducting research in diverse populations.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25327703 PMCID: PMC4204111 DOI: 10.1038/ncomms6260
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 2Quantile–quantile plot for GWAS of breast cancer in Latinas.
The gray line represents a perfect match between the expected distribution of –log10 P under the uniform and those observed in the present analysis.
Discovery and replication of newly discovered protective variants at 6q25 related to breast cancer risk in Latinas.
| rs140068132 | A/G | 0.60 | 0.49–0.72 | 3 × 10−7 | 9% |
| rs147157845 | C/A | 0.59 | 0.48–0.72 | 1 × 10−7 | 9% |
| rs140068132 | 0.63 | 0.53–0.75 | 3 × 10−7 | 15% | |
| rs147157845 | 0.66 | 0.55–0.78 | 3 × 10−6 | 15% | |
| rs140068132 | 0.54 | 0.41–0.71 | 1 × 10−5 | 10% | |
| rs147157845 | 0.55 | 0.42–0.72 | 2 × 10−5 | 10% | |
| rs140068132 | 0.61 | 0.31–1.22 | 0.16 | 7% | |
| rs147157845 | 0.60 | 0.30–1.19 | 0.15 | 7% | |
| rs140068132 | 0.60 | 0.53–0.67 | 9 × 10−18 | ||
| rs147157845 | 0.61 | 0.54–0.68 | 2 × 10−16 | ||
CI, confidence interval; MAF, minor allele frequency; OR, odds ratio; WHI, Women’s Health Initiative.
*Reference allele/tested allele.
†MAF: tested allele.
Figure 3A regional plot of the −log10 P values for SNPs at 6q25.1.
The SNP with the highest −log10 P value is coloured purple and identified by its rs no. on top of the graph. The two SNPs that were replicated in multiple independent samples are circle-shaped, updated −log10 P values after meta-analysis are X-shaped, and previously reported risk SNPs are triangle-shaped. All other SNPs are represented by crosses and the colours reflect the level of correlation with the SNP with highest −log10 P value. The LD is estimated using data from 1,000 Genomes Project Amerindian populations. In addition, shown are the SNP Build 37 coordinates in megabases (Mb), recombination rates in centimorgans (cM) per megabase (Mb) and the name and location of genes in the UCSC Genome Browser (below).
Association between breast cancer, global Indigenous American (IA) ancestry, local IA ancestry and rs140068132/rs147157845*.
| Global IA ancestry | 0.31 | 0.18–0.54 | 3.47 × 10−5 |
| Global IA ancestry | 0.55 | 0.29–1.01 | 0.056 |
| Local IA at 6q25 | 0.55 | 0.42–0.72 | 1.87 × 10−5 |
| Global IA ancestry | 0.55 | 0.30–1.02 | 0.06 |
| Local IA at 6q25 | 0.68 | 0.50–0.91 | 0.009 |
| rs140068132 | 0.63 | 0.50–0.80 | 1.6 × 10−4 |
CI, confidence interval; IA, Indigenous American; MEC, Multiethnic Cohort; NC-BCFR, Northern California site of the Breast Cancer Family Registry; OR, odds ratio; SFBCS, San Francisco Bay Area Breast Cancer Study.
*These analyses included 1,476 US Latina cases and 1,131 controls from the SFBCS, the NC-BCFR and the MEC with available locus specific ancestry estimates from a previous study.
Association between rs140068132/rs147157845 and breast cancer risk by ER status in US Latinas.
| ER+ | 0.63 | 0.49–0.80 | 1.9 × 10−4 |
| ER− | 0.34 | 0.21–0.54 | 4.7 × 10−6 |
| ER− versus ER+ | 0.53 | 0.32–0.88 | 0.014 |
| ER+ | 0.62 | 0.48–0.79 | 9.3 × 10−5 |
| ER− | 0.34 | 0.21–0.54 | 3.8 × 10−6 |
| ER− versus ER+ | 0.55 | 0.34–0.91 | 0.019 |
CI, confidence interval; ER, oestrogen receptor; ER+, oestrogen receptor-positive; ER−, oestrogen receptor-negative; OR, odds ratio.
*ORs are for the minor alleles (rs140068132 allele G and rs147157845 allele A).
Figure 4Box plot of percent mammographic breast density by genotypes for SNP rs140068132 at 6q25 in 1,113 women (304 cases and 809 controls) from the Mexican study.
The boxes represent the median (black middle line) limited by the 25th (Q1) and 75th (Q3) percentiles. The whiskers are the upper and lower adjacent values, which are the most extreme values within Q3+1.5(Q3−Q1) and Q1−1.5(Q3−Q1), respectively. The black dots represent outliers. N defines the number of individuals within each genotype category.
Figure 5Replication of previously reported associations in Latinas.
Scatter plot of odds ratios previously published (x axis) and in Latinas (y axis). Red dots represent SNPs that were associated at ≤0.05 level of significance in Latinas.