Literature DB >> 34069208

Gene-Environment Interactions Relevant to Estrogen and Risk of Breast Cancer: Can Gene-Environment Interactions Be Detected Only among Candidate SNPs from Genome-Wide Association Studies?

JooYong Park1,2, Ji-Yeob Choi1,2,3,4, Jaesung Choi3, Seokang Chung1, Nan Song5, Sue K Park1,4,6, Wonshik Han4,7, Dong-Young Noh4,7, Sei-Hyun Ahn8, Jong Won Lee8, Mi Kyung Kim9, Sun Ha Jee10, Wanqing Wen11, Manjeet K Bolla12, Qin Wang12, Joe Dennis12, Kyriaki Michailidou12,13,14, Mitul Shah15, Don M Conroy15, Patricia A Harrington15, Rebecca Mayes15, Kamila Czene16, Per Hall16,17, Lauren R Teras18, Alpa V Patel19, Fergus J Couch19, Janet E Olson20, Elinor J Sawyer21, Rebecca Roylance22, Stig E Bojesen23,24,25, Henrik Flyger26, Diether Lambrechts27,28, Adinda Baten29, Keitaro Matsuo30,31, Hidemi Ito31, Pascal Guénel32, Thérèse Truong32, Renske Keeman33, Marjanka K Schmidt33,34, Anna H Wu35, Chiu-Chen Tseng35, Angela Cox36, Simon S Cross37, Irene L Andrulis38,39, John L Hopper40, Melissa C Southey41,42,43, Pei-Ei Wu44, Chen-Yang Shen45,46, Peter A Fasching47,48, Arif B Ekici49, Kenneth Muir50, Artitaya Lophatananon50, Hermann Brenner51,52,53, Volker Arndt51, Michael E Jones54, Anthony J Swerdlow54,55, Reiner Hoppe56,57, Yon-Dschun Ko58, Mikael Hartman59,60,61, Jingmei Li62, Arto Mannermaa63,64,65, Jaana M Hartikainen63,64, Javier Benitez66,67, Anna González-Neira67, Christopher A Haiman35, Thilo Dörk68, Natalia V Bogdanova68,69,70, Soo Hwang Teo71,72, Nur Aishah Mohd Taib73, Olivia Fletcher74, Nichola Johnson74, Mervi Grip75, Robert Winqvist76,77, Carl Blomqvist78,79, Heli Nevanlinna80, Annika Lindblom81,82, Camilla Wendt83, Vessela N Kristensen84,85, Rob A E M Tollenaar86, Bernadette A M Heemskerk-Gerritsen87, Paolo Radice88, Bernardo Bonanni89, Ute Hamann90, Mehdi Manoochehri90, James V Lacey91,92, Maria Elena Martinez93,94, Alison M Dunning15, Paul D P Pharoah12,15, Douglas F Easton12,15, Keun-Young Yoo6, Daehee Kang1,4,6.   

Abstract

In this study we aim to examine gene-environment interactions (GxEs) between genes involved with estrogen metabolism and environmental factors related to estrogen exposure. GxE analyses were conducted with 1970 Korean breast cancer cases and 2052 controls in the case-control study, the Seoul Breast Cancer Study (SEBCS). A total of 11,555 SNPs from the 137 candidate genes were included in the GxE analyses with eight established environmental factors. A replication test was conducted by using an independent population from the Breast Cancer Association Consortium (BCAC), with 62,485 Europeans and 9047 Asians. The GxE tests were performed by using two-step methods in GxEScan software. Two interactions were found in the SEBCS. The first interaction was shown between rs13035764 of NCOA1 and age at menarche in the GE|2df model (p-2df = 1.2 × 10-3). The age at menarche before 14 years old was associated with the high risk of breast cancer, and the risk was higher when subjects had homozygous minor allele G. The second GxE was shown between rs851998 near ESR1 and height in the GE|2df model (p-2df = 1.1 × 10-4). Height taller than 160 cm was associated with a high risk of breast cancer, and the risk increased when the minor allele was added. The findings were not replicated in the BCAC. These results would suggest specificity in Koreans for breast cancer risk.

Entities:  

Keywords:  breast cancer; estrogen; gene-environment interaction

Year:  2021        PMID: 34069208      PMCID: PMC8156547          DOI: 10.3390/cancers13102370

Source DB:  PubMed          Journal:  Cancers (Basel)        ISSN: 2072-6694            Impact factor:   6.639


1. Introduction

Breast cancer is the most common cancer in females worldwide, with an estimated 2.3 million incident cases globally in 2020. Moreover, breast cancer ranks as the most common cancer for women in 159 countries, and the most common cause of cancer deaths in 110 countries [1]. In Korea, the incidence rate of breast cancer, as well as breast cancer mortality, has been persistently increasing [2]. Numerous epidemiological studies have been performed to identify risk factors for breast cancer and many non-genetic factors, referred to as environmental factors, have been established [3,4,5]. Furthermore, many genome-wide association studies (GWAS) have been conducted, which have provided significant opportunities to discover the potential effects of common genetic factors on complex diseases [6]. To date, more than 200 common susceptibility loci for breast cancer have been identified through GWAS [7,8,9,10]. Although BRCA1 and BRCA2 are the most well-known risk genes associated with about a 20-fold increased breast cancer risk, they only account for 20% of familial breast cancer due to the low frequency of mutations [11,12]. Even if all the common genetic factors are taken together, they are estimated to explain only about 30% of the familial risk [7]. Consequently, further research is needed to identify the missing heritability [13], and to better understand how genetic factors combine with environmental factors to influence risk [14]. Currently, a few alternatives have been proposed, and one of them is a gene-environment interaction (GxE) analysis. By accounting for interactions between genetic and environmental factors, better estimates of the population-attributable risk or effects in specific subgroups exposed by certain environments can be obtained [15,16]. To date, many GxE studies for breast cancer risk have been conducted, but few studies have shown significant results after accounting for multiple testing [14,17,18,19,20,21,22,23,24]. Most of these studies used the standard case-control analysis of GxE interaction based on logistic regression and added interaction terms and tended to have poor statistical power. Some methods have been developed and proposed [25], and two-step approaches have been reported to have greater power than other approaches [13]. However, previous studies have not found, or have only found a few interactions by using SNPs which were discovered from GWAS and have been conducted, for the most part, within European populations [14,17,18,19,20,21,22,23,24]. In this study we focused on estrogen-related lifestyle factors that have been identified for breast cancer, including several well-established reproductive factors that are mediated by hormonal mechanisms. Endogenous estrogen levels are likely controlled by genetic factors of the estrogen metabolism pathway and, in turn, play an integral part as determinants of breast cancer risk [26,27,28]. Thus, we aimed to examine whether there are any interactions between genes and environmental factors related to estrogen exposure or metabolism by using a candidate approach in Korean women.

2. Results

Table 1 shows the distribution of demographic and reproductive factors and their association with breast cancer risk in the Seoul Breast Cancer Study (SEBCS) population. Controls were generally older than cases, and most of the environmental factors were significantly different between cases and controls. Taller height, lower BMI, younger age at menarche, older age at first full term pregnancy (FFTP), no breastfeeding experience and shorter duration of breastfeeding, and longer duration of estrogen exposure before FFTP (EEBF) were associated with increased risk for breast cancer among the eight candidate environmental factors.
Table 1

Distributions of demographic and reproductive factors and associations with the risk of breast cancer, Seoul Breast Cancer Study.

Demographic and Reproductive FactorsCases N = 1970Controls N = 2052OR c95% CI
Mean ± SD%Mean ± SD%
Age49.0 ± 8.26 51.4 ± 7.75 0.990.98–1.01
Family history. yes 4.8 1.82.641.77–3.94
Height157.9 ± 5.04 155.9 ± 5.00 1.071.06–1.08
  <160 cm 59.6 75.71.00reference
  ≥160 cm 40.4 24.31.921.67–2.20
BMI (kg/m2)23.2 ± 2.94 23.7 ± 2.96 0.970.95–1.00
  <25 76.8 71.21.00reference
  ≥25 23.2 28.90.810.69–0.93
Menarche age14.8 ± 1.65 15.2 ± 1.77 0.900.86–0.93
  ≥14 years 78.9 85.01.00reference
  <14 years 21.1 15.01.271.07–1.50
Ever pregnancy 93.5 92.80.260.14–0.46
Age at FFTP a25.9 ±3.48 25.1 ± 3.36 1.051.03–1.07
  <27 years 55.6 66.41.00reference
  Nulliparity or ≥27 years 44.4 33.61.491.30–1.70
No. of children2.2 ± 0.85 2.3 ± 0.91 1.060.97–1.16
  ≥2 85.2 88.21.00reference
  <2 14.8 11.81.110.91–1.36
Never breastfeeding 22.3 14.51.31 d1.10–1.57
Breastfeeding duration (months)18.0 ± 22.59 24.0 ± 21.53 0.99 d0.99–1.00
Breastfeeding duration per child (months)7.3 ± 8.01 9.9 ± 7.27 0.97 d0.96–0.98
  ≥2.5 month 45.1 68.31.00reference
  <2.5 month 54.9 31.73.81 d3.16–4.59
Duration of EEBF b12.5 ± 6.78 11.7 ± 8.00 1.04 e1.02–1.05
  <13 years 61.1 76.21.00reference
  ≥13 years 38.9 23.81.75 e1.51–2.03
Postmenopausal women 38.2 54.80.600.50–0.72
Age at menopause48.5 ± 5.40 49.2 ± 4.55 0.960.94–0.98

a FFTP first full-term pregnancy (years); b EEBF estrogen exposure before the first full-term pregnancy (years); c ORs analyzed by logistic regression adjusting for age, family history, categorical FFTP, age at menarche, and menopausal status; d ORs analyzed by logistic regression adjusting for age, family history, categorical FFTP, age at menarche, menopausal status, and the number of children; e ORs analyzed by logistic regression adjusting for age, family history, ever pregnancy, and menopausal status.

We conducted the replication using both a European population and an Asian population in the Breast Cancer Association Consortium (BCAC) (Table S3), and their distribution of demographic and other factors are shown in Tables S4 and S5, as well as associations with breast cancer. The associations with the risk of breast cancer were heterogeneous by ethnicity in height, age at menarche, age at FFTP, and menopause status. We found two interactions through the GxE analysis (Table 2). High LDs were observed among the three SNPs (i.e., rs13035764, rs11125629, and rs11688818) in NCOA1 on chromosome 2.
Table 2

Top two SNPs from the gene and environment interaction analysis between 11,555 SNPs involved in estrogen metabolism and 8 environmental factors related to estrogen, Seoul Breast Cancer Study.

SNP Chromosome Position_b36 Gene Minor/Major_Allele MAF
rs13035764224571432 NCOA1 G/C0.2971
N(Cases/Controls) OR a (95%CI) Analysis methods Environment p-2df Threshold p-GxE
3925(1949/1976)0.84 (0.76–0.93)Weighted GE|2dfAge at menarche1.2 × 10−30.005 b0.0524
SNP Chromosome Position_b36 Gene Minor/Major_Allele MAF
rs8519986152025031 ESR1 A/G0.4426
N(Cases/Controls) OR a (95%CI) Analysis methods Environment p-2df Threshold p-GxE
3940(1955/1985)0.82 (0.75–0.90)Subset GE|2dfheight6.8 × 10−50.000110 c0.0471

a risk of SNP for breast cancer; b threshold of bin 1 in the weighted model; c 0.05 divided by 456 SNPs which were tested in step 2.

These three SNPs, i.e., rs13035764, rs11125629, and rs11688818, showed interactions with age at menarche by the weighted GE|2df method (p2df = 0.0012, 0.0018, 0.0027 respectively, significant threshold p = 0.005, bin1) (Figure S1). Age at menarche before 14 years old (<25% quartile) was associated with increased risk of breast cancer (OR = 1.27, 95% CI 1.07–1.50); this association was much stronger for subjects who had a homozygous minor allele G (ORage at menarche < 14 years|rs13035764_GG = 2.85, 95% CI 1.43–5.67). When we examined the associations of age at menarche with breast cancer risk in 44 studies, eight studies (seven European and one Asian) showed consistent effects with the SEBCS (Figure S3), however, interaction with rs13035764 was not observed in the replication set from either theEuropean or Asian populations (Table 3 and Table S6).
Table 3

Association between an environmental factor and breast cancer by genotypes.

StudyExposureCategoryN(Cases/Controls)OR95% CIp-GxE
SEBCSAge at menarche < 14 years old aEffect of E in overall4022(1970/2052)1.271.07–1.500.0524
E|rs13035764_CC1963(1007/956)1.190.95–1.51
E|rs13035764_CG1706(810/896)1.170.89–1.53
E|rs13035764_GG337(147/190)2.851.43–5.67
Height ≥ 160 cm bEffect of E in overall4022(1970/2052)1.921.67–2.200.0471
E|rs851998_GG1281(677/604)1.531.20–1.95
E|rs851998_GA1921(938/983)2.031.65–2.49
E|rs851998_AA819(355/464)2.391.74–3.29
BCAC-EuropeanAge at menarche < 12 years old aEffect of E in overall62,485(36,894/25,591)1.030.99–1.080.5314
E|rs13035764_CC9953(5921/4032)1.020.91–1.14
E|rs13035764_CG34,124(20,140/13,984)1.030.97–1.09
E|rs13035764_GG18,408(10,833/7575)1.050.97–1.14
Height ≥ 169 cm bEffect of E in overall56,363(31,994/24,369)0.970.93–1.010.2425
E|rs851998_GG27,224(15,513/11,711)0.930.88–0.99
E|rs851998_GA23,843(13,459/10,384)0.990.93–1.05
E|rs851998_AA5296(3022/2274)1.030.90–1.17
BCAC-AsianAge at menarche < 12 years old aEffect of E in overall9047(4392/4655)0.970.82–1.140.7817
E|rs13035764_CC4867(2407/2460)0.950.75–1.21
E|rs13035764_CG3504(1666/1838)0.990.76–1.28
E|rs13035764_GG676(319/357)1.040.57–1.91
Height ≥ 160 cm bEffect of E in overall5789(2879/2910)1.221.08–1.390.4642
E|rs851998_GG1751(899/852)1.200.95–1.52
E|rs851998_GA2846(1422/1424)1.190.99–1.42
E|rs851998_AA1192(558/634)1.381.04–1.82

ORs analyzed by logistic regression adjusting for age, family history, categorical FFTP, age at menarche, and menopausal status. a age at menarche <25% quartile; b height ≥75% quartile.

Figure S2 shows the results of the interaction between height and three SNPs (rs851998, rs851971, and rs851967) located near the ESR1 gene in chromosome 6 by the subset GE|2df method (p2df = 0.000068, 0.000094 and 0.0001, respectively, the significance threshold was 0.000110 because the number of SNPs tested in step 2 was 456). Height taller than 160 cm (≥75% quartile) was associated with increased risk of breast cancer (OR = 1.92, 95% CI 1.67–2.20). The association of height with breast cancer was stronger when the minor allele was considered (ORheight ≥ 160 cm|rs851998_GG = 1.53, 95% CI 1.20–1.95; ORheight ≥ 160 cm|rs851998_GA = 2.03, 95% CI 1.65–2.49; and ORheight ≥ 160 cm|rs851998_AA = 2.39, 95% CI 1.74–3.29) (Table 3). This association was observed among Asian studies of the BCAC. The risk of breast cancer associated with greater height was enhanced among those who had minor alleles (ORheight ≥ 160 cm|rs851998_AA = 1.38, 95% CI 1.04–1.82) but the interaction test was not significant (p-GxE = 0.4642) (Table 3). Although there were ten studies (nine European and one Asian) that showed the consistent effects of height with SEBCS (Figure S4), a significant interaction with rs851998 was not found, even in the combined population, which showed a consistent effect with the SEBCS (Table S7). The associations between each environmental factor and the risk of breast cancer by genotype were not different according to the estrogen receptor (ER) or progesterone receptor (PR) status in the SEBCS population (Table S8).

3. Discussion

We performed GxE analyses between 11,555 SNPs in and near the promoters of the 136 genes involved with estrogen metabolism and eight environmental factors related to estrogen exposure by using GxEScan. Various results from the two-step methods were obtained, and two interactions involving NCOA1 with age at menarche and ESR1 with height were found. The minor allele of rs13035764 in the NCOA1 was associated with an enhanced risk of breast cancer for younger age at menarche in the SEBCS. Our study suggested that women whose menstruation was late, possibly had an increased protective effect with this variant. NCOA1 is known as the steroid receptor coactivator 1 (SRC-1) encoding a protein that acts as a transcriptional coactivator for steroid and nuclear hormone receptors. Including SRC-1, SRC coactivators have important roles in development, growth, reproduction, and even in cancer [29]. It has been shown that SRC-1 proteins were overexpressed from 19% to 29% of human breast cancers, and this overexpression has been associated with large and high-grade tumors [30,31,32]. Although there was no epidemiological study that examined the association between SRC-1 and age at menarche directly, previous reports have indicated the regulatory role of SRC-1 during a specific phase in the menstrual cycle [33,34]. Thus, our findings could suggest that the risk for breast cancer decreases in the late menarche group as they were exposed not only to the hormone but also to the SRC-1 effects later. We also found that the minor allele of rs851998, which is located near the ESR1 gene, was associated with an increased risk of breast cancer for females who were taller than 160 cm. Previously, numerous studies have suggested the association between the ESR1 gene and breast cancer [35,36,37,38], and have reported the role of ESR1 mutations for breast cancer [39,40,41]. In addition, adult height has been linked with breast cancer risk from many epidemiological studies [42,43,44,45,46]. As a key factor for the development of female sex organs and sex characteristics, estrogen also plays a role in height as a part of the growth process [47,48]. Although which part of estrogen metabolism and how it acts on the growth in vivo is unclear [49], many studies have supported that because estrogen synthesis and secretion are increased during puberty, these factors could lead to epiphyseal fusion, termination of linear growth, and determination of final height [50,51,52,53]. With several reports suggesting the association between ESR1 and height [54,55,56], we think that the ESR1 gene and height possibly have an association with breast cancer risk, since significant interaction results were found in our results. This study was conducted by the candidate gene approach with a hypothesis for statistical efficiency and biological plausibility. There have been a considerable number of criticisms [57,58,59]. The main concerns have included inadequate analytic procedures during the selection of genetic or environmental factors and using statistical methods with small sample size and lower power, as well as publication bias. However, an opposite commentary has refuted that the candidate GxE approach remains to be the most commonly studied despite its limitations, and the goal of a candidate gene-environment interaction study is distinct from that of a GWAS-based interaction study [60]. GxE based on GWAS focuses on gene-environmental correlations, while candidate GxE with a hypothesis attempts to discover causes. Moreover, GxE based on GWAS mainly aims to overcome missing heritability and to examine the modified effect of genetic factors by environmental factors. There are other approaches to discover novel genetic factors, which have interactions with established environmental factors from genome-wide ranges; however, these approaches could not report significant interactions, possibly due to the burden of multiple testing [20,21]. Meanwhile, the candidate GxE approach, which aimed to detect genetic factors that have interactions with established environmental factors from candidate genes and to examine the modified effects of environmental factors by genotype, could decrease the statistical burden and interpret easily due to biological plausibility. There are several limitations to this study. First, we dichotomized the continuous variables when the environmental factors were included in the GxE analysis. Although this approach could have some statistical problems, such as losing information, underestimating the extent of variation, and concealing nonlinearity, binary variables make interpretation easy with possible mechanistic conclusions [61] and facilitate testing for multi-factor interactions [62]. Second, our statistically significant results were obtained from the EG|2df method rather than from the EDGxE, which demonstrated the best power and efficiency. Furthermore, the 2df test is called the joint gene and GxE test, in which the hypothesis is βg = βgxe = 0. Thus, a polymorphism can be detected even when there is no interaction, but it has a marginal effect. However, our results from the GE|2df showed that the effect of environmental factors differed according to the number of minor alleles (Table 3). Thus, they might not be false-positive results. Third, the interactions found from the SEBCS were not replicated in the independent large consortium data, even in the replication test using each ethnicity population and combined population which showed consistent effects with the SEBCS. To date, numerous studies have performed gene-environment interaction tests for breast cancer risk, however, the results have rarely been replicated [63,64,65]. Most of the studies have been conducted in the European population and all replicated results have also been found in the European descent study population. This could be explained by racial differences impacting environmental factors and also genetic backgrounds [66,67,68,69,70] (as shown the Figures S3 and S4 and Tables S6 and S7). A larger study population or consortium data including Asian descent are demanded to replicate the findings successfully. Fourth, the sample size of this study was not large relative to other consortium studies; however, two-step methods generally require a smaller sample size to achieve enough power for the GxE test as compared with the standard GxE test, and a previous study used only 2382 subjects and found novel genes by using GxEScan [13]. Lastly, breast cancer was considered as a whole entity rather than subtypes of breast cancers such as pre-menopausal and post-menopausal breast cancer due to the limitation of sample size. Further studies with a larger sample size would find that differences in the GxE depend on menopause status. Although breast cancer is the most common cancer in women worldwide, the incidence or mortality of breast cancer differs from country to country [71]. According to GLOBCAN 2018 and the Korean Breast Cancer Society, the incidence rate of most of these countries is decreasing, while it is still increasing in Korea. The mortality of breast cancer is also dramatically increasing [72,73]. Korean females have shown a decreasing trend of age at menarche and an increasing trend of height [74,75,76,77]. These aspects possibly explain why the incidence rate of breast cancer is increasing in Korea, and therefore there is a need for research on breast cancer specific to Korean women.

4. Methods

4.1. Study Population of the Gene-Environment Interaction (GxE) Study

The Seoul Breast Cancer Study (SEBCS) subjects were used in this study. As a multicenter case-control study, 4040 women with histologically confirmed breast cancer were recruited between 2001 and 2007 from the Seoul National University Hospital and the Asan Medical Center. During the same period, 1818 non-cancer patients aged 32 to 75 years were enrolled as hospital-based controls in the same hospital, and 2052 healthy women aged 39 to 71 years who were participants in a community health screening program from 2002 to 2007 were also included in the control group [78]. A questionnaire survey was administered by trained interviewers using in-person interviews for all subjects. Demographic information, family history, behavioral and dietary habits, and reproductive information were collected by this survey. Among the participants, we excluded those who had previous histories of cancer, hysterectomy, or oophorectomy, and only included those who were born from 1930 to 1969, following the same criteria as a previous study [78]. From these exclusion criteria, 3332 cases and 3620 controls were selected with questionnaire data. All subjects in the SEBCS had written informed consent, and the study design was approved by the Committee on Human Research of the Seoul National University Hospital (IRB no. H-0503-144-004).

4.2. Study Population and Data Used in the Replication

A large dataset pooled from the Breast Cancer Association Consortium (BCAC) was used for the replication. Participants who were not of European or Asian descent and had missing data regarding sex, age, and genotype information of SNPs were excluded. A replication dataset for interaction between age at menarche and an SNP that had shown interaction with age at menarche from the SEBCS was made after excluding subjects who had missing data on age at menarche. Likewise, a replication dataset for interaction between height and an SNP that had shown an interaction with height from the SEBCS was created after excluding missing data in height. Missing values of covariates including family history, age at first full term pregnancy, and menopause status were categorized separately. In total, 62,485 women of European descent (36,894 cases and 25,591 controls in 37 studies) and 9047 women of Asian descent (4392 cases and 4655 controls in 7 studies) were included in the replication dataset to examine the interaction between age at menarche and an SNP in NCOA1, and 56,363 women of European descent (31,994 cases and 24,369 controls in 29 studies) and 5789 women of Asian descent (2879 cases and 2910 controls in 4 studies) were included in a replication dataset to assess the interaction between height and an SNP in ESR1 (Table S3).

4.3. GWAS Data

The SEBCS also performed GWAS by using the Affymetrix Genome-Wide Human SNP Array 6.0 chip (Affymetrix, Inc., Santa Clara, CA, USA). A total of 4394 women (2342 cases and 2052 controls) were scanned, and we compiled a final dataset that included 555,117 SNPs after quality control exclusions. Detailed descriptions of genotyping and quality control procedures can be found in the previous literature [79,80]. To conduct gene-environment interaction analyses, we included as eligible subjects a total 4022 women (1970 cases and 2052 controls) who had both questionnaire and GWAS data.

4.4. Candidate Gene Selection

We used the search term “estrogen metabolism” in several databases to identify candidate genes involved with estrogen metabolism. Ninety-one genes were identified using the Ensembl genome browser 84 (www.ensembl.org/ (accessed on 30 May 2016)), and 89 genes were selected except for two genes located on the X chromosome. Twenty-eight genes were searched using AmiGO (http://amigo1.geneontology.org/ (accessed on 30 May 2016)), but all genes overlapped with the previous selection. Twenty-nine genes were searched using AmiGO2 (http://amigo.geneontology.org/amigo (accessed on 30 May 2016)), and two genes, which did not overlap with the previous selection, were added. We also searched the literature to find more genes related to estrogen metabolism. Thirty-seven genes were added among 199 SNPs, which showed significant results from the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis [81], and nine genes were further added from previous studies [82,83,84,85,86,87,88,89,90,91,92,93,94]. A total of 137 candidate genes were on our list.

4.5. SNPs Extraction and Imputation

The location and range of candidate genes data were searched using Ensembl genome browser 84 and the National Center for Biotechnology Information (NCBI) website (GRCh38/hg38 ver.). To cover the promoter regions of most candidate genes, we included SNPs from candidate genes that were within 20 Kb from the start and the end positions of each range [95]. Of the 555,117 SNPs mentioned in the above GWAS data, 2482 SNPs were extracted from 136 genes and one gene had an empty extraction result. After the Hardy–Weinberg equilibrium (HWE) test, 2472 SNPs were selected with HWE > 0.001. We also performed an SNP imputation based on HapMap 2.0 (release 22/hg18) with the same range of candidate genes. In total, 9750 SNPs with quality scores > 0.3 were obtained by imputation, and 9083 SNPs with minor allele frequency (MAF) > 0.01 were included in the GxE analysis. In total, 11,555 SNPs were tested in the GxE analysis.

4.6. Environmental Factors Related to Estrogen Exposure

We selected seven variables related to estrogen exposure in this analysis including body mass index (BMI), height, age at menarche, age at first full term pregnancy (FFTP), number of children, breastfeeding, and total duration of breastfeeding. All exposures were continuous variables, except for breastfeeding which was collected as yes/no, and most of the continuous variables had missing values. We performed an imputation for missing data to the median value of each variable in the stratified birth year groups (1930–1950, 1950–1959, and 1960–1969), and therefore the effect of birth cohort could be considered without reducing power with the sample size (Table 1 and Table S1). To assess the adjusted effect of breastfeeding duration, breastfeeding duration per child was calculated by dividing the total duration of breastfeeding by the number of children. We also made a new variable to examine the combined effects of age at menarche and age at FFTP. The duration of estrogen exposure before FFTP (EEBF) was calculated by subtracting the age at menarche from the age at FFTP for parous women. For nulliparous women, EEBF was calculated by subtracting the age at menarche from the age at interview for pre-menopause women, or from the age at menopause for post-menopause women. To reduce computational burden and to facilitate interpretation, all continuous variables were dichotomized by quartile of risk direction. This resulted in eight environmental factors in the GxE analysis that were categorized as follows: age at menarche (≥14 years vs. <14 years, 25% quartile); age at FFTP (<27 years vs. nulliparity or ≥27 years, 75% quartile); height (<160 cm vs. ≥160 cm. 75% quartile); BMI (<25 kg/m2 vs. ≥25 kg/m2 Korea criteria of obesity); number of children (≥2 vs. <2, 25% quartile); breastfeeding (often vs. never); breastfeeding duration per child (≥2.5 month vs. <2.5 month, 25% quartile); and EEBF (<13 years vs. ≥13 years, 75% quartile). Likewise, continuous variables from the BCAC were dichotomized by quartile of risk direction in the European and Asian populations, respectively, as follows: age at menarche (≥12 years vs. <12 years, 25% quartile in both European and Asian populations); height (<169 cm vs. ≥169 cm, 75% quartile in European population and <160 cm vs. ≥160 cm, 75% quartile in Asian population); and age at FFTP (<28 years vs. nulliparity or ≥28 years, 75% quartile in both European and Asian populations). The estrogen receptor (ER) or progesterone receptor (PR) status was collected by the medical and pathological records and was determined when the tumor cells showed 10% or more positive nuclear staining from an immunohistochemistry assay [78].

4.7. Statistical Analysis

To estimate the effect of each environmental factor for breast cancer with odds ratios (OR) and 95% confidence intervals (CI), age, family history, parity, age at FFTP, age at menarche, and menopausal status were commonly included in the model. For variables regarding parity such as the number of children, breastfeeding history, and breastfeeding duration per child, further adjustments were performed by adding the number of children to the model. As an exception, age, family history, parity, and menopausal status were included in the model when the analysis was performed for EEBF. These models were consistently used in the GxE analysis, and in the logistic regression to estimate genetic risk factors, which showed significant interactions in the GxE analysis. GxEScan (ver. Beta 0.4.0; http://biostats.usc.edu/software (accessed on 1 June 2016)) was mainly used in this study to conduct the GxE analysis. This software was developed by Gauderman et al. [13] and was designed to conduct GxE analysis in case-control study designs using the traditional GxE test and also a few two-step methods. Especially, the two-step methods have generally been proposed to provide greater statistical power than the traditional case-control GxE test while preserving the type I error rate. Four two-step methods were provided by GxEScan, i.e., DG|GxE, GE|GxE, EDGxE, and GE|2df and we mainly observed GxE results from these methods. There were two hypotheses testing approaches in the two-step methods, ‘subset testing’ and ‘weighted testing’. Detailed descriptions of these hypotheses can be found elsewhere [13,96] and both were used in this study. As an initial step, we carried out GxE analyses by birth year groups (1930–1950, 1950–1959, and 1960–1969) because each birth year group had a different distribution of environmental factors (Table S2). The results of GxE analyses of each birth year group were inconsistent and were not statistically significant, possibly due to the modest sample size. For these reasons, we performed a meta-analysis of the three sets of results obtained from each birth year group by using METAL software [97], and then compared our results with the GxE result from the overall pooled data. There was a negligible difference between the two results; we presented the result from the pooled data analysis. PLINK (ver. 1.07; http://pngu.mgh.harvard.edu/purcell/plink/ (accessed on 7 March 2016)) was used to examine GWAS data and for extracting candidate SNPs. Overall management of questionnaire data and statistical analyses were performed by SAS (ver. 9.4, SAS Institute Inc., Cary, NC, USA). Replication analyses were performed in the two datasets for gene x height and for gene x age at menarche, separately. Interactions were tested by ethnicity first, and then in each study that showed the consistent effects of exposures with SEBCS. SAS (ver. 9.4; SAS Institute Inc., Cary, NC, USA) was used to test replication.

5. Conclusions

Through the GxE test between 136 candidate genes involved in estrogen metabolism and eight environmental factors related to estrogen exposure, we found two interactions in middle-aged Korean females. Although the results of our study were not replicated, they do suggest specificity in Koreans for breast cancer risk. The explainable interaction might be a better method to interpret and apply the results than deep medicine or artificial intelligence [98]. We expect to make a step forward in predicting and preventing breast cancer, as more interactions are to be investigated.
  96 in total

Review 1.  Health disparities in breast cancer: biology meets socioeconomic status.

Authors:  Barbara K Dunn; Tanya Agurs-Collins; Doris Browne; Ronald Lubet; Karen A Johnson
Journal:  Breast Cancer Res Treat       Date:  2010-05-01       Impact factor: 4.872

Review 2.  Candidate gene-environment interaction research: reflections and recommendations.

Authors:  Danielle M Dick; Arpana Agrawal; Matthew C Keller; Amy Adkins; Fazil Aliev; Scott Monroe; John K Hewitt; Kenneth S Kendler; Kenneth J Sher
Journal:  Perspect Psychol Sci       Date:  2015-01

Review 3.  Gene--environment-wide association studies: emerging approaches.

Authors:  Duncan Thomas
Journal:  Nat Rev Genet       Date:  2010-04       Impact factor: 53.242

Review 4.  ESR1 mutations—a mechanism for acquired endocrine resistance in breast cancer.

Authors:  Rinath Jeselsohn; Gilles Buchwalter; Carmine De Angelis; Myles Brown; Rachel Schiff
Journal:  Nat Rev Clin Oncol       Date:  2015-06-30       Impact factor: 66.675

5.  Cyclic changes in the expression of steroid receptor coactivators and corepressors in the normal human endometrium.

Authors:  Tanri Shiozawa; Hsien-Chang Shih; Tsutomu Miyamoto; Yu-Zhen Feng; Junko Uchikawa; Kazuko Itoh; Ikuo Konishi
Journal:  J Clin Endocrinol Metab       Date:  2003-02       Impact factor: 5.958

6.  Joint effects of the CYP1A1 MspI, ERalpha PvuII, and ERalpha XbaI polymorphisms on the risk of breast cancer: results from a population-based case-control study in Shanghai, China.

Authors:  Yueping Shen; De-Kun Li; Junqing Wu; Zibao Zhang; Ersheng Gao
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2006-02       Impact factor: 4.254

Review 7.  Estrogens and growth: review.

Authors:  Andrei S Chagin; Lars Sävendahl
Journal:  Pediatr Endocrinol Rev       Date:  2007-06

8.  A genome-wide association study identifies a breast cancer risk variant in ERBB4 at 2q34: results from the Seoul Breast Cancer Study.

Authors:  Hyung-cheol Kim; Ji-Young Lee; Hyuna Sung; Ji-Yeob Choi; Sue K Park; Kyoung-Mu Lee; Young Jin Kim; Min Jin Go; Lian Li; Yoon Shin Cho; Miey Park; Dong-Joon Kim; Ji Hee Oh; Jun-Woo Kim; Jae-Pil Jeon; Soon-Young Jeon; Haesook Min; Hyo Mi Kim; Jaekyung Park; Keun-Young Yoo; Dong-Young Noh; Sei-Hyun Ahn; Min Hyuk Lee; Sung-Won Kim; Jong Won Lee; Byeong-Woo Park; Woong-Yang Park; Eun-Hye Kim; Mi Kyung Kim; Wonshik Han; Sang-Ah Lee; Keitaro Matsuo; Chen-Yang Shen; Pei-Ei Wu; Chia-Ni Hsiung; Jong-Young Lee; Hyung-Lae Kim; Bok-Ghee Han; Daehee Kang
Journal:  Breast Cancer Res       Date:  2012-03-27       Impact factor: 6.466

Review 9.  Oestrogen exposure and breast cancer risk.

Authors:  Ruth C Travis; Timothy J Key
Journal:  Breast Cancer Res       Date:  2003-07-28       Impact factor: 6.466

10.  Genetic variation at CYP3A is associated with age at menarche and breast cancer risk: a case-control study.

Authors:  Nichola Johnson; Frank Dudbridge; Nick Orr; Lorna Gibson; Michael E Jones; Minouk J Schoemaker; Elizabeth J Folkerd; Ben P Haynes; John L Hopper; Melissa C Southey; Gillian S Dite; Carmel Apicella; Marjanka K Schmidt; Annegien Broeks; Laura J Van't Veer; Femke Atsma; Kenneth Muir; Artitaya Lophatananon; Peter A Fasching; Matthias W Beckmann; Arif B Ekici; Stefan P Renner; Elinor Sawyer; Ian Tomlinson; Michael Kerin; Nicola Miller; Barbara Burwinkel; Frederik Marme; Andreas Schneeweiss; Christof Sohn; Pascal Guénel; Therese Truong; Emilie Cordina; Florence Menegaux; Stig E Bojesen; Børge G Nordestgaard; Henrik Flyger; Roger Milne; M Pilar Zamora; Jose Ignacio Arias Perez; Javier Benitez; Leslie Bernstein; Hoda Anton-Culver; Argyrios Ziogas; Christina Clarke Dur; Hermann Brenner; Heiko Müller; Volker Arndt; Aida Karina Dieffenbach; Alfons Meindl; Joerg Heil; Claus R Bartram; Rita K Schmutzler; Hiltrud Brauch; Christina Justenhoven; Yon-Dschun Ko; Heli Nevanlinna; Taru A Muranen; Kristiina Aittomäki; Carl Blomqvist; Keitaro Matsuo; Thilo Dörk; Natalia V Bogdanova; Natalia N Antonenkova; Annika Lindblom; Arto Mannermaa; Vesa Kataja; Veli-Matti Kosma; Jaana M Hartikainen; Georgia Chenevix-Trench; Jonathan Beesley; Anna H Wu; David Van den Berg; Chiu-Chen Tseng; Diether Lambrechts; Dominiek Smeets; Patrick Neven; Hans Wildiers; Jenny Chang-Claude; Anja Rudolph; Stefan Nickels; Dieter Flesch-Janys; Paolo Radice; Paolo Peterlongo; Bernardo Bonanni; Valeria Pensotti; Fergus J Couch; Janet E Olson; Xianshu Wang; Zachary Fredericksen; Vernon S Pankratz; Graham G Giles; Gianluca Severi; Laura Baglietto; Chris Haiman; Jacques Simard; Mark S Goldberg; France Labrèche; Martine Dumont; Penny Soucy; Soo Teo; Cheng Har Yip; Sze Yee Phuah; Belinda K Cornes; Vessela N Kristensen; Grethe Grenaker Alnæs; Anne-Lise Børresen-Dale; Wei Zheng; Robert Winqvist; Katri Pylkäs; Arja Jukkola-Vuorinen; Mervi Grip; Irene L Andrulis; Julia A Knight; Gord Glendon; Anna Marie Mulligan; Peter Devillee; Jonine Figueroa; Stephen J Chanock; Jolanta Lissowska; Mark E Sherman; Per Hall; Nils Schoof; Maartje Hooning; Antoinette Hollestelle; Rogier A Oldenburg; Madeleine Tilanus-Linthorst; Jianjun Liu; Angie Cox; Ian W Brock; Malcolm W R Reed; Simon S Cross; William Blot; Lisa B Signorello; Paul D P Pharoah; Alison M Dunning; Mitul Shah; Daehee Kang; Dong-Young Noh; Sue K Park; Ji-Yeob Choi; Mikael Hartman; Hui Miao; Wei Yen Lim; Anthony Tang; Ute Hamann; Asta Försti; Thomas Rüdiger; Hans Ulrich Ulmer; Anna Jakubowska; Jan Lubinski; Katarzyna Jaworska-Bieniek; Katarzyna Durda; Suleeporn Sangrajrang; Valerie Gaborieau; Paul Brennan; James McKay; Susan Slager; Amanda E Toland; Celine Vachon; Drakoulis Yannoukakos; Chen-Yang Shen; Jyh-Cherng Yu; Chiun-Sheng Huang; Ming-Feng Hou; Anna González-Neira; Daniel C Tessier; Daniel Vincent; Francois Bacot; Craig Luccarini; Joe Dennis; Kyriaki Michailidou; Manjeet K Bolla; Jean Wang; Douglas F Easton; Montserrat García-Closas; Mitch Dowsett; Alan Ashworth; Anthony J Swerdlow; Julian Peto; Isabel dos Santos Silva; Olivia Fletcher
Journal:  Breast Cancer Res       Date:  2014-05-26       Impact factor: 6.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.