| Literature DB >> 31039828 |
Annelie Johansson1, Domenico Palli2, Giovanna Masala2, Sara Grioni3, Claudia Agnoli3, Rosario Tumino4, Maria Concetta Giurdanella4, Francesca Fasanelli5, Carlotta Sacerdote5, Salvatore Panico6, Amalia Mattiello6, Silvia Polidoro7, Michael E Jones8, Minouk J Schoemaker8, Nick Orr9,10, Katarzyna Tomczyk10, Nichola Johnson10, Olivia Fletcher10, Vittorio Perduca11, Laura Baglietto12, Pierre-Antoine Dugué13,14,15, Melissa C Southey15,16, Graham G Giles13,14, Dallas R English13,14, Roger L Milne13,14,16, Gianluca Severi13,14,17, Srikant Ambatipudi18,19, Cyrille Cuenin18, Veronique Chajès18, Isabelle Romieu18, Zdenko Herceg18, Anthony J Swerdlow8,20, Paolo Vineis7,21, James M Flanagan22.
Abstract
BACKGROUND: It is well established that estrogens and other hormonal factors influence breast cancer susceptibility. We hypothesized that a woman's total lifetime estrogen exposure accumulates changes in DNA methylation, detectable in the blood, which could be used in risk assessment for breast cancer.Entities:
Keywords: Biomarker; Breast cancer; Cancer risk; DNA methylation; EWAS; Epigenetics; Estrogen exposure; Hormonal exposures
Mesh:
Substances:
Year: 2019 PMID: 31039828 PMCID: PMC6492393 DOI: 10.1186/s13148-019-0664-7
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1EWAS identifies CpG sites significantly associated with ELEE in EPIC-Italy. An EWAS with DNA methylation (HM450K beta values) as outcome and ELEE as exposure was conducted in EPIC-Italy (n = 216) using a beta regression model adjusted for age, BMI, alcohol consumption, and smoking duration (all variables reported at recruitment), and batch, position on batch, and WBC composition. a Manhattan plot of minus log10Q values for FDR-corrected P values from the EWAS of all 404,596 probes across the genome. Blue line indicates FDR Q value threshold of 0.05. b Volcano plot of the regression coefficients (estimates) from the beta regression model, showing significant hypomethylated (in blue) and hypermethylated probes (in purple)
List of the 42 target CpG sites included in the targeted bisulfite sequencing
| HM450K probe | Chr | Position | Nearest gene | Distance to gene | EWAS of ELEE in EPIC-Italyb | |||
|---|---|---|---|---|---|---|---|---|
| Estimate | SE | |||||||
| cg01893629 | chr12 | 34494825 | ALG10 | 313588 | − 0.14 | 0.03 | 8.15E−05 | 4.96E−02 |
| cg08254089a | chr20 | 36933189 | BPI | 0 | − 0.28 | 0.08 | 7.04E−05 | 4.78E−02 |
| cg21590238 | chr12 | 121454837 | C12orf43 | 536 | − 0.29 | 0.08 | 2.67E−06 | 1.44E−02 |
| cg21153102 | chr15 | 41252147 | CHAC1 | 3429 | − 0.43 | 0.14 | 6.16E−05 | 4.58E−02 |
| cg03340215a | chr15 | 83315615 | CPEB1 | 0 | − 0.31 | 0.08 | 4.78E−05 | 4.21E−02 |
| cg06968859 | chr2 | 80724209 | CTNNA2 | 0 | − 0.26 | 0.05 | 2.52E−09 | 1.02E−03 |
| cg12105860a | chr12 | 31742801 | DENND5B | 0 | 0.14 | 0.03 | 1.58E−06 | 1.21E−02 |
| cg16840364a | chr4 | 84539569 | GPAT3 | 12541 | − 0.14 | 0.06 | 4.77E−05 | 4.21E−02 |
| cg08835688 | chr7 | 50849931 | GRB10 | 0 | − 0.14 | 0.03 | 4.50E−05 | 4.11E−02 |
| cg08349826 | chr16 | 10346403 | GRIN2A | 69791 | − 0.11 | 0.04 | 4.19E−05 | 4.02E−02 |
| cg23681866 | chr6 | 29895175 | HLA-J | 0 | − 1.33 | 0.31 | 2.17E−05 | 3.13E−02 |
| cg22968966 | chr16 | 22959875 | HS3ST2 | 32215 | − 0.40 | 0.12 | 7.95E−05 | 4.92E−02 |
| cg15127563 | chr2 | 231729487 | ITM2C | 132 | 0.36 | 0.09 | 1.77E−05 | 2.95E−02 |
| cg20020161 | chr2 | 231732669 | ITM2C | 0 | − 0.14 | 0.04 | 4.49E−05 | 4.11E−02 |
| cg22097768 | chr17 | 61615913 | KCNH6 | 0 | − 0.20 | 0.05 | 4.88E−05 | 4.25E−02 |
| cg17969123 | chr19 | 18745971 | KLHL26 | 1865 | − 0.15 | 0.04 | 3.18E−05 | 3.65E−02 |
| cg05422360 | chrX | 75648455 | MAGEE1 | 0 | − 0.44 | 0.12 | 1.72E−07 | 5.26E−03 |
| cg01768446 | chr16 | 89982419 | MC1R | 1866 | − 0.13 | 0.03 | 4.57E−05 | 4.13E−02 |
| cg25372296 | chr1 | 98510328 | MIR137HG | 0 | 0.33 | 0.09 | 3.83E−06 | 1.65E−02 |
| cg04519403a | chr5 | 79298951 | MTX3 | 11862 | − 0.24 | 0.05 | 7.51E−06 | 2.20E−02 |
| cg12091786 | chr20 | 61877942 | NKAIN4 | 0 | − 0.37 | 0.10 | 2.07E−05 | 3.06E−02 |
| cg25279613a | chr7 | 24956523 | OSBPL3 | 0 | 0.20 | 0.06 | 4.95E−06 | 1.96E−02 |
| cg24536703 | chr11 | 77183438 | PAK1 | 0 | − 0.32 | 0.09 | 2.71E−05 | 3.43E−02 |
| cg24036523 | chr14 | 73712256 | PAPLN | 0 | − 0.45 | 0.11 | 3.05E−05 | 3.65E−02 |
| cg16720405 | chr3 | 122790178 | PDIA5 | 0 | − 0.17 | 0.05 | 5.17E−05 | 4.33E−02 |
| cg13674411 | chr1 | 204232677 | PLEKHA6 | 0 | 0.11 | 0.03 | 3.50E−05 | 3.76E−02 |
| cg20684174 | chr11 | 7541255 | PPFIBP2 | 0 | − 0.13 | 0.03 | 1.23E−05 | 2.67E−02 |
| cg01430588a | chr17 | 56769767 | RAD51C | 194 | − 0.32 | 0.09 | 6.04E−05 | 4.51E−02 |
| cg22273487 | chr20 | 32580931 | RALY | 525 | 0.19 | 0.05 | 1.60E−05 | 2.84E−02 |
| cg22343083 | chr8 | 54786401 | RGS20 | 0 | − 0.30 | 0.07 | 1.92E−06 | 1.26E−02 |
| cg22758104 | chr17 | 50465 | RPH3AL | 11713 | − 0.23 | 0.06 | 4.62E−07 | 6.07E−03 |
| cg16733643a | chr1 | 41575522 | SCMH1 | 0 | − 0.42 | 0.11 | 5.00E−05 | 4.25E−02 |
| cg17588491 | chr22 | 25198892 | SGSM1 | 3242 | − 0.17 | 0.04 | 4.41E−05 | 4.06E−02 |
| cg13971030 | chr11 | 35366721 | SLC1A2 | 0 | − 0.49 | 0.14 | 9.19E−06 | 2.41E−02 |
| cg17567562a | chr3 | 47687980 | SMARCC1 | 0 | − 0.48 | 0.11 | 1.18E−05 | 2.66E−02 |
| cg10298859 | chr13 | 112883993 | SPACA7 | 146656 | − 0.20 | 0.04 | 2.26E−07 | 5.26E−03 |
| cg19216791 | chr19 | 5568216 | TINCR | 210 | − 0.25 | 0.07 | 4.13E−05 | 4.02E−02 |
| cg25936380 | chr2 | 120981591 | TMEM185B | 606 | − 0.29 | 0.07 | 2.22E−07 | 5.26E−03 |
| cg01824466a | chr8 | 95959531 | TP53INP1 | 0 | − 0.23 | 0.08 | 8.08E−05 | 4.96E−02 |
| cg26657235 | chr6 | 150378972 | ULBP3 | 4367 | − 0.12 | 0.03 | 1.32E−05 | 2.76E−02 |
| cg08551047 | chr15 | 91473569 | UNC45A | 0 | − 0.58 | 0.14 | 2.87E−06 | 1.46E−02 |
| cg20394620a | chrX | 48541924 | WAS | 260 | − 0.20 | 0.05 | 3.02E−06 | 1.46E−02 |
SE standard error
aMarked probes did not pass quality control in the targeted sequencing data and were not included in the analysis of the methylation index
bResult from the EWAS of ELEE in EPIC-Italy (n = 216, dataset 2). The estimates correspond to regression coefficients from a mixed-effects linear regression model (percentage change in DNA methylation per unit longer ELEE), and P values from the beta regression model, which have been corrected for multiple testing using FDR (Q values)
Table of characteristics for case-control pairs in EPIC-Italy and the Generations Study
| EPIC-Italy | The Generations Study | |||||
|---|---|---|---|---|---|---|
| Cases ( | Controls ( | Cases ( | Controls ( |
| ||
| Age | Mean (st.dev.), years | 52.9 (7.2) | 53.0 (7.1) | 53.9 (10.3) | 54.1 (10.4) | 0.054 |
| Time to diagnosis | Mean (st.dev.), years | 5.3 (4.4) | NA | 4.0 (2.4) | NA |
|
| Menopausal status | 0.690 | |||||
| Premenopausal | 52 (32.1%) | 49 (30.2%) | 135 (39.8%) | 127 (37.5%) | ||
| Postmenopausal | 85 (52.5%) | 87 (53.7%) | 204 (60.2%) | 212 (62.5%) | ||
| Age at menarche | Mean (st.dev.), years | 12.7 (1.4) | 12.7 (1.7) | 12.7 (1.4) | 12.7 (1.5) | 0.914 |
| Age at menopause | Mean (st.dev.), years | 50.2 (3.7) | 49.1 (3.8) | 50.3 (4.3) | 50.1 (4.5) | 0.126 |
| Number of pregnancies | Mean (st.dev.) | 1.6 (1.1) | 1.7 (1.0) | 1.9 (1.1) | 1.9 (1.2) |
|
| Ever breastfed | 103 (63.6%) | 112 (69.1%) | 265 (78.2%) | 274 (80.8%) | 0.179 | |
| Breastfeeding duration | Mean (st.dev.), years | 0.7 (0.6) | 0.8 (0.7) | 0.9 (1.0) | 0.8 (0.9) |
|
| BMI | Mean (st.dev.), kg/m2 | 25.8 (4.1) | 25.3 (4.3) | 25.7 (4.3) | 25.2 (4.3) | 0.626 |
| Alcohol consumptiona | Mean (st.dev.) | 5.5 (7.0) | 7.4 (9.9) | 15.8 (16.7) | 14.6 (15.5) |
|
| Smoking status |
| |||||
| Smoker | 31 (19.1%) | 36 (22.2%) | 25 (7.4%) | 23 (6.8%) | ||
| Former | 23 (14.2%) | 41 (25.3%) | 98 (28.9%) | 86 (25.4%) | ||
| Never | 106 (65.4%) | 85 (52.5%) | 216 (63.7%) | 230 (67.9%) | ||
| Smoking duration | Mean (st.dev.), years | 8.1 (12.6) | 10.6 (13.4) | 4.4 (9.0) | 3.9 (9.0) |
|
| OC ever | 59 (36.4 %) | 67 (41.4 %) | 253 (74.6%) | 256 (75.5%) |
| |
| OC duration | Mean (st.dev.), years | 4.1 (5.1) | 5.6 (5.8) | 8.1 (6.4) | 8.8 (6.8) |
|
| HRT ever | 23 (14.2 %) | 28 (17.3 %) | 239 (70.5%) | 241 (71.1%) |
| |
| HRT duration | Mean (st.dev.), years | 2.8 (2.7) | 2.6 (3.4) | 6.7 (5.7) | 5.4 (4.3) |
|
aAlcohol consumption in the Generations Study reported in average units per week, converted to average gram per day by multiplying with 8 (1 unit = 8 g alcohol) and dividing by 7
bP values indicate differences between all subjects from EPIC-Italy and all subjects from the Generations Study: t test for continuous variables, chi-squared test for categorical variables
Fig. 2The MI is associated with breast cancer risk. The MI was developed in combined HM450K data from EPIC-Italy (dataset 2, n = 237) and the Generations Study (dataset 3, n = 65) using ridge regression. The correlation between the MI and ELEE and the association between the MI and breast cancer risk were evaluated. a The correlations between the MI and ELEE in the development of HM450K data were as follows: r = 0.60 and P = 6 × 10−25 for EPIC-Italy and r = 0.27 and P = 0.027 for the Generations Study b The MI and ELEE were not correlated in the Generations Study targeted sequencing data (r =− 0.04, P = 0.340). c Density plot of the MI values in controls and cases in EPIC-Italy HM450K data. The MI was significantly associated with breast cancer risk in EPIC-Italy (n = 162 pairs, OR = 1.51, 95% CI 1.26–1.82, P = 1 × 10−5). d Density plot of the MI values in controls and cases in the Generations Study targeted sequencing data. The MI was significantly associated with breast cancer risk in the Generations Study (n = 339 pairs, OR = 1.04, 95% CI 1.01–1.08, P = 0.022). ORs were adjusted for age, BMI, alcohol consumption, and smoking duration (all variables reported at recruitment) and WBC composition
Fig. 3Meta-analysis of the association between MI and breast cancer risk. The association between MI and risk for breast cancer, as a continuous variable (a) or as a categorical variable (b), was estimated in the four studies included in the meta-analysis using conditional logistic regression adjusted for age, BMI, alcohol consumption, and smoking duration (all variables reported at recruitment) and WBC composition. The log odds ratios were combined in a meta-analysis using restricted-maximum likelihood model. The square boxes represent the odds ratios (ORs) and the lines the 95% confidence intervals (CIs). aEPIC-Italy corresponds to the new EPIC-Italy samples, not included in the development of the MI
Fig. 4Time to diagnosis and the association between the MI and breast cancer risk. Matched case-control pairs were stratified on median time to diagnosis in EPIC-Italy HM450K data (dataset 2) and in the four study cohorts included in the meta-analysis. The association between the MI and breast cancer risk was analyzed in the two groups. a The MI was significantly associated with breast cancer risk in EPIC-Italy pairs with a shorter time to diagnosis (n = 81 pairs, OR = 1.47, 95% CI 1.12–1.93, P = 0.005). b The MI was significantly associated with breast cancer risk in EPIC-Italy pairs with a longer time to diagnosis (n = 81 pairs, OR = 1.84, 95% CI 1.30–2.61, P = 0.001). c The combined meta-analysis including pairs with shorter time to diagnosis showed no significant association between the MI and breast cancer risk (OR = 1.03, 95% CI 0.98–1.08, P = 0.241). d The combined meta-analysis including pairs with shorter time to diagnosis showed no significant association between the MI and breast cancer risk (OR = 1.05, 95% CI 1.01–1.10, P = 0.021). The log odds ratios were combined in the meta-analyses using restricted-maximum likelihood model. ORs were adjusted for age, BMI, alcohol consumption, and smoking duration (all variables reported at recruitment) and WBC composition