| Literature DB >> 30157950 |
Erin W Hofstatter1, Steve Horvath2,3, Disha Dalela4, Piyush Gupta5, Anees B Chagpar6, Vikram B Wali7, Veerle Bossuyt8, Anna Maria Storniolo9, Christos Hatzis7, Gauri Patwardhan7, Marie-Kristin Von Wahlde7,10, Meghan Butler4, Lianne Epstein7, Karen Stavris4, Tracy Sturrock4, Alexander Au4,11, Stephanie Kwei4, Lajos Pusztai7.
Abstract
BACKGROUND: Age is one of the most important risk factors for developing breast cancer. However, age-related changes in normal breast tissue that potentially lead to breast cancer are incompletely understood. Quantifying tissue-level DNA methylation can contribute to understanding these processes. We hypothesized that occurrence of breast cancer should be associated with an acceleration of epigenetic aging in normal breast tissue.Entities:
Keywords: Biomarkers; Breast; Breast neoplasms; DNA methylation; Epigenetics; Epigenomics; Genome; Humans; Multivariate analysis; Smoking
Mesh:
Year: 2018 PMID: 30157950 PMCID: PMC6114717 DOI: 10.1186/s13148-018-0534-8
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Demographic variables of the cancer and control arms
| Variables | Breast cancer | Controls | |
|---|---|---|---|
| Total cohort samples | 40 | 53 | |
| Age (years) | 49.7 (46.32–53.02) | 45.9 (40.29–51.55) | 0.126 |
| Age category | 0.742 | ||
| < 50 years | 24 (60%) | 30 (57%) | |
| ≥ 50 years | 16 (40%) | 23 (43%) | |
| Ethnicity | 0.076 | ||
| White | 31(78%) | 49 (92%) | |
| African Americans | 4 (10%) | 3 (6%) | |
| Others | 5 (13%) | 1 (2%) | |
| Ashkenazi Jew | 6 (15%) | 3 (6%) | 0.162 |
| Height (in.) | 64.02 (63.10–64.94) | 63.96 (63.19–64.73) | 0.45 |
| Weight (lbs) | 157.22 (146.54–167.90) | 157.50 (148.59–166.41) | 0.483 |
| BMI (kg/m2) | 0.446 | ||
| < 18.5 | 0 (0%) | 1 (2%) | |
| 18.5–24.9 | 21 (53%) | 22 (42%) | |
| 25.0–29.9 | 8 (20%) | 17 (32%) | |
| > 30 | 11 (28%) | 13 (25%) | |
| Tobacco use | 0.696 | ||
| No | 25 (63%) | 31 (58%) | |
| Yes | 15 (38%) | 22 (42%) | |
| Smoking (pack years) | 3.73 (1.06–6.41) | 3.86 (1.06–6.59) | 0.475 |
| Current alcohol use | 0.019 | ||
| No | 20 (52%) | 15 (28%) | |
| Yes | 18 (47%) | 38 (72%) | |
| Positive family history of breast cancer | 18 (45%) | 11 (21%) | 0.012 |
| Age at menarche (years) | 12.37 (11.72–13.03) | 12.54 (12.20–12.87) | 0.671 |
| Menopausal status | 0.212 | ||
| Pre-menopausal | 27 (68%) | 29 (55%) | |
| Post-menopausal | 13 (33%) | 24 (45%) | |
| Ever pregnant | 0.266 | ||
| No | 8 (20%) | 16 (30%) | |
| Yes | 32 (80%) | 37 (70%) | |
| No. of times pregnant | 2.6 (1.94–3.25) | 1.94 (1.48–2.39) | 0.049 |
| Age at first childbirth (years) | 25.73 (23.24–28.99) | 25.61 (24.21–27.01) | 0.465 |
| Number of live births | 1.97 (1.54–2.40) | 1.57 (1.23–1.92) | 0.073 |
| Breastfeeding | 0.567 | ||
| No | 25 (63%) | 30 (57%) | |
| Yes | 15 (38%) | 23 (43%) | |
| ER/PR status | NA | ||
| ER+/PR+ | 38 (95%) | – | |
| ER+/PR- | 1 (2.5%) | – | |
| ER−/PR+ | 0(0) | ||
| ER−/PR− | 1(2.5%) | – | |
| Her2 status | NA | ||
| Not typed | 6 (15%) | – | |
| Her− | 31 (78%) | – | |
| Her+ | 3 (7.5%) | – |
Multivariate logistic regression predicting breast cancer
| Logistic regression | Number of obs | 57 | ||||
| LR chi2(9) | 25.9 | |||||
| Prob > | 0.0021 | |||||
| Log likelihood | − 25.488985 | Pseudo | 0.3369 | |||
| Breast cancer status | Odds ratio | Std. err. |
| [95% conf. | Interval] | |
| Age | 1.11 | 0.07 | 1.79 | 0.07 | 0.99 | 1.25 |
| Age of first live birth | 1.04 | 0.08 | 0.50 | 0.62 | 0.89 | 1.22 |
| Age of menarche | 1.33 | 0.38 | 0.99 | 0.32 | 0.76 | 2.34 |
| Current alcohol intake | 0.21 | 0.16 | − 2.06 | 0.04 | 0.05 | 0.93 |
| BMI | 0.95 | 0.07 | − 0.78 | 0.44 | 0.82 | 1.09 |
| Ever breast fed | 0.61 | 0.50 | − 0.60 | 0.55 | 0.12 | 3.06 |
| Family history | 2.37 | 1.91 | 1.07 | 0.28 | 0.49 | 11.51 |
| Post- vs pre-menopausal | 0.01 | 0.02 | − 2.60 | 0.01 | 0.00 | 0.32 |
| Smoking (py) | 0.89 | 0.08 | − 1.29 | 0.20 | 0.74 | 1.06 |
Fig. 1Tissue epigenetic age versus chronological age. DNA methylation age estimate based on 353 CpGs (y-axis) versus chronological age. All samples are normal breast tissue samples; normal tissue samples from cancer patients were obtained from mastectomy specimens > 3 cm from tumor margin. Samples (points) are colored by disease status of the donor: red = breast cancer and green = control. A linear regression line has been added. Age acceleration is defined as raw residual resulting from the regression model, i.e., the (signed) vertical distance to the line. Points above and below the line exhibit positive and negative epigenetic age acceleration, respectively. The high Pearson correlation coefficient r = 0.712, (p < 0.001) reflects the strong linear relationship between DNAmAge and chronological age at the time of breast sample collection
Univariate and multivariate analyses of factors affecting DNAmAge and age acceleration residuals
| Univariate analysis | Multivariate analysis | |||||
|---|---|---|---|---|---|---|
| Coef. | Std. err. | Coef. | Std. err. | |||
| DNAmAge | ||||||
| |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| BMI | 0.148 | 0.238 | 0.534 | 0.164 | 0.143 | 0.256 |
| Current alcohol use | − 2.833 | 2.904 | 0.332 | – | – | – |
| |
|
|
|
|
|
|
| Age at menarche | 0.322 | 0.959 | 0.738 | 0.948 | 0.49 | 0.057 |
| Age at first live birth | 0.03 | 0.251 | 0.905 | – | – | – |
| Count of live births | ||||||
| 1 | 2.292 | 5.494 | 0.678 | − 4.92 | 3.268 | 0.137 |
| 1+ | 14.395 | 2.887 | < 0.001 | − 1.536 | 2.138 | 0.475 |
| Breast fed | 5.286 | 2.83 | 0.065 | – | – | – |
| Post- vs pre-menopausal | 18.645 | 2.138 | < 0.001 | − 3.982 | 3.006 | 0.19 |
| Hispanic | − 5.924 | 5.018 | 0.241 | − 3.921 | 3.223 | 0.228 |
| Race | ||||||
| White | − 0.059 | 5.815 | 0.992 | 0.709 | 3.088 | 0.819 |
| African Americans | − 2.142 | 7.643 | 0.78 | 0.196 | 4.219 | 0.963 |
| Age Acc. Residuals | ||||||
| Age | 0.000 | 0.039 | 1 | 0.095 | 0.095 | 0.324 |
| |
|
|
|
|
|
|
| BMI | 0.117 | 0.11 | 0.288 | 0.164 | 0.143 | 0.256 |
| Current alcohol use | − 2.484 | 1.349 | 0.068 | – | – | – |
| |
|
|
|
|
|
|
| Age at menarche | 0.358 | 0.454 | 0.433 | 0.949 | 0.491 | 0.057 |
| Age at first live birth | − 0.169 | 0.153 | 0.274 | – | – | – |
| Count of live births | ||||||
| 1 | − 1.598 | 2.9 | 0.583 | − 4.92 | 3.268 | 0.137 |
| 1+ | 0.041 | 1.52 | 0.979 | − 1.536 | 2.138 | 0.475 |
| Breast fed | − 2.28 | 1.315 | 0.086 | – | – | – |
| Post vs pre-menopausal | − 1.351 | 1.335 | 0.314 | − 3.982 | 3.006 | 0.19 |
| Hispanic | 0.636 | 2.342 | 0.786 | − 3.921 | 3.223 | 0.228 |
| Race | ||||||
| White | − 1.063 | 2.685 | 0.693 | 0.709 | 3.088 | 0.819 |
| African Americans | 1.082 | 3.529 | 0.76 | 0.196 | 4.219 | 0.963 |
Variables in italics are those which reached statistical significance
Fig. 2Comparison of epigenetic variables between cases and controls. Bar plots depict mean epigenetic age and age acceleration (y-axis) versus disease status. a Mean methylation levels of each cohort. b DNAmAge of each cohort. c Age acceleration differences. d Age acceleration residuals. The cancer cohort exhibits a significant positive age acceleration (positive residual coefficient) correlation compared to controls
Fig. 3Correlation of tobacco use variables between cases and controls. DNA methylation age acceleration estimates (y-axis) are depicted for specific tobacco use variables, including total pack years, total number of years smoking, and cigarettes per day. a–c Analyses for these variables using the combination of both study cohorts. d–f The same specific variable analyses for the control cohort. g–i Analyses for the cancer cohort. There is a statistically significant positive correlation between the tobacco variables and the complete cohort, and in the control cases. Though a positive correlation is noted in breast cancer cases as well, it did not reach a statistical significance
Regression adjustment model with inverse showing average treatment effects (ATE) and potential-outcome mean (POmean)
| DNAmAge | Groups | Coef. | Bootstrap |
| [95% conf. interval] | ||
| Std. err. | |||||||
| ATE | Cancer vs control group | 3.98337 | 1.333459 | 2.99 | 0.003 | 1.369837 | 6.596902 |
| POmean | Control | 55.60416 | 1.642661 | 33.85 | 0 | 52.38461 | 58.82372 |
Treatment-effects estimation: Number of obs = 85; Estimator: IPW regression adjustment; Outcome model: linear; Treatment model: logit; Bootstrap Iterations: 500
Fig. 4Receiver operating characteristic for all epigenetic variables. Receiver operating characteristic (ROC) were plotted for breast cancer status as the reference variable and age, DNAmAge, mean methylation by sample, age acceleration difference, and age acceleration residuals as classification variables. DeLong method was used to calculate the standard errors, and binomial confidence intervals were calculated. The ROC curves were plotted based on the binomial fit models, and the AUC was calculated. The sensitivity and specificity of the most predictive epigenetic variable was then calculated based on the ROC curve