| Literature DB >> 34411412 |
Jacob K Kresovich1, Zongli Xu1, Katie M O'Brien1, Min Shi2, Clarice R Weinberg2, Dale P Sandler1, Jack A Taylor1,3.
Abstract
Although blood DNA methylation (DNAm) profiles are reported to be associated with breast cancer incidence, they have not been widely used in breast cancer risk assessment. Among a breast cancer case-cohort of 2774 women (1551 cases) in the Sister Study, we used candidate CpGs and DNAm estimators of physiologic characteristics to derive a methylation-based breast cancer risk score, mBCRS. Overall, 19 CpGs and five DNAm estimators were selected using elastic net regularization to comprise mBCRS. In a test set, higher mBCRS was positively associated with breast cancer incidence, showing similar strength to the polygenic risk score (PRS) based on 313 single nucleotide polymorphisms (313 SNPs). Area under the curve for breast cancer prediction was 0.60 for self-reported risk factors (RFs), 0.63 for PRS, and 0.63 for mBCRS. Adding mBCRS to PRS and RFs improved breast cancer prediction from 0.66 to 0.71. mBCRS findings were replicated in a nested case-control study within the EPIC-Italy cohort. These results suggest that mBCRS, a risk score derived using blood DNAm, can be used to enhance breast cancer prediction. Published 2021. This article is a U.S. Government work and is in the public domain in the USA. Molecular Oncology published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.Entities:
Keywords: DNA methylation; breast cancer; breast cancer prediction; risk score
Mesh:
Year: 2021 PMID: 34411412 PMCID: PMC8732352 DOI: 10.1002/1878-0261.13087
Source DB: PubMed Journal: Mol Oncol ISSN: 1574-7891 Impact factor: 7.449
Sister Study methylation case‐cohort sample characteristics at study enrollment overall and by training/testing set assignment.
| Overall | Training set | Testing set |
| |
|---|---|---|---|---|
| Total participants, | 2774 | 1941 | 833 | |
| Incident breast cancers, | 1551 (100) | 1090 (100) | 461 (100) | |
| Invasive | 1218 (79) | 877 (80) | 341 (74) | 0.01 |
| DCIS | 333 (21) | 213 (20) | 120 (26) | |
| Age, mean years (SD) | 57.0 (9) | 57.3 (9) | 56.5 (9) | 0.03 |
| Body mass index, mean kg/m2 (SD) | 27.6 (6) | 27.6 (6) | 27.7 (6) | 0.53 |
| Physical activity, mean METs/week (SD) | 50.8 (31) | 50.6 (31) | 51.3 (32) | 0.60 |
| Alcohol consumption, mean drinks/week (SD) | 3.1 (5) | 3.1 (5) | 3.2 (5) | 0.60 |
| Age at menarche, mean years (SD) | 12.6 (1) | 12.6 (1) | 12.6 (2) | 0.58 |
| Live births, mean total (SD) | 1.9 (1) | 2.0 (1) | 1.9 (1) | 0.13 |
| Age at first birth, mean years (SD) | 24.9 (5) | 24.9 (5) | 24.9 (5) | 0.86 |
| Breastfeed duration, mean weeks (SD) | 35.2 (56) | 35.4 (54) | 34.6 (61) | 0.73 |
| Postmenopausal hormone use | 5.9 (7) | 5.9 (7) | 5.6 (7) | 0.98 |
| Number of affected sisters, count (SD) | 1.1 (0.4) | 1.1 (0.4) | 1.1 (0.4) | 0.51 |
| Youngest proband sister age at diagnosis, mean years (SD) | 48.9 (10) | 49.0 (10) | 48.6 (10) | 0.27 |
| Polygenic risk score, mean (SD) | −0.17 (0.6) | −0.16 (0.7) | −0.19 (0.6) | 0.18 |
| Educational attainment, | ||||
| Less than HS/HS degree | 430 (16) | 311 (16) | 119 (14) | 0.40 |
| Attended college/college degree | 1641 (59) | 1134 (58) | 507 (61) | |
| Advanced degree | 703 (25) | 496 (26) | 207 (25) | |
| Menopause status, | ||||
| Premenopausal | 826 (30) | 574 (30) | 252 (30) | 0.73 |
| Postmenopausal | 1947 (70) | 1366 (70) | 581 (70) | |
| Previous number of breast biopsies, | ||||
| Zero | 1875 (68) | 1312 (68) | 563 (68) | 0.25 |
| One | 448 (17) | 330 (17) | 158 (19) | |
| Two or more | 411 (15) | 299 (15) | 112 (13) | |
METs, metabolic equivalent tasks. Missing covariates: body mass index, 2; sister age at diagnosis, 3; polygenic risk score, 102; menopause status, 1; postmenopausal hormone use, 6. P‐diff calculated using the t‐tests for continuous variables and χ2 tests for categorical variables.
Among postmenopausal women (n = 1947).
Fig. 1mBCRS correlations with 313 SNP PRS and age and mBCRS distributions in the Sister Study testing set. Among women sampled as part of the random subcohort and selected into the testing set (n = 375), scatterplot and fit line between mBCRS values and the 313 SNP PRS (Pearson’s correlation: 0.03, top left panel); and scatterplot and fit line between mBCRS and age at blood draw (Pearson’s correlation: 0.09, top right panel). The distribution of mBCRS on the original scale ranged from 44.28 to 45.96, with a mean of 45.20 (SD = 0.19); women who remained cancer‐free had a mean of 45.14 (SD = 0.17), those who developed breast cancer had a mean of 45.24 (SD = 0.20) (bottom left panel). The standardized residuals for mBCRS ranged from −5.30 to 5.20, with a mean of 0.30 (SD = 1.19); women who remained cancer‐free had a mean of −0.03 (SD = 0.99), and those who developed breast cancer had a mean of 0.56 (SD = 1.27) (bottom right panel).
Test set‐based hazard ratios per covariate‐adjusted standard deviation (SD) from univariable and multivariable analyses of mBCRS and polygenic risk score (n = 794).
| Full follow‐up | Excluding first 2 years | |||||
|---|---|---|---|---|---|---|
| HR (95% CI) |
|
| HR (95% CI) |
|
| |
| Univariate models | ||||||
| mBCRS | 1.78 (1.51, 2.11) | 6.70 | 2.1 × 10−11 | 1.83 (1.52, 2.19) | 6.45 | 1.1 × 10−10 |
| PRS | 1.57 (1.35, 1.83) | 5.84 | 5.1 × 10−9 | 1.62 (1.37, 1.92) | 5.69 | 1.3 × 10−8 |
| Multivariable model | ||||||
| mBCRS | 1.74 (1.46, 2.06) | 6.23 | 4.6 × 10−10 | 1.76 (1.47, 2.12) | 6.03 | 1.6 × 10−9 |
| PRS | 1.55 (1.31, 1.83) | 5.14 | 2.8 × 10−7 | 1.59 (1.33, 1.92) | 4.98 | 6.3 × 10−7 |
Sample sizes: full follow‐up, n = 794 with 443 events; excluding first 2 years, n = 673 with 326 events.
Full follow‐up length: mean = 5.2 years, SD = 2.6.
Per covariate‐adjusted standard deviation increase in mBCRS score (or PRS), accounting for: age at blood draw, menopause status, body mass index, interaction term for BMI and menopause, physical activity, alcohol consumption, age at first birth (among parous), total number of births, age at menarche, menopause age (among postmenopausal), smoking pack‐years, previous number of breast biopsies, family history of breast cancer (number of affected sisters, youngest age of sister’s diagnosis), educational attainment, and durations of postmenopausal hormone use and breastfeeding, and standardized to the distribution of the noncases.
Results displayed are from two separate models including either mBCRS or PRS.
Results displayed are for both mBCRS and PRS that were included as covariates in a single model.
Test set‐based OPERA (95% CI) estimated odds ratios per covariate‐adjusted standard deviation from univariable and multivariable analyses of mBCRS, polygenic risk score, and age (n = 794).
| Univariable | Multivariable | ||||||
|---|---|---|---|---|---|---|---|
| mBCRS only | PRS only | Age only | mBCRS + PRS | mBCRS + age | PRS + age | All three combined | |
| mBCRS | 1.58 (1.38, 1.81) | – | – | 1.58 (1.37, 1.81) | 1.56 (1.36, 1.79) | – | 1.56 (1.35, 1.79) |
| Polygenic risk score | – | 1.58 (1.36, 1.83) | – | 1.57 (1.35, 1.84) | – | 1.61 (1.38, 1.87) | 1.60 (1.37, 1.87) |
| Age at blood draw | – | – | 1.35 (1.16, 1.57) | – | 1.31 (1.12, 1.53) | 1.38 (1.19, 1.62) | 1.35 (1.15, 1.58) |
| Log likelihood (null = −545.0) | −520.1 | −525.8 | −537.4 | −502.4 | −514.1 | −517.1 | −495.5 |
| LR χ2 DF | 1 | 1 | 1 | 2 | 2 | 2 | 3 |
| χ | 49.9 | 38.5 | 15.3 | 85.3 | 61.8 | 55.9 | 99.0 |
Breast cancer risk factors include the following: age at blood draw, menopause status, body mass index, interaction term for BMI and menopause, physical activity, alcohol consumption, age at first birth (among parous), total number of births, age at menarche, menopause age (among postmenopausal), smoking pack‐years, previous number of breast biopsies, family history of breast cancer (number of affected sisters, youngest age of sister’s diagnosis), educational attainment, and durations of postmenopausal hormone use and breastfeeding.
Residuals adjusted for all breast cancer risk factors, except for age at blood draw.
LR χ2 DF, likelihood ratio χ2 statistic degrees of freedom.
Fig. 2Predictive capability of breast cancer risk markers in Sister Study testing set. Predictive capability of the mBCRS, PRS, and questionnaire‐based breast cancer risk factors (RFs) using receiver operating characteristics analysis; predictive capability reported as AUC. Questionnaire‐based risk factors include the following: age at blood draw, menopause status, body mass index, physical activity, alcohol consumption, age at first birth (among parous), total number of births, age at menarche, menopause age (among postmenopausal), smoking pack‐years, previous number of breast biopsies, family history of breast cancer (number of affected sisters, youngest proband sister age at diagnosis), educational attainment, and durations of postmenopausal hormone use and breastfeeding.