| Literature DB >> 30018733 |
Mildred C Gonzales1, James Grayson2, Amanda Lie3, Chong Ho Yu4, Shyang-Yun Pamela K Shiao5.
Abstract
Breast cancer (BC) is the most common cancer in women worldwide and second leading cause of cancer-related death. Understanding gene-environment interactions could play a critical role for next stage of BC prevention efforts. Hence, the purpose of this study was to examine the key gene-environmental factors affecting the risks of BC in a diverse sample. Five genes in one-carbon metabolism pathway including MTHFR 677, MTHFR 1298, MTR 2756, MTRR 66, and DHFR 19bp together with demographics, lifestyle, and dietary intake factors were examined in association with BC risks. A total of 80 participants (40 BC cases and 40 family/friend controls) in southern California were interviewed and provided salivary samples for genotyping. We presented the first study utilizing both conventional and new analytics including ensemble method and predictive modeling based on smallest errors to predict BC risks. Predictive modeling of Generalized Regression Elastic Net Leave-One-Out demonstrated alcohol use (p = 0.0126) and age (p < 0.0001) as significant predictors; and significant interactions were noted between body mass index (BMI) and alcohol use (p = 0.0027), and between BMI and MTR 2756 polymorphisms (p = 0.0090). Our findings identified the modifiable lifestyle factors in gene-environment interactions that are valuable for BC prevention.Entities:
Keywords: breast cancer; gene-environment interaction; predictors
Year: 2018 PMID: 30018733 PMCID: PMC6044380 DOI: 10.18632/oncotarget.25520
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Comparisons on demographic and lifestyle factors between control and breast cancer groups
| Controls (N = 40) | Cases (N = 40) | ||
|---|---|---|---|
| Age in years ( | 44.8±15.89 | 61.7±8.87 | 0.001 |
| Ethnicity | |||
| Asian | 16 (40) | 16 (40) | 1.000 |
| Caucasian | 11 (27.5) | 11 (27.5) | |
| Hispanic | 10 (25) | 10 (25) | |
| African American | 3 (7.5) | 3 (7.5) | |
| BMI status | |||
| WNL | 19 (47.5) | 20 (50) | 0.8230 |
| Overweight and Obese | 21 (52.5) | 20 (50) | |
| Alcohol drinker | |||
| No | 20 (50) | 23 (57.5) | 0.5011 |
| Yes | 20 (50) | 17 (42.5) | |
| Smoking | |||
| No | 40 (100) | 38 (95) | 0.1521 |
| Yes | 0 (100) | 2(5) | |
Note: WNL: within normal limit (18.5-24.9).
Comparisons on gene polymorphisms between control and breast cancer groups
| Controls (N = 40) | Cases (N = 40) | ||
|---|---|---|---|
| 0 (CC) | 24 (60) | 25 (62.5) | 0.8185 |
| 1 (CT) | 11 (27.5) | 9 (22.5) | |
| 2 (TT) | 5 (12.5) | 6 (15) | |
| 0 (AA) | 22 (55) | 23 (57.5) | |
| 1 (AC) | 17 (42.5) | 13 (32.5) | |
| 2 (CC) | 1 (2.5) | 4 (10) | 0.8217 |
| 0% | 12 (30) | 10 (25) | 0.4925 |
| 15% | 11 (27.5) | 11 (27.5) | |
| 30% | 1 (2.5) | 4 (10) | |
| 35% | 5 (12.5) | 7 (17.5) | |
| 50% | 6 (15) | 2 (5) | |
| 70% | 5 (12.5) | 6 (15) | |
| 5.5 ± 24.28 | 26.25 ± 23.47 | ||
| ≥ 50% | 11 (27.5) | 8 (20) | |
| 0 (AA) | 30 (75) | 26 (65) | 0.3291 |
| 1 (AG) | 9 (22.5) | 11 (27.5) | |
| 2 (GG) | 1(2.5) | 3 (7.5) | |
| 0 (AA) | 15 (37.5) | 19 (47.5) | 0.3656 |
| 1 (AG) | 20 (50) | 16 (40) | |
| 2 (GG) | 5 (12.5) | 5 (12.5) | |
| 0 (Ins/Ins) | 11 (27.5) | 7 (17.5) | 0.2842 |
| 1 (Ins/Del) | 20 (50) | 25 (62.5) | |
| 2 (Del/Del) | 9 (22.5) | 8 (20) | |
| Total mutations (0-10) | |||
| ≥ 3 | 15 (18.75) | 16 (20) | 0.8185 |
| 2.97 ± 1.53 | 3.12 ± 1.34 | 0.6370 | |
0=Wild type, 1=heterozygote, 2=homozygote
Distribution of gene polymorphisms per control and breast cancer groups across race-ethnic groups
| Genotypes | Controls | Cases | ||||||
|---|---|---|---|---|---|---|---|---|
| 0 (%) | 1 (%) | 2 (%) | 0 (%) | 1 (%) | 2 (%) | |||
| CC | CT | TT | CC | CT | TT | |||
| Total | 24 (60) | 11 (27.5) | 5 (12.5) | NS | 25 (62.5) | 9 (22.5) | 6 (15) | 0.0081 |
| Asian | 14 (87.5) | 2 (12.5) | 0 (0) | NS | 13 (81.25) | 3 (18.75) | 0 (0) | NS |
| White | 6 (54.55) | 3 (27.27) | 2 (18.18) | NS | 6 (54.55) | 3 (27.27) | 2 (18.18) | NS |
| Hispanic | 2 (20) | 6 (60) | 2 (20) | NS | 3 (30) | 3 (30) | 4 (40) | NS |
| Black | 2 (66.67) | 0 (0) | 1 (33.33) | NS | 3 (100) | 0 (0) | 0 (0) | -- |
| AA | AC | CC | AA | AC | CC | |||
| Total | 22 (55) | 17 (42.5) | 1 (2.5) | NS | 23 (57.5) | 13 (32.5) | 4 (10) | NS |
| Asian | 7 (43.75) | 8 (50) | 1 (6.25) | NS | 6 (37.5) | 7 (43.75) | 3 (18.75) | NS |
| White | 6 (54.55) | 5 (45.45) | 0 (0) | NS | 6 (54.55) | 4 (36.36) | 1 (9.09) | NS |
| Hispanic | 7 (70) | 3 (30) | 0 (0) | NS | 9 (90) | 1 (10) | 0 (0) | NS |
| Black | 2 (66.67) | 1 (33.33) | 0 (0) | NS | 2 (66.67) | 1 (33.33) | 0 (0) | NS |
| AA | AG | GG | AA | AG | GG | |||
| Total | 30 (75) | 9 (22.5) | 1 (2.5) | NS | 26 (65) | 11 (27.5) | 3 (7.5) | NS |
| Asian | 13 (81.25) | 3 (18.75) | 0 (0) | NS | 11 (68.75) | 3 (18.75) | 2 (12.5) | NS |
| White | 6 (54.55) | 4 (36.36) | 1 (9.09) | NS | 7 (63.64) | 3 (27.27) | 1 (9.09) | NS |
| Hispanic | 10 (100) | 0 (0) | 0 (0) | -- | 9 (90) | 1 (10) | 0 (0) | NS |
| Black | 1 (33.33) | 2 (66.67) | 0 (0) | NS | 1 (33.33) | 2 (66.67) | 0 (0) | NS |
| AA | AG | GG | AA | AG | GG | |||
| Total | 15 (37.5) | 20 (50) | 5 (12.5) | NS | 19 (47.5) | 16 (40) | 5 (12.5) | NS |
| Asian | 9 (56.25) | 7 (43.75) | 0 (0) | NS | 7 (43.75) | 9 (56.25) | 0 (0) | NS |
| White | 3 (27.27) | 5 (45.45) | 3 (27.27) | NS | 2 (18.18) | 6 (54.55) | 3 (27.27) | NS |
| Hispanic | 3 (30) | 5 (50) | 2 (20) | NS | 7 (70) | 1 (10) | 2 (20) | NS |
| Black | 0 (0) | 3 (100) | 0 (0) | -- | 3 (100) | 0 (0) | 0 (0) | NS |
| II | ID | DD | II | ID | DD | |||
| Total | 9 (22.5) | 20 (50) | 11 (27.5) | .0104 | 8 (20) | 25 (62.5) | 7 (17.5) | 0.0016 |
| Asian | 4 (25) | 10 (62.5) | 2 (12.5) | NS | 3 (18.75) | 9 (56.25) | 4 (25) | NS |
| White | 4 (36.36) | 4 (36.36) | 3 (27.27) | NS | 3 (27.27) | 7 (63.64) | 1 (9.09) | NS |
| Hispanic | 2 (20) | 6 (60) | 2 (20) | NS | 1 (10) | 6 (60) | 3 (30) | NS |
| Black | 1 (33.33) | 0 (0) | 2 (66.67) | NS | 0 (0) | 3 (100) | 0 (0) | -- |
Note: HWE: Hardy-Weinberg Equilibrium, NS: Not significant, --: cannot be calculated; HWE Calculator: https://wpcalc.com/en/equilibrium-hardy-weinberg/
0=Wild type, 1=heterozygote, 2=homozygote
Selected predictors of breast cancer for gene-environment interactions
Figure 1Genes in prediction of breast cancer: (A) per single gene profiler, (B) examples on interaction profiles of genes and breast cancer.
Figure 2Gene-environment Interaction: (A) prediction profiler, (B) examples on interaction profiles.
Baseline logistic regression model and generalized regression Elastic Net model on the predictors of breast cancer from gene-environment interactions
| Logistic Regression Original Model with Validation | Generalized Regression Elastic | |||||
|---|---|---|---|---|---|---|
| With AICc Validation | With Leave-One-Out Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | 0.0025 | 0.9986 | -0.2270 | 0.8445 | -0.5199 | 0.5019 |
| BMI * Alcohol | 2.5212 | 0.1176 | 2.8496 | 0.0119 | 2.8879 | 0.0027 |
| BMI *
| -2.3841 | 0.1659 | -1.9493 | 0.1314 | -2.4105 | 0.0090 |
| Alcohol | -1.9568 | 0.1834 | -2.0448 | 0.0306 | -2.1418 | 0.0126 |
| Saturated Fat | 0.6984 | 0.2954 | 0.9178 | 0.0868 | 0.9299 | 0.0868 |
| 1.6116 | 0.3091 | 1.1838 | 0.3044 | 1.4942 | 0.1146 | |
| 1.1764 | 0.3459 | 1.9493 | 0.0659 | 1.2469 | 0.2189 | |
| 0.0372 | 0.9675 | -0.7873 | 0.3131 | -0.2323 | 0.7576 | |
| -0.4192 | 0.6226 | -0.3404 | 0.6572 | -0.0551 | 0.9438 | |
| BMI | -0.3813 | 0.7847 | -0.2402 | 0.8473 | 0 | 1.0000 |
| Misclassification Rate | 0.3000 | 0.3125 | 0.2785 | |||
| AICc | 70.35 | 117.96 | n/a | |||
| Area under the curve | 0.7240 | 0.7469 | 0.7532 | |||
Note. AICc: Akaike’s information criterion with correction.
Figure 3Receiver operating characteristic curve and area under the curve (AUC) for baseline logistic regression model (left panel), Elastic Net with Akaike’s information criteria with correction validation model (middle) and Leave-One-Out validation model (right panel) on the predictors of breast cancer from gene-environment interactions
Baseline logistic regression model, generalized regression Elastic Net model (with AICc and Leave-One-Out-Validation) on the predictors of breast cancer from gene environment interactions including age as a factor
| Logistic Regression Original Model with Validation | Generalized Regression Elastic Net Model | |||||
|---|---|---|---|---|---|---|
| With AICc Validation | With Leave-One-Out Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | 1.7735 | 0.2386 | 1.2899 | 0.1972 | 1.9324 | 0.0027 |
| Age | -2.8420 | 0.0001 | -2.2898 | <0.0001 | -2.5734 | <0.0001 |
| BMI*Alcohol | 2.5349 | 0.1595 | 1.4491 | 0.2790 | 2.1891 | 0.0152 |
| Alcohol | -2.2623 | 0.1675 | -1.0589 | 0.3814 | -1.8443 | 0.0461 |
| BMI* | -2.3623 | 0.2526 | -0.8735 | 0.1162 | -1.1353 | 0.0581 |
| 1.3848 | 0.4603 | 0 | 1.0000 | 0 | 1.0000 | |
| BMI | 0.1668 | 0.9215 | 0.2466 | 0.8302 | 0 | 1.0000 |
| Misclassification Rate | 0.3000 | 0.2375 | 0.2278 | |||
| AICc | 50.10 | 94.79 | n/a | |||
| Area under the curve | 0.7656 | 0.8313 | 0.8455 | |||
Note. AICc: Akaike’s information criterion with corrections.
Figure 4Receiver operating characteristic curve and area under the curve (AUC) for baseline logistic regression model (left panel), Elastic Net with Akaike’s information criteria with correction validation model (middle) and Leave-One-Out validation model (right panel) on the predictors of breast cancer from gene-environment interactions including age as a factor