| Literature DB >> 32878103 |
Archana J McEligot1, Valerie Poynor2, Rishabh Sharma3, Anand Panangadan4.
Abstract
A multitude of dietary factors from dietary fat to macro and micronutrients intakes have been associated with breast cancer, yet data are still equivocal. Therefore, utilizing data from the large, multi-year, cross-sectional National Health and Nutrition Examination Survey (NHANES), we applied a novel, modern statistical shrinkage technique, logistic least absolute shrinkage and selection operator (LASSO) regression, to examine the association between dietary intakes in women, ≥50 years, with self-reported breast cancer (n = 286) compared with women without self-reported breast cancer (1144) from the 1999-2010 NHANES cycle. Logistic LASSO regression was used to examine the relationship between twenty-nine variables, including dietary variables from food, as well as well-established/known breast cancer risk factors, and to subsequently identify the most relevant variables associated with self-reported breast cancer. We observed that as the penalty factor (λ) increased in the logistic LASSO regression, well-established breast cancer risk factors, including age (β = 0.83) and parity (β = -0.05) remained in the model. For dietary macro and micronutrient intakes, only vitamin B12 (β = 0.07) was positively associated with self-reported breast cancer. Caffeine (β = -0.01) and alcohol (β = 0.03) use also continued to remain in the model. These data suggest that a diet high in vitamin B12, as well as alcohol use may be associated with self-reported breast cancer. Nonetheless, additional prospective studies should apply more recent statistical techniques to dietary data and cancer outcomes to replicate and confirm the present findings.Entities:
Keywords: LASSO; NHANES; breast cancer; diet
Mesh:
Substances:
Year: 2020 PMID: 32878103 PMCID: PMC7551912 DOI: 10.3390/nu12092652
Source DB: PubMed Journal: Nutrients ISSN: 2072-6643 Impact factor: 5.717
Figure 1Flowchart of study participants.
Figure 2Cross validation plot for the penalty term.
Descriptive and other characteristics in participants with and without self-reported breast cancer.
| Descriptive Variable | Women with | Women without | |
|---|---|---|---|
| Mean age (± SD) | 68.46, (0.74) | 63.19, (0.36) | <0.001 |
| Parity, mean (± SD) | 2.49, (0.17) | 2.70, (0.07) | 0.15 |
| Age at first menarche, | 12.62, (0.13) | 12.89, (0.06) | 0.06 |
| Ethnicity, | |||
| Non-Hispanic White | 203, (88%) | 595, (77%) | <0.001 |
| Non-Hispanic Black | 44, (7.2%) | 219, (10.9%) | |
| Hispanic | 32, (2.6%) | 294, (8.1%) | |
| Unknown/Other | 5, (2.3%) | 36, (4.5%) | |
| BMI (kg/m2) 1, | 28.89, (0.55) | 29.38, (0.33) | 0.43 |
1 BMI variable, n = 279 with breast cancer, n = 1116 without breast cancer.
Dietary macro- and micronutrient intakes in women with and without self-reported breast cancer.
| Descriptive | Women with Self-Reported Breast Cancer | Women without Self-Reported Breast Cancer | 95% CI | |
|---|---|---|---|---|
| Energy (Kcal) | 1638 (43.60) | 1648 (28.23) | (−109.59, 91.01) | 0.46 |
| Carbohydrate (g) | 205.38 (6.46) | 204.75 (3.77) | (−14.34, 15.57) | 0.36 |
| Carbohydrate, % energy | 50.43 (0.86) | 50.44 (0.48) | (−2.00, 1.98) | 0.80 |
| Protein (g) | 64.20 (2.38) | 65.45 (1.31) | (−6.46, 3.97) | 0.68 |
| Protein, % energy | 15.89 (0.30) | 16.10 (0.18) | (−0.944, 0.520) | 0.72 |
| Total Fat (g) | 61.86 (2.05) | 63.99 (1.47) | (−6.83, 2.64) | 0.50 |
| Fat, % energy | 33.57 (0.60) | 33.97 (0.39) | (−1.88, 1.06) | 0.78 |
| Cholesterol (mg) | 213.80 (10.56) | 226.50 (7.70) | (−40.46, 15.05) | 0.40 |
| Fiber (g) | 15.23 (0.72) | 14.85 (0.34) | (−1.25, 2.02) | 0.38 |
| Folate (μg) | 353.49 (13.81) | 347.07 (8.40) | (−26.49, 39.34) | 0.38 |
| Vitamin B12 (μg) | 5.02 (0.75) | 4.17 (0.15) | (−0.632, 2.34) | 0.08 |
| Vitamin B6 (mg) | 1.61 (0.07) | 1.60 (0.04) | (−0.16, 0.19) | 0.45 |
| Thiamin (mg) | 1.36 (0.07) | 1.39 (0.04) | (−0.18, 0.15) | 0.73 |
| Riboflavin (mg) | 1.89 (0.07) | 1.88 (0.04) | (−0.164, 0.16) | 0.45 |
| Calcium (mg) | 772.74 (27.24) | 780.44 (20.44) | (−67.42, 52.03) | 0.21 |
| Phosphorous (mg) | 1082 (35.87) | 1096 (20.13) | (−91.45, 63.47) | 0.49 |
| Magnesium (mg) | 253.27 (9.13) | 256.33 (4.79) | (−24.00, 17.89) | 0.65 |
| Iron (mg) | 13.24 (0.56) | 12.84 (0.29) | (−0.93, 1.72) | 0.25 |
| Vitamin A (IU) | 685.55 (75.15) | 648.52 (18.85) | (−116.40, 190.45) | 0.19 |
| Vitamin C (mg) | 87.68 (4.14) | 92.04 (4.81) | (−16.66, 7.94) | 0.28 |
| Vitamin E (mg) | 6.66 (0.40) | 6.67 (0.19) | (−0.93, 0.91) | 0.52 |
| Zinc (mg) | 9.76 (0.35) | 9.61 (0.24) | (−0.61, 0.92) | 0.24 |
| Sodium (mg) | 2665 (84.90) | 2768 (59.30) | (−289.68, 81.87) | 0.80 |
| Potassium (mg) | 2452 (61.65) | 2476 (39.86) | (−158.56, 109.05) | 0.38 |
| Caffeine (mg) | 154.56 (14.42) | 174.94 (11.62) | (−57.97, 17.21) | 0.38 |
| Alcohol (g) | 5.31 (1.01) | 3.17 (0.49) | (0.08, 4.21) 2 | 0.19 |
1 Mean (SD) macro- and micronutrient, and 95% CI data shown are on the raw data; p-value data shown are on log transformed data; 2 Log transformed 95% CI = (−0.20, 1.03).
The estimated coefficients for logistic least absolute shrinkage and selection operator (LASSO) regression between dietary data, and well-established breast cancer risk factors with self-reported breast cancer.
| Variables | Coefficients |
|---|---|
| Well-established Variables | |
| Age (years) | 0.83 (0.41) |
| Parity (# live births) | −0.05 (0.03) |
| Age at first menstrual cycle | 0 |
| Alcohol (g) | 0.03 (0.02) |
| Other Variables | |
| Caffeine (mg) | −0.01 (0.02) |
| Mexican/Hispanic | 0 |
| Non-Hispanic Black | 0 |
| Other | 0 |
| Dietary Variables | |
| Energy (Kcal) | 0 |
| Carbohydrate, % energy | 0 |
| Protein, % energy | 0 |
| Fat, % energy | 0 |
| Cholesterol (mg) | 0 |
| Fiber (g) | 0 |
| Folate (μg) | 0 |
| Vitamin B12 (μg) | 0.07 (0.05) |
| Vitamin B6 (mg) | 0 |
| Thiamin (Vitamin B1) (mg) | 0 |
| Riboflavin (Vitamin B2) (mg) | 0 |
| Calcium (mg) | 0 |
| Phosphorous (mg) | 0 |
| Magnesium (mg) | 0 |
| Iron (mg) | 0 |
| Vitamin A (RE) | 0 |
| Vitamin C (mg) | 0 |
| Vitamin E (mg) | 0 |
| Zinc (mg) | 0 |
| Sodium (mg) | 0 |
| Potassium (mg) | 0 |
Figure 3Plots for LASSO regression coefficients over different values of the penalty parameter. In (a), data shown are the sixteen variables that remained in the model the longest as the penalty term increased; in (b), data shown are the remaining variables in the model.