| Literature DB >> 29861441 |
S Pamela K Shiao1, James Grayson2, Amanda Lie3, Chong Ho Yu4.
Abstract
For personalized nutrition in preparation for precision healthcare, we examined the predictors of healthy eating, using the healthy eating index (HEI) and glycemic index (GI), in family-based multi-ethnic colorectal cancer (CRC) families. A total of 106 participants, 53 CRC cases and 53 family members from multi-ethnic families participated in the study. Machine learning validation procedures, including the ensemble method and generalized regression prediction, Elastic Net with Akaike's Information Criterion with correction and Leave-One-Out cross validation methods, were applied to validate the results for enhanced prediction and reproducibility. Models were compared based on HEI scales for the scores of 77 versus 80 as the status of healthy eating, predicted from individual dietary parameters and health outcomes. Gender and CRC status were interactive as additional predictors of HEI based on the HEI score of 77. Predictors of HEI 80 as the criterion score of a good diet included five significant dietary parameters (with intake amount): whole fruit (1 cup), milk or milk alternative such as soy drinks (6 oz), whole grain (1 oz), saturated fat (15 g), and oil and nuts (1 oz). Compared to the GI models, HEI models presented more accurate and fitted models. Milk or a milk alternative such as soy drink (6 oz) is the common significant parameter across HEI and GI predictive models. These results point to the importance of healthy eating, with the appropriate amount of healthy foods, as modifiable factors for cancer prevention.Entities:
Keywords: colorectal cancer; diverse ethnic groups; generalized regression elastic net; glycemic index; healthy eating
Mesh:
Year: 2018 PMID: 29861441 PMCID: PMC6024360 DOI: 10.3390/nu10060674
Source DB: PubMed Journal: Nutrients ISSN: 2072-6643 Impact factor: 5.717
Demographic characteristics of the sample.
| Parameters | Total ( | |
|---|---|---|
| Gender | Male | 39 (37%) |
| Female | 67 (63%) | |
| Age, Years | M ± SD | 54 ± 16 |
| Ethnicity | Asian | 40 (38%) |
| Caucasian | 34 (32%) | |
| Hispanic | 23 (22%) | |
| African American | 9 (9%) | |
| BMI status | Obese | 26 (25%) |
| Alcohol drinker | Yes | 57 (54%) |
| Smoker | Yes | 9 (9%) |
Note: BMI: body mass index.
Healthy Eating Index and parameters for the sample (N = 106).
| Parameters (Amount, Maximum Score) | Intake M ± SD | Score M ± SD | Maximum Score |
|---|---|---|---|
| Calorie (per day) | 1600 ± 850 | -- | -- |
| Total Fruit (≥0.8 cup, 5 points) | 1.6 ± 1.5 | 4.1 ± 1.5 | 69 (65%) |
| Whole Fruit (≥0.4 cup, 5 points) | 1.2 ± 1.1 | 4.2 ± 1.5 | 78 (74%) |
| Vegetables (≥1.1 cup, 5 points) | 1.5 ± 1.2 | 4.1 ± 1.4 | 62 (59%) |
| Dark greens (≥0.4 cup, 5 points) | 0.9 ± 0.7 | 4.4 ± 1.2 | 77 (73%) |
| Total Grain (≥3 oz, 5 points) | 4.6 ± 3.0 | 4.3 ± 1.2 | 66 (62%) |
| Whole Grain (≥1.5 oz, 5 points) | 1.7 ± 1.7 | 3.4 ± 1.7 | 45 (43%) |
| Dairy (≥1.3 cup, 10 points) | 1.4 ± 3.2 | 5.4 ± 3.6 | 24 (23%) |
| Protein (≥2.5 oz, 10 points) | 5.8 ± 4.2 | 9.2 ± 2.0 | 86 (54%) |
| Oil and nuts (≥12 g, 10 points) | 36 ± 22 | 9.9 ± 0.4 | 103 (97%) |
| Saturated Fat (g, ≤8% energy) | 18.5 ± 11.4 | 7.0 ± 2.7 | 18 (17%) |
| Sodium (≤1.1 g, 10 points) | 3.2 ± 1.9 | 2.1 ± 3.5 | 6 (6%) |
| Empty Calorie (≤19% energy, 20 points) | 350 ± 230 | 17.6 ± 3.4 | 47 (44%) |
| Healthy Eating Index score (>80, good) | 76 ± 9 | -- | >80: 41 (39%) |
| ≥77 (median distribution) | ≥77: 54 (51%) | ||
| Glycemic Load | 96 ± 59 | -- | -- |
| Glycemic Index (≤55, low and good) | 54 ± 4.2 | -- | ≤55: 66 (62%) |
| ≤53.8 (median distribution) | ≤53.8: 53 (50%) |
Note: M: mean, SD: standard deviation, oz: ounce, g: gram.
Recommended dietary daily intake for the sample (N = 106).
| Parameters, Unit, RDI | M ± SD | Intake % | |
|---|---|---|---|
| Carbohydrates, g, 45–65% calorie | 200 ± 110 | ≥45% | 75 (71%) |
| Protein, g, 10–35% calorie | 77 ± 44 | ≥20% | 38 (36%) |
| Total Fat, g, 20–35% calorie | 370 ± 220 | <35% | 69 (65%) |
| Saturated Fat, g, <10% calorie | 19 ± 11 | <10% | 51 (48%) |
| Cholesterol, <300 mg | 260 ± 170 | <100% | 78 (74%) |
| Sodium, <2300 mg | 3000 ± 1700 | <100% | 40 (38%) |
| Fiber, ≥25 g | 19 ± 10 | ≥100% | 16 (15%) |
| Total Folate, 400 μg | 370 ± 220 | ≥100% | 34 (32%) |
| Vitamin B1 (Thiamine), 1.1 mg | 1.4 ± 0.8 | ≥100% | 65 (61%) |
| Vitamin B2 (Riboflavin), 1.1 mg | 1.9 ± 1.3 | ≥100% | 78 (74%) |
| Vitamin B6, 1.3 mg | 1.8 ± 1.0 | ≥100% | 68 (64%) |
| Vitamin B12, 2.4 μg | 6.1 ± 8.2 | <150% | 44 (42%) |
| Niacin, 14 mg | 21 ± 12 | ≥100% | 72 (68%) |
| Calcium, 1000 mg | 840 ± 620 | ≥75% | 46 (43%) |
| Magnesium, 320 mg | 300 ± 160 | ≥75% | 52 (49%) |
| Iron, 8 mg | 13 ± 7.6 | ≥100% | 44 (42%) |
| Zinc, 8 mg | 11 ± 6.9 | ≥100% | 53 (50%) |
| Methionine, 13 mg/Kg | 1.8 ± 1.0 | <150% | 45 (43%) |
Note: RDI: recommended daily intake, g: gram; mg: milligram, μg: microgram, Kg: Kilogram.
Predictors of Healthy Eating Index (80): Baseline logistic regression and generalized regression Elastic Net models.
| Logistic Regression with Validation | Generalized Regression Elastic Net | |||||
|---|---|---|---|---|---|---|
| AICc Validation | Leave-One-Out Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | 2.64 | 0.002 | 1.61 | 0.001 | 1.58 | 0.002 |
| Whole Fruit 1 cup | −2.51 | 0.002 | −1.90 | 0.0004 | −1.86 | 0.0006 |
| Milk Soy 6 oz | −2.62 | 0.002 | −1.86 | 0.0002 | −1.84 | 0.0002 |
| Whole Grain 1 oz | −2.44 | 0.007 | −2.28 | 0.001 | −2.26 | 0.002 |
| Sat Fat 15 g | 3.81 | 0.008 | 2.31 | 0.010 | 2.55 | 0.010 |
| Oil Nut 1 oz | −2.57 | 0.02 | −1.29 | 0.12 | −1.56 | 0.09 |
| Misclassification Rate | 0.30 | 0.23 | 0.23 | |||
| AICc | 58 | 105 | n/a | |||
| Area under the curve | 0.83 | 0.87 | 0.87 | |||
Note. AICc: Akaike’s information criterion with corrections.
Figure 1Predictors of the Healthy Eating Index (80): Area under the receiver operating characteristic curve (AUC) for logistic regression (left), Elastic Net with Akaike’s information criteria with correction (AICc) validation (middle) and Leave-One-Out validation models (right).
Predictors of Healthy Eating Index (77): Baseline logistic regression and generalized regression Elastic Net models.
| Logistic Regression with Validation | Generalized Regression Elastic Net | |||||
|---|---|---|---|---|---|---|
| AICc Validation | Leave-One-Out Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | 1.11 | 0.18 | 0.27 | 0.65 | 0.31 | 0.61 |
| Milk Soy 6 oz | −2.23 | 0.0008 | −1.82 | 0.0003 | −1.71 | 0.0006 |
| Whole Grain 1 oz | −1.23 | 0.10 | −1.30 | 0.02 | −1.37 | 0.01 |
| Empty Calories 300 | 1.21 | 0.11 | 1.21 | 0.03 | 1.10 | 0.048 |
| Fiber 19 g | 1.01 | 0.21 | 1.36 | 0.03 | 1.38 | 0.03 |
| Gender | −2.04 | 0.11 | −1.83 | 0.06 | −2.60 | 0.003 |
| GroupCa * Gender | 1.88 | 0.23 | 1.63 | 0.17 | 2.73 | 0.03 |
| GroupCa | −0.54 | 0.47 | 0.37 | 0.55 | 0.29 | 0.63 |
| Misclassification Rate | 0.27 | 0.25 | 0.23 | |||
| AICc | 63 | 122 | n/a | |||
| Area under the curve | 0.80 | 0.83 | 0.84 | |||
Note. AICc: Akaike’s information criterion with corrections * interaction.
Figure 2Predictors of the Healthy Eating Index (77): Area under the receiver operating characteristic curve (AUC) for baseline logistic regression (left), Elastic Net with Akaike’s information criteria with correction (AICc) validation (middle) and Leave-One-Out validation models (right).
Figure 3Prediction profiler (a) for significant predictors of health eating (score 77) and (b) interaction of gender with cancer/control group (non-parallel and crossing lines) when compared to another parameter (dairy or soy drink intake) without interaction (parallel lines).
Predictors of the Glycemic Index (55): Baseline logistic regression and generalized regression Elastic Net models.
| Logistic Regression l with Validation | Generalized Regression Elastic Net | |||||
|---|---|---|---|---|---|---|
| AICc Validation | Leave-One-Out Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | 0.73 | 0.04 | 1.07 | 0.0005 | 1.01 | 0.0009 |
| Milk Soy 6 oz | −0.86 | 0.09 | −1.07 | 0.01 | −1.01 | 0.02 |
| Misclassification Rate | 0.35 | 0.37 | 0.38 | |||
| AICc | 49 | 139 | n/a | |||
| Area Under Curve | 0.67 | 0.62 | 0.63 | |||
Note. AICc: Akaike’s information criterion with corrections.
Figure 4Predictors of the Glycemic Index (55): Area under the receiver operating characteristic curve (AUC) for baseline logistic regression (left), Elastic Net with Akaike’s information criteria with correction (AICc) validation (middle) and Leave-One-Out validation models (right).
Predictors of Glycemic Index (53.8): Baseline logistic regression and generalized regression Elastic Net models.
| Logistic Regression with Validation | Generalized Regression Elastic Net | |||||
|---|---|---|---|---|---|---|
| AICc Validation | Leave-One-Out Validation | |||||
| Parameters | Estimate | Estimate | Estimate | |||
| (Intercept) | 0.51 | 0.34 | 0.79 | 0.04 | 0.87 | 0.03 |
| Milk Soy 6 oz | −1.36 | 0.02 | −1.29 | 0.003 | −1.41 | 0.002 |
| Empty Calories 300 | 1.37 | 0.02 | 0.62 | 0.16 | 0.74 | 0.10 |
| Dark Green 6 oz | −1.18 | 0.04 | −0.94 | 0.03 | −1.05 | 0.02 |
| Misclassification Rate | 0.38 | 0.34 | 0.33 | |||
| AICc | 62 | 141 | n/a | |||
| Area Under Curve | 0.58 | 0.70 | 0.72 | |||
Note. AICc: Akaike’s information criterion with corrections.
Figure 5Predictors of the Glycemic Index (53.8): Area under the receiver operating characteristic curve (AUC) for baseline logistic regression (left), Elastic Net with Akaike’s information criteria with correction (AICc) validation (middle) and Leave-One-Out validation models (right).