| Literature DB >> 32331378 |
Larissa C Hunt1, Hassan S Dashti2, Queenie Chan3, Rachel Gibson4, Céline Vetter1.
Abstract
We used data-driven approaches to identify independent diet exposures among 45 candidate variables, for which we then probed cross-sectional associations with cardiometabolic risk (CMR). We derived average daily caloric intake and macronutrient composition, daily meal frequencies, and irregularity of energy and macronutrient intake from 7-day food diaries in the Airwave Health Monitoring Study participants (N = 8090). We used K-means and hierarchical clustering to identify non-redundant diet exposures with representative exposures for each cluster chosen by silhouette value. We then used multi-variable adjusted logistic regression to estimate prevalence ratios (PR) and 95% confidence intervals (95%CI) for CMR (≥3 criteria: dyslipidemia, hypertension, central adiposity, inflammation and impaired glucose control) across diet exposure quartiles. We identified four clusters: i) fat intake, ii) carbohydrate intake, iii) protein intake and intake regularity, and iv) meal frequencies and energy intake. Of these clusters, higher carbohydrate intake was associated with lower likelihood of CMR (PR = 0.89, 95%CI = 0.81-0.98; ptrend = 0.02), as was higher fiber intake (PR = 0.76, 95%CI = 0.68-0.85; ptrend < 0.001). Higher meal frequency was also associated with lower likelihood of CMR (PR = 0.76, 95%CI = 0.68-0.85; ptrend < 0.001). Our results highlight a novel, data-driven approach to select non-redundant, minimally collinear, primary exposures across a host of potentially relevant exposures (including diet composition, temporal distribution, and regularity), as often encountered in nutritional epidemiology.Entities:
Keywords: Airwave Health Monitoring Study; body mass index; cluster; diet patterns; food intake regularity; meal frequency; nutrient intake
Year: 2020 PMID: 32331378 PMCID: PMC7230946 DOI: 10.3390/nu12041170
Source DB: PubMed Journal: Nutrients ISSN: 2072-6643 Impact factor: 5.717
Baseline characteristics stratified by sex in the Airwave Health Monitoring Study cohort (N = 8090). Values are presented as mean (standard deviation) or absolute counts and percentages.
| Overall | Male | Female | ||
|---|---|---|---|---|
| Age, years | 40.8 (9.1) | 41.9 (8.8) | 39.0 (9.4) | |
| Body mass index, kg/m2 | 27.0 (4.1) | 27.7 (3.6) | 25.7 (4.6) | |
| Ethnicity | White | 7874 (97.3%) | 96.9% | 98.0% |
| Other | 203 (2.5%) | 2.9% | 1.9% | |
| Missing | 13 (0.2%) | 0.2% | 0.2% | |
| Region of employment | England | 5822 (72.0% | 70.5% | 74.3% |
| Scotland | 1333 (16.5%) | 18.4% | 13.3% | |
| Wales | 792 (9.8%) | 9.5% | 10.3% | |
| Missing | 143 (1.8%) | 1.6% | 2.1% | |
| Educational level | A levels/Higher or equivalent | 2599 (32.1%) | 32.0% | 32.4% |
| Bachelor’s degree or higher | 2222 (27.5%) | 27.5% | 30.2% | |
| Other | 3268 (40.4%) | 42.3% | 37.3% | |
| Missing | 1 (0%) | 0% | 0% | |
| Annual household income | Less than £37,999 | 2123 (26.2%) | 19.8% | 36.6% |
| £38,000 to £77,999 | 5162 (63.8%) | 70.4% | 53.1% | |
| More than £78,000 | 804 (9.9%) | 9.7% | 10.3% | |
| Missing | 1 (0%) | 0% | 0% | |
| Relationship status | Married/cohabitating | 6411 (79.2%) | 85.8% | 68.8% |
| Single | 876 (10.8%) | 6.5% | 17.9% | |
| Divorced/separated | 619 (7.7%) | 6.4% | 9.7% | |
| Other | 183 (2.3%) | 1.4% | 3.6% | |
| Missing | 1 (0%) | 0% | 0% | |
| Work hours | Less than/equal to 40 hours/week | 3766 (46.6%) | 36.3% | 63.0% |
| Greater than 40 hours/week | 4324 (53.4%) | 63.7% | 37.0% | |
| Missing | 1 (0%) | 0% | 0% | |
| Physical activity * | Low | 1561 (19.3%) | 17.3% | 22.4% |
| Moderate | 5593 (69.1%) | 70.4% | 67.0% | |
| High | 936 (11.6%) | 12.2% | 10.5% | |
| Missing | 1 (0%) | 0% | 0% | |
| Smoker status | Current | 634 (7.8%) | 6.6% | 9.8% |
| Former | 1874 (23.2%) | 23.3% | 22.9% | |
| Never | 5582 (69.0%) | 70.1% | 67.3% | |
| Missing | 1 (0%) | 0% | 0% | |
| Alcohol consumption | Yes | 7479 (92.4%) | 93.8% | 90.2% |
| No | 611 (7.6%) | 6.2% | 9.8% | |
| Missing | 0 (0%) | 0% | 0% | |
| Sleep duration | Less than 7 hours | 2563 (31.7%) | 34.4% | 27.3% |
| 7–8 hours | 5262 (65.0%) | 63.1% | 68.2% | |
| 9 hours or more | 264 (3.3%) | 2.5% | 4.5% | |
| Missing | 1 (0%) | 0% | 0% |
* Scored using the IPAQ Scoring Guidelines for vigorous, moderate and low activity categories.
Figure 1K-means optimal cluster determination. The lowest Bayesian Information Criterion (BIC) indicates the best fit model and thus the optimal number of clusters for k-means clustering analysis.
Cross-sectional associations between k-means three-cluster solution top hit diet exposure quartiles with cardiometabolic risk prevalence (N = 8090). Values are presented as prevalence ratio (95% confidence interval). Significant p-values (p <0.05) are marked in bold.
| Cluster 1: Saturated Fat Intake (%kcal *) | |||||
|---|---|---|---|---|---|
| Quartile 1 | Quartile 2 | Quartile 3 | Quartile 4 |
| |
| Mean (SD †) | 8 (1) | 11 (0) | 12 (0) | 15 (2) | |
| Interquartile Range | 3–9 | 10–11 | 12–13 | 14–30 | |
| Prevalent Cases/N | 478/1443 | 702/2054 | 822/2233 | 910/2360 | |
| Model 1 | 1.00 (ref) | 1.01 (0.91; 1.11) | 1.06 (0.96; 1.17) | 1.06 (0.95; 1.17) | 0.27 |
| Model 2 | 1.00 (ref) | 1.01 (0.91; 1.11) | 1.05 (0.95; 1.16) | 1.05 (0.94; 1.16) | 0.37 |
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 39 (4) | 46 (1) | 49 (1) | 56 (4) | |
| Interquartile Range | 10–43 | 44–47 | 48–51 | 52–76 | |
| Prevalent Cases/N | 851/1955 | 668/1804 | 665/1918 | 728/2413 | |
| Model 1 | 1.00 (ref) | 0.88 (0.80; 0.95) | 0.85 (0.77; 0.92) | 0.79 (0.71; 0.86) |
|
| Model 2 | 1.00 (ref) | 0.88 (0.81; 0.96) | 0.85 (0.78; 0.93) | 0.79 (0.71; 0.86) |
|
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 13 (1) | 16 (0) | 17 (0) | 21 (3) | |
| Interquartile Range | 8–14 | 15–16 | 17–18 | 19–55 | |
| Prevalent Cases/N | 437/1344 | 771/2144 | 820/2162 | 884/2440 | |
| Model 1 | 1.00 (ref) | 1.05 (0.95; 1.16) | 1.07 (0.97; 1.18) | 1.02 (0.91; 1.13) | 0.83 |
| Model 2 | 1.00 (ref) | 1.05 (0.95; 1.16) | 1.07 (0.97; 1.18) | 1.02 (0.91; 1.13) | 0.82 |
Model 1: adjusted for age and sex, as well as for the quartiles of the most representative variable of the other clusters, respectively. Model 2: adjusted for covariates of Model 1, plus education level, region of employment, work hours and sleep duration. * kilocalorie, † standard deviation.
Figure 2Dendrogram and correlation matrix of diet exposures. Variables are ordered by best fit to clusters determined by hierarchical clustering. The four-cluster solution is outlined in dashed lines. Yellow indicates a positive correlation coefficient; blue indicates a negative correlation coefficient. Variable order from top to bottom and left to right: (1) average fat for breakfast; (2) average saturated fat for breakfast; (3) average fat for lunch; (4) average saturated fat for lunch; (5) average daily fat intake; (6) average daily saturated fat intake; (7) average fat for dinner; (8) average saturated fat for dinner; (9) average daily eating occasions; (10) dinner frequency; (11) breakfast frequency; (12) lunch frequency; (13) late night and early morning snack frequency; (14) average calories for snacks; (15) evening snack frequency; (16) average calories for breakfast; (17) average calories for dinner; (18) average calories for lunch; (19) average daily caloric intake; (20) average fiber for breakfast; (21) average fiber for dinner; (22) average fiber for lunch; (23) average daily fiber intake; (24) average carbohydrates for breakfast; (25) average sugar for breakfast; (26) average carbohydrates for lunch; (27) average sugar for lunch; (28) average daily carbohydrate intake; (29) average daily sugar intake; (30) average carbohydrates for dinner; (31) average sugar for dinner; (32) average protein for lunch; (33) average daily protein intake; (34) average protein for dinner; (35) average protein for breakfast; (36) saturated fat irregularity; (37) fat irregularity; (38) breakfast irregularity; (39) lunch irregularity; (40) dinner irregularity; (41) sugar irregularity; (42) carbohydrate irregularity; (43) protein irregularity; (44) fiber irregularity; (45) daily caloric irregularity.
Figure 3Correlation matrix of top two diet exposures for k-means 4- cluster solution. Yellow indicates a positive correlation coefficient; blue indicates a negative correlation coefficient.
Cross-sectional associations between k-means four-cluster solution top hit diet exposure quartiles with cardiometabolic risk prevalence (N = 8090). Values are presented as prevalence ratio (95% confidence interval). Significant p-values (p < 0.05) are marked in bold.
| Cluster 1: Saturated Fat Intake (%kcal *) | |||||
|---|---|---|---|---|---|
| Quartile 1 | Quartile 2 | Quartile 3 | Quartile 4 |
| |
| Mean (SD †) | 8 (1) | 11 (0) | 12 (0) | 15 (2) | |
| Interquartile Range | 3–9 | 10–11 | 12–13 | 14–30 | |
| Prevalent Cases/N | 478/1443 | 702/2054 | 822/2233 | 910/2360 | |
| Model 1 | 1.00 (ref) | 0.99 (0.89; 1.10) | 1.02 (0.92; 1.13) | 1.01 (0.90; 1.12) | 0.87 |
| Model 2 | 1.00 (ref) | 0.99 (0.89; 1.10) | 1.02 (0.92; 1.13) | 1.00 (0.89; 1.12) | 0.95 |
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 39 (4) | 46 (1) | 49 (1) | 56 (4) | |
| Interquartile Range | 10–43 | 44–47 | 48–51 | 52–76 | |
| Prevalent Cases/N | 851/1955 | 668/1804 | 665/1918 | 728/2413 | |
| Model 1 | 1.00 (ref) | 0.93 (0.85; 1.01) | 0.93 (0.84; 1.01) | 0.89 (0.81; 0.98) |
|
| Model 2 | 1.00 (ref) | 0.93 (0.85; 1.01) | 0.92 (0.84; 1.01) | 0.89 (0.80; 0.98) |
|
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 6.32 (0.92) | 8.33 (0.48) | 10.09 (0.57) | 13.30 (2.09) | |
| Interquartile Range | 2.54—7.50 | 7.51—9.16 | 9.17—11.14 | 11.15—35.62 | |
| Prevalent Cases/N | 806/2020 | 757/2024 | 688/2019 | 661/2027 | |
| Model 1 | 1.00 (ref) | 0.91 (0.84; 0.99) | 0.81 (0.73; 0.89) | 0.76 (0.68; 0.85) |
|
| Model 2 | 1.00 (ref) | 0.92 (0.84; 1.00) | 0.82 (0.74; 0.90) | 0.78 (0.70; 0.86) |
|
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 3.06 (0.35) | 3.8 (0.16) | 4.35 (0.16) | 5.17 (0.44) | |
| Interquartile Range | 1.71—3.43 | 3.57–4.00 | 4.14—4.57 | 4.71—7.71 | |
| Prevalent Cases/N | 649/1747 | 819/2150 | 755/2013 | 689/2162 | |
| Model 1 | 1.00 (ref) | 0.99 (0.90; 1.08) | 0.94 (0.85; 1.03) | 0.76 (0.68; 0.85) |
|
| Model 2 | 1.00 (ref) | 0.99 (0.90; 1.08) | 0.94 (0.85; 1.04) | 0.76 (0.68; 0.86) |
|
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 1374.52 (174.91) | 1759.66 (85.25) | 2056.47 (91.38) | 2596.07 (330.91) | |
| Interquartile Range | 627.33—1603.69 | 1603.91—1906.23 | 1906.24—2227.87 | 2227.97—4620.40 | |
| Prevalent Cases/N | 678/2023 | 716/2022 | 763/2022 | 755/2023 | |
| Model 1 | 1.00 (ref) | 0.96 (0.87; 1.06) | 0.96 (0.86; 1.06) | 0.95 (0.84; 1.07) | 0.49 |
| Model 2 | 1.00 (ref) | 0.97 (0.88; 1.07) | 0.97 (0.87; 1.07) | 0.96 (0.85; 1.08) | 0.64 |
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 13 (1) | 16 (0) | 17 (0) | 21 (3) | |
| Interquartile Range | 8–14 | 15–16 | 17–18 | 19–55 | |
| Prevalent Cases/N | 437/1344 | 771/2144 | 820/2162 | 884/2440 | |
| Model 1 | 1.00 (ref) | 1.06 (0.96; 1.17) | 1.08 (0.97; 1.19) | 1.02 (0.91; 1.14) | 0.86 |
| Model 2 | 1.00 (ref) | 1.06 (0.96; 1.17) | 1.08 (0.97; 1.19) | 1.02 (0.91; 1.14) | 0.88 |
|
| |||||
|
|
|
|
|
| |
| Mean (SD) | 7 (2) | 11 (1) | 15 (1) | 23 (5) | |
| Interquartile Range | 0–9 | 10 -12 | 13–17 | 18–79 | |
| Prevalent Cases/N | 573/1706 | 621/1782 | 919/2520 | 799/2082 | |
| Model 1 | 1.00 (ref) | 1.01 (0.92; 1.12) | 1.05 (0.95; 1.14) | 1.04 (0.94; 1.14) | 0.39 |
| Model 2 | 1.00 (ref) | 1.01 (0.91; 1.11) | 1.05 (0.95; 1.14) | 1.03 (0.93; 1.14) | 0.43 |
Model 1: adjusted for age and sex, as well as for the quartiles of the most representative variable of the other clusters, respectively. Model 2: adjusted for covariates of Model 1, plus education level, region of employment, work hours and sleep duration. *kilocalorie, †standard deviation.