| Literature DB >> 35875792 |
Mark Stecker1, Mona Stecker1, Allison B Reiss2, Lora Kasselman2.
Abstract
There is conflicting information on the relationship between diet and dementia. The purposes of this pilot study were twofold. First, to use publicly available data regarding food consumption (United Kingdom Family Food), dementia, risk and demographic factors to find relationships between the consumption of various foods to dementia prevalence. The second purpose was to identify elements of study design that had important effects on the results. Multiple analyses were performed on different data sets derived from the existing data. Statistical testing began with univariate correlation analyses corrected for multiple testing followed by global tests for significance. Subsequently, a number of multivariate techniques were applied including stepwise linear regression, cluster regression, regularized regression, and principal components analysis. Permutation tests and simulations highlighted the strength and weakness of each technique. The univariate analyses demonstrated that the consumption of certain foods was highly associated with the prevalence of dementia. However, because of the complexity of the data set and the high degree of correlation between variables, different multivariate analyses yielded different results, explainable by the correlations. Some factors identified as having potential associations were the consumption of rice, sugar, fruit, potatoes, meat products and fish. However, within a given dietary category there were often a number of different elements with different relations to dementia. This pilot study demonstrates some critical elements for a future study: (1) dietary factors must be very narrowly defined, (2) large numbers of cases are needed to support multivariable analyses. (3) Multiple statistical methods along with simulations must be used to confirm results.Entities:
Keywords: dementia; diet; multivariate analysis; prevalence; rice; risk factors; statistical simulations
Year: 2022 PMID: 35875792 PMCID: PMC9298542 DOI: 10.3389/fnagi.2022.606424
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.702
Comparison of the two studies performed in this paper on diet and dementia using the United Kingdom family food data.
| Element | Study 1 | Study 2 | Comment |
| United Kingdom Family Food: Years | 2016–2017 | 2011, 2012, 2013, 2014, 2015, 2015, 2016, 2017, 2018, 2018, and 2019 | Although data was available from 2001–2019 only the listed years had complete data on dementia prevalence |
| Dementia index | Fraction of the total population with dementia. | “Dementia QOF”-Fraction of patients seen by a general practitioner with a diagnosis of dementia | Dementia QOF available only for England. Dementia prevalence in other locations came from other data sets ( |
| Geography | (1) 9 Regions of England | (1) 9 Regions of England | In the second study data from the regions of England was reconstructed from data at the Clinical Commissioning Group or Count level ( |
| Comorbidities | Entered by hand from various sources ( | Downloaded and processed directly from NHS websites and ONS websites | For study 2, mappings between different geographies in England used to find to construct results that are valid for each region of England ( |
| Univariate | (1) Pearson Correlation | (1) Pearson Correlation | For a study with limited cases, univariate comparison are a critical primary result. |
| Partial Correlations for Food Variables | No | Yes | Removes effects of age, race and gender from the food variables. |
| Permutation tests for univariate measures | No | Yes | The use of permutation tests increases statistical certainty about the test results. |
| Exhaustive Regression | Limited | Yes | Time intensive but has low false positive rate. Arbitrary choice of what defines optimal models |
| Stepwise Multiple Regression | No | Yes | Forward very valuable with ranked variables to reduce false positives but reverse often eliminates likely important variables. |
| Cluster Regression | No | Yes | Limited value as clustering not biologically based. |
| Principal Components Analysis | Yes | Limited | Limited Value as clustering not biologically based. |
| Regularized Regression | No | Yes | |
| Construction of Single weighted food index | No | Yes | Weights chosen according to the partial correlation coefficients or through an iterative variance minimizing search. |
| Simulations | No | Yes | For a complex analysis, simulations allow a better picture of the true statistical nature of any process. |
FIGURE 1Illustration of the global tests for significance in the first study. (A) Histogram of the p-values generated by the Pearson correlation tests between each food variable and dementia in the first study. (B) The cumulative density function (CDF) describing the p-values. The bootstrap distribution is that derived by randomizing the dementia variable values and is linear as expected. The Kolmogorov-Smirnov test demonstrates that the observed distribution is unlikely to be uniform and hence that there is a significant relationship between the explanatory variables and the outcome variable.
FIGURE 2Illustration of the global tests for the second study. (A) Histogram of Spearman correlation p-values for the food variables with the “full” data set. (B) The CDF of the p-values showing a strongly non-uniform shape. (C) The CDF of Spearman correlation p-values in the second study with the “averaged” data set. (D) The CDF of the partial correlation p-values with the “full” data set. Although the shape is closer to that of a uniform distribution, the Kolmogorov-Smirnov test still shows a large statistical difference.
FIGURE 3Data from the ∑R2 test for global significance with (A) the Spearman correlation coefficients and (B) the partial correlation coefficients for each food variable. In each case the single bar at the right represents the actual sum of R2 over all food variables in the actual data set. The smaller bars to the left in each graph show the results of computing the sum of the values of R2 over 200 random permutations of the data in the dementia variable. These simulations essentially show the values of ∑R2 in the absence of any relationship-the null hypothesis. The inset at the lower right shows a cartoon of the findings expected when there is or is not a significant relationship between the explanatory variables and the outcome variable.
Univariate relationships between study variables and dementia in the first and second study.
| Second Study | First Study | |||||||
| Variable Name | p | R | Partial p | Partial R | Variable | R | p | p |
| Fraction 65 + (3) | 2.7E-17 | 0.80 | − | − |
| 0.99 | 4.9E-07 | 0.00024 |
| Other cereal convenience foods (250) | 1.6E-13 | 0.74 | 8.1E-06 | 0.50 | % Mixed Race in Population | −0.99 | 1.2E-06 | 0.0011 |
| Other cereal foods – frozen and not frozen (255) | 4.8E-13 | 0.73 | 7.2E-09 | 0.62 | % Black Race in Population | −0.98 | 6.5E-06 | 0.0037 |
| Other convenience meat products – frozen or not frozen (78) | 2.6E-12 | 0.71 | 0.00042 | 0.40 |
| −0.97 | 1.2E-05 | 0.00094 |
| 5.6E-12 | 0.70 | 6.2E-13 | 0.72 | % Other Race in Population | −0.97 | 1.7E-05 | 0.017 | |
| Stroke: QOF prevalence (all ages) (14) | 4.0E-09 | 0.63 | 0.019 | 0.28 |
| −0.97 | 1.9E-05 | 0.0025 |
| Meat based ready meals and convenience meat products (76) | 7.4E-09 | 0.62 | 0.016 | 0.28 | Death Rate | 0.97 | 2.4E-05 | 0.0016 |
| Hot dogs and sausage sandwiches (330) | 1.9E-08 | 0.60 | 1.61E-07 | 0.57 | Birth Rate | −0.97 | 2.6E-05 | 0.0016 |
| Fresh and processed potatoes (355) | 4.1E-08 | 0.59 | 0.0011 | 0.38 | % Asian Race in Population | −0.96 | 5.5E-05 | 0.042 |
| Frozen fruit and fruit products (194) | 4.5E-08 | 0.59 | 5.0E-05 | 0.46 | Mineral or Spring Waters | −0.95 | 1.2E-04 | 0.036 |
| Pizzas – frozen and not frozen (248) | 7.6E-08 | 0.58 | 0.044 | 0.24 | Population density | −0.94 | 1.6E-04 | 0.0096 |
| Other soft fruit, fresh (186) | 8.4E-08 | 0.58 | 0.013 | 0.29 |
| 0.94 | 1.9E-04 | 0.013 |
| Other sponge cakes or desserts (not cream cakes) (477) | 2.8E-07 | 0.56 | 8.0E-05 | 0.45 | % Other White Race in Population | −0.94 | 2.0E-04 | 0.049 |
| Champagne, sparkling wines and wine with mixer (301) | 3.9E-07 | 0.56 | 2.7E-06 | 0.52 | Prevalence of Hypertension | 0.93 | 2.3E-04 | 0.04 |
| White British (8) | 4.4E-07 | 0.55 | − | − | Bacon and Ham Uncooked | 0.93 | 3.4E-04 | 0.05 |
| Soft drinks, not concentrated, low calorie (287) | 4.5E-07 | 0.55 | 0.0031 | 0.34 | Salmon, fresh, chilled or frozen–total | −0.92 | 5.4E-04 | 0.0125 |
| Butter and margarine eaten out (429) | 4.5E-07 | 0.55 | 0.013 | 0.29 | Median Age | 0.91 | 5.6E-04 | 0.03 |
| Other root vegetables or tubers, e.g., turnip, parsnip, radish, beetroot (373) | 1.5E-06 | 0.53 | 0.00073 | 0.39 | Other Food And Drink | −0.91 | 7.1E-04 | 0.077* |
| Pizza (247) | 5.2E-06 | 0.51 | 2.3E-07 | 0.57 |
| 0.91 | 7.3E-04 | 0.00391 |
19 variables with the lowest p-values are shown. In the first study, ranking was done according to the Pearson correlation probability p
FIGURE 4Plot of the relationship between the food index derived by standardizing the food variables and multiplying each by their partial correlation coefficients and dementia.
Comparison of index elements in the DASH, Mediterranean and MIND diets (Morris et al., 2015) that are also in the current study.
| Prior study indices | Current target group | Current study group extremes | |||||||
|
|
|
| |||||||
| Index | DASH | MeDi | MIND | R Mean | R Std | R Min | Minimum element | R Max | Maximum element |
| Grains | − | − | − | –0.12 | 0.28 | –0.65 | Bread | 0.72 | Cooked rice |
| Vegetables | − | − | 0.031 | 0.22 | –0.51 | Fresh Green Vegetables | 0.39 | Other root vegetables | |
| Green Leafy Vegetables | − | –0.44 | 0.076 | –0.49 | Lettuce and Leafy Salads | –0.39 | Leafy Salad Fresh | ||
| Potatoes | − | –0.074 | 0.28 | –0.53 | Potatoes | 0.43 | Potatoes-Mashed | ||
| Fruits | − | − | –0.10 | 0.34 | –0.73 | Pure fruit Juice | 0.46 | Frozen Fruit and fruit products | |
| Dairy | − | + | –0.11 | 0.29 | –0.60 | Hard Cheese Cheddar type | 0.43 | Soft drinks including Milk | |
| Red meats | + | + | –0.08 | 0.19 | –0.30 | Beef Steak-More Expensive | 0.22 | Corned Meat | |
| Fish | − | − | 0.078 | 0.16 | –0.18 | White Fish Frozen | 0.38 | Other tinned or bottled fish | |
| Poultry | − | − | 0.10 | 0.23 | –0.27 | Other poultry | 0.52 | Chicken burger | |
| Nuts, seeds, and legumes | − | − | 0.065 | 0.075 | –0.046 | Nuts and Crisps | 0.11 | Nuts edible seeds and peanut butter | |
| Beans | − | –0.18 | 0.32 | –0.50 | Beans, fresh | 0.19 | Other canned beans and pulses | ||
| Nuts | − | 0.065 | 0.075 | –0.046 | Nuts and crisps | 0.11 | Nuts edible seeds and peanut butter | ||
| Total Fat | + | –0.19 | 0.26 | –0.60 | All other fats | 0.09 | Fats, preserves, sugar and custard | ||
| Olive oil | − | − | –0.4 | ||||||
| Butter margarine | + | 0.11 | 0.26 | –0.08 | Butter | 0.291 | Butter and margarine | ||
| Cheese | + | –0.026 | 0.30 | –0.60 | Hard cheese cheddar type | 0.37 | Cheese and egg dishes or pizza | ||
| Sweets | + | –0.085 | 0.28 | –0.40 | Chocolate coated bars and sweets | 0.33 | Boiled sweets | ||
| Pastries sweets | + | –0.30 | 0.27 | –0.52 | Cakes and pastries not frozen | 0.17 | Take away pastries | ||
| Sodium | + | –0.49 | |||||||
| Alcohol | + | + | 0.038 | 0.31 | –0.40 | Fortified wines | 0.52 | Champagne, sparkling wines and wine with mixer | |
In columns 2, 3, 4, a+ sign indicates that consuming more of the given food changes the index in such a way as to increase the risk of dementia. A – sign indicates the opposite. There are often many entries in the family food database that would be associated with the index element indicated in each diet. The values listed under current target group are the mean partial correlation coefficient and its standard deviation over all food elements that fit the Index element in the current study. To the right of this is the food variable in the group with the smallest R and the food variable with the largest R. If the results from