| Literature DB >> 26879185 |
Philippe J Giabbanelli1, Jean Adams1.
Abstract
OBJECTIVE: Many dietary assessment methods attempt to estimate total food and nutrient intake. If the intention is simply to determine whether participants achieve dietary recommendations, this leads to much redundant data. We used data mining techniques to explore the number of foods that intake information was required on to accurately predict achievement, or not, of key dietary recommendations.Entities:
Keywords: Data mining; Diet; Dietary assessment; Dietary pattern analysis; Nutrition
Mesh:
Year: 2016 PMID: 26879185 PMCID: PMC4873899 DOI: 10.1017/S1368980016000185
Source DB: PubMed Journal: Public Health Nutr ISSN: 1368-9800 Impact factor: 4.022
Fig. 1(colour online) Schematic illustration of a decision tree (a) and how this is formed through repeated ‘cuts’ of the data (b)
Prevalence of achieving and not achieving dietary recommendations and accuracy of decision trees to predict this, using data mining techniques on the nutritional intake of 4156 individuals (2967 individuals for fruit and vegetables) from the UK National Diet and Nutrition Survey (2008–12)
| Fruit & vegetables | Free sugars | Sodium | Fat | Saturated fat | |
|---|---|---|---|---|---|
| No. achieving recommendation without oversampling | 656 | 1472 | 2524 | 1045 | 795 |
| % | 22·1 | 35·4 | 60·7 | 25·1 | 19·1 |
| SMOTE oversampling % | 252 % (yes) | 85 % (yes) | 54 % (no) | 197 % (yes) | 322 % (yes) |
| No. achieving recommendation after oversampling | 2309 | 2679 | 2524 | 3103 | 3354 |
| No. not achieving recommendation after oversampling | 2311 | 2684 | 2513 | 3111 | 3361 |
| Decision tree with the best trade-off between accuracy and number of predictor variables | |||||
| Overall accuracy (%) | 83·1 | 76·5 | 75·9 | 72·4 | 79·7 |
| Sensitivity (%) | 82·5 | 76·1 | 81·9 | 66·3 | 75·8 |
| Specificity (%) | 83·8 | 76·9 | 69·8 | 78·4 | 83·6 |
| No. of predictor variables | 11 | 28 | 28 | 33 | 28 |
| % of all relevant food/nutrient (g) accounted for by predictor variables | 21·0 | 31·2 | 13·4 | 13·0 | 27·4 |
| Most accurate decision tree | |||||
| Overall accuracy (%) | 83·6 | 77·0 | 76·1 | 72·9 | 81·7 |
| Sensitivity (%) | 83·9 | 75·7 | 80·7 | 69·3 | 81·4 |
| Specificity (%) | 83·3 | 78·3 | 71·5 | 76·4 | 81·9 |
| No. of predictor variables | 50 | 64 | 49 | 123 | 156 |
| % of all relevant food/nutrient (g) accounted for by predictor variables | 30·8 | 38·6 | 25·4 | 29·5 | 42·7 |
SMOTE, Synthetic Minority Over-sampling TEchnique.
After oversampling using the SMOTE method (see online supplementary material).
Percentage of all fruit and vegetables (g) recorded, not just those contributing to 5-a-day portions (specifically, fruit juice can contribute a maximum of only one 5-a-day portion).
Comparison of the analytical sample with the UK population
| Adults aged 19 years or older | Children aged <19 years | |||||||
|---|---|---|---|---|---|---|---|---|
| Analytical sample ( | UK population | Analytical sample ( | UK population | |||||
| Variable |
| % |
| % |
| % |
| % |
| Female | 1182 | 56·8 | 25 198 773 | 51·5 | 1007 | 48·6 | 6 955 262 | 48·8 |
| Age (adults) | ||||||||
| 19–29 years | 296 | 14·2 | 9 447 071 | 19·3 | – | – | ||
| 30–39 years | 390 | 18·7 | 8 319 926 | 17·0 | – | – | ||
| 40–49 years | 425 | 20·4 | 9 268 735 | 18·9 | – | – | ||
| 50–59 years | 363 | 17·4 | 7 708 532 | 15·8 | – | – | ||
| 60–64 years | 181 | 8·7 | 3 807 975 | 7·8 | – | – | ||
| ≥65 years | 428 | 20·6 | 10 377 127 | 21·2 | – | – | ||
| Age (children) | ||||||||
| 0–4 years | – | – | 499 | 24·1 | 3 913 953 | 27·5 | ||
| 5–9 years | – | – | 583 | 26·4 | 3 516 615 | 24·7 | ||
| 10–14 years | – | – | 547 | 26·4 | 3 669 326 | 25·7 | ||
| 15–18 years | – | – | 444 | 21·4 | 3 152 919 | 22·1 | ||
Fig. 2(colour online) Overall accuracy (with 95 % confidence margins) of decision trees v. the number of predictor variables included, using data mining techniques on the nutritional intake of 4156 individuals (2967 individuals for fruit and vegetables) from the UK National Diet and Nutrition Survey (2008–12)
Predictor variables (individual foods, age and sex) included in decision trees for predicting achievement of five dietary recommendations, using data mining techniques on the nutritional intake of 4156 individuals (2967 individuals for fruit and vegetables) from the UK National Diet and Nutrition Survey (2008–12)
| Dietary recommendation outcome | |||||
|---|---|---|---|---|---|
| Fat | Free sugars | Fruit & vegetables | Sodium | Saturated fat | Food name |
| Yes | Yes | Yes | Age | ||
| Yes | Alcoholic soft drinks, spirit based | ||||
| Yes | Almonds, kernel only: ground almonds | ||||
| Yes | Apple juice, unsweetened, cartons, pasteurized | ||||
| Yes | Apple juice, unsweetened, UHT | ||||
| Yes | Apples, eating, raw, flesh & skin only | ||||
| Yes | Avocado pear, flesh only | ||||
| Yes | Bacon rashers, back, grilled, lean and fat | ||||
| Yes | Bacon rashers, back, not smoked, grilled, extra trim | ||||
| Yes | Baked beans in tomato sauce with pork sausages | ||||
| Yes | Yes | Bananas, raw, flesh only | |||
| Yes | Beefburger and onion, grilled | ||||
| Yes | Black pudding, fried | ||||
| Yes | Blackcurrant juice drink, ready to drink, not low calorie | ||||
| Yes | Boiled sweets, barley sugar, butterscotch, glacier mints, hard candy | ||||
| Yes | Bread, white, crusty | ||||
| Yes | Yes | Bread, white, toasted | |||
| Yes | Bread, 50 % white and 50 % wholemeal flours | ||||
| Yes | Bread, white sliced, not fortified | ||||
| Yes | Brown sauce, bottled | ||||
| Yes | Brussels sprouts, fresh, boiled | ||||
| Yes | Butter beans, dried, boiled | ||||
| Yes | Yes | Butter, salted | |||
| Yes | Butter, unsalted | ||||
| Yes | Carbonated beverages, no juice, not low calorie, canned | ||||
| Yes | Yes | Yes | Carbonated beverages, no juice, not low calorie, not canned | ||
| Yes | Celery, fresh, raw | ||||
| Yes | Chapatti, brown, no fat | ||||
| Yes | Yes | Cheese, cheddar, any other or for recipes | |||
| Yes | Cheese, cheddar, English | ||||
| Yes | Cheese, soft full fat, Philadelphia type | ||||
| Yes | Chicken fried in olive oil | ||||
| Yes | Children’s fromage frais fruit with added vitamin D | ||||
| Yes | Chocolate brownie, no nuts, purchased | ||||
| Yes | Chocolate-covered caramels, Cadbury Caramel | ||||
| Yes | Chocolate Swiss roll with butter cream, purchased | ||||
| Yes | Cola cherry cola, canned, not low calorie | ||||
| Yes | Cola, not canned, not low calorie, not caffeine free | ||||
| Yes | Coleslaw, purchased, not low calorie | ||||
| Yes | Cookies and biscuits with chocolate | ||||
| Yes | Cornetto type ice cream, chocolate or nut based | ||||
| Yes | Cranberry fruit juice drink, e.g. Ocean Spray | ||||
| Yes | Cream, double | ||||
| Yes | Cream egg | ||||
| Yes | Croissants, plain, not filled | ||||
| Yes | Drinking chocolate, instant, dry weight | ||||
| Yes | Fat spread (62–72 % fat), not polyunsaturated | ||||
| Yes | Fruit gums, wine gums | ||||
| Yes | Fruit juice drink, carbonated, not low calorie, not canned | ||||
| Yes | Fruit juice drink with 5 % fruit juice, ready to drink | ||||
| Yes | Fully coated chocolate biscuits with biscuit filling | ||||
| Yes | Garlic bread, lower fat | ||||
| Yes | Ham, unspecified, not smoked, not canned | ||||
| Yes | Hamburger, Big Mac, McDonalds | ||||
| Yes | High juice, ready to drink, not blackcurrant or low calorie | ||||
| Yes | Ice lollies | ||||
| Yes | Jaffa Cakes | ||||
| Yes | Kit Kat | ||||
| Yes | Lager, not canned, e.g. Heineken | ||||
| Yes | Lager, not canned, e.g. Skol | ||||
| Yes | Lamb scrag and neck, stewed, lean only | ||||
| Yes | Lemonade, not low calorie, not canned | ||||
| Yes | Light spreadable butter (60 % fat) | ||||
| Yes | Lucozade sport isotonic drink, not carbonated | ||||
| Yes | Yes | Mayonnaise (retail) | |||
| Yes | Yes | Milk chocolate bar | |||
| Yes | Milk shake, thick style, takeaway | ||||
| Yes | Milk, skimmed, after boiling | ||||
| Yes | Milk, whole pasteurized, winter | ||||
| Yes | Milk, whole pasteurized, summer | ||||
| Yes | Mushrooms fried in olive oil | ||||
| Yes | Naan bread, plain | ||||
| Yes | Oatcakes | ||||
| Yes | Olive oil | ||||
| Yes | Onions, boiled | ||||
| Yes | Orange juice, unsweetened, UHT | ||||
| Yes | Oven ready chips | ||||
| Yes | Papadums/poppadoms, fried in vegetable ghee | ||||
| Yes | Pasta noodles, boiled | ||||
| Yes | Pasta noodles, egg, boiled | ||||
| Yes | Pasta spaghetti, boiled, white | ||||
| Yes | Peanut butter, crunchy, not wholenut | ||||
| Yes | Pears, eating, raw, flesh & skin only, no core | ||||
| Yes | Pepperami | ||||
| Yes | Petit Filous fromage frais | ||||
| Yes | Potato cakes (scones), purchased | ||||
| Yes | Potatoes, new, boiled, skins eaten | ||||
| Yes | Potatoes, old, baked, flesh & skin | ||||
| Yes | Potatoes, old, mashed & butter | ||||
| Yes | Prawns, boiled, flesh only | ||||
| Yes | Reduced fat spread (41–62 %), not polyunsaturated | ||||
| Yes | Ribena, original blackcurrant drink, concentrate | ||||
| Yes | Robinsons Fruit Shoot | ||||
| Yes | Rolls, white, crusty | ||||
| Yes | Yes | Yes | Sausage roll, flaky pastry, purchased | ||
| Yes | Sausages, pork, grilled | ||||
| Yes | Sausages, premium pork, grilled | ||||
| Yes | Scrambled eggs with skimmed milk and no fat | ||||
| Yes | Semi-sweet biscuit | ||||
| Yes | Sex | ||||
| Yes | Soya alternative to milk, sweetened plain | ||||
| Yes | Spinach, fresh, raw | ||||
| Yes | Spreadable butter (75–80 % fat) | ||||
| Yes | Sugar, white | ||||
| Yes | Super Noodles, Batchelors, as served | ||||
| Yes | Swiss roll, individual, chocolate coated, purchased | ||||
| Yes | Tomatoes, raw | ||||
| Yes | Turkey slices, unsmoked, pre-pack or deli | ||||
| Yes | Water for concentrated soft drinks, not diet | ||||
| Yes | White chocolate buttons, mice | ||||
| Yes | Whole milk, after boiling | ||||
| Yes | Wine white, dry, not canned | ||||
| Yes | Yes | Yoghurt twin pot with cereal/crumble | |||
| Yes | Yoghurt, Greek style, cows, natural, whole milk | ||||
| Yes | Yorkshire pudding, frozen | ||||
UHT, ultra-heat treated.