Literature DB >> 28731557

Data mining: Potential applications in research on nutrition and health.

Marijka Batterham1, Elizabeth Neale2, Allison Martin2, Linda Tapsell2.   

Abstract

AIM: Data mining enables further insights from nutrition-related research, but caution is required. The aim of this analysis was to demonstrate and compare the utility of data mining methods in classifying a categorical outcome derived from a nutrition-related intervention.
METHODS: Baseline data (23 variables, 8 categorical) on participants (n = 295) in an intervention trial were used to classify participants in terms of meeting the criteria of achieving 10 000 steps per day. Results from classification and regression trees (CARTs), random forests, adaptive boosting, logistic regression, support vector machines and neural networks were compared using area under the curve (AUC) and error assessments.
RESULTS: The CART produced the best model when considering the AUC (0.703), overall error (18%) and within class error (28%). Logistic regression also performed reasonably well compared to the other models (AUC 0.675, overall error 23%, within class error 36%). All the methods gave different rankings of variables' importance. CART found that body fat, quality of life using the SF-12 Physical Component Summary (PCS) and the cholesterol: HDL ratio were the most important predictors of meeting the 10 000 steps criteria, while logistic regression showed the SF-12PCS, glucose levels and level of education to be the most significant predictors (P ≤ 0.01).
CONCLUSIONS: Differing outcomes suggest caution is required with a single data mining method, particularly in a dataset with nonlinear relationships and outliers and when exploring relationships that were not the primary outcomes of the research.
© 2017 Dietitians Association of Australia.

Entities:  

Keywords:  artificial neural network; classification and regression tree; data mining; support vector machine

Year:  2017        PMID: 28731557     DOI: 10.1111/1747-0080.12337

Source DB:  PubMed          Journal:  Nutr Diet        ISSN: 1446-6368            Impact factor:   2.333


  2 in total

Review 1.  Prospects and Pitfalls of Machine Learning in Nutritional Epidemiology.

Authors:  Stefania Russo; Stefano Bonassi
Journal:  Nutrients       Date:  2022-04-20       Impact factor: 6.706

2.  Total energy expenditure in adults aged 65 years and over measured using doubly-labelled water: international data availability and opportunities for data sharing.

Authors:  Judi Porter; Kay Nguo; Simone Gibson; Catherine E Huggins; Jorja Collins; Nicole J Kellow; Helen Truby
Journal:  Nutr J       Date:  2018-03-26       Impact factor: 3.271

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.