Literature DB >> 29082809

Suitability of random forest analysis for epidemiological research: Exploring sociodemographic and lifestyle-related risk factors of overweight in a cross-sectional design.

Noora Kanerva1,2, Jukka Kontto2, Maijaliisa Erkkola3, Jaakko Nevalainen4, Satu Männistö2.   

Abstract

AIMS: Factors that contribute to the development of overweight are numerous and form a complex structure with many unknown interactions and associations. We aimed to explore this structure (i.e. the mutual importance or hierarchy of sociodemographic and lifestyle-related risk factors of being overweight) using a machine-learning technique called random forest (RF). The results were compared with traditional logistic regression (LR) analysis.
METHODS: The cross-sectional FINRISK 2007 Study included 4757 Finns (aged 25-74 years). Information on participants' lifestyle and sociodemographic characteristics were collected with questionnaires. Diet was assessed, using a validated food-frequency questionnaire. Height and weight were measured. Participants with a body mass index (BMI) ≥25 kg/m2 were classified as overweight. R-statistical software was used to run RF analysis ('randomForest') to derive estimates for variable importance and out-of-bag error, which were compared to a LR model.
RESULTS: In total, 704 (32%) men and 1119 (44%) women had normal BMI, whereas 1502 (69%) men and 1432 (57%) women had BMI ≥25. Estimated error rates for the models were similar (RF vs. LR: 42% vs. 40% for men, 38% vs. 35% for women). Both models ranked age, education and physical activity as the most important risk factors for being overweight, but RF ranked macronutrients (carbohydrates and protein) as more important compared to LR.
CONCLUSIONS: RF did not demonstrate higher power in variable selection compared to LR in our study. The features of RF are more likely to appear beneficial in settings with a larger number of predictors.

Entities:  

Keywords:  Machine learning; mutual importance; obesity; random forest; risk factor

Mesh:

Year:  2017        PMID: 29082809     DOI: 10.1177/1403494817736944

Source DB:  PubMed          Journal:  Scand J Public Health        ISSN: 1403-4948            Impact factor:   3.021


  8 in total

1.  Classifying Non-Dementia and Alzheimer's Disease/Vascular Dementia Patients Using Kinematic, Time-Based, and Visuospatial Parameters: The Digital Clock Drawing Test.

Authors:  Anis Davoudi; Catherine Dion; Shawna Amini; Patrick J Tighe; Catherine C Price; David J Libon; Parisa Rashidi
Journal:  J Alzheimers Dis       Date:  2021       Impact factor: 4.472

2.  Machine learning as a strategy to account for dietary synergy: an illustration based on dietary intake and adverse pregnancy outcomes.

Authors:  Lisa M Bodnar; Abigail R Cartus; Sharon I Kirkpatrick; Katherine P Himes; Edward H Kennedy; Hyagriv N Simhan; William A Grobman; Jennifer Y Duffy; Robert M Silver; Samuel Parry; Ashley I Naimi
Journal:  Am J Clin Nutr       Date:  2020-06-01       Impact factor: 8.472

3.  Application of machine learning to understand child marriage in India.

Authors:  Anita Raj; Nabamallika Dehingia; Abhishek Singh; Lotus McDougal; Julian McAuley
Journal:  SSM Popul Health       Date:  2020-12-05

4.  Machine-Learning for Prescription Patterns: Random Forest in the Prediction of Dose and Number of Antipsychotics Prescribed to People with Schizophrenia.

Authors:  Mattia Marchi; Giacomo Galli; Gianluca Fiore; Andrew Mackinnon; Giorgio Mattei; Fabrizio Starace; Gian M Galeazzi
Journal:  Clin Psychopharmacol Neurosci       Date:  2022-08-31       Impact factor: 3.731

5.  Association of body composition with pubertal timing in children and adolescents from Guangzhou, China.

Authors:  Yijin Zheng; Jianping Liang; Ding Zeng; Weiqing Tan; Lun Yang; Shuang Lu; Wanwen Yao; Yi Yang; Li Liu
Journal:  Front Public Health       Date:  2022-08-17

Review 6.  Fertility is a key predictor of the double burden of malnutrition among women of child-bearing age in sub-Saharan Africa.

Authors:  Jason Mulimba Were; Saverio Stranges; Irena F Creed
Journal:  J Glob Health       Date:  2020-12       Impact factor: 4.413

7.  Quantitative methods for descriptive intersectional analysis with binary health outcomes.

Authors:  Mayuri Mahendran; Daniel Lizotte; Greta R Bauer
Journal:  SSM Popul Health       Date:  2022-01-22

Review 8.  A scoping review on the use of machine learning in research on social determinants of health: Trends and research prospects.

Authors:  Shiho Kino; Yu-Tien Hsu; Koichiro Shiba; Yung-Shin Chien; Carol Mita; Ichiro Kawachi; Adel Daoud
Journal:  SSM Popul Health       Date:  2021-06-05
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.