Literature DB >> 25672399

Use of random forest to estimate population attributable fractions from a case-control study of Salmonella enterica serotype Enteritidis infections.

W Gu1, A R Vieira1, R M Hoekstra2, P M Griffin1, D Cole1.   

Abstract

To design effective food safety programmes we need to estimate how many sporadic foodborne illnesses are caused by specific food sources based on case-control studies. Logistic regression has substantive limitations for analysing structured questionnaire data with numerous exposures and missing values. We adapted random forest to analyse data of a case-control study of Salmonella enterica serotype Enteritidis illness for source attribution. For estimation of summary population attributable fractions (PAFs) of exposures grouped into transmission routes, we devised a counterfactual estimator to predict reductions in illness associated with removing grouped exposures. For the purpose of comparison, we fitted the data using logistic regression models with stepwise forward and backward variable selection. Our results show that the forward and backward variable selection of logistic regression models were not consistent for parameter estimation, with different significant exposures identified. By contrast, the random forest model produced estimated PAFs of grouped exposures consistent in rank order with results obtained from outbreak data, with egg-related exposures having the highest estimated PAF (22·1%, 95% confidence interval 8·5-31·8). Random forest might be structurally more coherent and efficient than logistic regression models for attributing Salmonella illnesses to sources involving many causal pathways.

Entities:  

Keywords:  Causality; counterfactual; foodborne diseases; logistic regression; machine learning

Mesh:

Year:  2015        PMID: 25672399      PMCID: PMC9151037          DOI: 10.1017/S095026881500014X

Source DB:  PubMed          Journal:  Epidemiol Infect        ISSN: 0950-2688            Impact factor:   4.434


  19 in total

Review 1.  Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case-control studies.

Authors:  Sander Greenland
Journal:  Am J Epidemiol       Date:  2004-08-15       Impact factor: 4.897

2.  Re-assessment of risk factors for sporadic Salmonella serotype Enteritidis infections: a case-control study in five FoodNet Sites, 2002-2003.

Authors:  R Marcus; J K Varma; C Medus; E J Boothe; B J Anderson; T Crume; K E Fullerton; M R Moore; P L White; E Lyszkowicz; A C Voetsch; F J Angulo
Journal:  Epidemiol Infect       Date:  2006-06-07       Impact factor: 2.451

3.  The occurrence of lung cancer in man.

Authors:  M L LEVIN
Journal:  Acta Unio Int Contra Cancrum       Date:  1953

Review 4.  Activities, achievements, and lessons learned during the first 10 years of the Foodborne Diseases Active Surveillance Network: 1996-2005.

Authors:  Elaine Scallan
Journal:  Clin Infect Dis       Date:  2007-01-24       Impact factor: 9.079

5.  What do case-control studies estimate? Survey of methods and assumptions in published case-control research.

Authors:  Mirjam J Knol; Jan P Vandenbroucke; Pippa Scott; Matthias Egger
Journal:  Am J Epidemiol       Date:  2008-09-15       Impact factor: 4.897

6.  Random forest Gini importance favours SNPs with large minor allele frequency: impact, sources and recommendations.

Authors:  Anne-Laure Boulesteix; Andreas Bender; Justo Lorenzo Bermejo; Carolin Strobl
Journal:  Brief Bioinform       Date:  2011-09-10       Impact factor: 11.622

7.  Foodborne illness acquired in the United States--major pathogens.

Authors:  Elaine Scallan; Robert M Hoekstra; Frederick J Angulo; Robert V Tauxe; Marc-Alain Widdowson; Sharon L Roy; Jeffery L Jones; Patricia M Griffin
Journal:  Emerg Infect Dis       Date:  2011-01       Impact factor: 6.883

8.  Gene selection and classification of microarray data using random forest.

Authors:  Ramón Díaz-Uriarte; Sara Alvarez de Andrés
Journal:  BMC Bioinformatics       Date:  2006-01-06       Impact factor: 3.169

9.  Screening large-scale association study data: exploiting interactions using random forests.

Authors:  Kathryn L Lunetta; L Brooke Hayward; Jonathan Segal; Paul Van Eerdewegh
Journal:  BMC Genet       Date:  2004-12-10       Impact factor: 2.797

10.  Attribution of foodborne illnesses, hospitalizations, and deaths to food commodities by using outbreak data, United States, 1998-2008.

Authors:  John A Painter; Robert M Hoekstra; Tracy Ayers; Robert V Tauxe; Christopher R Braden; Frederick J Angulo; Patricia M Griffin
Journal:  Emerg Infect Dis       Date:  2013-03       Impact factor: 6.883

View more
  7 in total

1.  Foodborne Diseases Active Surveillance Network-2 Decades of Achievements, 1996-2015.

Authors:  Olga L Henao; Timothy F Jones; Duc J Vugia; Patricia M Griffin
Journal:  Emerg Infect Dis       Date:  2015-09       Impact factor: 6.883

2.  Derivation and validation of different machine-learning models in mortality prediction of trauma in motorcycle riders: a cross-sectional retrospective study in southern Taiwan.

Authors:  Pao-Jen Kuo; Shao-Chun Wu; Peng-Chen Chien; Cheng-Shyuan Rau; Yi-Chun Chen; Hsiao-Yun Hsieh; Ching-Hua Hsieh
Journal:  BMJ Open       Date:  2018-01-05       Impact factor: 2.692

3.  Artificial neural network approach to predict surgical site infection after free-flap reconstruction in patients receiving surgery for head and neck cancer.

Authors:  Pao-Jen Kuo; Shao-Chun Wu; Peng-Chen Chien; Shu-Shya Chang; Cheng-Shyuan Rau; Hsueh-Ling Tai; Shu-Hui Peng; Yi-Chun Lin; Yi-Chun Chen; Hsiao-Yun Hsieh; Ching-Hua Hsieh
Journal:  Oncotarget       Date:  2018-02-09

4.  Mortality prediction in patients with isolated moderate and severe traumatic brain injury using machine learning models.

Authors:  Cheng-Shyuan Rau; Pao-Jen Kuo; Peng-Chen Chien; Chun-Ying Huang; Hsiao-Yun Hsieh; Ching-Hua Hsieh
Journal:  PLoS One       Date:  2018-11-09       Impact factor: 3.240

5.  Effects of a mannan-rich yeast cell wall-derived preparation on cecal concentrations and tissue prevalence of Salmonella Enteritidis in layer chickens.

Authors:  G Girgis; M Powell; M Youssef; D E Graugnard; W D King; K A Dawson
Journal:  PLoS One       Date:  2020-04-23       Impact factor: 3.240

6.  Methods for generating hypotheses in human enteric illness outbreak investigations: a scoping review of the evidence.

Authors:  C Ickert; J Cheng; D Reimer; J Greig; A Hexemer; T Kershaw; L Waddell; M Mascarenhas
Journal:  Epidemiol Infect       Date:  2019-09-27       Impact factor: 2.451

7.  Comparison of machine learning methods for estimating case fatality ratios: An Ebola outbreak simulation study.

Authors:  Alpha Forna; Ilaria Dorigatti; Pierre Nouvellet; Christl A Donnelly
Journal:  PLoS One       Date:  2021-09-15       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.