OBJECTIVES: Evaluating occupational exposures in population-based case-control studies often requires exposure assessors to review each study participant's reported occupational information job-by-job to derive exposure estimates. Although such assessments likely have underlying decision rules, they usually lack transparency, are time consuming and have uncertain reliability and validity. We aimed to identify the underlying rules to enable documentation, review and future use of these expert-based exposure decisions. METHODS: Classification and regression trees (CART, predictions from a single tree) and random forests (predictions from many trees) were used to identify the underlying rules from the questionnaire responses, and an expert's exposure assignments for occupational diesel exhaust exposure for several metrics: binary exposure probability and ordinal exposure probability, intensity and frequency. Data were split into training (n=10 488 jobs), testing (n=2247) and validation (n=2248) datasets. RESULTS: The CART and random forest models' predictions agreed with 92-94% of the expert's binary probability assignments. For ordinal probability, intensity and frequency metrics, the two models extracted decision rules more successfully for unexposed and highly exposed jobs (86-90% and 57-85%, respectively) than for low or medium exposed jobs (7-71%). CONCLUSIONS: CART and random forest models extracted decision rules and accurately predicted an expert's exposure decisions for the majority of jobs, and identified questionnaire response patterns that would require further expert review if the rules were applied to other jobs in the same or different study. This approach makes the exposure assessment process in case-control studies more transparent, and creates a mechanism to efficiently replicate exposure decisions in future studies.
OBJECTIVES: Evaluating occupational exposures in population-based case-control studies often requires exposure assessors to review each study participant's reported occupational information job-by-job to derive exposure estimates. Although such assessments likely have underlying decision rules, they usually lack transparency, are time consuming and have uncertain reliability and validity. We aimed to identify the underlying rules to enable documentation, review and future use of these expert-based exposure decisions. METHODS: Classification and regression trees (CART, predictions from a single tree) and random forests (predictions from many trees) were used to identify the underlying rules from the questionnaire responses, and an expert's exposure assignments for occupational diesel exhaust exposure for several metrics: binary exposure probability and ordinal exposure probability, intensity and frequency. Data were split into training (n=10 488 jobs), testing (n=2247) and validation (n=2248) datasets. RESULTS: The CART and random forest models' predictions agreed with 92-94% of the expert's binary probability assignments. For ordinal probability, intensity and frequency metrics, the two models extracted decision rules more successfully for unexposed and highly exposed jobs (86-90% and 57-85%, respectively) than for low or medium exposed jobs (7-71%). CONCLUSIONS:CART and random forest models extracted decision rules and accurately predicted an expert's exposure decisions for the majority of jobs, and identified questionnaire response patterns that would require further expert review if the rules were applied to other jobs in the same or different study. This approach makes the exposure assessment process in case-control studies more transparent, and creates a mechanism to efficiently replicate exposure decisions in future studies.
Authors: Joanne S Colt; Margaret R Karagas; Molly Schwenn; Dalsu Baris; Alison Johnson; Patricia Stewart; Castine Verrill; Lee E Moore; Jay Lubin; Mary H Ward; Claudine Samanic; Nathaniel Rothman; Kenneth P Cantor; Laura E Beane Freeman; Alan Schned; Sai Cherala; Debra T Silverman Journal: Occup Environ Med Date: 2010-09-23 Impact factor: 4.402
Authors: Melissa C Friesen; Anjoeka Pronk; David C Wheeler; Yu-Cheng Chen; Sarah J Locke; Dennis D Zaebst; Molly Schwenn; Alison Johnson; Richard Waddell; Dalsu Baris; Joanne S Colt; Debra T Silverman; Patricia A Stewart; Hormuzd A Katki Journal: Ann Occup Hyg Date: 2012-11-25
Authors: Anjoeka Pronk; Patricia A Stewart; Joseph B Coble; Hormuzd A Katki; David C Wheeler; Joanne S Colt; Dalsu Baris; Molly Schwenn; Margaret R Karagas; Alison Johnson; Richard Waddell; Castine Verrill; Sai Cherala; Debra T Silverman; Melissa C Friesen Journal: Occup Environ Med Date: 2012-07-27 Impact factor: 4.402
Authors: Lin Fritschi; Melissa C Friesen; Deborah Glass; Geza Benke; Jennifer Girschik; Troy Sadkowsky Journal: J Environ Public Health Date: 2009-10-15
Authors: Melissa C Friesen; David C Wheeler; Roel Vermeulen; Sarah J Locke; Dennis D Zaebst; Stella Koutros; Anjoeka Pronk; Joanne S Colt; Dalsu Baris; Margaret R Karagas; Nuria Malats; Molly Schwenn; Alison Johnson; Karla R Armenti; Nathanial Rothman; Patricia A Stewart; Manolis Kogevinas; Debra T Silverman Journal: Ann Occup Hyg Date: 2016-01-04
Authors: Melissa C Friesen; Susan M Shortreed; David C Wheeler; Igor Burstyn; Roel Vermeulen; Anjoeka Pronk; Joanne S Colt; Dalsu Baris; Margaret R Karagas; Molly Schwenn; Alison Johnson; Karla R Armenti; Debra T Silverman; Kai Yu Journal: Ann Occup Hyg Date: 2014-12-03
Authors: Heidi J Fischer; Ximena P Vergara; Michael Yost; Michael Silva; David A Lombardi; Leeka Kheifets Journal: J Expo Sci Environ Epidemiol Date: 2015-05-13 Impact factor: 5.563
Authors: David C Wheeler; Kellie J Archer; Igor Burstyn; Kai Yu; Patricia A Stewart; Joanne S Colt; Dalsu Baris; Margaret R Karagas; Molly Schwenn; Alison Johnson; Karla Armenti; Debra T Silverman; Melissa C Friesen Journal: Ann Occup Hyg Date: 2014-11-27
Authors: L T Stayner; J J Collins; Y L Guo; D Heederik; M Kogevinas; K Steenland; C Wesseling; P A Demers Journal: Curr Environ Health Rep Date: 2017-09
Authors: Calvin B Ge; Melissa C Friesen; Hans Kromhout; Susan Peters; Nathaniel Rothman; Qing Lan; Roel Vermeulen Journal: Ann Work Expo Health Date: 2018-11-12 Impact factor: 2.179
Authors: Melissa C Friesen; Anjoeka Pronk; David C Wheeler; Yu-Cheng Chen; Sarah J Locke; Dennis D Zaebst; Molly Schwenn; Alison Johnson; Richard Waddell; Dalsu Baris; Joanne S Colt; Debra T Silverman; Patricia A Stewart; Hormuzd A Katki Journal: Ann Occup Hyg Date: 2012-11-25
Authors: Melissa C Friesen; Sarah J Locke; Carina Tornow; Yu-Cheng Chen; Dong-Hee Koh; Patricia A Stewart; Mark Purdue; Joanne S Colt Journal: Ann Occup Hyg Date: 2014-03-03
Authors: Anila Bello; Susan R Woskie; Rebecca Gore; Dale P Sandler; Silke Schmidt; Freya Kamel Journal: Ann Work Expo Health Date: 2017-04-01 Impact factor: 2.179