Melissa C Friesen1, Qing Lan2, Calvin Ge3, Sarah J Locke2, Dean Hosgood4, Lin Fritschi5, Troy Sadkowsky6, Yu-Cheng Chen7, Hu Wei2, Jun Xu8, Tai Hing Lam9, Yok Lam Kwong10, Kexin Chen11, Caigang Xu12, Yu-Chieh Su13, Brian C H Chiu14, Kai Ming Dennis Ip9, Mark P Purdue2, Bryan A Bassig2, Nat Rothman2, Roel Vermeulen3. 1. 1.Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, North Bethesda, MD 20980, USA; friesenmc@mail.nih.gov. 2. 1.Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, North Bethesda, MD 20980, USA; 3. 2.University of Utrecht, Utrecht, The Netherlands; 4. 3.Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA; 5. 4.School of Public Health, Curtin University, Perth, Australia; 6. 5.Data Scientists Pty Ltd, Sunshine Coast, Queensland, Australia; 7. 1.Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, North Bethesda, MD 20980, USA; 6.Environmental Health Research Center, National Health Research Institutes, Zhunan, Taiwan; 8. 1.Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, North Bethesda, MD 20980, USA; 7.Division of Community Medicine and Public Health Practice, School of Public Health, The University of Hong Kong, Hong Kong; 9. 7.Division of Community Medicine and Public Health Practice, School of Public Health, The University of Hong Kong, Hong Kong; 10. 8.Bone Marrow Transplant Unit, Queen Mary Hospital, Hong Kong; 9.Division of Haematology, Oncology and Bone Marrow Transplantation, Department of Medicine, The University of Hong Kong, Hong Kong; 11. 10.Department of Epidemiology and Biostatistics, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China; 12. 11.Department of Hematology, Hematology Research Laboratory and Pathology, West China Hospital of Sichuan University, Chengdu, Sichuan, China; 13. 12.Division of Hematology-Oncology, Department of Internal Medicine, Buddhist Dalin Tzu Chi General Hospital, Chiayi, Taiwan; 13.School of Medicine, Tzu Chi University, Hualian, Taiwan; 14. 14.Department of Public Health Sciences, University of Chicago, Chicago, IL, USA.
Abstract
OBJECTIVE: In community-based epidemiological studies, job- and industry-specific 'modules' are often used to systematically obtain details about the subject's work tasks. The module assignment is often made by the interviewer, who may have insufficient occupational hygiene knowledge to assign the correct module. We evaluated, in the context of a case-control study of lymphoid neoplasms in Asia ('AsiaLymph'), the performance of an algorithm that provided automatic, real-time module assignment during a computer-assisted personal interview. METHODS: AsiaLymph's occupational component began with a lifetime occupational history questionnaire with free-text responses and three solvent exposure screening questions. To assign each job to one of 23 study-specific modules, an algorithm automatically searched the free-text responses to the questions 'job title' and 'product made or services provided by employer' using a list of module-specific keywords, comprising over 5800 keywords in English, Traditional and Simplified Chinese. Hierarchical decision rules were used when the keyword match triggered multiple modules. If no keyword match was identified, a generic solvent module was assigned if the subject responded 'yes' to any of the three solvent screening questions. If these question responses were all 'no', a work location module was assigned, which redirected the subject to the farming, teaching, health professional, solvent, or industry solvent modules or ended the questions for that job, depending on the location response. We conducted a reliability assessment that compared the algorithm-assigned modules to consensus module assignments made by two industrial hygienists for a subset of 1251 (of 11409) jobs selected using a stratified random selection procedure using module-specific strata. Discordant assignments between the algorithm and consensus assignments (483 jobs) were qualitatively reviewed by the hygienists to evaluate the potential information lost from missed questions with using the algorithm-assigned module (none, low, medium, high). RESULTS: The most frequently assigned modules were the work location (33%), solvent (20%), farming and food industry (19%), and dry cleaning and textile industry (6.4%) modules. In the reliability subset, the algorithm assignment had an exact match to the expert consensus-assigned module for 722 (57.7%) of the 1251 jobs. Overall, adjusted for the proportion of jobs in each stratum, we estimated that 86% of the algorithm-assigned modules would result in no information loss, 2% would have low information loss, and 12% would have medium to high information loss. Medium to high information loss occurred for <10% of the jobs assigned the generic solvent module and for 21, 32, and 31% of the jobs assigned the work location module with location responses of 'someplace else', 'factory', and 'don't know', respectively. Other work location responses had ≤8% with medium to high information loss because of redirections to other modules. Medium to high information loss occurred more frequently when a job description matched with multiple keywords pointing to different modules (29-69%, depending on the triggered assignment rule). CONCLUSIONS: These evaluations demonstrated that automatically assigned modules can reliably reproduce an expert's module assignment without the direct involvement of an industrial hygienist or interviewer. The feasibility of adapting this framework to other studies will be language- and exposure-specific. Published by Oxford University Press on behalf of the British Occupational Hygiene Society 2016.
OBJECTIVE: In community-based epidemiological studies, job- and industry-specific 'modules' are often used to systematically obtain details about the subject's work tasks. The module assignment is often made by the interviewer, who may have insufficient occupational hygiene knowledge to assign the correct module. We evaluated, in the context of a case-control study of lymphoid neoplasms in Asia ('AsiaLymph'), the performance of an algorithm that provided automatic, real-time module assignment during a computer-assisted personal interview. METHODS: AsiaLymph's occupational component began with a lifetime occupational history questionnaire with free-text responses and three solvent exposure screening questions. To assign each job to one of 23 study-specific modules, an algorithm automatically searched the free-text responses to the questions 'job title' and 'product made or services provided by employer' using a list of module-specific keywords, comprising over 5800 keywords in English, Traditional and Simplified Chinese. Hierarchical decision rules were used when the keyword match triggered multiple modules. If no keyword match was identified, a generic solvent module was assigned if the subject responded 'yes' to any of the three solvent screening questions. If these question responses were all 'no', a work location module was assigned, which redirected the subject to the farming, teaching, health professional, solvent, or industry solvent modules or ended the questions for that job, depending on the location response. We conducted a reliability assessment that compared the algorithm-assigned modules to consensus module assignments made by two industrial hygienists for a subset of 1251 (of 11409) jobs selected using a stratified random selection procedure using module-specific strata. Discordant assignments between the algorithm and consensus assignments (483 jobs) were qualitatively reviewed by the hygienists to evaluate the potential information lost from missed questions with using the algorithm-assigned module (none, low, medium, high). RESULTS: The most frequently assigned modules were the work location (33%), solvent (20%), farming and food industry (19%), and dry cleaning and textile industry (6.4%) modules. In the reliability subset, the algorithm assignment had an exact match to the expert consensus-assigned module for 722 (57.7%) of the 1251 jobs. Overall, adjusted for the proportion of jobs in each stratum, we estimated that 86% of the algorithm-assigned modules would result in no information loss, 2% would have low information loss, and 12% would have medium to high information loss. Medium to high information loss occurred for <10% of the jobs assigned the generic solvent module and for 21, 32, and 31% of the jobs assigned the work location module with location responses of 'someplace else', 'factory', and 'don't know', respectively. Other work location responses had ≤8% with medium to high information loss because of redirections to other modules. Medium to high information loss occurred more frequently when a job description matched with multiple keywords pointing to different modules (29-69%, depending on the triggered assignment rule). CONCLUSIONS: These evaluations demonstrated that automatically assigned modules can reliably reproduce an expert's module assignment without the direct involvement of an industrial hygienist or interviewer. The feasibility of adapting this framework to other studies will be language- and exposure-specific. Published by Oxford University Press on behalf of the British Occupational Hygiene Society 2016.
Authors: Joanne S Colt; Margaret R Karagas; Molly Schwenn; Dalsu Baris; Alison Johnson; Patricia Stewart; Castine Verrill; Lee E Moore; Jay Lubin; Mary H Ward; Claudine Samanic; Nathaniel Rothman; Kenneth P Cantor; Laura E Beane Freeman; Alan Schned; Sai Cherala; Debra T Silverman Journal: Occup Environ Med Date: 2010-09-23 Impact factor: 4.402
Authors: Melissa C Friesen; Anjoeka Pronk; David C Wheeler; Yu-Cheng Chen; Sarah J Locke; Dennis D Zaebst; Molly Schwenn; Alison Johnson; Richard Waddell; Dalsu Baris; Joanne S Colt; Debra T Silverman; Patricia A Stewart; Hormuzd A Katki Journal: Ann Occup Hyg Date: 2012-11-25
Authors: Renee N Carey; Timothy R Driscoll; Susan Peters; Deborah C Glass; Alison Reid; Geza Benke; Lin Fritschi Journal: Occup Environ Med Date: 2013-10-24 Impact factor: 4.402
Authors: Lin Fritschi; Geza Benke; Ann M Hughes; Anne Kricker; Claire M Vajdic; Andrew Grulich; Jennifer Turner; Samuel Milliken; John Kaldor; Bruce K Armstrong Journal: Cancer Causes Control Date: 2005-06 Impact factor: 2.506
Authors: Anjoeka Pronk; Patricia A Stewart; Joseph B Coble; Hormuzd A Katki; David C Wheeler; Joanne S Colt; Dalsu Baris; Molly Schwenn; Margaret R Karagas; Alison Johnson; Richard Waddell; Castine Verrill; Sai Cherala; Debra T Silverman; Melissa C Friesen Journal: Occup Environ Med Date: 2012-07-27 Impact factor: 4.402
Authors: Lin Fritschi; Melissa C Friesen; Deborah Glass; Geza Benke; Jennifer Girschik; Troy Sadkowsky Journal: J Environ Public Health Date: 2009-10-15
Authors: Calvin B Ge; Melissa C Friesen; Hans Kromhout; Susan Peters; Nathaniel Rothman; Qing Lan; Roel Vermeulen Journal: Ann Work Expo Health Date: 2018-11-12 Impact factor: 2.179
Authors: Zhi-Ming Mai; Jia-Huang Lin; Roger Kai-Cheong Ngan; Dora Lai-Wan Kwong; Wai-Tong Ng; Alice Wan-Ying Ng; Kai-Ming Ip; Yap-Hang Chan; Anne Wing-Mui Lee; Sai-Yin Ho; Maria Li Lung; Tai-Hing Lam Journal: Open Forum Infect Dis Date: 2020-09-12 Impact factor: 3.835