Melissa C Friesen1, Susan M Shortreed2, David C Wheeler3, Igor Burstyn4, Roel Vermeulen5, Anjoeka Pronk6, Joanne S Colt7, Dalsu Baris7, Margaret R Karagas8, Molly Schwenn9, Alison Johnson10, Karla R Armenti11, Debra T Silverman7, Kai Yu12. 1. 1.Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA friesenmc@mail.nih.gov. 2. 2.Biostatistics, Group Health Research Institute, Seattle, WA 98101-1448, USA. 3. 1.Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA 3.Department of Biostatistics, Virginia Commonwealth University, Richmond, VA 23298, USA. 4. 4.Department of Environmental and Occupational Health, Drexel University, Philadelphia, PA 19104, USA. 5. 5.Utrecht University, Utrecht, The Netherlands. 6. 6.TNO, Utrecht, The Netherlands. 7. 1.Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA. 8. 7.Geisel School of Medicine at Dartmouth, Hanover, NH 03756, USA. 9. 8.Maine Cancer Registry, Augusta, ME 04333-0011, USA. 10. 9.Vermont Cancer Registry, Burlington, VT 05402-0070, USA. 11. 10.New Hampshire Department of Health and Human Services, Division of Public Health Services, Bureau of Public Health Statistics and Informatics, Concord, NH 03301, USA. 12. 11.Biostatistics, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA.
Abstract
OBJECTIVES: Rule-based expert exposure assessment based on questionnaire response patterns in population-based studies improves the transparency of the decisions. The number of unique response patterns, however, can be nearly equal to the number of jobs. An expert may reduce the number of patterns that need assessment using expert opinion, but each expert may identify different patterns of responses that identify an exposure scenario. Here, hierarchical clustering methods are proposed as a systematic data reduction step to reproducibly identify similar questionnaire response patterns prior to obtaining expert estimates. As a proof-of-concept, we used hierarchical clustering methods to identify groups of jobs (clusters) with similar responses to diesel exhaust-related questions and then evaluated whether the jobs within a cluster had similar (previously assessed) estimates of occupational diesel exhaust exposure. METHODS: Using the New England Bladder Cancer Study as a case study, we applied hierarchical cluster models to the diesel-related variables extracted from the occupational history and job- and industry-specific questionnaires (modules). Cluster models were separately developed for two subsets: (i) 5395 jobs with ≥1 variable extracted from the occupational history indicating a potential diesel exposure scenario, but without a module with diesel-related questions; and (ii) 5929 jobs with both occupational history and module responses to diesel-relevant questions. For each subset, we varied the numbers of clusters extracted from the cluster tree developed for each model from 100 to 1000 groups of jobs. Using previously made estimates of the probability (ordinal), intensity (µg m(-3) respirable elemental carbon), and frequency (hours per week) of occupational exposure to diesel exhaust, we examined the similarity of the exposure estimates for jobs within the same cluster in two ways. First, the clusters' homogeneity (defined as >75% with the same estimate) was examined compared to a dichotomized probability estimate (<5 versus ≥5%; <50 versus ≥50%). Second, for the ordinal probability metric and continuous intensity and frequency metrics, we calculated the intraclass correlation coefficients (ICCs) between each job's estimate and the mean estimate for all jobs within the cluster. RESULTS: Within-cluster homogeneity increased when more clusters were used. For example, ≥80% of the clusters were homogeneous when 500 clusters were used. Similarly, ICCs were generally above 0.7 when ≥200 clusters were used, indicating minimal within-cluster variability. The most within-cluster variability was observed for the frequency metric (ICCs from 0.4 to 0.8). We estimated that using an expert to assign exposure at the cluster-level assignment and then to review each job in non-homogeneous clusters would require ~2000 decisions per expert, in contrast to evaluating 4255 unique questionnaire patterns or 14983 individual jobs. CONCLUSIONS: This proof-of-concept shows that using cluster models as a data reduction step to identify jobs with similar response patterns prior to obtaining expert ratings has the potential to aid rule-based assessment by systematically reducing the number of exposure decisions needed. While promising, additional research is needed to quantify the actual reduction in exposure decisions and the resulting homogeneity of exposure estimates within clusters for an exposure assessment effort that obtains cluster-level expert assessments as part of the assessment process. Published by Oxford University Press on behalf of the British Occupational Hygiene Society 2014.
OBJECTIVES: Rule-based expert exposure assessment based on questionnaire response patterns in population-based studies improves the transparency of the decisions. The number of unique response patterns, however, can be nearly equal to the number of jobs. An expert may reduce the number of patterns that need assessment using expert opinion, but each expert may identify different patterns of responses that identify an exposure scenario. Here, hierarchical clustering methods are proposed as a systematic data reduction step to reproducibly identify similar questionnaire response patterns prior to obtaining expert estimates. As a proof-of-concept, we used hierarchical clustering methods to identify groups of jobs (clusters) with similar responses to diesel exhaust-related questions and then evaluated whether the jobs within a cluster had similar (previously assessed) estimates of occupational diesel exhaust exposure. METHODS: Using the New England Bladder Cancer Study as a case study, we applied hierarchical cluster models to the diesel-related variables extracted from the occupational history and job- and industry-specific questionnaires (modules). Cluster models were separately developed for two subsets: (i) 5395 jobs with ≥1 variable extracted from the occupational history indicating a potential diesel exposure scenario, but without a module with diesel-related questions; and (ii) 5929 jobs with both occupational history and module responses to diesel-relevant questions. For each subset, we varied the numbers of clusters extracted from the cluster tree developed for each model from 100 to 1000 groups of jobs. Using previously made estimates of the probability (ordinal), intensity (µg m(-3) respirable elemental carbon), and frequency (hours per week) of occupational exposure to diesel exhaust, we examined the similarity of the exposure estimates for jobs within the same cluster in two ways. First, the clusters' homogeneity (defined as >75% with the same estimate) was examined compared to a dichotomized probability estimate (<5 versus ≥5%; <50 versus ≥50%). Second, for the ordinal probability metric and continuous intensity and frequency metrics, we calculated the intraclass correlation coefficients (ICCs) between each job's estimate and the mean estimate for all jobs within the cluster. RESULTS: Within-cluster homogeneity increased when more clusters were used. For example, ≥80% of the clusters were homogeneous when 500 clusters were used. Similarly, ICCs were generally above 0.7 when ≥200 clusters were used, indicating minimal within-cluster variability. The most within-cluster variability was observed for the frequency metric (ICCs from 0.4 to 0.8). We estimated that using an expert to assign exposure at the cluster-level assignment and then to review each job in non-homogeneous clusters would require ~2000 decisions per expert, in contrast to evaluating 4255 unique questionnaire patterns or 14983 individual jobs. CONCLUSIONS: This proof-of-concept shows that using cluster models as a data reduction step to identify jobs with similar response patterns prior to obtaining expert ratings has the potential to aid rule-based assessment by systematically reducing the number of exposure decisions needed. While promising, additional research is needed to quantify the actual reduction in exposure decisions and the resulting homogeneity of exposure estimates within clusters for an exposure assessment effort that obtains cluster-level expert assessments as part of the assessment process. Published by Oxford University Press on behalf of the British Occupational Hygiene Society 2014.
Authors: Joanne S Colt; Margaret R Karagas; Molly Schwenn; Dalsu Baris; Alison Johnson; Patricia Stewart; Castine Verrill; Lee E Moore; Jay Lubin; Mary H Ward; Claudine Samanic; Nathaniel Rothman; Kenneth P Cantor; Laura E Beane Freeman; Alan Schned; Sai Cherala; Debra T Silverman Journal: Occup Environ Med Date: 2010-09-23 Impact factor: 4.402
Authors: Melissa C Friesen; Dong-Uk Park; Joanne S Colt; Dalsu Baris; Molly Schwenn; Margaret R Karagas; Karla R Armenti; Alison Johnson; Debra T Silverman; Patricia A Stewart Journal: Am J Ind Med Date: 2014-08 Impact factor: 2.214
Authors: Melissa C Friesen; Anjoeka Pronk; David C Wheeler; Yu-Cheng Chen; Sarah J Locke; Dennis D Zaebst; Molly Schwenn; Alison Johnson; Richard Waddell; Dalsu Baris; Joanne S Colt; Debra T Silverman; Patricia A Stewart; Hormuzd A Katki Journal: Ann Occup Hyg Date: 2012-11-25
Authors: Judith Garcia-Aymerich; Federico P Gómez; Marta Benet; Eva Farrero; Xavier Basagaña; Àngel Gayete; Carles Paré; Xavier Freixa; Jaume Ferrer; Antoni Ferrer; Josep Roca; Juan B Gáldiz; Jaume Sauleda; Eduard Monsó; Joaquim Gea; Joan A Barberà; Àlvar Agustí; Josep M Antó Journal: Thorax Date: 2010-12-21 Impact factor: 9.139
Authors: Renee N Carey; Timothy R Driscoll; Susan Peters; Deborah C Glass; Alison Reid; Geza Benke; Lin Fritschi Journal: Occup Environ Med Date: 2013-10-24 Impact factor: 4.402
Authors: David C Wheeler; Igor Burstyn; Roel Vermeulen; Kai Yu; Susan M Shortreed; Anjoeka Pronk; Patricia A Stewart; Joanne S Colt; Dalsu Baris; Margaret R Karagas; Molly Schwenn; Alison Johnson; Debra T Silverman; Melissa C Friesen Journal: Occup Environ Med Date: 2012-11-15 Impact factor: 4.402
Authors: Feng-Chiao Su; Melissa C Friesen; Aleksandr B Stefaniak; Paul K Henneberger; Ryan F LeBouf; Marcia L Stanton; Xiaoming Liang; Michael Humann; M Abbas Virji Journal: Ann Work Expo Health Date: 2018-08-13 Impact factor: 2.179
Authors: Calvin B Ge; Melissa C Friesen; Hans Kromhout; Susan Peters; Nathaniel Rothman; Qing Lan; Roel Vermeulen Journal: Ann Work Expo Health Date: 2018-11-12 Impact factor: 2.179
Authors: Laura Kurth; Mohammed Abbas Virji; Eileen Storey; Susan Framberg; Christa Kallio; Jordan Fink; Anthony Scott Laney Journal: Int J Hyg Environ Health Date: 2017-09-05 Impact factor: 5.840
Authors: M Abbas Virji; Christine R Schuler; Jean Cox-Ganser; Marcia L Stanton; Michael S Kent; Kathleen Kreiss; Aleksandr B Stefaniak Journal: Ann Work Expo Health Date: 2019-10-11 Impact factor: 2.179
Authors: Albeliz Santiago-Colón; Carissa M Rocheleau; Stephen Bertke; Annette Christianson; Devon T Collins; Emma Trester-Wilson; Wayne Sanderson; Martha A Waters; Jennita Reefhuis Journal: Ann Work Expo Health Date: 2021-07-03 Impact factor: 2.179