Literature DB >> 26412196

A practical tool for public health surveillance: Semi-automated coding of short injury narratives from large administrative databases using Naïve Bayes algorithms.

Helen R Marucci-Wellman1, Mark R Lehto2, Helen L Corns3.   

Abstract

Public health surveillance programs in the U.S. are undergoing landmark changes with the availability of electronic health records and advancements in information technology. Injury narratives gathered from hospital records, workers compensation claims or national surveys can be very useful for identifying antecedents to injury or emerging risks. However, classifying narratives manually can become prohibitive for large datasets. The purpose of this study was to develop a human-machine system that could be relatively easily tailored to routinely and accurately classify injury narratives from large administrative databases such as workers compensation. We used a semi-automated approach based on two Naïve Bayesian algorithms to classify 15,000 workers compensation narratives into two-digit Bureau of Labor Statistics (BLS) event (leading to injury) codes. Narratives were filtered out for manual review if the algorithms disagreed or made weak predictions. This approach resulted in an overall accuracy of 87%, with consistently high positive predictive values across all two-digit BLS event categories including the very small categories (e.g., exposure to noise, needle sticks). The Naïve Bayes algorithms were able to identify and accurately machine code most narratives leaving only 32% (4853) for manual review. This strategy substantially reduces the need for resources compared with manual review alone.
Copyright © 2015. Published by Elsevier Ltd.

Entities:  

Keywords:  Injury; Narrative text; Naïve Bayes; Occupational; Public health; Surveillance

Mesh:

Year:  2015        PMID: 26412196     DOI: 10.1016/j.aap.2015.06.014

Source DB:  PubMed          Journal:  Accid Anal Prev        ISSN: 0001-4575


  8 in total

1.  Harnessing information from injury narratives in the 'big data' era: understanding and applying machine learning for injury surveillance.

Authors:  Kirsten Vallmuur; Helen R Marucci-Wellman; Jennifer A Taylor; Mark Lehto; Helen L Corns; Gordon S Smith
Journal:  Inj Prev       Date:  2016-01-04       Impact factor: 2.399

2.  Workers' compensation claim counts and rates by injury event/exposure among state-insured private employers in Ohio, 2007-2017.

Authors:  Steven J Wurzelbacher; Alysha R Meyers; Michael P Lampl; P Timothy Bushnell; Stephen J Bertke; David C Robins; Chih-Yu Tseng; Steven J Naber
Journal:  J Safety Res       Date:  2021-09-17

3.  The development of a machine learning algorithm to identify occupational injuries in agriculture using pre-hospital care reports.

Authors:  Erika Scott; Liane Hirabayashi; Alex Levenstein; Nicole Krupa; Paul Jenkins
Journal:  Health Inf Sci Syst       Date:  2021-07-29

4.  Comparison of methods for auto-coding causation of injury narratives.

Authors:  S J Bertke; A R Meyers; S J Wurzelbacher; A Measure; M P Lampl; D Robins
Journal:  Accid Anal Prev       Date:  2015-12-30

5.  Testing and Validating Semi-automated Approaches to the Occupational Exposure Assessment of Polycyclic Aromatic Hydrocarbons.

Authors:  Albeliz Santiago-Colón; Carissa M Rocheleau; Stephen Bertke; Annette Christianson; Devon T Collins; Emma Trester-Wilson; Wayne Sanderson; Martha A Waters; Jennita Reefhuis
Journal:  Ann Work Expo Health       Date:  2021-07-03       Impact factor: 2.179

6.  Mortality prediction in patients with isolated moderate and severe traumatic brain injury using machine learning models.

Authors:  Cheng-Shyuan Rau; Pao-Jen Kuo; Peng-Chen Chien; Chun-Ying Huang; Hsiao-Yun Hsieh; Ching-Hua Hsieh
Journal:  PLoS One       Date:  2018-11-09       Impact factor: 3.240

7.  Mortality, morbidity and health in developed societies: a review of data sources.

Authors:  Guillaume Wunsch; Catherine Gourbin
Journal:  Genus       Date:  2018-01-29

8.  Prediction of postoperative complications of pediatric cataract patients using data mining.

Authors:  Kai Zhang; Xiyang Liu; Jiewei Jiang; Wangting Li; Shuai Wang; Lin Liu; Xiaojing Zhou; Liming Wang
Journal:  J Transl Med       Date:  2019-01-03       Impact factor: 5.531

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.