Literature DB >> 21482563

A combined Fuzzy and Naive Bayesian strategy can be used to assign event codes to injury narratives.

H Marucci-Wellman1, M Lehto, H Corns.   

Abstract

BACKGROUND: Bayesian methods show promise for classifying injury narratives from large administrative datasets into cause groups. This study examined a combined approach where two Bayesian models (Fuzzy and Naïve) were used to either classify a narrative or select it for manual review.
METHODS: Injury narratives were extracted from claims filed with a worker's compensation insurance provider between January 2002 and December 2004. Narratives were separated into a training set (n=11,000) and prediction set (n=3,000). Expert coders assigned two-digit Bureau of Labor Statistics Occupational Injury and Illness Classification event codes to each narrative. Fuzzy and Naïve Bayesian models were developed using manually classified cases in the training set. Two semi-automatic machine coding strategies were evaluated. The first strategy assigned cases for manual review if the Fuzzy and Naïve models disagreed on the classification. The second strategy selected additional cases for manual review from the Agree dataset using prediction strength to reach a level of 50% computer coding and 50% manual coding.
RESULTS: When agreement alone was used as the filtering strategy, the majority were coded by the computer (n=1,928, 64%) leaving 36% for manual review. The overall combined (human plus computer) sensitivity was 0.90 and positive predictive value (PPV) was >0.90 for 11 of 18 2-digit event categories. Implementing the 2nd strategy improved results with an overall sensitivity of 0.95 and PPV >0.90 for 17 of 18 categories.
CONCLUSIONS: A combined Naïve-Fuzzy Bayesian approach can classify some narratives with high accuracy and identify others most beneficial for manual review, reducing the burden on human coders.

Entities:  

Mesh:

Year:  2011        PMID: 21482563     DOI: 10.1136/ip.2010.030593

Source DB:  PubMed          Journal:  Inj Prev        ISSN: 1353-8047            Impact factor:   2.399


  6 in total

1.  Development and evaluation of a Naïve Bayesian model for coding causation of workers' compensation claims.

Authors:  S J Bertke; A R Meyers; S J Wurzelbacher; J Bell; M L Lampl; D Robins
Journal:  J Safety Res       Date:  2012-11-01

2.  Applying Machine Learning to Workers' Compensation Data to Identify Industry-Specific Ergonomic and Safety Prevention Priorities: Ohio, 2001 to 2011.

Authors:  Alysha R Meyers; Ibraheem S Al-Tarawneh; Steven J Wurzelbacher; P Timothy Bushnell; Michael P Lampl; Jennifer L Bell; Stephen J Bertke; David C Robins; Chih-Yu Tseng; Chia Wei; Jill A Raudabaugh; Teresa M Schnorr
Journal:  J Occup Environ Med       Date:  2018-01       Impact factor: 2.162

3.  Harnessing information from injury narratives in the 'big data' era: understanding and applying machine learning for injury surveillance.

Authors:  Kirsten Vallmuur; Helen R Marucci-Wellman; Jennifer A Taylor; Mark Lehto; Helen L Corns; Gordon S Smith
Journal:  Inj Prev       Date:  2016-01-04       Impact factor: 2.399

4.  Workers' compensation claim counts and rates by injury event/exposure among state-insured private employers in Ohio, 2007-2017.

Authors:  Steven J Wurzelbacher; Alysha R Meyers; Michael P Lampl; P Timothy Bushnell; Stephen J Bertke; David C Robins; Chih-Yu Tseng; Steven J Naber
Journal:  J Safety Res       Date:  2021-09-17

5.  Comparison of methods for auto-coding causation of injury narratives.

Authors:  S J Bertke; A R Meyers; S J Wurzelbacher; A Measure; M P Lampl; D Robins
Journal:  Accid Anal Prev       Date:  2015-12-30

6.  Injury narrative text classification using factorization model.

Authors:  Lin Chen; Kirsten Vallmuur; Richi Nayak
Journal:  BMC Med Inform Decis Mak       Date:  2015-05-20       Impact factor: 2.796

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.