Literature DB >> 28455150

Evaluation of a rule-based method for epidemiological document classification towards the automation of systematic reviews.

George Karystianis1, Kristina Thayer2, Mary Wolfe2, Guy Tsafnat3.   

Abstract

INTRODUCTION: Most data extraction efforts in epidemiology are focused on obtaining targeted information from clinical trials. In contrast, limited research has been conducted on the identification of information from observational studies, a major source for human evidence in many fields, including environmental health. The recognition of key epidemiological information (e.g., exposures) through text mining techniques can assist in the automation of systematic reviews and other evidence summaries.
METHOD: We designed and applied a knowledge-driven, rule-based approach to identify targeted information (study design, participant population, exposure, outcome, confounding factors, and the country where the study was conducted) from abstracts of epidemiological studies included in several systematic reviews of environmental health exposures. The rules were based on common syntactical patterns observed in text and are thus not specific to any systematic review. To validate the general applicability of our approach, we compared the data extracted using our approach versus hand curation for 35 epidemiological study abstracts manually selected for inclusion in two systematic reviews.
RESULTS: The returned F-score, precision, and recall ranged from 70% to 98%, 81% to 100%, and 54% to 97%, respectively. The highest precision was observed for exposure, outcome and population (100%) while recall was best for exposure and study design with 97% and 89%, respectively. The lowest recall was observed for the population (54%), which also had the lowest F-score (70%).
CONCLUSION: The generated performance of our text-mining approach demonstrated encouraging results for the identification of targeted information from observational epidemiological study abstracts related to environmental exposures. We have demonstrated that rules based on generic syntactic patterns in one corpus can be applied to other observational study design by simple interchanging the dictionaries aiming to identify certain characteristics (i.e., outcomes, exposures). At the document level, the recognised information can assist in the selection and categorization of studies included in a systematic review.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Automation of systematic reviews; Dictionaries; Environmental health studies; Rule-based modelling; Text mining

Mesh:

Year:  2017        PMID: 28455150     DOI: 10.1016/j.jbi.2017.04.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  4 in total

1.  Data extraction methods for systematic review (semi)automation: A living systematic review.

Authors:  Lena Schmidt; Babatunde K Olorisade; Luke A McGuinness; James Thomas; Julian P T Higgins
Journal:  F1000Res       Date:  2021-05-19

2.  Improving reference prioritisation with PICO recognition.

Authors:  Austin J Brockmeier; Meizhi Ju; Piotr Przybyła; Sophia Ananiadou
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-05       Impact factor: 2.796

Review 3.  A Narrative Literature Review of Natural Language Processing Applied to the Occupational Exposome.

Authors:  Annika M Schoene; Ioannis Basinas; Martie van Tongeren; Sophia Ananiadou
Journal:  Int J Environ Res Public Health       Date:  2022-07-13       Impact factor: 4.614

4.  Automated screening of research studies for systematic reviews using study characteristics.

Authors:  Guy Tsafnat; Paul Glasziou; George Karystianis; Enrico Coiera
Journal:  Syst Rev       Date:  2018-04-25
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.