Hui Yang1. 1. Department of Computing, Open University, Milton Keynes, UK. h.yang@open.ac.uk
Abstract
OBJECTIVE: This article describes a system developed for the 2009 i2b2 Medication Extraction Challenge. The purpose of this challenge is to extract medication information from hospital discharge summaries. DESIGN: The system explored several linguistic natural language processing techniques (eg, term-based and token-based rule matching) to identify medication-related information in the narrative text. A number of lexical resources was constructed to profile lexical or morphological features for different categories of medication constituents. MEASUREMENTS: Performance was evaluated in terms of the micro-averaged F-measure at the horizontal system level. RESULTS: The automated system performed well, and achieved an F-micro of 80% for the term-level results and 81% for the token-level results, placing it sixth in exact matches and fourth in inexact matches in the i2b2 competition. CONCLUSION: The overall results show that this relatively simple rule-based approach is capable of tackling multiple entity identification tasks such as medication extraction under situations in which few training documents are annotated for machine learning approaches, and the entity information can be characterized with a set of feature tokens.
OBJECTIVE: This article describes a system developed for the 2009 i2b2 Medication Extraction Challenge. The purpose of this challenge is to extract medication information from hospital discharge summaries. DESIGN: The system explored several linguistic natural language processing techniques (eg, term-based and token-based rule matching) to identify medication-related information in the narrative text. A number of lexical resources was constructed to profile lexical or morphological features for different categories of medication constituents. MEASUREMENTS: Performance was evaluated in terms of the micro-averaged F-measure at the horizontal system level. RESULTS: The automated system performed well, and achieved an F-micro of 80% for the term-level results and 81% for the token-level results, placing it sixth in exact matches and fourth in inexact matches in the i2b2 competition. CONCLUSION: The overall results show that this relatively simple rule-based approach is capable of tackling multiple entity identification tasks such as medication extraction under situations in which few training documents are annotated for machine learning approaches, and the entity information can be characterized with a set of feature tokens.
Authors: Sunghwan Sohn; Cheryl Clark; Scott R Halgrim; Sean P Murphy; Christopher G Chute; Hongfang Liu Journal: J Am Med Inform Assoc Date: 2014-03-17 Impact factor: 4.497
Authors: George Karystianis; Azad Dehghan; Aleksandar Kovacevic; John A Keane; Goran Nenadic Journal: J Biomed Inform Date: 2015-06-29 Impact factor: 6.317
Authors: Miguel García-Remesal; Alejandro García-Ruiz; David Pérez-Rey; Diana de la Iglesia; Víctor Maojo Journal: Biomed Res Int Date: 2012-12-27 Impact factor: 3.411