Literature DB >> 12957786

Using lexical disambiguation and named-entity recognition to improve spelling correction in the electronic patient record.

Patrick Ruch1, Robert Baud, Antoine Geissbühler.   

Abstract

In this article, we show how a set of natural language processing (NLP) tools can be combined to improve the processing of clinical records. The study concentrates on improving spelling correction, which is of major importance for quality control in the electronic patient record (EPR). As first task, we report on the design of an improved interactive tool for correcting spelling errors. Unlike traditional systems, the linguistic context (both semantic and syntactic) is used to improve the correction strategy. The system is organized along three modules. Module 1 is based on a classical spelling checker, it means that it is context-independent and simply measures a string-edit-distance between a misspelled word and a list of well-formed words. Module 2 attempts to rank more relevantly the set of candidates provided by the first module using morpho-syntactic disambiguation tools. Module 3 processes words with the same part-of-speech (POS) and apply word-sense (WS) disambiguation in order to rerank the set of candidates. As second task, we show how this improved interactive spell checker can be cast as a fully automatic system by adjunction of another NLP module: a named-entity (NE) extractor, i.e. a tool able to identify words as such patient and physician names. This module is used to avoid replacement of named-entities when the system is not used in an interactive mode. Results confirm that using the linguistic context can improve interactive spelling correction, and justify the use of named-entity recognizer to conduct fully automatic spelling correction. It is concluded that NLP is mature enough to help information processing in EPR.

Entities:  

Mesh:

Year:  2003        PMID: 12957786     DOI: 10.1016/s0933-3657(03)00052-6

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  11 in total

1.  GeneRIF quality assurance as summary revision.

Authors:  Zhiyong Lu; K Bretonnel Cohen; Lawrence Hunter
Journal:  Pac Symp Biocomput       Date:  2007

2.  Data from clinical notes: a perspective on the tension between structure and flexible documentation.

Authors:  S Trent Rosenbloom; Joshua C Denny; Hua Xu; Nancy Lorenzi; William W Stead; Kevin B Johnson
Journal:  J Am Med Inform Assoc       Date:  2011-01-12       Impact factor: 4.497

3.  An Ensemble Method for Spelling Correction in Consumer Health Questions.

Authors:  Halil Kilicoglu; Marcelo Fiszman; Kirk Roberts; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

4.  Automated Misspelling Detection and Correction in Persian Clinical Text.

Authors:  Azita Yazdani; Marjan Ghazisaeedi; Nasrin Ahmadinejad; Masoumeh Giti; Habibe Amjadi; Azin Nahvijou
Journal:  J Digit Imaging       Date:  2020-06       Impact factor: 4.056

5.  Automated identification of drug and food allergies entered using non-standard terminology.

Authors:  Richard H Epstein; Paul St Jacques; Michael Stockin; Brian Rothman; Jesse M Ehrenfeld; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-06-07       Impact factor: 4.497

Review 6.  What can natural language processing do for clinical decision support?

Authors:  Dina Demner-Fushman; Wendy W Chapman; Clement J McDonald
Journal:  J Biomed Inform       Date:  2009-08-13       Impact factor: 6.317

7.  Data-poor categorization and passage retrieval for gene ontology annotation in Swiss-Prot.

Authors:  Frédéric Ehrler; Antoine Geissbühler; Antonio Jimeno; Patrick Ruch
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

8.  Complexities, variations, and errors of numbering within clinical notes: the potential impact on information extraction and cohort-identification.

Authors:  David A Hanauer; Qiaozhu Mei; V G Vinod Vydiswaran; Karandeep Singh; Zach Landis-Lewis; Chunhua Weng
Journal:  BMC Med Inform Decis Mak       Date:  2019-04-04       Impact factor: 2.796

9.  Context-Sensitive Spelling Correction of Consumer-Generated Content on Health Care.

Authors:  Xiaofang Zhou; An Zheng; Jiaheng Yin; Rudan Chen; Xianyang Zhao; Wei Xu; Wenqing Cheng; Tian Xia; Simon Lin
Journal:  JMIR Med Inform       Date:  2015-07-31

10.  A Natural Language Processing Tool for Large-Scale Data Extraction from Echocardiography Reports.

Authors:  Chinmoy Nath; Mazen S Albaghdadi; Siddhartha R Jonnalagadda
Journal:  PLoS One       Date:  2016-04-28       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.