Literature DB >> 11687566

Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.

P G Mutalik1, A Deshpande, P M Nadkarni.   

Abstract

OBJECTIVES: To test the hypothesis that most instances of negated concepts in dictated medical documents can be detected by a strategy that relies on tools developed for the parsing of formal (computer) languages-specifically, a lexical scanner ("lexer") that uses regular expressions to generate a finite state machine, and a parser that relies on a restricted subset of context-free grammars, known as LALR(1) grammars.
METHODS: A diverse training set of 40 medical documents from a variety of specialties was manually inspected and used to develop a program (Negfinder) that contained rules to recognize a large set of negated patterns occurring in the text. Negfinder's lexer and parser were developed using tools normally used to generate programming language compilers. The input to Negfinder consisted of medical narrative that was preprocessed to recognize UMLS concepts: the text of a recognized concept had been replaced with a coded representation that included its UMLS concept ID. The program generated an index with one entry per instance of a concept in the document, where the presence or absence of negation of that concept was recorded. This information was used to mark up the text of each document by color-coding it to make it easier to inspect. The parser was then evaluated in two ways: 1) a test set of 60 documents (30 discharge summaries, 30 surgical notes) marked-up by Negfinder was inspected visually to quantify false-positive and false-negative results; and 2) a different test set of 10 documents was independently examined for negatives by a human observer and by Negfinder, and the results were compared.
RESULTS: In the first evaluation using marked-up documents, 8,358 instances of UMLS concepts were detected in the 60 documents, of which 544 were negations detected by the program and verified by human observation (true-positive results, or TPs). Thirteen instances were wrongly flagged as negated (false-positive results, or FPs), and the program missed 27 instances of negation (false-negative results, or FNs), yielding a sensitivity of 95.3 percent and a specificity of 97.7 percent. In the second evaluation using independent negation detection, 1,869 concepts were detected in 10 documents, with 135 TPs, 12 FPs, and 6 FNs, yielding a sensitivity of 95.7 percent and a specificity of 91.8 percent. One of the words "no," "denies/denied," "not," or "without" was present in 92.5 percent of all negations.
CONCLUSIONS: Negation of most concepts in medical narrative can be reliably detected by a simple strategy. The reliability of detection depends on several factors, the most important being the accuracy of concept matching.

Entities:  

Mesh:

Year:  2001        PMID: 11687566      PMCID: PMC130070          DOI: 10.1136/jamia.2001.0080598

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  10 in total

1.  Ad hoc classification of radiology reports.

Authors:  D B Aronow; F Fangfang; W B Croft
Journal:  J Am Med Inform Assoc       Date:  1999 Sep-Oct       Impact factor: 4.497

2.  UMLS concept indexing for production databases: a feasibility study.

Authors:  P Nadkarni; R Chen; C Brandt
Journal:  J Am Med Inform Assoc       Date:  2001 Jan-Feb       Impact factor: 4.497

3.  An automatic indexing method for medical documents.

Authors:  M M Wagner
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1991

4.  Representing information in patient reports using natural language processing and the extensible markup language.

Authors:  C Friedman; G Hripcsak; L Shagina; H Liu
Journal:  J Am Med Inform Assoc       Date:  1999 Jan-Feb       Impact factor: 4.497

5.  Validation of clinical problems using a UMLS-based semantic parser.

Authors:  H S Goldberg; C Hsu; V Law; C Safran
Journal:  Proc AMIA Symp       Date:  1998

6.  An analysis of statistical term strength and its use in the indexing and retrieval of molecular biology texts.

Authors:  W J Wilbur; Y Yang
Journal:  Comput Biol Med       Date:  1996-05       Impact factor: 4.589

7.  A general natural-language text processor for clinical radiology.

Authors:  C Friedman; P O Alderson; J H Austin; J J Cimino; S B Johnson
Journal:  J Am Med Inform Assoc       Date:  1994 Mar-Apr       Impact factor: 4.497

8.  UMLS knowledge for biomedical language processing.

Authors:  A T McCray; A R Aronson; A C Browne; T C Rindflesch; A Razi; S Srinivasan
Journal:  Bull Med Libr Assoc       Date:  1993-04

9.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

10.  Ambiguity resolution while mapping free text to the UMLS Metathesaurus.

Authors:  T C Rindflesch; A R Aronson
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1994
  10 in total
  58 in total

Review 1.  An introduction to information retrieval: applications in genomics.

Authors:  P M Nadkarni
Journal:  Pharmacogenomics J       Date:  2002       Impact factor: 3.550

2.  Electronically screening discharge summaries for adverse medical events.

Authors:  Harvey J Murff; Alan J Forster; Josh F Peterson; Julie M Fiskio; Heather L Heiman; David W Bates
Journal:  J Am Med Inform Assoc       Date:  2003-03-28       Impact factor: 4.497

3.  MITRE system for clinical assertion status classification.

Authors:  Cheryl Clark; John Aberdeen; Matt Coarr; David Tresner-Kirsch; Ben Wellner; Alexander Yeh; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2011-04-22       Impact factor: 4.497

4.  The challenge of negation in health care searches and queries.

Authors:  Valerie J Harvey; Constance M Ruzich; Jeanne M Baugh; Bruce Johnston; Arthur J Grant
Journal:  AMIA Annu Symp Proc       Date:  2003

5.  Extracting structured information from free text pathology reports.

Authors:  Gunther Schadow; Clement J McDonald
Journal:  AMIA Annu Symp Proc       Date:  2003

6.  Integrating query of relational and textual data in clinical databases: a case study.

Authors:  John M Fisk; Pradeep Mutalik; Forrest W Levin; Joseph Erdos; Caroline Taylor; Prakash Nadkarni
Journal:  J Am Med Inform Assoc       Date:  2003 Jan-Feb       Impact factor: 4.497

7.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

8.  Text mining neuroscience journal articles to populate neuroscience databases.

Authors:  Chiquito J Crasto; Luis N Marenco; Michele Migliore; Buqing Mao; Prakash M Nadkarni; Perry Miller; Gordon M Shepherd
Journal:  Neuroinformatics       Date:  2003

9.  Development of automated detection of radiology reports citing adrenal findings.

Authors:  Jason J Zopf; Jessica M Langer; William W Boonn; Woojin Kim; Hanna M Zafar
Journal:  J Digit Imaging       Date:  2012-02       Impact factor: 4.056

10.  Biomedical negation scope detection with conditional random fields.

Authors:  Shashank Agarwal; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2010 Nov-Dec       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.