Literature DB >> 21856440

Applying semantic-based probabilistic context-free grammar to medical language processing--a preliminary study on parsing medication sentences.

Hua Xu1, Samir AbdelRahman, Yanxin Lu, Joshua C Denny, Son Doan.   

Abstract

Semantic-based sublanguage grammars have been shown to be an efficient method for medical language processing. However, given the complexity of the medical domain, parsers using such grammars inevitably encounter ambiguous sentences, which could be interpreted by different groups of production rules and consequently result in two or more parse trees. One possible solution, which has not been extensively explored previously, is to augment productions in medical sublanguage grammars with probabilities to resolve the ambiguity. In this study, we associated probabilities with production rules in a semantic-based grammar for medication findings and evaluated its performance on reducing parsing ambiguity. Using the existing data set from 2009 i2b2 NLP (Natural Language Processing) challenge for medication extraction, we developed a semantic-based CFG (Context Free Grammar) for parsing medication sentences and manually created a Treebank of 4564 medication sentences from discharge summaries. Using the Treebank, we derived a semantic-based PCFG (Probabilistic Context Free Grammar) for parsing medication sentences. Our evaluation using a 10-fold cross validation showed that the PCFG parser dramatically improved parsing performance when compared to the CFG parser.
Copyright © 2011 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21856440      PMCID: PMC3226929          DOI: 10.1016/j.jbi.2011.08.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  30 in total

1.  Mining free-text medical records.

Authors:  D T Heinze; M L Morsch; J Holbrook
Journal:  Proc AMIA Symp       Date:  2001

2.  "Understanding" medical school curriculum content using KnowledgeMap.

Authors:  Joshua C Denny; Jeffrey D Smithers; Randolph A Miller; Anderson Spickard
Journal:  J Am Med Inform Assoc       Date:  2003-03-28       Impact factor: 4.497

Review 3.  Two biomedical sublanguages: a description based on the theories of Zellig Harris.

Authors:  Carol Friedman; Pauline Kra; Andrey Rzhetsky
Journal:  J Biomed Inform       Date:  2002-08       Impact factor: 6.317

4.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

5.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

6.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

7.  A schema for representing medical language applied to clinical radiology.

Authors:  C Friedman; J J Cimino; S B Johnson
Journal:  J Am Med Inform Assoc       Date:  1994 May-Jun       Impact factor: 4.497

Review 8.  Natural language processing and the representation of clinical data.

Authors:  N Sager; M Lyman; C Bucknall; N Nhan; L J Tick
Journal:  J Am Med Inform Assoc       Date:  1994 Mar-Apr       Impact factor: 4.497

9.  A general natural-language text processor for clinical radiology.

Authors:  C Friedman; P O Alderson; J H Austin; J J Cimino; S B Johnson
Journal:  J Am Med Inform Assoc       Date:  1994 Mar-Apr       Impact factor: 4.497

10.  Facilitating cancer research using natural language processing of pathology reports.

Authors:  Hua Xu; Kristin Anderson; Victor R Grann; Carol Friedman
Journal:  Stud Health Technol Inform       Date:  2004
View more
  2 in total

1.  Mission and Sustainability of Informatics for Integrating Biology and the Bedside (i2b2).

Authors:  Shawn Murphy; Adam Wilcox
Journal:  EGEMS (Wash DC)       Date:  2014-09-11

2.  Analysis of cross-institutional medication description patterns in clinical narratives.

Authors:  Sunghwan Sohn; Cheryl Clark; Scott R Halgrim; Sean P Murphy; Siddhartha R Jonnalagadda; Kavishwar B Wagholikar; Stephen T Wu; Christopher G Chute; Hongfang Liu
Journal:  Biomed Inform Insights       Date:  2013-06-24
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.