Literature DB >> 22449720

Lexical patterns, features and knowledge resources for coreference resolution in clinical notes.

Phil Gooch1, Abdul Roudsari.   

Abstract

Generation of entity coreference chains provides a means to extract linked narrative events from clinical notes, but despite being a well-researched topic in natural language processing, general-purpose coreference tools perform poorly on clinical texts. This paper presents a knowledge-centric and pattern-based approach to resolving coreference across a wide variety of clinical records from two corpora (Ontology Development and Information Extraction (ODIE) and i2b2/VA), and describes a method for generating coreference chains using progressively pruned linked lists that reduces the search space and facilitates evaluation by a number of metrics. Independent evaluation results give an F-measure for each corpus of 79.2% and 87.5%, respectively. A baseline of blind coreference of mentions of the same class gives F-measures of 65.3% and 51.9% respectively. For the ODIE corpus, recall is significantly improved over the baseline (p<0.05) but overall there was no statistically significant improvement in F-measure (p>0.05). For the i2b2/VA corpus, recall, precision, and F-measure are significantly improved over the baseline (p<0.05). Overall, our approach offers performance at least as good as human annotators and greatly increased performance over general-purpose tools. The system uses a number of open-source components that are available to download.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22449720     DOI: 10.1016/j.jbi.2012.02.012

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  1 in total

1.  An Infinite Mixture Model for Coreference Resolution in Clinical Notes.

Authors:  Sijia Liu; Hongfang Liu; Vipin Chaudhary; Dingcheng Li
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2016-07-22
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.