Literature DB >> 22298565

A system for coreference resolution for the clinical narrative.

Jiaping Zheng1, Wendy W Chapman, Timothy A Miller, Chen Lin, Rebecca S Crowley, Guergana K Savova.   

Abstract

OBJECTIVE: To research computational methods for coreference resolution in the clinical narrative and build a system implementing the best methods.
METHODS: The Ontology Development and Information Extraction corpus annotated for coreference relations consists of 7214 coreferential markables, forming 5992 pairs and 1304 chains. We trained classifiers with semantic, syntactic, and surface features pruned by feature selection. For the three system components--for the resolution of relative pronouns, personal pronouns, and noun phrases--we experimented with support vector machines with linear and radial basis function (RBF) kernels, decision trees, and perceptrons. Evaluation of algorithms and varied feature sets was performed using standard metrics.
RESULTS: The best performing combination is support vector machines with an RBF kernel and all features (MUC score=0.352, B(3)=0.690, CEAF=0.486, BLANC=0.596) outperforming a traditional decision tree baseline. DISCUSSION: The application showed good performance similar to performance on general English text. The main error source was sentence distances exceeding a window of 10 sentences between markables. A possible solution to this problem is hinted at by the fact that coreferent markables sometimes occurred in predictable (although distant) note sections. Another system limitation is failure to fully utilize synonymy and ontological knowledge. Future work will investigate additional ways to incorporate syntactic features into the coreference problem.
CONCLUSION: We investigated computational methods for coreference resolution in the clinical narrative. The best methods are released as modules of the open source Clinical Text Analysis and Knowledge Extraction System and Ontology Development and Information Extraction platforms.

Mesh:

Year:  2012        PMID: 22298565      PMCID: PMC3384116          DOI: 10.1136/amiajnl-2011-000599

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  10 in total

1.  MEDSYNDIKATE--a natural language system for the extraction of medical information from findings reports.

Authors:  Udo Hahn; Martin Romacker; Stefan Schulz
Journal:  Int J Med Inform       Date:  2002-12-04       Impact factor: 4.046

2.  Exploring semantic groups through visual approaches.

Authors:  Olivier Bodenreider; Alexa T McCray
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

3.  The MiPACQ clinical question answering system.

Authors:  Brian L Cairns; Rodney D Nielsen; James J Masanz; James H Martin; Martha S Palmer; Wayne H Ward; Guergana K Savova
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

4.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

5.  Building a semantically annotated corpus of clinical texts.

Authors:  Angus Roberts; Robert Gaizauskas; Mark Hepple; George Demetriou; Yikun Guo; Ian Roberts; Andrea Setzer
Journal:  J Biomed Inform       Date:  2009-01-23       Impact factor: 6.317

6.  Anaphoric relations in the clinical narrative: corpus creation.

Authors:  Guergana K Savova; Wendy W Chapman; Jiaping Zheng; Rebecca S Crowley
Journal:  J Am Med Inform Assoc       Date:  2011-04-01       Impact factor: 4.497

7.  Analysis of questions asked by family doctors regarding patient care.

Authors:  J W Ely; J A Osheroff; M H Ebell; G R Bergus; B T Levy; M L Chambliss; E R Evans
Journal:  BMJ       Date:  1999-08-07

Review 8.  Coreference resolution: a review of general methodologies and applications in the clinical domain.

Authors:  Jiaping Zheng; Wendy W Chapman; Rebecca S Crowley; Guergana K Savova
Journal:  J Biomed Inform       Date:  2011-08-12       Impact factor: 6.317

9.  Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model.

Authors:  Anni Coden; Guergana Savova; Igor Sominsky; Michael Tanenblatt; James Masanz; Karin Schuler; James Cooper; Wei Guan; Piet C de Groen
Journal:  J Biomed Inform       Date:  2008-12-27       Impact factor: 6.317

10.  ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

Authors:  Henk Harkema; John N Dowling; Tyler Thornblade; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2009-05-10       Impact factor: 6.317

  10 in total
  12 in total

1.  Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences.

Authors:  Jung-wei Fan; Elly W Yang; Min Jiang; Rashmi Prasad; Richard M Loomis; Daniel S Zisook; Josh C Denny; Hua Xu; Yang Huang
Journal:  J Am Med Inform Assoc       Date:  2013-08-01       Impact factor: 4.497

2.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

3.  Automated Radiology Report Summarization Using an Open-Source Natural Language Processing Pipeline.

Authors:  Daniel J Goff; Thomas W Loehfelm
Journal:  J Digit Imaging       Date:  2018-04       Impact factor: 4.056

4.  A supervised framework for resolving coreference in clinical records.

Authors:  Bryan Rink; Kirk Roberts; Sanda M Harabagiu
Journal:  J Am Med Inform Assoc       Date:  2012-05-19       Impact factor: 4.497

5.  Towards generalizable entity-centric clinical coreference resolution.

Authors:  Timothy Miller; Dmitriy Dligach; Steven Bethard; Chen Lin; Guergana Savova
Journal:  J Biomed Inform       Date:  2017-04-21       Impact factor: 6.317

6.  Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.

Authors:  Jyotishman Pathak; Kent R Bailey; Calvin E Beebe; Steven Bethard; David C Carrell; Pei J Chen; Dmitriy Dligach; Cory M Endle; Lacey A Hart; Peter J Haug; Stanley M Huff; Vinod C Kaggal; Dingcheng Li; Hongfang Liu; Kyle Marchant; James Masanz; Timothy Miller; Thomas A Oniki; Martha Palmer; Kevin J Peterson; Susan Rea; Guergana K Savova; Craig R Stancl; Sunghwan Sohn; Harold R Solbrig; Dale B Suesse; Cui Tao; David P Taylor; Les Westberg; Stephen Wu; Ning Zhuo; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2013-11-04       Impact factor: 4.497

Review 7.  Natural Language Processing for EHR-Based Computational Phenotyping.

Authors:  Zexian Zeng; Yu Deng; Xiaoyu Li; Tristan Naumann; Yuan Luo
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2018-06-25       Impact factor: 3.710

8.  Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse.

Authors:  Nicolas Garcelon; Antoine Neuraz; Vincent Benoit; Rémi Salomon; Anita Burgun
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

9.  Towards comprehensive syntactic and semantic annotations of the clinical narrative.

Authors:  Daniel Albright; Arrick Lanfranchi; Anwen Fredriksen; William F Styler; Colin Warner; Jena D Hwang; Jinho D Choi; Dmitriy Dligach; Rodney D Nielsen; James Martin; Wayne Ward; Martha Palmer; Guergana K Savova
Journal:  J Am Med Inform Assoc       Date:  2013-01-25       Impact factor: 4.497

10.  Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning.

Authors:  Kevin Bretonnel Cohen; Benjamin Glass; Hansel M Greiner; Katherine Holland-Bouley; Shannon Standridge; Ravindra Arya; Robert Faist; Diego Morita; Francesco Mangano; Brian Connolly; Tracy Glauser; John Pestian
Journal:  Biomed Inform Insights       Date:  2016-05-22
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.