Literature DB >> 21459927

Anaphoric relations in the clinical narrative: corpus creation.

Guergana K Savova1, Wendy W Chapman, Jiaping Zheng, Rebecca S Crowley.   

Abstract

OBJECTIVE: The long-term goal of this work is the automated discovery of anaphoric relations from the clinical narrative. The creation of a gold standard set from a cross-institutional corpus of clinical notes and high-level characteristics of that gold standard are described.
METHODS: A standard methodology for annotation guideline development, gold standard annotations, and inter-annotator agreement (IAA) was used.
RESULTS: The gold standard annotations resulted in 7214 markables, 5992 pairs, and 1304 chains. Each report averaged 40 anaphoric markables, 33 pairs, and seven chains. The overall IAA is high on the Mayo dataset (0.6607), and moderate on the University of Pittsburgh Medical Center (UPMC) dataset (0.4072). The IAA between each annotator and the gold standard is high (Mayo: 0.7669, 0.7697, and 0.9021; UPMC: 0.6753 and 0.7138). These results imply a quality corpus feasible for system development. They also suggest the complementary nature of the annotations performed by the experts and the importance of an annotator team with diverse knowledge backgrounds. LIMITATIONS: Only one of the annotators had the linguistic background necessary for annotation of the linguistic attributes. The overall generalizability of the guidelines will be further strengthened by annotations of data from additional sites. This will increase the overall corpus size and the representation of each relation type.
CONCLUSION: The first step toward the development of an anaphoric relation resolver as part of a comprehensive natural language processing system geared specifically for the clinical narrative in the electronic medical record is described. The deidentified annotated corpus will be available to researchers.

Entities:  

Mesh:

Year:  2011        PMID: 21459927      PMCID: PMC3128403          DOI: 10.1136/amiajnl-2011-000108

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  14 in total

1.  A broad-coverage natural language processing system.

Authors:  C Friedman
Journal:  Proc AMIA Symp       Date:  2000

2.  Automatic detection of acute bacterial pneumonia from chest X-ray reports.

Authors:  M Fiszman; W W Chapman; D Aronsky; R S Evans; P J Haug
Journal:  J Am Med Inform Assoc       Date:  2000 Nov-Dec       Impact factor: 4.497

3.  Exploring semantic groups through visual approaches.

Authors:  Olivier Bodenreider; Alexa T McCray
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

4.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

5.  Agreement, the f-measure, and reliability in information retrieval.

Authors:  George Hripcsak; Adam S Rothschild
Journal:  J Am Med Inform Assoc       Date:  2005-01-31       Impact factor: 4.497

6.  Automatic extraction of PIOPED interpretations from ventilation/perfusion lung scan reports.

Authors:  M Fiszman; P J Haug; P R Frederick
Journal:  Proc AMIA Symp       Date:  1998

7.  Towards a comprehensive medical language processing system: methods and issues.

Authors:  C Friedman
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

8.  Experience with a mixed semantic/syntactic parser.

Authors:  P J Haug; S Koehler; L M Lau; P Wang; R Rocha; S M Huff
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1995

9.  Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model.

Authors:  Anni Coden; Guergana Savova; Igor Sominsky; Michael Tanenblatt; James Masanz; Karin Schuler; James Cooper; Wei Guan; Piet C de Groen
Journal:  J Biomed Inform       Date:  2008-12-27       Impact factor: 6.317

10.  Electronic interpretation of chest radiograph reports to detect central venous catheters.

Authors:  William E Trick; Wendy W Chapman; Mary F Wisniewski; Brian J Peterson; Steven L Solomon; Robert A Weinstein
Journal:  Infect Control Hosp Epidemiol       Date:  2003-12       Impact factor: 3.254

View more
  24 in total

1.  It's about this and that: a description of anaphoric expressions in clinical text.

Authors:  Yan Wang; Genevieve B Melton; Serguei Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  A system for coreference resolution for the clinical narrative.

Authors:  Jiaping Zheng; Wendy W Chapman; Timothy A Miller; Chen Lin; Rebecca S Crowley; Guergana K Savova
Journal:  J Am Med Inform Assoc       Date:  2012-01-31       Impact factor: 4.497

Review 3.  Evaluating the state of the art in coreference resolution for electronic medical records.

Authors:  Ozlem Uzuner; Andreea Bodnari; Shuying Shen; Tyler Forbush; John Pestian; Brett R South
Journal:  J Am Med Inform Assoc       Date:  2012-02-24       Impact factor: 4.497

4.  Named entity recognition of follow-up and time information in 20,000 radiology reports.

Authors:  Yan Xu; Junichi Tsujii; Eric I-Chao Chang
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

5.  Automatic discourse connective detection in biomedical text.

Authors:  Balaji Polepalli Ramesh; Rashmi Prasad; Tim Miller; Brian Harrington; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2012-06-28       Impact factor: 4.497

6.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

7.  Automatically correlating clinical findings and body locations in radiology reports using MedLEE.

Authors:  Merlijn Sevenster; Rob van Ommering; Yuechen Qian
Journal:  J Digit Imaging       Date:  2012-04       Impact factor: 4.056

Review 8.  Natural language processing: an introduction.

Authors:  Prakash M Nadkarni; Lucila Ohno-Machado; Wendy W Chapman
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

9.  Inter-annotator reliability of medical events, coreferences and temporal relations in clinical narratives by annotators with varying levels of clinical expertise.

Authors:  Preethi Raghavan; Eric Fosler-Lussier; Albert M Lai
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

10.  Towards generalizable entity-centric clinical coreference resolution.

Authors:  Timothy Miller; Dmitriy Dligach; Steven Bethard; Chen Lin; Guergana Savova
Journal:  J Biomed Inform       Date:  2017-04-21       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.