Literature DB >> 21597105

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.

Anne-Lyse Minard1, Anne-Laure Ligozat, Asma Ben Abacha, Delphine Bernhard, Bruno Cartoni, Louise Deléger, Brigitte Grau, Sophie Rosset, Pierre Zweigenbaum, Cyril Grouin.   

Abstract

OBJECTIVE: This paper describes the approaches the authors developed while participating in the i2b2/VA 2010 challenge to automatically extract medical concepts and annotate assertions on concepts and relations between concepts.
DESIGN: The authors'approaches rely on both rule-based and machine-learning methods. Natural language processing is used to extract features from the input texts; these features are then used in the authors' machine-learning approaches. The authors used Conditional Random Fields for concept extraction, and Support Vector Machines for assertion and relation annotation. Depending on the task, the authors tested various combinations of rule-based and machine-learning methods.
RESULTS: The authors'assertion annotation system obtained an F-measure of 0.931, ranking fifth out of 21 participants at the i2b2/VA 2010 challenge. The authors' relation annotation system ranked third out of 16 participants with a 0.709 F-measure. The 0.773 F-measure the authors obtained on concept extraction did not make it to the top 10.
CONCLUSION: On the one hand, the authors confirm that the use of only machine-learning methods is highly dependent on the annotated training data, and thus obtained better results for well-represented classes. On the other hand, the use of only a rule-based method was not sufficient to deal with new types of data. Finally, the use of hybrid approaches combining machine-learning and rule-based approaches yielded higher scores.

Entities:  

Mesh:

Year:  2011        PMID: 21597105      PMCID: PMC3168313          DOI: 10.1136/amiajnl-2011-000154

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  13 in total

1.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

2.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

3.  Comparing natural language processing tools to extract medical problems from narrative text.

Authors:  Stéphane M Meystre; Peter J Haug
Journal:  AMIA Annu Symp Proc       Date:  2005

4.  Identifying important concepts from medical documents.

Authors:  Quanzhi Li; Yi-Fang Brook Wu
Journal:  J Biomed Inform       Date:  2006-03-02       Impact factor: 6.317

Review 5.  Extracting information from textual documents in the electronic health record: a review of recent research.

Authors:  S M Meystre; G K Savova; K C Kipper-Schuler; J F Hurdle
Journal:  Yearb Med Inform       Date:  2008

6.  Lessons extracting diseases from discharge summaries.

Authors:  William Long
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

7.  Machine learning and rule-based approaches to assertion classification.

Authors:  Ozlem Uzuner; Xiaoran Zhang; Tawanda Sibanda
Journal:  J Am Med Inform Assoc       Date:  2008-10-24       Impact factor: 4.497

8.  Semantic structuring of and information extraction from medical documents using the UMLS.

Authors:  K Denecke
Journal:  Methods Inf Med       Date:  2008       Impact factor: 2.176

9.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

10.  Medical language processing: applications to patient data representation and automatic encoding.

Authors:  N Sager; M Lyman; N T Nhàn; L J Tick
Journal:  Methods Inf Med       Date:  1995-03       Impact factor: 2.176

View more
  18 in total

1.  Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries.

Authors:  Yan Xu; Kai Hong; Junichi Tsujii; Eric I-Chao Chang
Journal:  J Am Med Inform Assoc       Date:  2012-05-14       Impact factor: 4.497

2.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

3.  Automated Assessment of Medical Students' Clinical Exposures according to AAMC Geriatric Competencies.

Authors:  Yukun Chen; Jesse Wrenn; Hua Xu; Anderson Spickard; Ralf Habermann; James Powers; Joshua C Denny
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

4.  Learning to identify treatment relations in clinical text.

Authors:  Cosmin A Bejan; Joshua C Denny
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

5.  Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text.

Authors:  Cosmin Adrian Bejan; Wei-Qi Wei; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2014-10-21       Impact factor: 4.497

6.  Differentiating Sense through Semantic Interaction Data.

Authors:  T Elizabeth Workman; Charlene Weir; Thomas C Rindflesch
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

7.  Eventual situations for timeline extraction from clinical reports.

Authors:  Cyril Grouin; Natalia Grabar; Thierry Hamon; Sophie Rosset; Xavier Tannier; Pierre Zweigenbaum
Journal:  J Am Med Inform Assoc       Date:  2013-04-09       Impact factor: 4.497

Review 8.  "Big data" and the electronic health record.

Authors:  M K Ross; W Wei; L Ohno-Machado
Journal:  Yearb Med Inform       Date:  2014-08-15

9.  Use of a support vector machine for categorizing free-text notes: assessment of accuracy across two institutions.

Authors:  Adam Wright; Allison B McCoy; Stanislav Henkin; Abhivyakti Kale; Dean F Sittig
Journal:  J Am Med Inform Assoc       Date:  2013-03-30       Impact factor: 4.497

10.  Automatic computation of CHA2DS2-VASc score: information extraction from clinical texts for thromboembolism risk assessment.

Authors:  Cyril Grouin; Louise Deléger; Arnaud Rosier; Lynda Temal; Olivier Dameron; Pascal Van Hille; Anita Burgun; Pierre Zweigenbaum
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.