Literature DB >> 21508414

A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Min Jiang1, Yukun Chen, Mei Liu, S Trent Rosenbloom, Subramani Mani, Joshua C Denny, Hua Xu.   

Abstract

OBJECTIVE: The authors' goal was to develop and evaluate machine-learning-based approaches to extracting clinical entities-including medical problems, tests, and treatments, as well as their asserted status-from hospital discharge summaries written using natural language. This project was part of the 2010 Center of Informatics for Integrating Biology and the Bedside/Veterans Affairs (VA) natural-language-processing challenge.
DESIGN: The authors implemented a machine-learning-based named entity recognition system for clinical text and systematically evaluated the contributions of different types of features and ML algorithms, using a training corpus of 349 annotated notes. Based on the results from training data, the authors developed a novel hybrid clinical entity extraction system, which integrated heuristic rule-based modules with the ML-base named entity recognition module. The authors applied the hybrid system to the concept extraction and assertion classification tasks in the challenge and evaluated its performance using a test data set with 477 annotated notes. MEASUREMENTS: Standard measures including precision, recall, and F-measure were calculated using the evaluation script provided by the Center of Informatics for Integrating Biology and the Bedside/VA challenge organizers. The overall performance for all three types of clinical entities and all six types of assertions across 477 annotated notes were considered as the primary metric in the challenge. RESULTS AND DISCUSSION: Systematic evaluation on the training set showed that Conditional Random Fields outperformed Support Vector Machines, and semantic information from existing natural-language-processing systems largely improved performance, although contributions from different types of features varied. The authors' hybrid entity extraction system achieved a maximum overall F-score of 0.8391 for concept extraction (ranked second) and 0.9313 for assertion classification (ranked fourth, but not statistically different than the first three systems) on the test data set in the challenge.

Entities:  

Mesh:

Year:  2011        PMID: 21508414      PMCID: PMC3168315          DOI: 10.1136/amiajnl-2011-000163

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  28 in total

1.  Automatic identification of pneumonia related concepts on chest x-ray reports.

Authors:  M Fiszman; W W Chapman; S R Evans; P J Haug
Journal:  Proc AMIA Symp       Date:  1999

2.  Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.

Authors:  P G Mutalik; A Deshpande; P M Nadkarni
Journal:  J Am Med Inform Assoc       Date:  2001 Nov-Dec       Impact factor: 4.497

3.  High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge.

Authors:  Jon Patrick; Min Li
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

4.  Lancet: a high precision medication event extraction system for clinical text.

Authors:  Zuofeng Li; Feifan Liu; Lamont Antieau; Yonggang Cao; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

5.  Extracting medication information from clinical text.

Authors:  Ozlem Uzuner; Imre Solti; Eithon Cadag
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

6.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

7.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

8.  BioTagger-GM: a gene/protein name recognition system.

Authors:  Manabu Torii; Zhangzhi Hu; Cathy H Wu; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2008-12-11       Impact factor: 4.497

9.  Development and evaluation of a clinical note section header terminology.

Authors:  Joshua C Denny; Randolph A Miller; Kevin B Johnson; Anderson Spickard
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

10.  MedEx: a medication information extraction system for clinical narratives.

Authors:  Hua Xu; Shane P Stenner; Son Doan; Kevin B Johnson; Lemuel R Waitman; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2010 Jan-Feb       Impact factor: 4.497

View more
  70 in total

1.  Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text.

Authors:  David Carrell; Bradley Malin; John Aberdeen; Samuel Bayer; Cheryl Clark; Ben Wellner; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

2.  A comprehensive study of named entity recognition in Chinese clinical text.

Authors:  Jianbo Lei; Buzhou Tang; Xueqin Lu; Kaihua Gao; Min Jiang; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2013-12-17       Impact factor: 4.497

3.  Chronology of your health events: approaches to extracting temporal relations from medical narratives.

Authors:  Özlem Uzuner; Amber Stubbs; Weiyi Sun
Journal:  J Biomed Inform       Date:  2013-12       Impact factor: 6.317

4.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

5.  Interactive Cohort Identification of Sleep Disorder Patients Using Natural Language Processing and i2b2.

Authors:  W Chen; R Kowatch; S Lin; M Splaingard; Y Huang
Journal:  Appl Clin Inform       Date:  2015-05-27       Impact factor: 2.342

6.  Automated Assessment of Medical Students' Clinical Exposures according to AAMC Geriatric Competencies.

Authors:  Yukun Chen; Jesse Wrenn; Hua Xu; Anderson Spickard; Ralf Habermann; James Powers; Joshua C Denny
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

7.  Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.

Authors:  Yonghui Wu; Xi Yang; Jiang Bian; Yi Guo; Hua Xu; William Hogan
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

8.  The Sublanguage of Clinical Problem Lists: A Corpus Analysis.

Authors:  Kevin J Peterson; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

9.  Automating Clinical Score Calculation within the Electronic Health Record. A Feasibility Assessment.

Authors:  Christopher Aakre; Mikhail Dziadzko; Mark T Keegan; Vitaly Herasevich
Journal:  Appl Clin Inform       Date:  2017-04-12       Impact factor: 2.342

10.  A study of active learning methods for named entity recognition in clinical text.

Authors:  Yukun Chen; Thomas A Lasko; Qiaozhu Mei; Joshua C Denny; Hua Xu
Journal:  J Biomed Inform       Date:  2015-09-15       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.