Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Literature DB >> 21508414

A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Min Jiang¹, Yukun Chen, Mei Liu, S Trent Rosenbloom, Subramani Mani, Joshua C Denny, Hua Xu.

Abstract

OBJECTIVE: The authors' goal was to develop and evaluate machine-learning-based approaches to extracting clinical entities-including medical problems, tests, and treatments, as well as their asserted status-from hospital discharge summaries written using natural language. This project was part of the 2010 Center of Informatics for Integrating Biology and the Bedside/Veterans Affairs (VA) natural-language-processing challenge.
DESIGN: The authors implemented a machine-learning-based named entity recognition system for clinical text and systematically evaluated the contributions of different types of features and ML algorithms, using a training corpus of 349 annotated notes. Based on the results from training data, the authors developed a novel hybrid clinical entity extraction system, which integrated heuristic rule-based modules with the ML-base named entity recognition module. The authors applied the hybrid system to the concept extraction and assertion classification tasks in the challenge and evaluated its performance using a test data set with 477 annotated notes. MEASUREMENTS: Standard measures including precision, recall, and F-measure were calculated using the evaluation script provided by the Center of Informatics for Integrating Biology and the Bedside/VA challenge organizers. The overall performance for all three types of clinical entities and all six types of assertions across 477 annotated notes were considered as the primary metric in the challenge. RESULTS AND DISCUSSION: Systematic evaluation on the training set showed that Conditional Random Fields outperformed Support Vector Machines, and semantic information from existing natural-language-processing systems largely improved performance, although contributions from different types of features varied. The authors' hybrid entity extraction system achieved a maximum overall F-score of 0.8391 for concept extraction (ranked second) and 0.9313 for assertion classification (ranked fourth, but not statistically different than the first three systems) on the test data set in the challenge.

Entities: Disease

Mesh：

Year: 2011 PMID： 21508414 PMCID： PMC3168315 DOI： 10.1136/amiajnl-2011-000163

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 4.497

28 in total

1. Automatic identification of pneumonia related concepts on chest x-ray reports.

Authors: M Fiszman; W W Chapman; S R Evans; P J Haug
Journal: Proc AMIA Symp Date: 1999

2. Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.

Authors: P G Mutalik; A Deshpande; P M Nadkarni
Journal: J Am Med Inform Assoc Date: 2001 Nov-Dec Impact factor: 4.497

3. High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge.

Authors: Jon Patrick; Min Li
Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497

4. Lancet: a high precision medication event extraction system for clinical text.

Authors: Zuofeng Li; Feifan Liu; Lamont Antieau; Yonggang Cao; Hong Yu
Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497

5. Extracting medication information from clinical text.

Authors: Ozlem Uzuner; Imre Solti; Eithon Cadag
Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497

6. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors: Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497

7. An overview of MetaMap: historical perspective and recent advances.

Authors: Alan R Aronson; François-Michel Lang
Journal: J Am Med Inform Assoc Date: 2010 May-Jun Impact factor: 4.497

8. BioTagger-GM: a gene/protein name recognition system.

Authors: Manabu Torii; Zhangzhi Hu; Cathy H Wu; Hongfang Liu
Journal: J Am Med Inform Assoc Date: 2008-12-11 Impact factor: 4.497

9. Development and evaluation of a clinical note section header terminology.

Authors: Joshua C Denny; Randolph A Miller; Kevin B Johnson; Anderson Spickard
Journal: AMIA Annu Symp Proc Date: 2008-11-06

10. MedEx: a medication information extraction system for clinical narratives.

Authors: Hua Xu; Shane P Stenner; Son Doan; Kevin B Johnson; Lemuel R Waitman; Joshua C Denny
Journal: J Am Med Inform Assoc Date: 2010 Jan-Feb Impact factor: 4.497

70 in total

1. Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text.

Authors: David Carrell; Bradley Malin; John Aberdeen; Samuel Bayer; Cheryl Clark; Ben Wellner; Lynette Hirschman
Journal: J Am Med Inform Assoc Date: 2012-07-06 Impact factor: 4.497

A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

1. Automatic identification of pneumonia related concepts on chest x-ray reports.

2. Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.

3. High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge.

4. Lancet: a high precision medication event extraction system for clinical text.

5. Extracting medication information from clinical text.

6. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

7. An overview of MetaMap: historical perspective and recent advances.

8. BioTagger-GM: a gene/protein name recognition system.

9. Development and evaluation of a clinical note section header terminology.

10. MedEx: a medication information extraction system for clinical narratives.

1. Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text.

2. A comprehensive study of named entity recognition in Chinese clinical text.

3. Chronology of your health events: approaches to extracting temporal relations from medical narratives.

4. Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

5. Interactive Cohort Identification of Sleep Disorder Patients Using Natural Language Processing and i2b2.

6. Automated Assessment of Medical Students' Clinical Exposures according to AAMC Geriatric Competencies.

7. Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.

8. The Sublanguage of Clinical Problem Lists: A Corpus Analysis.

9. Automating Clinical Score Calculation within the Electronic Health Record. A Feasibility Assessment.

10. A study of active learning methods for named entity recognition in clinical text.