Literature DB >> 22239956

Using an ensemble system to improve concept extraction from clinical records.

Ning Kang1, Zubair Afzal, Bharat Singh, Erik M van Mulligen, Jan A Kors.   

Abstract

Recognition of medical concepts is a basic step in information extraction from clinical records. We wished to improve on the performance of a variety of concept recognition systems by combining their individual results. We selected two dictionary-based systems and five statistical-based systems that were trained to annotate medical problems, tests, and treatments in clinical records. Manually annotated clinical records for training and testing were made available through the 2010 i2b2/VA (Informatics for Integrating Biology and the Bedside) challenge. Results of individual systems were combined by a simple voting scheme. The statistical systems were trained on a set of 349 records. Performance (precision, recall, F-score) was assessed on a test set of 477 records, using varying voting thresholds. The combined annotation system achieved a best F-score of 82.2% (recall 81.2%, precision 83.3%) on the test set, a score that ranks third among 22 participants in the i2b2/VA concept annotation task. The ensemble system had better precision and recall than any of the individual systems, yielding an F-score that is 4.6% point higher than the best single system. Changing the voting threshold offered a simple way to obtain a system with high precision (and moderate recall) or one with high recall (and moderate precision). The ensemble-based approach is straightforward and allows the balancing of precision versus recall of the combined system. The ensemble system is freely available and can easily be extended, integrated in other systems, and retrained.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22239956     DOI: 10.1016/j.jbi.2011.12.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  12 in total

1.  Ensembles of NLP Tools for Data Element Extraction from Clinical Notes.

Authors:  Tsung-Ting Kuo; Pallavi Rao; Cleo Maehara; Son Doan; Juan D Chaparro; Michele E Day; Claudiu Farcas; Lucila Ohno-Machado; Chun-Nan Hsu
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

2.  A Study of Concept Extraction Across Different Types of Clinical Notes.

Authors:  Youngjun Kim; Ellen Riloff; John F Hurdle
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

3.  Unsupervised biomedical named entity recognition: experiments with clinical and biological texts.

Authors:  Shaodian Zhang; Noémie Elhadad
Journal:  J Biomed Inform       Date:  2013-08-15       Impact factor: 6.317

4.  Automatically Detecting Acute Myocardial Infarction Events from EHR Text: A Preliminary Study.

Authors:  Jiaping Zheng; Jorge Yarzebski; Balaji Polepalli Ramesh; Robert J Goldberg; Hong Yu
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

5.  Ensembles of natural language processing systems for portable phenotyping solutions.

Authors:  Cong Liu; Casey N Ta; James R Rogers; Ziran Li; Junghwan Lee; Alex M Butler; Ning Shang; Fabricio Sampaio Peres Kury; Liwei Wang; Feichen Shen; Hongfang Liu; Lyudmila Ena; Carol Friedman; Chunhua Weng
Journal:  J Biomed Inform       Date:  2019-10-23       Impact factor: 6.317

6.  Combining rules and machine learning for extraction of temporal expressions and events from clinical narratives.

Authors:  Aleksandar Kovacevic; Azad Dehghan; Michele Filannino; John A Keane; Goran Nenadic
Journal:  J Am Med Inform Assoc       Date:  2013-04-20       Impact factor: 4.497

7.  Concept selection for phenotypes and diseases using learn to rank.

Authors:  Nigel Collier; Anika Oellrich; Tudor Groza
Journal:  J Biomed Semantics       Date:  2015-06-01

8.  Biomedical literature classification using encyclopedic knowledge: a Wikipedia-based bag-of-concepts approach.

Authors:  Marcos Antonio Mouriño García; Roberto Pérez Rodríguez; Luis E Anido Rifón
Journal:  PeerJ       Date:  2015-09-29       Impact factor: 2.984

9.  Mission and Sustainability of Informatics for Integrating Biology and the Bedside (i2b2).

Authors:  Shawn Murphy; Adam Wilcox
Journal:  EGEMS (Wash DC)       Date:  2014-09-11

10.  Biomolecular Relationships Discovered from Biological Labyrinth and Lost in Ocean of Literature: Community Efforts Can Rescue Until Automated Artificial Intelligence Takes Over.

Authors:  Rajinder Gupta; Shrikant S Mantri
Journal:  Front Genet       Date:  2016-03-31       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.