Literature DB >> 22127105

Applying active learning to assertion classification of concepts in clinical text.

Yukun Chen1, Subramani Mani, Hua Xu.   

Abstract

Supervised machine learning methods for clinical natural language processing (NLP) research require a large number of annotated samples, which are very expensive to build because of the involvement of physicians. Active learning, an approach that actively samples from a large pool, provides an alternative solution. Its major goal in classification is to reduce the annotation effort while maintaining the quality of the predictive model. However, few studies have investigated its uses in clinical NLP. This paper reports an application of active learning to a clinical text classification task: to determine the assertion status of clinical concepts. The annotated corpus for the assertion classification task in the 2010 i2b2/VA Clinical NLP Challenge was used in this study. We implemented several existing and newly developed active learning algorithms and assessed their uses. The outcome is reported in the global ALC score, based on the Area under the average Learning Curve of the AUC (Area Under the Curve) score. Results showed that when the same number of annotated samples was used, active learning strategies could generate better classification models (best ALC-0.7715) than the passive learning method (random sampling) (ALC-0.7411). Moreover, to achieve the same classification performance, active learning strategies required fewer samples than the random sampling method. For example, to achieve an AUC of 0.79, the random sampling method used 32 samples, while our best active learning algorithm required only 12 samples, a reduction of 62.5% in manual annotation effort. Copyright Â
© 2011 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 22127105      PMCID: PMC3306548          DOI: 10.1016/j.jbi.2011.11.003

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  6 in total

1.  Active learning with support vector machine applied to gene expression data for cancer classification.

Authors:  Ying Liu
Journal:  J Chem Inf Comput Sci       Date:  2004 Nov-Dec

Review 2.  Extracting information from textual documents in the electronic health record: a review of recent research.

Authors:  S M Meystre; G K Savova; K C Kipper-Schuler; J F Hurdle
Journal:  Yearb Med Inform       Date:  2008

3.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

4.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Authors:  Min Jiang; Yukun Chen; Mei Liu; S Trent Rosenbloom; Subramani Mani; Joshua C Denny; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2011-04-20       Impact factor: 4.497

5.  Community annotation experiment for ground truth generation for the i2b2 medication challenge.

Authors:  Ozlem Uzuner; Imre Solti; Fei Xia; Eithon Cadag
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

6.  A general natural-language text processor for clinical radiology.

Authors:  C Friedman; P O Alderson; J H Austin; J J Cimino; S B Johnson
Journal:  J Am Med Inform Assoc       Date:  1994 Mar-Apr       Impact factor: 4.497

  6 in total
  14 in total

1.  Active Learning-based corpus annotation--the PathoJen experience.

Authors:  Udo Hahn; Elena Beisswanger; Ekaterina Buyko; Erik Faessler
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

2.  Applying active learning to supervised word sense disambiguation in MEDLINE.

Authors:  Yukun Chen; Hongxin Cao; Qiaozhu Mei; Kai Zheng; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2013-01-30       Impact factor: 4.497

3.  Assisted annotation of medical free text using RapTAT.

Authors:  Glenn T Gobbel; Jennifer Garvin; Ruth Reeves; Robert M Cronin; Julia Heavirland; Jenifer Williams; Allison Weaver; Shrimalini Jayaramaraja; Dario Giuse; Theodore Speroff; Steven H Brown; Hua Xu; Michael E Matheny
Journal:  J Am Med Inform Assoc       Date:  2014-01-15       Impact factor: 4.497

4.  A study of active learning methods for named entity recognition in clinical text.

Authors:  Yukun Chen; Thomas A Lasko; Qiaozhu Mei; Joshua C Denny; Hua Xu
Journal:  J Biomed Inform       Date:  2015-09-15       Impact factor: 6.317

5.  Active learning: a step towards automating medical concept extraction.

Authors:  Mahnoosh Kholghi; Laurianne Sitbon; Guido Zuccon; Anthony Nguyen
Journal:  J Am Med Inform Assoc       Date:  2015-08-07       Impact factor: 4.497

6.  Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management.

Authors:  Vijay Garla; Caroline Taylor; Cynthia Brandt
Journal:  J Biomed Inform       Date:  2013-07-08       Impact factor: 6.317

7.  An active learning-enabled annotation system for clinical named entity recognition.

Authors:  Yukun Chen; Thomas A Lask; Qiaozhu Mei; Qingxia Chen; Sungrim Moon; Jingqi Wang; Ky Nguyen; Tolulola Dawodu; Trevor Cohen; Joshua C Denny; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2017-07-05       Impact factor: 2.796

8.  Using multiclass classification to automate the identification of patient safety incident reports by type and severity.

Authors:  Ying Wang; Enrico Coiera; William Runciman; Farah Magrabi
Journal:  BMC Med Inform Decis Mak       Date:  2017-06-12       Impact factor: 2.796

9.  Adaptive Dimensionality Reduction with Semi-Supervision (AdDReSS): Classifying Multi-Attribute Biomedical Data.

Authors:  George Lee; David Edmundo Romo Bucheli; Anant Madabhushi
Journal:  PLoS One       Date:  2016-07-15       Impact factor: 3.240

10.  Large-Scale Discovery of Disease-Disease and Disease-Gene Associations.

Authors:  Djordje Gligorijevic; Jelena Stojanovic; Nemanja Djuric; Vladan Radosavljevic; Mihajlo Grbovic; Rob J Kulathinal; Zoran Obradovic
Journal:  Sci Rep       Date:  2016-08-31       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.