Literature DB >> 26253132

Active learning: a step towards automating medical concept extraction.

Mahnoosh Kholghi1, Laurianne Sitbon2, Guido Zuccon2, Anthony Nguyen3.   

Abstract

OBJECTIVE: This paper presents an automatic, active learning-based system for the extraction of medical concepts from clinical free-text reports. Specifically, (1) the contribution of active learning in reducing the annotation effort and (2) the robustness of incremental active learning framework across different selection criteria and data sets are determined.
MATERIALS AND METHODS: The comparative performance of an active learning framework and a fully supervised approach were investigated to study how active learning reduces the annotation effort while achieving the same effectiveness as a supervised approach. Conditional random fields as the supervised method, and least confidence and information density as 2 selection criteria for active learning framework were used. The effect of incremental learning vs standard learning on the robustness of the models within the active learning framework with different selection criteria was also investigated. The following 2 clinical data sets were used for evaluation: the Informatics for Integrating Biology and the Bedside/Veteran Affairs (i2b2/VA) 2010 natural language processing challenge and the Shared Annotated Resources/Conference and Labs of the Evaluation Forum (ShARe/CLEF) 2013 eHealth Evaluation Lab.
RESULTS: The annotation effort saved by active learning to achieve the same effectiveness as supervised learning is up to 77%, 57%, and 46% of the total number of sequences, tokens, and concepts, respectively. Compared with the random sampling baseline, the saving is at least doubled.
CONCLUSION: Incremental active learning is a promising approach for building effective and robust medical concept extraction models while significantly reducing the burden of manual annotation.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  active learning; clinical free text; conditional random fields; medical concept extraction; robustness analysis

Mesh:

Year:  2015        PMID: 26253132      PMCID: PMC7784313          DOI: 10.1093/jamia/ocv069

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  7 in total

1.  Natural language processing: algorithms and tools to extract computable information from EHRs and from the biomedical literature.

Authors:  Lucila Ohno-Machado; Prakash Nadkarni; Kevin Johnson
Journal:  J Am Med Inform Assoc       Date:  2013 Sep-Oct       Impact factor: 4.497

2.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

Review 3.  Natural language processing: an introduction.

Authors:  Prakash M Nadkarni; Lucila Ohno-Machado; Wendy W Chapman
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

4.  Automatic extraction of cancer characteristics from free-text pathology reports for cancer notifications.

Authors:  Anthony Nguyen; Julie Moore; Michael Lawley; David Hansen; Shoni Colquist
Journal:  Stud Health Technol Inform       Date:  2011

5.  Active learning for clinical text classification: is it better than random sampling?

Authors:  Rosa L Figueroa; Qing Zeng-Treitler; Long H Ngo; Sergey Goryachev; Eduardo P Wiechmann
Journal:  J Am Med Inform Assoc       Date:  2012-06-15       Impact factor: 4.497

6.  Applying active learning to assertion classification of concepts in clinical text.

Authors:  Yukun Chen; Subramani Mani; Hua Xu
Journal:  J Biomed Inform       Date:  2011-11-22       Impact factor: 6.317

7.  Automatic Classification of Free-Text Radiology Reports to Identify Limb Fractures using Machine Learning and the SNOMED CT Ontology.

Authors:  Guido Zuccon; Amol S Wagholikar; Anthony N Nguyen; Luke Butt; Kevin Chu; Shane Martin; Jaimi Greenslade
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2013-03-18
  7 in total
  7 in total

1.  Using Active Learning to Identify Health Information Technology Related Patient Safety Events.

Authors:  Allan Fong; Jessica L Howe; Katharine T Adams; Raj M Ratwani
Journal:  Appl Clin Inform       Date:  2017-01-18       Impact factor: 2.342

2.  Clinical Document Classification Using Labeled and Unlabeled Data Across Hospitals.

Authors:  Hamed Hassanzadeh; Mahnoosh Kholghi; Anthony Nguyen; Kevin Chu
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

3.  Evaluating active learning methods for annotating semantic predications.

Authors:  Jake Vasilakes; Rubina Rizvi; Genevieve B Melton; Serguei Pakhomov; Rui Zhang
Journal:  JAMIA Open       Date:  2018-06-27

4.  Cost-aware active learning for named entity recognition in clinical text.

Authors:  Qiang Wei; Yukun Chen; Mandana Salimi; Joshua C Denny; Qiaozhu Mei; Thomas A Lasko; Qingxia Chen; Stephen Wu; Amy Franklin; Trevor Cohen; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

5.  Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.

Authors:  Wangjin Lee; Jinwook Choi
Journal:  BMC Med Inform Decis Mak       Date:  2019-07-15       Impact factor: 2.796

6.  Deep active learning for classifying cancer pathology reports.

Authors:  Kevin De Angeli; Shang Gao; Mohammed Alawad; Hong-Jun Yoon; Noah Schaefferkoetter; Xiao-Cheng Wu; Eric B Durbin; Jennifer Doherty; Antoinette Stroup; Linda Coyle; Lynne Penberthy; Georgia Tourassi
Journal:  BMC Bioinformatics       Date:  2021-03-09       Impact factor: 3.169

7.  Strategies to Address the Lack of Labeled Data for Supervised Machine Learning Training With Electronic Health Records: Case Study for the Extraction of Symptoms From Clinical Notes.

Authors:  Marie Humbert-Droz; Pritam Mukherjee; Olivier Gevaert
Journal:  JMIR Med Inform       Date:  2022-03-14
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.