Literature DB >> 28870380

Active learning reduces annotation time for clinical concept extraction.

Mahnoosh Kholghi1, Laurianne Sitbon2, Guido Zuccon3, Anthony Nguyen4.   

Abstract

OBJECTIVE: To investigate: (1) the annotation time savings by various active learning query strategies compared to supervised learning and a random sampling baseline, and (2) the benefits of active learning-assisted pre-annotations in accelerating the manual annotation process compared to de novo annotation.
MATERIALS AND METHODS: There are 73 and 120 discharge summary reports provided by Beth Israel institute in the train and test sets of the concept extraction task in the i2b2/VA 2010 challenge, respectively. The 73 reports were used in user study experiments for manual annotation. First, all sequences within the 73 reports were manually annotated from scratch. Next, active learning models were built to generate pre-annotations for the sequences selected by a query strategy. The annotation/reviewing time per sequence was recorded. The 120 test reports were used to measure the effectiveness of the active learning models.
RESULTS: When annotating from scratch, active learning reduced the annotation time up to 35% and 28% compared to a fully supervised approach and a random sampling baseline, respectively. Reviewing active learning-assisted pre-annotations resulted in 20% further reduction of the annotation time when compared to de novo annotation. DISCUSSION: The number of concepts that require manual annotation is a good indicator of the annotation time for various active learning approaches as demonstrated by high correlation between time rate and concept annotation rate.
CONCLUSION: Active learning has a key role in reducing the time required to manually annotate domain concepts from clinical free text, either when annotating from scratch or reviewing active learning-assisted pre-annotations.
Copyright © 2017 Elsevier B.V. All rights reserved.

Keywords:  Active learning; Annotation time; Clinical free text; Concept extraction; Machine-assisted pre-annotation

Mesh:

Year:  2017        PMID: 28870380     DOI: 10.1016/j.ijmedinf.2017.08.001

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  1 in total

1.  Cost-aware active learning for named entity recognition in clinical text.

Authors:  Qiang Wei; Yukun Chen; Mandana Salimi; Joshua C Denny; Qiaozhu Mei; Thomas A Lasko; Qingxia Chen; Stephen Wu; Amy Franklin; Trevor Cohen; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.