Literature DB >> 24859154

Transfer learning based clinical concept extraction on data from multiple sources.

Xinbo Lv1, Yi Guan2, Benyang Deng1.   

Abstract

Machine learning methods usually assume that training data and test data are drawn from the same distribution. However, this assumption often cannot be satisfied in the task of clinical concept extraction. The main aim of this paper was to use training data from one institution to build a concept extraction model for data from another institution with a different distribution. An instance-based transfer learning method, TrAdaBoost, was applied in this work. To prevent the occurrence of a negative transfer phenomenon with TrAdaBoost, we integrated it with Bagging, which provides a "softer" weights update mechanism with only a tiny amount of training data from the target domain. Two data sets named BETH and PARTNERS from the 2010 i2b2/VA challenge as well as BETHBIO, a data set we constructed ourselves, were employed to show the effectiveness of our work's transfer ability. Our method outperforms the baseline model by 2.3% and 4.4% when the baseline model is trained by training data that are combined from the source domain and the target domain in two experiments of BETH vs. PARTNERS and BETHBIO vs. PARTNERS, respectively. Additionally, confidence intervals for the performance metrics suggest that our method's results have statistical significance. Moreover, we explore the applicability of our method for further experiments. With our method, only a tiny amount of labeled data from the target domain is required to build a concept extraction model that produces better performance.
Copyright © 2014 Elsevier Inc. All rights reserved.

Keywords:  Bagging; Clinical concept extraction; Machine learning; TrAdaBoost; Transfer learning

Mesh:

Year:  2014        PMID: 24859154     DOI: 10.1016/j.jbi.2014.05.006

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  5 in total

1.  CRFs based de-identification of medical records.

Authors:  Bin He; Yi Guan; Jianyi Cheng; Keting Cen; Wenlan Hua
Journal:  J Biomed Inform       Date:  2015-08-24       Impact factor: 6.317

2.  Clinical Document Classification Using Labeled and Unlabeled Data Across Hospitals.

Authors:  Hamed Hassanzadeh; Mahnoosh Kholghi; Anthony Nguyen; Kevin Chu
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

3.  Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes.

Authors:  Rui Zhang; Jialin Liu; Yong Huang; Miye Wang; Qingke Shi; Jun Chen; Zhi Zeng
Journal:  BMC Med Inform Decis Mak       Date:  2017-05-02       Impact factor: 2.796

4.  Forecasting adverse surgical events using self-supervised transfer learning for physiological signals.

Authors:  Hugh Chen; Scott M Lundberg; Gabriel Erion; Jerry H Kim; Su-In Lee
Journal:  NPJ Digit Med       Date:  2021-12-08

5.  Federated Learning Approach with Pre-Trained Deep Learning Models for COVID-19 Detection from Unsegmented CT images.

Authors:  Lucian Mihai Florescu; Costin Teodor Streba; Mircea-Sebastian Şerbănescu; Mădălin Mămuleanu; Dan Nicolae Florescu; Rossy Vlăduţ Teică; Raluca Elena Nica; Ioana Andreea Gheonea
Journal:  Life (Basel)       Date:  2022-06-26
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.