Literature DB >> 33936439

Comparative Study of Various Approaches for Ensemble-based De-identification of Electronic Health Record Narratives.

Youngjun Kim1, Paul M Heider1, Stéphane M Meystre1,2.   

Abstract

De-identification of electric health record narratives is a fundamental task applying natural language processing to better protect patient information privacy. We explore different types of ensemble learning methods to improve clinical text de-identification. We present two ensemble-based approaches for combining multiple predictive models. The first method selects an optimal subset of de-identification models by greedy exclusion. This ensemble pruning allows one to save computational time or physical resources while achieving similar or better performance than the ensemble of all members. The second method uses a sequence of words to train a sequential model. For this sequence labelling-based stacked ensemble, we employ search-based structured prediction and bidirectional long short-term memory algorithms. We create ensembles consisting of de-identification models trained on two clinical text corpora. Experimental results show that our ensemble systems can effectively integrate predictions from individual models and offer better generalization across two different corpora. ©2020 AMIA - All rights reserved.

Entities:  

Mesh:

Year:  2021        PMID: 33936439      PMCID: PMC8075417     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  17 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Evaluating the state-of-the-art in automatic de-identification.

Authors:  Ozlem Uzuner; Yuan Luo; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

Review 3.  De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.

Authors:  Amber Stubbs; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-11       Impact factor: 6.317

4.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

5.  A Study of Medical Problem Extraction for Better Disease Management.

Authors:  Youngjun Kim; Stéphane M Meystre
Journal:  Stud Health Technol Inform       Date:  2019-08-21

6.  An Empirical Test of GRUs and Deep Contextualized Word Representations on De-Identification.

Authors:  Kahyun Lee; Michele Filannino; Özlem Uzuner
Journal:  Stud Health Technol Inform       Date:  2019-08-21

7.  A Study of Concept Extraction Across Different Types of Clinical Notes.

Authors:  Youngjun Kim; Ellen Riloff; John F Hurdle
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

Review 8.  Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1.

Authors:  Amber Stubbs; Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-28       Impact factor: 6.317

9.  Automated de-identification of free-text medical records.

Authors:  Ishna Neamatullah; Margaret M Douglass; Li-wei H Lehman; Andrew Reisner; Mauricio Villarroel; William J Long; Peter Szolovits; George B Moody; Roger G Mark; Gari D Clifford
Journal:  BMC Med Inform Decis Mak       Date:  2008-07-24       Impact factor: 2.796

10.  Automatic detection of protected health information from clinic narratives.

Authors:  Hui Yang; Jonathan M Garibaldi
Journal:  J Biomed Inform       Date:  2015-07-29       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.