Literature DB >> 32308882

De-identification of Clinical Text via Bi-LSTM-CRF with Neural Language Models.

Buzhou Tang1,2, Dehuan Jiang1, Qingcai Chen1, Xiaolong Wang1, Jun Yan3, Ying Shen4.   

Abstract

De-identification of clinical text, the prerequisite of electronic clinical data reuse, is a typical named entity recogni tion (NER) problem. A number of state-of-the-art deep learning methods for NER, such as Bi-LSTM-CRF (bidirec tional long-short-term-memory conditional random fields), have been applied for de-identification. Neural language models used for language representation bring great improvement in lots of NLP tasks when they are integrated with other deep learning methods. In this paper, we introduce Bi-LSTM-CRF with neural language models for de- identification of clinical text, and evaluate it on the de-identification datasets of the i2b2 2014 and the CEGS N- GRID 2016 challenges. Four neural language models of three types individually integrated with Bi-LSTM-CRF are compared in this study. Bi-LSTM-CRF with neural language models achieves the highest "strict" micro-averaged F1-score of 95.50% on the i2b2 2014 dataset and 91.82% on the CEGS N-GRID 2016 dataset, becoming new benchmark results on these two datasets respectively Keywords: De-identification, Named entity recognition, Bidirectional long-short-term-memory, Conditional ran dom fields, Neural language models. ©2019 AMIA - All rights reserved.

Year:  2020        PMID: 32308882      PMCID: PMC7153082     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  10 in total

1.  Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.

Authors:  Amber Stubbs; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-08-28       Impact factor: 6.317

2.  Health Insurance Portability and Accountability Act of 1996. Public Law 104-191.

Authors: 
Journal:  US Statut Large       Date:  1996-08-21

Review 3.  De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.

Authors:  Amber Stubbs; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-11       Impact factor: 6.317

4.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

5.  Ensemble-based Methods to Improve De-identification of Electronic Health Record Narratives.

Authors:  Youngjun Kim; Paul Heider; Stéphane Meystre
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

6.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

Authors:  Özlem Uzuner; Amber Stubbs
Journal:  J Biomed Inform       Date:  2015-10-24       Impact factor: 6.317

7.  De-identification of clinical notes via recurrent neural network and conditional random field.

Authors:  Zengjian Liu; Buzhou Tang; Xiaolong Wang; Qingcai Chen
Journal:  J Biomed Inform       Date:  2017-06-01       Impact factor: 6.317

8.  De-identification of patient notes with recurrent neural networks.

Authors:  Franck Dernoncourt; Ji Young Lee; Ozlem Uzuner; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

9.  Entity recognition from clinical texts via recurrent neural network.

Authors:  Zengjian Liu; Ming Yang; Xiaolong Wang; Qingcai Chen; Buzhou Tang; Zhe Wang; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2017-07-05       Impact factor: 2.796

10.  Leveraging text skeleton for de-identification of electronic medical records.

Authors:  Yue-Shu Zhao; Kun-Li Zhang; Hong-Chao Ma; Kun Li
Journal:  BMC Med Inform Decis Mak       Date:  2018-03-22       Impact factor: 2.796

  10 in total
  2 in total

1.  Improving domain adaptation in de-identification of electronic health records through self-training.

Authors:  Shun Liao; Jamie Kiros; Jiyang Chen; Zhaolei Zhang; Ting Chen
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

2.  An Efficient Method for Deidentifying Protected Health Information in Chinese Electronic Health Records: Algorithm Development and Validation.

Authors:  Peng Wang; Yong Li; Liang Yang; Simin Li; Linfeng Li; Zehan Zhao; Shaopei Long; Fei Wang; Hongqian Wang; Ying Li; Chengliang Wang
Journal:  JMIR Med Inform       Date:  2022-08-30
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.