Literature DB >> 29032162

De-identification of medical records using conditional random fields and long short-term memory networks.

Zhipeng Jiang1, Chao Zhao2, Bin He3, Yi Guan4, Jingchi Jiang5.   

Abstract

The CEGS N-GRID 2016 Shared Task 1 in Clinical Natural Language Processing focuses on the de-identification of psychiatric evaluation records. This paper describes two participating systems of our team, based on conditional random fields (CRFs) and long short-term memory networks (LSTMs). A pre-processing module was introduced for sentence detection and tokenization before de-identification. For CRFs, manually extracted rich features were utilized to train the model. For LSTMs, a character-level bi-directional LSTM network was applied to represent tokens and classify tags for each token, following which a decoding layer was stacked to decode the most probable protected health information (PHI) terms. The LSTM-based system attained an i2b2 strict micro-F1 measure of 0.8986, which was higher than that of the CRF-based system.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Conditional random fields; De-identification; Long short-term memory networks; Protected health information

Mesh:

Year:  2017        PMID: 29032162      PMCID: PMC5890009          DOI: 10.1016/j.jbi.2017.10.003

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  14 in total

1.  CRFs based de-identification of medical records.

Authors:  Bin He; Yi Guan; Jianyi Cheng; Keting Cen; Wenlan Hua
Journal:  J Biomed Inform       Date:  2015-08-24       Impact factor: 6.317

2.  Rapidly retargetable approaches to de-identification in medical records.

Authors:  Ben Wellner; Matt Huyck; Scott Mardis; John Aberdeen; Alex Morgan; Leonid Peshkin; Alex Yeh; Janet Hitzeman; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

3.  Learning long-term dependencies with gradient descent is difficult.

Authors:  Y Bengio; P Simard; P Frasconi
Journal:  IEEE Trans Neural Netw       Date:  1994

Review 4.  De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.

Authors:  Amber Stubbs; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-11       Impact factor: 6.317

5.  LSTM: A Search Space Odyssey.

Authors:  Klaus Greff; Rupesh K Srivastava; Jan Koutnik; Bas R Steunebrink; Jurgen Schmidhuber
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2016-07-08       Impact factor: 10.451

Review 6.  Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1.

Authors:  Amber Stubbs; Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-28       Impact factor: 6.317

7.  Can physicians recognize their own patients in de-identified notes?

Authors:  Stéphane Meystre; Shuying Shen; Deborah Hofmann; Adi Gundlapalli
Journal:  Stud Health Technol Inform       Date:  2014

8.  De-identification of patient notes with recurrent neural networks.

Authors:  Franck Dernoncourt; Ji Young Lee; Ozlem Uzuner; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

9.  The emergence of national electronic health record architectures in the United States and Australia: models, costs, and questions.

Authors:  Tracy D Gunter; Nicolas P Terry
Journal:  J Med Internet Res       Date:  2005-03-14       Impact factor: 5.428

10.  Automatic detection of protected health information from clinic narratives.

Authors:  Hui Yang; Jonathan M Garibaldi
Journal:  J Biomed Inform       Date:  2015-07-29       Impact factor: 6.317

View more
  5 in total

Review 1.  De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.

Authors:  Amber Stubbs; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-11       Impact factor: 6.317

2.  A natural language processing challenge for clinical records: Research Domains Criteria (RDoC) for psychiatry.

Authors:  Özlem Uzuner; Amber Stubbs; Michele Filannino
Journal:  J Biomed Inform       Date:  2017-10-16       Impact factor: 6.317

Review 3.  Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks.

Authors:  Michele Filannino; Özlem Uzuner
Journal:  Yearb Med Inform       Date:  2018-08-29

4.  A study of deep learning methods for de-identification of clinical notes in cross-institute settings.

Authors:  Xi Yang; Tianchen Lyu; Qian Li; Chih-Yin Lee; Jiang Bian; William R Hogan; Yonghui Wu
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-05       Impact factor: 2.796

5.  Publicly available machine learning models for identifying opioid misuse from the clinical notes of hospitalized patients.

Authors:  Brihat Sharma; Dmitriy Dligach; Kristin Swope; Elizabeth Salisbury-Afshar; Niranjan S Karnik; Cara Joyce; Majid Afshar
Journal:  BMC Med Inform Decis Mak       Date:  2020-04-29       Impact factor: 3.298

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.