Literature DB >> 28602904

A hybrid approach to automatic de-identification of psychiatric notes.

Hee-Jin Lee1, Yonghui Wu1, Yaoyun Zhang1, Jun Xu1, Hua Xu2, Kirk Roberts3.   

Abstract

De-identification, or identifying and removing protected health information (PHI) from clinical data, is a critical step in making clinical data available for clinical applications and research. This paper presents a natural language processing system for automatic de-identification of psychiatric notes, which was designed to participate in the 2016 CEGS N-GRID shared task Track 1. The system has a hybrid structure that combines machine leaning techniques and rule-based approaches. The rule-based components exploit the structure of the psychiatric notes as well as characteristic surface patterns of PHI mentions. The machine learning components utilize supervised learning with rich features. In addition, the system performance was boosted with integration of additional data to the training set through domain adaptation. The hybrid system showed overall micro-averaged F-score 90.74 on the test set, second-best among all the participants of the CEGS N-GRID task.
Copyright © 2017. Published by Elsevier Inc.

Entities:  

Keywords:  De-identification; Natural language processing; Psychiatric notes

Mesh:

Year:  2017        PMID: 28602904      PMCID: PMC5705430          DOI: 10.1016/j.jbi.2017.06.006

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  22 in total

1.  Medical document anonymization with a semantic lexicon.

Authors:  P Ruch; R H Baud; A M Rassinoux; P Bouillon; G Robert
Journal:  Proc AMIA Symp       Date:  2000

2.  A de-identifier for medical discharge summaries.

Authors:  Ozlem Uzuner; Tawanda C Sibanda; Yuan Luo; Peter Szolovits
Journal:  Artif Intell Med       Date:  2007-11-28       Impact factor: 5.326

3.  A system for de-identifying medical message board text.

Authors:  Adrian Benton; Shawndra Hill; Lyle Ungar; Annie Chung; Charles Leonard; Cristin Freeman; John H Holmes
Journal:  BMC Bioinformatics       Date:  2011-06-09       Impact factor: 3.169

4.  Replacing personally-identifying information in medical records, the Scrub system.

Authors:  L Sweeney
Journal:  Proc AMIA Annu Fall Symp       Date:  1996

5.  Domain adaptation for semantic role labeling of clinical text.

Authors:  Yaoyun Zhang; Buzhou Tang; Min Jiang; Jingqi Wang; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2015-06-10       Impact factor: 4.497

6.  The MITRE Identification Scrubber Toolkit: design, training, and assessment.

Authors:  John Aberdeen; Samuel Bayer; Reyyan Yeniterzi; Ben Wellner; Cheryl Clark; David Hanauer; Bradley Malin; Lynette Hirschman
Journal:  Int J Med Inform       Date:  2010-10-14       Impact factor: 4.046

Review 7.  Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1.

Authors:  Amber Stubbs; Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-28       Impact factor: 6.317

8.  De-identification of patient notes with recurrent neural networks.

Authors:  Franck Dernoncourt; Ji Young Lee; Ozlem Uzuner; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

9.  Automatic detection of protected health information from clinic narratives.

Authors:  Hui Yang; Jonathan M Garibaldi
Journal:  J Biomed Inform       Date:  2015-07-29       Impact factor: 6.317

10.  Development and evaluation of an open source software tool for deidentification of pathology reports.

Authors:  Bruce A Beckwith; Rajeshwarri Mahaadevan; Ulysses J Balis; Frank Kuo
Journal:  BMC Med Inform Decis Mak       Date:  2006-03-06       Impact factor: 2.796

View more
  12 in total

1.  Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

Authors:  Hee-Jin Lee; Yaoyun Zhang; Kirk Roberts; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

Review 2.  De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.

Authors:  Amber Stubbs; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-11       Impact factor: 6.317

3.  Ensemble-based Methods to Improve De-identification of Electronic Health Record Narratives.

Authors:  Youngjun Kim; Paul Heider; Stéphane Meystre
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

4.  A natural language processing challenge for clinical records: Research Domains Criteria (RDoC) for psychiatry.

Authors:  Özlem Uzuner; Amber Stubbs; Michele Filannino
Journal:  J Biomed Inform       Date:  2017-10-16       Impact factor: 6.317

5.  Natural Language Processing for Enterprise-scale De-identification of Protected Health Information in Clinical Notes.

Authors:  Noor Abu-El-Rub; Jay Urbain; George Kowalski; Kristen Osinski; Robert Spaniol; Mei Liu; Bradley Taylor; Lemuel R Waitman
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23

6.  Building a best-in-class automated de-identification tool for electronic health records through ensemble learning.

Authors:  Karthik Murugadoss; Ajit Rajasekharan; Bradley Malin; Vineet Agarwal; Sairam Bade; Jeff R Anderson; Jason L Ross; William A Faubion; John D Halamka; Venky Soundararajan; Sankar Ardhanari
Journal:  Patterns (N Y)       Date:  2021-05-12

Review 7.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

8.  Improving domain adaptation in de-identification of electronic health records through self-training.

Authors:  Shun Liao; Jamie Kiros; Jiyang Chen; Zhaolei Zhang; Ting Chen
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

9.  Transferability of neural network clinical deidentification systems.

Authors:  Kahyun Lee; Nicholas J Dobbins; Bridget McInnes; Meliha Yetisgen; Özlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2021-11-25       Impact factor: 7.942

10.  A study of deep learning methods for de-identification of clinical notes in cross-institute settings.

Authors:  Xi Yang; Tianchen Lyu; Qian Li; Chih-Yin Lee; Jiang Bian; William R Hogan; Yonghui Wu
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-05       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.