Literature DB >> 29854175

Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

Hee-Jin Lee1, Yaoyun Zhang1, Kirk Roberts1, Hua Xu1.   

Abstract

De-identification of clinical notes is a special case of named entity recognition. Supervised machine-learning (ML) algorithms have achieved promising results for this task. However, ML-based de-identification systems often require annotating a large number of clinical notes of interest, which is costly. Domain adaptation (DA) is a technology that enables learning from annotated datasets from different sources, thereby reducing annotation cost required for ML training in the target domain. In this study, we investigate the use of DA methods for deidentification of psychiatric notes. Three state-of-the-art DA methods: instance pruning, instance weighting, and feature augmentation are applied to three source corpora of annotated hospital discharge summaries, outpatient notes, and a mixture of different note types written for diabetic patients. Our results show that DA can increase deidentification performance over the baselines, indicating that it can effectively reduce annotation cost for the target psychiatric notes. Feature augmentation is shown to increase performance the most among the three DA methods. Performance variation among the different types of clinical notes is also observed, showing that a mixture of different types of notes brings the biggest increase in performance.

Entities:  

Mesh:

Year:  2018        PMID: 29854175      PMCID: PMC5977650     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  26 in total

1.  Medical document anonymization with a semantic lexicon.

Authors:  P Ruch; R H Baud; A M Rassinoux; P Bouillon; G Robert
Journal:  Proc AMIA Symp       Date:  2000

2.  Domain adaptation for semantic role labeling in the biomedical domain.

Authors:  Daniel Dahlmeier; Hwee Tou Ng
Journal:  Bioinformatics       Date:  2010-02-23       Impact factor: 6.937

3.  A system for de-identifying medical message board text.

Authors:  Adrian Benton; Shawndra Hill; Lyle Ungar; Annie Chung; Charles Leonard; Cristin Freeman; John H Holmes
Journal:  BMC Bioinformatics       Date:  2011-06-09       Impact factor: 3.169

4.  Replacing personally-identifying information in medical records, the Scrub system.

Authors:  L Sweeney
Journal:  Proc AMIA Annu Fall Symp       Date:  1996

5.  Domain adaptation for semantic role labeling of clinical text.

Authors:  Yaoyun Zhang; Buzhou Tang; Min Jiang; Jingqi Wang; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2015-06-10       Impact factor: 4.497

6.  A hybrid approach to automatic de-identification of psychiatric notes.

Authors:  Hee-Jin Lee; Yonghui Wu; Yaoyun Zhang; Jun Xu; Hua Xu; Kirk Roberts
Journal:  J Biomed Inform       Date:  2017-06-07       Impact factor: 6.317

7.  The MITRE Identification Scrubber Toolkit: design, training, and assessment.

Authors:  John Aberdeen; Samuel Bayer; Reyyan Yeniterzi; Ben Wellner; Cheryl Clark; David Hanauer; Bradley Malin; Lynette Hirschman
Journal:  Int J Med Inform       Date:  2010-10-14       Impact factor: 4.046

Review 8.  Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1.

Authors:  Amber Stubbs; Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-28       Impact factor: 6.317

9.  Automatic detection of protected health information from clinic narratives.

Authors:  Hui Yang; Jonathan M Garibaldi
Journal:  J Biomed Inform       Date:  2015-07-29       Impact factor: 6.317

10.  Development and evaluation of an open source software tool for deidentification of pathology reports.

Authors:  Bruce A Beckwith; Rajeshwarri Mahaadevan; Ulysses J Balis; Frank Kuo
Journal:  BMC Med Inform Decis Mak       Date:  2006-03-06       Impact factor: 2.796

View more
  5 in total

1.  Evaluation of Automated Public De-Identification Tools on a Corpus of Radiology Reports.

Authors:  Jackson M Steinkamp; Taylor Pomeranz; Jason Adleberg; Charles E Kahn; Tessa S Cook
Journal:  Radiol Artif Intell       Date:  2020-10-14

2.  Improving domain adaptation in de-identification of electronic health records through self-training.

Authors:  Shun Liao; Jamie Kiros; Jiyang Chen; Zhaolei Zhang; Ting Chen
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

3.  Transferability of neural network clinical deidentification systems.

Authors:  Kahyun Lee; Nicholas J Dobbins; Bridget McInnes; Meliha Yetisgen; Özlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2021-11-25       Impact factor: 7.942

4.  Customization scenarios for de-identification of clinical notes.

Authors:  Tzvika Hartman; Michael D Howell; Jeff Dean; Shlomo Hoory; Ronit Slyper; Itay Laish; Oren Gilon; Danny Vainstein; Greg Corrado; Katherine Chou; Ming Jack Po; Jutta Williams; Scott Ellis; Gavin Bee; Avinatan Hassidim; Rony Amira; Genady Beryozkin; Idan Szpektor; Yossi Matias
Journal:  BMC Med Inform Decis Mak       Date:  2020-01-30       Impact factor: 2.796

Review 5.  Biomedical Ontologies to Guide AI Development in Radiology.

Authors:  Ross W Filice; Charles E Kahn
Journal:  J Digit Imaging       Date:  2021-11-01       Impact factor: 4.903

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.