Literature DB >> 28614702

De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.

Amber Stubbs1, Michele Filannino2, Özlem Uzuner3.   

Abstract

The 2016 CEGS N-GRID shared tasks for clinical records contained three tracks. Track 1 focused on de-identification of a new corpus of 1000 psychiatric intake records. This track tackled de-identification in two sub-tracks: Track 1.A was a "sight unseen" task, where nine teams ran existing de-identification systems, without any modifications or training, on 600 new records in order to gauge how well systems generalize to new data. The best-performing system for this track scored an F1 of 0.799. Track 1.B was a traditional Natural Language Processing (NLP) shared task on de-identification, where 15 teams had two months to train their systems on the new data, then test it on an unannotated test set. The best-performing system from this track scored an F1 of 0.914. The scores for Track 1.A show that unmodified existing systems do not generalize well to new data without the benefit of training data. The scores for Track 1.B are slightly lower than the 2014 de-identification shared task (which was almost identical to 2016 Track 1.B), indicating that these new psychiatric records pose a more difficult challenge to NLP systems. Overall, de-identification is still not a solved problem, though it is important to the future of clinical NLP.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Clinical records; Machine learning; Natural language processing; Shared task

Mesh:

Year:  2017        PMID: 28614702      PMCID: PMC5705537          DOI: 10.1016/j.jbi.2017.06.011

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  17 in total

1.  Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text.

Authors:  David Carrell; Bradley Malin; John Aberdeen; Samuel Bayer; Cheryl Clark; Ben Wellner; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

2.  Rapidly retargetable approaches to de-identification in medical records.

Authors:  Ben Wellner; Matt Huyck; Scott Mardis; John Aberdeen; Alex Morgan; Leonid Peshkin; Alex Yeh; Janet Hitzeman; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

3.  Using machine learning for concept extraction on clinical documents from multiple data sources.

Authors:  Manabu Torii; Kavishwar Wagholikar; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2011-06-27       Impact factor: 4.497

4.  De-identification of medical records using conditional random fields and long short-term memory networks.

Authors:  Zhipeng Jiang; Chao Zhao; Bin He; Yi Guan; Jingchi Jiang
Journal:  J Biomed Inform       Date:  2017-10-13       Impact factor: 6.317

5.  Automatic de-identification of electronic medical records using token-level and character-level conditional random fields.

Authors:  Zengjian Liu; Yangxin Chen; Buzhou Tang; Xiaolong Wang; Qingcai Chen; Haodi Li; Jingfeng Wang; Qiwen Deng; Suisong Zhu
Journal:  J Biomed Inform       Date:  2015-06-26       Impact factor: 6.317

6.  A hybrid approach to automatic de-identification of psychiatric notes.

Authors:  Hee-Jin Lee; Yonghui Wu; Yaoyun Zhang; Jun Xu; Hua Xu; Kirk Roberts
Journal:  J Biomed Inform       Date:  2017-06-07       Impact factor: 6.317

7.  De-identification of clinical notes via recurrent neural network and conditional random field.

Authors:  Zengjian Liu; Buzhou Tang; Xiaolong Wang; Qingcai Chen
Journal:  J Biomed Inform       Date:  2017-06-01       Impact factor: 6.317

Review 8.  Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1.

Authors:  Amber Stubbs; Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-28       Impact factor: 6.317

9.  De-identification of patient notes with recurrent neural networks.

Authors:  Franck Dernoncourt; Ji Young Lee; Ozlem Uzuner; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

10.  Combining knowledge- and data-driven methods for de-identification of clinical narratives.

Authors:  Azad Dehghan; Aleksandar Kovacevic; George Karystianis; John A Keane; Goran Nenadic
Journal:  J Biomed Inform       Date:  2015-07-22       Impact factor: 6.317

View more
  34 in total

1.  Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation.

Authors:  Hee-Jin Lee; Yaoyun Zhang; Kirk Roberts; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  A Study of Deep Learning Methods for De-identification of Clinical Notes at Cross Institute Settings.

Authors:  Xi Yang; Tianchen Lyu; Chih-Yin Lee; Jiang Bian; William R Hogan; Yonghui Wu
Journal:  IEEE Int Conf Healthc Inform       Date:  2019-11-21

3.  Ensemble method-based extraction of medication and related information from clinical texts.

Authors:  Youngjun Kim; Stéphane M Meystre
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

4.  Ensemble-based Methods to Improve De-identification of Electronic Health Record Narratives.

Authors:  Youngjun Kim; Paul Heider; Stéphane Meystre
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

5.  Efficient Active Learning for Electronic Medical Record De-identification.

Authors:  Muqun Li; Martin Scaiano; Khaled El Emam; Bradley A Malin
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2019-05-06

6.  Comparative Study of Various Approaches for Ensemble-based De-identification of Electronic Health Record Narratives.

Authors:  Youngjun Kim; Paul M Heider; Stéphane M Meystre
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

7.  Exploring associations of clinical and social parameters with violent behaviors among psychiatric patients.

Authors:  Hong-Jie Dai; Emily Chia-Yu Su; Mohy Uddin; Jitendra Jonnagaddala; Chi-Shin Wu; Shabbir Syed-Abdul
Journal:  J Biomed Inform       Date:  2017-08-16       Impact factor: 6.317

8.  2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records.

Authors:  Sam Henry; Kevin Buchan; Michele Filannino; Amber Stubbs; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

9.  A Comparative Analysis of Speed and Accuracy for Three Off-the-Shelf De-Identification Tools.

Authors:  Paul M Heider; Jihad S Obeid; Stéphane M Meystre
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2020-05-30

10.  De-identification of Clinical Text via Bi-LSTM-CRF with Neural Language Models.

Authors:  Buzhou Tang; Dehuan Jiang; Qingcai Chen; Xiaolong Wang; Jun Yan; Ying Shen
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.