Literature DB >> 26315662

CRFs based de-identification of medical records.

Bin He1, Yi Guan2, Jianyi Cheng1, Keting Cen1, Wenlan Hua1.   

Abstract

De-identification is a shared task of the 2014 i2b2/UTHealth challenge. The purpose of this task is to remove protected health information (PHI) from medical records. In this paper, we propose a novel de-identifier, WI-deId, based on conditional random fields (CRFs). A preprocessing module, which tokenizes the medical records using regular expressions and an off-the-shelf tokenizer, is introduced, and three groups of features are extracted to train the de-identifier model. The experiment shows that our system is effective in the de-identification of medical records, achieving a micro-F1 of 0.9232 at the i2b2 strict entity evaluation level.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Conditional random fields; De-identification; Medical records; Protected health information

Mesh:

Year:  2015        PMID: 26315662      PMCID: PMC4988860          DOI: 10.1016/j.jbi.2015.08.012

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  8 in total

1.  A de-identifier for medical discharge summaries.

Authors:  Ozlem Uzuner; Tawanda C Sibanda; Yuan Luo; Peter Szolovits
Journal:  Artif Intell Med       Date:  2007-11-28       Impact factor: 5.326

2.  State-of-the-art anonymization of medical records using an iterative machine learning framework.

Authors:  György Szarvas; Richárd Farkas; Róbert Busa-Fekete
Journal:  J Am Med Inform Assoc       Date:  2007 Sep-Oct       Impact factor: 4.497

3.  Rapidly retargetable approaches to de-identification in medical records.

Authors:  Ben Wellner; Matt Huyck; Scott Mardis; John Aberdeen; Alex Morgan; Leonid Peshkin; Alex Yeh; Janet Hitzeman; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

4.  Evaluating the state-of-the-art in automatic de-identification.

Authors:  Ozlem Uzuner; Yuan Luo; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

5.  De-identification of health records using Anonym: effectiveness and robustness across datasets.

Authors:  Guido Zuccon; Daniel Kotzur; Anthony Nguyen; Anton Bergheim
Journal:  Artif Intell Med       Date:  2014-04-03       Impact factor: 5.326

6.  Transfer learning based clinical concept extraction on data from multiple sources.

Authors:  Xinbo Lv; Yi Guan; Benyang Deng
Journal:  J Biomed Inform       Date:  2014-05-21       Impact factor: 6.317

Review 7.  Automatic de-identification of textual documents in the electronic health record: a review of recent research.

Authors:  Stephane M Meystre; F Jeffrey Friedlin; Brett R South; Shuying Shen; Matthew H Samore
Journal:  BMC Med Res Methodol       Date:  2010-08-02       Impact factor: 4.615

8.  Automated de-identification of free-text medical records.

Authors:  Ishna Neamatullah; Margaret M Douglass; Li-wei H Lehman; Andrew Reisner; Mauricio Villarroel; William J Long; Peter Szolovits; George B Moody; Roger G Mark; Gari D Clifford
Journal:  BMC Med Inform Decis Mak       Date:  2008-07-24       Impact factor: 2.796

  8 in total
  7 in total

1.  Automatic prediction of coronary artery disease from clinical narratives.

Authors:  Kevin Buchan; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-27       Impact factor: 6.317

2.  De-identification of medical records using conditional random fields and long short-term memory networks.

Authors:  Zhipeng Jiang; Chao Zhao; Bin He; Yi Guan; Jingchi Jiang
Journal:  J Biomed Inform       Date:  2017-10-13       Impact factor: 6.317

3.  De-identification of clinical notes via recurrent neural network and conditional random field.

Authors:  Zengjian Liu; Buzhou Tang; Xiaolong Wang; Qingcai Chen
Journal:  J Biomed Inform       Date:  2017-06-01       Impact factor: 6.317

4.  A Context-Enhanced De-identification System.

Authors:  Kahyun Lee; Mehmet Kayaalp; Sam Henry; Özlem Uzuner
Journal:  ACM Trans Comput Healthc       Date:  2021-10-15

5.  Classifying Cyber-Risky Clinical Notes by Employing Natural Language Processing.

Authors:  Suzanna Schmeelk; Martins Samuel Dogo; Yifan Peng; Braja Gopal Patra
Journal:  Proc Annu Hawaii Int Conf Syst Sci       Date:  2022-01-04

6.  Entity recognition from clinical texts via recurrent neural network.

Authors:  Zengjian Liu; Ming Yang; Xiaolong Wang; Qingcai Chen; Buzhou Tang; Zhe Wang; Hua Xu
Journal:  BMC Med Inform Decis Mak       Date:  2017-07-05       Impact factor: 2.796

Review 7.  Privacy Protection and Secondary Use of Health Data: Strategies and Methods.

Authors:  Dingyi Xiang; Wei Cai
Journal:  Biomed Res Int       Date:  2021-10-07       Impact factor: 3.411

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.