Literature DB >> 19745408

Testing tactics to localize de-identification.

Cyril Grouin1, Arnaud Rosier, Olivier Dameron, Pierre Zweigenbaum.   

Abstract

Recent renewed interest in de-identification (also known as "anonymisation") has led to the development of a series of systems in the United States with very good performance on challenge test sets. De-identification needs however to be tuned to the local documents and their specificities. We address here two issues raised in this context. First, tuning is generally performed by language engineers who should not have to work on identified text. We therefore perform a first gross de-identification step in the hospital. Second, to set up a de-identification system for new documents in a language different from English, here French patient reports, we tested two methods: the first attempts to adapt an existing US de-identifier for English, the second re-develops a new system which applies the same methods. The first method involved localizing patterns designed for English, which proved cumbersome and did not quickly obtain good performance. With a similar effort, the latter method obtained much better results. Evaluated on a set of 23 randomly selected texts from a corpus of 21,749 clinical texts, it obtained 83% recall and 92% precision.

Entities:  

Mesh:

Year:  2009        PMID: 19745408

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  7 in total

Review 1.  Strategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.

Authors:  Clete A Kushida; Deborah A Nichols; Rik Jadrnicek; Ric Miller; James K Walsh; Kara Griffin
Journal:  Med Care       Date:  2012-07       Impact factor: 2.983

2.  Recognition and pseudonymisation of medical records for secondary use.

Authors:  Johannes Heurix; Stefan Fenz; Antonio Rella; Thomas Neubauer
Journal:  Med Biol Eng Comput       Date:  2015-06-04       Impact factor: 2.602

3.  A De-identification method for bilingual clinical texts of various note types.

Authors:  Soo-Yong Shin; Yu Rang Park; Yongdon Shin; Hyo Joung Choi; Jihyun Park; Yongman Lyu; Moo-Song Lee; Chang-Min Choi; Woo-Sung Kim; Jae Ho Lee
Journal:  J Korean Med Sci       Date:  2014-12-23       Impact factor: 2.153

4.  De-identifying Swedish clinical text - refinement of a gold standard and experiments with Conditional random fields.

Authors:  Hercules Dalianis; Sumithra Velupillai
Journal:  J Biomed Semantics       Date:  2010-04-12

5.  De-identification of primary care electronic medical records free-text data in Ontario, Canada.

Authors:  Karen Tu; Julie Klein-Geltink; Tezeta F Mitiku; Chiriac Mihai; Joel Martin
Journal:  BMC Med Inform Decis Mak       Date:  2010-06-18       Impact factor: 2.796

Review 6.  Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.

Authors:  Raphaël Chevrier; Vasiliki Foufi; Christophe Gaudet-Blavignac; Arnaud Robert; Christian Lovis
Journal:  J Med Internet Res       Date:  2019-05-31       Impact factor: 5.428

Review 7.  Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors:  Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2018-03-30
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.