Literature DB >> 25552878

A De-identification method for bilingual clinical texts of various note types.

Soo-Yong Shin1, Yu Rang Park2, Yongdon Shin2, Hyo Joung Choi2, Jihyun Park2, Yongman Lyu2, Moo-Song Lee3, Chang-Min Choi4, Woo-Sung Kim5, Jae Ho Lee6.   

Abstract

De-identification of personal health information is essential in order not to require written patient informed consent. Previous de-identification methods were proposed using natural language processing technology in order to remove the identifiers in clinical narrative text, although these methods only focused on narrative text written in English. In this study, we propose a regular expression-based de-identification method used to address bilingual clinical records written in Korean and English. To develop and validate regular expression rules, we obtained training and validation datasets composed of 6,039 clinical notes of 20 types and 5,000 notes of 33 types, respectively. Fifteen regular expression rules were constructed using the development dataset and those rules achieved 99.87% precision and 96.25% recall for the validation dataset. Our de-identification method successfully removed the identifiers in diverse types of bilingual clinical narrative texts. This method will thus assist physicians to more easily perform retrospective research.

Entities:  

Keywords:  Anonymization; Bilingual Text; Clinical Text; De-identification; Medical Informatics; Patient Privacy; Text Mining

Mesh:

Year:  2014        PMID: 25552878      PMCID: PMC4278030          DOI: 10.3346/jkms.2015.30.1.7

Source DB:  PubMed          Journal:  J Korean Med Sci        ISSN: 1011-8934            Impact factor:   2.153


  27 in total

1.  Medical document anonymization with a semantic lexicon.

Authors:  P Ruch; R H Baud; A M Rassinoux; P Bouillon; G Robert
Journal:  Proc AMIA Symp       Date:  2000

2.  A de-identifier for medical discharge summaries.

Authors:  Ozlem Uzuner; Tawanda C Sibanda; Yuan Luo; Peter Szolovits
Journal:  Artif Intell Med       Date:  2007-11-28       Impact factor: 5.326

3.  Developing a standard for de-identifying electronic patient records written in Swedish: precision, recall and F-measure in a manual and computerized annotation trial.

Authors:  Sumithra Velupillai; Hercules Dalianis; Martin Hassel; Gunnar H Nilsson
Journal:  Int J Med Inform       Date:  2009-05-23       Impact factor: 4.046

Review 4.  Biomedical informatics and outcomes research: enabling knowledge-driven health care.

Authors:  Peter J Embi; Stanley E Kaufman; Philip R O Payne
Journal:  Circulation       Date:  2009-12-08       Impact factor: 29.690

5.  The MITRE Identification Scrubber Toolkit: design, training, and assessment.

Authors:  John Aberdeen; Samuel Bayer; Reyyan Yeniterzi; Ben Wellner; Cheryl Clark; David Hanauer; Bradley Malin; Lynette Hirschman
Journal:  Int J Med Inform       Date:  2010-10-14       Impact factor: 4.046

6.  Repurposing the clinical record: can an existing natural language processing system de-identify clinical notes?

Authors:  Frances P Morrison; Li Li; Albert M Lai; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2008-10-24       Impact factor: 4.497

7.  Building public trust in uses of Health Insurance Portability and Accountability Act de-identified data.

Authors:  Deven McGraw
Journal:  J Am Med Inform Assoc       Date:  2012-06-26       Impact factor: 4.497

8.  Methods for the de-identification of electronic health records for genomic research.

Authors:  Khaled El Emam
Journal:  Genome Med       Date:  2011-04-27       Impact factor: 11.117

9.  Lessons Learned from Development of De-identification System for Biomedical Research in a Korean Tertiary Hospital.

Authors:  Soo-Yong Shin; Yongman Lyu; Yongdon Shin; Hyo Joung Choi; Jihyun Park; Woo-Sung Kim; Jae Ho Lee
Journal:  Healthc Inform Res       Date:  2013-06-30

10.  Large-scale evaluation of automated clinical note de-identification and its impact on information extraction.

Authors:  Louise Deleger; Katalin Molnar; Guergana Savova; Fei Xia; Todd Lingren; Qi Li; Keith Marsolo; Anil Jegga; Megan Kaiser; Laura Stoutenborough; Imre Solti
Journal:  J Am Med Inform Assoc       Date:  2012-08-02       Impact factor: 4.497

View more
  30 in total

1.  Symptom-dependent cut-offs of urine metanephrines improve diagnostic accuracy for detecting pheochromocytomas in two separate cohorts, compared to symptom-independent cut-offs.

Authors:  Yoon Young Cho; Kee-Ho Song; Young Nam Kim; Seong Hee Ahn; Hyeonmok Kim; Sooyoun Park; Sunghwan Suh; Beom-Jun Kim; Soo-Youn Lee; Sail Chun; Jung-Min Koh; Seung Hun Lee; Jae Hyeon Kim
Journal:  Endocrine       Date:  2016-08-02       Impact factor: 3.633

2.  Classification of Use Status for Dietary Supplements in Clinical Notes.

Authors:  Yadan Fan; Lu He; Rui Zhang
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-01-19

3.  Considerations when communicating with awake patients undergoing image-guided neuro-interventions.

Authors:  Altin Stafa; Luigi Simonetti; Francesco Di Paola; Marco Leonardi
Journal:  Interv Neuroradiol       Date:  2015-08-10       Impact factor: 1.610

4.  Establishing the role of honest broker: bridging the gap between protecting personal health data and clinical research efficiency.

Authors:  Hyo Joung Choi; Min Joung Lee; Chang-Min Choi; JaeHo Lee; Soo-Yong Shin; Yungman Lyu; Yu Rang Park; Soyoung Yoo
Journal:  PeerJ       Date:  2015-12-17       Impact factor: 2.984

5.  The Younger Patients Have More Better Prognosis in Limited Disease Small Cell Lung Cancer.

Authors:  Hye-Jin Kim; Chang-Min Choi; Seul-Gi Kim
Journal:  Tuberc Respir Dis (Seoul)       Date:  2016-10-05

6.  Patterns and injuries associated with orbital wall fractures in elderly patients who visited the emergency room: a retrospective case-control study.

Authors:  Youn-Jung Kim; Shin Ahn; Dong-Woo Seo; Chang Hwan Sohn; Hyung-Joo Lee; In-June Park; Dong-Jin Yang; Seung Mok Ryoo; Won Young Kim; Kyung Soo Lim
Journal:  BMJ Open       Date:  2016-09-19       Impact factor: 2.692

7.  Issues and Solutions of Healthcare Data De-identification: the Case of South Korea.

Authors:  Soo Yong Shin
Journal:  J Korean Med Sci       Date:  2018-01-29       Impact factor: 2.153

Review 8.  Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors:  Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2018-03-30

9.  Characteristics of orbital wall fractures in preschool and school-aged children.

Authors:  Dong Jin Yang; Youn-Jung Kim; Dong-Woo Seo; Hyung-Joo Lee; In-June Park; Chang Hwan Sohn; Jung Min Ryoo; Jong Seung Lee; Won Young Kim; Kyoung Soo Lim
Journal:  Clin Exp Emerg Med       Date:  2017-03-30

10.  Managing Patient-Generated Health Data Through Mobile Personal Health Records: Analysis of Usage Data.

Authors:  Yu Rang Park; Yura Lee; Ji Young Kim; Jeonghoon Kim; Hae Reong Kim; Young-Hak Kim; Woo Sung Kim; Jae-Ho Lee
Journal:  JMIR Mhealth Uhealth       Date:  2018-04-09       Impact factor: 4.773

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.