Literature DB >> 18952938

Repurposing the clinical record: can an existing natural language processing system de-identify clinical notes?

Frances P Morrison1, Li Li, Albert M Lai, George Hripcsak.   

Abstract

Electronic clinical documentation can be useful for activities such as public health surveillance, quality improvement, and research, but existing methods of de-identification may not provide sufficient protection of patient data. The general-purpose natural language processor MedLEE retains medical concepts while excluding the remaining text so, in addition to processing text into structured data, it may be able provide a secondary benefit of de-identification. Without modifying the system, the authors tested the ability of MedLEE to remove protected health information (PHI) by comparing 100 outpatient clinical notes with the corresponding XML-tagged output. Of 809 instances of PHI, 26 (3.2%) were detected in output as a result of processing and identification errors. However, PHI in the output was highly transformed, much appearing as normalized terms for medical concepts, potentially making re-identification more difficult. The MedLEE processor may be a good enhancement to other de-identification systems, both removing PHI and providing coded data from clinical text.

Entities:  

Mesh:

Year:  2008        PMID: 18952938      PMCID: PMC2605586          DOI: 10.1197/jamia.M2862

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  9 in total

1.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports.

Authors:  George Hripcsak; John H M Austin; Philip O Alderson; Carol Friedman
Journal:  Radiology       Date:  2002-07       Impact factor: 11.105

2.  Automated detection of adverse events using natural language processing of discharge summaries.

Authors:  Genevieve B Melton; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2005-03-31       Impact factor: 4.497

3.  Medical i2b2 NLP smoking challenge: the A-Life system architecture and methodology.

Authors:  Daniel T Heinze; Mark L Morsch; Brian C Potter; Ronald E Sheffer
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

4.  State-of-the-art anonymization of medical records using an iterative machine learning framework.

Authors:  György Szarvas; Richárd Farkas; Róbert Busa-Fekete
Journal:  J Am Med Inform Assoc       Date:  2007 Sep-Oct       Impact factor: 4.497

5.  Evaluating the state-of-the-art in automatic de-identification.

Authors:  Ozlem Uzuner; Yuan Luo; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

6.  Automating a severity score guideline for community-acquired pneumonia employing medical language processing of discharge summaries.

Authors:  C Friedman; C Knirsch; L Shagina; G Hripcsak
Journal:  Proc AMIA Symp       Date:  1999

7.  Hospital electronic medical record-based public health surveillance system deployed during the 2002 Winter Olympic Games.

Authors:  Adi V Gundlapalli; Jonathan Olson; Sean P Smith; Michael Baza; Robert R Hausam; Louise J Eutropius; Stanley L Pestotnik; Karen Duncan; Nancy Staggers; Pierre Pincetl; Matthew H Samore
Journal:  Am J Infect Control       Date:  2007-04       Impact factor: 2.918

8.  Limited parsing of notational text visit notes: ad-hoc vs. NLP approaches.

Authors:  R C Barrows Jr; M Busuioc; C Friedman
Journal:  Proc AMIA Symp       Date:  2000

9.  Evaluation of a deidentification (De-Id) software engine to share pathology reports and clinical documents for research.

Authors:  Dilip Gupta; Melissa Saul; John Gilbertson
Journal:  Am J Clin Pathol       Date:  2004-02       Impact factor: 2.493

  9 in total
  19 in total

Review 1.  Strategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.

Authors:  Clete A Kushida; Deborah A Nichols; Rik Jadrnicek; Ric Miller; James K Walsh; Kara Griffin
Journal:  Med Care       Date:  2012-07       Impact factor: 2.983

2.  Using a pipeline to improve de-identification performance.

Authors:  Frances P Morrison; Soumitra Sengupta; George Hripcsak
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

3.  Recognition and pseudonymisation of medical records for secondary use.

Authors:  Johannes Heurix; Stefan Fenz; Antonio Rella; Thomas Neubauer
Journal:  Med Biol Eng Comput       Date:  2015-06-04       Impact factor: 2.602

4.  The machine giveth and the machine taketh away: a parrot attack on clinical text deidentified with hiding in plain sight.

Authors:  David S Carrell; David J Cronkite; Muqun Rachel Li; Steve Nyemba; Bradley A Malin; John S Aberdeen; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

5.  Building gold standard corpora for medical natural language processing tasks.

Authors:  Louise Deleger; Qi Li; Todd Lingren; Megan Kaiser; Katalin Molnar; Laura Stoutenborough; Michal Kouril; Keith Marsolo; Imre Solti
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

6.  A De-identification method for bilingual clinical texts of various note types.

Authors:  Soo-Yong Shin; Yu Rang Park; Yongdon Shin; Hyo Joung Choi; Jihyun Park; Yongman Lyu; Moo-Song Lee; Chang-Min Choi; Woo-Sung Kim; Jae Ho Lee
Journal:  J Korean Med Sci       Date:  2014-12-23       Impact factor: 2.153

Review 7.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

Authors:  Kory Kreimeyer; Matthew Foster; Abhishek Pandey; Nina Arya; Gwendolyn Halford; Sandra F Jones; Richard Forshee; Mark Walderhaug; Taxiarchis Botsis
Journal:  J Biomed Inform       Date:  2017-07-17       Impact factor: 6.317

8.  Building a best-in-class automated de-identification tool for electronic health records through ensemble learning.

Authors:  Karthik Murugadoss; Ajit Rajasekharan; Bradley Malin; Vineet Agarwal; Sairam Bade; Jeff R Anderson; Jason L Ross; William A Faubion; John D Halamka; Venky Soundararajan; Sankar Ardhanari
Journal:  Patterns (N Y)       Date:  2021-05-12

9.  Preparing an annotated gold standard corpus to share with extramural investigators for de-identification research.

Authors:  Todd Lingren; Yizhao Ni; Louise Deleger; Megan Kaiser; Laura Stoutenborough; Keith Marsolo; Michal Kouril; Katalin Molnar; Imre Solti
Journal:  J Biomed Inform       Date:  2014-02-17       Impact factor: 6.317

10.  Resilience of clinical text de-identified with "hiding in plain sight" to hostile reidentification attacks by human readers.

Authors:  David S Carrell; Bradley A Malin; David J Cronkite; John S Aberdeen; Cheryl Clark; Muqun Rachel Li; Dikshya Bastakoty; Steve Nyemba; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.