Literature DB >> 12463926

Identification of patient name references within medical documents using semantic selectional restrictions.

Ricky K Taira1, Alex A T Bui, Hooshang Kangarloo.   

Abstract

De-identification of a patient's personal data from medical records is a protective legal requirement imposed before medical documents can be used for research purposes or transferred to other healthcare providers (e.g., teachers, students, tele-consultations). This de-identification process is tedious if performed manually, and is known to be quite faulty in direct search and replace strategies [9]. In this paper, we report on the identification step of this process. The proposed algorithm is based on estimating the fitness of candidate patient name references to a set of semantic selectional restrictions. The semantic restrictions place tight contextual requirements upon candidate words in the report text and are determined automatically from a manually tagged corpus of training reports. Maximum entropy classifiers are used to provide a probabilistic measure of the belief of a given candidate token to a given semantic restriction. We report on the design and preliminary evaluation of the system within the do-main of pediatric urology.

Entities:  

Mesh:

Year:  2002        PMID: 12463926      PMCID: PMC2244274     

Source DB:  PubMed          Journal:  Proc AMIA Symp        ISSN: 1531-605X


  3 in total

1.  Basic principles of ROC analysis.

Authors:  C E Metz
Journal:  Semin Nucl Med       Date:  1978-10       Impact factor: 4.446

2.  DataServer: an infrastructure to support evidence-based radiology.

Authors:  Alex A T Bui; John David N Dionisio; Craig A Morioka; Usha Sinha; Ricky K Taira; Hooshang Kangarloo
Journal:  Acad Radiol       Date:  2002-06       Impact factor: 3.173

3.  Automatic record hash coding and linkage for epidemiological follow-up data confidentiality.

Authors:  C Quantin; H Bouzelat; F A Allaert; A M Benhamiche; J Faivre; L Dusserre
Journal:  Methods Inf Med       Date:  1998-09       Impact factor: 2.176

  3 in total
  26 in total

Review 1.  Strategies for de-identification and anonymization of electronic health record data for use in multicenter research studies.

Authors:  Clete A Kushida; Deborah A Nichols; Rik Jadrnicek; Ric Miller; James K Walsh; Kara Griffin
Journal:  Med Care       Date:  2012-07       Impact factor: 2.983

2.  Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text.

Authors:  David Carrell; Bradley Malin; John Aberdeen; Samuel Bayer; Cheryl Clark; Ben Wellner; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

3.  Using a pipeline to improve de-identification performance.

Authors:  Frances P Morrison; Soumitra Sengupta; George Hripcsak
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

4.  A secure protocol to distribute unlinkable health data.

Authors:  Bradley A Malin; Latanya Sweeney
Journal:  AMIA Annu Symp Proc       Date:  2005

5.  A de-identifier for medical discharge summaries.

Authors:  Ozlem Uzuner; Tawanda C Sibanda; Yuan Luo; Peter Szolovits
Journal:  Artif Intell Med       Date:  2007-11-28       Impact factor: 5.326

6.  State-of-the-art anonymization of medical records using an iterative machine learning framework.

Authors:  György Szarvas; Richárd Farkas; Róbert Busa-Fekete
Journal:  J Am Med Inform Assoc       Date:  2007 Sep-Oct       Impact factor: 4.497

7.  Problem-centric organization and visualization of patient imaging and clinical data.

Authors:  Vijayaraghavan Bashyam; William Hsu; Emily Watt; Alex A T Bui; Hooshang Kangarloo; Ricky K Taira
Journal:  Radiographics       Date:  2009-01-23       Impact factor: 5.333

8.  A system for de-identifying medical message board text.

Authors:  Adrian Benton; Shawndra Hill; Lyle Ungar; Annie Chung; Charles Leonard; Cristin Freeman; John H Holmes
Journal:  BMC Bioinformatics       Date:  2011-06-09       Impact factor: 3.169

9.  Resilience of clinical text de-identified with "hiding in plain sight" to hostile reidentification attacks by human readers.

Authors:  David S Carrell; Bradley A Malin; David J Cronkite; John S Aberdeen; Cheryl Clark; Muqun Rachel Li; Dikshya Bastakoty; Steve Nyemba; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

10.  A tool for improving the longitudinal imaging characterization for neuro-oncology cases.

Authors:  Ricky K Taira; Ricky Taira; Alex Bui; Alex At Bui; William Hsu; Vijayaraghavan Bashyam; Shishir Dube; Emily Watt; Lewellyn Andrada; Suzie El-Saden; Timothy Cloughesy; Hooshang Kangarloo
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.