Literature DB >> 22298567

A simple heuristic for blindfolded record linkage.

Susan C Weber1, Henry Lowe, Amar Das, Todd Ferris.   

Abstract

OBJECTIVES: To address the challenge of balancing privacy with the need to create cross-site research registry records on individual patients, while matching the data for a given patient as he or she moves between participating sites. To evaluate the strategy of generating anonymous identifiers based on real identifiers in such a way that the chances of a shared patient being accurately identified were maximized, and the chances of incorrectly joining two records belonging to different people were minimized.
METHODS: Our hypothesis was that most variation in names occurs after the first two letters, and that date of birth is highly reliable, so a single match variable consisting of a hashed string built from the first two letters of the patient's first and last names plus their date of birth would have the desired characteristics. We compared and contrasted the match algorithm characteristics (rate of false positive v. rate of false negative) for our chosen variable against both Social Security Numbers and full names.
RESULTS: In a data set of 19 000 records, a derived match variable consisting of a 2-character prefix from both first and last names combined with date of birth has a 97% sensitivity; by contrast, an anonymized identifier based on the patient's full names and date of birth has a sensitivity of only 87% and SSN has sensitivity 86%.
CONCLUSION: The approach we describe is most useful in situations where privacy policies preclude the full exchange of the identifiers required by more sophisticated and sensitive linkage algorithms. For data sets of sufficiently high quality this effective approach, while producing a lower rate of matching than more complex algorithms, has the merit of being easy to explain to institutional review boards, adheres to the minimum necessary rule of the HIPAA privacy rule, and is faster and less cumbersome to implement than a full probabilistic linkage.

Entities:  

Mesh:

Year:  2012        PMID: 22298567      PMCID: PMC3392854          DOI: 10.1136/amiajnl-2011-000329

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  10 in total

1.  Analysis of identifier performance using a deterministic linkage algorithm.

Authors:  Shaun J Grannis; J Marc Overhage; Clement J McDonald
Journal:  Proc AMIA Symp       Date:  2002

2.  STRIDE--An integrated standards-based translational research informatics platform.

Authors:  Henry J Lowe; Todd A Ferris; Penni M Hernandez; Susan C Weber
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

3.  Which are the best identifiers for record linkage?

Authors:  Catherine Quantin; Christine Binquet; Karima Bourquard; Ronny Pattisina; Béatrice Gouyon-Cornet; Cyril Ferdynus; Jean-Bernard Gouyon; Allaert François-André
Journal:  Med Inform Internet Med       Date:  2004 Sep-Dec

4.  Linking patients' records across organizations while maintaining anonymity.

Authors:  Boonchai Kijsanayotin; Stuart M Speedie; Donald P Connelly
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

5.  Private medical record linkage with approximate matching.

Authors:  Elizabeth Durham; Yuan Xue; Murat Kantarcioglu; Bradley Malin
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

6.  Probabilistic linkage of large public health data files.

Authors:  M A Jaro
Journal:  Stat Med       Date:  1995 Mar 15-Apr 15       Impact factor: 2.373

7.  Against simple universal health-care identifiers.

Authors:  P Szolovits; I Kohane
Journal:  J Am Med Inform Assoc       Date:  1994 Jul-Aug       Impact factor: 4.497

8.  Issues in identification and linkage of patient records across an integrated delivery system.

Authors:  M G Arellano; G I Weber
Journal:  J Healthc Inf Manag       Date:  1998

9.  Privacy-preserving record linkage using Bloom filters.

Authors:  Rainer Schnell; Tobias Bachteler; Jörg Reiher
Journal:  BMC Med Inform Decis Mak       Date:  2009-08-25       Impact factor: 2.796

10.  Some methods for blindfolded record linkage.

Authors:  Tim Churches; Peter Christen
Journal:  BMC Med Inform Decis Mak       Date:  2004-06-28       Impact factor: 2.796

  10 in total
  23 in total

1.  Data linkages between patient-powered research networks and health plans: a foundation for collaborative research.

Authors:  Abiy Agiro; Xiaoxue Chen; Biruk Eshete; Rebecca Sutphen; Elizabeth Bourquardez Clark; Cristina M Burroughs; W Benjamin Nowell; Jeffrey R Curtis; Sara Loud; Robert McBurney; Peter A Merkel; Antoine G Sreih; Kalen Young; Kevin Haynes
Journal:  J Am Med Inform Assoc       Date:  2019-07-01       Impact factor: 4.497

Review 2.  Privacy preserving interactive record linkage (PPIRL).

Authors:  Hye-Chung Kum; Ashok Krishnamurthy; Ashwin Machanavajjhala; Michael K Reiter; Stanley Ahalt
Journal:  J Am Med Inform Assoc       Date:  2013-11-07       Impact factor: 4.497

3.  Biomedical data privacy: problems, perspectives, and recent advances.

Authors:  Bradley A Malin; Khaled El Emam; Christine M O'Keefe
Journal:  J Am Med Inform Assoc       Date:  2012-12-06       Impact factor: 4.497

4.  Sharing data for the public good and protecting individual privacy: informatics solutions to combine different goals.

Authors:  Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2013-01-01       Impact factor: 4.497

5.  Oncoshare: lessons learned from building an integrated multi-institutional database for comparative effectiveness research.

Authors:  Susan C Weber; Tina Seto; Cliff Olson; Pragati Kenkare; Allison W Kurian; Amar K Das
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

6.  Design and implementation of a privacy preserving electronic health record linkage tool in Chicago.

Authors:  Abel N Kho; John P Cashy; Kathryn L Jackson; Adam R Pah; Satyender Goel; Jörn Boehnke; John Eric Humphries; Scott Duke Kominers; Bala N Hota; Shannon A Sims; Bradley A Malin; Dustin D French; Theresa L Walunas; David O Meltzer; Erin O Kaleba; Roderick C Jones; William L Galanter
Journal:  J Am Med Inform Assoc       Date:  2015-06-23       Impact factor: 4.497

7.  Linked Records of Children with Traumatic Brain Injury. Probabilistic Linkage without Use of Protected Health Information.

Authors:  T D Bennett; J M Dean; H T Keenan; M H McGlincy; A M Thomas; L J Cook
Journal:  Methods Inf Med       Date:  2015-05-29       Impact factor: 2.176

8.  An Associative Memory Model for Integration of Fragmented Research Data and Identification of Treatment Correlations in Breast Cancer Care.

Authors:  Ashis Gopal Banerjee; Mridul Khan; John Higgins; Annarita Giani; Amar K Das
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

9.  A practical approach to achieve private medical record linkage in light of public resources.

Authors:  Mehmet Kuzu; Murat Kantarcioglu; Elizabeth Ashley Durham; Csaba Toth; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2012-07-30       Impact factor: 4.497

Review 10.  Genome privacy: challenges, technical approaches to mitigate risk, and ethical considerations in the United States.

Authors:  Shuang Wang; Xiaoqian Jiang; Siddharth Singh; Rebecca Marmor; Luca Bonomi; Dov Fox; Michelle Dow; Lucila Ohno-Machado
Journal:  Ann N Y Acad Sci       Date:  2016-09-28       Impact factor: 5.691

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.