Literature DB >> 22904698

Quantifying the Correctness, Computational Complexity, and Security of Privacy-Preserving String Comparators for Record Linkage.

Elizabeth Durham1, Yuan Xue, Murat Kantarcioglu, Bradley Malin.   

Abstract

Record linkage is the task of identifying records from disparate data sources that refer to the same entity. It is an integral component of data processing in distributed settings, where the integration of information from multiple sources can prevent duplication and enrich overall data quality, thus enabling more detailed and correct analysis. Privacy-preserving record linkage (PPRL) is a variant of the task in which data owners wish to perform linkage without revealing identifiers associated with the records. This task is desirable in various domains, including healthcare, where it may not be possible to reveal patient identity due to confidentiality requirements, and in business, where it could be disadvantageous to divulge customers' identities. To perform PPRL, it is necessary to apply string comparators that function in the privacy-preserving space. A number of privacy-preserving string comparators (PPSCs) have been proposed, but little research has compared them in the context of a real record linkage application. This paper performs a principled and comprehensive evaluation of six PPSCs in terms of three key properties: 1) correctness of record linkage predictions, 2) computational complexity, and 3) security. We utilize a real publicly-available dataset, derived from the North Carolina voter registration database, to evaluate the tradeoffs between the aforementioned properties. Among our results, we find that PPSCs that partition, encode, and compare strings yield highly accurate record linkage results. However, as a tradeoff, we observe that such PPSCs are less secure than those that map and compare strings in a reduced dimensional space.

Entities:  

Year:  2012        PMID: 22904698      PMCID: PMC3418825          DOI: 10.1016/j.inffus.2011.04.004

Source DB:  PubMed          Journal:  Inf Fusion        ISSN: 1566-2535            Impact factor:   12.975


  13 in total

1.  Analysis of identifier performance using a deterministic linkage algorithm.

Authors:  Shaun J Grannis; J Marc Overhage; Clement J McDonald
Journal:  Proc AMIA Symp       Date:  2002

2.  An empirical comparison of record linkage procedures.

Authors:  Shanti Gomatam; Randy Carter; Mario Ariet; Glenn Mitchell
Journal:  Stat Med       Date:  2002-05-30       Impact factor: 2.373

3.  Analysis of a probabilistic record linkage technique without human review.

Authors:  Shaun J Grannis; J Marc Overhage; Siu Hui; Clement J McDonald
Journal:  AMIA Annu Symp Proc       Date:  2003

4.  How (not) to protect genomic data privacy in a distributed network: using trail re-identification to evaluate and design anonymity protection systems.

Authors:  Bradley Malin; Latanya Sweeney
Journal:  J Biomed Inform       Date:  2004-06       Impact factor: 6.317

5.  Real world performance of approximate string comparators for use in patient matching.

Authors:  Shaun J Grannis; J Marc Overhage; Clement McDonald
Journal:  Stud Health Technol Inform       Date:  2004

6.  Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper.

Authors:  Charles Safran; Meryl Bloomrosen; W Edward Hammond; Steven Labkoff; Suzanne Markel-Fox; Paul C Tang; Don E Detmer
Journal:  J Am Med Inform Assoc       Date:  2006-10-31       Impact factor: 4.497

7.  How to ensure data security of an epidemiological follow-up: quality assessment of an anonymous record linkage procedure.

Authors:  C Quantin; H Bouzelat; F A Allaert; A M Benhamiche; J Faivre; L Dusserre
Journal:  Int J Med Inform       Date:  1998-03       Impact factor: 4.046

8.  Zero-check: a zero-knowledge protocol for reconciling patient identities across institutions.

Authors:  Jules J Berman
Journal:  Arch Pathol Lab Med       Date:  2004-03       Impact factor: 5.534

9.  Privacy-preserving record linkage using Bloom filters.

Authors:  Rainer Schnell; Tobias Bachteler; Jörg Reiher
Journal:  BMC Med Inform Decis Mak       Date:  2009-08-25       Impact factor: 2.796

10.  Some methods for blindfolded record linkage.

Authors:  Tim Churches; Peter Christen
Journal:  BMC Med Inform Decis Mak       Date:  2004-06-28       Impact factor: 2.796

View more
  8 in total

Review 1.  Privacy preserving interactive record linkage (PPIRL).

Authors:  Hye-Chung Kum; Ashok Krishnamurthy; Ashwin Machanavajjhala; Michael K Reiter; Stanley Ahalt
Journal:  J Am Med Inform Assoc       Date:  2013-11-07       Impact factor: 4.497

2.  Efficient Privacy-Aware Record Integration.

Authors:  Mehmet Kuzu; Murat Kantarcioglu; Ali Inan; Elisa Bertino; Elizabeth Durham; Bradley Malin
Journal:  Adv Database Technol       Date:  2013

3.  Composite Bloom Filters for Secure Record Linkage.

Authors:  Elizabeth Ashley Durham; Murat Kantarcioglu; Yuan Xue; Csaba Toth; Mehmet Kuzu; Bradley Malin
Journal:  IEEE Trans Knowl Data Eng       Date:  2014-12       Impact factor: 6.977

4.  SOEMPI: A Secure Open Enterprise Master Patient Index Software Toolkit for Private Record Linkage.

Authors:  Csaba Toth; Elizabeth Durham; Murat Kantarcioglu; Yuan Xue; Bradley Malin
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

5.  A practical approach to achieve private medical record linkage in light of public resources.

Authors:  Mehmet Kuzu; Murat Kantarcioglu; Elizabeth Ashley Durham; Csaba Toth; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2012-07-30       Impact factor: 4.497

6.  Privacy preserving probabilistic record linkage (P3RL): a novel method for linking existing health-related data and maintaining participant confidentiality.

Authors:  Kurt Schmidlin; Kerri M Clough-Gorr; Adrian Spoerri
Journal:  BMC Med Res Methodol       Date:  2015-05-30       Impact factor: 4.615

7.  Evaluating privacy-preserving record linkage using cryptographic long-term keys and multibit trees on large medical datasets.

Authors:  Adrian P Brown; Christian Borgs; Sean M Randall; Rainer Schnell
Journal:  BMC Med Inform Decis Mak       Date:  2017-06-08       Impact factor: 2.796

8.  Validity of a stroke severity index for administrative claims data research: a retrospective cohort study.

Authors:  Sheng-Feng Sung; Cheng-Yang Hsieh; Huey-Juan Lin; Yu-Wei Chen; Chih-Hung Chen; Yea-Huei Kao Yang; Ya-Han Hu
Journal:  BMC Health Serv Res       Date:  2016-09-22       Impact factor: 2.655

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.