Literature DB >> 26707453

Privacy-preserving matching of similar patients.

Dinusha Vatsalan1, Peter Christen2.   

Abstract

The identification of similar entities represented by records in different databases has drawn considerable attention in many application areas, including in the health domain. One important type of entity matching application that is vital for quality healthcare analytics is the identification of similar patients, known as similar patient matching. A key component of identifying similar records is the calculation of similarity of the values in attributes (fields) between these records. Due to increasing privacy and confidentiality concerns, using the actual attribute values of patient records to identify similar records across different organizations is becoming non-trivial because the attributes in such records often contain highly sensitive information such as personal and medical details of patients. Therefore, the matching needs to be based on masked (encoded) values while being effective and efficient to allow matching of large databases. Bloom filter encoding has widely been used as an efficient masking technique for privacy-preserving matching of string and categorical values. However, no work on Bloom filter-based masking of numerical data, such as integer (e.g. age), floating point (e.g. body mass index), and modulus (numbers wrap around upon reaching a certain value, e.g. date and time), which are commonly required in the health domain, has been presented in the literature. We propose a framework with novel methods for masking numerical data using Bloom filters, thereby facilitating the calculation of similarities between records. We conduct an empirical study on publicly available real-world datasets which shows that our framework provides efficient masking and achieves similar matching accuracy compared to the matching of actual unencoded patient records.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Approximate matching; Bloom filters; Numerical data; Privacy; Similarity

Mesh:

Year:  2015        PMID: 26707453     DOI: 10.1016/j.jbi.2015.12.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  4 in total

1.  Accuracy of an Electronic Health Record Patient Linkage Module Evaluated between Neighboring Academic Health Care Centers.

Authors:  Mindy K Ross; Javier Sanz; Brian Tep; Rob Follett; Spencer L Soohoo; Douglas S Bell
Journal:  Appl Clin Inform       Date:  2020-11-04       Impact factor: 2.342

2.  Locational privacy-preserving distance computations with intersecting sets of randomly labeled grid points.

Authors:  Rainer Schnell; Jonas Klingwort; James M Farrow
Journal:  Int J Health Geogr       Date:  2021-03-20       Impact factor: 3.918

3.  Privacy-preserving record linkage in large databases using secure multiparty computation.

Authors:  Peeter Laud; Alisa Pankova
Journal:  BMC Med Genomics       Date:  2018-10-11       Impact factor: 3.063

4.  Good Practice Data Linkage (GPD): A Translation of the German Version.

Authors:  Stefanie March; Silke Andrich; Johannes Drepper; Dirk Horenkamp-Sonntag; Andrea Icks; Peter Ihle; Joachim Kieschke; Bianca Kollhorst; Birga Maier; Ingo Meyer; Gabriele Müller; Christoph Ohlmeier; Dirk Peschke; Adrian Richter; Marie-Luise Rosenbusch; Nadine Scholten; Mandy Schulz; Christoph Stallmann; Enno Swart; Stefanie Wobbe-Ribinski; Antke Wolter; Jan Zeidler; Falk Hoffmann
Journal:  Int J Environ Res Public Health       Date:  2020-10-27       Impact factor: 3.390

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.