Literature DB >> 27577394

A Generic Method for Assessing the Quality of De-Identified Health Data.

Fabian Prasser1, Raffael Bild1, Klaus A Kuhn1.   

Abstract

Data sharing plays an important role in modern biomedical research. Due to the inherent sensitivity of health data, patient privacy must be protected. De-identification means to transform a dataset in such a way that it becomes extremely difficult for an attacker to link its records to identified individuals. This can be achieved with different types of data transformations. As transformation impacts the information content of a dataset, it is important to balance an increase in privacy with a decrease in data quality. To this end, models for measuring both aspects are needed. Non-Uniform Entropy is a model for data quality which is frequently recommended for de-identifying health data. In this work we show that it cannot be used in a meaningful way for measuring the quality of data which has been transformed with several important types of data transformation. We introduce a generic variant, which overcomes this limitation. We performed experiments with real-world datasets, which show that our method provides a unified framework in which the quality of differently transformed data can be compared to find a good or even optimal solution to a given data de-identification problem. We have implemented our method into ARX, an open source anonymization tool for biomedical data.

Entities:  

Mesh:

Year:  2016        PMID: 27577394

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  3 in total

1.  To share or not to share? Expected pros and cons of data sharing in radiological research.

Authors:  Francesco Sardanelli; Marco Alì; Myriam G Hunink; Nehmat Houssami; Luca M Sconfienza; Giovanni Di Leo
Journal:  Eur Radiol       Date:  2018-01-18       Impact factor: 5.315

Review 2.  Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.

Authors:  Raphaël Chevrier; Vasiliki Foufi; Christophe Gaudet-Blavignac; Arnaud Robert; Christian Lovis
Journal:  J Med Internet Res       Date:  2019-05-31       Impact factor: 5.428

Review 3.  Utility-driven assessment of anonymized data via clustering.

Authors:  Maria Eugénia Ferrão; Paula Prata; Paulo Fazendeiro
Journal:  Sci Data       Date:  2022-07-30       Impact factor: 8.501

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.