Literature DB >> 22822040

Reducing patient re-identification risk for laboratory results within research datasets.

Ravi V Atreya1, Joshua C Smith, Allison B McCoy, Bradley Malin, Randolph A Miller.   

Abstract

OBJECTIVE: To try to lower patient re-identification risks for biomedical research databases containing laboratory test results while also minimizing changes in clinical data interpretation.
MATERIALS AND METHODS: In our threat model, an attacker obtains 5-7 laboratory results from one patient and uses them as a search key to discover the corresponding record in a de-identified biomedical research database. To test our models, the existing Vanderbilt TIME database of 8.5 million Safe Harbor de-identified laboratory results from 61 280 patients was used. The uniqueness of unaltered laboratory results in the dataset was examined, and then two data perturbation models were applied-simple random offsets and an expert-derived clinical meaning-preserving model. A rank-based re-identification algorithm to mimic an attack was used. The re-identification risk and the retention of clinical meaning for each model's perturbed laboratory results were assessed.
RESULTS: Differences in re-identification rates between the algorithms were small despite substantial divergence in altered clinical meaning. The expert algorithm maintained the clinical meaning of laboratory results better (affecting up to 4% of test results) than simple perturbation (affecting up to 26%). DISCUSSION AND
CONCLUSION: With growing impetus for sharing clinical data for research, and in view of healthcare-related federal privacy regulation, methods to mitigate risks of re-identification are important. A practical, expert-derived perturbation algorithm that demonstrated potential utility was developed. Similar approaches might enable administrators to select data protection scheme parameters that meet their preferences in the trade-off between the protection of privacy and the retention of clinical meaning of shared data.

Entities:  

Mesh:

Year:  2012        PMID: 22822040      PMCID: PMC3555327          DOI: 10.1136/amiajnl-2012-001026

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  42 in total

1.  How (not) to protect genomic data privacy in a distributed network: using trail re-identification to evaluate and design anonymity protection systems.

Authors:  Bradley Malin; Latanya Sweeney
Journal:  J Biomed Inform       Date:  2004-06       Impact factor: 6.317

2.  Revisiting HIPAA.

Authors:  Lisa A Gallagher
Journal:  Nurs Manage       Date:  2010-04

3.  Anonymization of longitudinal electronic medical records.

Authors:  Acar Tamersoy; Grigorios Loukides; Mehmet Ercan Nergiz; Yucel Saygin; Bradley Malin
Journal:  IEEE Trans Inf Technol Biomed       Date:  2012-01-27

4.  Business process optimization for RHIOs.

Authors:  Praveen Soti; Seema Pandey
Journal:  J Healthc Inf Manag       Date:  2007

5.  The NCBI dbGaP database of genotypes and phenotypes.

Authors:  Matthew D Mailman; Michael Feolo; Yumi Jin; Masato Kimura; Kimberly Tryka; Rinat Bagoutdinov; Luning Hao; Anne Kiang; Justin Paschall; Lon Phan; Natalia Popova; Stephanie Pretel; Lora Ziyabari; Moira Lee; Yu Shao; Zhen Y Wang; Karl Sirotkin; Minghong Ward; Michael Kholodov; Kerry Zbicz; Jeffrey Beck; Michael Kimelman; Sergey Shevelev; Don Preuss; Eugene Yaschenko; Alan Graeff; James Ostell; Stephen T Sherry
Journal:  Nat Genet       Date:  2007-10       Impact factor: 38.330

Review 6.  From genetic privacy to open consent.

Authors:  Jeantine E Lunshof; Ruth Chadwick; Daniel B Vorhaus; George M Church
Journal:  Nat Rev Genet       Date:  2008-05       Impact factor: 53.242

7.  Focus on electronic health records. 'HIPAA2' legislation means more delicate handling of data.

Authors:  Mike May
Journal:  Nat Med       Date:  2010-03       Impact factor: 53.440

Review 8.  Identifiability in biobanks: models, measures, and mitigation strategies.

Authors:  Bradley Malin; Grigorios Loukides; Kathleen Benitez; Ellen Wright Clayton
Journal:  Hum Genet       Date:  2011-07-08       Impact factor: 4.132

9.  The disclosure of diagnosis codes can breach research participants' privacy.

Authors:  Grigorios Loukides; Joshua C Denny; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

10.  Technical and policy approaches to balancing patient privacy and data sharing in clinical and translational research.

Authors:  Bradley Malin; David Karp; Richard H Scheuermann
Journal:  J Investig Med       Date:  2010-01       Impact factor: 2.895

View more
  9 in total

1.  Detecting the Presence of an Individual in Phenotypic Summary Data.

Authors:  Yongtai Liu; Zhiyu Wan; Weiyi Xia; Murat Kantarcioglu; Yevgeniy Vorobeychik; Ellen Wright Clayton; Abel Kho; David Carrell; Bradley A Malin
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  Biomedical data privacy: problems, perspectives, and recent advances.

Authors:  Bradley A Malin; Khaled El Emam; Christine M O'Keefe
Journal:  J Am Med Inform Assoc       Date:  2012-12-06       Impact factor: 4.497

Review 3.  Information technology for clinical, translational and comparative effectiveness research. Findings from the section clinical research informatics.

Authors:  C Daniel; R Choquet
Journal:  Yearb Med Inform       Date:  2014-08-15

Review 4.  Ensuring privacy in the study of pathogen genetics.

Authors:  Sanjay R Mehta; Staal A Vinterbo; Susan J Little
Journal:  Lancet Infect Dis       Date:  2014-04-07       Impact factor: 25.071

Review 5.  Regulations and Norms for Reuse of Residual Clinical Biospecimens and Health Data.

Authors:  Elizabeth E Umberfield; Sharon L R Kardia; Yun Jiang; Andrea K Thomer; Marcelline R Harris
Journal:  West J Nurs Res       Date:  2021-07-08       Impact factor: 1.774

6.  Points to consider for sharing variant-level information from clinical genetic testing with ClinVar.

Authors:  Danielle R Azzariti; Erin Rooney Riggs; Christa L Martin; Heidi L Rehm; Annie Niehaus; Laura Lyman Rodriguez; Erin M Ramos; Brandi Kattman; Melissa J Landrum
Journal:  Cold Spring Harb Mol Case Stud       Date:  2018-02-01

7.  Security controls in an integrated Biobank to protect privacy in data sharing: rationale and study design.

Authors:  Takako Takai-Igarashi; Kengo Kinoshita; Masao Nagasaki; Soichi Ogishima; Naoki Nakamura; Sachiko Nagase; Satoshi Nagaie; Tomo Saito; Fuji Nagami; Naoko Minegishi; Yoichi Suzuki; Kichiya Suzuki; Hiroaki Hashizume; Shinichi Kuriyama; Atsushi Hozawa; Nobuo Yaegashi; Shigeo Kure; Gen Tamiya; Yoshio Kawaguchi; Hiroshi Tanaka; Masayuki Yamamoto
Journal:  BMC Med Inform Decis Mak       Date:  2017-07-06       Impact factor: 2.796

8.  Lost in Anonymization - A Data Anonymization Reference Classification Merging Legal and Technical Considerations.

Authors:  Kerstin N Vokinger; Daniel J Stekhoven; Michael Krauthammer
Journal:  J Law Med Ethics       Date:  2020-03       Impact factor: 1.718

9.  Revolutionizing Medical Data Sharing Using Advanced Privacy-Enhancing Technologies: Technical, Legal, and Ethical Synthesis.

Authors:  James Scheibner; Jean Louis Raisaro; Juan Ramón Troncoso-Pastoriza; Marcello Ienca; Jacques Fellay; Effy Vayena; Jean-Pierre Hubaux
Journal:  J Med Internet Res       Date:  2021-02-25       Impact factor: 5.428

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.