Literature DB >> 26567325

A multi-institution evaluation of clinical profile anonymization.

Raymond Heatherly1, Luke V Rasmussen2, Peggy L Peissig3, Jennifer A Pacheco2, Paul Harris4, Joshua C Denny5, Bradley A Malin6.   

Abstract

BACKGROUND AND
OBJECTIVE: There is an increasing desire to share de-identified electronic health records (EHRs) for secondary uses, but there are concerns that clinical terms can be exploited to compromise patient identities. Anonymization algorithms mitigate such threats while enabling novel discoveries, but their evaluation has been limited to single institutions. Here, we study how an existing clinical profile anonymization fares at multiple medical centers.
METHODS: We apply a state-of-the-artk-anonymization algorithm, withkset to the standard value 5, to the International Classification of Disease, ninth edition codes for patients in a hypothyroidism association study at three medical centers: Marshfield Clinic, Northwestern University, and Vanderbilt University. We assess utility when anonymizing at three population levels: all patients in 1) the EHR system; 2) the biorepository; and 3) a hypothyroidism study. We evaluate utility using 1) changes to the number included in the dataset, 2) number of codes included, and 3) regions generalization and suppression were required.
RESULTS: Our findings yield several notable results. First, we show that anonymizing in the context of the entire EHR yields a significantly greater quantity of data by reducing the amount of generalized regions from ∼15% to ∼0.5%. Second, ∼70% of codes that needed generalization only generalized two or three codes in the largest anonymization.
CONCLUSIONS: Sharing large volumes of clinical data in support of phenome-wide association studies is possible while safeguarding privacy to the underlying individuals.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  anonymization; clinical codes; generalization; privacy; secondary use

Mesh:

Year:  2015        PMID: 26567325      PMCID: PMC4954623          DOI: 10.1093/jamia/ocv154

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  25 in total

Review 1.  From genetic privacy to open consent.

Authors:  Jeantine E Lunshof; Ruth Chadwick; Daniel B Vorhaus; George M Church
Journal:  Nat Rev Genet       Date:  2008-05       Impact factor: 53.242

2.  The inevitable application of big data to health care.

Authors:  Travis B Murdoch; Allan S Detsky
Journal:  JAMA       Date:  2013-04-03       Impact factor: 56.272

3.  The disclosure of diagnosis codes can breach research participants' privacy.

Authors:  Grigorios Loukides; Joshua C Denny; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

Review 4.  Publishing data from electronic health records while preserving privacy: a survey of algorithms.

Authors:  Aris Gkoulalas-Divanis; Grigorios Loukides; Jimeng Sun
Journal:  J Biomed Inform       Date:  2014-06-14       Impact factor: 6.317

5.  Big data in health care: using analytics to identify and manage high-risk and high-cost patients.

Authors:  David W Bates; Suchi Saria; Lucila Ohno-Machado; Anand Shah; Gabriel Escobar
Journal:  Health Aff (Millwood)       Date:  2014-07       Impact factor: 6.301

6.  Phenome-wide association studies (PheWASs) for functional variants.

Authors:  Zhan Ye; John Mayer; Lynn Ivacic; Zhiyi Zhou; Min He; Steven J Schrodi; David Page; Murray H Brilliant; Scott J Hebbring
Journal:  Eur J Hum Genet       Date:  2014-07-30       Impact factor: 4.246

7.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

Authors:  Joshua C Denny; Lisa Bastarache; Marylyn D Ritchie; Robert J Carroll; Raquel Zink; Jonathan D Mosley; Julie R Field; Jill M Pulley; Andrea H Ramirez; Erica Bowton; Melissa A Basford; David S Carrell; Peggy L Peissig; Abel N Kho; Jennifer A Pacheco; Luke V Rasmussen; David R Crosslin; Paul K Crane; Jyotishman Pathak; Suzette J Bielinski; Sarah A Pendergrass; Hua Xu; Lucia A Hindorff; Rongling Li; Teri A Manolio; Christopher G Chute; Rex L Chisholm; Eric B Larson; Gail P Jarvik; Murray H Brilliant; Catherine A McCarty; Iftikhar J Kullo; Jonathan L Haines; Dana C Crawford; Daniel R Masys; Dan M Roden
Journal:  Nat Biotechnol       Date:  2013-12       Impact factor: 54.908

8.  A systematic review of re-identification attacks on health data.

Authors:  Khaled El Emam; Elizabeth Jonker; Luk Arbuckle; Bradley Malin
Journal:  PLoS One       Date:  2011-12-02       Impact factor: 3.240

9.  Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.

Authors:  Nils Homer; Szabolcs Szelinger; Margot Redman; David Duggan; Waibhav Tembe; Jill Muehling; John V Pearson; Dietrich A Stephan; Stanley F Nelson; David W Craig
Journal:  PLoS Genet       Date:  2008-08-29       Impact factor: 5.917

10.  PCORnet: turning a dream into reality.

Authors:  Francis S Collins; Kathy L Hudson; Josephine P Briggs; Michael S Lauer
Journal:  J Am Med Inform Assoc       Date:  2014-05-12       Impact factor: 4.497

View more
  5 in total

1.  Privacy Policy and Technology in Biomedical Data Science.

Authors:  April Moreno Arellano; Wenrui Dai; Shuang Wang; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  Annu Rev Biomed Data Sci       Date:  2018-07

2.  Data Safe Havens and Trust: Toward a Common Understanding of Trusted Research Platforms for Governing Secure and Ethical Health Research.

Authors:  Nathan Christopher Lea; Jacqueline Nicholls; Christine Dobbs; Nayha Sethi; James Cunningham; John Ainsworth; Martin Heaven; Trevor Peacock; Anthony Peacock; Kerina Jones; Graeme Laurie; Dipak Kalra
Journal:  JMIR Med Inform       Date:  2016-06-21

3.  PhenoMeNal: processing and analysis of metabolomics data in the cloud.

Authors:  Kristian Peters; James Bradbury; Sven Bergmann; Marco Capuccini; Marta Cascante; Pedro de Atauri; Timothy M D Ebbels; Carles Foguet; Robert Glen; Alejandra Gonzalez-Beltran; Ulrich L Günther; Evangelos Handakas; Thomas Hankemeier; Kenneth Haug; Stephanie Herman; Petr Holub; Massimiliano Izzo; Daniel Jacob; David Johnson; Fabien Jourdan; Namrata Kale; Ibrahim Karaman; Bita Khalili; Payam Emami Khonsari; Kim Kultima; Samuel Lampa; Anders Larsson; Christian Ludwig; Pablo Moreno; Steffen Neumann; Jon Ander Novella; Claire O'Donovan; Jake T M Pearce; Alina Peluso; Marco Enrico Piras; Luca Pireddu; Michelle A C Reed; Philippe Rocca-Serra; Pierrick Roger; Antonio Rosato; Rico Rueedi; Christoph Ruttkies; Noureddin Sadawi; Reza M Salek; Susanna-Assunta Sansone; Vitaly Selivanov; Ola Spjuth; Daniel Schober; Etienne A Thévenot; Mattia Tomasoni; Merlijn van Rijswijk; Michael van Vliet; Mark R Viant; Ralf J M Weber; Gianluigi Zanetti; Christoph Steinbeck
Journal:  Gigascience       Date:  2019-02-01       Impact factor: 6.524

Review 4.  Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.

Authors:  Raphaël Chevrier; Vasiliki Foufi; Christophe Gaudet-Blavignac; Arnaud Robert; Christian Lovis
Journal:  J Med Internet Res       Date:  2019-05-31       Impact factor: 5.428

Review 5.  Lessons learned from the eMERGE Network: balancing genomics in discovery and practice.

Authors: 
Journal:  HGG Adv       Date:  2020-12-25
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.