Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A multi-institution evaluation of clinical profile anonymization.

Literature DB >> 26567325

A multi-institution evaluation of clinical profile anonymization.

Raymond Heatherly¹, Luke V Rasmussen², Peggy L Peissig³, Jennifer A Pacheco², Paul Harris⁴, Joshua C Denny⁵, Bradley A Malin⁶.

Abstract

BACKGROUND AND
OBJECTIVE: There is an increasing desire to share de-identified electronic health records (EHRs) for secondary uses, but there are concerns that clinical terms can be exploited to compromise patient identities. Anonymization algorithms mitigate such threats while enabling novel discoveries, but their evaluation has been limited to single institutions. Here, we study how an existing clinical profile anonymization fares at multiple medical centers.
METHODS: We apply a state-of-the-artk-anonymization algorithm, withkset to the standard value 5, to the International Classification of Disease, ninth edition codes for patients in a hypothyroidism association study at three medical centers: Marshfield Clinic, Northwestern University, and Vanderbilt University. We assess utility when anonymizing at three population levels: all patients in 1) the EHR system; 2) the biorepository; and 3) a hypothyroidism study. We evaluate utility using 1) changes to the number included in the dataset, 2) number of codes included, and 3) regions generalization and suppression were required.
RESULTS: Our findings yield several notable results. First, we show that anonymizing in the context of the entire EHR yields a significantly greater quantity of data by reducing the amount of generalized regions from ∼15% to ∼0.5%. Second, ∼70% of codes that needed generalization only generalized two or three codes in the largest anonymization.
CONCLUSIONS: Sharing large volumes of clinical data in support of phenome-wide association studies is possible while safeguarding privacy to the underlying individuals.

Entities: Disease Species

Keywords: anonymization; clinical codes; generalization; privacy; secondary use

Mesh：

Year: 2015 PMID： 26567325 PMCID： PMC4954623 DOI： 10.1093/jamia/ocv154

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 4.497

25 in total

Review 1. From genetic privacy to open consent.

Authors: Jeantine E Lunshof; Ruth Chadwick; Daniel B Vorhaus; George M Church
Journal: Nat Rev Genet Date: 2008-05 Impact factor: 53.242

2. The inevitable application of big data to health care.

Authors: Travis B Murdoch; Allan S Detsky
Journal: JAMA Date: 2013-04-03 Impact factor: 56.272

3. The disclosure of diagnosis codes can breach research participants' privacy.

Authors: Grigorios Loukides; Joshua C Denny; Bradley Malin
Journal: J Am Med Inform Assoc Date: 2010 May-Jun Impact factor: 4.497

Review 4. Publishing data from electronic health records while preserving privacy: a survey of algorithms.

Authors: Aris Gkoulalas-Divanis; Grigorios Loukides; Jimeng Sun
Journal: J Biomed Inform Date: 2014-06-14 Impact factor: 6.317

5. Big data in health care: using analytics to identify and manage high-risk and high-cost patients.

Authors: David W Bates; Suchi Saria; Lucila Ohno-Machado; Anand Shah; Gabriel Escobar
Journal: Health Aff (Millwood) Date: 2014-07 Impact factor: 6.301

6. Phenome-wide association studies (PheWASs) for functional variants.

Authors: Zhan Ye; John Mayer; Lynn Ivacic; Zhiyi Zhou; Min He; Steven J Schrodi; David Page; Murray H Brilliant; Scott J Hebbring
Journal: Eur J Hum Genet Date: 2014-07-30 Impact factor: 4.246

7. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

Authors: Joshua C Denny; Lisa Bastarache; Marylyn D Ritchie; Robert J Carroll; Raquel Zink; Jonathan D Mosley; Julie R Field; Jill M Pulley; Andrea H Ramirez; Erica Bowton; Melissa A Basford; David S Carrell; Peggy L Peissig; Abel N Kho; Jennifer A Pacheco; Luke V Rasmussen; David R Crosslin; Paul K Crane; Jyotishman Pathak; Suzette J Bielinski; Sarah A Pendergrass; Hua Xu; Lucia A Hindorff; Rongling Li; Teri A Manolio; Christopher G Chute; Rex L Chisholm; Eric B Larson; Gail P Jarvik; Murray H Brilliant; Catherine A McCarty; Iftikhar J Kullo; Jonathan L Haines; Dana C Crawford; Daniel R Masys; Dan M Roden
Journal: Nat Biotechnol Date: 2013-12 Impact factor: 54.908

8. A systematic review of re-identification attacks on health data.

Authors: Khaled El Emam; Elizabeth Jonker; Luk Arbuckle; Bradley Malin
Journal: PLoS One Date: 2011-12-02 Impact factor: 3.240

9. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.

Authors: Nils Homer; Szabolcs Szelinger; Margot Redman; David Duggan; Waibhav Tembe; Jill Muehling; John V Pearson; Dietrich A Stephan; Stanley F Nelson; David W Craig
Journal: PLoS Genet Date: 2008-08-29 Impact factor: 5.917

10. PCORnet: turning a dream into reality.

Authors: Francis S Collins; Kathy L Hudson; Josephine P Briggs; Michael S Lauer
Journal: J Am Med Inform Assoc Date: 2014-05-12 Impact factor: 4.497

5 in total

1. Privacy Policy and Technology in Biomedical Data Science.

Authors: April Moreno Arellano; Wenrui Dai; Shuang Wang; Xiaoqian Jiang; Lucila Ohno-Machado
Journal: Annu Rev Biomed Data Sci Date: 2018-07

2. Data Safe Havens and Trust: Toward a Common Understanding of Trusted Research Platforms for Governing Secure and Ethical Health Research.

Authors: Nathan Christopher Lea; Jacqueline Nicholls; Christine Dobbs; Nayha Sethi; James Cunningham; John Ainsworth; Martin Heaven; Trevor Peacock; Anthony Peacock; Kerina Jones; Graeme Laurie; Dipak Kalra
Journal: JMIR Med Inform Date: 2016-06-21

3. PhenoMeNal: processing and analysis of metabolomics data in the cloud.

Authors: Kristian Peters; James Bradbury; Sven Bergmann; Marco Capuccini; Marta Cascante; Pedro de Atauri; Timothy M D Ebbels; Carles Foguet; Robert Glen; Alejandra Gonzalez-Beltran; Ulrich L Günther; Evangelos Handakas; Thomas Hankemeier; Kenneth Haug; Stephanie Herman; Petr Holub; Massimiliano Izzo; Daniel Jacob; David Johnson; Fabien Jourdan; Namrata Kale; Ibrahim Karaman; Bita Khalili; Payam Emami Khonsari; Kim Kultima; Samuel Lampa; Anders Larsson; Christian Ludwig; Pablo Moreno; Steffen Neumann; Jon Ander Novella; Claire O'Donovan; Jake T M Pearce; Alina Peluso; Marco Enrico Piras; Luca Pireddu; Michelle A C Reed; Philippe Rocca-Serra; Pierrick Roger; Antonio Rosato; Rico Rueedi; Christoph Ruttkies; Noureddin Sadawi; Reza M Salek; Susanna-Assunta Sansone; Vitaly Selivanov; Ola Spjuth; Daniel Schober; Etienne A Thévenot; Mattia Tomasoni; Merlijn van Rijswijk; Michael van Vliet; Mark R Viant; Ralf J M Weber; Gianluigi Zanetti; Christoph Steinbeck
Journal: Gigascience Date: 2019-02-01 Impact factor: 6.524

Review 4. Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.

Authors: Raphaël Chevrier; Vasiliki Foufi; Christophe Gaudet-Blavignac; Arnaud Robert; Christian Lovis
Journal: J Med Internet Res Date: 2019-05-31 Impact factor: 5.428

Review 5. Lessons learned from the eMERGE Network: balancing genomics in discovery and practice.

Authors:
Journal: HGG Adv Date: 2020-12-25

5 in total