Literature DB >> 20385806

Anonymization of electronic medical records for validating genome-wide association studies.

Grigorios Loukides1, Aris Gkoulalas-Divanis, Bradley Malin.   

Abstract

Genome-wide association studies (GWAS) facilitate the discovery of genotype-phenotype relations from population-based sequence databases, which is an integral facet of personalized medicine. The increasing adoption of electronic medical records allows large amounts of patients' standardized clinical features to be combined with the genomic sequences of these patients and shared to support validation of GWAS findings and to enable novel discoveries. However, disseminating these data "as is" may lead to patient reidentification when genomic sequences are linked to resources that contain the corresponding patients' identity information based on standardized clinical features. This work proposes an approach that provably prevents this type of data linkage and furnishes a result that helps support GWAS. Our approach automatically extracts potentially linkable clinical features and modifies them in a way that they can no longer be used to link a genomic sequence to a small number of patients, while preserving the associations between genomic sequences and specific sets of clinical features corresponding to GWAS-related diseases. Extensive experiments with real patient data derived from the Vanderbilt's University Medical Center verify that our approach generates data that eliminate the threat of individual reidentification, while supporting GWAS validation and clinical case analysis tasks.

Entities:  

Mesh:

Year:  2010        PMID: 20385806      PMCID: PMC2867915          DOI: 10.1073/pnas.0911686107

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  9 in total

Review 1.  Quality assurance of medical ontologies.

Authors:  J E Rogers
Journal:  Methods Inf Med       Date:  2006       Impact factor: 2.176

Review 2.  A call for the creation of personalized medicine databases.

Authors:  David Gurwitz; Jeantine E Lunshof; Russ B Altman
Journal:  Nat Rev Drug Discov       Date:  2006-01       Impact factor: 84.694

3.  The NCBI dbGaP database of genotypes and phenotypes.

Authors:  Matthew D Mailman; Michael Feolo; Yumi Jin; Masato Kimura; Kimberly Tryka; Rinat Bagoutdinov; Luning Hao; Anne Kiang; Justin Paschall; Lon Phan; Natalia Popova; Stephanie Pretel; Lora Ziyabari; Moira Lee; Yu Shao; Zhen Y Wang; Karl Sirotkin; Minghong Ward; Michael Kholodov; Kerry Zbicz; Jeffrey Beck; Michael Kimelman; Sergey Shevelev; Don Preuss; Eugene Yaschenko; Alan Graeff; James Ostell; Stephen T Sherry
Journal:  Nat Genet       Date:  2007-10       Impact factor: 38.330

4.  Development of a large-scale de-identified DNA biobank to enable personalized medicine.

Authors:  D M Roden; J M Pulley; M A Basford; G R Bernard; E W Clayton; J R Balser; D R Masys
Journal:  Clin Pharmacol Ther       Date:  2008-05-21       Impact factor: 6.875

Review 5.  A HapMap harvest of insights into the genetics of common disease.

Authors:  Teri A Manolio; Lisa D Brooks; Francis S Collins
Journal:  J Clin Invest       Date:  2008-05       Impact factor: 14.808

6.  The disclosure of diagnosis codes can breach research participants' privacy.

Authors:  Grigorios Loukides; Joshua C Denny; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

7.  Progress and challenges in genome-wide association studies in humans.

Authors:  Peter Donnelly
Journal:  Nature       Date:  2008-12-11       Impact factor: 49.962

8.  A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants.

Authors:  Laura J Scott; Karen L Mohlke; Lori L Bonnycastle; Cristen J Willer; Yun Li; William L Duren; Michael R Erdos; Heather M Stringham; Peter S Chines; Anne U Jackson; Ludmila Prokunina-Olsson; Chia-Jen Ding; Amy J Swift; Narisu Narisu; Tianle Hu; Randall Pruim; Rui Xiao; Xiao-Yi Li; Karen N Conneely; Nancy L Riebow; Andrew G Sprau; Maurine Tong; Peggy P White; Kurt N Hetrick; Michael W Barnhart; Craig W Bark; Janet L Goldstein; Lee Watkins; Fang Xiang; Jouko Saramies; Thomas A Buchanan; Richard M Watanabe; Timo T Valle; Leena Kinnunen; Gonçalo R Abecasis; Elizabeth W Pugh; Kimberly F Doheny; Richard N Bergman; Jaakko Tuomilehto; Francis S Collins; Michael Boehnke
Journal:  Science       Date:  2007-04-26       Impact factor: 47.728

9.  Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.

Authors:  Nils Homer; Szabolcs Szelinger; Margot Redman; David Duggan; Waibhav Tembe; Jill Muehling; John V Pearson; Dietrich A Stephan; Stanley F Nelson; David W Craig
Journal:  PLoS Genet       Date:  2008-08-29       Impact factor: 5.917

  9 in total
  38 in total

1.  Attribute Utility Motivated k-anonymization of datasets to support the heterogeneous needs of biomedical researchers.

Authors:  Huimin Ye; Elizabeth S Chen
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Anonymization of longitudinal electronic medical records.

Authors:  Acar Tamersoy; Grigorios Loukides; Mehmet Ercan Nergiz; Yucel Saygin; Bradley Malin
Journal:  IEEE Trans Inf Technol Biomed       Date:  2012-01-27

Review 3.  Electronic medical records as a tool in clinical pharmacology: opportunities and challenges.

Authors:  D M Roden; H Xu; J C Denny; R A Wilke
Journal:  Clin Pharmacol Ther       Date:  2012-06       Impact factor: 6.875

4.  Ethical and practical challenges to studying patients who opt out of large-scale biorepository research.

Authors:  S Trent Rosenbloom; Jennifer L Madison; Kyle B Brothers; Erica A Bowton; Ellen Wright Clayton; Bradley A Malin; Dan M Roden; Jill Pulley
Journal:  J Am Med Inform Assoc       Date:  2013-07-25       Impact factor: 4.497

5.  Charting a course for genomic medicine from base pairs to bedside.

Authors:  Eric D Green; Mark S Guyer
Journal:  Nature       Date:  2011-02-10       Impact factor: 49.962

6.  Birth month affects lifetime disease risk: a phenome-wide method.

Authors:  Mary Regina Boland; Zachary Shahn; David Madigan; George Hripcsak; Nicholas P Tatonetti
Journal:  J Am Med Inform Assoc       Date:  2015-06-02       Impact factor: 4.497

7.  PREMIX: PRivacy-preserving EstiMation of Individual admiXture.

Authors:  Feng Chen; Michelle Dow; Sijie Ding; Yao Lu; Xiaoqian Jiang; Hua Tang; Shuang Wang
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

Review 8.  Identifiability in biobanks: models, measures, and mitigation strategies.

Authors:  Bradley Malin; Grigorios Loukides; Kathleen Benitez; Ellen Wright Clayton
Journal:  Hum Genet       Date:  2011-07-08       Impact factor: 4.132

9.  Scalable privacy-preserving data sharing methodology for genome-wide association studies.

Authors:  Fei Yu; Stephen E Fienberg; Aleksandra B Slavković; Caroline Uhler
Journal:  J Biomed Inform       Date:  2014-02-06       Impact factor: 6.317

Review 10.  Personalized cardiovascular medicine: concepts and methodological considerations.

Authors:  Henry Völzke; Carsten O Schmidt; Sebastian E Baumeister; Till Ittermann; Glenn Fung; Janina Krafczyk-Korth; Wolfgang Hoffmann; Matthias Schwab; Henriette E Meyer zu Schwabedissen; Marcus Dörr; Stephan B Felix; Wolfgang Lieb; Heyo K Kroemer
Journal:  Nat Rev Cardiol       Date:  2013-03-26       Impact factor: 32.419

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.