Literature DB >> 33442528

Privacy preserving data visualizations.

Demetris Avraam1,2, Rebecca Wilson1,3, Oliver Butters1,3, Thomas Burton4, Christos Nicolaides2,5,6, Elinor Jones7, Andy Boyd8, Paul Burton1.   

Abstract

Data visualizations are a valuable tool used during both statistical analysis and the interpretation of results as they graphically reveal useful information about the structure, properties and relationships between variables, which may otherwise be concealed in tabulated data. In disciplines like medicine and the social sciences, where collected data include sensitive information about study participants, the sharing and publication of individual-level records is controlled by data protection laws and ethico-legal norms. Thus, as data visualizations - such as graphs and plots - may be linked to other released information and used to identify study participants and their personal attributes, their creation is often prohibited by the terms of data use. These restrictions are enforced to reduce the risk of breaching data subject confidentiality, however they limit analysts from displaying useful descriptive plots for their research features and findings. Here we propose the use of anonymization techniques to generate privacy-preserving visualizations that retain the statistical properties of the underlying data while still adhering to strict data disclosure rules. We demonstrate the use of (i) the well-known k-anonymization process which preserves privacy by reducing the granularity of the data using suppression and generalization, (ii) a novel deterministic approach that replaces individual-level observations with the centroids of each k nearest neighbours, and (iii) a probabilistic procedure that perturbs individual attributes with the addition of random stochastic noise. We apply the proposed methods to generate privacy-preserving data visualizations for exploratory data analysis and inferential regression plot diagnostics, and we discuss their strengths and limitations.
© The Author(s) 2020.

Entities:  

Keywords:  Anonymization; Data visualizations; Disclosure control; Privacy protection; Sensitive data

Year:  2021        PMID: 33442528      PMCID: PMC7790778          DOI: 10.1140/epjds/s13688-020-00257-4

Source DB:  PubMed          Journal:  EPJ Data Sci        ISSN: 2193-1127            Impact factor:   3.184


  8 in total

1.  Protecting privacy using k-anonymity.

Authors:  Khaled El Emam; Fida Kamal Dankar
Journal:  J Am Med Inform Assoc       Date:  2008-06-25       Impact factor: 4.497

2.  Visualizing biological data-now and in the future.

Authors:  Seán I O'Donoghue; Anne-Claude Gavin; Nils Gehlenborg; David S Goodsell; Jean-Karim Hériché; Cydney B Nielsen; Chris North; Arthur J Olson; James B Procter; David W Shattuck; Thomas Walter; Bang Wong
Journal:  Nat Methods       Date:  2010-03       Impact factor: 28.547

3.  Seeing is believing: good graphic design principles for medical research.

Authors:  Susan P Duke; Fabrice Bancken; Brenda Crowe; Mat Soukup; Taxiarchis Botsis; Richard Forshee
Journal:  Stat Med       Date:  2015-06-25       Impact factor: 2.373

4.  Data Visualization in Sociology.

Authors:  Kieran Healy; James Moody
Journal:  Annu Rev Sociol       Date:  2014-07

5.  DataSHIELD: taking the analysis to the data, not the data to the analysis.

Authors:  Amadou Gaye; Yannick Marcon; Julia Isaeva; Philippe LaFlamme; Andrew Turner; Elinor M Jones; Joel Minion; Andrew W Boyd; Christopher J Newby; Marja-Liisa Nuotio; Rebecca Wilson; Oliver Butters; Barnaby Murtagh; Ipek Demir; Dany Doiron; Lisette Giepmans; Susan E Wallace; Isabelle Budin-Ljøsne; Carsten Oliver Schmidt; Paolo Boffetta; Mathieu Boniol; Maria Bota; Kim W Carter; Nick deKlerk; Chris Dibben; Richard W Francis; Tero Hiekkalinna; Kristian Hveem; Kirsti Kvaløy; Sean Millar; Ivan J Perry; Annette Peters; Catherine M Phillips; Frank Popham; Gillian Raab; Eva Reischl; Nuala Sheehan; Melanie Waldenberger; Markus Perola; Edwin van den Heuvel; John Macleod; Bartha M Knoppers; Ronald P Stolk; Isabel Fortier; Jennifer R Harris; Bruce H R Woffenbuttel; Madeleine J Murtagh; Vincent Ferretti; Paul R Burton
Journal:  Int J Epidemiol       Date:  2014-09-26       Impact factor: 7.196

Review 6.  Data Safe Havens in health research and healthcare.

Authors:  Paul R Burton; Madeleine J Murtagh; Andy Boyd; James B Williams; Edward S Dove; Susan E Wallace; Anne-Marie Tassé; Julian Little; Rex L Chisholm; Amadou Gaye; Kristian Hveem; Anthony J Brookes; Pat Goodwin; Jon Fistein; Martin Bobrow; Bartha M Knoppers
Journal:  Bioinformatics       Date:  2015-06-25       Impact factor: 6.937

7.  Data Safe Havens and Trust: Toward a Common Understanding of Trusted Research Platforms for Governing Secure and Ethical Health Research.

Authors:  Nathan Christopher Lea; Jacqueline Nicholls; Christine Dobbs; Nayha Sethi; James Cunningham; John Ainsworth; Martin Heaven; Trevor Peacock; Anthony Peacock; Kerina Jones; Graeme Laurie; Dipak Kalra
Journal:  JMIR Med Inform       Date:  2016-06-21

8.  Synthetic ALSPAC longitudinal datasets for the Big Data VR project.

Authors:  Demetris Avraam; Rebecca C Wilson; Paul Burton
Journal:  Wellcome Open Res       Date:  2017-08-30
  8 in total
  1 in total

1.  A deterministic approach for protecting privacy in sensitive personal data.

Authors:  Demetris Avraam; Elinor Jones; Paul Burton
Journal:  BMC Med Inform Decis Mak       Date:  2022-01-28       Impact factor: 2.796

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.