Literature DB >> 18495290

Remote access methods for exploratory data analysis and statistical modelling: Privacy-Preserving Analytics.

Ross Sparks1, Chris Carter, John B Donnelly, Christine M O'Keefe, Jodie Duncan, Tim Keighley, Damien McAullay.   

Abstract

This paper is concerned with the challenge of enabling the use of confidential or private data for research and policy analysis, while protecting confidentiality and privacy by reducing the risk of disclosure of sensitive information. Traditional solutions to the problem of reducing disclosure risk include releasing de-identified data and modifying data before release. In this paper we discuss the alternative approach of using a remote analysis server which does not enable any data release, but instead is designed to deliver useful results of user-specified statistical analyses with a low risk of disclosure. The techniques described in this paper enable a user to conduct a wide range of methods in exploratory data analysis, regression and survival analysis, while at the same time reducing the risk that the user can read or infer any individual record attribute value. We illustrate our methods with examples from biostatistics using publicly available data. We have implemented our techniques into a software demonstrator called Privacy-Preserving Analytics (PPA), via a web-based interface to the R software. We believe that PPA may provide an effective balance between the competing goals of providing useful information and reducing disclosure risk in some situations.

Entities:  

Mesh:

Year:  2008        PMID: 18495290     DOI: 10.1016/j.cmpb.2008.04.001

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  8 in total

1.  Never too old for anonymity: a statistical standard for demographic data sharing via the HIPAA Privacy Rule.

Authors:  Bradley Malin; Kathleen Benitez; Daniel Masys
Journal:  J Am Med Inform Assoc       Date:  2011 Jan-Feb       Impact factor: 4.497

2.  Privacy by Design at Population Data BC: a case study describing the technical, administrative, and physical controls for privacy-sensitive secondary use of personal information for research in the public interest.

Authors:  Caitlin Pencarrick Hertzman; Nancy Meagher; Kimberlyn M McGrail
Journal:  J Am Med Inform Assoc       Date:  2012-08-30       Impact factor: 4.497

3.  Sharing Time-to-Event Data with Privacy Protection.

Authors:  Luca Bonomi; Liyue Fan
Journal:  IEEE Int Conf Healthc Inform       Date:  2022-09-08

4.  EXpectation Propagation LOgistic REgRession (EXPLORER): distributed privacy-preserving online model learning.

Authors:  Shuang Wang; Xiaoqian Jiang; Yuan Wu; Lijuan Cui; Samuel Cheng; Lucila Ohno-Machado
Journal:  J Biomed Inform       Date:  2013-04-04       Impact factor: 6.317

5.  Anonymization for outputs of population health and health services research conducted via an online data center.

Authors:  Christine M O'Keefe; Mark Westcott; Maree O'Sullivan; Adrien Ickowicz; Tim Churches
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

6.  A secure distributed logistic regression protocol for the detection of rare adverse drug events.

Authors:  Khaled El Emam; Saeed Samet; Luk Arbuckle; Robyn Tamblyn; Craig Earle; Murat Kantarcioglu
Journal:  J Am Med Inform Assoc       Date:  2012-08-07       Impact factor: 4.497

7.  Supporting Regularized Logistic Regression Privately and Efficiently.

Authors:  Wenfa Li; Hongzhe Liu; Peng Yang; Wei Xie
Journal:  PLoS One       Date:  2016-06-06       Impact factor: 3.240

8.  Secure Multi-pArty Computation Grid LOgistic REgression (SMAC-GLORE).

Authors:  Haoyi Shi; Chao Jiang; Wenrui Dai; Xiaoqian Jiang; Yuzhe Tang; Lucila Ohno-Machado; Shuang Wang
Journal:  BMC Med Inform Decis Mak       Date:  2016-07-25       Impact factor: 2.796

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.