Literature DB >> 33289676

Web-Based Privacy-Preserving Multicenter Medical Data Analysis Tools Via Threshold Homomorphic Encryption: Design and Development Study.

Yao Lu1, Tianshu Zhou1, Yu Tian1, Shiqiang Zhu2, Jingsong Li1,2.   

Abstract

BACKGROUND: Data sharing in multicenter medical research can improve the generalizability of research, accelerate progress, enhance collaborations among institutions, and lead to new discoveries from data pooled from multiple sources. Despite these benefits, many medical institutions are unwilling to share their data, as sharing may cause sensitive information to be leaked to researchers, other institutions, and unauthorized users. Great progress has been made in the development of secure machine learning frameworks based on homomorphic encryption in recent years; however, nearly all such frameworks use a single secret key and lack a description of how to securely evaluate the trained model, which makes them impractical for multicenter medical applications.
OBJECTIVE: The aim of this study is to provide a privacy-preserving machine learning protocol for multiple data providers and researchers (eg, logistic regression). This protocol allows researchers to train models and then evaluate them on medical data from multiple sources while providing privacy protection for both the sensitive data and the learned model.
METHODS: We adapted a novel threshold homomorphic encryption scheme to guarantee privacy requirements. We devised new relinearization key generation techniques for greater scalability and multiplicative depth and new model training strategies for simultaneously training multiple models through x-fold cross-validation.
RESULTS: Using a client-server architecture, we evaluated the performance of our protocol. The experimental results demonstrated that, with 10-fold cross-validation, our privacy-preserving logistic regression model training and evaluation over 10 attributes in a data set of 49,152 samples took approximately 7 minutes and 20 minutes, respectively.
CONCLUSIONS: We present the first privacy-preserving multiparty logistic regression model training and evaluation protocol based on threshold homomorphic encryption. Our protocol is practical for real-world use and may promote multicenter medical research to some extent. ©Yao Lu, Tianshu Zhou, Yu Tian, Shiqiang Zhu, Jingsong Li. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 08.12.2020.

Entities:  

Keywords:  confidentiality; logistic regression; machine learning; threshold homomorphic encryption

Year:  2020        PMID: 33289676      PMCID: PMC7755539          DOI: 10.2196/22555

Source DB:  PubMed          Journal:  J Med Internet Res        ISSN: 1438-8871            Impact factor:   5.428


  13 in total

1.  International, multicenter randomized preclinical trials in translational stroke research: it's time to act.

Authors:  Ulrich Dirnagl; Marc Fisher
Journal:  J Cereb Blood Flow Metab       Date:  2012-04-18       Impact factor: 6.200

2.  WebGLORE: a web service for Grid LOgistic REgression.

Authors:  Wenchao Jiang; Pinghao Li; Shuang Wang; Yuan Wu; Meng Xue; Lucila Ohno-Machado; Xiaoqian Jiang
Journal:  Bioinformatics       Date:  2013-09-25       Impact factor: 6.937

3.  SecureLR: Secure Logistic Regression Model via a Hybrid Cryptographic Protocol.

Authors:  Yichen Jiang; Jenny Hamer; Chenghong Wang; Xiaoqian Jiang; Miran Kim; Yongsoo Song; Yuhou Xia; Noman Mohammed; Md Nazmus Sadat; Shuang Wang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2018-05-07       Impact factor: 3.710

Review 4.  A guide to organizing a multicenter clinical trial.

Authors:  Kevin C Chung; Jae W Song
Journal:  Plast Reconstr Surg       Date:  2010-08       Impact factor: 4.730

5.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.

Authors:  George Hripcsak; Jon D Duke; Nigam H Shah; Christian G Reich; Vojtech Huser; Martijn J Schuemie; Marc A Suchard; Rae Woong Park; Ian Chi Kei Wong; Peter R Rijnbeek; Johan van der Lei; Nicole Pratt; G Niklas Norén; Yu-Chuan Li; Paul E Stang; David Madigan; Patrick B Ryan
Journal:  Stud Health Technol Inform       Date:  2015

6.  EXpectation Propagation LOgistic REgRession (EXPLORER): distributed privacy-preserving online model learning.

Authors:  Shuang Wang; Xiaoqian Jiang; Yuan Wu; Lijuan Cui; Samuel Cheng; Lucila Ohno-Machado
Journal:  J Biomed Inform       Date:  2013-04-04       Impact factor: 6.317

7.  POPCORN: A web service for individual PrognOsis prediction based on multi-center clinical data CollabORatioN without patient-level data sharing.

Authors:  Yu Tian; Yong Shang; Dan-Yang Tong; Sheng-Qiang Chi; Jun Li; Xiang-Xing Kong; Ke-Feng Ding; Jing-Song Li
Journal:  J Biomed Inform       Date:  2018-08-10       Impact factor: 6.317

8.  A secure distributed logistic regression protocol for the detection of rare adverse drug events.

Authors:  Khaled El Emam; Saeed Samet; Luk Arbuckle; Robyn Tamblyn; Craig Earle; Murat Kantarcioglu
Journal:  J Am Med Inform Assoc       Date:  2012-08-07       Impact factor: 4.497

9.  Privacy-preserving logistic regression training.

Authors:  Charlotte Bonte; Frederik Vercauteren
Journal:  BMC Med Genomics       Date:  2018-10-11       Impact factor: 3.063

10.  Logistic regression model training based on the approximate homomorphic encryption.

Authors:  Andrey Kim; Yongsoo Song; Miran Kim; Keewoo Lee; Jung Hee Cheon
Journal:  BMC Med Genomics       Date:  2018-10-11       Impact factor: 3.063

View more
  1 in total

Review 1.  Machine and cognitive intelligence for human health: systematic review.

Authors:  Xieling Chen; Gary Cheng; Fu Lee Wang; Xiaohui Tao; Haoran Xie; Lingling Xu
Journal:  Brain Inform       Date:  2022-02-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.