Literature DB >> 26554428

VERTIcal Grid lOgistic regression (VERTIGO).

Yong Li1, Xiaoqian Jiang2, Shuang Wang3, Hongkai Xiong1, Lucila Ohno-Machado3.   

Abstract

OBJECTIVE: To develop an accurate logistic regression (LR) algorithm to support federated data analysis of vertically partitioned distributed data sets.
MATERIAL AND METHODS: We propose a novel technique that solves the binary LR problem by dual optimization to obtain a global solution for vertically partitioned data. We evaluated this new method, VERTIcal Grid lOgistic regression (VERTIGO), in artificial and real-world medical classification problems in terms of the area under the receiver operating characteristic curve, calibration, and computational complexity. We assumed that the institutions could "align" patient records (through patient identifiers or hashed "privacy-protecting" identifiers), and also that they both had access to the values for the dependent variable in the LR model (eg, that if the model predicts death, both institutions would have the same information about death).
RESULTS: The solution derived by VERTIGO has the same estimated parameters as the solution derived by applying classical LR. The same is true for discrimination and calibration over both simulated and real data sets. In addition, the computational cost of VERTIGO is not prohibitive in practice. DISCUSSION: There is a technical challenge in scaling up federated LR for vertically partitioned data. When the number of patients m is large, our algorithm has to invert a large Hessian matrix. This is an expensive operation of time complexity O(m(3)) that may require large amounts of memory for storage and exchange of information. The algorithm may also not work well when the number of observations in each class is highly imbalanced.
CONCLUSION: The proposed VERTIGO algorithm can generate accurate global models to support federated data analysis of vertically partitioned data. Published by Oxford University Press on behalf of the American Medical Informatics Association 2015. This work is written by US Government employees and is in the public domain in the US.

Entities:  

Keywords:  dual optimization; federated data analysis; logistic regression; vertically partitioned data

Mesh:

Year:  2015        PMID: 26554428      PMCID: PMC4901373          DOI: 10.1093/jamia/ocv146

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  7 in total

1.  Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database.

Authors:  Mohammed Saeed; Mauricio Villarroel; Andrew T Reisner; Gari Clifford; Li-Wei Lehman; George Moody; Thomas Heldt; Tin H Kyaw; Benjamin Moody; Roger G Mark
Journal:  Crit Care Med       Date:  2011-05       Impact factor: 7.598

2.  Effect of data combination on predictive modeling: a study using gene expression data.

Authors:  Melanie Osl; Stephan Dreiseitl; Jihoon Kim; Kiltesh Patel; Christian Baumgartner; Lucila Ohno-Machado
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

3.  Early diagnosis of acute myocardial infarction using clinical and electrocardiographic data at presentation: derivation and evaluation of logistic regression models.

Authors:  R L Kennedy; A M Burton; H S Fraser; L N McStay; R F Harrison
Journal:  Eur Heart J       Date:  1996-08       Impact factor: 29.983

Review 4.  A comparison of goodness-of-fit tests for the logistic regression model.

Authors:  D W Hosmer; T Hosmer; S Le Cessie; S Lemeshow
Journal:  Stat Med       Date:  1997-05-15       Impact factor: 2.373

5.  The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1982-04       Impact factor: 11.105

6.  Grid Binary LOgistic REgression (GLORE): building shared models without sharing data.

Authors:  Yuan Wu; Xiaoqian Jiang; Jihoon Kim; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2012-04-17       Impact factor: 4.497

7.  pSCANNER: patient-centered Scalable National Network for Effectiveness Research.

Authors:  Lucila Ohno-Machado; Zia Agha; Douglas S Bell; Lisa Dahm; Michele E Day; Jason N Doctor; Davera Gabriel; Maninder K Kahlon; Katherine K Kim; Michael Hogarth; Michael E Matheny; Daniella Meeker; Jonathan R Nebeker
Journal:  J Am Med Inform Assoc       Date:  2014-04-29       Impact factor: 4.497

  7 in total
  18 in total

1.  Privacy-Preserving Methods for Vertically Partitioned Incomplete Data.

Authors:  Yi Deng; Xiaoqian Jiang; Qi Long
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

2.  VANTAGE6: an open source priVAcy preserviNg federaTed leArninG infrastructurE for Secure Insight eXchange.

Authors:  Arturo Moncada-Torres; Frank Martin; Melle Sieswerda; Johan Van Soest; Gijs Geleijnse
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

3.  PRINCESS: Privacy-protecting Rare disease International Network Collaboration via Encryption through Software guard extensionS.

Authors:  Feng Chen; Shuang Wang; Xiaoqian Jiang; Sijie Ding; Yao Lu; Jihoon Kim; S Cenk Sahinalp; Chisato Shimizu; Jane C Burns; Victoria J Wright; Eileen Png; Martin L Hibberd; David D Lloyd; Hai Yang; Amalio Telenti; Cinnamon S Bloss; Dov Fox; Kristin Lauter; Lucila Ohno-Machado
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

4.  Clinical Research Informatics for Big Data and Precision Medicine.

Authors:  C Weng; M G Kahn
Journal:  Yearb Med Inform       Date:  2016-11-10

5.  VERTICOX: Vertically Distributed Cox Proportional Hazards Model Using the Alternating Direction Method of Multipliers.

Authors:  Wenrui Dai; Xiaoqian Jiang; Luca Bonomi; Yong Li; Hongkai Xiong; Lucila Ohno-Machado
Journal:  IEEE Trans Knowl Data Eng       Date:  2020-04-22       Impact factor: 9.235

6.  Privacy Policy and Technology in Biomedical Data Science.

Authors:  April Moreno Arellano; Wenrui Dai; Shuang Wang; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  Annu Rev Biomed Data Sci       Date:  2018-07

Review 7.  Genome privacy: challenges, technical approaches to mitigate risk, and ethical considerations in the United States.

Authors:  Shuang Wang; Xiaoqian Jiang; Siddharth Singh; Rebecca Marmor; Luca Bonomi; Dov Fox; Michelle Dow; Lucila Ohno-Machado
Journal:  Ann N Y Acad Sci       Date:  2016-09-28       Impact factor: 5.691

8.  Federated Tensor Factorization for Computational Phenotyping.

Authors:  Yejin Kim; Jimeng Sun; Hwanjo Yu; Xiaoqian Jiang
Journal:  KDD       Date:  2017-08

9.  Privacy-Preserving Artificial Intelligence Techniques in Biomedicine.

Authors:  Reihaneh Torkzadehmahani; Reza Nasirigerdeh; David B Blumenthal; Tim Kacprowski; Markus List; Julian Matschinske; Julian Spaeth; Nina Kerstin Wenke; Jan Baumbach
Journal:  Methods Inf Med       Date:  2022-01-21       Impact factor: 1.800

10.  Secure Multi-pArty Computation Grid LOgistic REgression (SMAC-GLORE).

Authors:  Haoyi Shi; Chao Jiang; Wenrui Dai; Xiaoqian Jiang; Yuzhe Tang; Lucila Ohno-Machado; Shuang Wang
Journal:  BMC Med Inform Decis Mak       Date:  2016-07-25       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.