Luca Bonomi1, Xiaoqian Jiang2, Lucila Ohno-Machado1,3. 1. Department of Biomedical Informatics, UC San Diego Health, University of California, San Diego, La Jolla, California, USA. 2. School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA. 3. Division of Health Services Research and Development, VA San Diego Healthcare System, La Jolla, California, USA.
Abstract
OBJECTIVE: Survival analysis is the cornerstone of many healthcare applications in which the "survival" probability (eg, time free from a certain disease, time to death) of a group of patients is computed to guide clinical decisions. It is widely used in biomedical research and healthcare applications. However, frequent sharing of exact survival curves may reveal information about the individual patients, as an adversary may infer the presence of a person of interest as a participant of a study or of a particular group. Therefore, it is imperative to develop methods to protect patient privacy in survival analysis. MATERIALS AND METHODS: We develop a framework based on the formal model of differential privacy, which provides provable privacy protection against a knowledgeable adversary. We show the performance of privacy-protecting solutions for the widely used Kaplan-Meier nonparametric survival model. RESULTS: We empirically evaluated the usefulness of our privacy-protecting framework and the reduced privacy risk for a popular epidemiology dataset and a synthetic dataset. Results show that our methods significantly reduce the privacy risk when compared with their nonprivate counterparts, while retaining the utility of the survival curves. DISCUSSION: The proposed framework demonstrates the feasibility of conducting privacy-protecting survival analyses. We discuss future research directions to further enhance the usefulness of our proposed solutions in biomedical research applications. CONCLUSION: The results suggest that our proposed privacy-protection methods provide strong privacy protections while preserving the usefulness of survival analyses.
OBJECTIVE: Survival analysis is the cornerstone of many healthcare applications in which the "survival" probability (eg, time free from a certain disease, time to death) of a group of patients is computed to guide clinical decisions. It is widely used in biomedical research and healthcare applications. However, frequent sharing of exact survival curves may reveal information about the individual patients, as an adversary may infer the presence of a person of interest as a participant of a study or of a particular group. Therefore, it is imperative to develop methods to protect patient privacy in survival analysis. MATERIALS AND METHODS: We develop a framework based on the formal model of differential privacy, which provides provable privacy protection against a knowledgeable adversary. We show the performance of privacy-protecting solutions for the widely used Kaplan-Meier nonparametric survival model. RESULTS: We empirically evaluated the usefulness of our privacy-protecting framework and the reduced privacy risk for a popular epidemiology dataset and a synthetic dataset. Results show that our methods significantly reduce the privacy risk when compared with their nonprivate counterparts, while retaining the utility of the survival curves. DISCUSSION: The proposed framework demonstrates the feasibility of conducting privacy-protecting survival analyses. We discuss future research directions to further enhance the usefulness of our proposed solutions in biomedical research applications. CONCLUSION: The results suggest that our proposed privacy-protection methods provide strong privacy protections while preserving the usefulness of survival analyses.
Authors: N Foldvary; B Nashold; E Mascha; E A Thompson; N Lee; J O McNamara; D V Lewis; J S Luther; A H Friedman; R A Radtke Journal: Neurology Date: 2000-02-08 Impact factor: 9.910
Authors: T E Le Voyer; E R Sigurdson; A L Hanlon; R J Mayer; J S Macdonald; P J Catalano; D G Haller Journal: J Clin Oncol Date: 2003-08-01 Impact factor: 44.544
Authors: Nils Homer; Szabolcs Szelinger; Margot Redman; David Duggan; Waibhav Tembe; Jill Muehling; John V Pearson; Dietrich A Stephan; Stanley F Nelson; David W Craig Journal: PLoS Genet Date: 2008-08-29 Impact factor: 5.917