Luca Bonomi1, Zeyun Wu2, Liyue Fan3. 1. Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA. 2. Department of Electrical and Computer Engineering, University of California San Diego, La Jolla, California, USA. 3. Department of Computer Science, University of North Carolina at Charlotte, Charlotte, North Carolina, USA.
Abstract
OBJECTIVE: Emerging technologies (eg, wearable devices) have made it possible to collect data directly from individuals (eg, time-series), providing new insights on the health and well-being of individual patients. Broadening the access to these data would facilitate the integration with existing data sources (eg, clinical and genomic data) and advance medical research. Compared to traditional health data, these data are collected directly from individuals, are highly unique and provide fine-grained information, posing new privacy challenges. In this work, we study the applicability of a novel privacy model to enable individual-level time-series data sharing while maintaining the usability for data analytics. METHODS AND MATERIALS: We propose a privacy-protecting method for sharing individual-level electrocardiography (ECG) time-series data, which leverages dimensional reduction technique and random sampling to achieve provable privacy protection. We show that our solution provides strong privacy protection against an informed adversarial model while enabling useful aggregate-level analysis. RESULTS: We conduct our evaluations on 2 real-world ECG datasets. Our empirical results show that the privacy risk is significantly reduced after sanitization while the data usability is retained for a variety of clinical tasks (eg, predictive modeling and clustering). DISCUSSION: Our study investigates the privacy risk in sharing individual-level ECG time-series data. We demonstrate that individual-level data can be highly unique, requiring new privacy solutions to protect data contributors. CONCLUSION: The results suggest our proposed privacy-protection method provides strong privacy protections while preserving the usefulness of the data.
OBJECTIVE: Emerging technologies (eg, wearable devices) have made it possible to collect data directly from individuals (eg, time-series), providing new insights on the health and well-being of individual patients. Broadening the access to these data would facilitate the integration with existing data sources (eg, clinical and genomic data) and advance medical research. Compared to traditional health data, these data are collected directly from individuals, are highly unique and provide fine-grained information, posing new privacy challenges. In this work, we study the applicability of a novel privacy model to enable individual-level time-series data sharing while maintaining the usability for data analytics. METHODS AND MATERIALS: We propose a privacy-protecting method for sharing individual-level electrocardiography (ECG) time-series data, which leverages dimensional reduction technique and random sampling to achieve provable privacy protection. We show that our solution provides strong privacy protection against an informed adversarial model while enabling useful aggregate-level analysis. RESULTS: We conduct our evaluations on 2 real-world ECG datasets. Our empirical results show that the privacy risk is significantly reduced after sanitization while the data usability is retained for a variety of clinical tasks (eg, predictive modeling and clustering). DISCUSSION: Our study investigates the privacy risk in sharing individual-level ECG time-series data. We demonstrate that individual-level data can be highly unique, requiring new privacy solutions to protect data contributors. CONCLUSION: The results suggest our proposed privacy-protection method provides strong privacy protections while preserving the usefulness of the data.
Authors: A L Goldberger; L A Amaral; L Glass; J M Hausdorff; P C Ivanov; R G Mark; J E Mietus; G B Moody; C K Peng; H E Stanley Journal: Circulation Date: 2000-06-13 Impact factor: 29.690
Authors: Giorgio Quer; Jennifer M Radin; Matteo Gadaleta; Katie Baca-Motes; Lauren Ariniello; Edward Ramos; Vik Kheterpal; Eric J Topol; Steven R Steinhubl Journal: Nat Med Date: 2020-10-29 Impact factor: 53.440
Authors: Marco A F Pimentel; Alistair E W Johnson; Peter H Charlton; Drew Birrenkott; Peter J Watkinson; Lionel Tarassenko; David A Clifton Journal: IEEE Trans Biomed Eng Date: 2016-11-18 Impact factor: 4.538