| Literature DB >> 34457153 |
Eric W Lee1, Li Xiong1, Vicki Stover Hertzberg2, Roy L Simpson2, Joyce C Ho1.
Abstract
From electronic health records (EHRs), the relationship between patients' conditions, treatments, and outcomes can be discovered and used in various healthcare research tasks such as risk prediction. In practice, EHRs can be stored in one or more data warehouses, and mining from distributed data sources becomes challenging. Another challenge arises from privacy laws because patient data cannot be used without some patient privacy guarantees. Thus, in this paper, we propose a privacy-preserving framework using sequential pattern mining in distributed data sources. Our framework extracts patterns from each source and shares patterns with other sources to discover discriminative and representative patterns that can be used for risk prediction while preserving privacy. We demonstrate our framework using a case study of predicting Cardiovascular Disease in patients with type 2 diabetes and show the effectiveness of our framework with several sources and by applying differential privacy mechanisms. ©2021 AMIA - All rights reserved.Entities:
Mesh:
Year: 2021 PMID: 34457153 PMCID: PMC8378625
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc