Jiawei Yuan1, Bradley Malin2, François Modave3, Yi Guo3, William R Hogan3, Elizabeth Shenkman3, Jiang Bian4. 1. Department of Electrical, Computer, Software, & Systems Engineering, Embry-Riddle Aeronautical University, Daytona Beach, FL, United States. 2. Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States; Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, United States. 3. Health Outcomes & Policy, University of Florida, Gainesville, FL, United States. 4. Health Outcomes & Policy, University of Florida, Gainesville, FL, United States. Electronic address: bianjiang@ufl.edu.
Abstract
BACKGROUND: The last few years have witnessed an increasing number of clinical research networks (CRNs) focused on building large collections of data from electronic health records (EHRs), claims, and patient-reported outcomes (PROs). Many of these CRNs provide a service for the discovery of research cohorts with various health conditions, which is especially useful for rare diseases. Supporting patient privacy can enhance the scalability and efficiency of such processes; however, current practice mainly relies on policy, such as guidelines defined in the Health Insurance Portability and Accountability Act (HIPAA), which are insufficient for CRNs (e.g., HIPAA does not require encryption of data - which can mitigate insider threats). By combining policy with privacy enhancing technologies we can enhance the trustworthiness of CRNs. The goal of this research is to determine if searchable encryption can instill privacy in CRNs without sacrificing their usability. METHODS: We developed a technique, implemented in working software to enable privacy-preserving cohort discovery (PPCD) services in large distributed CRNs based on elliptic curve cryptography (ECC). This technique also incorporates a block indexing strategy to improve the performance (in terms of computational running time) of PPCD. We evaluated the PPCD service with three real cohort definitions: (1) elderly cervical cancer patients who underwent radical hysterectomy, (2) oropharyngeal and tongue cancer patients who underwent robotic transoral surgery, and (3) female breast cancer patients who underwent mastectomy) with varied query complexity. These definitions were tested in an encrypted database of 7.1 million records derived from the publically available Healthcare Cost and Utilization Project (HCUP) Nationwide Inpatient Sample (NIS). We assessed the performance of the PPCD service in terms of (1) accuracy in cohort discovery, (2) computational running time, and (3) privacy afforded to the underlying records during PPCD. RESULTS: The empirical results indicate that the proposed PPCD can execute cohort discovery queries in a reasonable amount of time, with query runtime in the range of 165-262s for the 3 use cases, with zero compromise in accuracy. We further show that the search performance is practical because it supports a highly parallelized design for secure evaluation over encrypted records. Additionally, our security analysis shows that the proposed construction is resilient to standard adversaries. CONCLUSIONS: PPCD services can be designed for clinical research networks. The security construction presented in this work specifically achieves high privacy guarantees by preventing both threats originating from within and beyond the network. Copyright Â
BACKGROUND: The last few years have witnessed an increasing number of clinical research networks (CRNs) focused on building large collections of data from electronic health records (EHRs), claims, and patient-reported outcomes (PROs). Many of these CRNs provide a service for the discovery of research cohorts with various health conditions, which is especially useful for rare diseases. Supporting patient privacy can enhance the scalability and efficiency of such processes; however, current practice mainly relies on policy, such as guidelines defined in the Health Insurance Portability and Accountability Act (HIPAA), which are insufficient for CRNs (e.g., HIPAA does not require encryption of data - which can mitigate insider threats). By combining policy with privacy enhancing technologies we can enhance the trustworthiness of CRNs. The goal of this research is to determine if searchable encryption can instill privacy in CRNs without sacrificing their usability. METHODS: We developed a technique, implemented in working software to enable privacy-preserving cohort discovery (PPCD) services in large distributed CRNs based on elliptic curve cryptography (ECC). This technique also incorporates a block indexing strategy to improve the performance (in terms of computational running time) of PPCD. We evaluated the PPCD service with three real cohort definitions: (1) elderly cervical cancerpatients who underwent radical hysterectomy, (2) oropharyngeal and tongue cancerpatients who underwent robotic transoral surgery, and (3) female breast cancerpatients who underwent mastectomy) with varied query complexity. These definitions were tested in an encrypted database of 7.1 million records derived from the publically available Healthcare Cost and Utilization Project (HCUP) Nationwide Inpatient Sample (NIS). We assessed the performance of the PPCD service in terms of (1) accuracy in cohort discovery, (2) computational running time, and (3) privacy afforded to the underlying records during PPCD. RESULTS: The empirical results indicate that the proposed PPCD can execute cohort discovery queries in a reasonable amount of time, with query runtime in the range of 165-262s for the 3 use cases, with zero compromise in accuracy. We further show that the search performance is practical because it supports a highly parallelized design for secure evaluation over encrypted records. Additionally, our security analysis shows that the proposed construction is resilient to standard adversaries. CONCLUSIONS:PPCD services can be designed for clinical research networks. The security construction presented in this work specifically achieves high privacy guarantees by preventing both threats originating from within and beyond the network. Copyright Â
Keywords:
Clinical research network (CRN); Data privacy; OneFlorida Clinical Data Research Network (CDRN); Patient-Centered Clinical Research Network (PCORnet); Privacy-preserving cohort discovery; Searchable encryption
Authors: Erin M George; Ana I Tergas; Cande V Ananth; William M Burke; Sharyn N Lewin; Eri Prendergast; Alfred I Neugut; Dawn L Hershman; Jason D Wright Journal: Gynecol Oncol Date: 2014-04-24 Impact factor: 5.482
Authors: Erica A Voss; Rupa Makadia; Amy Matcho; Qianli Ma; Chris Knoll; Martijn Schuemie; Frank J DeFalco; Ajit Londhe; Vivienne Zhu; Patrick B Ryan Journal: J Am Med Inform Assoc Date: 2015-02-10 Impact factor: 4.497
Authors: Lisa M Schilling; Bethany M Kwan; Charles T Drolshagen; Patrick W Hosokawa; Elias Brandt; Wilson D Pace; Christopher Uhrich; Michael Kamerick; Aidan Bunting; Philip R O Payne; William E Stephens; Joseph M George; Mark Vance; Kelli Giacomini; Jason Braddy; Mika K Green; Michael G Kahn Journal: EGEMS (Wash DC) Date: 2013-10-07
Authors: Rachael L Fleurence; Lesley H Curtis; Robert M Califf; Richard Platt; Joe V Selby; Jeffrey S Brown Journal: J Am Med Inform Assoc Date: 2014-05-12 Impact factor: 4.497
Authors: Jie Xu; Yu Zhang; Huamin Yu; Bo Lin; Dejian Wang; Hong Yuan; Bin Hu; Jun Jiang; Peng Xiang; Te Lin; Huizhe Lu; Guiying Zhang Journal: Ann Transl Med Date: 2022-09
Authors: Elizabeth Shenkman; Myra Hurt; William Hogan; Olveen Carrasquillo; Steven Smith; Andrew Brickman; David Nelson Journal: Acad Med Date: 2018-03 Impact factor: 6.893
Authors: Steven M Smith; Kathryn McAuliffe; Jaclyn M Hall; Caitrin W McDonough; Matthew J Gurka; Temple O Robinson; Ralph L Sacco; Carl Pepine; Elizabeth Shenkman; Rhonda M Cooper-DeHoff Journal: Prev Chronic Dis Date: 2018-03-01 Impact factor: 2.830
Authors: Kassaye Yitbarek Yigzaw; Andrius Budrionis; Luis Marco-Ruiz; Torje Dahle Henriksen; Peder A Halvorsen; Johan Gustav Bellika Journal: BMC Med Inform Decis Mak Date: 2020-06-22 Impact factor: 2.796
Authors: S L Filipp; M Cardel; J Hall; R Z Essner; D J Lemas; D M Janicke; S R Smith; J Nadglowski; W Troy Donahoo; R M Cooper-DeHoff; D R Nelson; W R Hogan; E A Shenkman; M J Gurka Journal: Obes Sci Pract Date: 2018-06-15