| Literature DB >> 32918447 |
J L Raisaro1, Francesco Marino2, Juan Troncoso-Pastoriza2, Raphaelle Beau-Lejdstrom3, Riccardo Bellazzi4,5, Robert Murphy6, Elmer V Bernstam6,7, Henry Wang8, Mauro Bucalo9, Yong Chen10, Assaf Gottlieb6, Arif Harmanci6, Miran Kim6, Yejin Kim6, Jeffrey Klann11, Catherine Klersy12, Bradley A Malin13, Marie Méan14, Fabian Prasser15,16, Luigia Scudeller17, Ali Torkamani18, Julien Vaucher14, Mamta Puppala19, Stephen T C Wong19, Milana Frenkel-Morgenstern20, Hua Xu6, Baba Maiyaki Musa21, Abdulrazaq G Habib21, Trevor Cohen22, Adam Wilcox22, Hamisu M Salihu23, Heidi Sofia24, Xiaoqian Jiang6, J P Hubaux2.
Abstract
Global pandemics call for large and diverse healthcare data to study various risk factors, treatment options, and disease progression patterns. Despite the enormous efforts of many large data consortium initiatives, scientific community still lacks a secure and privacy-preserving infrastructure to support auditable data sharing and facilitate automated and legally compliant federated analysis on an international scale. Existing health informatics systems do not incorporate the latest progress in modern security and federated machine learning algorithms, which are poised to offer solutions. An international group of passionate researchers came together with a joint mission to solve the problem with our finest models and tools. The SCOR Consortium has developed a ready-to-deploy secure infrastructure using world-class privacy and security technologies to reconcile the privacy/utility conflicts. We hope our effort will make a change and accelerate research in future pandemics with broad and diverse samples on an international scale.Entities:
Keywords: COVID-19; federated learning; healthcare privacy; international consortium; secure data analysis
Mesh:
Year: 2020 PMID: 32918447 PMCID: PMC7454652 DOI: 10.1093/jamia/ocaa172
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Comparison of SCOR with similar data-sharing initiatives
| Initiative | Type of analysis | Data storage | Scope | Type of data transferred | Data protection mechanism | Level of automation |
|---|---|---|---|---|---|---|
| 4CE | meta-analysis | decentralized | international | aggregate-level | local obfuscation | manual analysis |
| ACT Network | cohort exploration | decentralized | national (USA) | aggregate-level | local obfuscation | fully automated system (SHRINE |
| LEOSS | centralized analytics | centralized | international (only EU) | patient-level | anonymization | manual analysis |
| OHDSI | meta-analysis | decentralized | international | aggregate-level | local obfuscation | manual analysis |
| PCORNet CDRNs | meta-analysis | decentralized | national (USA) | aggregate-level | local obfuscation | manual analysis |
| N3C | centralized analytics | centralized | national (USA) | patient-level | anonymization | manual analysis |
| SCOR | cohort exploration and decentralized analytics | decentralized | international | aggregate-level | encryption & global obfuscation | fully automated system (MedCO |
Comparison of fully automated systems for COVID-19 data sharing is reported in Table 2 below.
Comparison between available medical distributed analysis platforms
| Functionalities | Safe settings | Safe output | |||
|---|---|---|---|---|---|
| Platform | Cohort exploration | Distributed analytics | Secure aggregation | Local obfuscation | Global obfuscation |
| SHRINE | • | • | |||
| Medical Informatics Platform | • | • | |||
| DataShield | • | • | • | ||
| MedCo | • | • | • | • | • |
Figure 1.MedCo core technologies. MedCo is a decentralized software system that uses cutting-edge privacy-preserving technologies to enable the secure sharing of medical data among health institutions. It builds on 3 core privacy-preserving technologies: homomorphic encryption, secure multiparty computation, and data obfuscation. These technologies are used in synergy to combine information owned by multiple institutions and reveal otherwise hidden global insights while addressing legal and privacy concerns.
Figure 2.The SCOR MedCo approach: when an institution queries the virtual collective dataset, it engages in a distributed cryptographic protocol with all the other institutions to securely obtain the result of the query. MedCo provides end-to-end protection against unauthorized access to data thanks to homomorphic encryption, which allows keeping the data in an encrypted state not only at rest and in transit but also during computation (safe settings). MedCo also removes the need for a central trusted authority by leveraging secure multiparty computation. The result of a query/analysis can be decrypted only through a distributed protocol that involves the approval of all the participating institutions. If 1 or more institutions are compromised by a cyber attack, the others can refuse to decrypt the data, thus keeping the data secure.