| Literature DB >> 35070169 |
Vasileios C Pezoulas1, Andreas Goules2, Fanis Kalatzis1, Luke Chatzis2, Konstantina D Kourou1, Aliki Venetsanopoulou2,3, Themis P Exarchos1,4, Saviana Gandolfo5, Konstantinos Votis6, Evi Zampeli7, Jan Burmeister8, Thorsten May8, Manuel Marcelino Pérez9, Iryna Lishchuk10, Thymios Chondrogiannis11, Vassiliki Andronikou11, Theodora Varvarigou11, Nenad Filipovic12, Manolis Tsiknakis13, Chiara Baldini14, Michele Bombardieri15, Hendrika Bootsma16, Simon J Bowman17, Muhammad Shahnawaz Soyfoo18, Dorian Parisis19, Christine Delporte19, Valérie Devauchelle-Pensec20, Jacques-Olivier Pers20, Thomas Dörner21, Elena Bartoloni22, Roberto Gerli22, Roberto Giacomelli23, Roland Jonsson24, Wan-Fai Ng25, Roberta Priori26, Manuel Ramos-Casals27, Kathy Sivils28, Fotini Skopouli7,29, Witte Torsten30, Joel A G van Roon31, Mariette Xavier32, Salvatore De Vita5, Athanasios G Tzioufas2, Dimitrios I Fotiadis1,33.
Abstract
For many decades, the clinical unmet needs of primary Sjögren's Syndrome (pSS) have been left unresolved due to the rareness of the disease and the complexity of the underlying pathogenic mechanisms, including the pSS-associated lymphomagenesis process. Here, we present the HarmonicSS cloud-computing exemplar which offers beyond the state-of-the-art data analytics services to address the pSS clinical unmet needs, including the development of lymphoma classification models and the identification of biomarkers for lymphomagenesis. The users of the platform have been able to successfully interlink, curate, and harmonize 21 regional, national, and international European cohorts of 7,551 pSS patients with respect to the ethical and legal issues for data sharing. Federated AI algorithms were trained across the harmonized databases, with reduced execution time complexity, yielding robust lymphoma classification models with 85% accuracy, 81.25% sensitivity, 85.4% specificity along with 5 biomarkers for lymphoma development. To our knowledge, this is the first GDPR compliant platform that provides federated AI services to address the pSS clinical unmet needs.Entities:
Keywords: Biomarkers; Data curation; Data harmonization; Data sharing; Federated AI; Lymphoma classification; Primary Sjögren’s syndrome
Year: 2022 PMID: 35070169 PMCID: PMC8760551 DOI: 10.1016/j.csbj.2022.01.002
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1An illustration of the core modules of the HarmonicSS cloud computing platform.
Fig. 2An illustration of the cohort data harmonization workflow.
Fig. 3An illustration of the federated AI model training and testing workflow, where the testing cohort is depicted in green color. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4An illustration of the UI interactions within the HarmonicSS platform. Green arrows denote secure communication protocols. POST/GET commands refer to REST service requests. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Demographic information.
| Demographics | Females | Males |
|---|---|---|
| Gender | 6,512 | 488 |
| Age at SS diagnosis (mean | 51.82 ( | 54.24 ( |
| Disease duration (mean) | 7.08 years | 5.59 years |
| Female to male ratio | 13.34% | |
Distribution of lymphoma and non-lymphoma patients per cohort.
| Cohort acronym | Cohort full name | Number of lymphoma patients | Number of non-lymphoma (or missing) patients |
|---|---|---|---|
| IDIBAPS | Consorci Institut D’Investigacions Biomediques August Pi I Sunyer | 0 | 300 |
| UNIPG | Università degli Studi di Perugia | 10 | 166 |
| UPSUd PARIS | Université Paris-Sud (database 1) | 24 | 483 |
| UoB | University of Birmingham | 3 | 156 |
| UNIVAQ | Università degli Studi dell'Aquila | 3 | 97 |
| ULB | Université libre de Bruxelles | 1 | 726 |
| HUA | Harokopion University of Athens | 8 | 151 |
| UMCG | University Medical Center Groningen | 20 | 166 |
| UiB | University of Bergen | 3 | 138 |
| UOI | University of Ioannina | 7 | 279 |
| UU | Utrecht University | 14 | 108 |
| UNIRO | Universita' Degli Studi Di Roma La Sapienza | 14 | 532 |
| QMUL | Queen Mary University of London | 1 | 47 |
| UMCU | Universitair Medisch Centrum Utrecht | 27 | 313 |
| MHH | Medizinische Hochschule Hannover | 5 | 178 |
| UNIPI | Universita di Pisa | 31 | 687 |
| CUMB | Charité – Universitätsmedizin Berlin | 0 | 71 |
| UBO | Université de Bretagne Occidentale | 4 | 77 |
| UOA | National and Kapodistrian University of Athens | 101 | 488 |
| AOUD | Azienda Sanitaria Universitaria Integrata di Udine | 16 | 281 |
| UNEW | University of Newcastle | 62 | 1358 |
A summary of the performance evaluation results across the four federated scenarios.
| Federated learning schema | Performance evaluation metrics | |||
|---|---|---|---|---|
| Accuracy | Sensitivity | Specificity | AUC | |
| 0.84 | 0.81 | 0.85 | 0.89 | |
| 0.86 | 0.75 | 0.87 | 0.87 | |
| 0.84 | 0.62 | 0.85 | 0.86 | |
| 0.83 | 0.81 | 0.84 | 0.89 | |
| 0.85 | 0.81 | 0.85 | 0.89 | |
| 0.83 | 0.87 | 0.83 | 0.88 | |
| 0.51 | 0.94 | 0.49 | 0.71 | |
| 0.64 | 0.75 | 0.63 | 0.69 | |
| 0.71 | 0.70 | 0.71 | 0.73 | |
| 0.69 | 0.70 | 0.69 | 0.76 | |
| 0.74 | 0.80 | 0.73 | 0.79 | |
| 0.71 | 0.70 | 0.71 | 0.71 | |
| 0.71 | 0.70 | 0.71 | 0.75 | |
| 0.71 | 0.70 | 0.71 | 0.76 | |
| 0.63 | 0.70 | 0.63 | 0.66 | |
| 0.68 | 0.70 | 0.68 | 0.69 | |
| 0.75 | 0.99 | 0.74 | 0.89 | |
| 0.78 | 0.99 | 0.76 | 0.90 | |
| 0.78 | 0.99 | 0.76 | 0.91 | |
| 0.76 | 0.99 | 0.74 | 0.90 | |
| 0.71 | 0.87 | 0.69 | 0.86 | |
| 0.74 | 0.75 | 0.74 | 0.86 | |
| 0.71 | 0.87 | 0.70 | 0.79 | |
| 0.85 | 0.62 | 0.87 | 0.74 | |
| 0.81 | 0.75 | 0.81 | 0.91 | |
| 0.78 | 0.87 | 0.78 | 0.92 | |
| 0.80 | 0.75 | 0.80 | 0.91 | |
| 0.80 | 0.87 | 0.79 | 0.91 | |
| 0.80 | 0.87 | 0.80 | 0.90 | |
| 0.78 | 0.75 | 0.78 | 0.91 | |
| 0.62 | 0.87 | 0.61 | 0.74 | |
| 0.85 | 0.62 | 0.86 | 0.74 | |
* With light blue color: The federated schema with the best performance, rd: dropout rate.
Fig. 5Receiver Operating Characteristic (ROC) curves for each federated algorithm across the two federated scenarios. From top to bottom: on the left for federated scenario 1 with testing cohorts AOUD, UNIPG, HUA and for federated scenario 2 testing cohort HUA.
Fig. 6An illustration of the SHAP plot in federated scenario 1 for the FGBT.
Fig. 7An illustration of the SHAP plot in federated scenario 1 for the FDART schemas.
| 1 | |
| 2 | train the initial model |
| 3 | store the weights of |
| 4 | for |
| 5 | retrieve weights and send them to location |
| 6 | update the weights of |
| 7 | store the weights of the model |
| 8 | retrieve the final federated model |
| 9 | evaluate the performance of |
| 10 | return |