| Literature DB >> 35614106 |
T V Nguyen1,2, M A Dakka3,4, S M Diakiw3, M D VerMilyea5,6, M Perugini3,7, J M M Hall3,8,9, D Perugini3.
Abstract
Training on multiple diverse data sources is critical to ensure unbiased and generalizable AI. In healthcare, data privacy laws prohibit data from being moved outside the country of origin, preventing global medical datasets being centralized for AI training. Data-centric, cross-silo federated learning represents a pathway forward for training on distributed medical datasets. Existing approaches typically require updates to a training model to be transferred to a central server, potentially breaching data privacy laws unless the updates are sufficiently disguised or abstracted to prevent reconstruction of the dataset. Here we present a completely decentralized federated learning approach, using knowledge distillation, ensuring data privacy and protection. Each node operates independently without needing to access external data. AI accuracy using this approach is found to be comparable to centralized training, and when nodes comprise poor-quality data, which is common in healthcare, AI accuracy can exceed the performance of traditional centralized training.Entities:
Mesh:
Year: 2022 PMID: 35614106 PMCID: PMC9133021 DOI: 10.1038/s41598-022-12833-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Illustrations of 5-node (a), 15-node (b) with single cluster scenarios, and 5-node each in 3-cluster scenario (c).
Model result comparison using 5 evaluation metrics: mean accuracy, class 0 (cat) accuracy, class 1 (dog) accuracy, balanced accuracy, and log loss.
| Models | Total accuracy | Class 0 (Cat) | Class 1 (Dog) | Balanced accuracy | Log loss |
|---|---|---|---|---|---|
| Baseline | 98.44 | 98.62 | 98.26 | 98.44 | 0.061 |
| 98.42 | 98.49 | 98.35 | 98.42 | 0.045 | |
| 98.62 | 98.13 | 99.11 | 98.62 | 0.038 | |
| Baseline | 75.31 | 57.34 | 93.32 | 75.33 | 1.235 |
| 78.00 | 57.68 | 98.35 | 78.01 | 0.421 | |
| 73.78 | 48.49 | 99.11 | 73.80 | 0.598 | |
Figure 2Comparison of decentralized model results for different transfer set scenarios. Baseline indicates an experiment where all data are centralized, and training occurs on this central node. Dc-1 to Dc-4 refer to experiments where individual nodes (1–4) are chosen as the only transfer set. Dc-m1 indicates a scenario where decentralized training occurs, but the transfer set is the theoretical centralized set of all data. Dc-m2 indicates a scenario where decentralized training is followed by a final process whereby all final models are distilled together at each node in term, with one full traversal of all nodes.
Figure 3Comparison of 15-node decentralized experiments where the number of node-level epochs for each node are altered and compared. For all node-level training of k epochs before transferring to neighboring nodes, the experiment is denoted Dc-ke. A clustering scenario where 15-nodes are split into 3 clusters of 5 nodes each, are compared to the results from a full ring of 15 nodes.
Trade-off between data transfer and model accuracy.
| Exp-id | Description | Total Acc | Class 0 | Class 1 | Balanced Acc |
|---|---|---|---|---|---|
| Baseline | 75.31 | 57.34 | 93.32 | 75.33 | |
| Dc-1e-5t | 1 epoch at each node | 76.98 | 75.01 | 78.95 | 76.98 |
| Dc-2e-5t | 2 epochs at each node | 79.91 | 71.01 | 88.83 | 79.92 |
| Baseline | 83.27 | 71.06 | 95.5 | 83.28 | |
| Dc-1e-5t | 1 epoch at each node | 85.04 | 73.5 | 96.61 | 85.06 |
| Dc-2e-5t | 2 epochs at each node | 87.75 | 90.14 | 85.36 | 87.75 |
Embryo dataset broken down to clinic owners.
| Clinic data allocation | Non-viable | Viable | Total |
|---|---|---|---|
| 1251 | 942 | 2193 | |
| Node1—Fertility Associates NZ (FANZ) | 269 | 318 | 587 |
| Node2—Institute for Reproductive Health (IRH) | 197 | 217 | 414 |
| Node3—Repromed Adelaide (REP) | 615 | 174 | 789 |
| Node4—Ovation Austin (OVA) | 79 | 157 | 236 |
| Node5—Midwest Fertility Specialists And San Antonio IVF (MISA) | 91 | 76 | 167 |
| Clean test set | 272 | 641 | 913 |
| Noisy blind test set | 517 | 681 | 1198 |
Figure 6The clinics’ data size shown in percentages. Training dataset (left) with clinics’ data allocated to 5 nodes, cleansed test set (middle) and noisy blind test set (right).
Figure 4The workflow of predicting/identifying the viability of an embryo image.
Comparison of model results on a medical (Embryo Viability) dataset.
| Models | Non-viable | Viable | Total accuracy |
|---|---|---|---|
| Baseline | 59.92 | 82.96 | 76.09 |
| 57.72 | 84.37 | 76.41 | |
| Baseline | 35.97 | 78.67 | 60.24 |
| 34.04 | 80.58 | 60.50 | |
Figure 5The decentralized model’s accuracy performance for individual clinic data in the cleansed test set (left graph) and in the noisy blind test set (right graph).