| Literature DB >> 34149305 |
Longling Zhang1, Bochen Shen1, Ahmed Barnawi2, Shan Xi1, Neeraj Kumar3,4,5, Yi Wu1.
Abstract
Existing deep learning technologies generally learn the features of chest X-ray data generated by Generative Adversarial Networks (GAN) to diagnose COVID-19 pneumonia. However, the above methods have a critical challenge: data privacy. GAN will leak the semantic information of the training data which can be used to reconstruct the training samples by attackers, thereby this method will leak the privacy of the patient. Furthermore, for this reason, that is the limitation of the training data sample, different hospitals jointly train the model through data sharing, which will also cause privacy leakage. To solve this problem, we adopt the Federated Learning (FL) framework, a new technique being used to protect data privacy. Under the FL framework and Differentially Private thinking, we propose a Federated Differentially Private Generative Adversarial Network (FedDPGAN) to detect COVID-19 pneumonia for sustainable smart cities. Specifically, we use DP-GAN to privately generate diverse patient data in which differential privacy technology is introduced to make sure the privacy protection of the semantic information of the training dataset. Furthermore, we leverage FL to allow hospitals to collaboratively train COVID-19 models without sharing the original data. Under Independent and Identically Distributed (IID) and non-IID settings, the evaluation of the proposed model is on three types of chest X-ray (CXR)images dataset (COVID-19, normal, and normal pneumonia). A large number of truthful reports make the verification of our model can effectively diagnose COVID-19 without compromising privacy.Entities:
Keywords: COVID-19; Differential privacy; Federated learning; Generative adversarial networks; Privacy protection
Year: 2021 PMID: 34149305 PMCID: PMC8204125 DOI: 10.1007/s10796-021-10144-6
Source DB: PubMed Journal: Inf Syst Front ISSN: 1387-3326 Impact factor: 6.191
Fig. 1Overview of federated differentially private generative adversarial network (FedDPGAN) framework
Fig. 2Overview of the generated dataset
Fig. 3Overview of non-IID data allocation method
Comparison of COVID-19 diagnostic performance between the proposed model and the benchmark models under IID setting
| Model | Accuracy | Data | Privacy |
|---|---|---|---|
| Augmentation | Protection | ||
| FedDPGAN-based ResNet | 94.45% | ||
| FedResNet | 93.96% | × |
|
| DPGAN-based ResNet | 93.77% |
|
|
| ResNet He et al. ( | 92.93% | × | × |
| CNN Tajbakhsh et al. ( | 90.72% | × | × |
| MLP Li et al. ( | 92.05% | × | × |
| KNN Park and Lee ( | 92.78% | × | × |
| SVM Morra et al. ( | 89.41% | × | × |
Fig. 4Comparison of COVID-19 diagnostic performance between the proposed model and the benchmark models
Fig. 5(a) Performance comparison of FedResNet model under IID and non-IID settings; (b) Performance comparison between FedResNet model and FedDPGAN-based ResNet model under non-IID settings
Fig. 6(a) Performance comparison of FedResNet model under IID setting and FedDPGAN-based ResNet model under non-IID settings; (b) Performance comparison between FedDPGAN-based ResNet model under IID setting and under non-IID settings
Fig. 7Performance comparison between ResNet model and FedDPGAN-based ResNet model under non-IID settings
Performance under the IID setting and non-IID setting of the proposed model under different privacy budgets σ
| Model | Data | Accuracy | Data | Privacy | |
|---|---|---|---|---|---|
| distribution | Augmentation | Protection | |||
| FedDPGAN- | non-IID | 10− 4 | 94.11% |
|
|
| based | non-IID | 10− 2 | 94.06% |
|
|
| ResNet | non-IID | 1 | 91.90% |
|
|
| FedDPGAN- | IID | 10− 4 | 94.45% |
|
|
| based | IID | 10− 2 | 93.81% |
|
|
| ResNet | IID | 1 | 94.01% |
|
|