| Literature DB >> 35632136 |
Trang-Thi Ho1, Khoa-Dang Tran1, Yennun Huang1.
Abstract
Coronavirus (COVID-19) has created an unprecedented global crisis because of its detrimental effect on the global economy and health. COVID-19 cases have been rapidly increasing, with no sign of stopping. As a result, test kits and accurate detection models are in short supply. Early identification of COVID-19 patients will help decrease the infection rate. Thus, developing an automatic algorithm that enables the early detection of COVID-19 is essential. Moreover, patient data are sensitive, and they must be protected to prevent malicious attackers from revealing information through model updates and reconstruction. In this study, we presented a higher privacy-preserving federated learning system for COVID-19 detection without sharing data among data owners. First, we constructed a federated learning system using chest X-ray images and symptom information. The purpose is to develop a decentralized model across multiple hospitals without sharing data. We found that adding the spatial pyramid pooling to a 2D convolutional neural network improves the accuracy of chest X-ray images. Second, we explored that the accuracy of federated learning for COVID-19 identification reduces significantly for non-independent and identically distributed (Non-IID) data. We then proposed a strategy to improve the model's accuracy on Non-IID data by increasing the total number of clients, parallelism (client-fraction), and computation per client. Finally, for our federated learning model, we applied a differential privacy stochastic gradient descent (DP-SGD) to improve the privacy of patient data. We also proposed a strategy to maintain the robustness of federated learning to ensure the security and accuracy of the model.Entities:
Keywords: COVID-19 detection; COVID-19 symptoms; chest X-ray images; convolutional neural network; differential privacy stochastic gradient descent; federated learning; spatial pyramid pooling layer
Mesh:
Year: 2022 PMID: 35632136 PMCID: PMC9147951 DOI: 10.3390/s22103728
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
The summary table of existing COVID-19 detection approaches.
| Author | Title | Data | Approach | Limitations |
|---|---|---|---|---|
| Horry et al. [ | COVID-19 detection | X-ray, | VGG19 | Sharing sensitive |
| Afshar et al. [ | Covid-caps: A | Chest X-ray | COVID-CAPS | Sharing sensitive |
| Mukherjee et al. [ | Deep neural network to detect | Chest X-ray, | CNN tailored Deep | Sharing sensitive |
| Otoom et al. [ | An IoT-based | COVID-19 | Eight algorithms | Sharing sensitive |
| Akib et al. [ | Machine learning based | COVID-19 | Logistic regression, | Sharing sensitive |
| Khaloufi et al. [ | Deep Learning Based | COVID-19 | ANN, AI-enabled | Sharing sensitive |
| Menni et al. [ | Real-time tracking | COVID-19 | Smartphone-based app, | Sharing sensitive |
| Canas et al. [ | Early detection of | Chest X-ray | MobileNetv2, | Sensitive data can still |
| Zhang et al. [ | Dynamic fusion-based | Chest X-ray | GhostNet, | Sensitive data can still |
| Abdul et al. [ | COVID-19 detection | Chest X-ray, | Federated with | Sensitive data can still |
Figure 1Federated COVID-19 detection system architecture.
The complexity of the proposed models using chest X-ray images.
| Model | No. Layers | No. Parameters |
|---|---|---|
| 5 × 5 CNN | 3 | 22 |
| ResNet18 | 18 | 11.2 |
| ResNet50 | 50 | 23.5 |
| 3 × 3 CNN | 3 | 1.6 |
| 3 × 3 CNN-SPP | 4 | 0.2 |
Figure 2The 2D CNN with 3 × 3 convolutional layers architecture.
Figure 3The 2D CNN with 3 × 3 convolutional layers and SPP architecture.
The complexity of the proposed models using symptom data.
| Model | No. Layers | No. Parameters |
|---|---|---|
| 1DCNN | 5 | 37.4 |
| ANN | 4 | 26.3 |
| LSTM | 5 | 90.2 |
Figure 4The structure of ANN.
Figure 5The structure of one cell conventional LSTM.
Figure 6Few samples of chest X-ray images.
The statistics of the chest X-ray dataset.
| Covid | Normal | Viral Pneumonia | Total Images | |
|---|---|---|---|---|
| Training | 3416 images | 9992 images | 1145 images | 14,553 images |
| Testing | 200 images | 200 images | 200 images | 600 images |
The statistics of the symptom dataset.
| Covid | Non-Covid | Total | |
|---|---|---|---|
| Training | 3949 images | 941 images | 4890 images |
| Testing | 434 images | 110 images | 544 images |
Figure 7Improvement in COVID-19 detection based on chest X-ray images and 3 × 3 CNN-SPP.
Figure 8Improvement in COVID-19 detection based on symptom data and ANN.
Figure 9An illustrative example of imbalance size for chest X-ray data with three clients and k = 1.
Figure 10Performance comparison between IID and Non-IID based on chest X-ray images.
The summary table of the performance comparison between IID and Non-IID based on chest X-ray images.
| Round | IID | Non-IID(1) | Non-IID(2) |
|---|---|---|---|
| 400 |
| 40.56% | 70.38% |
| 600 |
| 39.40% | 73.45% |
| 800 |
| 45.46% | 78.62% |
| 1000 |
| 44.68% | 80.93% |
Figure 11Performance comparison between IID and Non-IID based on symptom dataset.
The summary table of the performance comparison between IID and Non-IID based on symptom dataset.
| Round | IID | Non-IID(1) |
|---|---|---|
| 400 |
| 94.24% |
| 600 |
| 94.39% |
| 800 |
| 95.03% |
| 1000 |
| 95.37% |
Figure 12Non-IID with different numbers of clients based on chest X-ray images.
The summary table of the accuracy of Non-IID with different numbers of clients based on chest X-ray images.
| Round | 3 Clients | 30 Clients | 300 Clients |
|---|---|---|---|
| 400 | 40.56% | 64.57% |
|
| 600 | 39.40% | 65.93% |
|
| 800 | 45.46% | 66.52% |
|
| 1000 | 44.68% | 65.93% |
|
Figure 13Non-IID with different client-fraction based on chest X-ray images.
Figure 14Non-IID with different batch-size based on chest X-ray images.
Non-IID with refined Non-IID based on chest X-ray images.
| Round | Baseline | Non-IID (300 Clients) | Refined Non-IID |
|---|---|---|---|
| 400 | 40.56% | 73.36% |
|
| 600 | 39.40% | 73.37% |
|
| 800 | 45.46% | 74.89% |
|
| 1000 | 44.68% | 75.42% |
|
Figure 15Non-IID with different client-fraction based on symptom data.
Figure 16Non-IID with different batch-size and client based on symptom data.
Non-IID with refined Non-IID based on symptom data.
| Round | IID | Non-IID(1) | Refined Non-IID(1) |
|---|---|---|---|
| 400 | 95.88% | 94.24% | 95.06% |
| 600 | 96.68% | 94.39% | 95.17% |
| 800 | 96.67% | 95.03% | 95.52% |
| 1000 | 96.65% | 95.37% | 95.68% |
Figure 17Differential privacy with different noise values on chest X-ray model.
Figure 18Differential privacy with different noise values on symptom model.
Figure 19The robustness of differential privacy on chest X-ray images.
The summary table of the robustness of differential privacy on chest X-ray images.
| q | Noise | Accuracy |
|
|---|---|---|---|
| 1/3 | 0.1 |
| 9.7 × |
| 10/30 | 0.3 | 75.36% | 8.9 × |
| 30/90 | 0.7 | 68.35% | 5.6 × |
| 60/180 | 1.3 | 70.21% | 97.39 |
| 90/270 | 1.9 | 69.27% | 46.36 |
| 100/300 | 2.1 | 68.70% |
|
The robustness of differential privacy on symptom data.
| q | Noise | Accuracy |
|
|---|---|---|---|
| 1/4 | 1.0 | 93.56% | 1.6 × |
| 10/40 | 2.0 |
| 2.9 × |
| 15/60 | 3.0 | 93.39% | 1.2 × |
| 20/80 | 4.0 | 91.33% | 73.56 |
| 25/100 | 5.0 | 88.73% |
|