| Literature DB >> 36119394 |
Santosh Kumar1, Rishab Nagar1, Saumya Bhatnagar1, Ramesh Vaddi2, Sachin Kumar Gupta3, Mamoon Rashid4,5, Ali Kashif Bashir6, Tamim Alkhalifah7.
Abstract
All witnessed the terrible effects of the COVID-19 pandemic on the health and work lives of the population across the world. It is hard to diagnose all infected people in real time since the conventional medical diagnosis of COVID-19 patients takes a couple of days for accurate diagnosis results. In this paper, a novel learning framework is proposed for the early diagnosis of COVID-19 patients using hybrid deep fusion learning models. The proposed framework performs early classification of patients based on collected samples of chest X-ray images and Coswara cough (sound) samples of possibly infected people. The captured cough samples are pre-processed using speech signal processing techniques and Mel frequency cepstral coefficient features are extracted using deep convolutional neural networks. Finally, the proposed system fuses extracted features to provide 98.70% and 82.7% based on Chest-X ray images and cough (audio) samples for early diagnosis using the weighted sum-rule fusion method.Entities:
Keywords: COVID-19; Chest X-ray; Cough-breathing sounds; Deep learning; Multimodal; Segmentation
Year: 2022 PMID: 36119394 PMCID: PMC9472671 DOI: 10.1016/j.compeleceng.2022.108391
Source DB: PubMed Journal: Comput Electr Eng ISSN: 0045-7906 Impact factor: 4.152
Existing works for COVID-19 detection using machine learning techniques based on chest-X ray images and cough(audio) samples.
| Study | Year | R | RSS | NS | NR | PS | TM | Per (%) |
|---|---|---|---|---|---|---|---|---|
| 2020 | CXR | – | 247 | DL | FC | – | 92.5 | |
| 2019 | CXR | 125 | 200 | DNET | ML | SP | 98% | |
| 2020 | X-ray | 230 | DL | CON | SP | SP | 90% | |
| 2022 | B | BB | 220 | MFCCS | DL | Low-D | 70% | |
| 2022 | Sound | 200 | Chest | HSR | SFs | CNN | 70% | |
| 2020 | cough | 4352 | SP | MFCC | DL | CNN | 80% | |
| 2020 | Web | Cough | COD | 5320 | DL | R2 | 97.10% | |
| 2020 | X-ray | 30 | R1 | CNN | DL | SP | 95.4% | |
| 2021 | XCXR | – | 200 | CNN | CNET | SP | 92.4 | |
| 2020 | SP | Cough | CO | 3621 | ST | R-18 | AUC(0.72) | |
| 2021 | ES | B | COV10 | 10 | 2DFT | Inv3 | 80% |
Abbreviation: FC (FC)-DenseNet103, CON COVIDX-N, R1 ResNet50, S SFT SVMCNET COVID-Net, DNET Dark-CoviDNet MultiM Multimodal, In-v3 Inception v3, R (ResNet-18), R1 ResNet-50 COT 7 COVID-19 9 Healthy, SFT Singlet, Fourier transformation, SVM Support vector machine, SP Speech processing, T dataset consists of 4352 unique people collected from the web app 2261 unique people from the Android app 4352 and 5634 samples. Cough Crowd sourced Respiratory Sound Data, SF Shape chest Features, 2DFT Two-dimensional (2D) Fourier transformation, COVID10 [5 COVID19 5 healthy 10], ES Electronic stethoscope, BB 120 COVID 100 healthy) B Breathing, Sp Smartphone app, TM Trained Model, Per. Performance(%) R Recordings Source, RSS Respiratory Sound, NS Number of Subjects, NR Number of Recordings, PS Pre-processing Steps, ST Short-term magnitude spectrogram, CO COVID-19 1620 healthy, Mel Mel-frequency cepstral coefficients (MFCC), COD 2660 COVID-19 2660 healthy, CODD 114 COVID-19, 1388 healthy, MFCC Spectrogram Mel spectrum, power spectrum Tona, Mel-frequency cepstral coefficients (MFCC), ENCNN Ensemble CNN, Total (COVID-19 34 516 samples 2100 1500 chest X-ray positive 600 Chest X-ray: Negative).
Fig. 1Illustrates working of the proposed multimodal framework for the diagnosis of COVID-19.
Fig. 2Illustrates working of pipeline chest X-ray imaging-based model.
Fig. 3Shows segmentation model based on U-net learning architecture.
Database of chest X-ray images.
| Dataset | Size | Used model |
|---|---|---|
| A | 468 | Used to make COVID-19 class data |
| B | 5860 | Used to make non-COVID-19 class data |
| C | 566 | Segmentation model |
| D | 138 | Used for validation of segmentation model |
| 852 | 426N(190W |
A: IEEE-8023 CXR - Cohen dataset [21], B: Pneumonia and normal chest X-ray, C: Shenzhen CXR with Masks, D: Montgomery county CXR images, E: COVIDGR 1.0, W Women, M Men, N Negative cases, P positive D 426P(239W 187M) used training model.
Fig. 4Depicts the architecture of the proposed classification model.
Fig. 5Working prototype for the cough-based model.
Fig. 6Shows the spectrogram of cough (voice) based on pronounced vowels (a, e, i, o, u) by patients.
Fig. 7Working prototype for the cough-based model.
Fig. 8Extraction of MFCCs from the cough (voice) samples.
Fig. 9Illustration of cough for volume and its power spectrum.
Fig. 12Segmentation of chest X-ray images using the proposed model.
Performance of proposed chest X-ray model based on fold cross-validation.
| Folds | Sensitivity | Specificity | Precision | Accuracy | F1 |
|---|---|---|---|---|---|
| 1 | 0.9459 | 1.0000 | 1.0000 | 0.9890 | 0.9722 |
| 2 | 1.00009 | 1.0000 | 1.0000 | 1.0000 | 1.0000 |
| 3 | 0.9453 | 0.91235 | 0.9367 | 0.9578 | 0.9646 |
| 4 | 0.9354 | 0.94285 | 0.9669 | 0.9677 | 0.9879 |
| 5 | 0.9891 | 0.97685 | 0.9556 | 0.9576 | 0.9789 |
Fig. 10Shows base image, mask, and predicted mask.
Fig. 11Loss and accuracy curves of the segmentation model.
Performance of proposed cough model based on fold cross-validation.
| Fold | Sensitivity | Specificity | Precision | Accuracy | F1 |
|---|---|---|---|---|---|
| 1 | 0.9279 | .9641 | 0.9538 | 0.9867 | 0.8792 |
| 2 | 0.9722 | 0.9861 | 0.9459 | 0.9833 | 0.9589 |
| 3 | 0.8657 | 0.91736 | 0.9367 | 0.9479 | 0.9746 |
| 4 | 0.9484 | 0.97288 | 0.9289 | 0.9576 | 0.9689 |
| 5 | 0.89594 | 0.94686 | 0.9266 | 0.9386 | 0.9979 |
Fig. 13Confusion Matrix features (a) chest X-ray images and (b) cough based diagnosis.
Fig. 14Confusion Matrix for (a) chest X-ray images and (b) cough based diagnosis.
Fig. 15(a) shows the accuracy of fold 3 and (b) fold 4 for CXR classification.
Performance of proposed cough model based on fold cross-validation.
| Fold | Sensitivity | Specificity | Precision | Accuracy | F1 |
|---|---|---|---|---|---|
| 1 | 0.9279 | .9641 | 0.9538 | 0.9867 | 0.8792 |
| 2 | 0.8131 | 0.8214 | 0.8131 | 0.8230 | 0.8131 |
Fig. 16Shows model accuracy vs epoch for COVID-19 classification based on CXR images.
Fig. 17Depicts accuracy of the proposed framework with segmentation method on Chest-X-ray images.
Fig. 18Illustrates the losses of the CXR classification model with segmentation method.
Accuracy (%) of the proposed Framework on selected weights.
| Model Type | Weight | Accuracy |
|---|---|---|
| X-ray based detection | 0.80 | 0.983 |
| Cough-based detection | 0.52 | 0.827 |
Comparisons with existing techniques on cough for COVID-19 detection.
| References | Dataset | Technique used | Performance (%) | Remark |
|---|---|---|---|---|
| chest-CT | Res-A | 86.7% | Takes more time, less accuracy | |
| Chest CT | MI | 82.90% | Less dataset | |
| Chest-X ray | DTL | 92.1% | Overfitting problem | |
| CC | ED | 77.1% | Less accuracy | |
Abbreviation: Ref. Reference, Prop. Proposed, DLT DLT-based classifier, ED Ensembles DNN, TL Transfer Learning with VGGish, chest-CT Chest CT Scan, REs-A ResNet and Location Attention, MI M-Inception techniques, CNN Convolutional neural Network , Coswara Coswara cough audio database, CC Coswara/ Coughvid.
Existing work for early detection of COVID-19 detection using ML techniques based on chest X and coughing samples.
| Study | Year | R | RSS | NS | NR | PS | TM | Per (%) |
|---|---|---|---|---|---|---|---|---|
| 2021 | XCXR | – | 200 | CNN | CNET | SP | 92.4 | |
| 2020 | X-ray | 30 | R1 | CNN | DL | SP | 95.4% | |
| 2019 | CXR | 125 | 200 | DNET | ML | SP | 98% | |
| 2020 | X-ray | 230 | DL | CON | SP | SP | 90% | |
| 2022 | Sound | 200 | Chest | HSR | SFs | CNN | 70% | |
| 2020 | cough | 4352 | SP | MFCC | DL | CNN | 80% | |
| 2020 | Web | Cough | COD | 5320 | DL | R2 | 97.10% | |
| 2020 | SP | Cough | CO | 3621 | ST | R-18 | AUC(0.72) | |
| 2021 | ES | B | COV10 | 10 | 2DFT | Inv3 | 80% | |
| 2022 | B | BB | 220 | MFCCS | DL | Low-D | 70% | |