| Literature DB >> 35818367 |
Adel Akbarimajd1, Nicolas Hoertel2,3,4, Mohammad Arafat Hussain5, Ali Asghar Neshat6, Mahmoud Marhamati6, Mahdi Bakhtoor7, Mohammad Momeny1.
Abstract
Deep convolutional neural networks (CNNs) are used for the detection of COVID-19 in X-ray images. The detection performance of deep CNNs may be reduced by noisy X-ray images. To improve the robustness of a deep CNN against impulse noise, we propose a novel CNN approach using adaptive convolution, with the aim to ameliorate COVID-19 detection in noisy X-ray images without requiring any preprocessing for noise removal. This approach includes an impulse noise-map layer, an adaptive resizing layer, and an adaptive convolution layer to the conventional CNN framework. We also used a learning-to-augment strategy using noisy X-ray images to improve the generalization of a deep CNN. We have collected a dataset of 2093 chest X-ray images including COVID-19 (452 images), non-COVID pneumonia (621 images), and healthy ones (1020 images). The architecture of pre-trained networks such as SqueezeNet, GoogleNet, MobileNetv2, ResNet18, ResNet50, ShuffleNet, and EfficientNetb0 has been modified to increase their robustness to impulse noise. Validation on the noisy X-ray images using the proposed noise-robust layers and learning-to-augment strategy-incorporated ResNet50 showed 2% better classification accuracy compared with state-of-the-art method.Entities:
Keywords: Adaptive convolution; Adaptive resize; COVID-19 classification; Data augmentation; Noise
Year: 2022 PMID: 35818367 PMCID: PMC9259198 DOI: 10.1016/j.jocs.2022.101763
Source DB: PubMed Journal: J Comput Sci
Fig. 1Sample chest X-ray images of (a) COVID-19, (b) healthy, and (c) non-COVID pneumonia cases from our dataset. The first row contains the noise-free images. The second and third rows show noise-corrupted images with the noise density of 5% and 10%, respectively.
Summary of Patients’ Diagnoses.
| Diagnosis | Number of subjects/patients | Data collection timeline (years) |
|---|---|---|
| COVID-19 | 452 | 2020–2021 |
| Non-COVID pneumonia | 1020 | 2018–2021 |
| Healthy | 621 | 2018–2021 |
Fig. 2The schematic diagram of the proposed method.
Fig. 3The 4-step pipeline of noise detection using the analysis of local statistical properties of an image [28].
Fig. 4The pixels processing window is divided into pixels overlapped sub-windows.
Fig. 5Fuzzy membership functions ‘’ or ‘’.
Fig. 6Noise detection rules.
Fig. 7Two channels for each image.
Fig. 8Comparison of image resizing techniques using the central pixel selection and adaptive pixel selection (i.e., adaptive resizing).
Fig. 9The architecture of the adaptive resizing layer [27].
Fig. 10The pipeline of adaptive resizing operation at the front end of the CNN.
Fig. 11Schematic diagram illustrating the shutting off a noisy pixel during convolution operation in a CNN convolution layer.
Fig. 12The architecture of the proposed noise-robust adaptive convolution layer.
Fig. 13Flowchart of learning-to-augment using noisy data.
Data partitioning for training, validation, and testing in this study.
| Phase | Data splitting | # of original images | # of augmented images | Total # of images |
|---|---|---|---|---|
| Train | 70% | 1466 | 1466 | |
| Validation | 10% | 209 | – | |
| Test | 20% | 418 | – |
Types of CNNs (conventional/noise-robust) used in training, validation, and testing.
| Phase | Type of CNN | # of noise-free images | # of noisy images | Total # of images |
|---|---|---|---|---|
| Train | Conventional CNN | 1466 | 1466 | |
| Validation | Conventional CNN | 209 | – | |
| Test | The noise-robust CNN | – | 418 |
Properties of the pretrained CNN models we used in this study.
| Network | Depth | Size | Parameters (Millions) | Input Image Size |
|---|---|---|---|---|
| SqueezeNet | 18 | 5.2 MB | 1.24 | 227 |
| GoogleNet | 22 | 27 MB | 7.00 | 224 |
| MobileNetv2 | 53 | 13 MB | 3.50 | 224 |
| ResNet18 | 18 | 44 MB | 11.70 | 224 |
| ResNet50 | 50 | 96 MB | 25.60 | 224 |
| ShuffleNet | 50 | 5.4 MB | 1.40 | 224 |
| EfficientNetb0 | 82 | 20 MB | 5.30 | 224 |
Fig. 14The architecture of the proposed noise-robust SqueezeNet model.
The configuration of the proposed noise-robust SqueezeNet model.
| # | Type | Descriptions | # | Type | Descriptions |
|---|---|---|---|---|---|
| 0 | Adaptive Resizing | 512 × 512 × 3 images | 35 | Convolution | 48 1 × 1 convolutions |
| stride [1 1] and padding [0 0 0 0] | |||||
| 1 | Input of Image | 227 × 227 × 3 images | 36 | ReLU | |
| and Noise-Map | with 'zerocenter' normalization | 37 | Convolution | 192 3 × 3 convolutions | |
| 2 | Adaptive Convolution | 64 3 × 3 convolutions | stride [1 1] and padding [1 1 1 1] | ||
| stride [2 2] and padding [0 0 0 0] | 38 | ReLU | |||
| 3 | ReLU | 39 | Convolution | 192 1 × 1 convolutions | |
| 4 | Max Pooling | 3 × 3 max pooling | stride [1 1] and padding [0 0 0 0] | ||
| stride [2 2] and padding [0 0 0 0] | 40 | ReLU | |||
| 5 | Convolution | 16 1 × 1 convolutions | 41 | Concatenation | Depth concatenation of 2 inputs |
| stride [1 1] and padding [0 0 0 0] | 42 | Convolution | 48 1 × 1 convolutions | ||
| 6 | ReLU | stride [1 1] and padding [0 0 0 0] | |||
| 7 | Convolution | 64 1 × 1 convolutions | 43 | ReLU | |
| stride [1 1] and padding [0 0 0 0] | 44 | Convolution | 192 1 × 1 convolutions | ||
| 8 | ReLU | stride [1 1] and padding [0 0 0 0] | |||
| 9 | Convolution | 64 3 × 3 convolutions | 45 | ReLU | |
| stride [1 1] and padding [1 1 1 1] | 46 | Convolution | 192 3 × 3 convolutions | ||
| 10 | ReLU | stride [1 1] and padding [1 1 1 1] | |||
| 11 | Concatenation | Depth concatenation of 2 inputs | 47 | ReLU | |
| 12 | Convolution | 16 1 × 1 convolutions | 48 | Concatenation | Depth concatenation of 2 inputs |
| stride [1 1] and padding [0 0 0 0] | 49 | Convolution | 64 1 × 1 convolutions | ||
| 13 | ReLU | stride [1 1] and padding [0 0 0 0] | |||
| 14 | Convolution | 64 3 × 3 convolutions | 50 | ReLU | |
| stride [1 1] and padding [1 1 1 1] | 51 | Convolution | 256 3 × 3 convolutions | ||
| 15 | ReLU | stride [1 1] and padding [1 1 1 1] | |||
| 16 | Convolution | 64 1 × 1 convolutions | 52 | ReLU | |
| stride [1 1] and padding [0 0 0 0] | 53 | Convolution | 256 1 × 1 convolutions | ||
| 17 | ReLU | stride [1 1] and padding [0 0 0 0] | |||
| 18 | Concatenation | Depth concatenation of 2 inputs | 54 | ReLU | |
| 19 | Max Pooling | 3 × 3 max pooling with | 55 | Concatenation | Depth concatenation of 2 inputs |
| [2 2] and padding [0 1 0 1] | 56 | Convolution | 64 1 × 1 convolutions | ||
| 20 | Convolution | 32 1 × 1 convolutions | stride [1 1] and padding [0 0 0 0] | ||
| stride [1 1] and padding [0 0 0 0] | 57 | ReLU | |||
| 21 | ReLU | 58 | Convolution | 256 1 × 1 convolutions | |
| 22 | Convolution | 128 3 × 3 convolutions | stride [1 1] and padding [0 0 0 0] | ||
| stride [1 1] and padding [1 1 1 1] | 59 | ReLU | |||
| 23 | ReLU | 60 | Convolution | 256 3 × 3 convolutions | |
| 24 | Convolution | 128 1 × 1 convolutions | stride [1 1] and padding [1 1 1 1] | ||
| stride [1 1] and padding [0 0 0 0] | 61 | ReLU | |||
| 25 | ReLU | 62 | Concatenation | Depth concatenation of 2 inputs | |
| 26 | Concatenation | Depth concatenation of 2 inputs | 63 | Dropout | 50% dropout |
| 27 | Convolution | 32 1 × 1 convolutions | 64 | Convolution | 1000 1 × 1 convolutions |
| stride [1 1] and padding [0 0 0 0] | stride [1 1] and padding [0 0 0 0] | ||||
| 28 | ReLU | 65 | ReLU | ||
| 29 | Convolution | 128 3 × 3 convolutions | 66 | Pooling | Global Average Pooling |
| stride [1 1] and padding [1 1 1 1] | 67 | Softmax | |||
| 30 | ReLU | 68 | Classification | Output | |
| 31 | Convolution | 128 1 × 1 convolutions | |||
| stride [1 1] and padding [0 0 0 0] | |||||
Fig. 15Aaccuracy vs. iteration and Loss vs. iteration curves for the training and validation of GoogleNet.
Fig. 16: The accuracy of COVID-19 detection by different methods for noisy X-ray images corrupted by the impulse noise with (a) , (b) , (c) , and (d) .
COVID-19 detection error (1/100) on X-ray images corrupted by the impulse noise with . Here, scenarios: (i) training conventional CNNs using data without augmentation, (ii) training conventional CNNs with data augmentated by learning-to-augment strategy, and (iii) training proposed noise-robust CNNs with data augmented by learning-to-augment strategy.
| Networks | Scenario | Impulse noise density | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1% | 2% | 3% | 4% | 5% | 6% | 7% | 8% | 9% | 10% | ||
| SqueezeNet | i | 0.79 | 0.80 | 0.80 | 0.81 | 0.82 | 0.82 | 0.82 | 0.82 | 0.83 | 0.83 |
| ii | 0.56 | 0.69 | 0.71 | 0.72 | 0.72 | 0.74 | 0.75 | 0.81 | 0.82 | 0.83 | |
| iii | 0.55 | 0.70 | 0.70 | 0.72 | 0.72 | 0.74 | 0.75 | 0.80 | 0.82 | 0.82 | |
| GoogleNet | i | 0.48 | 0.49 | 0.49 | 0.50 | 0.50 | 0.52 | 0.53 | 0.53 | 0.58 | 0.61 |
| ii | 0.33 | 0.39 | 0.44 | 0.45 | 0.48 | 0.50 | 0.52 | 0.53 | 0.53 | 0.56 | |
| iii | 0.26 | 0.28 | 0.28 | 0.29 | 0.29 | 0.30 | 0.31 | 0.31 | 0.32 | 0.32 | |
| MobileNetv2 | i | 0.59 | 0.62 | 0.67 | 0.72 | 0.72 | 0.73 | 0.77 | 0.80 | 0.80 | 0.82 |
| ii | 0.28 | 0.34 | 0.36 | 0.53 | 0.58 | 0.58 | 0.69 | 0.69 | 0.70 | 0.71 | |
| iii | 0.24 | 0.24 | 0.26 | 0.26 | 0.26 | 0.27 | 0.27 | 0.28 | 0.28 | 0.30 | |
| ResNet18 | i | 0.51 | 0.70 | 0.75 | 0.78 | 0.78 | 0.81 | 0.82 | 0.82 | 0.83 | 0.83 |
| ii | 0.26 | 0.27 | 0.30 | 0.38 | 0.39 | 0.45 | 0.45 | 0.46 | 0.53 | 0.57 | |
| iii | 0.22 | 0.25 | 0.25 | 0.26 | 0.26 | 0.27 | 0.28 | 0.30 | 0.30 | 0.30 | |
| ShuffleNet | i | 0.66 | 0.76 | 0.77 | 0.77 | 0.77 | 0.78 | 0.79 | 0.80 | 0.80 | 0.83 |
| ii | 0.35 | 0.38 | 0.41 | 0.47 | 0.47 | 0.54 | 0.54 | 0.60 | 0.60 | 0.60 | |
| iii | 0.22 | 0.25 | 0.26 | 0.26 | 0.28 | 0.28 | 0.29 | 0.29 | 0.32 | 0.33 | |
| ResNet50 | i | 0.65 | 0.66 | 0.71 | 0.73 | 0.78 | 0.79 | 0.79 | 0.81 | 0.82 | 0.82 |
| ii | 0.49 | 0.49 | 0.48 | 0.45 | 0.45 | 0.42 | 0.40 | 0.31 | 0.28 | 0.31 | |
| iii | |||||||||||
Fig. 17Line chart of COVID-19 detection accuracy using the impulse noise-corrupted X-ray data with .