| Literature DB >> 33067538 |
Mizuho Nishio1, Shunjiro Noguchi2, Hidetoshi Matsuo3, Takamichi Murakami3.
Abstract
This study aimed to develop and validate computer-aided diagnosis (CXDx) system for classification between COVID-19 pneumonia, non-COVID-19 pneumonia, and the healthy on chest X-ray (CXR) images. From two public datasets, 1248 CXR images were obtained, which included 215, 533, and 500 CXR images of COVID-19 pneumonia patients, non-COVID-19 pneumonia patients, and the healthy samples, respectively. The proposed CADx system utilized VGG16 as a pre-trained model and combination of conventional method and mixup as data augmentation methods. Other types of pre-trained models were compared with the VGG16-based model. Single type or no data augmentation methods were also evaluated. Splitting of training/validation/test sets was used when building and evaluating the CADx system. Three-category accuracy was evaluated for test set with 125 CXR images. The three-category accuracy of the CAD system was 83.6% between COVID-19 pneumonia, non-COVID-19 pneumonia, and the healthy. Sensitivity for COVID-19 pneumonia was more than 90%. The combination of conventional method and mixup was more useful than single type or no data augmentation method. In conclusion, this study was able to create an accurate CADx system for the 3-category classification. Source code of our CADx system is available as open source for COVID-19 research.Entities:
Mesh:
Year: 2020 PMID: 33067538 PMCID: PMC7567783 DOI: 10.1038/s41598-020-74539-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Representative CXR images of COVID-19 pneumonia, non-COVID-19 pneumonia, and the healthy. CXR chest X-ray imaging, COVID-19 novel coronavirus disease. (A) COVID-19 pneumonia of 30-year-old male. (B) Non-COVID-19 pneumonia of 56-year-old male. (C) No pneumonia of 60-year-old female.
Patients’ characteristics and CXR attributes.
| Category | Value |
|---|---|
| Number of images | 1248 |
| Male | 512 |
| Female | 702 |
| Not available | 34 |
| Available | 1205 |
| Not available | 43 |
| Mean ± SD of age (years) | 48.1 ± 17.5 |
| COVID-19 | 215 |
| Non-COVID-19 pneumonia | 533 |
| The healthy | 500 |
| PA | 666 |
| AP | 582 |
CXR chest X-ray imaging, COVID-19 novel coronavirus disease, SD standard deviation, PA posterior–anterior view, AP anterior–posterior view.
Figure 2Outline of deep learning model of the proposed method. Note: For pre-trained models, VGG16, Resnet-50, MobileNet, DenseNet-121, and EfficientNet were used in the current study. Activation function is omitted for brevity. GAP global averaging pooling layer, FC fully-connected layer, D dropout layer.
Results of five pre-trained models.
| Models | Loss of test set | 3-category accuracy of test set (%) |
|---|---|---|
| VGG16 (proposed method) | 0.4682 ± 0.0289 | 83.68 ± 2.00 |
| Resnet-50 | 0.5237 ± 0.0161 | 77.76 ± 1.18 |
| MobileNet | 0.4919 ± 0.0300 | 78.72 ± 3.22 |
| DenseNet-121 | 0.5276 ± 0.0082 | 78.24 ± 2.23 |
| EfficientNet | 0.5206 ± 0.0177 | 78.40 ± 1.82 |
Value of each cell was mean ± standard deviation of 5 trials.
Representative confusion matrix of 3-category classification in test set.
| Prediction by the proposed model | ||||
|---|---|---|---|---|
| The healthy | Non-COVID-19 pneumonia | COVID-19 pneumonia | ||
| The healthy | 43 | 7 | 0 | |
| Non-COVID-19 pneumonia | 9 | 41 | 3 | |
| COVID-19 pneumonia | 2 | 0 | 20 | |
Accuracy was 83.2% (104/125).
Results of ablation study of the proposed method for data augmentation methods and layer freezing.
| Models | Loss of test set | 3-category accuracy of test set (%) |
|---|---|---|
| Proposed method | 0.4682 ± 0.0289 | 83.68 ± 2.00 |
| No data augmentation with layer freezing | 0.9009 ± 0.1967 | 78.72 ± 1.65 |
| Conventional data augmentation method only with layer freezing | 0.4863 ± 0.0274 | 82.56 ± 2.45 |
| Mix-up only with layer freezing | 0.6407 ± 0.0674 | 79.20 ± 1.75 |
| Conventional data augmentation method and mixup without layer freezing | 0.5143 ± 0.0179 | 79.04 ± 2.60 |
Value of each cell was mean ± standard deviation of 5 trials.