| Literature DB >> 35840945 |
Hantian Dong1, Biaokai Zhu2, Xinri Zhang3, Xiaomei Kong4.
Abstract
PURPOSE: This paper aims to develop a successful deep learning model with data augmentation technique to discover the clinical uniqueness of chest X-ray imaging features of coal workers' pneumoconiosis (CWP). PATIENTS AND METHODS: We enrolled 149 CWP patients and 68 dust-exposure workers for a prospective cohort observational study between August 2021 and December 2021 at First Hospital of Shanxi Medical University. Two hundred seventeen chest X-ray images were collected for this study, obtaining reliable diagnostic results through the radiologists' team, and confirming clinical imaging features. We segmented regions of interest with diagnosis reports, then classified them into three categories. To identify these clinical features, we developed a deep learning model (ShuffleNet V2-ECA Net) with data augmentation through performances of different deep learning models by assessment with Receiver Operation Characteristics (ROC) curve and area under the curve (AUC), accuracy (ACC), and Loss curves.Entities:
Keywords: Chest X-ray; Coal workers' pneumoconiosis classification; Data augmentation; Deep learning; ECA-Net; ShuffleNet
Mesh:
Substances:
Year: 2022 PMID: 35840945 PMCID: PMC9284687 DOI: 10.1186/s12890-022-02068-x
Source DB: PubMed Journal: BMC Pulm Med ISSN: 1471-2466 Impact factor: 3.320
Definition of CWP stages (according to GBZ70-2015)
| Summary of pneumoconiosis standard (CXR) | |
|---|---|
| Dust-exposure workers (Coal Mining Workers with clear dust-exposure history) | No opacities discovered, or small opacities (Level 1 profusion) discovered in one subregion |
| CWP Stage I | Small opacities (Level 1 profusion) discovered in two subregions at least, or small opacities (Level 2 profusion)discovered in four subregions at most |
| CWP Stage II | Small opacities (Level 2 profusion) discovered in four subregions at least, or small opacitie (Level 3 profusion) discovered |
| CWP Stage III | Small opacities (Level 3 profusion) discovered in four subregions at least, or large opacities discovered |
Baseline characteristics
| All subjects | Dust-exposed workers | CWP | CWP | CWP | |
|---|---|---|---|---|---|
| M (SD) | |||||
| Age (yr), mean (SD) | 55 (13) | 48.7 (6.0) | 60 (11) | 57.9 (7.9) | 56.3 (7.6) |
| Exposure duration (yr), mean (SD) | 29.0 (14) | 21.6 (7.5) | 32.0 (10) | 27.0 (14) | 16.4 (1.5) |
| Dyspnoea (SD) | 3 (9) | 0 (3) | 6 (9) | 4 (11) | 3.7 (2.1) |
| Cough (SD) | 0.3 (6) | 0 (3) | 2 (9) | 0 (7) | 8.3 (4.6) |
| mMRC score (SD) | 2 (2) | 0 (2) | 2 (2) | 2 (0) | 1.8 (0.8) |
| Total | 217 | 63 | 130 | 22 | 2 |
| Mining (n) | 63 | 32 | 26 | 5 | 0 |
| Tunnelling (n) | 105 | 29 | 60 | 14 | 2 |
| Comprehensive digging (n) | 17 | 0 | 16 | 1 | 0 |
| Mixing (n) | 19 | 2 | 15 | 2 | 0 |
| Other (n) | 13 | 0 | 13 | 0 | 0 |
M: mean, SD: standard deviation, mMRC: modified Medical Research Council
Fig. 1Original CXR a with identifying target lung region. We segmented b regions of interest (ROIs) classified into three types
Distribution of clinical CXR imaging features
| CXR Imaging feature | Lung zone | Number of ROIs |
|---|---|---|
| Pulmonary nodules | Top-right | 83 |
| Middle-right | 77 | |
| Bottom-right | 26 | |
| Top-left | 112 | |
| Middle-left | 55 | |
| Bottom-left | 23 | |
| Total | 376 | |
| Pulmonary interstitial changes | Top-right | 2 |
| Middle-right | 21 | |
| Bottom-right | 24 | |
| Top-left | 1 | |
| Middle-left | 33 | |
| Bottom-left | 35 | |
| Total | 116 | |
| Emphysema | Top-right | 8 |
| Middle-right | 3 | |
| Bottom-right | 4 | |
| Top-left | 6 | |
| Middle-left | 4 | |
| Bottom-left | 3 | |
| Total | 28 |
ROI: Region of interest
Fig. 2Flowsheet clarifying the procedure of classifying CXR clinical features among CWP
Fig. 3Comparison of accuracy in CWP classification with different algorithms
Detailed classification of image dataset augmentation
Performance of ShuffleNet v2, ResNet 50, GoogleNet, DenseNet 121 and ShuffleNet-Attention on the test set
| Accuracy | Recall | F1-score | Precision | |||
|---|---|---|---|---|---|---|
| Class A | Class B | Class C | ||||
| Shufflenet-Attenion | 0.96 | 0.97 | 0.96 | 0.96 | 0.92 | 0.93 |
| Shufflenet v2 | 0.94 | 0.95 | 0.95 | 0.95 | 0.89 | 0.9 |
| Resnet 50 | 0.9 | 0.94 | 0.93 | 0.93 | 0.85 | 0.84 |
| Googlenet | 0.91 | 0.94 | 0.92 | 0.92 | 0.91 | 0.82 |
| Densnet 121 | 0.89 | 0.9 | 0.86 | 0.86 | 0.82 | 0.88 |
Class A: pulmonary nodules, Class B: pulmonary interstitial changes, Class C: emphysema
Fig. 4The accuracy in classification with different models with epochs
Fig. 5The losses in classification with different models with epochs
Fig. 6The ROC curve in classification with different models with epochs. Class A: pulmonary nodules, Class B: pulmonary interstitial changes, Class C: emphysema