| Literature DB >> 30654560 |
Seok-Jae Heo1, Yangwook Kim2, Sehyun Yun3, Sung-Shil Lim4, Jihyun Kim5, Chung-Mo Nam6,7, Eun-Cheol Park8, Inkyung Jung9, Jin-Ha Yoon10,11.
Abstract
We aimed to use deep learning to detect tuberculosis in chest radiographs in annual workers' health examination data and compare the performances of convolutional neural networks (CNNs) based on images only (I-CNN) and CNNs including demographic variables (D-CNN). The I-CNN and D-CNN models were trained on 1000 chest X-ray images, both positive and negative, for tuberculosis. Feature extraction was conducted using VGG19, InceptionV3, ResNet50, DenseNet121, and InceptionResNetV2. Age, weight, height, and gender were recorded as demographic variables. The area under the receiver operating characteristic (ROC) curve (AUC) was calculated for model comparison. The AUC values of the D-CNN models were greater than that of I-CNN. The AUC values for VGG19 increased by 0.0144 (0.957 to 0.9714) in the training set, and by 0.0138 (0.9075 to 0.9213) in the test set (both p < 0.05). The D-CNN models show greater sensitivity than I-CNN models (0.815 vs. 0.775, respectively) at the same cut-off point for the same specificity of 0.962. The sensitivity of D-CNN does not attenuate as much as that of I-CNN, even when specificity is increased by cut-off points.Entities:
Keywords: computer-assisted diagnosis; convolutional neural network; deep learning; image; tuberculosis
Mesh:
Year: 2019 PMID: 30654560 PMCID: PMC6352082 DOI: 10.3390/ijerph16020250
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Lung segmentation using U-Net before training the convolutional neural network: (a) the original chest X-ray image, (b) a mask of lung structures segmented through U-Net, and (c) the final segmented image of the lungs.
Summary of demographic variables for training and test datasets.
| Variables | Training | Test | ||||
|---|---|---|---|---|---|---|
| Tuberculosis | Tuberculosis | |||||
| Positive | Negative | Positive | Negative | |||
| Age | 50.08 ± 10.74 | 40.33 ± 11.07 | <0.001 | 50.42 ± 10.48 | 40.30 ± 10.86 | <0.001 |
| Gender | <0.001 | <0.001 | ||||
| Male | 682 (68.20) | 561 (56.10) | 125 (61.88) | 20,445 (54.56) | ||
| Female | 318 (31.80) | 439 (43.90) | 77 (38.12) | 17,030 (45.44) | ||
| Height | 168.36 ± 8.33 | 167.85 ± 8.43 | 0.170 | 168.04 ± 8.53 | 167.54 ± 8.37 | 0.401 |
| Weight | 63.76 ± 11.42 | 64.98 ± 12.99 | 0.025 | 62.51 ± 10.74 | 64.43 ± 12.99 | 0.006 |
Values are presented as number (%) or mean ± standard deviation. * p value was calculated from t-test or chi-squared test.
Figure 2Flowchart of tuberculosis classification using the convolutional neural network (CNN) model. Flow 1 uses only chest X-rays for tuberculosis classification. Flow 2 uses demographic variables as well as chest X-rays.
Comparison of the area under the curve (AUC) when using only images and when adding demographic variables.
| Models | Training | Test | ||||||
|---|---|---|---|---|---|---|---|---|
| AUC | AUC | |||||||
| I-CNN * | D-CNN ** | Difference | I-CNN * | D-CNN ** | Difference | |||
| VGG19 | 0.9570 | 0.9714 | 0.0144 | <0.001 | 0.9075 | 0.9213 | 0.0138 | 0.049 |
| InceptionV3 | 0.9523 | 0.9616 | 0.0093 | 0.014 | 0.8821 | 0.9045 | 0.0224 | 0.033 |
| ResNet50 | 0.9219 | 0.9250 | 0.0031 | 0.434 | 0.8780 | 0.8955 | 0.0175 | 0.051 |
| DenseNet121 | 0.9315 | 0.9472 | 0.0157 | 0.002 | 0.8605 | 0.8893 | 0.0288 | 0.011 |
| InceptionResNetV2 | 0.9482 | 0.9455 | 0.0027 | 0.407 | 0.8851 | 0.8864 | 0.0013 | 0.888 |
* I-CNN: convolutional neural network only using images; ** D-CNN: I-CNN with demographic variables added.
AUC comparison of various demographic variable combinations for the training dataset: in reference to convolutional neural networks using I-CNN.
| Input Variables | AUC | |
|---|---|---|
| I-CNN | 0.9075 | - |
| I-CNN + Gender | 0.9107 | 0.657 |
| I-CNN + Age | 0.9111 | 0.602 |
| I-CNN + Weight | 0.9122 | 0.468 |
| I-CNN + Height | 0.9091 | 0.802 |
| I-CNN + Weight + Age | 0.9212 | 0.039 |
| I-CNN + Weight + Age + Gender | 0.9207 | 0.023 |
| I-CNN + Weight + Age + Gender + Height | 0.9213 | 0.049 |
* p value was calculated for the difference between the AUC based on I-CNN.
Figure 3Value of sensitivity and specificity changed according to the cut-off point: (a) the sensitivity and specificity for the training data set and (b) the sensitivity and specificity for the test data set.