| Literature DB >> 34121816 |
Erdal Tasci1, Caner Uluturk1, Aybars Ugur1.
Abstract
Tuberculosis (TB) is known as a potentially dangerous and infectious disease that affects mostly lungs worldwide. The detection and treatment of TB at an early stage are critical for preventing the disease and decreasing the risk of mortality and transmission of it to others. Nowadays, as the most common medical imaging technique, chest radiography (CXR) is useful for determining thoracic diseases. Computer-aided detection (CADe) systems are also crucial mechanisms to provide more reliable, efficient, and systematic approaches with accelerating the decision-making process of clinicians. In this study, we propose voting and preprocessing variations-based ensemble CNN model for TB detection. We utilize 40 different variations in fine-tuned CNN models based on InceptionV3 and Xception by also using CLAHE (contrast-limited adaptive histogram equalization) preprocessing technique and 10 different image transformations for data augmentation types. After analyzing all these combination schemes, three or five best classifier models are selected as base learners for voting operations. We apply the Bayesian optimization-based weighted voting and the average of probabilities as a combination rule in soft voting methods on two TB CXR image datasets to get better results in various numbers of models. The computational results indicate that the proposed method achieves 97.500% and 97.699% accuracy rates on Montgomery and Shenzhen datasets, respectively. Furthermore, our method outperforms state-of-the-art results for the two TB detection datasets in terms of accuracy rate.Entities:
Keywords: Augmentation; CLAHE; Deep learning; Ensemble learning; Fine-tuning; Image processing; Pattern recognition; Tuberculosis detection; Voting
Year: 2021 PMID: 34121816 PMCID: PMC8182991 DOI: 10.1007/s00521-021-06177-2
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.606
Fig. 1The overview of the proposed method
Fig. 2Example of the CLAHE operation on an image of the Montgomery dataset (i) Original image (ii) The image after CLAHE operation
Fig. 3Examples of the transformation for data augmentation of an image of the Montgomery dataset (i) The image (ii) Rotation (iii) Reflection (iv) Scaling (v) Shearing (vi) Translation
Types of image data augmentation and parameter values
| Transformation | Value |
|---|---|
| RandXReflection | True |
| RandYReflection | True |
| RandRotation | [0 360] |
| RandXScale | [1 2] |
| RandYScale | [1 2] |
| RandXShear | [0 30] |
| RandYShear | [0 30] |
| RandXTranslation | [1 3] |
| RandYTranslation | [1 3] |
Generated combination sets for the experimental process
| Set | Minibatch | CNN Model | Preprocessing | Augmentation |
|---|---|---|---|---|
| 1 | 64 | Inceptionv3 | No preprocessing | No augmentation |
| 2 | 64 | Inceptionv3 | No preprocessing | No augmentation |
| 3 | 64 | Inceptionv3 | No preprocessing | RandXReflection |
| 4 | 64 | Inceptionv3 | No preprocessing | RandYReflection |
| 5 | 64 | Inceptionv3 | No preprocessing | RandRotation |
| 6 | 64 | Inceptionv3 | No preprocessing | RandXScale |
| 7 | 64 | Inceptionv3 | No preprocessing | RandYScale |
| 8 | 64 | Inceptionv3 | No preprocessing | RandXShear |
| 9 | 64 | Inceptionv3 | No preprocessing | RandYShear |
| 10 | 64 | Inceptionv3 | No preprocessing | RandXTranslation |
| 11 | 64 | Inceptionv3 | No preprocessing | RandYTranslation |
| 12 | 64 | Inceptionv3 | CLAHE | No augmentation |
| 13 | 64 | Inceptionv3 | CLAHE | RandXReflection |
| 14 | 64 | Inceptionv3 | CLAHE | RandYReflection |
| 15 | 64 | Inceptionv3 | CLAHE | RandRotation |
| 16 | 64 | Inceptionv3 | CLAHE | RandXScale |
| 17 | 64 | Inceptionv3 | CLAHE | RandYScale |
| 18 | 64 | Inceptionv3 | CLAHE | RandXShear |
| 19 | 64 | Inceptionv3 | CLAHE | RandYShear |
| 20 | 64 | inceptionv3 | CLAHE | RandXTranslation |
| 21 | 64 | Inceptionv3 | CLAHE | RandYTranslation |
| 22 | 16 | Xception | No preprocessing | No augmentation |
| 23 | 16 | Xception | No Preprocessing | No augmentation |
| 24 | 16 | xception | No preprocessing | RandXReflection |
| 25 | 16 | Xception | No preprocessing | RandYReflection |
| 26 | 16 | Xception | No preprocessing | RandRotation |
| 27 | 16 | Xception | No preprocessing | RandXScale |
| 28 | 16 | Xception | No preprocessing | RandYScale |
| 29 | 16 | Xception | No preprocessing | RandXShear |
| 30 | 16 | Xception | No preprocessing | RandYShear |
| 31 | 16 | Xception | No preprocessing | RandXTranslation |
| 32 | 16 | Xception | No preprocessing | RandYTranslation |
| 33 | 16 | Xception | CLAHE | No augmentation |
| 34 | 16 | Xception | CLAHE | RandXReflection |
| 35 | 16 | Xception | CLAHE | RandYReflection |
| 36 | 16 | Xception | CLAHE | RandRotation |
| 37 | 16 | Xception | CLAHE | RandXScale |
| 38 | 16 | Xception | CLAHE | RandYScale |
| 39 | 16 | Xception | CLAHE | RandXShear |
| 40 | 16 | Xception | CLAHE | RandYShear |
| 41 | 16 | Xception | CLAHE | RandXTranslation |
| 42 | 16 | Xception | CLAHE | RandYTranslation |
Fine-tuning times and experimental results of the combination sets for the Montgomery, Shenzhen datasets, and mean values in terms of accuracy rate
| Set | Montgomery FT | Shenzhen FT | Montgomery | Shenzhen | Mean datasets |
|---|---|---|---|---|---|
| Time (mm:ss) | Time (mm:ss) | ACC (%) | ACC (%) | ACC (%) | |
| 1 | 03 m 15 s | 11 m 51 s | 67.8571 | 88.4956 | 78.1764 |
| 2 | 00 m 58 s | 06 m 43 s | 67.8571 | 84.9558 | 76.4065 |
| 3 | 02 m 54 s | 13 m 32 s | 82.1429 | 89.3805 | 85.7617 |
| 4 | 02 m 53 s | 13 m 26 s | 75.0000 | 86.7257 | 80.8629 |
| 02 m 53 s | 82.1429 | 86.2042 | |||
| 6 | 02 m 52 s | 13 m 26 s | 78.5714 | 86.7257 | 82.6486 |
| 7 | 02 m 53 s | 13 m 48 s | 75.0000 | 85.8407 | 80.4204 |
| 8 | 02 m 54 s | 13 m 33 s | 82.1429 | 88.4956 | 85.3193 |
| 9 | 02 m 54 s | 13 m 32 s | 78.5714 | 84.9558 | 81.7636 |
| 10 | 03 m 00 s | 13 m 26 s | 82.1429 | 86.7257 | 84.4343 |
| 11 | 02 m 56 s | 13 m 46 s | 78.5714 | 85.8407 | 82.2061 |
| 12 | 00 m 58 s | 07 m 07 s | 75.0000 | 87.6106 | 81.3053 |
| 13 | 02 m 56 s | 13m 44s | 82.1429 | 89.3805 | 85.7617 |
| 14 | 02 m 54 s | 13 m 35 s | 85.7143 | 88.4954 | 87.1049 |
| 15 | 02 m 56 s | 13 m 25 s | 85.7143 | 85.8407 | 85.7775 |
| 13 m 29 s | 87.6106 | ||||
| 17 | 02 m 54 s | 13 m 26 s | 82.1429 | 87.6106 | 84.8768 |
| 18 | 02 m 58 s | 13 m 27 s | 89.2857 | 89.3805 | 89.3331 |
| 19 | 02 m 54 s | 13 m 49 s | 82.1429 | 88.4956 | 85.3193 |
| 20 | 02 m 57 s | 13 m 33 s | 85.7143 | 88.4956 | 87.1050 |
| 21 | 02 m 56 s | 13 m 36 s | 89.2857 | 89.3805 | 89.3331 |
| 22 | 05 m 48 s | 15 m 50 s | 82.1429 | 89.3805 | 85.7617 |
| 23 | 02 m 17 s | 10 m 35 s | 85.7143 | 88.4956 | 87.1050 |
| 24 | 04 m 57 s | 21 m 12 s | 89.2857 | 87.6106 | 88.4482 |
| 25 | 04 m 56 s | 21 m 11 s | 85.7143 | 85.8407 | 85.7775 |
| 26 | 04 m 56 s | 21 m 12 s | 85.7143 | 87.6106 | 86.6625 |
| 27 | 04 m 58 s | 21 m 12 s | 82.1429 | 89.3805 | 85.7617 |
| 28 | 04 m 54 s | 21 m 11 s | 82.1429 | 85.8407 | 83.9918 |
| 21 m 14 s | 85.8407 | 89.3489 | |||
| 30 | 04 m 59 s | 21 m 11 s | 85.7143 | 89.3805 | 87.5474 |
| 31 | 04 m 56 s | 21 m 27 s | 71.4286 | 85.8407 | 78.6347 |
| 32 | 04 m 54 s | 21 m 13 s | 82.1429 | 89.3805 | 85.7617 |
| 33 | 02 m 1 6s | 10 m 37 s | 89.2857 | 88.4956 | 88.8907 |
| 34 | 04 m 59 s | 21 m 17 s | 85.7143 | 87.6106 | 86.6625 |
| 35 | 04 m 56 s | 21 m 11 s | 82.1429 | 87.6106 | 84.8768 |
| 36 | 04 m 57 s | 21 m 14 s | 78.5714 | 88.4956 | 83.5335 |
| 37 | 04 m 57 s | 21 m 09 s | 85.7143 | 89.3805 | 87.5474 |
| 38 | 05 m 00 s | 21 m 09 s | 89.2857 | 87.6106 | 88.4482 |
| 39 | 05 m 01 s | 21 m 24 s | 78.5714 | 86.7257 | 82.6486 |
| 40 | 04 m 58 s | 21 m 11 s | 78.5714 | 88.4956 | 83.5335 |
| 41 | 04 m 57 s | 21 m 10 s | 85.7143 | 88.4956 | 87.1050 |
| 42 | 04 m 56 s | 21 m 20 s | 89.2857 | 88.4956 | 88.8907 |
Fig. 4A graphical representation of ACC values according to all variations in fine-tuning process for datasets
Soft voting and Bayesian optimization-based weighted voting results according to Montgomery, Shenzhen, and both mean datasets’ accuracy (%)
| Montgomery ACC-based | ||||||
|---|---|---|---|---|---|---|
| Soft voting | Dataset | Number of votes | Mean accuracy (%) | SD (%) | MeanAUC | SD |
| 10 seed | Montgomery | 3 | 96.7857 | 3.5515 | 0.9880 | 0.0208 |
| Montgomery | 5 | 96.7857 | 3.5515 | 0.9880 | 0.0208 | |
Fig. 5The diagram of the best voting scheme for the Montgomery dataset
Fig. 6The diagram of the best voting scheme for the Shenzhen dataset
Comparison of our proposed method with state-of-the-art methods on TB detection for two CXR image datasets used
| Study | Method | Montgomery | Montgomery | Shenzhen | Shenzhen |
|---|---|---|---|---|---|
| ACC (%) | AUC | ACC (%) | AUC | ||
| Our study, 2021 | VoPreCNNFT | 0.989 | 0.994 | ||
| Ayaz et al. [ | HCDEL | 93.470 | 0.970 | 97.590 | 0.990 |
| Win et al. [ | HDHFS | 92.700 | 0.995 | 95.500 | 0.995 |
| Xie et al. [ | FRCNN | 92.600 | 0.977 | 90.200 | 0.941 |
| Rajaraman et al. [ | SLMHDF | 87.500 | 0.962 | 93.400 | 0.991 |
| Santosh and Antani [ | SETFV | 83.000 | 0.900 | 91.000 | 0.960 |
| Lopes and Valiati [ | EDFPCNN | 82.600 | 0.926 | 84.700 | 0.926 |
| Pasa et al. [ | OptCNN | 79.000 | 0.811 | 84.400 | 0.900 |
| Hwang et al. [ | PreACNNF | 67.400 | 0.884 | 83.700 | 0.926 |