| Literature DB >> 35494520 |
Olfa Hrizi1, Karim Gasmi1, Ibtihel Ben Ltaifa2, Hamoud Alshammari3, Hanen Karamti4, Moez Krichen5, Lassaad Ben Ammar6, Mahmood A Mahmood3.
Abstract
Computer science plays an important role in modern dynamic health systems. Given the collaborative nature of the diagnostic process, computer technology provides important services to healthcare professionals and organizations, as well as to patients and their families, researchers, and decision-makers. Thus, any innovations that improve the diagnostic process while maintaining quality and safety are crucial to the development of the healthcare field. Many diseases can be tentatively diagnosed during their initial stages. In this study, all developed techniques were applied to tuberculosis (TB). Thus, we propose an optimized machine learning-based model that extracts optimal texture features from TB-related images and selects the hyper-parameters of the classifiers. Increasing the accuracy rate and minimizing the number of characteristics extracted are our goals. In other words, this is a multitask optimization issue. A genetic algorithm (GA) is used to choose the best features, which are then fed into a support vector machine (SVM) classifier. Using the ImageCLEF 2020 data set, we conducted experiments using the proposed approach and achieved significantly higher accuracy and better outcomes in comparison with the state-of-the-art works. The obtained experimental results highlight the efficiency of modified SVM classifier compared with other standard ones.Entities:
Mesh:
Year: 2022 PMID: 35494520 PMCID: PMC9041161 DOI: 10.1155/2022/8950243
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 3.822
Figure 1Separating hyperplane.
Figure 2Slices of typical CT images with three types of TB-related findings (https://www.imageclef.org/2020/medical/tuberculosis).
Figure 3Illustration of the proposed approach.
Figure 4Block diagram of feature extraction by applying WT and SGLDM.
Extracted features.
| Features (mean/range) | |
|---|---|
| Frequency domain (32,33, 34) | Spatial domain (32,35) |
| Angular second moment | Correlation |
| Contrast | Variance |
| Inverse difference moment | Sum variance |
| Sum average | Difference variance |
| Entropy, sum entropy, difference entropy | Information measure of correlation I |
| Cluster prominence | Information measure of correlation II |
| Cluster shade | Maximal correlation coefficient |
| Dissimilarity | Correlation matrix |
| Homogeneity and inverse difference normalized | Maximum probability |
| Energy | |
Figure 5Genetic algorithm.
Class distribution in tuberculosis data set.
| Label | Number of occurrences | Test |
|---|---|---|
| Left lung affected | 211 | 75 |
| Right lung affected | 233 | 99 |
| Caverns left | 66 | 3 |
| Caverns right | 79 | 5 |
| Pleurisy left | 7 | 28 |
| Pleurisy right | 14 | 46 |
The percentage of train and test data for each label.
| Left lung affected | Right lung affected | Left lung | Right lung pleurisy | Left lung caverns | Right lung caverns | |
|---|---|---|---|---|---|---|
| Train | 211 (75%) | 233 (82%) | 7 (2%) | 14 (4%) | 66 (23%) | 79 (28%) |
| Test | 75 (63%) | 99 (83%) | 3 (3%) | 5 (4%) | 28 (23%) | 46 (38%) |
Truth table.
| Outcomes | Disease | ||
|---|---|---|---|
| Test | + | TP | FP |
| - | FN | TN | |
GA parameters.
| GA property | Value/method |
|---|---|
| Size of generation | 100 |
| Initial population size | 30 |
| Selection method | Tournament |
| Number of crossover points | 1 |
| Crossover probability | 0.9 |
| Mutation method | Uniform mutation |
| Mutation probability | 0.05 |
Sample of binary classification results based on an adaptive genetic algorithm.
| Accuracy | ||||||
|---|---|---|---|---|---|---|
| Left lung affected | Right lung affected | Caverns left | Caverns right | Pleurisy left | Pleurisy right | |
|
| 0.76 | 0.82 | 0.78 | 0.74 | 0.97 | 0.95 |
|
| 0.68 | 0.75 | 0.64 | 0.64 | 0.95 | 0.93 |
|
| 0.69 | 0.74 | 0.69 | 0.65 | 0.95 | 0.94 |
|
| 0.75 | 0.81 | 0.76 | 0.73 | 0.97 | 0.95 |
|
| 0.73 | 0.81 | 0.78 | 0.71 | 0.97 | 0.95 |
Comparison between performance algorithms based on the rate of accuracy.
| SVM | KNN | CART | NB | LDA | RF | |
|---|---|---|---|---|---|---|
| Left lung affected | 0.77 | 0.72 | 0.63 | 0.46 | 0.72 | 0.70 |
| Right lung affected | 0.82 | 0.80 | 0.69 | 0.74 | 0.81 | 0.81 |
| Caverns left | 0.78 | 0.76 | 0.68 | 0.72 | 0.75 | 0.75 |
| Caverns right | 0.77 | 0.73 | 0.51 | 0.42 | 0.76 | 0.68 |
| Pleurisy left | 0.97 | 0.97 | 0.95 | 0.82 | 0.95 | 0.97 |
| Pleurisy right | 0.95 | 0.95 | 0.93 | 0.89 | 0.93 | 0.95 |
| Mean accuracy | 0.84 | 0.82 | 0.73 | 0.67 | 0.82 | 0.81 |
Sample of selected feature results for the left and right lung affected category.
| Range of selected features | Left lung | Left lung |
|---|---|---|
| [1 : 20] | 0.75 | 0.79 |
| [3 : 12] | 0.78 | 0.83 |
| [3 : 14] | 0.77 | 0.82 |
| [3 : 20] | 0.77 | 0.81 |
| [3 : 10] | 0.75 | 0.8 |