| Literature DB >> 35204365 |
Jung Han Hwang1, Jae Won Seo2, Jeong Ho Kim1, Suyoung Park1, Young Jae Kim3, Kwang Gi Kim3.
Abstract
In this study, we aimed to investigate quantitative differences in performance in terms of comparing the automated classification of deep vein thrombosis (DVT) using two categories of artificial intelligence algorithms: deep learning based on convolutional neural networks (CNNs) and conventional machine learning. We retrospectively enrolled 659 participants (DVT patients, 282; normal controls, 377) who were evaluated using contrast-enhanced lower extremity computed tomography (CT) venography. Conventional machine learning consists of logistic regression (LR), support vector machines (SVM), random forests (RF), and extreme gradient boosts (XGB). Deep learning based on CNN included the VGG16, VGG19, Resnet50, and Resnet152 models. According to the mean generated AUC values, we found that the CNN-based VGG16 model showed a 0.007 higher performance (0.982 ± 0.014) as compared with the XGB model (0.975 ± 0.010), which showed the highest performance among the conventional machine learning models. In the conventional machine learning-based classifications, we found that the radiomic features presenting a statistically significant effect were median values and skewness. We found that the VGG16 model within the deep learning algorithm distinguished deep vein thrombosis on CT images most accurately, with slightly higher AUC values as compared with the other AI algorithms used in this study. Our results guide research directions and medical practice.Entities:
Keywords: computed tomography; deep learning; deep vein thrombosis; machine learning; radiomics
Year: 2022 PMID: 35204365 PMCID: PMC8871174 DOI: 10.3390/diagnostics12020274
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Flowchart of deep vein thrombosis (DVT) classification using deep learning models based on convolutional neural networks (CNN) and conventional machine learning.
Figure 2Example of generating the vein region image containing thrombus: (a) a contrast-enhanced lower extremity computed tomography venography image in coronal view; (b) computed tomography venography images containing thrombus; (c) generated vein region image from (b).
Selected features from machine learning algorithms.
| Feature Number | Features | |
|---|---|---|
| Logistic regression | 20 | First order: 10th percentile, 90th percentile, entropy, maximum, median, minimum, skewness |
| Support vector machine | 20 | First order: 10th percentile, 90th percentile, entropy, maximum, median, robust mean absolute deviation, skewness |
| Random forest | 19 | First order: mean, median, root mean squared, skewness |
| Extreme gradient boost | 18 | First order: mean, median, range, root mean squared, skewness, uniformity |
GLCM, gray level co-occurrence matrix; GLRLM, gray level run length matrix; GLSZM, gray level size zone matrix.
Comparison of the mean values for each model’s AUC, sensitivity, specificity, and accuracy.
| Mean AUC | Mean Sensitivity | Mean Specificity | Mean Accuracy | |
|---|---|---|---|---|
| LR | 0.954 (±0.002) | 0.879 (±0.002) | 0.902 (±0.004) | 0.889 (±0.002) |
| SVM | 0.964 (±0.001) | 0.913 (±0.001) | 0.919 (±0.003) | 0.915 (±0.001) |
| RF | 0.969 (±0.001) | 0.932 (±0.002) | 0.921 (±0.002) | 0.926 (±0.001) |
| XGB |
|
|
|
|
| VGG16 |
|
| 0.916 (±0.005) | 0.934 (±0.003) |
| VGG19 | 0.981 (±0.002) | 0.950 (±0.004) |
|
|
| Resnet50 | 0.904 (±0.008) | 0.858 (±0.007) | 0.859 (±0.010) | 0.849 (±0.006) |
| Resnet152 | 0.888 (±0.014) | 0.825 (±0.006) | 0.873 (±0.012) | 0.841 (±0.008) |
AUC, area under a receiver operating characteristic curve; CI, confidence interval; LR, logistic regression; SVM, support vector machine; RF, random forest; XGB, extreme gradient boost. The highest performance values are shown in bold.
Figure 3Receiver operation characteristic (ROC) curves: (a) ROC curves of machine learning algorithms; (b) ROC curves of deep learning algorithms; (c) ROC curves of all algorithms.
Comparison of the mean performance values of deep learning and machine learning algorithms.
| Deep Learning Algorithms | Machine Learning Algorithms | |
|---|---|---|
| Mean AUC (±SD) | 0.939 (±0.043) |
|
| Mean sensitivity (±SD) | 0.897 (±0.057) |
|
| Mean specificity (±SD) | 0.894 (±0.028) |
|
| Mean accuracy (±SD) | 0.890 (±0.045) |
|
AUC, area under a receiver operating characteristic curve; SD, standard deviation. The highest performance values are shown in bold.