| Literature DB >> 35741169 |
Haizhen Li1, Ying Xu1, Yi Lei2, Qing Wang3,4, Xuemei Gao1.
Abstract
(1) Background: The present study aims to evaluate and compare the model performances of different convolutional neural networks (CNNs) used for classifying sagittal skeletal patterns. (2)Entities:
Keywords: artificial intelligence; convolutional neural networks; orthodontic sagittal skeletal pattern classification
Year: 2022 PMID: 35741169 PMCID: PMC9221941 DOI: 10.3390/diagnostics12061359
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1(a) Flow chart of the experimental process. (b) Schematic diagram of data preprocessing. (c) Schematic diagram of data augmentation. (d) Schematic diagram of model visualization.
Numbers of patients assigned to the training, validation, and test sets of the three sagittal skeletal subgroups.
| Class I | Class II | Class III | Total | |
|---|---|---|---|---|
| Training set | 642 | 525 | 534 | 1701 |
| Validation set | 138 | 113 | 115 | 366 |
| Test set | 138 | 112 | 115 | 365 |
| Total | 918 | 750 | 764 | 2432 |
Figure 2Training results for the four candidate CNN models. DenseNet161 had the highest accuracy. (a) DesneNet161; (b) ResNet152; (c) VGG16; (d) GoogLeNet. The horizontal axis of the graph represents the training epochs. The left vertical axis of the graph represents the classification accuracy, and the right vertical axis of the graph represents the loss value.
Model size, training time, classification accuracy, inference time, and AUC value of the four CNNs.
| Model Size | Training Time (min) | Accuracy | Inference Time | AUC Value | |
|---|---|---|---|---|---|
| DenseNet161 | 102 | 40 | 89.58 | 0.32 | 0.977 |
| ResNet152 | 222 | 38 | 89.04 | 0.32 | 0.974 |
| VGG16 | 512 | 33 | 88.76 | 0.26 | 0.973 |
| GoogLeNet | 21.5 | 27 | 87.94 | 0.083 | 0.972 |
Figure 3Confusion matrix of the four CNN models of the test set: data concentrating in the diagonal line indicates better predictive performance in the four models. (a) DesneNet161; (b) ResNet152; (c) VGG16; (d) GoogLeNet.
Precision, recall rates, and F1 scores of four CNNs on the test set for each sagittal skeletal pattern subgroup.
| Precision (95% CI) | Recall (95% CI) | F1 Score (95% CI) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| I | II | III | I | II | III | I | II | III | |
| DenseNet161 | 0.83(0.77–0.88) | 0.93(0.86–0.96) | 0.95(0.90–0.98) | 0.91(0.85–0.94) | 0.88(0.81–0.93) | 0.90(0.83–0.94) | 0.87(0.81–0.691) | 0.90(0.83–0.94) | 0.92(0.86–0.96) |
| ResNet152 | 0.83(0.75–0.87) | 0.89(0.82–0.94) | 0.97(0.92–0.99) | 0.89(0.83–0.93) | 0.88(0.80–0.92) | 0.90(0.84–0.95) | 0.86(0.79–0.90) | 0.88(0.81–0.93) | 0.94(0.88–0.97) |
| VGG16 | 0.84(0.78–0.89) | 0.88(0.81–0.93) | 0.95(0.90–0.98) | 0.86(0.79–0.91) | 0.90(0.83–0.94) | 0.90(0.84–0.95) | 0.85(0.78–0.90) | 0.89(0.82–0.93) | 0.93(0.87–0.96) |
| GoogLeNet | 0.87(0.80–0.92) | 0.84(0.77–0.90) | 0.93(0.87–0.96) | 0.80(0.73–0.86) | 0.91(0.84–0.95) | 0.94(0.88–0.97) | 0.83(0.76–0.89) | 0.88(0.80–0.92) | 0.94(0.87–0.96) |
Figure 4ROC curves of the four CNN models of the test sets. All four CNNs models have good ability to classify the sagittal skeletal patterns. (a) DesneNet161; (b) ResNet152; (c) VGG16; (d) GoogLeNet.
Figure 5Representative class activation maps of different CNNs; (a–d) representative images of Class I; (e–h) representative images of Class II; (i–l) representative images of Class III.
The average ANB angles and Wits values of the misjudged samples by the four CNNs.
| DenseNet161 | ResNet152 | VGG16 | GoogLeNet | |||||
|---|---|---|---|---|---|---|---|---|
| ANB | Wits | ANB | Wits | ANB | Wits | ANB | Wits | |
| I–II * | 4.38 ± 0.34 | 1.63 ± 0.25 | 4.43 ± 0.31 | 1.45 ± 0.3 | 4.38 ± 0.29 | 1.51 ± 0.17 | 4.51 ± 0.29 | 1.48 ± 0.26 |
| II–I * | 5.61 ± 0.35 | 2.85 ± 0.48 | 5.45 ± 0.33 | 3.23 ± 0.57 | 5.57 ± 0.29 | 2.85 ± 0.61 | 5.63 ± 0.16 | 3.07 ± 0.61 |
| I–III * | 0.66 ± 0.34 | −1.92 ± 0.64 | 0.50 ± 0.20 | −1.83 ± 0.84 | 0.54 ± 0.30 | −1.94 ± 0.32 | 0.44 ± 0.32 | −1.79 ± 0.65 |
| III–I * | −0.38 ± 0.31 | −3.34 ± 0.32 | −0.42 ± 0.29 | −3.75 ± 0.29 | −0.55 ± 0.30 | −3.46 ± 0.22 | −0.33 ± 0.22 | −3.31 ± 0.23 |
* I–II represents the skeletal Class I samples misjudged as skeletal Class II, II–I represents the skeletal Class II samples misjudged as skeletal Class I, I–III represents the skeletal Class I samples misjudged as skeletal Class III, and III–I represents the skeletal Class III samples misjudged as skeletal Class I.