| Literature DB >> 36010318 |
Balazs Feher1,2, Ulrike Kuchler1, Falk Schwendicke3, Lisa Schneider3, Jose Eduardo Cejudo Grano de Oro3, Tong Xi4, Shankeeth Vinayahalingam4, Tzu-Ming Harry Hsu5, Janet Brinz6, Akhilanand Chaurasia7, Kunaal Dhingra8, Robert Andre Gaudin9,10, Hossein Mohammad-Rahimi11, Nielsen Pereira12, Francesc Perez-Pastor13, Olga Tryfonos14, Sergio E Uribe15,16,17, Marcel Hanisch18, Joachim Krois3.
Abstract
The detection and classification of cystic lesions of the jaw is of high clinical relevance and represents a topic of interest in medical artificial intelligence research. The human clinical diagnostic reasoning process uses contextual information, including the spatial relation of the detected lesion to other anatomical structures, to establish a preliminary classification. Here, we aimed to emulate clinical diagnostic reasoning step by step by using a combined object detection and image segmentation approach on panoramic radiographs (OPGs). We used a multicenter training dataset of 855 OPGs (all positives) and an evaluation set of 384 OPGs (240 negatives). We further compared our models to an international human control group of ten dental professionals from seven countries. The object detection model achieved an average precision of 0.42 (intersection over union (IoU): 0.50, maximal detections: 100) and an average recall of 0.394 (IoU: 0.50-0.95, maximal detections: 100). The classification model achieved a sensitivity of 0.84 for odontogenic cysts and 0.56 for non-odontogenic cysts as well as a specificity of 0.59 for odontogenic cysts and 0.84 for non-odontogenic cysts (IoU: 0.30). The human control group achieved a sensitivity of 0.70 for odontogenic cysts, 0.44 for non-odontogenic cysts, and 0.56 for OPGs without cysts as well as a specificity of 0.62 for odontogenic cysts, 0.95 for non-odontogenic cysts, and 0.76 for OPGs without cysts. Taken together, our results show that a combined object detection and image segmentation approach is feasible in emulating the human clinical diagnostic reasoning process in classifying cystic lesions of the jaw.Entities:
Keywords: artificial intelligence; cysts; diagnosis; machine learning; oral; radiography; surgery
Year: 2022 PMID: 36010318 PMCID: PMC9406703 DOI: 10.3390/diagnostics12081968
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Dataset characteristics.
| Vienna | Nijmegen | Total | |
|---|---|---|---|
|
| |||
| Age, median (IQR), years | 50 (39–60) | 57 (45–66) | 53.5 (41–63) |
| Female, n (% of total) | 391 (43) | 115 (35) | 506 (41) |
| Male, n (% of total) | 519 (57) | 214 (65) | 733 (59) |
|
| |||
| Cysts, n.f.s., n (%) | 215 (23.6) | 0 (0) | 215 (17.3) |
| Odontogenic cysts, n (%) | 485 (53.3) | 102 (31) | 587 (47.3) |
| Non-odontogenic cysts, n (%) | 90 (9.9) | 139 (42.2) | 229 (18.5) |
| Negative controls, n (%) | 120 (13.2) | 88 (26.7) | 208 (16.8) |
IQR, interquartile range.
Figure 1Overview of the diagnostic pipeline. (a) First, the object detection model detects cystic lesions and marks them with a bounding box. (b) Next, multiple segmentation models are employed which segment structures of maxilla, mandible, mandibular canal, maxillary sinuses, the complete dentition, as well as each individual tooth. (c) The overlaps between the marked bounding box and segmented structures are then calculated. (d) Finally, the Random Forest classifier gives a preliminary diagnosis based on the computed overlaps for each sample.
Figure A1Sample panoramic radiographs for odontogenic and non-odontogenic cysts. Top, odontogenic (radicular) cyst at the upper right first molar. The radiolucency appears centered around the roots of the affected tooth, which clearly shows multiple signs of pathology. Bottom, non-odontogenic (globulomaxillary) cyst between the upper left lateral incisor and canine. There is no sign of primary pathology of either the incisor or the canine; their displacement is an assumed consequence of cystic growth.
Detection performance.
| IoU | MD | Value | |
|---|---|---|---|
| Average precision | 0.50–0.95 | 100 | 0.213 |
| 0.50 | 100 | 0.424 | |
| 0.75 | 100 | 0.203 | |
| Average recall | 0.50–0.95 | 1 | 0.244 |
| 0.50–0.95 | 10 | 0.331 | |
| 0.50–0.95 | 100 | 0.394 |
IoU, intersection over union; MD, maximal detections.
Classification performance.
| Odontogenic Cyst | Non-Odontogenic Cyst | |
|---|---|---|
|
| ||
| Sensitivity (Recall) | 0.84 | 0.56 |
| Specificity | 0.56 | 0.84 |
| PPV (Precision) | 0.89 | 0.45 |
| NPV | 0.45 | 0.89 |
| F1-score | 0.86 | 0.50 |
|
| ||
| Sensitivity (Recall) | 0.91 | 0.51 |
| Specificity | 0.51 | 0.91 |
| PPV (Precision) | 0.83 | 0.68 |
| NPV | 0.68 | 0.83 |
| F1-score | 0.87 | 0.58 |
IoU, intersection over union; NPV, negative predictive value, PPV, positive predictive value.
Figure A2Performance of Random Forest classifier on test set predictions with increasing thresholds in terms of F1-score, sensitivity, specificity, and precision. The sample count visualizes that with higher threshold, the number of samples included in the test set decreases as the Random Forest is only applicable to samples with detected cysts.
Human diagnostic performance.
| Odontogenic Cyst | Non-Odontogenic Cyst | No Cyst | |
|---|---|---|---|
| Sensitivity (Recall) | 0.70 | 0.44 | 0.56 |
| Specificity | 0.62 | 0.95 | 0.76 |
| PPV (Precision) | 0.53 | 0.58 | 0.78 |
| NPV | 0.83 | 0.94 | 0.62 |
| F1-score | 0.56 | 0.45 | 0.61 |
NPV, negative predictive value, PPV, positive predictive value.
Individual human diagnostic performance.
| PPV (Precision) | Sensitivity (Recall) | F1-Score | |
|---|---|---|---|
|
| |||
| Odontogenic cyst | 0.49 | 0.92 | 0.64 |
| Non-odontogenic cyst | 0.65 | 0.69 | 0.67 |
| No cyst | 0.91 | 0.41 | 0.56 |
| Accuracy | 0.61 | ||
| Macro average | 0.68 | 0.67 | 0.62 |
| Weighted average | 0.74 | 0.61 | 0.60 |
|
| |||
| Odontogenic cyst | 0.55 | 0.64 | 0.59 |
| Non-odontogenic cyst | 0.86 | 0.38 | 0.52 |
| No cyst | 0.71 | 0.71 | 0.71 |
| Accuracy | 0.65 | ||
| Macro average | 0.70 | 0.57 | 0.61 |
| Weighted average | 0.67 | 0.65 | 0.65 |
|
| |||
| Odontogenic cyst | 0.57 | 0.80 | 0.66 |
| Non-odontogenic cyst | 0.58 | 0.22 | 0.32 |
| No cyst | 0.82 | 0.71 | 0.76 |
| Accuracy | 0.69 | ||
| Macro average | 0.66 | 0.58 | 0.58 |
| Weighted average | 0.71 | 0.69 | 0.68 |
|
| |||
| Odontogenic cyst | 0.56 | 0.62 | 0.59 |
| Non-odontogenic cyst | 0.47 | 0.69 | 0.56 |
| No cyst | 0.79 | 0.67 | 0.72 |
| Accuracy | 0.65 | ||
| Macro average | 0.61 | 0.66 | 0.62 |
| Weighted average | 0.68 | 0.65 | 0.66 |
|
| |||
| Odontogenic cyst | 0.53 | 0.73 | 0.61 |
| Non-odontogenic cyst | 0.62 | 0.47 | 0.54 |
| No cyst | 0.78 | 0.64 | 0.70 |
| Accuracy | 0.65 | ||
| Macro average | 0.64 | 0.61 | 0.62 |
| Weighted average | 0.68 | 0.65 | 0.65 |
|
| |||
| Odontogenic cyst | 0.57 | 0.50 | 0.53 |
| Non-odontogenic cyst | 0.66 | 0.59 | 0.62 |
| No cyst | 0.68 | 0.75 | 0.71 |
| Accuracy | 0.65 | ||
| Macro average | 0.64 | 0.61 | 0.62 |
| Weighted average | 0.64 | 0.65 | 0.64 |
|
| |||
| Odontogenic cyst | 0.46 | 0.67 | 0.55 |
| Non-odontogenic cyst | 0.31 | 0.34 | 0.32 |
| No cyst | 0.75 | 0.53 | 0.62 |
| Accuracy | 0.55 | ||
| Macro average | 0.50 | 0.51 | 0.50 |
| Weighted average | 0.60 | 0.55 | 0.56 |
|
| |||
| Odontogenic cyst | 0.87 | 0.13 | 0.22 |
| Non-odontogenic cyst | 0.69 | 0.34 | 0.46 |
| No cyst | 0.60 | 0.97 | 0.74 |
| Accuracy | 0.62 | ||
| Macro average | 0.72 | 0.48 | 0.47 |
| Weighted average | 0.70 | 0.62 | 0.54 |
|
| |||
| Odontogenic cyst | 0.42 | 0.97 | 0.58 |
| Non-odontogenic cyst | 0.75 | 0.09 | 0.17 |
| No cyst | 0.90 | 0.32 | 0.47 |
| Accuracy | 0.51 | ||
| Macro average | 0.69 | 0.46 | 0.41 |
| Weighted average | 0.72 | 0.51 | 0.48 |
|
| |||
| Odontogenic cyst | 0.42 | 0.89 | 0.57 |
| Non-odontogenic cyst | 0.44 | 0.81 | 0.57 |
| No cyst | 0.79 | 0.11 | 0.20 |
| Accuracy | 0.45 | ||
| Macro average | n/a | ||
| Weighted average | n/a |
PPV, positive predictive value.
Figure 2Diagnostic performance. (a) Human controls in increasing order of mean F1-score. The black line represents the mean F1-score of the entire human control group, the dashed lines represent plus or minus one standard deviation, respectively. (b) Classification models in increasing order of mean F1-score.