| Literature DB >> 35330386 |
Van Nhat Thang Le1,2,3,4, Junhyeok Kang5, Il-Seok Oh5, Jae-Gon Kim1,2,3, Yeon-Mi Yang1,2,3, Dae-Woo Lee1,2,3.
Abstract
Detection of cephalometric landmarks has contributed to the analysis of malocclusion during orthodontic diagnosis. Many recent studies involving deep learning have focused on head-to-head comparisons of accuracy in landmark identification between artificial intelligence (AI) and humans. However, a human-AI collaboration for the identification of cephalometric landmarks has not been evaluated. We selected 1193 cephalograms and used them to train the deep anatomical context feature learning (DACFL) model. The number of target landmarks was 41. To evaluate the effect of human-AI collaboration on landmark detection, 10 images were extracted randomly from 100 test images. The experiment included 20 dental students as beginners in landmark localization. The outcomes were determined by measuring the mean radial error (MRE), successful detection rate (SDR), and successful classification rate (SCR). On the dataset, the DACFL model exhibited an average MRE of 1.87 ± 2.04 mm and an average SDR of 73.17% within a 2 mm threshold. Compared with the beginner group, beginner-AI collaboration improved the SDR by 5.33% within a 2 mm threshold and also improved the SCR by 8.38%. Thus, the beginner-AI collaboration was effective in the detection of cephalometric landmarks. Further studies should be performed to demonstrate the benefits of an orthodontist-AI collaboration.Entities:
Keywords: cephalometric landmark detection; clinical application; deep learning
Year: 2022 PMID: 35330386 PMCID: PMC8954049 DOI: 10.3390/jpm12030387
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Descriptive summary of study data.
| Variables | Mean ± SD/N (%) |
|---|---|
| Training data | |
| Age (years) | 9.31 ± 2.77 |
| Gender (male) | 566 (47.44) |
| Angle classification | |
| Class I | 324 (27.16) |
| Class II division 1 | 291 (24.39) |
| Class II division 2 | 139 (11.65) |
| Class III | 439 (36.80) |
| Test data | |
| Age (years) | 9.74 ± 3.12 |
| Gender (male) | 47 (47.00) |
| Angle classification | |
| Class I | 27 (27.00) |
| Class II division 1 | 24 (24.00) |
| Class II division 2 | 12 (12.00) |
| Class III | 37 (37.00) |
Figure 1Workflow diagram of the study plan. In step 1, JBNU dataset including 1193 images for training and 100 images for testing was used to evaluate the performance of the DACFL model in clinical applications. In step 2, 10 images were extracted randomly from JBNU test data to evaluate the effect of DACFL-based support on the clinical skills of beginners in cephalometric diagnosis. Abbreviations: AI, artificial intelligence; DACFL, deep anatomical context feature learning; JBNU, Jeonbuk National University.
Results of landmark detection in terms of mean radial error.
| No. | Landmarks | AI | |
|---|---|---|---|
| MRE (mm) | SD | ||
| 1 | Sella | 0.76 | 0.44 |
| 2 | Porion | 1.40 | 1.20 |
| 3 | Basion | 2.09 | 1.98 |
| 4 | Hinge axis | 1.70 | 1.06 |
| 5 | Pterygoid | 2.38 | 1.46 |
| 6 | Nasion | 1.33 | 0.87 |
| 7 | Orbitale | 2.23 | 1.81 |
| 8 | A-point | 1.43 | 1.11 |
| 9 | PM | 1.32 | 0.88 |
| 10 | Pogonion | 1.19 | 0.96 |
| 11 | B-point | 1.69 | 1.22 |
| 12 | Posterior nasal spine | 1.63 | 1.37 |
| 13 | Anterior nasal spine | 1.27 | 0.91 |
| 14 | R1 | 2.32 | 1.62 |
| 15 | R3 | 1.84 | 1.21 |
| 16 | Articulare | 1.03 | 0.74 |
| 17 | Menton | 1.22 | 0.91 |
| 18 | Maxilla 1 crown | 1.03 | 0.92 |
| 19 | Maxilla 1 root | 3.31 | 2.78 |
| 20 | Mandible 1 crown | 0.90 | 0.52 |
| 21 | Mandible 1 root | 2.89 | 4.04 |
| 22 | Maxilla 6 distal | 1.41 | 1.86 |
| 23 | Maxilla 6 root | 2.20 | 1.53 |
| 24 | Mandible 6 distal | 1.64 | 2.32 |
| 25 | Mandible 6 root | 2.97 | 2.50 |
| 26 | Glabella | 5.18 | 5.13 |
| 27 | Soft tissue nasion | 3.10 | 2.44 |
| 28 | Pronasale | 2.06 | 8.16 |
| 29 | Columella | 1.05 | 0.87 |
| 30 | Subnasale | 1.06 | 0.99 |
| 31 | Soft tissue A | 1.21 | 1.34 |
| 32 | Upper lip | 1.48 | 4.10 |
| 33 | Stms | 1.82 | 1.46 |
| 34 | Stmi | 1.03 | 0.79 |
| 35 | Lower lip | 1.16 | 0.90 |
| 36 | Soft tissue B | 2.08 | 2.70 |
| 37 | Soft tissue pogonion | 4.70 | 10.39 |
| 38 | Gnathion | 1.34 | 2.08 |
| 39 | Gonion | 2.70 | 2.14 |
| 40 | APOcc | 1.02 | 1.19 |
| 41 | PPOcc | 2.33 | 2.86 |
| Average | 1.87 | 2.04 | |
Abbreviations: AI, artificial intelligence; MRE, mean radial error; SD, standard deviation.
Results of landmark detection in terms of successful detection rate.
| No. | Landmarks | SDR (%) | |||
|---|---|---|---|---|---|
| <2 mm | <2.5 mm | <3 mm | <4 mm | ||
| 1 | Sella | 97% | 100% | 100% | 100% |
| 2 | Porion | 84% | 92% | 94% | 95% |
| 3 | Basion | 59% | 73% | 82% | 89% |
| 4 | Hinge axis | 64% | 79% | 90% | 97% |
| 5 | Pterygoid | 48% | 58% | 69% | 85% |
| 6 | Nasion | 83% | 92% | 93% | 99% |
| 7 | Orbitale | 59% | 70% | 79% | 90% |
| 8 | A-point | 82% | 89% | 91% | 96% |
| 9 | PM | 80% | 88% | 95% | 100% |
| 10 | Pogonion | 86% | 91% | 95% | 97% |
| 11 | B-point | 69% | 80% | 84% | 91% |
| 12 | Posterior nasal spine | 74% | 86% | 89% | 97% |
| 13 | Anterior nasal spine | 85% | 92% | 92% | 99% |
| 14 | R1 | 51% | 59% | 73% | 85% |
| 15 | R3 | 61% | 77% | 88% | 94% |
| 16 | Articulare | 90% | 95% | 97% | 99% |
| 17 | Menton | 86% | 89% | 96% | 96% |
| 18 | Maxilla 1 crown | 91% | 94% | 94% | 97% |
| 19 | Maxilla 1 root | 38% | 49% | 60% | 73% |
| 20 | Mandible 1 crown | 96% | 98% | 100% | 100% |
| 21 | Mandible 1 root | 48% | 58% | 70% | 82% |
| 22 | Maxilla 6 distal | 87% | 92% | 95% | 98% |
| 23 | Maxilla 6 root | 56% | 68% | 78% | 93% |
| 24 | Mandible 6 distal | 84% | 88% | 88% | 92% |
| 25 | Mandible 6 root | 36% | 53% | 68% | 80% |
| 26 | Glabella | 32% | 40% | 46% | 58% |
| 27 | Soft tissue nasion | 38% | 47% | 60% | 76% |
| 28 | Pronasale | 95% | 95% | 95% | 95% |
| 29 | Columella | 96% | 96% | 97% | 97% |
| 30 | Subnasale | 93% | 96% | 97% | 98% |
| 31 | Soft tissue A | 89% | 93% | 94% | 97% |
| 32 | Upper lip | 89% | 91% | 93% | 97% |
| 33 | Stms | 68% | 77% | 83% | 92% |
| 34 | Stmi | 87% | 93% | 96% | 100% |
| 35 | Lower lip | 86% | 88% | 92% | 99% |
| 36 | Soft tissue B | 74% | 81% | 82% | 87% |
| 37 | Soft tissue pogonion | 62% | 69% | 76% | 80% |
| 38 | Gnathion | 90% | 92% | 95% | 96% |
| 39 | Gonion | 51% | 56% | 68% | 79% |
| 40 | APOcc | 93% | 94% | 95% | 98% |
| 41 | PPOcc | 69% | 78% | 81% | 86% |
| Average | 73.32% | 80.39% | 85.61% | 91.68% | |
Abbreviation: SDR, successful detection rate.
Quantitative comparison by average successful detection rate and mean radial error.
| Group | SDR (%) | MRE (mm) | SD | |||
|---|---|---|---|---|---|---|
| <2 mm | <2.5 mm | <3 mm | <4 mm | |||
| AI | 73.17% | 79.02% | 83.17% | 89.51% | 1.89 | 2.63 |
| Beginners | 47.40% | 54.83% | 60.80% | 70.21% | 3.72 | 4.52 |
| Beginners + AI | 52.73% | 61.16% | 67.77% | 77.01% | 3.14 | 4.06 |
Figure 2Comparison between the beginner-only and beginner–artificial intelligence groups in terms of the successful detection rate. A t-test was applied to compare the average successful detection rates between the beginner-only and beginner–AI groups within 2, 2.5, 3, and 4 mm thresholds. The beginner–AI collaboration improved the successful detection rates within 2, 2.5, 3, and 4 mm thresholds. Abbreviations: AI, artificial intelligence; ns, not significant.
Successful detection rate and mean radial error for each landmark within a 2 mm threshold.
| No. | Landmarks | AI | Beginners | Beginners + AI | |||
|---|---|---|---|---|---|---|---|
| SDR | MRE ± SD | SDR | MRE ± SD | SDR | MRE ± SD | ||
| 1 | Sella | 90% | 1.14 ± 0.52 | 83.5% | 1.67 ± 2.36 | 84.5% | 1.74 ± 2.42 |
| 2 | Porion | 90% | 1.19 ± 0.6 | 25.5% | 5.3 ± 4.22 | 41% | 3.34 ± 3.25 |
| 3 | Basion | 70% | 2.21 ± 1.88 | 22% | 6.3 ± 4.92 | 44% | 3.58 ± 3.3 |
| 4 | Hinge axis | 70% | 1.77 ± 1.35 | 54.5% | 2.56 ± 2.38 | 56% | 2.47 ± 2.44 |
| 5 | Pterygoid | 70% | 2.12 ± 1.56 | 41.5% | 3.71 ± 3.11 | 47.5% | 3.27 ± 2.88 |
| 6 | Nasion | 70% | 1.62 ± 0.72 | 42.5% | 5.06 ± 5.73 | 54% | 3.25 ± 4.51 |
| 7 | Orbitale | 40% | 3.01 ± 2.46 | 21.5% | 4.66 ± 3.4 | 22% | 4.26 ± 3.02 |
| 8 | A-point | 70% | 1.8 ± 0.98 | 45.5% | 3.21 ± 3.13 | 55% | 2.74 ± 3.05 |
| 9 | PM | 70% | 1.89 ± 0.92 | 48% | 3.06 ± 3.53 | 49.5% | 2.88 ± 3.41 |
| 10 | Pogonion | 90% | 1.33 ± 1.06 | 64% | 2.51 ± 3.61 | 63.5% | 2.35 ± 3.41 |
| 11 | B-point | 70% | 1.76 ± 0.86 | 43.5% | 3.27 ± 3.6 | 45.5% | 3.1 ± 3.4 |
| 12 | Posterior nasal spine | 70% | 1.5 ± 1.02 | 36.5% | 3.77 ± 4.21 | 40% | 3.45 ± 4.16 |
| 13 | Anterior nasal spine | 80% | 1.29 ± 0.76 | 57.5% | 2.79 ± 3.92 | 61.5% | 2.59 ± 3.88 |
| 14 | R1 | 60% | 2.02 ± 1.32 | 28% | 4.05 ± 3.85 | 36% | 3.49 ± 2.74 |
| 15 | R3 | 70% | 1.74 ± 1.53 | 36% | 3.82 ± 3.55 | 38% | 3.44 ± 3.35 |
| 16 | Articulare | 100% | 0.88 ± 0.45 | 58.5% | 2.51 ± 2.62 | 81% | 1.73 ± 2.41 |
| 17 | Menton | 90% | 1.26 ± 0.74 | 67% | 2.26 ± 3.15 | 67% | 2.28 ± 3.14 |
| 18 | Maxilla 1 crown | 90% | 1.1 ± 0.57 | 86% | 1.48 ± 3.14 | 88.5% | 1.41 ± 3.16 |
| 19 | Maxilla 1 root | 50% | 2.9 ± 1.66 | 32% | 3.71 ± 2.96 | 34% | 3.41 ± 3.02 |
| 20 | Mandible 1 crown | 90% | 0.99 ± 0.58 | 92.5% | 1.31 ± 2.99 | 93.5% | 1.23 ± 2.96 |
| 21 | Mandible 1 root | 70% | 1.91 ± 1.52 | 42% | 3.2 ± 3.21 | 39.5% | 3.12 ± 3.06 |
| 22 | Maxilla 6 distal | 100% | 0.93 ± 0.44 | 73% | 2.76 ± 3.46 | 80.5% | 2.11 ± 2.9 |
| 23 | Maxilla 6 root | 50% | 2.13 ± 1.27 | 42% | 3.16 ± 2.8 | 50.5% | 2.62 ± 2.46 |
| 24 | Mandible 6 distal | 100% | 0.93 ± 0.6 | 67.5% | 3.1 ± 4.24 | 75% | 2.29 ± 3.46 |
| 25 | Mandible 6 root | 20% | 3.38 ± 1.97 | 29% | 4.53 ± 4.29 | 35% | 3.49 ± 3.47 |
| 26 | Glabella | 20% | 5.72 ± 3.54 | 19.5% | 8.9 ± 7.05 | 15.5% | 7.31 ± 5.42 |
| 27 | Soft tissue nasion | 30% | 3.7 ± 2.95 | 18.5% | 6.29 ± 5.19 | 23% | 4.93 ± 4.26 |
| 28 | Pronasale | 90% | 4.67 ± 11.98 | 82.5% | 5.24 ± 11.91 | 82% | 5.29 ± 11.93 |
| 29 | Columella | 90% | 1.27 ± 1.54 | 36.5% | 3.47 ± 3.42 | 43.5% | 3.04 ± 3.28 |
| 30 | Subnasale | 100% | 0.86 ± 0.43 | 83.5% | 1.79 ± 2.86 | 79.5% | 1.84 ± 2.85 |
| 31 | Soft tissue A | 70% | 1.37 ± 0.93 | 31.5% | 4.02 ± 3.68 | 50% | 2.66 ± 3.07 |
| 32 | Upper lip | 90% | 1.68 ± 2.23 | 53% | 3.12 ± 4.02 | 58.5% | 2.89 ± 3.98 |
| 33 | Stms | 50% | 2.3 ±1.91 | 0% | 7.11 ± 3.02 | 7.5% | 6.38 ± 3.2 |
| 34 | Stmi | 90% | 0.8 ± 0.57 | 63% | 2.51 ± 3.52 | 64% | 2.39 ± 3.49 |
| 35 | Lower lip | 100% | 0.62 ± 0.35 | 65.5% | 2.45 ± 3.49 | 75% | 2.01 ± 3.36 |
| 36 | Soft tissue B | 60% | 2.39 ± 1.87 | 45% | 3.05 ± 3.54 | 45.5% | 2.93 ± 3.33 |
| 37 | Soft tissue pogonion | 70% | 1.22 ± 0.94 | 41% | 4.61 ± 4.86 | 54% | 3.14 ± 3.94 |
| 38 | Gnathion | 100% | 0.7 ± 0.52 | 66.5% | 2.16 ± 3.15 | 66.5% | 2.12 ± 3.13 |
| 39 | Gonion | 50% | 2.92 ± 1.74 | 23% | 4.23 ± 3.07 | 26.5% | 3.92 ± 2.89 |
| 40 | APOcc | 90% | 1.83 ± 2.76 | 69% | 2.8 ± 4.07 | 71.5% | 2.75 ± 4.03 |
| 41 | PPOcc | 60% | 2.53 ± 2.91 | 6% | 7.09 ± 5.56 | 17% | 5.59 ± 5.04 |
Figure 3Benefit of beginner–AI collaboration in the detection of cephalometric landmarks. Based on successful detection rate for each landmark within a 2 mm threshold, the benefits of beginner–AI collaboration were analyzed. In general, this collaboration showed a positive impact on the majority of cephalometric landmarks.
Figure 4Number of decision changes among beginners across 41 landmarks. In the second experiment, the beginners traced the anatomical landmarks on 10 images with the AI’s answer view. The recorded changes are represented as number of ratings. In general, the number of decision changes was small despite being shown at most anatomical landmarks.
Successful classification rate for eight clinical measurements.
| Measurements | SCR (%) | ||
|---|---|---|---|
| AI | Beginners | Beginners + AI | |
| ANB | 60% | 46.5% | 52% |
| SNB | 90% | 58.5% | 72.5% |
| SNA | 100% | 51% | 64% |
| ODI | 70% | 62.5% | 66% |
| APDI | 90% | 47.5% | 57.5% |
| FHI | 90% | 66% | 76% |
| FHA | 100% | 77.5% | 88% |
| MW | 70% | 81% | 81.5% |
| Average | 83.75% | 61.31% | 69.69% |
Figure 5Comparison of eight clinical measurements between the beginner-only and beginner–AI groups. From the SCRs of two groups, a figure was presented to demonstrate the AI’s support. As a result, the beginner–AI collaboration improved the SCRs of eight clinical measurements. Abbreviation: SCR, successful classification rate.