| Literature DB >> 35676524 |
Joon Im1, Ju-Yeong Kim2, Hyung-Seog Yu1, Kee-Joon Lee1, Sung-Hwan Choi1, Ji-Hoi Kim1, Hee-Kap Ahn3, Jung-Yul Cha4.
Abstract
This study evaluates the accuracy and efficiency of automatic tooth segmentation in digital dental models using deep learning. We developed a dynamic graph convolutional neural network (DGCNN)-based algorithm for automatic tooth segmentation and classification using 516 digital dental models. We segmented 30 digital dental models using three methods for comparison: (1) automatic tooth segmentation (AS) using the DGCNN-based algorithm from LaonSetup software, (2) landmark-based tooth segmentation (LS) using OrthoAnalyzer software, and (3) tooth designation and segmentation (DS) using Autolign software. We evaluated the segmentation success rate, mesiodistal (MD) width, clinical crown height (CCH), and segmentation time. For the AS, LS, and DS, the tooth segmentation success rates were 97.26%, 97.14%, and 87.86%, respectively (p < 0.001, post-hoc; AS, LS > DS), the means of MD widths were 8.51, 8.28, and 8.63 mm, respectively (p < 0.001, post hoc; DS > AS > LS), the means of CCHs were 7.58, 7.65, and 7.52 mm, respectively (p < 0.001, post-hoc; LS > DS, AS), and the means of segmentation times were 57.73, 424.17, and 150.73 s, respectively (p < 0.001, post-hoc; AS < DS < LS). Automatic tooth segmentation of a digital dental model using deep learning showed high segmentation success rate, accuracy, and efficiency; thus, it can be used for orthodontic diagnosis and appliance fabrication.Entities:
Mesh:
Year: 2022 PMID: 35676524 PMCID: PMC9178028 DOI: 10.1038/s41598-022-13595-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Reliability analysis of tooth measurement.
| Segmentation type | MD width | CCH | ||
|---|---|---|---|---|
| ICC (95% CI) | ICC (95% CI) | |||
| LS | 0.994 (0.992–0.995) | < 0.0001 | 0.992 (0.990–0.994) | < 0.0001 |
| DS | 0.987 (0.983–0.989) | < 0.0001 | 0.989 (0.985–0.991) | < 0.0001 |
| AS | 0.997 (0.997–0.998) | < 0.0001 | 0.991 (0.998–0.993) | < 0.0001 |
| REF | 0.997 (0.997–0.998) | < 0.0001 | 0.993 (0.990–0.994) | < 0.0001 |
MD width and CCH, presented as continuous variables, were verified for intra-rater reliability using ICC. The ICC of MD width and CCH were 0.994–0.997 and 0.989–0.993, respectively, showing very high reproducibility.
ICC > 0.7: excellent.
Kappa statistics of agreement of measurement results on success/failure of tooth segmentation.
| Segmentation type | First | Kappa | |||||
|---|---|---|---|---|---|---|---|
| Success | Failure | Total | |||||
| LS | Second | Success | 271 (100.0%) | 0 (0.0%) | 271 (96.8%) | 1.000 | < 0.0001 |
| Failure | 0 (0.0%) | 9 (100.0%) | 9 (3.2%) | ||||
| Total | 271 (100.0%) | 9 (100.0%) | 280 (100.0%) | ||||
| DS | Second | Success | 244 (99.6%) | 0 (0.0%) | 244 (87.1%) | 0.984 | < 0.0001 |
| Failure | 1 (0.4%) | 35 (100.0%) | 36 (12.9%) | ||||
| Total | 245 (100.0%) | 35 (100.0%) | 280 (100.0%) | ||||
| AS | Second | Success | 270 (99.3%) | 0 (0.0%) | 270 (96.4%) | 0.885 | < 0.0001 |
| Failure | 2 (0.7%) | 8 (100.0%) | 10 (3.6%) | ||||
| Total | 272 (100.0%) | 8 (100.0%) | 280 (100.0%) | ||||
Success and failure of segmentation, presented as nominal variables using Cohen's kappa, indicated very high evaluation reproducibility from 0.885 to 1.000.
Kappa values were interpreted as follows: poor, < 0.0; slight, 0.0 to 0.2; fair, 0.2 to 0.4; moderate, 0.4 to 0.6; substantial, 0.6 to 0.8, and almost perfect, 0.8 to 1.0.
Comparison of success rate among three methods.
| LS | DS | AS | Post hoc test | ||
|---|---|---|---|---|---|
| Success | 816 (97.14%) | 738 (87.86%) | 817 (97.26%) | < 0.001* | LS, AS > DS |
| Failure | 24 (2.86%) | 102 (12.14%) | 23 (2.74%) |
Success rates for automatic segmentation were 97.26 and 97.14% for the AS and LS, respectively, which were higher than the 87.86% success rate of the DS.
Data are given as n (percentage).
ap values were derived from Cochran's Q test; *p < 0.05.
Comparison of MD width, CCH, and segmentation time among three segmentation groups.
| LS | DS | AS | REF | Post hoc test | ||
|---|---|---|---|---|---|---|
| MD width (mm) | 8.28 (8.15, 8.41) | 8.63 (8.49, 8.76) | 8.51 (8.37, 8.65) | 8.52 (8.40, 8.63) | < 0.001* | DS > REF, AS > LS |
| CCH (mm) | 7.65 (7.52, 7.78) | 7.52 (7.39, 7.65) | 7.58 (7.45, 7.70) | 7.62 (7.50, 7.74) | < 0.001* | LS, REF > DS, AS |
| Time (sec) | 424.17 (404.28, 444.05) | 150.73 (140.70, 160.77) | 57.73 (54.43, 61.04) | < 0.001* | LS > DS > AS |
MD width and CCH showed statistically significant differences, depending on segmentation method. The segmentation time also showed statistically significant differences in the three groups, with the AS having the least manual intervention being the shortest.
Data are given as the mean (95% confidence interval).
ap values were derived from Friedman test; Shapiro–Wilk’s test was employed to test the normality assumption; *p < 0.05.
Comparison of MD width and CCH among the three methods by tooth group.
| Variable | MD width (mm) | CCH (mm) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| LS | DS | AS | Post-hoc test | LS | DS | AS | Post-hoc test | ||||
| Upper | Incisal | −0.26 (−0.29, −0.23) | 0.07 (0.03, 0.11) | 0.00 (−0.05, 0.05) | < 0.001 | DS > AS > LS | −0.01 (−0.02, 0.00) | −0.21 (−0.26, −0.16) | −0.11 (−0.15, −0.07) | < 0.001 | LS > AS, DS |
| Canine | −0.21 (−0.26, −0.17) | −0.06 (−0.10, −0.01) | −0.15 (−0.21, −0.09) | < 0.001 | DS, AS > LS | −0.02 (−0.04, −0.01) | −0.19 (−0.25, −0.13) | −0.10 (−0.12, −0.07) | < 0.001 | LS > AS, DS | |
| Premolar | −0.31 (−0.35, −0.27) | −0.06 (−0.09, −0.02) | −0.35 (−0.39, −0.31) | < 0.001 | DS > LS, AS | −0.03 (−0.06, −0.01) | −0.14 (−0.19, −0.08) | −0.09 (−0.12, −0.06) | < 0.001 | LS > AS, DS | |
| Molar | −0.08 (−0.14, −0.02) | 0.68 (0.60, 0.76) | 0.61 (0.51, 0.71) | < 0.001 | DS, AS > LS | −0.02 (−0.06, 0.02) | −0.11 (−0.18, −0.04) | −0.08 (−0.12, −0.05) | < 0.001 | LS > AS, DS | |
| Lower | Incisal | −0.26 (−0.30, −0.23) | −0.01 (−0.06, 0.04) | −0.07 (−0.11, −0.04) | < 0.001 | DS > AS > LS | 0.00 (−0.01, 0.01) | −0.18 (−0.23, −0.13) | −0.11 (−0.13, −0.08) | < 0.001 | LS > AS, DS |
| Canine | −0.26 (−0.32, −0.20) | 0.06 (−0.02, 0.14) | −0.14 (−0.18, −0.10) | < 0.001 | DS > AS > LS | 0.00 (−0.02, 0.02) | −0.13 (−0.20, −0.07) | −0.09 (−0.13, −0.06) | < 0.001 | LS > AS, DS | |
| Premolar | −0.31 (−0.35, −0.28) | −0.06 (−0.10, −0.03) | −0.15 (−0.19, −0.12) | < 0.001 | DS > AS > LS | −0.03 (−0.07, 0.01) | −0.15 (−0.18, −0.11) | −0.10 (−0.12, −0.08) | < 0.001 | LS > AS, DS | |
| Molar | −0.13 (−0.17, −0.09) | 0.24 (0.19, 0.29) | 0.20 (0.16, 0.24) | < 0.001 | DS, AS > LS | 0.00 (−0.01, 0.01) | −0.09 (−0.14, −0.04) | −0.06 (−0.08, −0.04) | < 0.001 | LS > AS, DS | |
The means of the MD width error ranged from −0.31 to −0.08 mm, −0.09 to 0.68 mm, and −0.35 to 0.61) mm in the LS, DS, and AS groups, respectively. There were statistically significant differences in all tooth groups (p < 0.001, post hoc: DS > AS > LS in upper incisal, lower incisal, lower canine, and lower premolar; DS, AS > LS in upper canine, upper and lower molar, DS > LS, AS in upper premolar). The means of the CCH error ranged from −0.03 to 0.00 mm, −0.21 to −0.09 mm, and −0.11 to −0.06 mm in the LS, DS, and AS groups, respectively. There were statistically significant differences in all tooth groups (p < 0.001, post hoc test: LS > DS, AS).
Data are given as the mean (95% confidence interval).
ap values were derived from Friedman test; Shapiro–Wilk’s test was employed to test the normality assumption; *p < 0.05.
Statistical analysis of main effects and first-order interactions affecting measurement error using GLMM.
| Variable | DF | F | Post hoc testb | |
|---|---|---|---|---|
| Interceptc | 1:4723 | 319.03 | < 0.001 | |
| Tooth group | 3:4723 | 296.67 | < 0.001 | Molar (0.102) > incisal (−0.098), Canine (−0.110) > premolar (−0.149) |
| Software | 2:4723 | 70.00 | < 0.001 | DS (−0.019) > AS (−0.052) > LS (−0.121) |
| MD width/CCH | 1:4723 | 32.92 | < 0.001 | MD width (−0.043) > CCH (−0.084) |
| Tooth group * Software | 6:4723 | 24.70 | < 0.001 | |
| Tooth group * MD width/CCH | 3:4723 | 201.37 | < 0.001 | |
| Software * MD width/CCH | 2:4723 | 428.81 | < 0.001 |
The tooth size errors were statistically different (p < 0.001) depending on the software used, and the post hoc test showed that DS (−0.019) > AS (−0.052) > LS (−0.121).
DF degrees of freedom.
F F value.
ap values were derived from a generalised linear mixed model.
bCategory (estimated mean) was presented for Bonferroni’s corrected post hoc test.
cIntercept represents the mean value of the response variable when all predictor variables in the model are zero.
Figure 1Schematic view of the deep learning process. After the original dental model was converted to a point cloud model, the dentition and the gingiva were segmented using the two-class DGCNN model. To increase the accuracy of semantic segmentation, we segmented individual teeth and gingiva using the seventeen-class DGCNN model with the vertices of the point cloud of the gingiva reduced to less than twice that of the tooth group. The gingival vertices were then restored. Since some of the segmentation results using DGCNN showed an unclear teeth margin, curvature-based mesh segmentation was used as post-processing to segment the teeth margin. The images of the digital dental model used in this figure were obtained using MeshLab (ver. 1.3.4 BETA, ISTI-CNR, Italy) and Unity Editor (ver.2020.3.23f1, Unity Technologies, USA) software.
Figure 2Schematic diagram indicating the study flow. Thirty digital dental models were segmented using three types of software (i.e., OrthoAnalyzer, Autolign, and LaonSetup). The manually corrected segmented tooth was used as the reference group. The size of the segmented teeth such as MD width and CCH, success and failure of tooth segmentation, and tooth segmentation time were evaluated.
Figure 3Three different methods of tooth segmentation. The automatic tooth segmentation process has three steps in common: (1) orientation, (2) setting of the mesiodistal (MD) points, and (3) segmentation, but the details differ depending on the software’s algorithm. (a) Landmark-based segmentation (LS) using OrthoAnalyzer software: precise MD point setting is required prior to tooth segmentation, and the tooth number and axis are determined. (b) Tooth designation and segmentation (DS) using Autolign software: approximate MD point setting is required, and the tooth number is designated. (c) Automatic tooth segmentation (AS) using LaonSetup software: fully automatic segmentation based on deep learning does not require MD point setting, and manual intervention is not required.
Figure 4Measurement of the mesiodistal (MD) width and the clinical crown height (CCH) of segmented teeth. (a) Reorient the segmented teeth of the LS, DS, AS, and REF groups using Meshmixer software. (b) Measurement of the MD width of segmented teeth in REF group. (c) Measurement of the CCH of segmented teeth as the distance between the virtual occlusal plane and the lowest point of the gingival margin of the clinical crown.
Figure 5Examples of segmentation failure; an arrow marks the part with segmentation failure. (a) Left: segmented tooth beyond the gingival margin; right: reference tooth. (b) Left: segmented tooth with partial loss of occlusal surface; right: reference tooth.