| Literature DB >> 35748080 |
Jisun Hwang1, Hee Mang Yoon2, Jae-Yeon Hwang3, Pyeong Hwa Kim4, Boram Bak5, Byeong Uk Bae6, Jinkyeong Sung6, Hwa Jung Kim7, Ah Young Jung4, Young Ah Cho4, Jin Seong Lee4.
Abstract
PURPOSE: To evaluate the applicability of Greulich-Pyle (GP) standards to bone age (BA) assessment in healthy Korean children using manual and deep learning-based methods.Entities:
Keywords: Age determination by skeleton; child; deep learning; hand bones; radiography
Mesh:
Year: 2022 PMID: 35748080 PMCID: PMC9226834 DOI: 10.3349/ymj.2022.63.7.683
Source DB: PubMed Journal: Yonsei Med J ISSN: 0513-5796 Impact factor: 3.052
Fig. 1Number of included children per age and sex (A: boys, B: girls) from two hospitals. PNUYH, Pusan National University Yangsan Hospital.
ICC Values of the Comparison between Chronological Age and Bone Age Assessment Methods
| Parameter | Estimated bone age | ||||
|---|---|---|---|---|---|
| Radiologist 1 | Radiologist 2 | Original DLBAA model | Modified DLBAA model | ||
| Chronological age | 0.978 | 0.978 | 0.982 | 0.982 | |
| Estimated bone age | |||||
| Radiologist 1 | 0.995 | 0.994 | 0.994 | ||
| Radiologist 2 | 0.993 | 0.994 | |||
| Original DLBAA model | 0.999 | ||||
DLBAA, deep learning-based bone age assessment; ICC, intraclass correlation.
All p-values were <0.001 by ICC analysis.
Bland-Altman Analysis with Slope from the Linear Regression Between Estimated Bone Ages and Chronological Age
| Measurements | Mean difference | Standard deviation | Slope | Intercept | |
|---|---|---|---|---|---|
| Chronological age vs. | |||||
| Radiologist 1 | −2.24 | 16.30 | −0.16 | 17 | |
| Radiologist 2 | −0.48 | 16.55 | −0.15 | 18 | |
| Original DLBAA model | −1.64 | 14.62 | −0.11 | 12 | |
| Modified DLBAA model | −1.40 | 14.43 | −0.11 | 12 | |
DLBAA, deep learning-based bone age assessment.
Fig. 2Bland-Altman plots and trend curve for comparison between chronological age (CA) and estimated bone age by radiologist 1 (A), radiologist 2 (B), original model of deep learning-based bone age assessment (DLBAA) system (C), and modified model of DLBAA system (D). Limits of agreement are shown as the top and bottom dashed lines and average bias (the center dashed line) with 95% confidence intervals of each value (dotted line). The regression fit of the differences on the means are shown as solid blue lines with 95% confidence intervals (gray shaded area).
Linear Regression Results for Bone Age Estimation by Radiologists and Deep Learning-Based Software Compared to Chronological Age
| Measurements | Regression coefficient | R2 value | Intercept | SD | ||
|---|---|---|---|---|---|---|
| Chronological age vs. | ||||||
| Radiologist 1 | 1.130 | 0.939 | −13.370 | 14.878 | <0.001 | |
| Radiologist 2 | 1.123 | 0.935 | −14.363 | 15.290 | <0.001 | |
| Original DLBAA model | 1.086 | 0.942 | −8.751 | 13.936 | <0.001 | |
| Modified DLBAA model | 1.082 | 0.942 | −8.452 | 13.809 | <0.001 | |
DLBAA, deep learning-based bone age assessment; SD, standard deviation of residuals of the regression.
Fig. 3Linear regression scatter plots between chronological age (CA) and estimated bone age by radiologist 1 (A), radiologist 2 (B), original model of deep learning-based bone age assessment (DLBAA) system (C), and modified model of DLBAA system (D). Lines represent the line of linear regression (blue line) and identity line (black line).
Fig. 4Screenshot result of the original model of DLBAA system in a girl with chronological age of 6 years 9 months. Among the top three GP-assigned bone ages, the estimated bone age with the highest probability was 5 years 9 months. In this patient, the estimated bone ages by radiologist 1, radiologist 2, and modified model of DLBAA system were 5 years 9 months, 5 years, and 6 years 3 months, respectively. DLBAA, deep learning-based bone age assessment.
Results of Pair-Wise Comparison of the Mean Absolute Error and Root Mean Square Error between Radiologists and Deep Learning-Based Software When Using Chronological Age as a Reference Standard
| Reader | Bonferroni-corrected | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| R1 | R2 | Original DLBAA model | Modified DLBAA model | R1 vs. R2 | R1 vs. Original DLBAA model | R1 vs. Modified DLBAA model | R2 vs. Original DLBAA model | R2 vs. Modified DLBAA model | Original vs. Modified DLBAA model | |
| MAE (month) | 13.09 | 13.12 | 11.52 | 11.31 | >0.999 | <0.001* | <0.001* | <0.001* | <0.001* | 0.81 |
| RMSE (month) | 16.44 | 16.54 | 14.69 | 14.48 | >0.999 | <0.001* | 0.018* | <0.001* | <0.001* | >0.999 |
R1, radiologist 1; R2, radiologist 2; DLBAA, deep learning-based bone age assessment; MAE, mean absolute error; RMSE, root mean square error.
Difference is statistically significant at the 0.05 level.
*Significant differences by the repeated measures of analysis of variance.
Comparison of Proportions of Bone Age Estimations >12, 18, and 24 Months Compared to Chronological Age between Radiologists and Deep Learning-Based Software
| Proportions (%) | Cochran's Q Test | Post-hoc (McNemar Test) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| R1 | R2 | Original DLBAA model | Modified DLBAA model | R1 vs. R2 | R1 vs. Original DLBAA model | R1 vs. Modified DLBAA model | R2 vs. Original DLBAA model | R2 vs. Modified DLBAA model | Original vs. Modified DLBAA model | ||
| >12 months | 44.3 | 44.5 | 39.2 | 36.1 | <0.001 | >0.999 | 0.022 | <0.001* | 0.016 | <0.001* | 0.028 |
| >18 months | 27.0 | 28.9 | 21.0 | 20.0 | <0.001 | 0.28 | 0.002* | <0.001* | <0.001* | <0.001* | 0.487 |
| >24 months | 14.2 | 15.3 | 8.0 | 8.7 | <0.001 | 0.511 | <0.001* | <0.001* | <0.001* | <0.001* | 0.678 |
R1, radiologist 1; R2, radiologist 2; DLBAA, deep learning-based bone age assessment.
*Statistically significant differences by post-hoc tests using Bonferroni correction (p<0.0083).