| Literature DB >> 35281250 |
Xi Wang1, Bo Zhou1, Ping Gong2, Ting Zhang3, Yan Mo2, Jie Tang2, Xinmiao Shi1, Jianhong Wang1, Xinyu Yuan4, Fengsen Bai4, Lei Wang1, Qi Xu1, Yu Tian1, Qing Ha2, Chencui Huang2, Yizhou Yu2, Lin Wang1.
Abstract
Background: The accuracy and consistency of bone age assessments (BAA) using standard methods can vary with physicians' level of experience.Entities:
Keywords: China 05 RUS-CHN; accuracy; artificial intelligence; bone age; consistency; different levels of experience
Year: 2022 PMID: 35281250 PMCID: PMC8908427 DOI: 10.3389/fped.2022.818061
Source DB: PubMed Journal: Front Pediatr ISSN: 2296-2360 Impact factor: 3.418
Figure 1Study design flowchart. The upper part illustrates inclusion and exclusion criteria. After stratified sampling by age, an age-balanced cohort of 316 samples were extracted, which were further randomly divided into group A and group B, each with 158 samples. Both groups were independently evaluated by 4 reference standard experts and 9 physicians of different levels of experience. For group B, the 9 physicians were given AI reports before performing BAA by themselves.
Characteristics of children in groups A and B.
|
|
|
|
|
|---|---|---|---|
| Cases | 158 | 158 | – |
| Calendar age (Age ± SD) | 9.805 ± 3.568 | 9.903 ± 3.521 | 0.807 |
| Gender (%) | 0.910 | ||
| female | 70 (44.3%) | 71 (44.9%) | |
| male | 88 (55.7%) | 87 (55.1%) |
Data are expressed as mean (standard deviation) or count (percent). P-value was calculated by
Two independent sample t-test or the
Chi-squared test.
Figure 2Bland-Altman plot of differences between the artificial intelligence model and reference standard bone age assessments. AI, artificial intelligence; RS, reference standard; RUS-CHN, Chinese Standard of Skeletal Maturity of the Hand and Wrist.
Assessment performance for physicians with different levels of experience with no additional information (group A) and with artificial intelligence model assistance (group B).
|
|
|
| |
|---|---|---|---|
| Senior physicians | group A | 0.403 ± 0.368 | 0.987 (0.983, 0.99) |
| group B | 0.325 ± 0.326 | 0.996 (0.995, 0.997) | |
| <0.001 | <0.001 | ||
| Mid-level physicians | group A | 0.469 ± 0.415 | 0.989 (0.9786, 0.992) |
| group B | 0.344 ± 0.356 | 0.996(0.995, 0.997) | |
| <0.001 | <0.001 | ||
| Junior physicians | group A | 0.755 ± 0.679 | 0.941 (0.91, 0.96) |
| group B | 0.370 ± 0.365 | 0.992 (0.989, 0.994) | |
| <0.001 | <0.001 | ||
Data are expressed as mean (standard deviation). P-value was calculated by the rank-sum test or the Chi-squared test where appropriate. ICC, intraclass correlation coefficient. 95% CI, 95% confidence interval.
Figure 3Box plot of bone age assessment errors without AI assistance (group A) and with AI model assistance (group B) for physicians of different experience. AI, artificial intelligence; BAA, bone age assessment.
MAE for assessments of 13 bones (radius, ulna, and short bones).
|
|
|
| ||
|---|---|---|---|---|
|
|
| |||
| Senior physicians | radius | 0.403 ± 0.46 | 0.347 ± 0.367 | 0.224 |
| ulna | 0.371 ± 0.571 | 0.289 ± 0.386 | 0.331 | |
| metacarpal I | 0.266 ± 0.356 | 0.265 ± 0.334 | 0.839 | |
| metacarpal III | 0.313 ± 0.375 | 0.327 ± 0.348 | 0.174 | |
| metacarpal V | 0.361 ± 0.423 | 0.281 ± 0.298 | 0.198 | |
| proximal phalanx I | 0.31 ± 0.332 | 0.258 ± 0.302 | 0.021 | |
| proximal phalanx III | 0.305 ± 0.357 | 0.271 ± 0.321 | 0.179 | |
| proximal phalanx V | 0.302 ± 0.369 | 0.233 ± 0.315 | 0.004 | |
| middle phalanx III | 0.291 ± 0.358 | 0.262 ± 0.333 | 0.177 | |
| middle phalanx V | 0.792 ± 1.37 | 1.009 ± 1.843 | 0.398 | |
| distal phalanx I | 0.353 ± 0.556 | 0.344 ± 0.594 | 0.954 | |
| distal phalanx III | 0.319 ± 0.415 | 0.291 ± 0.283 | 0.383 | |
| distal phalanx V | 0.301 ± 0.382 | 0.277 ± 0.306 | 0.927 | |
| Mid-level physicians | radius | 0.468 ± 0.464 | 0.368 ± 0.375 | 0.004 |
| ulna | 0.373 ± 0.498 | 0.288 ± 0.389 | 0.045 | |
| metacarpal I | 0.354 ± 0.452 | 0.292 ± 0.345 | 0.329 | |
| metacarpal III | 0.411 ± 0.465 | 0.334 ± 0.373 | 0.112 | |
| metacarpal V | 0.335 ± 0.396 | 0.297 ± 0.317 | 0.855 | |
| proximal phalanx I | 0.36 ± 0.388 | 0.281 ± 0.304 | 0.006 | |
| proximal phalanx III | 0.38 ± 0.436 | 0.291 ± 0.343 | 0.004 | |
| proximal phalanx V | 0.32 ± 0.382 | 0.251 ± 0.318 | 0.008 | |
| middle phalanx III | 0.32 ± 0.394 | 0.291 ± 0.365 | 0.231 | |
| middle phalanx V | 1.03 ± 1.888 | 0.951 ± 1.858 | 0.429 | |
| distal phalanx I | 0.424 ± 0.594 | 0.365 ± 0.799 | 0.020 | |
| distal phalanx III | 0.347 ± 0.459 | 0.31 ± 0.33 | 0.347 | |
| distal phalanx V | 0.377 ± 0.441 | 0.296 ± 0.327 | 0.073 | |
| Junior physicians | radius | 0.765 ± 0.781 | 0.396 ± 0.409 | <0.001 |
| ulna | 0.786 ± 1.094 | 0.313 ± 0.449 | <0.001 | |
| metacarpal I | 0.499 ± 0.536 | 0.308 ± 0.385 | <0.001 | |
| metacarpal III | 0.483 ± 0.505 | 0.343 ± 0.361 | <0.001 | |
| metacarpal V | 0.475 ± 0.511 | 0.342 ± 0.358 | 0.002 | |
| proximal phalanx I | 0.485 ± 0.504 | 0.312 ± 0.341 | <0.001 | |
| proximal phalanx III | 0.514 ± 0.498 | 0.293 ± 0.362 | <0.001 | |
| proximal phalanx V | 0.516 ± 0.505 | 0.289 ± 0.362 | <0.001 | |
| middle phalanx III | 0.476 ± 0.535 | 0.31 ± 0.385 | <0.001 | |
| middle phalanx V | 1.29 ± 1.895 | 1.071 ± 1.85 | <0.001 | |
| distal phalanx I | 0.61 ± 0.704 | 0.409 ± 0.679 | <0.001 | |
| distal phalanx III | 0.516 ± 0.542 | 0.368 ± 0.378 | 0.001 | |
| distal phalanx V | 0.598 ± 0.572 | 0.388 ± 0.412 | <0.001 | |
Data are expressed as mean (standard deviation). P-value was calculated by the rank-sum test.
MAE, Mean absolute error.