| Literature DB >> 35629198 |
Bo-Kyeong Kang1,2, Yelin Han3, Jaehoon Oh2,4, Jongwoo Lim2,3, Jongbin Ryu5,6, Myeong Seong Yoon2,4, Juncheol Lee2,4, Soorack Ryu7.
Abstract
PURPOSE: This study aimed to develop and validate an automatic segmentation algorithm for the boundary delineation of ten wrist bones, consisting of eight carpal and two distal forearm bones, using a convolutional neural network (CNN).Entities:
Keywords: CNN; carpal bone; deep learning; segmentation; wrist
Year: 2022 PMID: 35629198 PMCID: PMC9147335 DOI: 10.3390/jpm12050776
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1Anatomy and labeling of ten wrist bones on a wrist radiograph. (a) The anatomy of ten wrist bones, consisting of eight carpal bones and two distal forearm bones, on an anteroposterior radiograph. (b) Labeling process for the ground truth masking of wrist bones using a self-made customized tool. (1) The classification as one of ten wrist bones and the delineation of each bone’s boundary; (2) Labeling and extraction of each bone.
Figure 2The Fine Mask R-CNN architecture. Our proposed network operates on a 2-stage method. (a) Detection of the regions of interest in the input wrist radiographs using SSD (blue) in the first stage and the delineation of 10 wrist bones using Mask R-CNN with the extended mask head in the second stage (yellow). (b) The structure of the extended mask head. This is an encoder–decoder structure, which can use previous information for the prediction of a specific part. ROI, region of interest; CNN, convolutional neural networks; SSD, Single-Shot Multibox Detector.
Comparison of the performance outcomes between the Mask R-CNN and the Fine Mask R-CNN for the automatic segmentation of ten wrist bones.
| Tm | Td | C | H | P | Tr | L | S | Carpal | R | U | Forearm | Total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mask R-CNN Dice, mean [SD] | 0.92 (0.03) | 0.90 (0.05) | 0.93 (0.04) | 0.93 (0.02) | 0.91 (0.05) | 0.93 (0.02) | 0.93 (0.02) | 0.93 (0.02) | 0.92 (0.01) | 0.94 (0.02) | 0.93 (0.02) | 0.94 (0.01) | 0.93 (0.01) |
| Fine Mask R-CNN Dice, mean [SD] | 0.93 (0.03) | 0.91 (0.05) | 0.95 (0.04) | 0.95 (0.02) | 0.93 (0.04) | 0.95 (0.02) | 0.95 (0.02) | 0.96 (0.02) | 0.94 (0.01) | 0.96 (0.01) | 0.96 (0.02) | 0.96 (0.01) | 0.95 (0.01) |
| Comparison between two networks’ | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * |
Dice, Dice coefficient; SD, standard deviation; Tm, trapezium; Td, trapezoid; C, capitate; H, hamate; P, pisiform; Tr, triquetrum; L, lunate; S, scaphoid; R, distal radius; U, distal ulna. Paired t-tests were used to compare the performance between two networks according to normality. * p-values < 0.05 were considered statistically significant.
Result of the Turing test between the ground truth masking segmented by clinicians and the predicted masking segmented by Fine Mask R-CNN for the automatic segmentation of ten wrist bones.
| Tm | Td | C | H | P | Tr | L | S | Carpal | R | U | Forearm | Total | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Prediction | Score | Median | 4 | 5 | 4 | 5 | 5 | 5 | 5 | 5 | 37 | 5 | 5 | 10 | 47 |
| IQR | 4, 5 | 5, 5 | 4, 5 | 4, 5 | 4, 5 | 4, 5 | 5, 5 | 5, 5 | 36, 38 | 5, 5 | 5, 5 | 9, 10 | 45, 48 | ||
| ICC | Mean | 0.58 | 0.59 | 0.60 | 0.60 | 0.54 | 0.77 | 0.71 | 0.31 | 0.51 | 0.61 | 0.51 | 0.56 | 0.54 | |
| 95% CI | 0.45, 0.69 | 0.46, 0.70 | 0.47, 0.70 | 0.46, 0.70 | 0.39, 0.65 | 0.70, 0.83 | 0.62, 0.78 | 0.10, 0.48 | 0.35, 0.64 | 0.48, 0.71 | 0.36, 0.63 | 0.42, 0.67 | 0.36, 0.66 | ||
| Ground Truth | Score | Median | 5 | 5 | 5 | 5 | 5 | 5 | 5 | 5 | 39 | 5 | 5 | 10 | 48 |
| IQR | 4, 5 | 5, 5 | 4, 5 | 5, 5 | 5, 5 | 5, 5 | 5, 5 | 5, 5 | 37, 39 | 5, 5 | 5, 5 | 10, 10 | 47, 49 | ||
| ICC | Mean | 0.57 | 0.04 | 0.39 | 0.56 | 0.42 | 0.61 | 0.55 | 0.52 | 0.48 | 0.65 | 0.40 | 0.57 | 0.54 | |
| 95% CI | 0.36, 0.70 | 0.25, 0.27 | 0.21, 0.54 | 0.42, 0.67 | 0.24, 0.56 | 0.49, 0.71 | 0.41, 0.66 | 0.36, 0.64 | 0.27, 0.63 | 0.54, 0.74 | 0.22, 0.55 | 0.44, 0.68 | 0.34, 0.67 | ||
| Score between two maskings | <0.001 * | 0.25 | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | 0.39 | <0.001 * | <0.001 * | <0.001 * | <0.001 * | <0.001 * | ||
IQR, interquartile range; ICC, intraclass correlation coefficient; Tm, trapezium; Td, trapezoid; C, capitate; H, hamate; P, pisiform; Tr, triquetrum; L, lunate; S, scaphoid; R, distal radius; U, distal ulna. The Wilcoxon signed rank test was used to compare the Turing test results between the prediction and the ground truth masking. * p-values < 0.05 were considered statistically significant. Values of ICC less than 0.5, between 0.5 and 0.75, between 0.75 and 0.9, and greater than 0.90 were indicative of poor, moderate, good, and excellent reliability, respectively.
Figure 3Visualization of Fine Mask R-CNN and Mask R-CNN network for the segmentation of ten wrist bones. (a) Original image of each wrist bone on the radiograph, (b) Delineation of segmented bone by physicians manually, (c) Delineation of segmented bone by Mask R-CNN, (d) Delineation of segmented bone by Fine Mask R-CNN with an extended mask head. Black lines indicate the ground truth masking segmented by physicians and yellow lines indicate the predicted masking segmented by CNN. CNN; convolutional neural networks.