| Literature DB >> 36192521 |
Takaki Inoue1, Satoshi Maki2,3, Takeo Furuya1, Yukio Mikami1, Masaya Mizutani1, Ikko Takada1, Sho Okimatsu1, Atsushi Yunde1, Masataka Miura1, Yuki Shiratani1, Yuki Nagashima1, Juntaro Maruyama1, Yasuhiro Shiga1, Kazuhide Inage1, Sumihisa Orita1,4, Yawara Eguchi1, Seiji Ohtori1.
Abstract
The emergency department is an environment with a potential risk for diagnostic errors during trauma care, particularly for fractures. Convolutional neural network (CNN) deep learning methods are now widely used in medicine because they improve diagnostic accuracy, decrease misinterpretation, and improve efficiency. In this study, we investigated whether automatic localization and classification using CNN could be applied to pelvic, rib, and spine fractures. We also examined whether this fracture detection algorithm could help physicians in fracture diagnosis. A total of 7664 whole-body CT axial slices (chest, abdomen, pelvis) from 200 patients were used. Sensitivity, precision, and F1-score were calculated to evaluate the performance of the CNN model. For the grouped mean values for pelvic, spine, or rib fractures, the sensitivity was 0.786, precision was 0.648, and F1-score was 0.711. Moreover, with CNN model assistance, surgeons showed improved sensitivity for detecting fractures and the time of reading and interpreting CT scans was reduced, especially for less experienced orthopedic surgeons. Application of the CNN model may lead to reductions in missed fractures from whole-body CT images and to faster workflows and improved patient care through efficient diagnosis in polytrauma patients.Entities:
Mesh:
Year: 2022 PMID: 36192521 PMCID: PMC9529907 DOI: 10.1038/s41598-022-20996-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flow chart showing the overall study process. CT, computed tomography.
Figure 2A screenshot of the image annotation process. We removed the black background and trimmed to include the area of the patient's trunk. A rectangular bounding box covering the minimum area of the fracture site was then drawn on every fracture of the CT slice and was labeled as spine fractures, pelvic fractures, or rib fractures using labelImg (version: 1.8.1, available at https://github.com/tzutalin/labelImg). CT, computed tomography.
Figure 3Sample CT images correctly annotated by neural networks on the testing dataset. (A and D) Pelvic fracture. (B and E) Rib fracture. (C and F) Spine fracture. The CNN model also generated a confidence score for each of the detected points as continuous values of a range from 0 to 100%). CT, computed tomography; CNN, convolutional neural network.
Demographic data of patients included in the training and validation dataset.
| Training and validation dataset | |
|---|---|
| No. of patients | 181 |
| Age, mean ± SD | 54.3 ± 20.8 |
| Sex (male: female) | 117:64 |
| No. of CT images | 5217 |
| No. of annotations | 6357 |
| Pelvic fracture | 2014 |
| Rib fracture | 1897 |
| Spine fracture | 2446 |
Data are shown as n and mean ± standard deviation.
CT computed tomography, No. number, SD standard deviation.
Demographic data of patients included in the testing dataset.
| Testing dataset | |
|---|---|
| No. of patients | 19 |
| Age, mean ± SD | 51.2 ± 19.2 |
| Sex (male:female) | 13:6 |
| No. of CT images | 2447 |
| Pelvic fracture | 143 |
| Rib fracture | 87 |
| Spine fracture | 136 |
| Pelvic fracture | 5.8 |
| Rib fracture | 3.6 |
| Spine fracture | 5.5 |
Data are shown as n and mean ± standard deviation.
CT computed tomography, No. number, SD standard deviation.
Performance of the CNN model.
| Pelvic fracture | Rib fracture | Spine fracture | Mean | |
|---|---|---|---|---|
| Sensitivity (%) | 0.839 | 0.713 | 0.780 | 0.786 |
| Precision (%) | 0.645 | 0.602 | 0.683 | 0.648 |
| F1-score (%) | 0.729 | 0.652 | 0.729 | 0.711 |
CNN convolutional neural network.
Figure 4Representative images of true positive, false positive, and false negative. (A–C) CT axial slices detected and diagnosed correctly by the CNN model for three fractures (true positive). (D–F) CT axial slices of a misdiagnosed fracture (false positive). The vascular groove was mistaken for a pelvic fracture (D); The boundary between costal cartilage and rib was mistaken for a rib fracture (E); The vascular groove was mistaken for a spine fracture (F). (G–I) CT axial slices of a missed fracture (false negative were indicated by arrowheads). CT, computed tomography; CNN, convolutional neural network.
Comparison of sensitivity for orthopedic surgeons with and without CNN model assistance.
| Sensitivity (%) | Without CNN model assistance | With CNN model assistance | |
|---|---|---|---|
| Orthopedic surgeon 1 | 0.822 | 0.892 | 0.0158* |
| Orthopedic surgeon 2 | 0.808 | 0.893 | 0.0060* |
| Orthopedic surgeon 3 | 0.853 | 0.881 | 0.0184* |
| Orthopedic surgeon 1 | 0.580 | 0.747 | 0.0006* |
| Orthopedic surgeon 2 | 0.596 | 0.729 | 0.0093* |
| Orthopedic surgeon 3 | 0.651 | 0.719 | 0.1573 |
| Orthopedic surgeon 1 | 0.649 | 0.814 | 0.0023* |
| Orthopedic surgeon 2 | 0.676 | 0.795 | < 0.0001* |
| Orthopedic surgeon 3 | 0.762 | 0.838 | 0.0143* |
| Orthopedic surgeon 1 | 0.697 | 0.829 | < 0.0001* |
| Orthopedic surgeon 2 | 0.676 | 0.818 | < 0.0001* |
| Orthopedic surgeon 3 | 0.764 | 0.826 | 0.0003* |
Orthopedic surgeon 1 and 2 had 3 years of experience. Orthopedic surgeon 3 had 8 years of experience. CNN, convolutional neural network.
*p < 0.05.
Comparison of precision for orthopedic surgeons with and without CNN model assistance.
| Precision (%) | Without CNN model assistance | With CNN model assistance |
|---|---|---|
| Orthopedic surgeon 1 | 0.957 | 0.976 |
| Orthopedic surgeon 2 | 0.974 | 0.976 |
| Orthopedic surgeon 3 | 0.961 | 0.947 |
| Orthopedic surgeon 1 | 0.962 | 0.942 |
| Orthopedic surgeon 2 | 0.949 | 0.885 |
| Orthopedic surgeon 3 | 0.935 | 0.984 |
| Orthopedic surgeon 1 | 0.977 | 0.940 |
| Orthopedic surgeon 2 | 0.952 | 0.956 |
| Orthopedic surgeon 3 | 0.961 | 0.950 |
| Orthopedic surgeon 1 | 0.965 | 0.955 |
| Orthopedic surgeon 2 | 0.961 | 0.948 |
| Orthopedic surgeon 3 | 0.955 | 0.956 |
Orthopedic surgeon 1 and 2 had 3 years of experience. Orthopedic surgeon 3 had 8 years of experience. CNN, convolutional neural network.
Diagnosis time of orthopedic surgeons with and without CNN model assistance.
| Without CNN model assistance (s) | With CNN model assistance (s) | ||
|---|---|---|---|
| Orthopedic surgeon 1 | 278.4 | 162.3 | < 0.0001* |
| Orthopedic surgeon 2 | 205.2 | 144.5 | < 0.0001* |
| Orthopedic surgeon 3 | 233.7 | 155.5 | < 0.0001* |
Orthopedic surgeon 1 and 2 had 3 years of experience. Orthopedic surgeon 3 had 8 years of experience. CNN, convolutional neural network.
*p < 0.05.