| Literature DB >> 35589847 |
Akifumi Niiya1, Kouzou Murakami2, Rei Kobayashi2, Atsuhito Sekimoto2, Miho Saeki2, Kosuke Toyofuku2, Masako Kato2, Hidenori Shinjo2, Yoshinori Ito2, Mizuki Takei3, Chiori Murata3, Yoshimitsu Ohgiya2.
Abstract
Artificial intelligence algorithms utilizing deep learning are helpful tools for diagnostic imaging. A deep learning-based automatic detection algorithm was developed for rib fractures on computed tomography (CT) images of high-energy trauma patients. In this study, the clinical effectiveness of this algorithm was evaluated. A total of 56 cases were retrospectively examined, including 46 rib fractures and 10 control cases from our hospital, between January and June 2019. Two radiologists annotated the fracture lesions (complete or incomplete) for each CT image, which is considered the "ground truth." Thereafter, the algorithm's diagnostic results for all cases were compared with the ground truth, and the sensitivity and number of false positive (FP) results per case were assessed. The radiologists identified 199 images with a fracture. The sensitivity of the algorithm was 89.8%, and the number of FPs per case was 2.5. After additional learning, the sensitivity increased to 93.5%, and the number of FPs was 1.9 per case. FP results were found in the trabecular bone with the appearance of fracture, vascular grooves, and artifacts. The sensitivity of the algorithm used in this study was sufficient to aid the rapid detection of rib fractures within the evaluated validation set of CT images.Entities:
Mesh:
Year: 2022 PMID: 35589847 PMCID: PMC9119970 DOI: 10.1038/s41598-022-12453-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1CNN architecture design. From left to right, the legend on the lower right shows the type of each layer (convolution or max pooling), kernel size, and the number of channels.
Results of preliminary experiments.
| Cases | Ground truths | Detections | Sensitivity | False positives per case |
|---|---|---|---|---|
| 56 | 199 | 178 | 89.4% | 2.5 |
46 rib fractures 10 control cases | Complete fractures: 151 Incomplete fractures: 48 | Complete fractures: 138 Incomplete fractures: 40 | Complete fractures: 91.4% Incomplete fractures: 83.3% |
Figure 2False negative results. These results emerged mainly in the upper ribs, in the proximity of vertebral bodies, and for minor incomplete fractures; additional learning reduced false negatives.
Results after additional learning.
| Cases | Ground truths | Detections | Sensitivity | False positives per case |
|---|---|---|---|---|
| 56 | 199 | 186 | 93.5% | 1.9 |
46 rib fractures 10 control cases | Complete fractures: 151 Incomplete fractures: 48 | Complete fractures: 143 Incomplete fractures: 43 | Complete fractures: 94.7% Incomplete fractures: 89.6% | |
Figure 3Fractures identified by the algorithm. The algorithm helped identify one case of incomplete fracture, in addition to some complete fractures.
Figure 4False positive results. These features resembled bone fractures and included strains, vessel grooves, and artifacts.
Figure 5False negative results. These fractures were more frequently unrecognized in the upper ribs and in the proximity of vertebral bodies. It is important to reduce false negative results for clinical application.
Figure 6False positive results. These features were classified as fractures in 6 of 10 normal cases.