| Literature DB >> 34011107 |
Masafumi Kaiume1,2, Shigeru Suzuki1, Koichiro Yasaka3, Haruto Sugawara3, Yun Shen1, Yoshiaki Katada1, Takuya Ishikawa1, Rika Fukui1, Osamu Abe2.
Abstract
ABSTRACT: To evaluate the rib fracture detection performance in computed tomography (CT) images using a software based on a deep convolutional neural network (DCNN) and compare it with the rib fracture diagnostic performance of doctors.We included CT images from 39 patients with thoracic injuries who underwent CT scans. In these images, 256 rib fractures were detected by two radiologists. This result was defined as the gold standard. The performances of rib fracture detection by the software and two interns were compared via the McNemar test and the jackknife alternative free-response receiver operating characteristic (JAFROC) analysis.The sensitivity of the DCNN software was significantly higher than those of both Intern A (0.645 vs 0.313; P < .001) and Intern B (0.645 vs 0.258; P < .001). Based on the JAFROC analysis, the differences in the figure-of-merits between the results obtained via the DCNN software and those by Interns A and B were 0.057 (95% confidence interval: -0.081, 0.195) and 0.071 (-0.082, 0.224), respectively. As the non-inferiority margin was set to -0.10, the DCNN software is non-inferior to the rib fracture detection performed by both interns.In the detection of rib fractures, detection by the DCNN software could be an alternative to the interpretation performed by doctors who do not have intensive training experience in image interpretation.Entities:
Mesh:
Year: 2021 PMID: 34011107 PMCID: PMC8137061 DOI: 10.1097/MD.0000000000026024
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Figure 1Flowchart for the patient selection procedure used in this study. CT = computed tomography.
Characteristics of included patients.
| Number of Patients | |
| All patients | 39 |
| Male patients | 28 |
| Female patients | 11 |
| Age (yr) | |
| All patients | 58 ± 21 (20–91)∗ |
| Male patients | 59 ± 20 (20–84)∗ |
| Female patients | 53 ± 23 (22–91)∗ |
| With or without contrast agent | |
| Without contrast agent | 6 |
| With intravenous contrast agent | 33 |
| Iohexol 600 mg Iodine/kg | 19 |
| Iopamidol 600 mg Iodine/kg | 14 |
Figure 2A screenshot of the user interface of the DCNN software displaying the list of suspected lesions on the right side of the screen. The square region of interest indicates where the fracture is presumed to be. DCNN = deep convolutional neural network.
Sensitivities and positive predictive values for rib fracture detection in chest computed tomography images by 2 radiologists.
| Sensitivity (95% CI) | Positive Predictive Value (95% CI) | |
| Radiologist A | 224/256 | 224/233 |
| 0.875 (0.834–0.916) | 0.961 (0.935–0.988) | |
| Radiologist B | 215/256 | 215/227 |
| 0.840 (0.795–0.885) | 0.947 (0.916–0.978) |
Sensitivities, positive predictive values, and highest F1 scores for rib fracture detection in chest computed tomography images via the DCNN software and by two interns.
| Sensitivity (95% CI) | Positive Predictive Value (95% CI) | F1 Score | |
| The DCNN software (confidence score from 26% to 100%) | 165/256 | 165/208 | 0.711 |
| 0.645 (0.586–0.703) | 0.793 (0.738–0.848) | ||
| Intern A (confidence score from 20% to 100%) | 80/256 | 80/94 | 0.457 |
| 0.313 (0.256–0.369) | 0.851 (0.778–0.924) | ||
| Intern B (confidence score from 53% to 100%) | 66/256 | 66/89 | 0.383 |
| 0.258 (0.204–0.311) | 0.742 (0.651–0.832) |
Figure 3Sensitivities of the DCNN software and those of two interns (Intern A and Intern B), and P value between the software and each of the two interns. Forest plot showing the sensitivities of the DCNN software and those of two interns (Intern A and Intern B) with 95% confidence intervals for rib fracture detection. The P values between the sensitivities of the DCNN software and each of the two interns are also shown. The sensitivity of the software was significantly better than those of both Intern A (P < .001) and Intern B (P < .001). DCNN = deep convolutional neural network.
Figure-of-merits for the DCNN software and the two interns.
| The DCNN software | Intern A | Intern B | |
| Figure-Of-Merit (95%CI) | 0.571 (0.454–0.689) | 0.514 (0.415–0.614) | 0.500 (0.393–0.607) |
Figure 4Estimated differences in Figure-Of-Merits between the software and each intern (Intern A and Intern B). Forest plot showing estimated differences in jackknife alternative free-response receiver operating characteristic Figure-Of-Merits between the observer performance of the software and each intern (Intern A and Intern B) for rib fracture detection. Since the non-inferiority margin was set to −0.10, the DCNN software is non-inferior to the rib fracture detection performed by both interns. The P-values between the performance of the software and each intern are also shown. There was no significant difference between the two groups. DCNN = deep convolutional neural network.