| Literature DB >> 34873241 |
Liding Yao1, Xiaojun Guan1, Xiaowei Song1, Yanbin Tan1, Chun Wang2, Chaohui Jin2, Ming Chen2, Huogen Wang3, Minming Zhang4.
Abstract
Rib fracture detection is time-consuming and demanding work for radiologists. This study aimed to introduce a novel rib fracture detection system based on deep learning which can help radiologists to diagnose rib fractures in chest computer tomography (CT) images conveniently and accurately. A total of 1707 patients were included in this study from a single center. We developed a novel rib fracture detection system on chest CT using a three-step algorithm. According to the examination time, 1507, 100 and 100 patients were allocated to the training set, the validation set and the testing set, respectively. Free Response ROC analysis was performed to evaluate the sensitivity and false positivity of the deep learning algorithm. Precision, recall, F1-score, negative predictive value (NPV) and detection and diagnosis were selected as evaluation metrics to compare the diagnostic efficiency of this system with radiologists. The radiologist-only study was used as a benchmark and the radiologist-model collaboration study was evaluated to assess the model's clinical applicability. A total of 50,170,399 blocks (fracture blocks, 91,574; normal blocks, 50,078,825) were labelled for training. The F1-score of the Rib Fracture Detection System was 0.890 and the precision, recall and NPV values were 0.869, 0.913 and 0.969, respectively. By interacting with this detection system, the F1-score of the junior and the experienced radiologists had improved from 0.796 to 0.925 and 0.889 to 0.970, respectively; the recall scores had increased from 0.693 to 0.920 and 0.853 to 0.972, respectively. On average, the diagnosis time of radiologist assisted with this detection system was reduced by 65.3 s. The constructed Rib Fracture Detection System has a comparable performance with the experienced radiologist and is readily available to automatically detect rib fracture in the clinical setting with high efficacy, which could reduce diagnosis time and radiologists' workload in the clinical practice.Entities:
Mesh:
Year: 2021 PMID: 34873241 PMCID: PMC8648839 DOI: 10.1038/s41598-021-03002-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flow chart showing overall study process.
The overview of dataset.
| Cohorts | No. Patients | No. CT slices | No. Fractures |
|---|---|---|---|
| Training | 1507 | 581,701 | 7362 |
| Validation | 100 | 36,697 | 473 |
| Testing | 100 | 37,183 | 436 |
Figure 2The pipeline for detecting rib fractures from CT scans. The fracture detection task was divided into three stages, including bone segmentation, rib location, and rib fracture classification (a). (b) bone segmentation; (c) rib location; (d) rib fracture classification.
The overview of dataset for the training of U-Net and 3D DenseNet.
| Cohorts | U-Net | 3D DenseNet | |
|---|---|---|---|
| No. CT images | No. Fracture blocks | No. Normal blocks | |
| Training | 4496 | 91,574 | 50,078,825 |
| Validation | 3145 | 5981 | 3,323,151 |
| Testing | 3568 | 5992 | 3,452,162 |
Figure 3Rib 3D view. The red rectangular box was the selected fracture lesion, and the green parts were the other suspected fracture lesions detected by our Rib Fracture Detection System.
Figure 4Free-response ROC (FROC) curve for our model.
The comparison of the performance between the model, radiologist and radiologist-model collaboration.
| Group | Model | Radiologist A | Radiologist B | Radiologist A-model collaboration | Radiologist B-model collaboration |
|---|---|---|---|---|---|
| F1-score | 0.890 | 0.796 | 0.889 | 0.925 | 0.970 |
| Recall | 0.913 | 0.693 | 0.853 | 0.920 | 0.972 |
| Precision | 0.869 | 0.935 | 0.928 | 0.930 | 0.968 |
| NPV | 0.969 | 0.989 | 0.985 | 0.985 | 0.993 |
| Time (seconds) | 20 ± 5.8 | 242.6 ± 83.0* | 153.6 ± 34.2* | 207.0 ± 47.9a | 58.6 ± 31.4a |
NPV negative predictive value.
*Indicated the p value of the comparison between model and radiologists was < 0.001.
aIndicated the p value of the comparison before and after using the model was < 0.001.
Figure 5Impact of the Rib Fracture Detection System in clinical practice for patients with suspicion of rib fracture in the Department of Radiology. In a cohort of patients with suspected rib fractures who underwent chest CT investigation, radiologists should pay close attention to all of the ribs without the help of our model in order to look for 18.2% of the fractured ribs. Since 80.9% of the ribs were diagnosed as non-fracture ribs by this model with a 96.9% true-negative rate, it demonstrated high accuracy in identifying true non-fractured ribs by the constructed model. As a result, with the assistance of this deep learning model, radiologists only had to pay more attention to 19.1% of the ribs that were categorized as high-risk for fracture, which significantly reduced their workload in detecting rib fracture.
The comparison of the performance between our Rib Fracture Detection System with Fast RCNN, Faster RCNN, YOLOv3.
| Group | Model | Fast RCNN | Faster RCNN | YOLOv3 |
|---|---|---|---|---|
| F1-score | 0.890 | 0.863 | 0.870 | 0.877 |
| Recall | 0.913 | 0.874 | 0.889 | 0.894 |
| Precision | 0.869 | 0.853 | 0.852 | 0.861 |
| NPV | 0.969 | 0.925 | 0.932 | 0.942 |