| Literature DB >> 33804831 |
Julie Wang1, Alexander Wood2, Chao Gao2, Kayvan Najarian1,2,3,4,5, Jonathan Gryak2,5.
Abstract
The spleen is one of the most frequently injured organs in blunt abdominal trauma. Computed tomography (CT) is the imaging modality of choice to assess patients with blunt spleen trauma, which may include lacerations, subcapsular or parenchymal hematomas, active hemorrhage, and vascular injuries. While computer-assisted diagnosis systems exist for other conditions assessed using CT scans, the current method to detect spleen injuries involves the manual review of scans by radiologists, which is a time-consuming and repetitive process. In this study, we propose an automated spleen injury detection method using machine learning. CT scans from patients experiencing traumatic injuries were collected from Michigan Medicine and the Crash Injury Research Engineering Network (CIREN) dataset. Ninety-nine scans of healthy and lacerated spleens were split into disjoint training and test sets, with random forest (RF), naive Bayes, SVM, k-nearest neighbors (k-NN) ensemble, and subspace discriminant ensemble models trained via 5-fold cross validation. Of these models, random forest performed the best, achieving an Area Under the receiver operating characteristic Curve (AUC) of 0.91 and an F1 score of 0.80 on the test set. These results suggest that an automated, quantitative assessment of traumatic spleen injury has the potential to enable faster triage and improve patient outcomes.Entities:
Keywords: computer-assisted diagnosis; image segmentation; machine learning; spleen injury detection
Year: 2021 PMID: 33804831 PMCID: PMC8063804 DOI: 10.3390/e23040382
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1A schematic diagram of the proposed method.
Figure 2Segmentation of healthy and lacerated spleens.
Mean and standard deviation (SD) of performance metrics for spleen injury classification from 5-fold cross validation on the training set. The highest value for each performance metric is bolded while the lowest SD is italicized.
| Metric | RF | Naive Bayes | SVM | Subspace Discriminant | |
|---|---|---|---|---|---|
| Accuracy | 0.71 (0.11) | 0.73 ( | 0.73 ( | 0.67 ( | |
| Sensitivity | 0.66 (0.17) | 0.61 (0.17) | 0.56 (0.18) | 0.44 (0.19) | |
| Specificity | 0.75 (0.15) | 0.84 (0.13) | 0.87 (0.13) | ||
| F1 | 0.68 (0.13) | 0.67 (0.14) | 0.65 (0.16) | 0.54 (0.18) | |
| AUC | 0.75 (0.12) | 0.81 ( 0.10) | 0.84 (0.10) | 0.77 (0.13) |
Performance metrics for spleen injury classification on the test set. The highest value for each performance metric is bolded.
| Metric | RF | Naive Bayes | SVM | Subspace Discriminant | |
|---|---|---|---|---|---|
| Accuracy |
| 0.70 | 0.71 | 0.75 | 0.64 |
| Sensitivity |
| 0.63 | 0.56 | 0.59 | 0.40 |
| Specificity |
| 0.76 | 0.85 | 0.88 | 0.85 |
| F1 |
| 0.66 | 0.64 | 0.68 | 0.50 |
| AUC |
| 0.74 | 0.80 | 0.84 | 0.76 |
Performance metrics for the RF classifier trained using hand-crafted features and for the deep learning method. The highest value for each performance metric is bolded.
| Metric | RF (Hand-Crafted) | ResNet + LSTM (Deep Learning) |
|---|---|---|
| Accuracy |
| 0.79 |
| Sensitivity |
| 0.67 |
| Specificity | 0.89 |
|
| F1 |
| 0.75 |
| AUC |
| 0.72 |
Performance metrics for the RF classifier trained on Michigan Medicine samples and tested on CIREN samples.
| Metric | RF |
|---|---|
| Accuracy | 0.75 |
| Sensitivity | 0.59 |
| Specificity | 0.94 |
| F1 | 0.71 |
| AUC | 0.91 |
RF classification accuracy by injury grades. The mean accuracy and standard deviation (SD) across 5-fold cross validation on the training set, as well as the mean accuracy on the test set are reported.
| Injury Grade | Training Accuracy | Testing Accuracy |
|---|---|---|
| Healthy | 0.89 (0.12) | 0.89 |
| AIS = 2 | 0.72 (0.30) | 0.70 |
| AIS = 3 | 0.74 (0.27) | 0.78 |
| AIS = 4, 5 | 0.88 (0.22) | 0.79 |
Figure 3Classification results. (a,b) lacerated (AIS = 2) samples correctly classified as lacerated; (c) lacerated (AIS = 2) sample incorrectly classified as healthy; (d,e) healthy samples incorrectly classified as lacerated.