| Literature DB >> 34415503 |
Bokai Zhang1, Amer Ghanem2, Alexander Simes2, Henry Choi2, Andrew Yoo2.
Abstract
PURPOSE: Surgical workflow recognition is a crucial and challenging problem when building a computer-assisted surgery system. Current techniques focus on utilizing a convolutional neural network and a recurrent neural network (CNN-RNN) to solve the surgical workflow recognition problem. In this paper, we attempt to use a deep 3DCNN to solve this problem.Entities:
Keywords: 3D ConvNet; Computer-assisted surgery; Focal loss; Surgical workflow recognition
Mesh:
Year: 2021 PMID: 34415503 PMCID: PMC8589754 DOI: 10.1007/s11548-021-02473-3
Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN: 1861-6410 Impact factor: 2.924
Training, validation and test datasets (hours of video)
| Phase name | Training data | Validation data | Testing data |
|---|---|---|---|
| Not a phase | 95.50 | 24.35 | 20.03 |
| Ligation of short gastric vessels | 70.79 | 18.03 | 13.80 |
| Gastric transection | 66.47 | 15.90 | 11.51 |
| Bougie | 5.08 | 1.07 | 0.84 |
| Oversew staple line | 42.71 | 13.46 | 6.63 |
| Exploration/inspection | 3.03 | 0.64 | 0.45 |
| Liver retraction | 1.09 | 0.43 | 0.11 |
| Hiatal hernia repair | 7.48 | 1.21 | 1.71 |
| Gastric band removal | 0.88 | 0.71 | 0.52 |
Fig. 1An overview of the proposed workflow: initial predictions are generated by 3D CNN from image sequences. Prior knowledge filtering is used to finalize the prediction results
Overall accuracy and weighted Jaccard score using different training techniques and different deep learning pipelines
| Method | Sampling | Augmentation | Loss | PKF | Accuracy | Jaccard |
|---|---|---|---|---|---|---|
| C3D | ASBS | CE | 0.7548 | 0.4010 | ||
| C3D-PKF | ASBS | CE | 0.7929 | 0.6591 | ||
| I3D | ASBS | CE | 0.7795 | 0.6506 | ||
| I3D-PKF | ASBS | CE | 0.8257 | 0.7099 | ||
| I3D-PKF | SMOTE | CE | 0.7892 | 0.6598 | ||
| InceptionV3-BiLSTM-PKF | ASBS | CE | 0.8078 | 0.6856 | ||
| InceptionV3-BiLSTM-FL-PKF | ASBS | FL | 0.8161 | 0.6989 | ||
| I3D-FL-PKF | ASBS | FL | 0.8340 | 0.7187 | ||
| I3D-FL-PKF | ASBS | FL | 0.8416 | 0.7327 |
Detailed performance for the I3D-FL-PKF workflow pipeline
| Phase name | Precision | Recall | F1 score |
|---|---|---|---|
| Not a phase | 0.80 | 0.75 | 0.78 |
| Ligation of short gastric vessels | 0.86 | 0.93 | 0.89 |
| Gastric transection | 0.89 | 0.90 | 0.90 |
| Bougie | 0.35 | 0.30 | 0.32 |
| Oversew staple line | 0.86 | 0.94 | 0.90 |
| Exploration/inspection | 0.74 | 0.17 | 0.28 |
| Liver retraction | 0.18 | 0.68 | 0.28 |
| Hiatal hernia repair | 0.85 | 0.92 | 0.88 |
| Gastric band removal | 0.85 | 0.69 | 0.76 |
Fig. 2Confusion matrices for phase recognition results: a I3D prediction results, b I3D-PKF prediction results, c I3D-FL-PKF prediction results. The X and Y-axis represent predicted label and ground truth, respectively. The “Not a phase” is denoted as P0. The “Ligation of short gastric vessels” phase is denoted as P1. The “Gastric transection” phase is denoted as P2. The “Bougie” phase is denoted as P3. The “Oversew staple line” phase is denoted as P4. The “Exploration/inspection” phase is denoted as P5. The “Liver retraction” phase is denoted as P6. The “Hiatal hernia repair” phase is denoted as P7. The “Gastric band removal” phase is denoted as P8
Fig. 3Color-coded ribbon illustration for phase recognition results: a InceptionV3-BiLSTM-FL prediction results, b InceptionV3-BiLSTM-FL-PKF prediction results, c I3D-FL prediction results, d I3D-FL-PKF prediction results, e Ground Truth