| Literature DB >> 36157842 |
Yu-Wen Chen1, Ju Zhang1, Peng Wang2, Zheng-Yu Hu1, Kun-Hua Zhong1.
Abstract
Computer-assisted surgery (CAS) has occupied an important position in modern surgery, further stimulating the progress of methodology and technology. In recent years, a large number of computer vision-based methods have been widely used in surgical workflow recognition tasks. For training the models, a lot of annotated data are necessary. However, the annotation of surgical data requires expert knowledge and thus becomes difficult and time-consuming. In this paper, we focus on the problem of data deficiency and propose a knowledge transfer learning method based on artificial neural network to compensate a small amount of labeled training data. To solve this problem, we propose an unsupervised method for pre-training a Convolutional-De-Convolutional (CDC) neural network for sequencing surgical workflow frames, which performs neural convolution in space (for semantic abstraction) and neural de-convolution in time (for frame level resolution) simultaneously. Specifically, through neural convolution transfer learning, we only fine-tuned the CDC neural network to classify the surgical phase. We performed some experiments for validating the model, and it showed that the proposed model can effectively extract the surgical feature and determine the surgical phase. The accuracy (Acc), recall, precision (Pres) of our model reached 91.4, 78.9, and 82.5%, respectively.Entities:
Keywords: convolutional-de-convolutional; deep learning; neural networks; surgical workflow; transfer learning
Year: 2022 PMID: 36157842 PMCID: PMC9491113 DOI: 10.3389/fncom.2022.998096
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 3.387
FIGURE 1Architecture of the network. *The network is pre-trained and its parameters are fixed.
FIGURE 2Our task for pretraining a CDC whether is the order of the given L frames correct? (Answer: The Left is correct).
List of phases in the dataset.
| ID | Phase |
| P0 | Trocar placement |
| P1 | Preparation |
| P2 | Calot triangle dissection |
| P3 | Clipping and cutting |
| P4 | Gallbladder dissection |
| P5 | Galbladder packaging |
| P6 | Cleaning and coagulation |
| P7 | Gallbladder retraction |
FIGURE 3Phase distribution (training data at 1 fps).
FIGURE 4Sampling data at all phases and transitional moment.
FIGURE 5Examples of negative and positive transitional delays and transitional moment.
ATD, TRR metrics for phase recognition.
| Methods | ATD | TRR |
| Ours | [−15 s; 30 s] | 6.0 |
| Twinanda | [−23 s; 54 s] | 3.8 |
| Dergachyova | [−45 s; 70 s] | 2.7 |
Time delay standard scores metrics for phase recognition.
| Methods | Scores | ||||||||
|
|
|
| |||||||
| Acc | Rec | Pres | Acc | Rec | Pres | Acc | Rec | Pres | |
| Ours |
|
|
|
|
|
|
|
|
|
| Twinanda | 75.2 | 64.6 | 69.0 | 80.5 | 70.6 | 77.8 | 82.9 | 74.9 | 79.5 |
| Dergachyova | 68.6 | 60.9 | 64.1 | 72.1 | 65.3 | 66.2 | 76.6 | 71.4 | 78.1 |
Bold values indicate the optimal result in the algorithm comparison.
Comparison results with no time delay.
| Methods | Rec | Pres |
| Dergachyova | 60.9 | 64.1 |
| Twinanda | 64.6 | 69.0 |
| CNN-biLSTM-CRF | 69.9 | 74.5 |
| Cnn-lstm-net | 72.2 | 60.8 |
| Spatial-net | 72.9 | 73.4 |
| CAE | 68.3 | 72.7 |
|
|
|
|
Bold values indicate the optimal result in the algorithm comparison.