| Literature DB >> 29184668 |
Odysseas Zisimopoulos1, Evangello Flouty1, Mark Stacey1, Sam Muscroft1, Petros Giataganas1, Jean Nehme1, Andre Chow1, Danail Stoyanov1,2.
Abstract
Computer-assisted interventions (CAI) aim to increase the effectiveness, precision and repeatability of procedures to improve surgical outcomes. The presence and motion of surgical tools is a key information input for CAI surgical phase recognition algorithms. Vision-based tool detection and recognition approaches are an attractive solution and can be designed to take advantage of the powerful deep learning paradigm that is rapidly advancing image recognition and classification. The challenge for such algorithms is the availability and quality of labelled data used for training. In this Letter, surgical simulation is used to train tool detection and segmentation based on deep convolutional neural networks and generative adversarial networks. The authors experiment with two network architectures for image segmentation in tool classes commonly encountered during cataract surgery. A commercially-available simulator is used to create a simulated cataract dataset for training models prior to performing transfer learning on real surgical data. To the best of authors' knowledge, this is the first attempt to train deep learning models for surgical instrument detection on simulated data while demonstrating promising results to generalise on real data. Results indicate that simulated data does have some potential for training advanced classification methods for CAI systems.Entities:
Keywords: CAI surgical phase recognition algorithms; Vision20 based tool detection; biomedical optical imaging; cataract surgery; computer-assisted interventions; deep convolutional neural networks; deep learning; generative adversarial networks; image classification; image recognition; image segmentation; learning (artificial intelligence); medical image processing; neural nets; surgery; surgical simulation; tool detection; tool segmentation; video signal processing
Year: 2017 PMID: 29184668 PMCID: PMC5683210 DOI: 10.1049/htl.2017.0064
Source DB: PubMed Journal: Healthc Technol Lett ISSN: 2053-3713
Fig. 1Example synthetic (top row) and real (bottom row) datasets used in this work; (a–c) three exemplar tools used in cataract surgery
a1, b1, c1 Images generated through a commercial simulation environment
a2, b2, c2 Segmentation masks delineating instruments from the operating site
a3, b3, c3 Real cataract surgery images (https://cataracts.grand-challenge.org/)
Summary of all deep learning models trained on the simulated datasets with various sizes and resolutions
| Model | Resolution | Training set size |
|---|---|---|
| FCN-VGG-400 | 400 | |
| FCN-VGG-10K-Small | 10,376 | |
| FCN-VGG-10K-Large | 10,376 | |
| P2P-400 | 400 | |
| P2P-10K | 10,376 |
Fig. 2Experimental results presenting all the outputs of the two learning models for different parameters (training dataset size and resolution) applied on a simulation testing dataset. The simulated images are shown in the first column and the ground truth segmentation images in the second column
Evaluation metrics for the performance of all deep learning models on the simulated test set
| Model | Pixel accuracy | Mean accuracy | Mean IU | fwIU |
|---|---|---|---|---|
| FCN-VGG-400 | 0.936 | 0.334 | 0.254 | 0.883 |
| FCN-VGG-10K-Small | 0.959 | 0.372 | 0.354 | 0.922 |
| FCN-VGG-10K-Large | 0.977 | 0.639 | 0.526 | 0.958 |
| P2P-400 | 0.981 | 0.395 | 0.196 | 0.969 |
| P2P-10K | 0.982 | 0.503 | 0.260 | 0.974 |
Fig. 3Experimental results presenting all the outputs of the two learning models for different parameters (training dataset number and resolution) applied on a real dataset. The real images are shown in the first row, while zoomed image results are presented on the right. We have manually labelled the ground-truth segmentations of the images in the second row
Evaluation metrics for the comparison between the balanced and imbalanced models on the simulated domain
| Model | Pixel accuracy | Mean accuracy | Mean IU | fwIU |
|---|---|---|---|---|
| FCN-VGG-10K-Large | 0.977 | 0.639 | 0.526 | 0.958 |
| P2P-10K | 0.982 | 0.503 | 0.260 | 0.974 |
| FCN-VGG-10K-Large-balanced | 0.972 | 0.709 | 0.531 | 0.950 |
| P2P-balanced | 0.985 | 0.527 | 0.432 | 0.974 |