| Literature DB >> 35095454 |
Min-Seok Kim1, Joon Hyuk Cha2, Seonhwa Lee3, Lihong Han1,4, Wonhyoung Park5, Jae Sung Ahn5, Seong-Cheol Park1,6,7,8.
Abstract
There have been few anatomical structure segmentation studies using deep learning. Numbers of training and ground truth images applied were small and the accuracies of which were low or inconsistent. For a surgical video anatomy analysis, various obstacles, including a variable fast-changing view, large deformations, occlusions, low illumination, and inadequate focus occur. In addition, it is difficult and costly to obtain a large and accurate dataset on operational video anatomical structures, including arteries. In this study, we investigated cerebral artery segmentation using an automatic ground-truth generation method. Indocyanine green (ICG) fluorescence intraoperative cerebral videoangiography was used to create a ground-truth dataset mainly for cerebral arteries and partly for cerebral blood vessels, including veins. Four different neural network models were trained using the dataset and compared. Before augmentation, 35,975 training images and 11,266 validation images were used. After augmentation, 260,499 training and 90,129 validation images were used. A Dice score of 79% for cerebral artery segmentation was achieved using the DeepLabv3+ model trained using an automatically generated dataset. Strict validation in different patient groups was conducted. Arteries were also discerned from the veins using the ICG videoangiography phase. We achieved fair accuracy, which demonstrated the appropriateness of the methodology. This study proved the feasibility of operating field view of the cerebral artery segmentation using deep learning, and the effectiveness of the automatic blood vessel ground truth generation method using ICG fluorescence videoangiography. Using this method, computer vision can discern blood vessels and arteries from veins in a neurosurgical microscope field of view. Thus, this technique is essential for neurosurgical field vessel anatomy-based navigation. In addition, surgical assistance, safety, and autonomous surgery neurorobotics that can detect or manipulate cerebral vessels would require computer vision to identify blood vessels and arteries.Entities:
Keywords: blood vessel; cerebral artery; computer vision; deep learning; indocyanine green; neural network; neurosurgical field; semantic segmentation
Year: 2022 PMID: 35095454 PMCID: PMC8790180 DOI: 10.3389/fnbot.2021.735177
Source DB: PubMed Journal: Front Neurorobot ISSN: 1662-5218 Impact factor: 2.650
Figure 1A data pair of full color visible light image data and artery ground truth map data generated from neurosurgical operating microscope ICG fluorescence videoangiography (IR800). Large veins with darker color are excluded (green arrows). (A) Visible light image. (B) Ground truth map from ICG. (C) Deep learning semantic segmentation class allocation according to anatomical structure and object types.
Figure 2This flowchart demonstrates the pipeline, which includes recording, pre-processing, and augmentation. (A) Recording visible ray color video. (B) ICG infrared fluorescence videos. (C) Image co-registration of color images. (D) Augmentations are done.
Deeplab V3+ header architecture based on ResNet-101.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| ResNet-101 low level feature | 256 × 128 × 128 | – | – | – | – |
| Convolution 1 | 48 × 128 × 128 | 1 × 1 | 1 | 1 | 0.1 |
| ResNet-101 out feature | 2,048 × 64 × 64 | – | – | – | – |
| Atrous spatial pyramidal pooling | 1,280 × 64 × 64 | – | – | – | – |
| Project convolution | 256 × 64 × 64 | 1 × 1 | 1 | – | 0.1 |
| Up sampling (interpolate) | 256 × 128 × 128 | – | – | – | – |
| Concatenation | 304 × 128 × 128 | – | – | – | – |
| Convolution | 256 × 128 × 128 | 3 × 3 | 1 | 1 | – |
| Convolution | 2 × 128 × 128 | 1 × 1 | 1 | – | – |
| Up sampling (interpolate) | 2 × 512 × 512 | – | – | – | – |
Figure 3Ground truth and inference results by models. For the first row, cerebral artery ground truth was shown and veins which are not included for the ground truth were marked by green arrows. For each box of four 2 × 2 images, the left upper image is the visible light color image from neurosurgical operating microscope video for analyses. The right lower image is the generated map by the model. In the right upper image, the ground truth in the first row boxes or result map from the second row boxes to the last row boxes is overlapped on monotone source image. In the left lower image, ground truth was shown in the first row. From the second row, result map and ground truth were superimposed to show true positive, false positive, false negative and true negative pixels.
Mean accuracies and confusion matrix for the best results.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
| 92-artery patient group: 84 artery phase training/8 artery phase validation | 0.795 | 0.660 | 0.746 | 10.7% | 4.3% | 76.8% | 8.23% |
| 99-patient group: 88 artery phase + 4 delayed phase train/8 artery phase + 3 delayed phase validation | 0.775 | 0.632 | 0.763 | 9.99% | 4.67% | 76.1% | 9.23% |
The DeepLabv3+ algorithm was used. Values in the confusion matrix are the percentages of pixels.
Result comparisons among algorithms.
|
|
|
|
|
|---|---|---|---|
| DeepLabv3+ | 0.77618 | 0.63423 | 0.86286 |
| DeepLabv3 | 0.74065 | 0.58813 | 0.83537 |
| FCN | 0.74273 | 0.59075 | 0.84306 |
| U-Net | 0.67751 | 0.51231 | 0.79522 |
Resnet-101 backbones were used for three algorithms except U-Net. Training was conducted in the total 99 patient group: 88 patients train group and 11 patients validation group.
Figure 4Performance by algorithm and training data size.
Hyperparameter settings and results among the 99-patient group with 84 artery phases and 4 delayed phases in the training group.
|
|
|
|
|
|
|---|---|---|---|---|
|
|
|
|
| |
| 10−6 | 0.99 | 5 × 10−5 | Low accuracy 0.4–0.5 | |
| 10−7 | 0.99 | 5 × 10−5 | 0.753 | 0.604 |
| 10−8 | 0.99 | 5 × 10−5 | 0.776 | 0.634 |
| 10−9 | 0.99 | 5 × 10−5 | 0.771 | 0.628 |
| 10−10 | 0.99 | 5 × 10−4 | 0.775 | 0.632 |
| 10−10 | 0.95 | 5 × 10−4 | 0.768 | 0.624 |
| 10−10 | 0.90 | 5 × 10−4 | 0.766 | 0.621 |
| 10−10 | 0.99 | 5 × 10−5 | 0.773 | 0.630 |
Test results in 11 patient group of 8 artery phases and 3 delayed phases are shown. Accuracies were about 2.5% lower than homogeneous artery phase 92 patient dataset shown in .
Hyperparameter settings and results in the 92-patient group with all artery phases.
|
|
|
|
|
|
|---|---|---|---|---|
|
|
|
|
| |
| 10−5 | 0.95 | 5 × 10−4 | Low accuracy 0.4–0.5 | |
| 10−6 | 0.99 | 5 × 10−4 | ||
| 10−6 | 0.99 | 5 × 10−5 | ||
| 10−6 | 0.95 | 5 × 10−4 | ||
| 7 × 10−7 | 0.99 | 5 × 10−5 | ||
| 5 × 10−7 | 0.99 | 5 × 10−4 | ||
| 10−7 | 0.99 | 5 × 10−4 | 0.637 | 0.778 |
| 10−8 | 0.99 | 5 × 10−4 | 0.650 | 0.788 |
| 10−9 | 0.99 | 5 × 10−5 | 0.656 | 0.792 |
| 7 × 10−10 | 0.95 | 5 × 10−3 | 0.660 | 0.795 |
| 5 × 10−10 | 0.99 | 5 × 10−5 | 0.660 | 0.795 |
| 5 × 10−10 | 0.90 | 5 × 10−5 | 0.654 | 0.791 |
| 7 × 10−10 | 0.95 | 5 × 10−3 | 0.660 | 0.795 |
| 10−10 | 0.99 | 5 × 10−4 | 0.656 | 0.792 |
Accuracies are for the test results for eight patients with artery phases.