| Literature DB >> 33194545 |
Nikolaos-Kosmas Chlis1,2,3, Angelos Karlas2,4,5,6, Nikolina-Alexia Fasoula2,4, Michael Kallmayer6, Hans-Henning Eckstein6, Fabian J Theis1,7, Vasilis Ntziachristos2,4,5, Carsten Marr1.
Abstract
Multispectral Optoacoustic Tomography (MSOT) resolves oxy- (HbO2) and deoxy-hemoglobin (Hb) to perform vascular imaging. MSOT suffers from gradual signal attenuation with depth due to light-tissue interactions: an effect that hinders the precise manual segmentation of vessels. Furthermore, vascular assessment requires functional tests, which last several minutes and result in recording thousands of images. Here, we introduce a deep learning approach with a sparse-UNET (S-UNET) for automatic vascular segmentation in MSOT images to avoid the rigorous and time-consuming manual segmentation. We evaluated the S-UNET on a test-set of 33 images, achieving a median DICE score of 0.88. Apart from high segmentation performance, our method based its decision on two wavelengths with physical meaning for the task-at-hand: 850 nm (peak absorption of oxy-hemoglobin) and 810 nm (isosbestic point of oxy-and deoxy-hemoglobin). Thus, our approach achieves precise data-driven vascular segmentation for automated vascular assessment and may boost MSOT further towards its clinical translation.Entities:
Keywords: Artificial intelligence; Clinical; Deep learning; Machine learning; Multispectral optoacoustic tomography; Segmentation; Translational
Year: 2020 PMID: 33194545 PMCID: PMC7644749 DOI: 10.1016/j.pacs.2020.100203
Source DB: PubMed Journal: Photoacoustics ISSN: 2213-5979
Fig. 1The S-UNET identifies important illumination wavelengths in MSOT images while learning to predict segmentation masks of human blood vessels. Each wavelength is weighted by a corresponding non-negative weight and all weighted wavelengths are combined before being inserted as input into a UNET architecture. Sparsity of wavelength selection is enforced by L1 regularization on the non-negative wavelength weights and the weights themselves are learned through standard back-propagation, along with the rest of the UNET parameters.
Model Performance. Dice results correspond to mean ± std.
| Model | Test Set Dice | Parameters | Wavelength Selection |
|---|---|---|---|
| UNET (original) | 0.75 ± 0.28 | 31,416,897 | No |
| UNET (downsized) | 0.90 ± 0.08 | 495,881 | No |
| UNET++ | 0.61 ± 0.26 | 9,049,377 | No |
| S-UNET | 0.86 ± 0.11 | 493,965 | Yes |
Fig. 2The S-UNET identifies wavelengths relevant to the segmentation task. Each boxplot (the box’s edges correspond to quartiles 1 and 3 while whiskers extend to ±1.5 times the interquartile range) corresponds to the weights assigned by the ensemble of 100 S-UNET instances to each wavelength. Averaging results is necessary since feature selection is an inherently noisy process. For every S-UNET instance, each of the 28 wavelengths of the input image is multiplied by its corresponding weight and all 28 weighted single-wavelength images are added in a pixel-wise manner. This step results in a single-channel image being passed on to the following layers of the network. According to the median weight of each wavelength, the two most important wavelengths are 850 nm and 810 nm, corresponding to the maximum absorption of HbO2 and total hemoglobin (THb), respectively.
Fig. 3The S-UNET successfully segments human vasculature from MSOT images. Each row corresponds to a different image of the test set. The first column (images a, e, i, m) shows the 850 nm channel of the MSOT image. The second (images b, f, j, n) and third columns (c, g, k, o) show the ground truth (true mask, blue) and predicted segmentation masks (red), respectively, visualized on top of the input image. The true segmentation mask is identified by expert physicians, while the S-UNET predicted segmentation mask corresponds to the output of the S-UNET ensemble. The fourth column (images d, h, l, p) corresponds to the absolute difference between the true and predicted binary segmentation masks and is equivalent to the logical operation of XOR (exclusive or). The predicted masks almost completely overlap with the ground truth segmentation. The S-UNET is successful even in the last two cases (rows) where the mask is relatively small and located in an area where similar bright spots are present. The white dashed line represents the skin surface. The white arrows point to the blood vessel of interest. The scale bar is 5 mm. The gray color bar ranges from 0 to 1 and corresponds to the normalized intensity of each image (columns 1-3) or the difference of the true and predicted segmentation masks (column 4).