| Literature DB >> 33149150 |
Philipp D Lösel1,2, Thomas van de Kamp3,4, Alejandra Jayme5,6, Alexey Ershov3,4, Tomáš Faragó3, Olaf Pichler5,7, Nicholas Tan Jerome8, Narendar Aadepu9,10, Sabine Bremer3,4,9, Suren A Chilingaryan8, Michael Heethoff11, Andreas Kopmann8, Janes Odar3,4, Sebastian Schmelzle11, Marcus Zuber3,4, Joachim Wittbrodt9, Tilo Baumbach3,4, Vincent Heuveline5,6,7.
Abstract
We present Biomedisa, a free and easy-to-use open-source online platform developed for semi-automatic segmentation of large volumetric images. The segmentation is based on a smart interpolation of sparsely pre-segmented slices taking into account the complete underlying image data. Biomedisa is particularly valuable when little a priori knowledge is available, e.g. for the dense annotation of the training data for a deep neural network. The platform is accessible through a web browser and requires no complex and tedious configuration of software and model parameters, thus addressing the needs of scientists without substantial computational expertise. We demonstrate that Biomedisa can drastically reduce both the time and human effort required to segment large images. It achieves a significant improvement over the conventional approach of densely pre-segmented slices with subsequent morphological interpolation as well as compared to segmentation tools that also consider the underlying image data. Biomedisa can be used for different 3D imaging modalities and various biomedical applications.Entities:
Mesh:
Year: 2020 PMID: 33149150 PMCID: PMC7642381 DOI: 10.1038/s41467-020-19303-w
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Comparison between a conventional segmentation approach and Biomedisa.
Both procedures require manual pre-segmentation of the 3D image stack. While the widely used morphological interpolation solely considers labels on pre-segmented slices, Biomedisa takes both the underlying 3D image data and the pre-segmented slices into account, resulting in a significantly lower amount of required manual input. Moreover, interpolation artifacts are avoided and fine details like hairs, which are usually omitted during manual segmentation, are included.
Fig. 4Uncertainty of Biomedisa results based on different numbers of pre-segmented slices.
For 215, 108, 54 and 37 pre-segmented slices, the results show approximately the same degree of uncertainty, while for 27 or fewer slices the uncertainty increases significantly. The conspicuous bright line represents the boundary between elytra and thorax, which are closely interlocked. The boundary is almost invisible in the tomographic scan, thus resulting in a high uncertainty of the segmentation result. The uncertainty values range from 0 (blue) to 1 (red), with 0 meaning no uncertainty and 1 meaning a high degree of uncertainty, i.e. a voxel can be assigned to at least two labels with the same probability.
Fig. 2Biomedisa segmentation using a Trigonopterus weevil as an example.
a Photograph of the original specimen. b Result of Biomedisa segmentation based on 37 pre-segmented slices of the tomographic volume adapted to the weevil’s morphology. c The 64 isolated body parts of (b). The surface meshes shown in this figure are based on the original Biomedisa result. If necessary, outliers or minor flaws in the reconstruction (e.g. tiny holes) can be corrected with low effort.
Fig. 5Biomedisa examples.
a Medaka fish with segmented skeleton and selected internal organs (based on µCT scan). b Mouse molar tooth showing enamel (white) and dentine (yellow) (µCT). c Human heart with segmented heart muscle and blood vessels (MRI). d Fossil parasitoid wasp from Baltic amber (SR-µCT). e Tracheal system of a hissing cockroach (µCT). f Claw of a theropod dinosaur from Burmese amber (SR-µCT). g Fossil parasitoid wasp preserved inside a mineralized fly pupa (SR-µCT). h Head of an Australian bull ant queen (SR-µCT). See “Methods” and Supplementary Table 2 for details on the specimens.
Quantitative comparison of different semi-automatic segmentation tools for the segmentation of different datasets.
| Dataset | Method | Dice (%) | ASD (pixels) | Time (min) |
|---|---|---|---|---|
| Mineralized wasp (56 labels, every 20th slice pre-segmented, 1077 × 992 × 2553 voxels) | Biomedisa GeodisTK (GeoS) Scikit-image (RW) MedPy (GC) ITK interpolation Amira interpolation | 94.21 79.42 88.29 79.96 79.04 | 0.583 4.196 3.803 3.008 3.762 | 30 332 372 1943 10 21 |
Biomedisa GeodisTK (GeoS) Scikit-image (RW) MedPy (GC) ITK interpolation Amira interpolation | 96.99 80.79 36.13 82.75 84.66 | 0.553 6.001 loss of 38 labels loss of 4 labels loss of 3 labels | 20 415 494 2419 11 13 | |
| Wasp from amber (15 labels, every 40th slice pre-segmented, 1417 × 2063 × 2733 voxels) | Biomedisa GeodisTK (GeoS) Scikit-image (RW) MedPy (GC) ITK interpolation Amira interpolation | 93.55 88.80 90.80 80.78 81.52 | 1.751 6.987 8.136 5.234 6,892 | 48 1057 1815 5591 28 19 |
| Theropod claw (1 label, every 80th slice pre-segmented, 1986 × 1986 × 3602 voxels) | Biomedisa GeodisTK (GeoS) Scikit-image (RW) MedPy (GC) ITK interpolation Amira interpolation | 60.02 16.69 0.0 69.97 66.35 | 3.845 18.174 29.542 4.318 5.815 | 21 182 541 563 36 3 |
| Medaka skeleton (1 label, every 10th−80th slice pre-segmented, 900 × 1303 × 4327 voxels) | Biomedisa GeodisTK (GeoS) Scikit-image (RW) MedPy (GC) ITK interpolation Amira interpolation | 76.40 5.01 7.59 39.04 23.69 | 1.694 17.122 20.072 7.220 11.976 | 28 196 537 629 17 2 |
| Human hearts (2 labels, every 20th slice pre-segmented, 157 × 216 × 167 voxels on average) | Biomedisa GeodisTK (GeoS) Scikit-image (RW) MedPy (GC) ITK interpolation Amira interpolation | 89.58 ± 1.71 81.37 ± 2.57 88.36 ± 1.49 70.14 ± 3.60 68.67 ± 4.35 | 0.715 ± 0.210 1.764 ± 0.307 1.021 ± 0.223 3.468 ± 0.610 3.862 ± 0.618 | 3 ± 2 s 13 ± 6 s 52 ± 24 s 61 ± 28 s 1 ± 1 s <30 s |
| Mouse molar teeth (3 labels, every 40th slice pre-segmented, 438 × 543 × 418 voxels on average) | Biomedisa GeodisTK (GeoS) Scikit-image (RW) MedPy (GC) ITK interpolation Amira interpolation | 98.20 ± 0.20 81.89 ± 1.11 89.90 ± 2.44 80.69 ± 1.93 79.20 ± 2.25 | 0.585 ± 0.055 6.813 ± 0.493 6.620 ± 1.397 6.375 ± 0.883 6.651 ± 0.885 | 1.3 ± 0.1 8.1 ± 0.6 17.4 ± 1.6 52.7 ± 6.4 0.2 ± 0.1 <1 |
For the configuration, the values of the default parameters were chosen, i.e. β = 130 (RW), norw = 10 and sorw = 4000 (Biomedisa). Graph Cut and GeoS have no default values for σ and the number of iterations, respectively. The values were therefore chosen from the examples in the documentation, i.e. σ = 15 (GC) and iterations = 4 (GeoS). If not explicitly stated otherwise, Dice and ASD scores are twofold cross-validation accuracies. If the dataset consists of several images, the standard deviation is given (±). Highest accuracy and best result are shown in bold font.
Fig. 6Speedup of computing times of GPU-based Biomedisa compared to CPU-based segmentation tools.
Speedup of computing times of different semi-automatic segmentation tools that take the image data into account compared to the slowest method according to Table 1. The values for mouse molar teeth and human hearts are average values.
Fig. 7Visual comparison of robustness to input errors.
Visual comparison of different segmentation methods for segmenting a human mandible based on flawed and inconsistent ground truth data labeled by two annotators. For the configuration, the values of the default parameters were chosen. If no default values were given, the values were selected from the examples in the documentation. Here, every 20th slice of the manual segmentation was used as initialization. The images show the segmentation result between two pre-segmented slices.
Fig. 3Biomedisa results based on different numbers of pre-segmented slices.
Inputs of 215 (as used for morphological interpolation), 108 and 54 equally spaced slices that correspond to pre-segmentation of every 5th, 10th and 20th slice provided accurate results. By adapting the spacing between the slices to the weevil’s morphology, a much lower count of only 37 slices yielded a dataset of equal quality. Lower numbers of pre-segmented slices resulted in increasingly flawed outputs.