| Literature DB >> 35128605 |
Hristina Uzunova1, Matthias Wilms2, Nils D Forkert2, Heinz Handels3,4, Jan Ehrhardt3,4.
Abstract
PURPOSE: This work aims for a systematic comparison of popular shape and appearance models. Here, two statistical and four deep-learning-based shape and appearance models are compared and evaluated in terms of their expressiveness described by their generalization ability and specificity as well as further properties like input data format, interpretability and latent space distribution and dimension.Entities:
Keywords: Comparison; Generative models; Shape and appearance models
Mesh:
Year: 2022 PMID: 35128605 PMCID: PMC9206635 DOI: 10.1007/s11548-022-02567-6
Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN: 1861-6410 Impact factor: 3.421
Fig. 1Example image data used in our experiments
Fig. 2Generalization ability (left) and specificity (right) for the 2D (top) and 3D shapes (bottom) measured in ASSD (left y-axis) – smaller values are better. The bars indicate the percentage of not generated structures (right y-axis)
Fig. 3Absolute surface distances from a reconstructed left putamen (blue segmentation label in Fig. 1) to the original input. Smaller values indicate better reconstruction
Compactness for all models and data. Smaller values are better. For the statistical methods, the compactness is specified in a range depending on the training set size. For the models that observe shape and appearance separately, the values are noted as
| Data | SS(A)M | LSS(A)M | AE | VAE | AE-GAN | (A)DAE |
|---|---|---|---|---|---|---|
| Shapes 2D | [3,14] | [4,55] | 512 | 512 | 512 | 512 |
| Shapes 3D | [4,51] | [31,100] | 1024 | 1024 | 1024 | 1024 |
| Appearance 3D | [4+4,257+189] | [37+41,383+351] | 1024 | 1024 | 1024 | 1024+64 |
Latent space ambiguity scores (LAS) for all models and training scenarios. Values close to zero indicate an unambiguous latent space
| Data | SS(A)M | LSS(A)M | AE | VAE | AE-GAN | (A)DAE |
|---|---|---|---|---|---|---|
| Shapes 2D | 0 | 0 | 0.4 | 0.02 | 0.5 | 0.03 |
| Shapes 3D | 0 | 0 | 0.95 | 0.14 | 0.45 | 0.05 |
| Appearance 3D | 0 | 0 | 0.27 | 0.06 | 0.14 | 0.07 |
Fig. 4Visualization of the linear interpolation between latent vectors of two shapes (first and last in a row). The bars underneath indicate the ASSD between a reconstruction (20 steps) and the first shape (top bar); and a reconstruction and the last shape (bottom bar). First and last values are the ASSD between real shape to itself or to the second shape and vice versa: yellow(max)blue(0). Some models skipped due to space constrains
Fig. 5Generalization ability, specificity and likeness for the 3D appearance modeling measured as L1 distances: smaller values are better
Fig. 6Interpolation experiment for the 3D appearance images (axial slices). Visualization analogous to Fig. 4 (except for measuring L1 distances between images)