| Literature DB >> 34362386 |
Marlen Runz1,2, Daniel Rusche3, Stefan Schmidt4, Martin R Weihrauch5, Jürgen Hesser6,7,8, Cleo-Aron Weis3.
Abstract
BACKGROUND: Histological images show strong variance (e.g. illumination, color, staining quality) due to differences in image acquisition, tissue processing, staining, etc. This can impede downstream image analysis such as staining intensity evaluation or classification. Methods to reduce these variances are called image normalization techniques.Entities:
Keywords: Deep learning; Digital pathology; Generative adversarial networks; HE-stain; Histology stain normalization; Style transfer; Unpaired image-to-image translation
Mesh:
Substances:
Year: 2021 PMID: 34362386 PMCID: PMC8349020 DOI: 10.1186/s13000-021-01126-y
Source DB: PubMed Journal: Diagn Pathol ISSN: 1746-1596 Impact factor: 2.644
Fig. 1Illustration of the applied CycleGAN architecture for mapping images from domain A to domain B. A real sample image is mapped to domain B by the generator and then back to domain A by the generator . The discriminator D differentiates between the generated image and a real sample image . The same process is done for the reverse direction when mapping a real sample image from domain B to domain A and backwards, i.e . During training, the loss is computed by the adversarial loss and the cycle consistency loss
Fig. 2Exemplary miniature image of the WSI that forms the HEV data set. Serial tissue sections from a thyroid tissue with a follicular carcinoma with HE-staining. For every slide the staining protocol is intentionally modified: A) Standard protocol at the Institute of Pathology, Medical Faculty Mannheim, Heidelberg University (HE) B) Shortened staining time (shortHE) C) Prolonged staining time (longHE) D) Only hematoxylin-stain (onlyH) E) Only eosin-stain (onlyE)
Fig. 3Results gallery from our experiments on the Mitos-Atypia-14 challenge data set. Columns A-C refer to the image tiles scanned by the Aperio scanner () being mapped by the generator G to produce the corresponding image in the domain of the Hamamatsu scanner () and the reconstruction from mapping the image back to its source domain (), i.e . The same process is done in the reverse direction for image tiles scanned in domain B, i.e (column D-F). Each row 1-4 presents another example tissue section
Fig. 4Results gallery from our experiments on the HEV data set for the mapping . Here, the input image is from domain A of the standard stained tissue (HE) being mapped to domain B corresponding to the image-sets shortHE, longHE, onlyH, onlyE. Each block A-D shows another example tissue section. The top row of each block represents an exemplary image tile of the stain to be mapped into, while the bottom row depicts the input image and the corresponding output for each stain
Fig. 5Evaluation of our experiments using FID and SSIM scores. A) FID scores between real and generated (fake, rec) images. For identical images the FID is zero, whereas it increases with noise and disturbances. B) SSIM scores between real vs. rec images. The SSIM scale ranges from 0 to 1 and is close to zero for hardly similar images. A table with all FID and SSIM scores is presented in the Appendix
Kappa-values for the ResNet models that were trained on different versions of the Camelyon16 data set
| model / data | ||||||
|---|---|---|---|---|---|---|
| ResNet_ori | 0.00 | 0.12 | 0.00 | 0.00 | ||
| ResNet_HE | 0.02 | 0.28 | 0.13 | 0.01 | ||
| ResNet_TL | 0.36 | 0.10 | 0.08 | 0.24 |
The training images (from the Camelyon16 data set) are 1) original (Cam_ori), 2) normalized by the CycleGAN to the HEV data set (Cam_HE) or 3) the TL data set (Cam_TL), respectively. For each training sets, a ResNet model was trained: 1) ResNet_ori, 2) ResNet_HE and 3) ResNet_TL. All models were tested on images from the Camelyon16 data set (n=1728 images) and the TL data set (n=1802 images). There were again three versions of both test data sets: one original version (Cam_ori and TL_ori), one version normalized to the HEV data set (Cam_HE and TL_HE), and one version normalized to the Camelyon16 (TL_Cam) or the TL data set (Cam_TL). The best kappa value obtained for each test set (column-wise) on all models is shown in bold
Overview of our stain normalization experiments
| Data set | Experiment Name | Set | Set |
|---|---|---|---|
| Mistos-Atypia-14 | Aperio scanner | Hamamatsu scanner | |
| HEV | standard HE stained | shortened staining time | |
| prolonged staining time | |||
| only stained with hematoxylin | |||
| only stained with eosin | |||
| Camelyon16 | Camelyon16 | standard HE stained | |
| TumorLymphnode | |||
| TumorLymphnode | TumorLymphnode | standard HE stained | |
| Camelyon16 |
FID scores for all experiments between real and generated (fake, rec) images for A and B
| FID | |||||
|---|---|---|---|---|---|
| 31.5017 | 59.4240 | 51.4460 | 119.0061 | 203.6761 | |
| 12.1464 | 4.5465 | 6.0007 | 4.1793 | 8.5647 | |
| 4.0544 | 4.2877 | 7.8630 | 4.0363 | 8.5685 | |
| 10.3222 | 4.0365 | 5.3136 | 7.0321 | 7.9218 | |
| 2.6451 | 6.3173 | 2.9931 | 4.9206 | 11.0160 |
SSIM scores (SD = standard deviation) for all experiments between real and rec images for A and B
| SSIM (SD) | |||||
|---|---|---|---|---|---|
| 0.9406 (0.0147) | 0.9724 (0.0055) | 0.9572 (0.0073) | 0.9731 (0.0057) | 0.9534 (0.0080) | |
| 0.9606 (0.0148) | 0.9760 (0.0063) | 0.9702 (0.0098) | 0.9763 (0.0056) | 0.9648 (0.0107) |
Kappa-values for the ResNet models that were trained on different versions of the Camelyon16 data set
| model / data | ||||||||
|---|---|---|---|---|---|---|---|---|
| ResNet_ori | 0.00 | 0.00 | 0.12 | 0.00 | 0.00 | 0.00 | ||
| ResNet_HE | 0.02 | 0.01 | 0.28 | 0.13 | 0.00 | 0.01 | ||
| ResNet_onlyH | 0.00 | 0.00 | 0.14 | 0.03 | 0.00 | 0.00 | ||
| ResNet_TL | 0.36 | 0.10 | 0.53 | 0.38 | 0.08 | 0.24 |
The best kappa value obtained for each test set (column-wise) on all models is shown in bold