| Literature DB >> 32953248 |
Hongrun Zhang1, Helen Kalirai2,3, Amelia Acha-Sagredo2, Xiaoyun Yang4, Yalin Zheng1, Sarah E Coupland2,3.
Abstract
Background: Uveal melanoma (UM) is the most common primary intraocular malignancy in adults. Monosomy 3 and BAP1 mutation are strong prognostic factors predicting metastatic risk in UM. Nuclear BAP1 (nBAP1) expression is a close immunohistochemical surrogate for both genetic alterations. Not all laboratories perform routine BAP1 immunohistochemistry or genetic testing, and rely mainly on clinical information and anatomic/morphologic analyses for UM prognostication. The purpose of our study was to pilot deep learning (DL) techniques to predict nBAP1 expression on whole slide images (WSIs) of hematoxylin and eosin (H&E) stained UM sections.Entities:
Keywords: BAP1; artificial intelligence; choroidal melanoma; deep learning; hematoxylin-and-eosin (H&E); prognostication; uveal melanoma; whole slide imaging
Mesh:
Substances:
Year: 2020 PMID: 32953248 PMCID: PMC7476670 DOI: 10.1167/tvst.9.2.50
Source DB: PubMed Journal: Transl Vis Sci Technol ISSN: 2164-2591 Impact factor: 3.283
Figure 1.Low- and high-power magnification images of enucleated eyes stained for hematoxylin and eosin (H&E) and BAP1 immunohistochemistry, taken after whole slide scanning. Top row: This is a nBAP1 positive UM; bottom row: a nBAP1 negative UM. The insets provide the higher power magnification of the tumor morphology and the location of the BAP1 staining.
Figure 2.Generation of tumor patches. The tumor region is first segmented from a whole slide image. The tiling operation is then applied to the tumor region. A tile with tumor ratio over 90% is cropped out as a tumor patch.
Data With Respect to Patch Size and Numbers in the Training and Test Sets
| Training Set (Including Validation Set) | Test Set | |||
|---|---|---|---|---|
| Patch Dimension | Positive | Negative | Positive | Negative |
| 512 × 512 | 160,059 | 253,097 | 49,769 | 76,923 |
| 1024 × 1024 | 38,589 | 61,189 | 12,066 | 18,627 |
| 2048 × 2048 | 9,007 | 14,275 | 2,826 | 4,569 |
Figure 3.Schematic diagram to show the process to predict patch-level nBAP1 expression. All the tumor patches in a slide are fed to a trained ResNet-18, which outputs the posterior probabilities of the patches (this is referred to independent patch classification). The feature vector corresponding to a tumor patch is extracted from the convolutional module (a) in the ResNet-18. The feature vectors of all the tumor patches are re-assembled into global feature maps according to the locations of tumor patches in the slide. The global feature maps are then forward to a U-Net (b), that outputs the probability map of the slide. The posterior probabilities of patches can be produced by a region correlation classification from the probability map of the slide. a A diagram shows a standard residual block used in the ResNet-18; b a diagram represents the U-net architecture used.
Different Subsets (S1–S5) in the Training Set for Training and Validation, Respectively
| Subsets for Training | Subset for Validation | |
|---|---|---|
| Model 1 | S2, S3, S4, S5 | S1 |
| Model 2 | S1, S3, S4, S5 | S2 |
| Model 3 | S1, S2, S4, S5 | S3 |
| Model 4 | S1, S2, S3, S5 | S4 |
| Model 5 | S1, S2, S3, S4 | S5 |
Patch-Level Area Under Curve (AUC) on the Test Set
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Independent | 0.730 | 0.665 | 0.676 | 0.805 | 0.675 | 0.710/0.058 |
| (0.727–0.733) | (0.662–0.668) | (0.673–0.679) | (0.803–0.807) | (0.672–0.678) | ||
| Region correlation | 0.804 | 0.721 | 0.766 | 0.797 | 0.674 | 0.753/0.054 |
| (0.802–0.80) | (0.719–0.724) | (0.763–0.769) | (0.794–0.799) | (0.672–0.677) | ||
|
| ||||||
|
|
|
|
|
|
| |
| Independent | 0.824 | 0.809 | 0.823 | 0.843 | 0.823 | 0.8249/0.011 |
| (0.819–0.829) | (0.805–0.814) | (0.818–0.828) | (0.839–0.847) | (0.819–0.828) | ||
| Region correlation | 0.853 | 0.837 | 0.850 | 0.904 | 0.858 | 0.8605/0.025 |
| (0.849–0.857) | (0.832–0.841) | (0.845–0.854) | (0.900–0.907) | (0.853–0.862) | ||
|
| ||||||
|
|
|
|
|
|
| |
| Independent | 0.773 | 0.788 | 0.819 | 0.826 | 0.817 | 0.8048/0.022 |
| (0.762–0.784) | (0.777–0.798) | (0.809–0.828) | (0.816–0.835) | (0.807–0.827) | ||
| Region correlation | 0.797 | 0.793 | 0.857 | 0.828 | 0.840 | 0.8233/0.027 |
| (0.786–0.808) | (0.783–0.804) | (0.848–0.866) | (0.818–0.838) | (0.830–0.849) | ||
The 95% confidence intervals (CIs) are in parentheses.
Patch-Level Area Under Curve (AUC) on the Test Set of the Method from Sun et al. and Our Method (with Region Correlation)
| Model (1024)-1 | Model (1024)-2 | Model (1024)-3 | Model (1024)-4 | Model (1024)-5 | |
|---|---|---|---|---|---|
| Sun et al. | 0.846 (0.842–0.850) | 0.799 (0.794–0.804) | 0.849 (0.844–0.853) | 0.854 (0.850–0.859) | 0.847 (0.843–0.851) |
| Our method | 0.853 (0.849–0.857) | 0.837 (0.832–0.841) | 0.850 (0.845–0.854) | 0.904 (0.900–0.907) | 0.858 (0.853–0.862) |
The 95% confidence intervals (CIs) are in parentheses.
Patch-Level Area Under Curve (AUC) on the Test Set of the Ensemble Model
| Patch Dimension | 512 × 512 | 1024 × 1024 | 2048 × 2048 |
|---|---|---|---|
| Independent | 0.753 (0.752, 0.753) | 0.849 (0.847, 0.851) | 0.823 (0.812, 0.831) |
| Region-Correlation | 0.825 (0.824, 0.826) | 0.8841 (0.878, 0.886) | 0.8401 (0.835, 0.848) |
The 95% confidence intervals (CIs) are in parentheses.
Figure 4.Receiver operating characteristic (ROC) curve for patch classification (Patch dimension: 1024 × 1024).
Figure 5.Probability maps. The first column lists the UM slide stained with H&E, and their probability maps by independent patch classification (from ResNet-18) and by region-correlation patch classification (from U-Net) were shown on the second and third columns, respectively. The probability maps corresponding to the red dotted regions in the UM slide with zooming. Grey corresponds to nBAP1 negative, whereas blue corresponds to nBAP1 positive.
Slide-Level Performances of the 5 Models (on 1024 × 1024 Patches) on the Test Set
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Accuracy | 0.931 | 0.795 | 0.886 |
| 0.818 | 0.863 |
| Sensitivity |
| 0.875 | 0.875 | 0.813 | 0.750 | 0.875 |
| Specificity | 0.928 | 0.750 | 0.892 |
| 0.857 | 0.857 |
| Precision | 0.882 | 0.666 | 0.823 |
| 0.750 | 0.777 |
| F1 | 0.909 | 0.756 | 0.848 |
| 0.750 | 0.823 |
| AUC | 0.953 | 0.915 | 0.928 | 0.940 | 0.915 | 0.944 |
|
| ||||||
|
|
|
|
|
|
| |
| Accuracy | 0.886 | 0.840 | 0.840 |
| 0.863 | 0.886 |
| Sensitivity |
| 0.750 | 0.812 | 0.812 | 0.750 | 0.812 |
| Specificity | 0.892 | 0.892 | 0.857 |
| 0.928 | 0.928 |
| Precision | 0.823 | 0.800 | 0.764 |
| 0.857 | 0.866 |
| F1 | 0.848 | 0.774 | 0.787 |
| 0.800 | 0.838 |
| AUC | 0.912 | 0.863 | 0.890 |
| 0.915 | 0.935 |
Figure 6.Slide-level receiver operating characteristic (ROC) curve with 95% confidence intervals of the ensemble of 5 models (independent patch classification) trained on 1024 × 1024 patches.
Figure 7.Slide-level receiver operating characteristic (ROC) curve with 95% confidence intervals of the ensemble of 5 models (region-correlation) trained on 1024 × 1024 patches.