| Literature DB >> 36268074 |
Andrew Broad1,2, Alexander I Wright3,4, Marc de Kamps1,2,5, Darren Treanor3,6,7,8.
Abstract
Improvements to patient care through the development of automated image analysis in pathology are restricted by the small image patch size that can be processed by convolutional neural networks (CNNs), when compared to the whole-slide image (WSI). Tile-by-tile processing across the entire WSI is slow and inefficient. While this may improve with future computing power, the technique remains vulnerable to noise from uninformative image areas. We propose a novel attention-inspired algorithm that selects image patches from informative parts of the WSI, first using a sparse randomised grid pattern, then iteratively re-sampling at higher density in regions where a CNN classifies patches as tumour. Subsequent uniform sampling across the enclosing region of interest (ROI) is used to mitigate sampling bias. Benchmarking tests informed the adoption of VGG19 as the main CNN architecture, with 79% classification accuracy. A further CNN was trained to separate false-positive normal epithelium from tumour epithelium, in a novel adaptation of a two-stage model used in brain imaging. These subsystems were combined in a processing pipeline to generate spatial distributions of classified patches from unseen WSIs. The ROI was predicted with a mean F1 (Dice) score of 86.6% over 100 evaluation WSIs. Several algorithms for evaluating tumour-stroma ratio (TSR) within the ROI were compared, giving a lowest root mean square (RMS) error of 11.3% relative to pathologists' annotations, against 13.5% for an equivalent tile-by-tile pipeline. Our pipeline processed WSIs between 3.3x and 6.3x faster than tile-by-tile processing. We propose our attention-based sampling pipeline as a useful tool for pathology researchers, with the further potential for incorporating additional diagnostic calculations.Entities:
Keywords: Artificial intelligence; Attention; Colorectal cancer; Region of interest; Sampling; Tumour–stroma ratio
Year: 2022 PMID: 36268074 PMCID: PMC9577057 DOI: 10.1016/j.jpi.2022.100110
Source DB: PubMed Journal: J Pathol Inform
Fig. 1Annotated QUASAR data, showing WSI overlaid with pathologist-generated ROI (blue outline) and cell patch classifications (in 3 mm box, red).
Fig. 2Typical cell patches extracted from QUASAR slides. Tissue types (left-right, top-bottom): A) non-informative, b) tumour, c) stroma or fibrosis, d) necrosis, e) vessels, f) inflammation, g) lumen, h) mucin, and i) muscle. Patch sizes shown: 100x100px, 224x224px, 448x448px. The pathologists’ ground-truth classification refers to the central pixel of each patch, although surrounding tissue was examined to provide structural context.
Fig. 3WSI sampling and patch-classification pipeline.
Comparative performance of CNN architectures in 9-way classification of 224x224px QUASAR patches.
| CNN type | Model size (MB) | Inference time (ms/patch) | Accuracy (random initial weights) | Accuracy (pre-trained on ImageNet) | Best % tumour in ROI from pipeline (pre- FP correction) |
|---|---|---|---|---|---|
| VGG19 | 548 | 63.9 | 72% | ||
| GoogLeNet | 25.4 | 63.8 | 75% | 92.0% | |
| DenseNet | 111 | 35.6 | 74% | 78% | 93.7% |
| VGG16 | 528 | 35.3 | 74% | 78% | 91.6% |
| MobileNet | 13.6 | 18.2 | 73% | 77% | 91.0% |
| AlexNet | 233 | 18.9 | 75% | 76% | |
| Inception 3 | 91.2 | 57.8 | 72% | 70% | |
| ResNet 50 | 97.8 | 35.0 | 71% | Model unavailable | |
| ResNet 18 | 44.7 | 37.1 | 68% | 68% |
The highest percentage score in each column is shown in bold text.
Fig. 4Sampling iterations applied during WSI processing, showing increasing density around detected tumour patches.
Pipeline performance and ROI accuracy for various sampling grid sizes and numbers of sampling iterations. The 224px grid size represents tile-by-tile processing, for comparison.
| Grid h, w (pixels) | Grid h, w (μm) | Resampling iterations (before final step) | Processing time, mins per WSI | Patches per WSI | ROI agreement (F1 Score) |
|---|---|---|---|---|---|
| 1024 | 502 | 1 | 23:19 | 4041 | 79.3% |
| 1024 | 502 | 2 | 23:05 | 7007 | 83.0% |
| 768 | 376 | 1 | 25:12 | 6542 | 83.2% |
| 768 | 376 | 2 | 36:27 | 12,016 | 86.5% |
| 640 | 313 | 1 | 29:05 | 7826 | 83.6% |
| 640 | 313 | 2 | 44:18 | 14,257 | 86.6% |
| 224 | 110 | – | 145:21 | 33,065 | 89.3% |
Accuracy of tumour–stroma ratio estimation for various sampling grid sizes and numbers of sampling iterations. The 224px grid size represents tile-by-tile processing, for comparison.
| Grid h, w (pixels) | Resampling iterations (before final step) | Min TSR RMSE | Sampling strategy giving min RMS error |
|---|---|---|---|
| 1024 | 1 | 12.7% | 100 patches, estimated ROI |
| 1024 | 2 | 12.9% | 3 mm box, max tumour density point |
| 768 | 1 | 12.7% | 100 patches, estimated ROI |
| 768 | 2 | 11.8% | 3 mm box, max tumour density point |
| 640 | 1 | 11.3% | 3 mm box, max tumour density point |
| 640 | 2 | 12.4% | Mean TSR over estimated ROI. |
| 224 | - | 13.5% | 3 mm box, max tumour density point |
Comparative performance of TSR algorithms for 640px grid size, 1 resampling iteration
| Sampling region (using post-FPC tumour points unless stated) | TSR mean error (offset) | TSR RMS error |
|---|---|---|
| 1) Predicted ROI (average over ≈100 points) | 0.09 | 14.9% |
| 2) 3 mm box at max tumour density point with RandomSpot layout (120 points) | -0.04 | 11.3% |
| 3) Predicted ROI (mean Sliding Window output) | 0.05 | 12.7% |
| 4) Ground-truth locations | 0.00 | 6.4% |
| 5) 4-layer CNN trained for TSR prediction | 0.12 | 20.7% |