| Literature DB >> 35511469 |
Mark D Zarella1,2, Keysabelis Rivera Alvarez1.
Abstract
Digital pathology and artificial intelligence (AI) rely on digitization of patient material as a necessary first step. AI development benefits from large sample sizes and diverse cohorts, and therefore efforts to digitize glass slides must meet these needs in an efficient and cost-effective manner. Technical innovation in whole-slide imaging has enabled high-throughput slide scanning through the coordinated increase in scanner capacity, speed, and automation. Combining these hardware innovations with automated informatics approaches has enabled more efficient workflows and the opportunity to provide higher-quality imaging data using fewer personnel. Here we review several practical considerations for deploying high-throughput scanning and we present strategies to increase efficiency with a focus on quality. Finally, we review remaining challenges and issue a call to vendors to innovate in the areas of automation and quality control in order to make high-throughput scanning realizable to laboratories with limited resources.Entities:
Keywords: artificial intelligence; computational pathology; data repository; digital pathology; machine learning; virtual slide; whole-slide imaging; whole-slide scanning
Mesh:
Year: 2022 PMID: 35511469 PMCID: PMC9327504 DOI: 10.1002/path.5923
Source DB: PubMed Journal: J Pathol ISSN: 0022-3417 Impact factor: 9.883
Figure 1Conventional versus automated slide scanning. (A) A conventional scanning workflow involves significant pre‐ and post‐scanning interaction. (B) Automation leverages image analysis, streamlined QC, and automated processing. Steps that require manual intervention are represented in tan. Automated steps are represented in blue.
Figure 2Slide scanning area is impacted by tissue geometry. (A) A line scanner scans regions of the slide within one or more bounding boxes positioned around the tissue. An example is depicted by the dotted line. Sample image was acquired using a Hamamatsu NanoZoomer S360. (B) A tile scanner captures tiles of fixed size surrounding the tissue with a padding value that can be customized in some scanners. This has the potential to reduce scan time. Regions in darker gray as indicated by the arrows depict blank regions of the slide that were scanned, whereas the lighter regions represent unscanned areas. Sample image was from the same slide as A but acquired using a Zeiss AxioScan Z1.
Common whole‐slide scanner speeds and throughput
| A | B | C | D | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Scanner | Scan time (s) | Size (mm2) | Norm. time | Scan time (s) | Size (mm2) | Norm. time | Scan time (s) | Size (mm2) | Norm. time | Scan time (s) | Size (mm2) | Norm. time |
| Resection | ||||||||||||
| 1 | 66 | 479.4 | 31 | 257 | 692.1 | 83.6 | 541 | 537.1 | 226.6 | 1,022 | 506.2 | 454.2 |
| 2 | 87 | 695.7 | 28.1 | 272 | 677.1 | 90.4 | 778 | 777 | 225.3 | 1,150 | 448.8 | 576.6 |
| 3 | 67 | 490.7 | 30.7 | 195 | 457.7 | 95.9 | 528 | 571.7 | 207.8 | 905 | 439.6 | 463.2 |
| Biopsy | ||||||||||||
| 4 | 33 | 157.9 | 47 | 94 | 156.8 | 134.9 | 250 | 184.8 | 304.4 | 302 | 111.7 | 608.2 |
| 5 | 42 | 266.5 | 35.5 | 126 | 353.9 | 80.1 | 384 | 425.2 | 203.2 | 417 | 100 | 938 |
| 6 | 15 | 67.7 | 49.9 | 76 | 81.3 | 210.4 | 92 | 84 | 246.4 | 186 | 67.2 | 622.4 |
| IHC | ||||||||||||
| 7 | 120 | 768 | 35.2 | 184 | 377.5 | 109.7 | 810 | 897.7 | 203 | 959 | 290.9 | 741.7 |
| 8 | 108 | 625.2 | 38.9 | 150 | 486 | 69.4 | 681 | 803.2 | 190.8 | 775 | 199.2 | 875.4 |
| 9 | 11 | 41.4 | 59.8 | 37 | 34.6 | 240.6 | 81 | 57.7 | 315.6 | 207 | 36.7 | 1,268.9 |
| Capacity | 360 slides | 6 slides | 210 slides | 100 slides | ||||||||
We tested four scanners on a set of nine slides: three H&E‐stained resections, three H&E‐stained biopsies, and three immunohistochemistry (IHC) slides. Slides were purposefully selected to test a range of tissue geometries, and scanners were selected to demonstrate the impact of different strategies on scan time, noting that not all of the scanners tested share the attributes required to be considered ‘high throughput’. Pixel size of the generated images ranged from 0.22 to 0.25 μm per pixel. Normalized time (norm. time) represents the estimated time it would have taken to scan a 15 × 15 mm area of tissue, calculated by dividing scan time by tissue area and multiplying by 225 mm2. All images were confirmed to be of sufficient quality and to have captured the tissue in its entirety. Discrepancies in image size across scanners were due mostly to different scanning strategies employed, where some scanners conformed tight to the boundaries of the tissue or discarded blank space between biopsy cores, for example. The scanners were as follows: (A) Hamamatsu NanoZoomer S360 (Hamamatsu Photonics K.K., Hamamatsu City, Japan); (B) Roche VENTANA DP200 (Ventana Medical Systems, Oro Valley, AZ, USA); (C) Hamamatsu NanoZoomer S210 (Hamamatsu Photonics K.K.); and (D) Zeiss AxioScan Z1 (Zeiss, Oberkochen, Germany).
Figure 3Slide scanning efficiency following AI‐enabled QC strategy. (A) In this AI‐enabled QC strategy, following slide scanning, the WSI is evaluated automatically within the scanner (using the scanner vendor's software) and if rejected is automatically rescanned with a more stringent (but perhaps slower) scan profile. Accepted images pass to a potentially more accurate external QC algorithm which diverts the images to visual review if rejected. The outcome of visual review dictates whether a slide is accepted, rescanned, or rejected without further rescanning for slides in which a rescan has already been attempted. A small number of slides that have passed automated QC are also reviewed visually as a control. (B) The relative scan time of a batch increases linearly as a function of the proportion of slides that undergo auto‐rescan (‘rescan rate’). The model assumed that the more stringent profile took 1.5, 2, 3, or 4 times longer than the original scan profile (‘rescan multiplier’). Simulated scan times were drawn from a Poisson distribution with mean scan time of 60 s, and a constant 10 s slide‐loading time was added to each scan. (C) The sensitivity of the rescan threshold dictates the balance between the scan time of a batch and the proportion of slides that should have been rejected but were missed (‘missed slides’). This balance relies critically on the accuracy of the algorithm to detect such slides, which was modeled by adding Gaussian noise with standard deviation 0.1, 0.15, 0.2, and 0.25 to a simulated quality value uniformly sampled from 0 to 1. A threshold was then applied to the resultant quality value to determine whether a slide will be rescanned. The rescan multiplier was selected to be 2.0 and the rescan rate was 0.05 for this analysis. A thousand simulations were performed for B and C, and mean values are shown.