| Literature DB >> 36268077 |
Olsi Kusta1,2, Charlotte Vestrup Rift3, Torsten Risør4,5, Eric Santoni-Rugiu3,6, John Brandt Brodersen7,8.
Abstract
Introduction: Digital pathology solutions are increasingly implemented for primary diagnostics in departments of pathology around the world. This has sparked a growing engagement on validation studies to evaluate the diagnostic performance of whole slide imaging (WSI) regarding safety, reliability, and accuracy. The aim of this review was to evaluate the performance of digital pathology for diagnostic purposes compared to light microscopy (LM) in human pathology, based on validation studies designed to assess such technologies.Entities:
Keywords: Diagnostic concordance; Diagnostic test accuracy; Human pathology; Overdiagnosis; Validation studies; Whole slide imaging (WSI)
Year: 2022 PMID: 36268077 PMCID: PMC9577136 DOI: 10.1016/j.jpi.2022.100136
Source DB: PubMed Journal: J Pathol Inform
Fig. 1Flowchart based on Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMAa) guidelines.
aThe figure was drafted based on a freely available template at http://prisma-statement.org/documents/PRISMA%202009%20flow%20diagram.pdf.
Judgement for Risk of Bias summarized for domains (QUADAS 2)a.
| Authors | Patient selection | Index test | Reference standard | Flow and timing |
|---|---|---|---|---|
| Ammendola et al. | ? | ? | ||
| Brunyé et al. | ||||
| Cima et al. | ||||
| Elmore et al. | ||||
| Larghi et al. | ||||
| Nielsen et al. | ? | |||
| Perez et al. | ? | |||
| Ribback et al. | ? | |||
| Tawfik et al. | ? | |||
| Tawfik et al. | ? | |||
| Tissier et al. | ? | ? | ||
| Zoroquiain et al. | ? |
Table adapted from the freely available template at https://view.officeapps.live.com/op/view.aspx?src=http%3A%2F%2Fwww.bristol.ac.uk%2Fmedia-library%2Fsites%2Fquadas%2Fmigrated%2Fdocuments%2Ftable.docx&wdOrigin=BROWSELINK.
Applicability concerns for the respective domains (QUADAS 2)a.
| Authors | Patient selection | Index test | Reference standard |
|---|---|---|---|
| Ammendola et al. | |||
| Brunyé et al. | |||
| Cima et al. | ? | ||
| Elmore et al. | |||
| Larghi et al. | |||
| Nielsen et al. | |||
| Perez et al. | |||
| Ribback et al. | |||
| Tawfik et al. | |||
| Tawfik et al. | |||
| Tissier et al. | ? | ? | |
| Zoroquiain et al. |
Table adapted from the freely available templates at https://view.officeapps.live.com/op/view.aspx?src=http%3A%2F%2Fwww.bristol.ac.uk%2Fmedia-library%2Fsites%2Fquadas%2Fmigrated%2Fdocuments%2Ftable.docx&wdOrigin=BROWSELINK.
Because final FS-FFPE diagnosis based on frozen sections (FS) or formalin-fixed and paraffin embedded (FFPE) biopsies may differ from the original assessment even during routine use of LM with frozen section.
This refers to the comparison of accuracy of WSI with LM to identify microorganisms and not human cells.
Fig. 2The proportion of the Risk of Bias and Applicability Concerns (QUADAS 2)a.
aThe drafted figure is a template freely available at https://view.officeapps.live.com/op/view.aspx?src=http%3A%2F%2Fwww.bristol.ac.uk%2Fmedia-library%2Fsites%2Fquadas%2Fmigrated%2Fdocuments%2Fgraphs.xlsx&wdOrigin=BROWSELINK.
Primary outcomes of diagnostic test accuracy (DTA) indicators and diagnostic concordance.
| Source | Subspecialty | Diagnostic purpose | Primary outcomes | ||||||||
| Ammendola et al. | Surgical Neuropathology | Grading of meningioma | |||||||||
| Observer 1 | Observer 2 | Observer 3 | Observer 4 | ||||||||
| Brain invasion | 0.50 | 0.50 | 0.51 | 0.51 | 0.53 | 0.55 | 0.50 | 0.55 | |||
| High mitotic index | 0.64 | 0.72 | 0.60 | 0.61 | 0.58 | 0.65 | 0.56 | 0.68 | |||
| Hypercellularity | 0.54 | 0.52 | 0.58 | 0.58 | 0.50 | 0.50 | 0.54 | 0.50 | |||
| Sheeting | 0.57 | 0.52 | 0.59 | 0.59 | 0.55 | 0.59 | 0.50 | 0.62 | |||
| Macronucleoli | 0.53 | 0.51 | 0.55 | 0.53 | 0.51 | 0.53 | 0.53 | 0.53 | |||
| Small cells | 0.55 | 0.51 | 0.63 | 0.61 | 0.54 | 0.53 | 0.52 | 0.54 | |||
| Spontaneous necrosis | 0.51 | 0.52 | 0.61 | 0.61 | 0.51 | 0.51 | 0.56 | 0.54 | |||
| Brunyé et al. | Breast pathology | Classification of breast neoplasms | |||||||||
| Benign | 71% (61–82%) | 29% (20–40%) | - | ||||||||
| Atypia | 37% (29–45%) | 21% (15–28%) | 43% (35–50%) | ||||||||
| Ductal Carcinoma in Situ (DCIS) | 52% (43–61%) | 17% (12–23%) | 31% (25–39%) | ||||||||
| Invasive breast cancer | 94% (88–99%) | – | 6% (2–14%) | ||||||||
| Cima et al. | Multiple subspecialties and organs | Cancer staging (surgical margins, tumor biology, lymph node status) and organ quality for transplantation | |||||||||
| 100% | 96% | ||||||||||
| 96% | 75% | ||||||||||
| 95% | 96% | ||||||||||
| 100% | 75% | ||||||||||
| 97% (к=0.96, CI: 0.941–0.985) | 86% (к=0.91, CI: 0.877–0.958) | ||||||||||
| Elmore et al. | Breast pathology | Diagnosis of breast cancer | |||||||||
| Benign without atypia | 97.1% (96.7–97.4%) | 95.7% (95.0–96.4%) | |||||||||
| Atypia | 37.8% (33.6–42.7%) | 27.8% (23.9–32.5%) | |||||||||
| Ductal Carcinoma in situ (DCIS) | 69.6% (64.4–75.3%) | 57.1% (50.6–64.8%) | |||||||||
| Invasive breast cancer | 97.7% (96.5–98.7%) | 97.2% (95.6–98.6%) | |||||||||
| Larghi et al. | Pancreatic pathology | Diagnostic classification according to the Papanicolau Society of Cytopathology system for reporting pancreatobiliary cytology | |||||||||
| 92% | 93% | ||||||||||
| 96% | 88% | ||||||||||
| 99% | 99% | ||||||||||
| 51% | 52% | ||||||||||
| 92% | 92% | ||||||||||
| Nielsen et al. | Dermatopathology | Diagnosing neoplasms of the skin: benign, premalignant, and malignant | |||||||||
| 92% (85–96%) | 86% (78–91%) | ||||||||||
| 99.5% (97–99.5%) | 99% (97–99.5%) | ||||||||||
| 93% (86–96.5%) | 92% (84.5–95.5%) | ||||||||||
| 98% (97–99%) | 97% (96–98%) | ||||||||||
| 72.4% | 69.6% | ||||||||||
| Perez et al. | Not specified | Diagnosing neoplasms: benign, suspicious, and malignant | |||||||||
| 87.9% | |||||||||||
| 95.7% | |||||||||||
| 97.1% | |||||||||||
| 82.7%. | |||||||||||
| 87% (163/186) | |||||||||||
| Ribback et al. | Urology, gynecology, and dermatopathology | Tumor diagnosis and assessment of surgical margin | |||||||||
| 92.6% | |||||||||||
| 99.0% | |||||||||||
| 98.3% | |||||||||||
| 97.7% | |||||||||||
| 98.35% | |||||||||||
| Tawfik et al. | Gynecological pathology | Assessing if negative for intraepithelial lesion or malignancy | |||||||||
| Bacterial vaginosis | 92% | ||||||||||
| Trichomona vaginalis | 91% | ||||||||||
| Fungi | 95% | ||||||||||
| Tawfik et al. | Gynecological pathology | Diagnosing for neoplasms, cellular changes, and infectious agents according to 2001 Bethesda reporting system and terminology | |||||||||
| Atypical squamous cells of undetermined significance (ASCUS) | 58.3% | 85.1% | |||||||||
| Low-grade squamous intraepithelial lesions (LSIL) | 54.1% | 93.9% | |||||||||
| High-grade squamous intraepithelial lesions (HSIL) | 51.8% | 98.8% | |||||||||
| Atypical glandular cells of undetermined significance (AGUS) | 32.8% | 99.1% | |||||||||
| Atypical squamous cells, cannot exclude high-grade squamous intraepithelial lesion (ASC-H) | 23.5% | 99.5% | |||||||||
| Any condition | 82.1% | 86.2% | |||||||||
| Tissier et al. | Nephropathology | Classification of adrenocortical tumor by Weiss score | |||||||||
| 86% | 94% | ||||||||||
| 100% | 93% | ||||||||||
| Zoroquiain et al. | Ocular pathology | Identification of prognostic factors for retinoblastoma | |||||||||
| Optic nerve invasion | Invasion and spread | Growth pattern of retinoblastoma | Calcification | ||||||||
| 100% | 100% | 100% | 97.8% | ||||||||
| 100% | 100% | 100% | 100% | ||||||||
Area under the curve (AUC) is the probability where the test with the target condition will have a higher value than the test without the target condition. It is represented with values from 0 to 1 and not in percentage.
Histopathological features are the main diagnostic findings that help to grade meningioma.
Above consensus means over-interpretation of the test to a higher breast cancer stage.
Below consensus is the opposite, under-interpretation to a lower stage.
Pathologist interpretation is used to denote the comparison during the validation study between WSI and LM, where pathologists have used both technologies.
Range of percentages in diagnostic concordance not reported.
Range of diagnostic concordance consists in the ratio of the cases that agreed with the consensus diagnosis and the total number of cases.
This is the average performance of WSI for all the above diagnostic categories but adjusted for the number of cases for each of the category.
Weiss score is a reference method to distinguish between a benign and a malignant adrenocortical tumor (ACT).
The study was designed in two stages of using WSI for the examination of the sample and the term ‘reading’ is used by the authors.
Additional outcomes for intra- and interobserver variability
| Source | Secondary outcome | |||||||
| Ammendola et al. | Surgical neuropathology | |||||||
| Atypical meningioma | 91% | 86% | 74% | 94% | 89% | |||
| Brain invasion | 100% | 91% | 86% | 97% | 94% | |||
| High mitotic index | 80% | 79% | 77% | 71% | 78% | |||
| Hypercellularity | 94% | 82% | 97% | 91% | 93% | |||
| Sheeting | 97% | 97% | 77% | 94% | 96% | |||
| Macronucleoli | 94% | 82% | 100% | 83% | 89% | |||
| Small cells | 97% | 94% | 97% | 91% | 96% | |||
| Spontaneous necrosis | 97% | 91% | 94% | 94% | 94% | |||
| Atypical meningioma | 54% | 63% | 60% | 74% | ||||
| Atypical for major criteria | 69% | 86% | 80% | 86% | ||||
| Atypical for minor criteria | 46% | 60% | 63% | 77% | ||||
| Brain invasion | 83% | 97% | 93% | 97% | ||||
| High mitotic index | 80% | 86% | 69% | 80% | ||||
| Hypercellularity | 74% | 77% | 86% | 86% | ||||
| Sheeting | 57% | 74% | 66% | 77% | ||||
| Macronucleoli | 37% | 49% | 40% | 51% | ||||
| Small cells | 34% | 49% | 34% | 49% | ||||
| Spontaneous necrosis | 26% | 51% | 31% | 54% | ||||
| Brain invasion | 83% | 89% | ||||||
| High mitotic index | 80% | 69% | ||||||
| Hypercellularity | 74% | 86% | ||||||
| Sheeting | 57% | 66% | ||||||
| Macronucleoli | 37% | 40% | ||||||
| Small cells | 34% | 34% | ||||||
| Spontaneous necrosis | 27% | 31% | ||||||
| Elmore et al. 2017 | Breast pathology | |||||||
| LM VS LM | 79% | |||||||
| WSI VS WSI | 73% | |||||||
| LM VS WSI | 77% | |||||||
| WSI VS LM | 76% | |||||||
| Larghi et al. | Pancreatic pathology | |||||||
| Diagnostic classification | к | 84.5% [к 0.79; CI 0.71–0.88] | 83.5% [к 0.78; CI 0.69–0.87] | |||||
| Presence of core tissue | к = 0.68, 95% CI 0.59−0.77 | 79.3% [к 0.59; CI 0.45–0.72] | 76.3% [к 0.53; CI 0.40–0.66] | |||||
| Number of lesional cells | к = 0.67, 95% CI 0.56−0.77 | 74.3% [к 0.62; CI 0.52–0.71] | 68.7% [к 0.53; CI 0.43–0.63] | |||||
| Percentage of lesional cells | к = 0.77, 95% CI 0.71−0.83 | 50.2% [к 0.40; CI 0.30–0.50] | 50.2% [к 0.38; CI 0.28–0.47] | |||||
| Mean | 78.3% [к 0.67; CI 0.57–0.78] | 77.8% [к 0.67; CI 0.57–0.77] | ||||||
| Nielsen et al. | Dermatopathology | |||||||
| LM | 0.91 | 0.94 | 0.91 | 0.97 | 0.84 | 0.81 | ||
| WSI | 0.97 | 0.86 | 0.95 | 0.95 | 0.85 | 0.82 | ||
| Tawfik et al. | Gynecological pathology | |||||||
| Negative | 0.74 (0.67–0.80) | 0.49 (0.39–0.60) | 0.63 (0.52–0.73) | 0.79 (0.70–0.87) | 0.61 (0.52–0.70) | 0.68 | ||
| Atypical squamous cells of undetermined significance (ASCUS) (95% CI) | 0.46 (0.39–0.52) | 0.21 (0.10–0.32) | 0.36 (0.25–0.46) | 0.45 (0.36–0.44) | 0.33 (0.24–0.43) | 0.39 | ||
| Low-grade squamous intraepithelial lesions (LSIL) (95% CI) | 0.53 (0.47–0.59) | 0.41 (0.31–0.52) | 0.52 (0.42–0.63) | 0.55 (0.46–0.64) | 0.51 (0.42–0.60) | 0.51 | ||
| High-grade squamous intraepithelial lesions (HSIL) (95% CI) | 0.58 (0.52–0.64) | 0.36 (0.26–0.46) | 0.42 (0.31–0.52) | 0.58 (0.49–0.67) | 0.54 (0.45–0.63) | 0.52 | ||
| Tissier et al. | Nephropathology | |||||||
| Weiss≥3 vs 0–2 | 0.83 | 0.70 (0.67–0.74) | 0.75 (0.72–0.79) | |||||
| Necrosis | 0.75 | 0.78 (0.74–0.81) | 0.83 (0.79–0.86) | |||||
| ≤25% clear cells | 0.42 | 0.71 (0.68–0.75) | 0.80 (0.77–0.83) | |||||
| Venous Invasion | 0.58 | 0.54 (0.50–0.57) | 0.54 (0.50–0.57) | |||||
| Mitotic figures | 0.42 | 0.54 (0.50–0.57) | 0.65 (0.62–0.69) | |||||
| Capsular Invasion | 0.25 | 0.49 (0.45–0.52) | 0.50 (0.47–0.54) | |||||
| Diffuse architecture | 0.33 | 0.41 (0.37–0.44) | 0.50 (0.46–0.53) | |||||
| Nuclear grade | 0.25 | 0.39 (0.36–0.43) | 0.45 (0.41–0.48) | |||||
| Atypical mitotic figures | 0.25 | 0.29 (0.26–0.33) | 0.46 (0.43–0.50) | |||||
| Sinusoidal invasion | 0 | 0.40 (0.37–0.44) | 0.30 (0.27–0.33) | |||||
| Weiss modified by Aubert et al ≥3 vs 0–2 | 0.50 | 0.67 (0.64–0.70) | 0.75 (0.72–0.78) | |||||
Interobserver concordance was measured between all the observers (pathologists), but also between senior pathologists versus all the observers that participated in the validation study.
Here all the possible combination of comparisons between LM and WSI were tried based on intraobserver agreement.
Beside the diagnostic classification, in this study other diagnostic features were considered, therefore we use the term “parameters”.
Kappa (к) statistics is used to assess observer agreement for intervention(s).
At Nielsen et al., they use the term ‘review’ instead of ‘reading’. We have chosen the latter for a consistent terminology (as it is used e.g. in Tissier et al.).
The case does not have the target condition.
Weiss score is a reference method to distinguish between a benign and a malignant adrenocortical tumor (ACT).