| Literature DB >> 35602201 |
Ellery Wulczyn1, Kunal Nagpal1, Matthew Symonds1, Melissa Moran1, Markus Plass2, Robert Reihs2, Farah Nader2, Fraser Tan1, Yuannan Cai1, Trissia Brown3, Isabelle Flament-Auvigne3, Mahul B Amin4, Martin C Stumpe1,5, Heimo Müller2, Peter Regitnig2, Andreas Holzinger2, Greg S Corrado1, Lily H Peng1, Po-Hsuan Cameron Chen1, David F Steiner1, Kurt Zatloukal2, Yun Liu1, Craig H Mermel1.
Abstract
Background: Gleason grading of prostate cancer is an important prognostic factor, but suffers from poor reproducibility, particularly among non-subspecialist pathologists. Although artificial intelligence (A.I.) tools have demonstrated Gleason grading on-par with expert pathologists, it remains an open question whether and to what extent A.I. grading translates to better prognostication.Entities:
Keywords: Prognostic markers; Prostate cancer
Year: 2021 PMID: 35602201 PMCID: PMC9053226 DOI: 10.1038/s43856-021-00005-3
Source DB: PubMed Journal: Commun Med (Lond) ISSN: 2730-664X
Cohort characteristics.
| Validation set 1 | Validation set 2 (subset of set 1) | ||
|---|---|---|---|
| Number of cases | 2807 | 1517 | |
| Number of slides | Total | 83,645 | 47,626 |
| Median per case (interquartile range) | 29 (25, 34) | 30 (26, 35) | |
| Overall survival (OS) | Median years of follow-up (interquartile range) | 13.1 (8.5, 17.2) | 11.2 (7.4, 15.2) |
| Censored (%) | 2150 (77%) | 1306 (86%) | |
| Observed (%) | 657 (23%) | 211 (14%) | |
| Disease-specific survival (DSS) (%) | Censored | 2673 (95%) | 1464 (97%) |
| Observed | 134 (5%) | 53 (3%) | |
| Grade Group (%) | 1 | 611 (22%) | 608 (40%) |
| 2 | 476 (17%) | 473 (31%) | |
| 3 | 224 (8%) | 224 (15%) | |
| 4 | 128 (5%) | 127 (8%) | |
| 5 | 85 (3%) | 85 (6%) | |
| Unknown | 1283 (46%) | 0 (0%) | |
| Pathologic T-stage (%) | T2 | 1640 (58%) | 1113 (73%) |
| T3 | 791 (28%) | 366 (24%) | |
| T4 | 25 (1%) | 6 (<1%) | |
| Unknown | 351 (13%) | 32 (2%) | |
| Age at diagnosis (%) | <60 | 952 (34%) | 537 (35%) |
| 60–70 | 1546 (55%) | 817 (54%) | |
| ≥70 | 309 (11%) | 163 (11%) | |
| Margin status (%) | Negative | 448 (16%) | 153 (10%) |
| Positive | 242 (9%) | 96 (6%) | |
| Unknown | 2117 (75%) | 1268 (84%) | |
| Pathologic N-stage (%) | N0 | 1395 (50%) | 879 (58%) |
| N1 | 77 (3%) | 62 (4%) | |
| N2 | 13 (<1%) | 4 (<1%) | |
| N3 | 10 (<1%) | 8 (1%) | |
| Unknown | 1312 (47%) | 564 (37%) | |
| Received hormone or chemotherapy (%) | Yes | 53 (2%) | 33 (2%) |
| No/unknown | 2754 (98%) | 1484 (98%) | |
| Received radiation therapy (%) | Yes | 277 (10%) | 176 (12%) |
| No/unknown | 2530 (90%) | 1341 (88%) | |
| Biochemical recurrence (%) | Censored | 338 (12%) | 228 (15%) |
| Observed | 95 (3%) | 55 (4%) | |
| No follow-up | 2374 (85%) | 1234 (81%) | |
Validation set 1 contains all prostatectomy cases from the Biobank Graz between 1995 and 2014. Validation set 2 was derived by first considering cases in the Gleason grading era at the institution (years 2000–2014; n = 2191), and then further filtering for cases where a Gleason score was recorded and available in the pathology report (n = 1517).
C-index for pathologist and A.I. grading.
| Validation set 1 ( | Validation set ( | |
|---|---|---|
| (A) Pathologist Grade Groups | N/Aa | 0.79 [0.71, 0.86] |
| (B) A.I. risk score (continuous) | ||
| (C) A.I. risk groups (discretized) | 0.82 [0.78–0.85] | 0.85 [0.79, 0.90] |
| (D) Average of (A) and (C) | N/Aa | 0.86 [0.80–0.91] |
The A.I. risk score (B) is a continuous risk score from a Cox regression fit on Gleason pattern percentages from the A.I. The A.I. risk group (C) is a discretized version of the A.I. risk score. The discretization was done to match the number and frequency of pathologist Grade Groups in validation set 2. (D) Represents the average of the Pathologist Grade Group and A.I. risk groups. In validation set 2, the C-index for the A.I. risk score was statistically significantly higher than that for the pathologists’ Grade Group (p < 0.05, prespecified analysis). Bold indicates the highest value in each column (dataset).
aNot available because pathologist Grade Groups were not available for all cases in validation set 1 due to the earlier time period.