| Literature DB >> 33057024 |
Lina Maria Carmona Echeverria1,2, Aiman Haider3, Alex Freeman3, Urszula Stopka-Farooqui4, Avi Rosenfeld5, Benjamin S Simpson4, Yipeng Hu6, David Hawkes6, Hayley Pye4, Susan Heavey4, Vasilis Stavrinides4,7, Joseph M Norris4,7, Ahmed El-Shater Bosaily7,8, Cristina Cardona Barrena4, Simon Bott9, Louise Brown10, Nick Burns-Cox11, Tim Dudderidge12, Alastair Henderson13, Richard Hindley14, Richard Kaplan10, Alex Kirkham6,15, Robert Oldroyd16, Maneesh Ghei17, Raj Persad18, Shonit Punwani6,15, Derek Rosario19, Iqbal Shergill20, Mathias Winkler21, Hashim U Ahmed21,22, Mark Emberton7, Hayley C Whitaker4.
Abstract
Gleason score 7 prostate cancer with a higher proportion of pattern 4 (G4) has been linked to genomic heterogeneity and poorer patient outcome. The current assessment of G4 proportion uses estimation by a pathologist, with a higher proportion of G4 more likely to trigger additional imaging and treatment over active surveillance. This estimation method has been shown to have inter-observer variability. Fifteen patients with Prostate Grade Group (GG) 2 (Gleason 3 + 4) and fifteen patients with GG3 (Gleason 4 + 3) disease were selected from the PROMIS study with 192 haematoxylin and eosin-stained slides scanned. Two experienced uropathologists assessed the maximum cancer core length (MCCL) and G4 proportion using the current standard method (visual estimation) followed by detailed digital manual annotation of each G4 area and measurement of MCCL (planimetric estimation) using freely available software by the same two experts. We aimed to compare visual estimation of G4 and MCCL to a pathologist-driven digital measurement. We show that the visual and digital MCCL measurement differs up to 2 mm in 76.6% (23/30) with a high degree of agreement between the two measurements; Visual gave a median MCCL of 10 ± 2.70 mm (IQR 4, range 5-15 mm) compared to digital of 9.88 ± 3.09 mm (IQR 3.82, range 5.01-15.7 mm) (p = 0.64) The visual method for assessing G4 proportion over-estimates in all patients, compared to digital measurements [median 11.2% (IQR 38.75, range 4.7-17.9%) vs 30.4% (IQR 18.37, range 12.9-50.76%)]. The discordance was higher as the amount of G4 increased (Bias 18.71, CI 33.87-48.75, r 0.7, p < 0.0001). Further work on assessing actual G4 burden calibrated to clinical outcomes might lead to the use of differing G4 thresholds of significance if the visual estimation is used or by incorporating semi-automated methods for G4 burden measurement.Entities:
Mesh:
Year: 2020 PMID: 33057024 PMCID: PMC7561724 DOI: 10.1038/s41598-020-73524-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Objective measurement of MCCL and shows a discrepancy with visual measurement and pathologist estimation. (A) MCCL difference between visual and digital MCCL shows under-estimation in visual compared to digital MCCL. Bar plot of visual MCCL in yellow and digital MCCL in blue, organised by Gleason score. MCCL is plotted on the y-axis; each patient is plotted on the x-axis. Red dashed lines represent a threshold of 6 mm as the MCCL criterion for significance (PROMIS definition 1). Patients highlighted in red were over or underestimated in the original visual measurement. (B) Waterfall plot representing the difference between visual and digital measurements as digital MCCL-visual MCCL by Gleason score (y-axis), patients plotted on the x-axis. Visual Gleason score is represented in yellow for 3 + 4 and blue for 4 + 3. Bars with a negative value represent measurements where the visual MCCL was shorter than the digital MCCL (underestimation). Bars with a positive value represent cases were the visual MCCL was higher than the digital MCCL. The difference in 80% of cases is ± 2 mm (n = 24), red dashed line at − 2 and 2 mm difference. (C) Density plots representing the MCCL distribution between visual and digital images by Gleason scores. Y-axis represents the Kernel density estimation. The X-axis contains MCCL values. Visual MCCL score is represented in yellow and blue for the digital measurement. 4 + 3.The mean visual MCCL was 9.53 mm (5–15 mm) and the mean digital MCCL was 9.88 mm (5.01–15.74).
Figure 2Visual Gleason 4 appraisal overestimates burden of disease. (A) Bar plot of the proportion of Gleason 4 estimation average between two uropathologists (yellow) and digital estimation (blue). %G4 is plotted on the y-axis; each patient is plotted on the x-axis. A threshold of 50% g4 for clinical significance is shown as a red dashed line. Patient number on the x-axis is highlighted in bold and underlined if the digital measurement of their %G4 would lead to reclassification based on the digital value. Patient marked with * has ≥ 50% G4 in the digital measurement. (B) Bland–Altman plot representing the difference in measurement in the y-axis as visual %G4 – digital %G4. The x-axis represents the mean %G4 measurement of both techniques as (visual %G4 + digital %G4)/2. The bold black line represents complete agreement at 0. The purple dashed line corresponds to the bias at 18.71; the dotted purple line corresponds to the bias confidence interval (33.87–48.75). Dash and dotted blue lines correspond to the upper and lower limit of agreement and confidence intervals are plotted with dotted blue lines. Upper limit of agreement: 41.31 (33.87–48.75), lower limit of agreement: − 3.87 (− 11.31 to 3.56). Regression line is plotted as a continuous blue line.
Figure 3Objective measurement of Gleason 4 burden shows a discrepancy between visual measurement and the digital measurement for the index block. (A) Visual %G4 for the index block 30 patients shown in yellow overlaid with digital %G4 in blue. Patients separated by original Gleason grade grouping; 3 + 4 or 4 + 3, and organized by visual %G4. A threshold of 50% G4 for clinical significance is shown as a red dashed line. Patient number on the x-axis highlighted in bold and underlined if the objective measurement of their %G4 would cause reclassification. (B) Bland–Altman plot representing the difference in measurement in the y-axis as visual %G4 − digital %G4. The x-axis represents the mean %G4 measurement of both techniques as (visual %G4 + digital %G4)/2. The bold black line represents complete agreement at 0. The purple dashed line corresponds to the bias at 14.36; the dotted purple line corresponds to the bias confidence interval (9.78–18.94). Dash and dotted blue lines correspond to the upper and lower limit of agreement and confidence intervals are plotted with dotted blue lines. Upper limit of agreement: 38.40 (30.49–46.32), lower limit of agreement: − 9.67 (− 17.59 to − 1.76). The regression line is plotted as a continuous blue line.
Gleason 7 patients in the PROMIS cohort and 30 selected patients for in-depth analysis.
| UCH—PROMIS cohort (4 + 3 or ≥ 6 mm MCCL) | Selected 30 patients | |||||
|---|---|---|---|---|---|---|
| Gleason score | 3 + 4 | 4 + 3 | 3 + 4 | 4 + 3 | ||
| n = 67 (78%) | n = 18 (22%) | n = 15 (50%) | n = 15 (50%) | |||
| Age (years) | 63 (43–77) | 64 (48–79) | 0.44* | 62 (50–72) | 65 (48–79) | 0.30* |
| Prostate volume (cc) | 38.34 (16–83) | 38.18 (26–55) | 0.65** | 34 (21–62) | 38 (26–55) | 0.11** |
| Presenting PSA (ng/dL) | 7.46 (1.30–13) | 10.76 (5.7–15) | < 0.0001* | 7.60 (4.9–10.1) | 10.74 (6.2–15) | 0.0005* |
| PSA density (PSAd) | 0.22 (0.06–0.59) | 0.29 (0.11–0.53) | 0.002** | 0.24 (0.10–0.38) | 0.29 (0.11–0.53) | 0.14* |
| Likert 2 | 1 (1.4%) | 0 | 0 | 0 | ||
| Likert 3 | 8 (11.9%) | 3 (16.6%) | 1 (6.6%) | 0 | ||
| Likert 4 | 21 (31.3%) | 3 (16.6%) | 6 (40%) | 4 (26.6%) | ||
| Likert 5 | 5 (7.46%) | 12 (66.6%) | 8 (53.3%) | 11 (73.3%) | ||
| Likert NA | 4 (5.87%) | 0 | 0 | 1 (6.6%) | ||
Table comparing the Gleason 7 patients from University College London (UCH) within the PROMIS study. UCH PROMIS cohort is on the left, selected patients on the right. Number of patients per group by Gleason score in each cohort as n = , percentage in parenthesis. Mean value for age, prostate volume, presenting PSA and PSA density, with range in parenthesis. Age is denoted in years, prostate volume in cubic centimetres (cc), PSA in ng/dL and PSA density calculated as PSA/prostate volume. Likert scores are presented as number of patients and percentage in parenthesis, Likert NA when no Likert score was given. *p-value obtained using an unpaired t-test, **if using Mann–Whitney test.
Figure 4Patient selection and methods of digital manual annotation. (A) Euler diagram representing patient selection process for 30 patients for in-depth analysis. (B) NDPview2 image of scanned H&E slide of prostate cores from transperineal biopsies, where nuclei are shown in blue, and other structures in pink. From left to right, MCCL measurement in a straight core of 8.5 mm. Approximate visual pathologist measurement marked with a red line (7.76 mm). Following the axis of the core, three measurements in black of 2.53 mm, 2.11 mm and 4.48 mm for a total of 9.12 mm for the digital measurement. (C) Three prostate cores, areas with cancer were contoured in yellow, areas with Gleason 4 were contoured in black. Close up of contours shown in black box. Non-contoured areas correspond to benign prostatic tissue. (D) Equation used to derive percentage Gleason 4.