Literature DB >> 34194136

A second opinion pathology review improves the diagnostic concordance between prostate cancer biopsy and radical prostatectomy specimens.

Takanori Maehara1, Takuya Sadahira1, Yuki Maruyama1, Koichiro Wada1, Motoo Araki1, Masami Watanabe1, Toyohiko Watanabe1, Hiroyuki Yanai2, Yasutomo Nasu1.   

Abstract

OBJECTIVES: The Gleason scoring system is an essential tool for determining the treatment strategy in prostate cancer (PCa). However, the Gleason grade group (GGG) often differs between needle-core biopsy (NCB) and radical prostatectomy (RP) specimens. We investigated the diagnostic value of a second opinion pathology review using NCB specimens in PCa.
MATERIALS AND METHODS: We retrospectively evaluated 882 patients who underwent robot-assisted RP from January 2012 to September 2019. Of these, patients whose original biopsy specimens were obtained from another hospital and reviewed by the urological pathology expert at our institution were included in the study. Patients who received neoadjuvant hormonal therapy were excluded from the study. Weighted kappa (k) coefficients were used to evaluate the diagnostic accuracy of each review.
RESULTS: A total of 497 patients were included in this study. Substantial agreement (weighted k = 0.783) in the GGG between initial- and second-opinion diagnoses based on NCB specimens was observed in 310 cases (62.4%). Although diagnoses based on a single opinion showed moderate agreement with the GGG of RP specimens (initial: 35.2%, weighted k = 0.522; second opinion; 38.8%, weighted k = 0.560), matching initial and second opinion diagnoses improved the concordance (42.9%, 133/310 cases) to substantial agreement (weighted k = 0.626).
CONCLUSIONS: A second opinion of PCa pathology helps to improve the diagnostic accuracy of NCB specimens. However, over half of diagnoses that matched between the initial and second opinions differed from the diagnosis of RP specimens. Copyright:
© 2021 Urology Annals.

Entities:  

Keywords:  Gleason grade group; Gleason score; prostate biopsy; prostate cancer; prostatectomy

Year:  2021        PMID: 34194136      PMCID: PMC8210712          DOI: 10.4103/UA.UA_81_20

Source DB:  PubMed          Journal:  Urol Ann        ISSN: 0974-7796


INTRODUCTION

The Gleason score (GS) is the most important parameter used to predict tumor aggressiveness and select the appropriate therapeutic management strategy for prostate cancer (PCa).[1] An accurate diagnosis is especially essential for PCa, as there are several treatments available for this cancer.[2] However, previous studies have reported a significant discrepancy in the GS between needle core biopsy (NCB) and radical prostatectomy (RP) specimens (28%–76% of cases).[345] In many of these cases, the GS is underestimated in NCB specimens, which leads to an upgraded diagnosis in up to 43% of patients after RP.[56] These diagnostic errors are associated with multiple factors, including the number and length of the biopsy cores obtained, tumor location, pathologist misinterpretation, and interobserver variability.[347] To improve accuracy, a mandatory second opinion review by urological pathology experts has been recommended in several publications.[389] Such reviews can result in a change in GS from the original diagnosis in a significant number of specimens, leading to an alteration in therapeutic management.[7810] At our institution, pathology slides of biopsy specimens and reports from previous institutions are routinely reviewed by a specialized urological pathologist before further clinical management, as a mandatory review program.[11] A new PCa pathological grading system was recently proposed to reflect tumor biology better than the previous Gleason scoring system.[110] This system comprises five modified grading groups. Specifically, GS 3 + 3 disease is categorized as Gleason grade group (GGG) 1, GS 3 + 4 disease as GGG2, GS 4 + 3 disease as GGG3, GS 4 + 4, 3 + 5, and 5 + 3 disease as GGG4, and GS 4 + 4, 5 + 4, and 5 + 5 disease as GGG5. In a validation study that included <25,000 PCa patients, the GGG resulted in a more accurate prognostic prediction, increasing the C statistic by 0.02–0.05 compared with the three GS groups (≤6, 7, and 8–10).[12] To date, only a few reports have evaluated the value of a second pathology review using this new grading system. In this study, we determined whether a second-opinion review is still mandatory for improving GS accuracy between NCB and RP specimens in the GGG era.

MATERIALS AND METHODS

We retrospectively collected data from 882 PCa patients who underwent robot-assisted RP (RARP) at Okayama University Hospital between January 2012 and September 2019. NCB slides, which were prepared elsewhere, were routinely reviewed by a urological pathology expert at our hospital. Of these 882 patients, 497 fulfilled all inclusion criteria for this study: (a) initial NCB pathology evaluation conducted at a different hospital (patients who underwent the initial NCB at our hospital were excluded), (b) second opinion evaluation of the same specimen conducted at our hospital, (c) an available GGG (specimens from a different hospital with an inadequate grade (e.g., GS 1 + 2) were excluded), and (d) no hormonal therapy received before RARP. We collected data on age, body mass index, prostate volume, initial prostate-specific antigen (PSA) level, PSA density, and numbers of biopsy cores and positive cores. Formal ethical approval for this study was obtained from the Okayama University Institutional Review Board (registration no. 1004) before study initiation. All patients provided written informed consent for the use of their clinical records. NCB slides from elsewhere were reviewed, and evaluations of RP specimens were performed by the same urological pathology expert from our hospital, who was blinded to the previous NCB results. A GS was assigned to each lesion based on the sum of the primary and secondary patterns of the lesion; for patients with more than one positive lesion, the highest GS among all NCB and RP specimen evaluations was adopted. The GS assigned to each NCB specimen was compared with that assigned to the corresponding RP specimen, and a major GS discrepancy was defined as a difference in the GGG category between the two specimen types. The five GGG categories are GGG1 (GS 3 + 3), GGG2 (GS 3 + 4 = 7), GGG3 (GS 4 + 3 = 7), GGG4 (GS 8), and GGG5 (GS 9 or 10).[18] To determine whether a second opinion pathology review improves the diagnostic accuracy, the agreement in the GGG between the NCB and RP specimens was calculated according to weighted Cohen's kappa (k) coefficients (5 × 5 tables; linear weights for categorical variables and quadratic weights for ordinal variables). k = 1 indicates that the raters are in complete agreement. k = 0 indicates no agreement among the raters other than what would be expected by chance. The agreement was categorized as almost perfect (k ≥ 0.81), substantial (k = 0.61–0.80), moderate (k = 0.41–0.60), fair (k = 0.21–0.40), or slight (k ≤ 0.20). The statistical analyses were performed using (Saitama Medical Center, Jichi Medical University, Saitama, Japan), a graphical user interface for R.[13]

RESULTS

From January 2012 to September 2019, 882 patients underwent RARP for PCa. Of these patients, 497 with available outside initial opinion, internal second opinion, and internal RP evaluations were included in the analysis. For the diagnosis of PCa, transrectal ultrasound-guided biopsy (TRUS-GB) was performed; no magnetic resonance imaging-guided biopsy (MRI-GB) or saturation template biopsy was performed. The characteristics of the patients and GGG distribution among the specimens are shown in Table 1. The rates of GGG1 and GGG4 were decreased in the RP compared with the NCB specimens, and that of GGG5 tended to be increased in the RP specimens. However, the median GGG was the same among the initial and second opinion NCB and RP specimens: three (interquartile range, 2–4). One case, which was diagnosed as GGG4 by the initial pathologist and as GGG2 by the second pathologist, was diagnosed with high-grade prostatic intraepithelial neoplasia after RARP.
Table 1

Demographic and clinical characteristics of the overall cohort (n=497)

n (%)
Age, median (IQR)68 (64-72)
BMI, median (IQR)23.7 (22.0-25.5)
Prostate volume, median (IQR)28 (21-37)
Initial PSA, median (IQR)7.4 (5.3-10.9)
PSA density, median (IQR)0.28 (0.19-0.44)
Transrectal/transperineal biopsy, n (%)352 (70.8)/145 (29.2)
Clinical T-stage, n (%)
 T1100 (20)
 T2365 (73)
 T332 (7)
Number of biopsy cores, median (IQR)11 (10-14)
Positive cores on initial opinion, median (IQR)3 (2-5)
Positive cores on second opinion, median (IQR)3 (2-5)

Gleason grade groupInitial, n (%)Second, n (%)Pathology, n (%)

Atypical--1 (1)
 1123 (25)105 (21)38 (8)
 2122 (25)106 (21)183 (36)
 375 (15)82 (16)122 (25)
 4121 (24)140 (29)54 (11)
 556 (11)64 (13)99 (20)

BMI: Body mass index, IQR: Interquartile range, PSA: Prostate specific antigen

Demographic and clinical characteristics of the overall cohort (n=497) BMI: Body mass index, IQR: Interquartile range, PSA: Prostate specific antigen Discrepancies in the GGG of the NCB specimens between the initial and second opinions are listed in Table 2. Of the 497 cases, the agreement was observed in 310 (62.3%). Among the 187 cases with a major discrepancy, the diagnosis was upgraded in 119 (63.6%) and downgraded in 68 (36.4%) after review. Both the GGG1 and GGG5 assignments had a higher rate of agreement (both 69%) between the initial and second opinions, whereas agreement between the initial and second opinions for GGG3 was observed in only half of the cases.
Table 2

Agreement in the Gleason grade group between the initial and second evaluations of needle core biopsy specimens

Initial GGGSecond GGGTotal

GGG 1GGG 2GGG 3GGG 4GGG 5
GGG1 (≤3+3)84 (69)23 (19)8 (6)8 (6)0 (0)123
GGG2 (3+4)17 (14)70 (57)17 (14)17 (14)1 (1)122
GGG3 (4+3)1 (1)8 (11)38 (51)21 (28)7 (9)75
GGG4 (8)3 (2)5 (4)17 (14)79 (66)17 (14)121
GGG5 (9, 10)0 (0)0 (0)2 (4)15 (27)39 (69)56
Total1051068214064497

GGG: Gleason grade group

Agreement in the Gleason grade group between the initial and second evaluations of needle core biopsy specimens GGG: Gleason grade group Discrepancies in the GGG of the initial and second opinion NCB specimens compared with the RP specimens are shown in Tables 3 and 4, respectively. GGG agreement was observed in 35.2% (175/497) of cases between the initial opinion NCB and RP specimens and in 38.8% (193/497) of cases between the second opinion NCB and RP specimens. The rate of GGG upgrade after RARP was higher for the initial opinion evaluation (41.0%; 204/497) than second opinion evaluation (33.4%; 166/497).
Table 3

Agreement in the Gleason grade group between radical prostatectomy specimens and the initial evaluation of needle core biopsy specimens

Initial GGGAtypicalRadical prostatectomy GGGTotal

GGG 1GGG 2GGG 3GGG 4GGG 5
GGG1 (≤3+3)0 (0)29 (24)63 (50)18 (15)6 (5)7 (6)123
GGG2 (3+4)0 (0)6 (5)71 (58)29 (24)6 (5)10 (8)122
GGG3 (4+3)0 (0)2 (3)25 (33)20 (27)16 (21)12 (16)75
GGG4 (8)1 (1)1 (1)15 (12)45 (36)22 (18)37 (31)121
GGG5 (9, 10)0 (0)0 (0)9 (16)10 (18)4 (7)33 (59)56
Total1381831225499497

GGG: Gleason grade group

Table 4

Agreement in the Gleason grade group between radical prostatectomy specimens and second-opinion review of needle core biopsy specimens

Second GGGAtypicalRadical prostatectomy GGGTotal

GGG 1GGG 2GGG 3GGG 4GGG 5
GGG 1 (≤3+3)0 (0)29 (28)52 (49)13 (12)5 (5)6 (6)105
GGG 2 (3+4)1 (1)6 (6)67 (63)22 (20)4 (4)6 (6)106
GGG 3 (4+3)0 (0)1 (1)32 (39)28 (35)11 (13)10 (12)82
GGG 4 (8)0 (0)2 (1)24 (17)48 (34)29 (21)37 (27)140
GGG 5 (9, 10)0 (0)0 (0)8 (13)11 (17)5 (8)40 (62)64
Total1381831225499497

GGG: Gleason grade group

Agreement in the Gleason grade group between radical prostatectomy specimens and the initial evaluation of needle core biopsy specimens GGG: Gleason grade group Agreement in the Gleason grade group between radical prostatectomy specimens and second-opinion review of needle core biopsy specimens GGG: Gleason grade group The GGG concordance rates between the RP specimens and the NCB specimens in which the first and second pathologists’ assignments were the same (n = 310) are summarized in Table 5. In this analysis, the GGG concordance rate between RP and NCB specimens improved to 42.9% (133/310), compared with the comparisons based on the NCB initial or second opinions alone. Of the cases without concordance, the GGG was upgraded in 35.2% (109/310) and downgraded in 21.9% (68/310) of cases after RARP. Specifically, 50% (42/84) and 20% (17/84) of the GGG1 cases were upgraded to GGG2 and an even higher-grade group, respectively.
Table 5

Agreement in the Gleason grade group between needle core biopsy (with matching initial- and second-opinion Gleason grade group) and radical prostatectomy specimens

1st=2nd GGGRadical prostatectomy GGGTotal

GGG 1GGG 2GGG 3GGG 4GGG 5
GGG 1 (≤3+3)25 (30)42 (50)9 (10)4 (5)4 (5)84
GGG 2 (3+4)2 (3)50 (71)13 (19)1 (1)4 (6)70
GGG 3 (4+3)0 (0)15 (39)13 (34)6 (16)4 (11)38
GGG 4 (8)1 (1)6 (8)32 (41)18 (23)22 (27)79
GGG 5 (9, 10)0 (0)5 (13)5 (13)2 (5)27 (69)39
Total28118723161310

GGG: Gleason grade group

Agreement in the Gleason grade group between needle core biopsy (with matching initial- and second-opinion Gleason grade group) and radical prostatectomy specimens GGG: Gleason grade group To determine the diagnostic improvement potential of second opinion pathology review, weighted Cohen's kappa coefficients were used [Table 6]. Initial and second opinion GGGs showed substantial agreement (weighted k = 0.783). The weighted k value improved from 0.522 when based on the initial opinion to 0.560 when based on the second opinion, both values indicating “moderate” agreement. In addition, when only the NCB specimens exhibiting GGG agreement between the initial and second opinions were compared with the RP specimens, the agreement (k = 0.626) improved to “substantial.”
Table 6

The diagnostic concordance between specimens and the weighted kappa coefficients

Concordance rateWeighted kappa coefficient (95% CI)
Initial×second opinion62.3% (310/497)0.783 (0.743-0.822)
Initial×RP specimen35.2% (175/497)0.522 (0.453-0.592)
Second opinion×RP specimen38.8% (193/497)0.560 (0.495-0.627)
1st=2nd×RP specimen42.9% (133/310)0.626 (0.547-0.706)

CI: Confidence interval, RP: Radical prostatectomy

The diagnostic concordance between specimens and the weighted kappa coefficients CI: Confidence interval, RP: Radical prostatectomy

DISCUSSION

The present study demonstrates the efficacy of second opinion pathology review for improving the GGG concordance between NCB and RP specimens. The accuracy of the initial opinion GGG was improved by the second opinion review (from k = 0.52 to k = 0.56). The GGG concordance further improved to “substantial” (k = 0.63) when the RP specimens were compared exclusively with the NCB specimens exhibiting agreement between the initial and second opinions. However, it should be noted that the GGG in over half of those NCB specimens was discordant with that of the RP specimens. The GGG has been identified as the most important independent risk factor for PSA recurrence-free survival and strongly influences PCa treatment strategies.[1214] Thus, an accurate determination of the GGG in NCB specimens is critical for the management of PCa patients. Interpretation bias and sampling error are two major factors potentially affecting the GS interpretation in NCB specimens.[1516] Regarding interpretation bias, numerous studies have investigated the reproducibility of this classification tool across pathologists, with GS agreement values (k) ranging from 0.41–0.64 in NCB specimens.[717] Such bias is caused by the different interpretations of the gray areas between adjacent grades of the GS system, particularly between GS 3 and GS 4.[18] A previous study revealed that variations in the interpretation of criteria result in a lower rate of agreement: Only 9.9% of 71 specimens exhibited total agreement among three pathologists, with a total disagreement rate of 26.8%.[18] In addition, it was reported that at least 17% of cases of GS overestimation in NCB specimens, compared with prostatectomy specimens, are due to the misinterpretation of the pathologist.[4] To resolve these problems, a consensus diagnosis among two or more pathologists has been recommended.[19] Numerous publications showed that mandatory second opinion pathology review by specialized urological pathologists leads to alteration of the management strategy and improved care in some cases.[3781120] The accuracy of the GS in NCB specimens depends on the experience in uropathology and workload of the reviewing pathologist.[3] A major discrepancy in the GS assigned by general versus specialized pathologists occurs in 15%–41% of NCB specimens.[820] The adoption of second opinion pathology review improves the GS agreement between NCB and RP specimens, and thus GS accuracy, and leads to a change in the treatment recommendation in 9%–26% of cases.[21] In our study, the GGG agreement in NCB specimens between the initial and second opinions was relatively high (weighted k = 0.78) compared with previous reports (0.41–0.64).[717] Hence, an improvement in GGG accuracy was observed in a small number of cases (35%–39%). This may be explained by the improvements in GS accuracy of general pathologists with increasing experience; one study reported that the concordance of GSs assigned by general pathologists was significantly higher during the second half of their 13-year training period.[21] Our concordance rate between NCB and RP specimens was similar to others (35% and 39%), and a tendency for a higher GGG in the second opinion review was also observed. Even with mandatory second-opinion pathology reviews, it is difficult to control interpretation bias. In this study, 57% of NCB cases with a matching GGG between the initial and second opinions were discordant with the GGG of RP specimens. Similarly, Gleason himself in 1992 reported an exact GS agreement in only 50% of cases after a second review of the same specimens used for his original classification.[22] To overcome this bias, the diagnoses of the second-opinion NCB specimen and RP specimen were determined by the same urological pathology expert using the same criteria; furthermore, the diagnosis of the NCB specimens was performed before that of the RP specimens, and no retrospective change in the NCB diagnosis was conducted in the current study. Thus, the inaccuracy might have been caused by sampling error rather than interpretation bias. Sampling error is caused by the extensive, multifocal, and heterogeneous characteristics of PCa, which render proper sampling of the prostate gland difficult.[23] Significant sampling variation occurs with the use of a systematic number of biopsy cores from prostate glands that fluctuate in volume. Thus, increasing the number of biopsy cores improves both sampling and accuracy.[11] Conversely, a small number of biopsy cores with a shorter length might lead to the overestimation of the GGG in NCB specimens. Another way to reduce sampling error is to improve tumor visualization and targeting during the biopsy. To this end, MRI-GB has been rapidly introduced worldwide.[523] Several studies showed a similar cancer detection rate between MRI-GB and TRUS-GB; however, the GS of RP specimens has a higher concordance with that of MRI-GB (57%–90%) than that of TRUS-GB specimens (28%–76%).[34523] MRI-GB also leads to a shift in the GGG distribution of newly diagnosed PC patients toward a diagnosis of higher-risk disease, which allows detection of 30% more high-risk cancers and 17% fewer low-risk cancers than those detected by systemic biopsy.[24] Similarly, Xu et al. reported that 32% of TRUS-GB cases were upgraded, compared with only 21% of MRI-GB cases, following analysis of RP specimens.[5] With the growing availability of prostate MRI, different functional imaging modalities have increased the role of prostate MRI in detecting, locating, and staging PCa.[25] The adoption of multi-parametric MRI can improve the detection of PCa, with a specificity of 0.88 and a sensitivity of 0.74.[26] The Prostate Imaging Reporting and Data System, version 2, (PI-RADS) score was proposed to detect PCa with the aim of increasing multiparametric MRI efficacy.[25] A high PI-RADS score can predict >80% of cases with clinically significant PCa.[2526] The present results suggest that the adoption of multiparametric MRI is important, in addition to a second opinion pathology review. The present study had several limitations. First, only cases treated with RARP were included. Cases diagnosed as benign or deemed unsuitable for RARP are not referred to our institution. Such selection bias may have increased the potential for overestimating the GGG. Second, the biopsy method was not unified, as the NCB specimens were obtained from different institutions; thus, the number of cores and the biopsy location differed among the initial institutions. Third, the same urological pathologist from our hospital determined the GS for both the NCB and RP specimens. Therefore, interpretation bias might have influenced the agreement between the initial and subsequent diagnoses.

CONCLUSIONS

Second opinion pathology review can improve the GGG concordance between NCB and RP specimens. However, even in the comparison limited to the NCB cases with a matching GGG between the initial and second opinions, the GGG rate was different after RARP in over half of the cases.

Financial support and sponsorship

Nil.

Conflicts of interest

There are no conflicts of interest.
  26 in total

1.  Accurate Gleason grading of prostatic adenocarcinoma in prostate needle biopsies by general pathologists.

Authors:  Andrew A Renshaw; Delray Schultz; Kerri Cote; Marian Loffredo; David E Ziemba; Anthony V D'Amico
Journal:  Arch Pathol Lab Med       Date:  2003-08       Impact factor: 5.534

2.  The value of second-opinion pathology diagnoses on prostate biopsies from patients referred for management of prostate cancer.

Authors:  Al B Barqawi; Ruslan Turcanu; Eduard J Gamito; Scott M Lucia; Colin I O'Donnell; E David Crawford; David D La Rosa; Francisco G La Rosa
Journal:  Int J Clin Exp Pathol       Date:  2011-06-12

3.  A UK-based investigation of inter- and intra-observer reproducibility of Gleason grading of prostatic biopsies.

Authors:  J Melia; R Moseley; R Y Ball; D F R Griffiths; K Grigor; P Harnden; M Jarmulowicz; L J McWilliam; R Montironi; M Waller; S Moss; M C Parkinson
Journal:  Histopathology       Date:  2006-05       Impact factor: 5.087

4.  Accuracy of multiparametric MRI for prostate cancer detection: a meta-analysis.

Authors:  Maarten de Rooij; Esther H J Hamoen; Jurgen J Fütterer; Jelle O Barentsz; Maroeska M Rovers
Journal:  AJR Am J Roentgenol       Date:  2014-02       Impact factor: 3.959

5.  Pelvic magnetic resonance imaging parameters predict urinary incontinence after robot-assisted radical prostatectomy.

Authors:  Takuya Sadahira; Yosuke Mitsui; Motoo Araki; Yuki Maruyama; Koichiro Wada; Kohei Edamura; Yasuyuki Kobayashi; Masami Watanabe; Toyohiko Watanabe; Yasutomo Nasu
Journal:  Low Urin Tract Symptoms       Date:  2018-12-12       Impact factor: 1.592

6.  Clinical and cost impact of second-opinion pathology. Review of prostate biopsies prior to radical prostatectomy.

Authors:  J I Epstein; P C Walsh; F Sanfilippo
Journal:  Am J Surg Pathol       Date:  1996-07       Impact factor: 6.394

7.  Does prostate biopsy Gleason score accurately express the biologic features of prostate cancer?

Authors:  Kobi Stav; Sandbank Judith; Herbert Merald; Dan Leibovici; Arie Lindner; Amnon Zisman
Journal:  Urol Oncol       Date:  2007 Sep-Oct       Impact factor: 3.498

8.  Investigation of the freely available easy-to-use software 'EZR' for medical statistics.

Authors:  Y Kanda
Journal:  Bone Marrow Transplant       Date:  2012-12-03       Impact factor: 5.483

9.  Factors predicting pathological upgrading after prostatectomy in patients with Gleason grade group 1 prostate cancer based on opinion-matched biopsy specimens.

Authors:  Yuki Maruyama; Takuya Sadahira; Motoo Araki; Yosuke Mitsui; Koichiro Wada; Acosta Gonzalez Herik Rodrigo; Kazuaki Munetomo; Yasuyuki Kobayashi; Masami Watanabe; Hiroyuki Yanai; Toyohiko Watanabe; Yasutomo Nasu
Journal:  Mol Clin Oncol       Date:  2020-02-10

10.  Direct comparison of multiparametric magnetic resonance imaging (MRI) results with final histopathology in patients with proven prostate cancer in MRI/ultrasonography-fusion biopsy.

Authors:  Angelika Borkowetz; Ivan Platzek; Marieta Toma; Theresa Renner; Roman Herout; Martin Baunacke; Michael Laniado; Gustavo Baretton; Michael Froehner; Stefan Zastrow; Manfred Wirth
Journal:  BJU Int       Date:  2016-04-02       Impact factor: 5.588

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.