Although several models have been developed to predict the probability of Gleason sum upgrading between biopsy and radical prostatectomy specimens, most of these models are restricted to prostate-specific antigen screening-detected prostate cancer. This study aimed to build a nomogram for the prediction of Gleason sum upgrading in clinically diagnosed prostate cancer. The study cohort comprised 269 Chinese prostate cancer patients who underwent prostate biopsy with a minimum of 10 cores and were subsequently treated with radical prostatectomy. Of all included patients, 220 (81.8%) were referred with clinical symptoms. The prostate-specific antigen level, primary and secondary biopsy Gleason scores, and clinical T category were used in a multivariate logistic regression model to predict the probability of Gleason sum upgrading. The developed nomogram was validated internally. Gleason sum upgrading was observed in 90 (33.5%) patients. Our nomogram showed a bootstrap-corrected concordance index of 0.789 and good calibration using 4 readily available variables. The nomogram also demonstrated satisfactory statistical performance for predicting significant upgrading. External validation of the nomogram published by Chun et al. in our cohort showed a marked discordance between the observed and predicted probabilities of Gleason sum upgrading. In summary, a new nomogram to predict Gleason sum upgrading in clinically diagnosed prostate cancer was developed, and it demonstrated good statistical performance upon internal validation.
Although several models have been developed to predict the probability of Gleason sum upgrading between biopsy and radical prostatectomy specimens, most of these models are restricted to prostate-specific antigen screening-detected prostate cancer. This study aimed to build a nomogram for the prediction of Gleason sum upgrading in clinically diagnosed prostate cancer. The study cohort comprised 269 Chinese prostate cancerpatients who underwent prostate biopsy with a minimum of 10 cores and were subsequently treated with radical prostatectomy. Of all included patients, 220 (81.8%) were referred with clinical symptoms. The prostate-specific antigen level, primary and secondary biopsy Gleason scores, and clinical T category were used in a multivariate logistic regression model to predict the probability of Gleason sum upgrading. The developed nomogram was validated internally. Gleason sum upgrading was observed in 90 (33.5%) patients. Our nomogram showed a bootstrap-corrected concordance index of 0.789 and good calibration using 4 readily available variables. The nomogram also demonstrated satisfactory statistical performance for predicting significant upgrading. External validation of the nomogram published by Chun et al. in our cohort showed a marked discordance between the observed and predicted probabilities of Gleason sum upgrading. In summary, a new nomogram to predict Gleason sum upgrading in clinically diagnosed prostate cancer was developed, and it demonstrated good statistical performance upon internal validation.
Characterizing cancer parameters before treatment is of paramount importance in determining the prognosis and choosing therapeutic strategies. The Gleason sum, although introduced approximately 50 years ago, still provides significant predictive value for localized prostate cancer[1]–[3]. Several treatment recommendations are made according to the Gleason sum; for example, a Gleason sum≤ 6 is an eligibility criterion for active surveillance and brachytherapy, and multimodality treatment should be considered in subjects with a high Gleason sum[4],[5]. Currently, the Gleason sum plays an important role in a number of prediction models, and it cannot easily be replaced by other clinicopathologic factors[6].The biopsy Gleason sum, however, often underestimates the final pathologic Gleason sum because it is based on a sample. Previous reports suggested that approximately 30% to 43% of patients will have a diagnosis of more aggressive prostate cancer variant at radical prostatectomy than was diagnosed at biopsy[7]–[13]. Any upgrading of the Gleason sum from the biopsy to the final pathologic specimen may alter the treatment options. The choice of conservative treatment, for example, might be reconsidered in men who are at a greater risk of biopsy Gleason sum upgrading. Furthermore, the probability of Gleason sum upgrading should be taken into account in the interpretation of the outcomes of non-surgical interventions.In the last 5 years, several models have been developed to predict the probability of Gleason sum upgrading in European or American patients[7]–[13]. Most of these models are restricted to screening-detected prostate cancer, and the subjects often have a prostate-specific antigen (PSA) value of less than 10 ng/mL. Due to the low incidence of prostate cancer (age-standardized incidence = 1.6 per 100,000[14]), PSA screening is not widely conducted in mainland China. Thus, greater proportions of Chinese patients are diagnosed with clinical symptoms and have advanced disease compared with their western counterparts. We developed a nomogram to predict the probability of Gleason sum upgrading in a cohort with clinically diagnosed prostate cancer. An external validation of the Chun et al. nomogram[7] was also performed.
Patients and Methods
Patients
The study population consisted of 269 assessable patients treated with radical prostatectomy and no neoadjuvant hormonal therapy at Fudan University Shanghai Cancer Center between April 2006 and May 2011. Of all the patients, 220 (81.8%) were referred to the hospital due to clinical symptoms such as urinary frequency, urgency, dysuria, and hematuria.
Clinical and pathologic evaluation
The indications for biopsy included elevated PSA level (>4 ng/mL), abnormal digital rectal examination, and/or suspicious findings in a radiologic examination. Clinical stage was assigned according to the 2002 TNM system. Pretreatment PSA level was measured before digital rectal examination and transrectal ultrasound (TRUS) examination. Prostate biopsies were performed under TRUS guidance, with the number of cores ranging from 10 to 16. Of the 269 men included in the study, 200 (74.3%) were biopsied with a 12-core schema. All outside biopsy slides were reviewed by a trained pathologist before the results were recorded in the database. Radical prostatectomy specimens were processed according to the Stanford protocol[5] and were also graded according to the Gleason system. Primary and secondary Gleason scores were assessed by two experienced genitourinary pathologists. Discrepancies were resolved by a joint review of the slides.
Statistical analysis
The data analyses in this study consisted of 3 stages. First, a logistic regression model was used to determine the probability of any Gleason sum upgrading between the biopsy and radical prostatectomy specimens. To develop a robust prediction model, we restricted the number of covariates to 20 events per variable. The PSA level, clinical T stage, primary Gleason score, and secondary Gleason score were used as predictors because they are readily available in the clinic and are well supported by evidence in the literature. For categorical variables, small units (≤5%) were merged into adjacent subgroups. Nonlinear or interaction effects were included in the final model if required to optimize the Akaike Information Criterion. A nomogram was constructed based on the results of multivariate logistic regression analysis. Second, the prediction model was further evaluated for its ability to predict significant upgrading, which was defined as a change in biopsy Gleason sum from ≤6 to ≥7 or from 7 to ≥8, as described in a previous report[15]. The predicted probability of any Gleason sum upgrading was correlated with the outcome of significant upgrading in patients with biopsy Gleason sum < 8. Third, we externally validated the nomogram proposed by Chun et al. to assess its predictive value in our set of patients clinically diagnosed with prostate cancer.The statistical performance of the prediction model was assessed by discrimination and calibration. Discrimination was measured using the concordance index (C-index), which is similar to an area under the receiver operating characteristic curve. Bootstrap-corrected C-indexes were used to better gauge expected future predictive accuracy. Calibration was assessed by visually inspecting the plots of predicted probability of Gleason sum upgrading versus actual outcomes and by the Spiegelhalter Z-test.Statistical tests were performed using software R 2.10.0, with a 2-sided significance level of 0.05.
Results
Clinicopathologic data
shows the descriptive characteristics of the entire cohort compared with those of Chun's cohort. Overall, a higher PSA level, more advanced tumor stage, and higher Gleason score (biopsy and radical prostatectomy) were observed in our cohort. shows the concordance between the biopsy and prostatectomy Gleason sums. Of the 269 patients, the Gleason sum was consistent in 158 (58.7%), upgraded in 90 (33.5%), and downgraded in 21 (7.8%). We further divided the entire cohort into three groups according to the biopsy Gleason sum (≤ 6, 7, ≥ 8). The probability of Gleason sum upgrading was the highest in patients with a low biopsy grade (Gleason sum≤ 6) (). In those patients with a biopsy Gleason sum of 7, 45.5% of the Gleason sum upgrades were increased by 2 or 3 units. High-magnitude Gleason sum upgrading was less frequently observed among the patients with low-grade disease with Gleason sum upgrading (19.1%).
Table 1.
Descriptive characteristics of enrolled patients from Fudan University Shanghai Cancer Center (FUSCC) and the Hamburg cohort reported by Chun et al.
Concordance between biopsy and radical prostatectomy Gleason sums
Biopsy Gleason sum
Radical prostatectomy Gleason sum
5
6
7
8
9
Total
4-5
1
6
3
0
0
10
6
0
51
51
3
4
109
7
0
4
80
6
5
95
8
0
0
11
15
11
37
9-10
0
0
2
3
13
18
Total
1
62
147
27
33
269
Figure 1.
The distribution and magnitude of Gleason sum upgrading in three groups with different biopsy Gleason scores.
For example, the Gleason sum was consistent in 80 (84.2%), upgraded by 1 unit in 6 (6.3%), upgraded by 2 or 3 units in 5 (5.3%), and downgraded in 4 (4.2%) patients with a biopsy Gleason sum of 7.
The distribution and magnitude of Gleason sum upgrading in three groups with different biopsy Gleason scores.
For example, the Gleason sum was consistent in 80 (84.2%), upgraded by 1 unit in 6 (6.3%), upgraded by 2 or 3 units in 5 (5.3%), and downgraded in 4 (4.2%) patients with a biopsy Gleason sum of 7.
Nomogram development
The PSA level, clinical stage, and primary and secondary Gleason scores were included in multivariate analysis. Because there was a strong association between the primary and secondary Gleason scores (), the interaction term between the primary and secondary Gleason scores was added to the model for assessment. We found that the C-index was significantly improved from 0.758 to 0.789 after including the interaction term (P < 0.001). The bootstrap-corrected C-index was 0.789, indicating acceptable discrimination. The calibration plot demonstrates that the rate of predicted Gleason sum upgrading closely paralleled the observed rate (). Based on these results, a nomogram was constructed to predict Gleason sum upgrading in a user-friendly manner ().
Table 3.
Statistical results of covariates in the prediction model for Gleason sum upgrading
Calibration plot of the predicted and observed probabilities of Gleason sum upgrading in the entire group (n = 269).
The calibration plot shows the performance of the nomogram. Specifically, the nomogram-predicted probabilities are comparable to the observed rates of biopsy Gleason sum upgrading. The nomogram-predicted probability of biopsy Gleason sum upgrading is plotted on the X-axis, and the observed rate of upgrading is plotted on the Y-axis. Perfect prediction would correspond to a slope of 1 (diagonal 45-degree broken line). The solid line indicates the bootstrap-corrected nomogram performance.
Figure 3.
Nomogram for predicting Gleason sum upgrading between biopsy and radical prostatectomy specimens in clinically diagnosed prostate cancer.
Calibration plot of the predicted and observed probabilities of Gleason sum upgrading in the entire group (n = 269).
The calibration plot shows the performance of the nomogram. Specifically, the nomogram-predicted probabilities are comparable to the observed rates of biopsy Gleason sum upgrading. The nomogram-predicted probability of biopsy Gleason sum upgrading is plotted on the X-axis, and the observed rate of upgrading is plotted on the Y-axis. Perfect prediction would correspond to a slope of 1 (diagonal 45-degree broken line). The solid line indicates the bootstrap-corrected nomogram performance.
Nomogram for predicting Gleason sum upgrading between biopsy and radical prostatectomy specimens in clinically diagnosed prostate cancer.
PSA, prostate-specific antigen; pGS, primary biopsy Gleason score; sGS, secondary biopsy Gleason score; cT category, clinical T category.To investigate the nomogram's ability to predict significant upgrading, we used it to evaluate 214 patients with a biopsy Gleason sum < 8. In this subgroup, 72 (33.6%) patients had significant upgrading. The prediction model showed a good discrimination ability for significant upgrading [C-index = 0.795, 95% confidence interval (CI) = 0.735-0.855]. Although the calibration plot shows a slight overestimation of the probability of significant upgrading, the Spiegelhalter Z-test indicated good calibration (P = 0.758) ().
Figure 4.
Calibration plot of the predicted and observed probabilities of significant Gleason sum upgrading in patients with Gleason sum < 8 (n = 214).
The prediction calculated using the nomogram is plotted on the X-axis, and the observed rate of significant Gleason sum upgrading is plotted on the Y-axis.
Calibration plot of the predicted and observed probabilities of significant Gleason sum upgrading in patients with Gleason sum < 8 (n = 214).
The prediction calculated using the nomogram is plotted on the X-axis, and the observed rate of significant Gleason sum upgrading is plotted on the Y-axis.The statistical performance of the nomogram proposed by Chun et al. was assessed in our entire cohort. The C-index was 0.755 (95% CI = 0.691-0.819). The calibration plot, however, shows poor concordance between the predicted and observed probabilities of Gleason sum upgrading (P < 0.001) (). Compared to the actual probability, the risk of underestimated or overestimated Gleason sum upgrading was more than 5%, with a wide range of predicted probability.
Figure 5.
Calibration plot of the Chun et al. nomogram in our cohort. The model-predicted probability of biopsy Gleason sum upgrading is plotted on the X-axis, and the observed rate of upgrading is plotted on the Y-axis.
Perfect prediction would correspond to a slope of 1 (diagonal 45-degree broken line). The solid line indicates the logistic calibration curve, and open circles represent the data for prediction strata specified by Chun et al.
Calibration plot of the Chun et al. nomogram in our cohort. The model-predicted probability of biopsy Gleason sum upgrading is plotted on the X-axis, and the observed rate of upgrading is plotted on the Y-axis.
Perfect prediction would correspond to a slope of 1 (diagonal 45-degree broken line). The solid line indicates the logistic calibration curve, and open circles represent the data for prediction strata specified by Chun et al.
Discussion
We successfully developed and validated a model to predict Gleason sum upgrading from biopsy to final pathology using four clinical variables. Our model is 78.9% accurate, and its prediction closely approximates the observed rate of Gleason sum upgrading between biopsy and final pathology. At the same time, the prediction model showed good discrimination ability for significant upgrading, with a C-index of 0.795. Conversely, testing of a previously published model predicting Gleason sum upgrading showed poor concordance between the predicted and observed probabilities of Gleason sum upgrading.Biopsy upgrading has important clinical implications in terms of selecting candidates for watchful waiting, surgery, and radiotherapy approaches. The previous studies were focused on Gleason sum upgrading in low- or intermediate-grade prostate cancer. However, this phenomenon is also meaningful in high-grade prostate cancer because Gleason score is a significant predictor of disease progression and survival even in the subgroup of patients with high-grade disease. Albertsen et al.[16] have reported that men with a biopsy Gleason score of 7 to 10 had a high risk (42% to 87%) of death from prostate cancer when treated conservatively, even when cancer was diagnosed as late as 74 years of age. Brachytherapy also showed poor outcomes in patients with intermediate- or high-grade disease: Gleason score 7 tumors had an approximately 50% probability of biochemical relapse within 5 years[4]. After radical prostatectomy, the incidence of extracapsular extension, seminal vesical invasion, and lymph node metastases was significantly higher in patients with high-grade disease than in patients with low-grade disease[17]. The results of the RTOG 92-02 trial showed that long-term androgen deprivation and radiotherapy provided a survival advantage only in patients with Gleason score 8-10 tumors[18]. Even in the subgroup with biopsy Gleason score > 7, the prognosis of prostate cancer worsens with increasing Gleason sum. The 8-year progression-free survival rates after radical prostatectomy were 40%, 32%, and 27% for patients with final Gleason sums of 8, 9, and 10, respectively (P = 0.043) [19]. Audenet et al. [20] also reported distinctive prognosis in high-grade prostate cancer treated with surgery; the individual 5-year global PSA-free recurrence rates in patients with Gleason sums of 8 and 9 were 50% and 35%, respectively (P = 0.002). Therefore, inaccurate tumor grade estimation definitively results in improper risk attribution and treatment assignment.The prevalence of Gleason sum upgrading in this study was 33.5%, which is within the range of previously reported values. It should be noted that the prevalence of Gleason sum upgrading was related to the number of prostate biopsy cores taken. King et al.[15] found that the probability of Gleason sum upgrading was 25% when using the sextant biopsy scheme but only 13% when using the extended 10-core biopsy scheme (P = 0.045). The risk of Gleason sum upgrading was reported to decrease when more than 12 biopsy cores were taken[10], although this biopsy scheme is not widely accepted as the first choice for prostate cancer diagnosis. In our study, all patients had a minimum of 10 biopsy cores, which represents the current standard of care.Consistent with previous studies, we found that PSA level and primary and secondary Gleason scores were independent prognostic factors for Gleason sum upgrading. The inclusion of an interaction term in the final model was motivated by the strong association between primary and secondary Gleason scores. The improved model performance confirmed the hypothesis that the effect of the secondary Gleason score depended on that of the primary Gleason score.King et al.[15] defined significant upgrading as a Gleason sum upgrading either from ≤ 6 to ≥ 7 or from 7 to ≥ 8 between biopsy and radical prostatectomy specimens. We observed that most (91.1%) of the Gleason sum upgrading in patients with low- or intermediate-grade disease was significant. When applied to this subgroup, our nomogram showed satisfactory discrimination and calibration.To the best of our knowledge, 3 models have already been developed to predict the probability of any Gleason sum upgrading[7],[9]. D'Amico's model was externally validated in a European cohort and yielded a C-index of 0.5, which is equivalent to the flip of a coin[7]. The second model, published by Kulkarni et al.[9], showed a C-index of 0.71 upon internal validation. However, this model relied on 9 predictor variables, some of which are not routinely recorded in patients' charts. The third model, developed by Chun et al.[7], demonstrated a C-index of 0.8 in the original patient cohort. It was later applied to two independent datasets from Italy and Japan for external validation[17],[21]. Both the development and validation cohorts showed a low PSA level (median value < 10 ng/mL) and early T stage (mostly < cT2b). The predictive accuracy was 74.9% in the Italian cohort and 79.2% in the Japanese cohort. The Chun et al. nomogram, however, was not validated in patients with clinically diagnosed prostate cancer, which had distinct characteristics. According to a multicenter study in mainland China, only 6.2% of prostate cancerpatients presented with increased PSA level without other symptoms[22]. Urinary symptoms were the major reason for prostate cancer diagnosis referral (75.9%) for Chinese patients. In populations without widespread PSA screening, patients who undergo radical prostatectomy tend to have higher PSA levels and more advanced tumor stage. When applied to our dataset, the accuracy of the Chun et al. nomogram in predicting Gleason sum upgrading was 75.5%; however, the calibration plot shows poor concordance between the predicted and observed probabilities of Gleason sum upgrading (P < 0.001) (). The poor performance of the Chun et al. nomogram in our series confirmed that the application of nomograms in different patient populations should be performed with great caution.The limitations of the present study include its single-center nature and limited sample size, which is mainly due to the low incidence of prostate cancer in mainland China. Furthermore, our nomogram was based on four readily available predictors. The accuracy of our model could potentially be improved by integrating additional variables. However, some potentially predictive factors, such as positive biopsy cores and the proportion or length of tumor involvement per biopsy core, were not recorded for patients who underwent biopsy outside of our center. Finally, we have not tested the performance of our nomogram in an external dataset. Despite these limitations, our model represents the first predictive model concerning the rate of Gleason sum upgrading between biopsy and final pathology among patients with clinically diagnosed prostate cancer.
Conclusions
We developed a new nomogram to predict Gleason sum upgrading in patients with clinically diagnosed prostate cancer. The model demonstrated good discrimination and calibration in internal validation. The existing nomogram constructed by Chun et al. should not be used in this setting.
Authors: Umberto Capitanio; Pierre I Karakiewicz; Claudio Jeldres; Alberto Briganti; Andrea Gallina; Nazareno Suardi; Andrea Cestari; Giorgio Guazzoni; Andrea Salonia; Francesco Montorsi Journal: Int J Urol Date: 2009-03-26 Impact factor: 3.369
Authors: Michael J Zelefsky; Leah Ben-Porat; Howard I Scher; Heather M Chan; Paul A Fearn; Zvi Y Fuks; Steven A Leibel; E S Venkatraman Journal: J Clin Oncol Date: 2005-02-01 Impact factor: 44.544
Authors: Stephen J Freedland; Christopher J Kane; Christopher L Amling; William J Aronson; Martha K Terris; Joseph C Presti Journal: Urology Date: 2007-03 Impact factor: 2.649
Authors: Michael J Zelefsky; Deborah A Kuban; Larry B Levy; Louis Potters; David C Beyer; John C Blasko; Brian J Moran; Jay P Ciezki; Anthony L Zietman; Thomas M Pisansky; Mohamed Elshaikh; Eric M Horwitz Journal: Int J Radiat Oncol Biol Phys Date: 2006-11-02 Impact factor: 7.038
Authors: Danielle A Stackhouse; Leon Sun; Florian R Schroeck; Jayakrishnan Jayachandran; Arthur A Caire; Cyril O Acholo; Cary N Robertson; David M Albala; Thomas J Polascik; Craig F Donatucci; Kelly E Maloney; Judd W Moul Journal: J Urol Date: 2009-05-17 Impact factor: 7.450
Authors: Girish S Kulkarni; Gina Lockwood; Andrew Evans; Ants Toi; John Trachtenberg; Michael A S Jewett; Antonio Finelli; Neil E Fleshner Journal: Cancer Date: 2007-06-15 Impact factor: 6.860
Authors: Eric M Horwitz; Kyounghwa Bae; Gerald E Hanks; Arthur Porter; David J Grignon; Harmar D Brereton; Varagur Venkatesan; Colleen A Lawton; Seth A Rosenthal; Howard M Sandler; William U Shipley Journal: J Clin Oncol Date: 2008-04-14 Impact factor: 44.544
Authors: Umberto Capitanio; Pierre I Karakiewicz; Luc Valiquette; Paul Perrotte; Claudio Jeldres; Alberto Briganti; Andrea Gallina; Nazareno Suardi; Andrea Cestari; Giorgio Guazzoni; Andrea Salonia; Francesco Montorsi Journal: Urology Date: 2009-02-04 Impact factor: 2.649