Literature DB >> 31118768

Development and external validation of prognostic nomograms in hepatocellular carcinoma patients: a population based study.

Zhiyu Xiao1,2, Yongcong Yan1,2, Qianlei Zhou1,2, Haohan Liu1,2, Pinbo Huang1,2, Qiming Zhou1,2, Changliang Lai1,2, Jianlong Zhang1,2, Jie Wang1,2, Kai Mao1,2.   

Abstract

Background: We attempted to construct and validate novel nomograms to predict overall survival (OS) and cancer-specific survival (CSS) in patients with hepatocellular carcinoma (HCC).
Methods: Models were established using a discovery set (n=10,262) obtained from the Surveillance, Epidemiology, and End Results (SEER) database. Based on univariate and multivariate Cox regression analyses, we identified independent risk factors for OS and CSS. Concordance indexes (c-indexes) and calibration plots were used to evaluate model discrimination. The predictive accuracy and clinical values of the nomograms were measured by decision curve analysis (DCA).
Results: Our OS nomogram with a c-index of 0.753 (95% confidence interval (CI), 0.745-0.761) was based on age, sex, race, marital status, histological grade, TNM stage, tumor size, and surgery performed, and it performed better than TNM stage. Our CSS nomogram had a c-index of 0.748 (95% CI, 0.740-0.756). The calibration curves fit well. DCA showed that the two nomograms provided substantial clinical value. Internal validation produced c-indexes of 0.758 and 0.752 for OS and CSS, respectively, while external validation in the Sun Yat-sen Memorial Hospital (SYMH) cohort produced a c-indexes of 0.702 and 0.686 for OS and CSS, respectively. Conclusions: We have developed nomograms that enable more accurate individualized predictions of OS and CSS to help doctors better formulate individual treatment and follow-up management strategies.

Entities:  

Keywords:  cancer-specific survival; decision curve analysis; epidemiology and end results; overall survival; surveillance

Year:  2019        PMID: 31118768      PMCID: PMC6489568          DOI: 10.2147/CMAR.S191287

Source DB:  PubMed          Journal:  Cancer Manag Res        ISSN: 1179-1322            Impact factor:   3.989


Introduction

Hepatocellular carcinoma (HCC) is the sixth most common cancer and the second most deadly cause of cancer mortality worldwide according to global cancer statistics obtained in 2012, and nearly half of the total number of cases and deaths occur in China.1–3 Improvements in treatment strategies have markedly improved the overall survival (OS) and cancer-specific survival (CSS) of HCC patients, although the long-term survival rate remains low. Many HCC patients die because of disease rather than of other causes. Factors related to prognostic predictions are based on the American Joint Committee on Cancer (AJCC) TNM staging system and the National Comprehensive Cancer Network (NCCN) guidelines.4 However, many factors not included in TNM staging may influence the survival of HCC patients, including patient background characteristics (ie, patient age, sex, race and geographical location), tumor-related factors (tumor size, invasion and histological grade) and treatment received (surgery performed).1,5–7 Nomograms are reliable statistical predictive models that are used to accurately calculate and predict individual survival by combining all risk factors for tumor development.8,9 An increasing number of nomograms are being widely established to provide assistance in formulating individual treatment and follow-up management strategies in several cancers, such as oropharyngeal cancer,10 gastrointestinal stromal tumors,11 adenoid cystic carcinoma,12 bladder cancer,13 and prostate cancer.14 In HCC, many nomograms have been constructed to predict recurrence-free survival and OS after liver resection,6,7,15,16 but these nomograms are all based on a single population or have been unvalidated in an external cohort. More importantly, few studies have focused on nomogram models specific to CSS in HCC patients. To the best of our knowledge, no study has been carried out to predict prognosis using data gathered from HCC patients in the Surveillance, Epidemiology, and End Results (SEER) database. In the present study, we aimed to establish and validate the first effective and convenient HCC nomogram model to calculate and predict OS and CSS based on clinicopathological risk factors obtained from the SEER database. Moreover, these nomogram models were validated using both an internal SEER dataset and an independent external cohort obtained from our hospital. These nomograms can guide individual treatment and follow-up management in HCC patients.

Materials and methods

Patients and study design

We downloaded clinical data related to all patients under the liver heading (Site Recode ICD-O-3/WHO 2008) in the SEER 18 registry database (1973–2015) using SEER*Stat 8.3.5 software. The flow chart used for data selection is shown in Figure 1. “The International Classification of Diseases for Oncology (ICD-O-3) Hist/behav, malignant” was used to screen HCC cases. The “Year of diagnosis” ranged from 2004 to 2014. “Derived AJCC Stage Group 6th (2004+)”, “CS tumor size (2004+)”, “RX Summ-Surg Prim Site (1998+)”, and “Grading and differentiation codes in ICD-O-2” were used in the present study. “Vital status recode” and “SEER cause-specific death classification” were used to set endpoints for OS and CSS, respectively, while patient survival time was defined using “survival months code”. The inclusion criteria were as follows: diagnostic confirmation achieved based on microscopic analysis and patient background characteristics (ie, age, sex, race and marital status), tumor-related factors (tumor size, invasion and histological grade) and treatment received (surgery performed) were known and available. The exclusion criteria were as follows: death certificate or autopsy only and age <15 years. A total of 15,394 cases in the SEER cohort were included and analyzed in the present study. All HCC patients were randomly divided into a discovery set with N*q samples and an internal testing set with N*(1-q) samples (q=2/3). To further validate our results in a responsible manner, we sought an external testing cohort from Sun Yat-sen Memorial Hospital (SYMH) that included data obtained between January 1, 2009, and December 31, 2012. That dataset included 244 postoperative HCC patients (the SYMH cohort) who were recruited using the above inclusion and exclusion criteria. All diagnoses were confirmed by pathology. The data from the SEER Registry and the Sun Yat-sen Memorial Hospital were rendered anonymous.This retrospective study was reviewed and approved by the ethics committee of Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, and written informed consent was obtained from each patient. The procedure was conducted in accordance with the Declaration of Helsinki.
Figure 1

Study flowchart.

Abbreviations: HCC, hepatocellular carcinoma; OS, overall survival; CSS, cancer-specific survival; DCA, decision curve analysis.

Study flowchart. Abbreviations: HCC, hepatocellular carcinoma; OS, overall survival; CSS, cancer-specific survival; DCA, decision curve analysis.

Nomogram construction

We validated that the clinicopathological features of the patients in the discovery and internal testing sets were well balanced (P>0.05) according to the chi-square test. We used a univariate Cox regression analysis to screen for clinicopathological risk factors for OS and CSS in the SEER discovery set. We further performed multivariate Cox regression analysis to screen for independent important factors for HCC patients without violating the PH assumption. All variables were screened using the forward stepwise selection method in a Cox multivariate analysis regression model.17,18 Based on the identified independent important factors, we constructed two nomograms to predict OS and CSS at 1, 3 and 5 years using R software.

Nomogram validation

We calculated concordance indexes (c-indexes) and drew calibration plots for the internal and external validating cohorts, respectively. The c-index quantified the discrimination between two random patients, with a c-index of 0.5 indicating no discrimination and 1 indicating perfect discrimination.19 Calibration plots were constructed to validate the accuracy and reliability of the nomograms for OS and CSS by comparing the nomogram-predicted and actual survival rates determined in a Kaplan-Meier analysis with 1000 bootstrap samples.20

Clinical application value assessment

Decision curve analysis (DCA) was performed to identify and compare the clinical application value between the nomogram model and other clinical features by calculating the net benefits at each risk threshold probability.21,22 The net benefit was determined by subtracting the proportion of all false-positive results from the proportion of true-positive results and weighted by the relative harm caused by giving up treatment compared with the negative consequences of unnecessary treatments.23 Based on the DCA, we further plotted curves to evaluate the clinical impact of the nomogram to help us more intuitively understand its significant value. Accordingly, we evaluated the number of high-risk patients and the number of high-risk patients with outcomes for OS and CSS at different threshold probabilities in a given population.24

Statistical analysis

The chi-square test was used to compare categorical variables. OS was defined as the time from diagnosis to the time of death from any cause or the most recent follow-up, and CSS was calculated from the date of diagnosis to the date of HCC-related death or the most recent follow-up. Kaplan-Meier survival curves were used to compare OS and CSS among different groups, and survival differences were assessed by a two-sided log-rank test in R version 3.3.4 (www.R-project.org). Univariate and multivariate Cox regression analyses were performed to generate hazard ratios (HRs) and 95% confidence intervals (CIs) in IBM SPSS Statistics version 24 (SPSS Inc., Chicago, IL, USA). The OS and CSS nomograms were constructed based on the factors identified in the multivariate analysis. C-indexes, calibration plots, DCA and clinical impact curves were analyzed in R version 3.3.4 with relevant packages. All statistical tests were two-sided, and a P-value <0.05 was considered statistically significant.

Results

Study flowchart and basic clinical characteristics

The study flowchart is presented in Figure 1. A total of 15,394 HCC patients were included in the study and randomly divided into a discovery set (n=10,262) and an internal testing set (n=5,132). Patient clinicopathological features in the discovery and internal testing sets are shown in Table S1. There were no significant differences between the two sets (P>0.05, Table S1). The detailed clinical characteristics of patients in the SYMH cohort (n=244) are shown in Table S1. The median ages (interquartile range) of the patients in the SEER discovery set, the internal testing set, the entire SEER cohort and the SYMH cohort were 63 (56–72), 63 (56–72), 63 (56–72) and 52 (42–59) years, respectively, and the median OS durations were 630, 600, 630 and 1448 days, respectively. The 1-, 3- and 5-year OS rates were 63.99%, 40.91%, and 30.78% and 80.10%, 58.36%, and 45.19% in the SEER and the SYMH cohorts, respectively. The median CSS durations in the above four sets were 1,230, 1,200, 1,200 and 1,701 days, respectively; while the 1-, 3- and 5-year CSS rates were 73.12%, 54.25%, and 45.42% in the SEER cohort. OS and CSS curves are plotted in Figure S1.
Table 1

The c-index for the nomogram to predict OS and CSS

GroupVariableOSCSS
c-index95% CIc-index95% CI
SEER discovery set (n=10,262)Nomogram0.7530.745–0.7610.7480.740–0.756
Grade0.6540.646–0.6620.680.670–0.690
TNM0.5550.547–0.5630.5720.562–0.580
Size0.6180.612–0.6240.6360.628–0.644
SEER testing set (n=5,132)Nomogram0.7580.746–0.7700.7520.740–0.764
Grade0.5630.553–0.5730.5810.569–0.593
TNM0.6590.649–0.6690.6880.676–0.699
Size0.6190.609–0.6290.6370.625–0.649
SYMH cohort (n=244)Nomogram0.7020.647–0.7570.6860.629–0.742
Grade0.5850.540–0.6300.5770.529–0.624
TNM0.6780.629–0.7270.6670.614–0.719
Size0.620.573–0.6670.6090.560–0.658

Abbreviations: SYMH, Sun Yat-Sen Memorial Hospital; TNM, tumor lymph node metastasis; OS, overall Survival; CSS, cancer-specific survival.

The c-index for the nomogram to predict OS and CSS Abbreviations: SYMH, Sun Yat-Sen Memorial Hospital; TNM, tumor lymph node metastasis; OS, overall Survival; CSS, cancer-specific survival.

Independent significant factors in the discovery set

To further identify the candidate predictors of OS and CSS, we evaluated all clinicopathological features by Cox proportional hazards regression analysis. Univariate analysis of the Cox regression was performed in 10,262 patients in the discovery set. All clinicopathological features, including age, sex, marital status, race, histological grade, tumor size, surgery performed, and TNM stage, affected OS (Table S2). Multivariate regression analysis was performed on the 8 factors that were shown to significantly affect OS, and all 8 factors were independent prognostic predictors of OS (Figure 2). Similarly, the univariate analysis showed that all evaluated clinical indexes were associated with CSS. In the multivariate analysis, sex, marital status, histological grade, tumor size, surgery performed, and TNM stage were the 6 independent significant factors that predicted CSS (Figure 2). OS and CSS curves stratified by these independent prognostic factors showed significant differences among the groups, as shown in Figures S2 and S3.
Figure 2

Multivariate Cox regression analysis and forest plots of the HR and 95% CIs of OS (A) and CSS (B) in the SEER discovery set. Grade: 1, well-differentiated; 2, moderately differentiated; 3, poorly differentiated; 4, undifferentiated.

Multivariate Cox regression analysis and forest plots of the HR and 95% CIs of OS (A) and CSS (B) in the SEER discovery set. Grade: 1, well-differentiated; 2, moderately differentiated; 3, poorly differentiated; 4, undifferentiated.

Prognostic nomograms for OS and CSS

Based on the independent prognostic factors identified in the multivariate Cox regression analysis, two nomograms were developed to predict 1-, 3-, and 5-year OS (Figure 3A) and CSS (Figure 3B) in HCC patients. The point assignments and prognostic scores for every variable involved in the score models are shown in Table S3.
Figure 3

Nomograms for predicting the 1-, 3- and 5-year probabilities of (A) OS and (B) CSS in patients with HCC in the SEER discovery set. All the points identified on the top scale for each factor were added to generate a total score. The total points projected on the bottom scale were used to determine the probabilities of 1-, 3- and 5-year OS and CSS in individuals. Grade: 1, well-differentiated; 2, moderately differentiated; 3, poorly differentiated; and 4, undifferentiated. TNM: 1, stage I; 2, stage II; 3, stage III; and 4, stage IV.

Nomograms for predicting the 1-, 3- and 5-year probabilities of (A) OS and (B) CSS in patients with HCC in the SEER discovery set. All the points identified on the top scale for each factor were added to generate a total score. The total points projected on the bottom scale were used to determine the probabilities of 1-, 3- and 5-year OS and CSS in individuals. Grade: 1, well-differentiated; 2, moderately differentiated; 3, poorly differentiated; and 4, undifferentiated. TNM: 1, stage I; 2, stage II; 3, stage III; and 4, stage IV.

Performance of the nomograms in the discovery set

All indexes in the nomograms and other single factors in all cohorts are listed in Table 1. The c-index for the OS prediction nomogram was 0.753 (95% CI, 0.745–0.761) in the discovery set, while the c-indexes for TNM stage, histologic grade and tumor size for OS prediction were 0.555 (95% CI, 0.547–0.563), 0.654 (95% CI, 0.646–0.662), and 0.618 (95% CI, 0.612–0.624), respectively, and were much lower than those of the nomogram model. Similarly, the c-index for the CSS prediction nomogram was higher than the c-indexes for TNM stage, grade and tumor size. The calibration plots for the probability of 1-, 3- and 5-year OS demonstrated good concordance between the nomogram prediction and actual observations in the discovery set (Figure 4A); similar results were found for the CSS nomogram (Figure 4B). The discriminatory ability of the OS nomogram was further assessed in a survival analysis. All patients were divided into three groups according to optimal cutoff values determined by X-tile software (for OS, low risk: <133, intermediate risk: 133–214, and high risk: >214; for CSS, low risk: <118, intermediate risk: 118–206, and high risk: >206) according to the nomogram predictions. Kaplan-Meier curves for OS and CSS were plotted for the entire SEER cohort (Figure 5A and B). Based on risk stratification, in the low-, intermediate- and high-risk subgroups, the 1-year OS rates were 85.59%, 52.78%, and 20.57%, the 3-year OS rates were 63.72%, 22.64%, and 4.38%, the 1-year CSS rates were 90.69%, 59.95%, and 23.39%, and the 3 year CSS rates were 75.23%, 31.18%, and 6.88%, respectively.
Figure 4

Calibration curves for predicting 1-, 3- and 5-year OS and CSS in patients with HCC. OS (A) and CSS (B) in the SEER discovery set; OS (C) and CSS (D) in the internal testing set; and OS (E) and CSS (F) in the external validation cohort. The nomogram-predicted probability of survival is plotted on the X-axis, and the actual survival rate is plotted on the Y-axis. Vertical bars indicate 95% CIs measured by Kaplan-Meier analysis.

Figure 5

Curves for OS and CSS in the entire SEER cohort (A and B) and curves for OS (C) and CSS (D) in the external validation cohort based on risk stratification by the nomogram models. DCA of the predictive nomograms and single factor models (TNM stage, histological grade and tumor size). The nomograms were compared against single factor models in terms of 1- and 3-year OS (E and F) and CSS (H and I). Clinical impact curves of the nomograms for OS (G) and CSS (J) in patients with HCC in the SEER discovery set. (E–H) Dashed lines indicate the net benefit of the models across a range of threshold probabilities. The horizontal solid black line represents the hypothesis that no patients reached the endpoint, and the solid gray line represents the hypothesis that all patients reached the endpoint. (G and J) At different threshold probabilities within a given population, the number of high-risk patients and the number of high-risk patients with the outcome were plotted.

Calibration curves for predicting 1-, 3- and 5-year OS and CSS in patients with HCC. OS (A) and CSS (B) in the SEER discovery set; OS (C) and CSS (D) in the internal testing set; and OS (E) and CSS (F) in the external validation cohort. The nomogram-predicted probability of survival is plotted on the X-axis, and the actual survival rate is plotted on the Y-axis. Vertical bars indicate 95% CIs measured by Kaplan-Meier analysis. Curves for OS and CSS in the entire SEER cohort (A and B) and curves for OS (C) and CSS (D) in the external validation cohort based on risk stratification by the nomogram models. DCA of the predictive nomograms and single factor models (TNM stage, histological grade and tumor size). The nomograms were compared against single factor models in terms of 1- and 3-year OS (E and F) and CSS (H and I). Clinical impact curves of the nomograms for OS (G) and CSS (J) in patients with HCC in the SEER discovery set. (E–H) Dashed lines indicate the net benefit of the models across a range of threshold probabilities. The horizontal solid black line represents the hypothesis that no patients reached the endpoint, and the solid gray line represents the hypothesis that all patients reached the endpoint. (G and J) At different threshold probabilities within a given population, the number of high-risk patients and the number of high-risk patients with the outcome were plotted.

Validation of the OS and CSS nomograms

The c-index for the OS prediction nomogram was 0.758 (95% CI, 0.746–0.770) in the SEER testing set, which was higher than the c-indexes for the TNM stage, grade and tumor size (Table 1). The c-index for CSS prediction was 0.752 (95% CI, 0.740–0.764) in the SEER testing set, which was higher than the c-index for any of the three single factors (Table 1). Likewise, the c-index for the OS prediction nomogram was 0.702 (95% CI, 0.647–0.757) in the SYMH cohort, which was higher than the c-indexes for TNM stage, grade and tumor size (Table 1). The curves for 1- and 3-year OS and CSS were generally well calibrated in the SEER testing set (Figure 4C and D) and the SYMH cohort (Figure 4E and F). Kaplan-Meier curves for OS and CSS in the low-, intermediate- and high-risk subgroups were plotted for the SYMH cohort (Figure 5C and D), and the results showed that the high-risk subgroups had the worst OS and CSS in the validation cohort.

Assessment of the value of the nomograms as clinical applications

The DCA results of the nomograms and other indexes (ie, histological grade, TNM stage and tumor size) are presented in Figure 5. The results showed that the nomogram indicated a better net benefit after 1 and 3 years than that achieved for the other indexes for predicting OS (Figure 5E and F) and CSS (Figure 5H and I) in the discovery set. Based on these results, we plotted additional curves for the clinical impact of the nomograms to help us more intuitively evaluate its significance. The OS nomogram showed that cost/benefit ratios were lower when the risk threshold was <0.7 (Figure 5G), while the CSS nomogram showed that cost/benefit ratios were lower when the risk threshold was <0.6 (Figure 5J) in the discovery set. Therefore, based on the DCA, the nomograms provided more net benefits for predicting OS (Figure S4A and B) and CSS (Figure S4C and D) in the testing set. Similarly, the nomograms had a stronger clinical impact in all validation sets including the SEER database and SYMH cohorts (Figure S4E and H).

Discussion

HCC is one of the most common cancers and the leading cause of cancer-related deaths worldwide, accounting for more than half of all cases and deaths in China.3,25 Thus, more precise treatment guidelines and follow-up management strategies are urgently needed for HCC patients. Nomogram models are statistical tools that can meet these requirements. Shim et al established and validated predictive nomograms with c-indicex of 0.69 based on data obtained from 1,085 patients with early-stage HCC who underwent curative resection in Asan Liver Center.7 He et al developed and validated a nomogram that predicted survival after recurrence in HCC patients with a high c-index of 0.797.15 However, previous studies on OS and CSS in HCC patients have been limited by a number of factors, such as single-center datasets, small sample sizes, and a lack of external validation. In the present study, we identified several conventional factors including age, sex, race, marital status, histological grade, TNM stage, tumor size, and surgery performed that significantly affected OS. Histological grade (undifferentiated) (HR=1.920, 95% CI=1.608–2.292, P<0.001) and TNM stage (stage IV) (HR=1.920, 95% CI=1.804–2.378, P<0.001) resulted in higher HRs than did other variables in the multivariate regression analysis. TNM stage has been regarded as the most important factor for predicting OS,4 and the staging system has been considered valuable for predicting survival in patients after liver transplantation.4,26 Interestingly, we found that being married was associated with a better prognosis of HCC (HR=0.9 and 0.822 relative to the OS and CSS of unmarried patients, respectively), which is consistent with the results of many other studies.27,28 Married patients with small intestinal adenocarcinoma have better OS and CSS than unmarried patients. Psychological and economic support from spouses may contribute to improvements in survival in married patients.29 A novel nomogram model that included age, tumor size, margin status and vascular invasion and alpha-fetoprotein (AFP) levels performed well in predicting prognosis in HCC patients after liver resection.30 Another interesting study found that a tumor size greater than 2 cm, multifocal tumors, and vascular invasion were three independent predictors of poor survival in patients with early-stage HCC after surgery.31 However, the effectiveness and accuracy of our OS prediction nomogram was better than that achieved using TNM stages (c-index, 0.753 vs 0.555) in the SEER discovery set. Additionally, the results obtained using our nomograms are more reasonable than those obtained used the universally acknowledged TNM stage and provide a prognostic estimate that can predict individual results. For example, imagine two HCC patients: X and Y. They have the same sex (male; 9 points) and TNM stage (II, 13 points), but different ages and histological grades, with one patients being 45 years old (0 point) and having a well-differentiated tumor (0 point) and the other being 75 years old (26 points) and having an undifferentiated tumor (56 points). However, both patients are stratified into stage II disease based on the TNM staging system, which is associated with specific outcomes. As has been widely acknowledged, these two patients will probably have different prognoses, but the question regarding how to quantify these prognoses remains unresolved. Calculating the total scores of these patients individually is simple according to our OS prediction nomogram: patient X has 22 points, while patient Y has 104 points. The OS nomogram indicates that the 1-, 3-, and 5-year OS probabilities for patient X are nearly 88.5%, 77% and 68%, respectively, while the probabilities are nearly 75%, 49.5% and 35%, respectively, for patient Y. More interestingly, for the CSS nomogram, we identified six clinical factors, namely, sex, marital status, histological grade, TNM stage, tumor size and surgery performed, but not patient age or race, as significant independent prognostic predictors. In contrast, in the OS prediction nomogram, the magnitude of a poor prognosis was found to increase with age, given the same histological grade and TNM stage. Most of the predictive factors included in our models were the same as those included in other well-accepted models (ie, age, sex, marital status, histological grade, and TNM stage),30,31 but our data included a larger and worldwide sample, which allowed us to estimate the contribution of additional factors and to build independent nomogram models for OS and CSS at cutoff points of 1, 3 and 5 years. However, neither risk stratification nor discrimination could be used associate the clinical consequences of fixed discrimination or calibration. Therefore, using DCA, we evaluated whether nomogram-based medical decisions and strategies could improve patient prognoses to show the clinical value of the nomograms,23,24 even though many other clinical predictive models ignore this factor. We showed that our nomograms add more benefit when used to predict OS and CSS. Huang et al revealed EXT2, ETV5, and CHODL as independent prognostic factors of HCC by univariate and multivariate Cox analyses from a public database and established a novel nomogram by integrating the three molecular proteins and the TNM staging system, which displayed good performance in predicting long-term prognosis in HCC patients. They focused on the correlation between the expression of molecular markers and patient prognosis. However, we used a large population from the international SEER database to avoid heterogeneity among different medical centers and used the variables that are available and easily obtainable in clinical practice to construct our nomogram models.32 Adhoute et al found that the Barcelona Clinic Liver Cancer (BCLC) nomogram and the NIACE score provided the best prognostic information, but the NIACE could even help treatment strategies because of the low Akaike’s information criterion (AIC) value in a large French HCC cohort. Interestingly, they focused on several current common staging systems and compared their prognostic prediction abilities. In particular, the authors reassigned detailed and accurate scores of risk factors in the BCLC staging system using the nomogram model, whereas our study mainly focused on mining and building a new prognostic staging system based on clinical factors from a worldwide cohort.33 The present study has several merits that should be noted. First, a large population from the international SEER database was used to avoid heterogeneity among different medical centers. The results were also externally validated using the SYMH cohort. Second, the variables included in the nomogram models are available and easily obtainable in routine clinical practice. Third, using DCA, we plotted curves for the clinical impact of the nomograms, and this helped us to more intuitively understand the significant value of the nomograms in a clinical setting. Nevertheless, several potential limitations should also be noted. First, this is a retrospective study. Second, we only analyzed common prognostic factors, while blood and inflammation indicators were not used in multivariable survival analyses. Third, there is a possibility that residual confounding occurred after internal validation as a consequence of overfitting from variables and selection of threshold. However, we performed bootstrapping during internal testing and external validation. Fourth, the BCLC stages were not recorded in the SEER database; thus, we could not compare different outcomes from our nomogram model with those using the BCLC staging system. In the future, we will optimize the model by combining prognostic factors in our study with chronic liver disease condition, other interventions and the severity of liver dysfunction in HCC patients from multicenter cohorts in China.

Conclusions

In conclusion, based on the clinical risk factors identified in a large population-based cohort, we established practical prognostic nomograms that can objectively and accurately predict long-term OS and CSS in HCC around the world. Moreover, the internal and external cohort validation results demonstrate that these nomograms perform very well and have high accuracy and reliability. Our nomograms were demonstrated to be clinically useful based on a DCA and should therefore help clinicians improve individual treatment, make clinical decisions and guide follow-up management strategies in HCC patients.
Table S1

Demographic and clinical characteristics of HCC patients in the SEER and SYMH cohorts.

Clinicopathological VariablesSEER Cohort (n=15394)P ValueExternal validation cohort
Entire cohortTrainingn=10262 (%)Testingn=5132 (%)SYMH (n=244)
Age-6057303800(37.03)1930(37.61)0.425185(75.82)
60-7041662822(27.50)1344(26.19)43(17.62)
70-8030161999(19.48)1017(19.82)16(6.56)
80-1304878(8.56)426(8.30)0(0)
GenderFemale37832541(24.76)1242(24.20)0.45821(8.61)
male116117721(75.24)3890(75.80)223(91.39)
RaceWhite104046968(67.90)3436(66.95)0.066244(100)
Asian28561898(18.50)958(18.67)0(0)
Black19621297(12.64)665(12.96)0(0)
American Indian17299(0.96)73(1.42)0(0)
MarriageNo27621869(18.21)893(17.40)0.22488(36.07)
Yes126328393(81.79)4239(82.60)156(63.93)
Tumor size≥5 cm81865466(53.26)2720(53.00)0.770105(43.03)
<5 cm72084796(46.74)2412(47.00)139(56.97)
Histological gradeI49883337(32.52)1651(32.17)0.759142(58.20)
II70804730(46.09)2350(45.79)
III30852032(19.80)1053(20.52)102(41.80)
IV241163(1.59)78(1.52)
TNM stageI66654442(43.29)2223(43.32)0.356100(40.98)
II34992368(23.08)1131(22.04)()41(16.80)
III36012389(23.28)1212(23.62)93(38.11)
IV16291063(10.36)566(11.03)10(4.10)
SurgeryNo75465006(48.78)2540(49.49)0.4150(0)
Yes78485256(51.22)2592(50.51)244(100)
OS057033830(37.32)1873(36.50)0.326123(50.41)
196916432(62.68)3259(63.50)121(49.59)
CSS087575870(57.20)2887(56.25)0.271136(55.74)
166374392(42.80)2245(43.75)108(44.26)

Abbreviations: SYMH, Sun Yat-Sen Memorial Hospital; TNM, tumor lymph node metastasis; OS, overall survival; CSS, cancer-specific survival. The chi-square test was used for comparisons between the discovery and testing sets.

Table S2

Univariate Cox regression analyses of clinicopathological parameters in the SEER discovery set.

Overall survivalCancer-specific survival
SEER discovery setHR95% CIP valueHR95% CIP value
Age
-6011
60-701.1551.085-1.228<0.0011.0751.000-1.1560.049
70-801.5411.444-1.645<0.0011.2231.129-1.325<0.001
80-2.1101.945-2.290<0.0011.5171.366-1.684<0.001
Gender
Female11
Male1.1041.042-1.1700.0011.1241.048-1.2060.001
Marriage
Single11
Married0.9000.845-0.9580.0010.8220.763-0.885
Race
Asian11
American1.3711.052-1.7850.0191.3691.007-1.8610.045
White1.2671.185-1.354<0.0011.1401.054-1.2340.001
Black1.5201.390-1.661<0.0011.4101.269-1.566<0.001
Grade
I11
II1.0080.952-1.0670.7831.0751.001-1.1530.046
III1.5791.477-1.689<0.0011.8411.698-1.996<0.001
IV1.9411.630-2.312<0.0012.5292.084-3.067<0.001
TNM
I11
II1.0260.958-1.0990.4601.1121.020-1.2130.016
III2.6962.536-2.867<0.0013.3073.068-3.565<0.001
IV4.9784.605-5.382<0.0016.4435.879-7.061<0.001
Surgery
No11
Yes0.2380.225-0.251<0.0010.2130.200-2.228<0.001
Size
<5 cm11
≥5 cm2.2972.186-2.415<0.0012.7172.556-2.889<0.001

Abbreviations: SYMH, Sun Yat-Sen Memorial Hospital; TNM, tumor lymph node metastasis; HR: hazard ratio; 95% CI, 95% confidence interval. * P<0.05, ** P<0.01.

Table S3

Point assignments and prognostic scores for each variable in the nomogram models

VariablesClassificationNomogram score
OSCSS
GenderFemale00
Male96
Age-600NA
60-7010NA
70-8026NA
80-37NA
MarriageSingle911
Married00
RaceAsian0NA
American25NA
White16NA
Black24NA
Gradewell differentiated00
moderately differentiated1619
poorly differentiated4146
undifferentiated5668
TNMI00
II1318
III4144
IV8284
SurgeryNo100100
Yes00
Tumor size<5 cm00
≥5 cm2837

Abbreviations: TNM, tumor lymph node metastasis; OS, overall survival; CSS, cancer-specific survival; NA, not available.

  10 in total

1.  Development and Validation of Nomogram for Predicting Survival of Primary Liver Cancers Using Machine Learning.

Authors:  Rui Chen; Beining Hou; Shaotian Qiu; Shuai Shao; Zhenjun Yu; Feng Zhou; Beichen Guo; Yuhan Li; Yingwei Zhang; Tao Han
Journal:  Front Oncol       Date:  2022-06-20       Impact factor: 5.738

2.  Diagnostic and prognostic values of upregulated SPC25 in patients with hepatocellular carcinoma.

Authors:  Xiaolin Yang; Hongzhi Sun; Ying Song; Li Yang; Haibo Liu
Journal:  PeerJ       Date:  2020-07-16       Impact factor: 2.984

3.  A practical nomogram and risk stratification system predicting the cancer-specific survival for patients with early hepatocellular carcinoma.

Authors:  Bing Yan; Bing-Bing Su; Dou-Sheng Bai; Jian-Jun Qian; Chi Zhang; Sheng-Jie Jin; Guo-Qing Jiang
Journal:  Cancer Med       Date:  2020-12-06       Impact factor: 4.452

4.  A Prognostic Scoring System for Predicting Overall Survival of Patients with the TNM 8th Edition Stage I and II Hepatocellular Carcinoma After Surgery: A Population-Based Study.

Authors:  Yannan Bai; Yuan'e Lian; Jiayi Wu; Shi Chen; Jianlin Lai; Yu Zheng; Yifeng Tian; Maolin Yan; Yaodong Wang
Journal:  Cancer Manag Res       Date:  2021-03-02       Impact factor: 3.989

5.  A Web-Based Prediction Model for Cancer-Specific Survival of Elderly Patients With Early Hepatocellular Carcinoma: A Study Based on SEER Database.

Authors:  Taiyu He; Tianyao Chen; Xiaozhu Liu; Biqiong Zhang; Song Yue; Junyi Cao; Gaoli Zhang
Journal:  Front Public Health       Date:  2022-01-13

6.  Construction and Validation of a Risk Prediction Model for Postoperative Urinary Retention in Lung Cancer Patients.

Authors:  Wei Zheng; Xu Zhang; Xu Zheng; Yicheng Liang; Yan Liu; Yushun Gao
Journal:  J Healthc Eng       Date:  2022-03-11       Impact factor: 2.682

7.  Development and Validation of a Nomogram to Predict Cancer-Specific Survival for Middle-Aged Patients With Early-Stage Hepatocellular Carcinoma.

Authors:  Chong Wen; Jie Tang; Hao Luo
Journal:  Front Public Health       Date:  2022-02-28

8.  A prediction model for postoperative urinary retention after thoracic surgery.

Authors:  Benjamin Wei; Ammar Asban; Rongbing Xie; Zachary Sollie; Luqin Deng; Thomas K DeLay; William B Swicord; Rajat Kumar; James K Kirklin; James Donahue
Journal:  JTCVS Open       Date:  2021-05-26

9.  A competing risk nomogram predicting cause-specific mortality in patients with lung adenosquamous carcinoma.

Authors:  Xiao Wu; Wenfeng Yu; R H Petersen; Hongxu Sheng; Yiqing Wang; Wang Lv; Jian Hu
Journal:  BMC Cancer       Date:  2020-05-16       Impact factor: 4.430

10.  Prognostic nomogram to predict the overall survival of patients with early-onset colorectal cancer: a population-based analysis.

Authors:  Junxian Wu; Linbin Lu; Hong Chen; Yihong Lin; Huanlin Zhang; Enlin Chen; Weiwei Lin; Jie Li; Xi Chen
Journal:  Int J Colorectal Dis       Date:  2021-07-29       Impact factor: 2.571

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.