| Literature DB >> 32368186 |
Haohan Liu1,2, Yongcong Yan1,2, Ruibing Chen1,2, Mengdi Zhu1, Jianhong Lin1,2, Chuanchao He2, Bingchao Shi2, Kai Wen2, Kai Mao2, Zhiyu Xiao2.
Abstract
BACKGROUND: The primary tumor, regional lymph nodes and distant metastasis (TNM) stage is an independent risk factor for 1-year hepatocellular carcinoma (HCC) recurrence but has insufficient predictive efficiency. We attempt to develop and validate a nomogram to predict 1-year recurrence in HCC and improve the predictive efficiency of the TNM stage.Entities:
Keywords: 1-year recurrence; Hepatocellular carcinoma; Nomogram; Risk score model; TNM stage
Year: 2020 PMID: 32368186 PMCID: PMC7189530 DOI: 10.1186/s12935-020-01216-9
Source DB: PubMed Journal: Cancer Cell Int ISSN: 1475-2867 Impact factor: 5.722
Fig. 2Independent risk factors for 1-year recurrence and stage-related gene identification in the discovery cohort. a The red line vertical to the X axis highlights the 1-year cutoff for samples showing recurrence, and the blue line highlights the 2-year cutoff. The black line vertical to the Y axis indicates the median recurrence time. b HRs and 95% CIs for risk factors are respectively represented by blue blocks and lines. c The ROC curve of the TNM stage for predicting 1-year recurrence was plotted, and the AUC was calculated. d Blue circles represent DEGs associated with recurrence at 1 year, and red circles represent DEGs associated with advanced stages. Purple areas indicate overlapping DEGs, and the counts of DEGs are shown. e Representative box plots of tumor/normal DEGs are shown. Red boxes: tumors, gray boxes: normal tissues, black dots: samples, black bars: means and standard deviations. *P < 0.05, **P < 0.01, ***P < 0.001
Fig. 1Study flowchart. The design and procedure of our study were shown. Stage-related genes are screened from discovery cohort (n = 434). Novel RS model and nomogram are constructed with TCGA training cohort (n = 182). Testing cohort consists of internal validation cohort (n = 92) and external validation cohort (n = 107)
Clinical characteristics of the training cohort and testing cohort
| Clinical characteristics | Entire TCGA | Chi square test | ||
|---|---|---|---|---|
| Training cohort (n = 182) | Testing cohort (n = 92) | |||
| Sex | ||||
| Female | 83 | 56 | 27 | 0.809 |
| Male | 191 | 126 | 65 | |
| Age | ||||
| < 60 | 134 | 94 | 50 | 0.395 |
| ≥ 60 | 140 | 98 | 42 | |
| TNM stage | ||||
| I | 149 | 93 | 46 | 0.303 |
| II | 65 | 38 | 27 | |
| III | 68 | 50 | 18 | |
| IV | 2 | 1 | 1 | |
| Histologic grade | ||||
| G1 | 37 | 25 | 12 | 0.685 |
| G2 | 130 | 88 | 42 | |
| G3 | 97 | 61 | 36 | |
| G4 | 10 | 8 | 2 | |
TCGA The Cancer Genome Atlas, TNM primary tumor, regional lymph nodes and distant metastasis
Fig. 3Construction of the prognostic RS model in the training cohort. a Selection of the tuning parameter (lambda) in the LASSO model by tenfold cross-validation based on minimum criteria for 1-year recurrence; the lower X axis shows the log (lambda), and the upper X axis shows the average number of stage-related genes. The Y axis indicates the partial likelihood deviance error. The red dots represent the average partial likelihood deviances for every model with a given lambda, and the vertical bars indicate the upper and lower values of the partial likelihood deviance error. The vertical gray lines define the optimal values of lambda, which provide the best fit. b LASSO coefficient profiles of 16 stage-related genes. The vertical black dotted lines are plotted at the value selected. c Kaplan–Meier analysis of 1-year DFS between the high-risk group (red) and the low-risk group (blue). d Heat map of five stage-related genes in the prognostic signature. e Representative GSEA plot (DNA replication KEGG pathway) of the high-risk group versus the low-risk group. ES and FDR were also shown
Univariate and multivariate Cox analysis of the training cohort
| Recurrence at 1 year | Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|
| HR | 95% CI | HR | 95% CI | |||
| Sex | ||||||
| Female | ||||||
| Male | 0.596 | 0.356–0.996 | 0.048* | 0.670 | 0.397–1.129 | 0.132 |
| Age | ||||||
| < 60 | ||||||
| ≥ 60 | 0.842 | 0.510–1.391 | 0.502 | |||
| TNM stage | ||||||
| I | ||||||
| II | 2.011 | 0.993–4.074 | 0.052 | 1.949 | 0.962–3.949 | 0.064 |
| III | 3.423 | 1.909–6.139 | < 0.001*** | 2.911 | 1.616–5.243 | < 0.001*** |
| IV | 5.163 | 0.688–38.733 | 0.110 | 6.424 | 0.816–50.602 | 0.077 |
| Histologic grade | ||||||
| G1 | ||||||
| G2 | 0.983 | 0.447–2.165 | 0.967 | |||
| G3 | 1.230 | 0.550–2.751 | 0.613 | |||
| G4 | 1.259 | 0.334–4.748 | 0.734 | |||
| RS | ||||||
| Low | ||||||
| High | 3.416 | 2.041–5.717 | < 0.001*** | 3.199 | 1.891–5.411 | < 0.001*** |
HR Hazard ratio, CI confidence interval, TNM primary tumor, regional lymph nodes and distant metastasis, RS risk score
*P < 0.05, ***P < 0.001
Fig. 4Nomogram based on the TNM stage and RS model for predicting 1-year recurrence in the training cohort. All points assigned on the top point scale for each factor are summed together to generate a total point score. The total point score is projected on the bottom scale to determine the 1-year DFS probability for an individual
Fig. 5ROC curve analysis, DCA and clinical impact curve analysis in the training cohort, the testing cohort and the external validation cohort. a–c Comparisons of the predictive value of the nomogram (orange), TNM stage (blue) and RS (green) for 1-year recurrence according to ROC analysis. ROC curves in the training cohort (a), the testing cohort (b) and the external validation cohort (c). The AUC and 95% CI were calculated. d–f DCA of the nomogram (red) and TNM stage (blue) for predicting 1-year recurrence in the training cohort (d), the testing cohort (e) and the external validation cohort (f). The X axis shows the high-risk threshold, and the Y axis represents the standardized net benefit. g–i Clinical impact curves of the nomogram for predicting 1-year recurrence in the training cohort (g), the testing cohort (h) and the external validation cohort (i). The number of high-risk patients (black dotted line) and the number of high-risk patients with events (red solid line) are plotted