| Literature DB >> 34453418 |
Qiang Su1,2,3,4, Zhenyu Liu5,6,7, Chi Chen1,4, Han Gao1,4, Yongbei Zhu1,4, Liusu Wang1,4, Meiqing Pan1,4, Jiangang Liu1,4, Xin Yang5,7, Jie Tian1,4,5,6,8.
Abstract
BACKGROUND: This study evaluated the predictive value of gene signatures for biochemical recurrence (BCR) in primary prostate cancer (PCa) patients.Entities:
Keywords: LASSO-Cox regression; biochemical recurrence-free survival; gene signature; primary prostate cancer; radical therapy
Mesh:
Substances:
Year: 2021 PMID: 34453418 PMCID: PMC8446568 DOI: 10.1002/cam4.4092
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Characteristics of the included datasets
| Dataset | Country | Number of samples | GPL | Number of genes |
|---|---|---|---|---|
|
| United Kingdom | 110T | GPL10558 | 48,107 |
|
| United Kingdom | 86T | GPL10558 | 48,107 |
|
| United Kingdom | 223T | GPL 25318 | 121,563 |
| TCGA | N/A | 403T | N/A | 5,6754 |
Abbreviations: GPL, Gene Expression Omnibus Platform; GSE, Gene Expression Omnibus Series; N/A, not applicable; T, tumor samples; TCGA, The Cancer Genome Atlas.
The characteristics of patients with prostate cancer in the training set and validation set
| Characteristics | Training set ( | Validation set ( |
|---|---|---|
| cT stage | ||
| T1 | 151 (36.0) | 150 (37.2) |
| T2 | 147 (35.1) | 140 (34.7) |
| T3 | 117 (27.9) | 45 (11.2) |
| T4 | 4 (1) | 1 (0.3) |
| Unknow | 0 (0) | 67 (16.6) |
| Gleason | ||
| 5 | 2 (0.5) | 0 (0) |
| 6 | 72 (17.2) | 37 (9.2) |
| 7 | 227 (54.2) | 198 (49.1) |
| 8 | 60 (14.3) | 56 (13.9) |
| 9 | 57 (13.6) | 109 (27.0) |
| 10 | 1 (0.2) | 3 (0.8) |
| Biochemical recurrence | ||
| Yes | 93 (22.2) | 52 (12.9) |
| No | 326 (77.8) | 351 (87.1) |
| Follow‐up time (months, mean ± SD) | 45.61±19.49 | 28.53±17.70 |
Abbreviations: cT, clinical tumor; SD, standard deviation.
FIGURE 1Selection strategy for gene signatures. (A) “Leave‐one‐out” cross‐validation for parameter selection in LASSO‐COX regression models, and the optimal λ value of 0.11517381 with log(λ) = −2.1613129 was selected; (B) six BCRFS‐associated gene signatures were selected by LASSO‐COX models. LASSO, least absolute shrinkage and selection operator method; BCRFS, biochemical recurrence‐free survival
FIGURE 2Risk score distribution, BCRFS status, and expression pattern of BCRFS‐associated gene signatures in both cohorts. (A) The scattergram of the risk score in the training set; (B) the scattergram of the risk score in the validation set; (C) BCRFS time/BCR status in the training set; (D) BCRFS time/BCR status in the validation set; (E) the expression pattern of six BCRFS‐associated gene signatures in the training set; (F) the expression pattern of six BCRFS‐associated gene signatures in the validation set. BCRFS, biochemical recurrence‐free survival
Univariate and multivariate Cox proportional hazards regression analyses for predicting biochemical recurrence in the training set (n = 419) and validation set (n = 403)
| Variables | Univariate Cox analysis | Multivariate Cox analysis | ||
|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | |||
| Training set ( | ||||
| Gleason score | ||||
| Cont. | 1.355 (1.108–1.656) | 0.003** | 1.426 (1.158–1.757) | <0.001*** |
| Risk score | ||||
| Cont. | 11.417 (7.160–18.206) | <0.001*** | 11.584 (7.313–18.349) | <0.001*** |
| Validation set ( | ||||
| Gleason score | ||||
| Cont. | 2.036 (1.540–2.693) | <0.001*** | 1.639 (1.178–2.281) | 0.003** |
| Risk score | ||||
| Cont. | 3.884 (2.383–6.331) | <0.001*** | 2.215 (1.181–4.154) | 0.013* |
Abbreviations: CI, confidence interval; Cont, continuous; HR, hazard ratio.
P value < 0.05
P value < 0.01
P value < 0.001.
FIGURE 3Gene signatures can predict BCRFS in both cohorts. (A) K–M survival curves for the training set indicated that better BCRFS was associated with significantly lower risk score; (B) K–M survival curves for the validation set indicated that better BCRFS was associated with significantly lower risk score; (C) Time‐dependent ROC revealed that the risk score was an excellent predictor for BCRFS in the training set; (D) Time‐dependent ROC revealed that the risk score was an excellent predictor for BCRFS in the validation set. BCRFS, biochemical recurrence‐free survival; K–M, Kaplan–Meier; ROC, receiver operating characteristic
FIGURE 4Nomogram prediction of BCRFS probability. The risk score and Gleason score were used to establish the nomogram for predicting 3 and 5‐year BCRFS in the training set. The dominant factor was the risk score. BCRFS, biochemical recurrence‐free survival
FIGURE 5Calibration plots of the nomogram. (A) Three‐year calibration plot of nomogram in the training set; (B) 5‐year calibration plot of nomogram in the training set; (C) 3‐year calibration plot of nomogram in the validation set; (D) 5‐year calibration plot of nomogram in the validation set; the nomogram's performance was excellent for predicting the 3‐year BCRFS and 5‐year BCRFS in both cohorts. BCRFS, biochemical recurrence‐free survival
The C‐index of the nomogram and other factors in the training and validation sets
| Variables | Training set | Validation set |
|---|---|---|
| C‐index | C‐index | |
| Nomogram | 0.793 | 0.722 |
| Gleason score | 0.588 | 0.676 |
| Risk score | 0.790 | 0.710 |
Abbreviation: C‐index, concordance index.
DEGs between low‐risk cases and high‐risk cases in the training set, under cut‐off criteria of |logFC| > 0.5 and adjusted P value < 0.05. For each gene, the LogFC, AveExpr, P value, and FDR from limma are given
| Gene | LogFC | AveExpr | FDR | |
|---|---|---|---|---|
| TPX2 | 4.79279 | 5.08858 | 6.63E−55 | 3.98E−54 |
| PHYHD1 | −5.05780 | 8.83455 | 3.52E−43 | 1.06E−42 |
| AURKA | 0.79642 | 3.53217 | 1.81E−10 | 3.61E−10 |
Abbreviations: AveExpr, average expression; DEGs, differentially expressed genes; FDR, false discovery rate adjusted P value; LogFC, log fold change.
FIGURE 6Bioinformatical analysis of three DEGs. (A) Three major categories were included in the bubble plots of GO analysis; (B) two enriched terms of KEGG pathway shown in bubble plot; (C) a chord plot was used to visualize the top three GO terms of BP, CC, and MF, respectively. DEGs, differentially expressed genes; GO, Gene Ontology; BP, biological process; CC, cellular component; MF, molecular function; KEGG, Kyoto Encyclopedia of Gene and Genomes