| Literature DB >> 34885197 |
Zihang Mai1,2,3, Qianwen Liu1,2,3, Xinye Wang1,2,3, Jiaxin Xie4, Jianye Yuan1,2,3, Jian Zhong1,2,3, Shuogui Fang1,2,3, Xiuying Xie2,3, Hong Yang1,2,3, Jing Wen2,3, Jianhua Fu1,2,3.
Abstract
Esophageal squamous cell carcinoma (ESCC) is one of the deadliest malignancies in China. The prognostic value of mutations, especially those in minor tumor clones, has not been systematically investigated. We conducted targeted deep sequencing to analyze the mutation status and the cancer cell fraction (CCF) of mutations in 201 ESCC patients. Our analysis showed that the prognostic effect of mutations was relevant to the CCF, and it should be considered in prognosis prediction. EP300 was a promising biomarker for overall survival, impairing prognosis in a CCF dose-dependent manner. We constructed a CCF-based predictor using a smooth clipped absolute deviation Cox model in the training set of 143 patients. The 3-year disease-free survival rates were 6.3% (95% CI: 1.6-23.9%), 29.8% (20.9-42.6%) and 70.5% (56.6-87.7%) in high-, intermediate- and low-risk patients, respectively, in the training set. The prognostic accuracy was verified in a validation set of 58 patients and the TCGA-ESCC cohort. The eight-gene model predicted prognosis independent of clinicopathological factors and the combination of our model and pathological staging markedly improved the prognostic accuracy of pathological staging alone. Our study describes a novel recurrence predictor for ESCC patients and provides a new perspective for the clinical translation of genomic findings.Entities:
Keywords: cancer cell fraction; esophageal cancer; mutation; prognosis prediction; tumor heterogeneity
Year: 2021 PMID: 34885197 PMCID: PMC8656931 DOI: 10.3390/cancers13236084
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Main clinical characteristics of patients in the training and validation cohorts.
| Variables | Training Set (N = 143) | Validation Set (N = 58) | Full Set (N = 201) | |
|---|---|---|---|---|
| N (%) | N (%) |
| N (%) | |
| Sex | 0.002 | |||
| Female/Male | 17/126 (11.9/88.1) | 18/40 (31.0/69) | 35/166 (17.4/82.6) | |
|
| 0.206 | |||
| <60/≥60 | 70/73 (49.0/51.0) | 22/36 (37.9/62.1) | 92/109 (45.8/54.2) | |
|
| 0.222 | |||
| <4 cm/≥4 cm | 67/76 (46.9/53.1) | 21/37 (36.2/63.8) | 88/113 (43.8/56.2) | |
| Smoking status | 0.063 | |||
| Yes/No | 98/45 (68.5/31.5) | 31/27 (53.5/46.5) | 129/72 (64.2/35.8) | |
| Alcoholism | 0.520 | |||
| Yes/No | 73/70 (51.0/49.0) | 26/32 (44.8/55.2) | 99/102 (49.3/50.7) | |
| Differentiation | 0.343 | |||
| Well/Moderate/Poor | 17/71/55 (11.9/49.7/38.5) | 8/34/16 (13.8/58.6/27.6) | 25/105/71 (12.4/52.2/35.3) | |
| Surgical approach | 0.609 | |||
| Left thoracotomy | 49(34.3) | 17(29.3) | 66(32.8) | |
| Right thoracotomy | 94(65.7) | 41(70.7) | 135(67.2) | |
| Lesion location | 0.151 | |||
| Upper/Middle/Lower | 8/54/51 (5.6/58.7/35.7) | 8/31/19 (13.8/53.4/32.8) | 16/115/70 (7.9/57.2/34.9) | |
|
| 0.848 | |||
|
| 23/120(16.1/83.9) | 8/50(13.7/86.3) | 31/170 (15.4/84.6) | |
|
| 1.000 | |||
|
| 70/73 (49.0/51.0) | 29/29 (50.0/50.0) | 99/102(49.3/20.7) | |
|
| 0.696 | |||
| ≥21/<21 | 108/35 (75.5/24.5) | 46/12 (79.3/20.7) | 154/47 (76.4/23.4) | |
| Adjuvant therapy | 1.00 | |||
| Yes/No | 43/100 (30.1/69.9) | 17/41 (29.3/70.7) | 60/141 (29.8/69.2) | |
†: pathological T classification. ‡: pathological LN classification. §: stratified by median of variables.
Figure 1Heterogeneity and clinical impact of alterations in ESCC. (A) Bar plot for comparison of mutation frequencies of the most frequently mutated genes observed in previous results and our cohort. (B) Scatter plot of cancer cell fraction of mutations in these frequently mutated genes. Clonal mutations are shown in red and subclonal mutations in blue. (C) Predicted number of subclones in ESCC. For comparison, the predicted number of subclones from patients in TCGA-ESCC cohort is also shown. Error bar represents the standard deviation. (D) Disease-free survival difference between patients with subclonal FRY mutations and wildtype FRY. (E) Volcano plot displays the relationship between genetic alterations and DFS. The X and Y axes indicate and , respectively. “Amp” and “Del” represent the amplification and deletion of the gene, respectively. (F) TP53 R141C hotspot mutation was associated with inferior DFS. (G) EP300 mutation was associated with grave OS.
Figure 2CCF–based patterns of prognostic value. The clinical endpoint analyzed here was DFS. Three prognostic effect patterns were identified by on our classification pipeline: CCF-independent pattern (A), CCF–dominant pattern (B) and CCF dose–dependent pattern (C).
Figure 3Performance of the eight-gene model in the training and validation sets. (A) AUC of the time–dependent ROC curve for the eight–gene predictor and the individual genes across the entire cohort. (B,C) Patients stratified by risk scores had distinct DFS in the training set (B) and validation set (C). (D–F) Time–dependent ROC curves compared the prognostic accuracies of the combined model integrating the eight–gene signature and pathological staging with pathological staging in the (D) training cohort, (E) validation cohort and TCGA–ESCC cohort (F).
Multivariate Cox regression of the eight-gene-based predictor and clinicopathological factors. †: pathological T classification. ‡: pathological LN classification. §: stratified by median of variables.
| Variables | Entire Cohort | Training Cohort | Validation Cohort | |||
|---|---|---|---|---|---|---|
| HR (95%CI) |
| HR (95%CI) |
| HR (95%CI) |
| |
| Sex | 0.45 (0.26–0.77) | 0.004 | 1.18 (0.61–2.31) | 0.625 | 0.13 (0.05–0.38) | 0.0001 |
| 0.75 (0.52–1.09) | 0.130 | 0.67 (0.44–1.03) | 0.071 | 0.45 (0.20–0.99) | 0.046 | |
| Surgical | 0.72 (0.50–1.05) | 0.102 | 0.52 (0.33–0.82) | 0.005 | 1.17 (0.54–2.56) | 0.688 |
| Alb (≥40 vs. <40) | 0.46 (0.11–2.01) | 0.304 | 1.86 (0.24–14.18) | 0.548 | 0.30 (0.033–2.56) | 0.294 |
| 1.19 (0.83–1.71) | 0.351 | 0.99 (0.64–1.53) | 0.952 | 1.51 (0.74–3.08) | 0.262 | |
| 1.03 (0.60–1.77) | 0.902 | 1.31 (0.69–2.50) | 0.408 | 0.43 (0.15–1.28) | 0.128 | |
| 2.37 (1.63–3.44) | <0.001 | 2.71 (1.73–4.25) | <0.001 | 2.29 (1.07–4.92) | 0.033 | |
| Genetic model | - | - | - | - | - | - |
| Intermediate risk | 2.74 (1.59–4.72) | <0.001 | 2.65 (1.40–5.03) | 0.003 | 3.52 (1.23–10.10) | 0.019 |
| High risk | 6.50 (3.60–11.75) | <0.001 | 6.74 (3.34–13.58) | <0.001 | 8.19 (2.484–27.01) | <0.001 |
Figure 4Risk stratification of patients with staging. (A) Sankey plot displaying the relationship between the risk scores of patients and different pathological stages. (B) Survival curves showing that patients with but low genetic risks had a relatively longer DFS time. (C) Heatmap showing the mutation profile of ESCC patients in different recurrence risk. PREX2 and SPATA31D1 mutations were enriched in low-risk patients.