| Literature DB >> 34257547 |
Lin Wang1,2, Xueping Li1,2, Lan Zhao1,2, Longyang Jiang1,2, Xinyue Song1,2, Aoshuang Qi1,2, Ting Chen1,2, Mingyi Ju1,2, Baohui Hu1,2, Minjie Wei1,2, Miao He1,2, Lin Zhao1,2.
Abstract
Esophageal cancer (ESCA) is a leading cause of cancer-related mortality, with poor prognosis worldwide. DNA damage repair is one of the hallmarks of cancer. Loss of genomic integrity owing to inactivation of DNA repair genes can increase the risk of cancer progression and lead to poor prognosis. We aimed to identify a novel gene signature related to DNA repair to predict the prognosis of ESCA patients. Based on gene expression profiles of ESCA patients from The Cancer Genome Atlas and gene set enrichment analysis, 102 genes related to DNA repair were identified as candidates. After stepwise Cox regression analysis, we established a five-gene prognostic model comprising DGCR8, POM121, TAF9, UPF3B, and BCAP31. Kaplan-Meier survival analysis confirmed a strong correlation between the prognostic model and survival. Moreover, we verified the clinical value of the prognostic signature under the influence of different clinical parameters. We found that small-molecule drugs (trametinib, selumetinib, and refametinib) could help to improve patient survival. In summary, our study provides a novel and promising prognostic signature based on DNA-repair-related genes to predict survival of patients with ESCA. Systematic data mining provides a theoretical basis for further exploring the molecular pathogenesis of ESCA and identifying therapeutic targets.Entities:
Keywords: DNA repair; esophageal cancer; overall survival; prognostic biomarkers; small molecular drugs; targeted therapy
Mesh:
Substances:
Year: 2021 PMID: 34257547 PMCID: PMC8262199 DOI: 10.3389/pore.2021.596899
Source DB: PubMed Journal: Pathol Oncol Res ISSN: 1219-4956 Impact factor: 3.201
Summary of clinical characteristics of ESCA patients in three cohorts.
| Characteristic | Patients in entire TCGA set ( | Patients in subgroup 1 ( | Patients in subgroup 2 ( |
|---|---|---|---|
| Age (years) | |||
| ≤60 | 81 (50.94%) | 44 (55.70%) | 37 (46.25%) |
| >60 | 78 (49.06%) | 35 (44.30%) | 43 (53.75%) |
| Gender | |||
| Female | 23 (14.47%) | 13 (16.46%) | 10 (12.50%) |
| Male | 136 (85.53%) | 66 (83.54%) | 70 (87.5%) |
| Histological type | |||
| Esophagus adenocarcinoma, NOS | 79 (49.69%) | 36 (45.57%) | 43 (53.75%) |
| Esophagus squamous cell carcinoma | 80 (50.31%) | 43 (54.43%) | 37 (46.25%) |
| Vital status | |||
| Alive | 96 (60.38%) | 47 (59.49%) | 49 (61.25%) |
| Dead | 63 (39.62%) | 32 (40.51%) | 31 (38.75%) |
| Pathologic stage | |||
| Stage I-II | 87 (54.72%) | 46 (58.23%) | 41 (51.25%) |
| Stage III-IV | 68 (42.77%) | 32 (40.51%) | 36 (45.00%) |
| NA | 4 (2.51%) | 1 (1.26%) | 3 (3.75%) |
| Race | |||
| Asian | 38 (23.90%) | 22 (27.85%) | 16 (20.00%) |
| Black or african american | 5 (3.14%) | 2 (2.53%) | 3 (3.75%) |
| White | 98 (61.64%) | 50 (63.29%) | 48 (60.00%) |
| NA | 18 (11.32%) | 5 (6.33%) | 13 (16.25%) |
| N Classification | |||
| N0-N1 | 133 (83.65%) | 65 (82.28%) | 68 (85.00%) |
| N2-N3 | 14 (8.80%) | 7 (8.86%) | 7 (8.75%) |
| NA | 12 (7.55%) | 7 (8.86%) | 5 (6.25%) |
| T classification | |||
| T1 | 25 (15.72%) | 14 (17.72%) | 11 (13.75%) |
| T2-T4 | 132 (83.02%) | 63 (79.75%) | 69 (86.25%) |
| NA | 2 (1.26%) | 2 (2.53%) | 0 (0.00%) |
| M classification | |||
| M0 | 126 (79.25%) | 59 (74.68%) | 67 (83.75%) |
| M1 | 15 (9.43%) | 7 (8.86%) | 8 (10.00%) |
| NA | 18 (11.32%) | 13 (16.46%) | 5 (6.25%) |
| Neoplasm cancer status | |||
| Tumor free | 91 (57.23%) | 49 (62.03%) | 42 (52.50%) |
| With tumor | 58 (36.48%) | 26 (32.91%) | 32 (40.00%) |
| NA | 10 (6.29%) | 4 (5.06%) | 6 (7.5%) |
| Tumor central location | |||
| Distal | 111 (69.81%) | 54 (68.35%) | 57 (71.25%) |
| Mid | 41 (25.79%) | 22 (27.85%) | 19 (23.75%) |
| Proximal | 6 (3.77%) | 3 (3.80%) | 3 (3.75%) |
| NA | 1 (0.63%) | 0 (0.00%) | 1 (1.25%) |
| Neoplasm histologic grade | |||
| G1 | 16 (10.06%) | 6 (7.6%) | 10 (12.5%) |
| G2 | 65 (40.88%) | 33 (41.77%) | 32 (40.00%) |
| G3 | 43 (27.05%) | 23 (29.11%) | 20 (25.00%) |
| NA | 35 (22.01%) | 17 (21.52%) | 18 (22.50%) |
| Residual tumor | |||
| R0 | 119 (74.84%) | 60 (75.95%) | 59 (73.75%) |
| R1+R2 | 13 (8.18%) | 3 (3.80%) | 10 (12.50%) |
| NA | 27 (16.98%) | 16 (20.25%) | 11 (13.75%) |
| Lymph node metastasis | |||
| NO | 83 (52.20%) | 34 (43.04%) | 49 (61.25%) |
| Yes | 43 (27.04%) | 27 (34.18%) | 16 (20.00%) |
| NA | 33 (20.76%) | 18 (22.78%) | 15 (18.75%) |
Abbreviations: ESCA, esophageal cancer; NA, not available.
FIGURE 1Flow diagram of data and analyses in this work.
FIGURE 2Screening of genes related to DNA repair in ESCA. (A) Enrichment plots showing differential expression of DNA-repair-related genes in normal tissues (n = 11) and tumor tissues (n = 159) according to GSEA. (B) Protein–protein interaction network (n = 102). (C) Functional enrichment (GO and KEGG) analyses of DNA repair genes (n = 102). (D) Interaction of five genes by GeneMANIA. (E) Biological function analysis of the individual five genes in ESCA. Abbreviation: ESCA, esophageal carcinoma; GSEA, gene set enrichment analysis; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
The detailed information of selected five genes related to overall survival in patients with ESCA.
| Gene | Ensemble ID | B (cox) | HR |
|
|---|---|---|---|---|
| BCAP31 | ENSG00000185825.14 | 0.440 | 1.938 | 0.0046 |
| TAF9 | ENSG00000273841.3 | 0.397 | 1.683 | 0.0243 |
| UPF3B | ENSG00000125351.9 | 0.383 | 1.657 | 0.0048 |
| POM121 | ENSG00000196313.10 | −0.373 | 0.603 | 0.0338 |
| DGCR8 | ENSG00000128191.12 | −0.864 | 0.408 | 0.0070 |
Abbreviations: ESCA, esophageal cancer; HR, hazard ratio.
FIGURE 3Alterations and differential expression of the five genes. (A) Alterations of the five genes in different cancers. (B) Alterations of the five genes in ESCA patients. (C) Genomic alterations of the five genes in patients with ESCA subtypes. (D) Differential expression of the selected five genes in normal group (n = 11) and tumor group (n = 159). Two-sided log-rank and Wilcoxon p < 0.05 were considered significant. Abbreviation: ESCA, esophageal carcinoma; ESCC, esophageal squamous carcinoma; EAD, esophageal adenocarcinoma.
FIGURE 4Construction of prognostic risk score system and identification of five-gene prognostic model. Risk score distribution of five genes in three cohorts: entire TCGA group (n = 159), TCGA subgroup 1 (n = 79), and TCGA subgroup 2 (n = 80). Top (A)–(C) and middle (D)–(F) plots show patient survival time and status based on risk score system (G)–(I) Heatmap of expression of the five genes; color from blue to red illustrates a trend from low expression to high expression.
Univariable and multivariable Cox linear regression analysis for risk score and different clinical pathological parameters.
| Univariable analysis | Multivariable analysis | ||||||
|---|---|---|---|---|---|---|---|
| Clinical feature | Number | HR | 95%CI of HR |
| HR | 95%CI of HR |
|
| Risk score (low/high) | 79/80 | 3.819 | 2.161–6.748 | <0.0001 | 3.388 | 1.664–6.899 | 0.001 |
| Cancer stage (stage I-II/III-IV) | 87/68 | 3.182 | 1.774–5.710 | <0.0001 | 2.732 | 1.328–5.623 | 0.006 |
| Stage-M (M0/M1) | 126/15 | 4.92 | 2.243–10.794 | <0.0001 | 2.535 | 1.024–6.276 | 0.044 |
| Residual tumor (R0/R1+ R2) | 119/13 | 2.324 | 1.143–4.724 | 0.020 | 1.199 | 0.539–2.668 | 0.657 |
Abbreviations: HR, hazard ratio; CI, confidence interval.
FIGURE 5Validation of prognostic signature for patients with ESCA. K–M survival curves for prognostic model and time-dependent ROC curve for (A, B) entire TCGA group (n = 159), (C, D) TCGA subgroup 1 (n = 79), and (E, F) TCGA subgroup 2 (n = 80). Two-sided log-rank and Wilcoxon p < 0.05 were considered significant. Abbreviation: MST, median survival time.
FIGURE 6Stratified analysis for further data mining. Validation of the five-gene prognostic signature in patients with ESCA for (A) cancer stage, (B) residual tumor, (C) cancer status and (D) lymph node metastasis in entire TCGA dataset (n = 159). Two-sided log-rank and Wilcoxon p < 0.05 were considered significant.
FIGURE 7Analysis of potential drug sensitivity of five genes. (A) Genomics of drug sensitivity in cancer (GDSC). (B) Structure of potential targeted drugs including trametinib, selumetinib and refametinib.