| Literature DB >> 35495147 |
Min-Yi Lv1,2, Wei Wang3, Min-Er Zhong1,2, Du Cai1,2, Dejun Fan1,2,4, Cheng-Hang Li1,2, Wei-Bin Kou5, Ze-Ping Huang1,2, Xin Duan1,2, Chuling Hu1,2, Qiqi Zhu1,2, Xiaosheng He1,2, Feng Gao1,2.
Abstract
Background: Increasing evidence have depicted that DNA repair-related genes (DRGs) are associated with the prognosis of colorectal cancer (CRC) patients. Thus, the aim of this study was to evaluate the impact of DNA repair-related gene signature (DRGS) in predicting the prognosis of CRC patients. Method: In this study, we retrospectively analyzed the gene expression profiles from six CRC cohorts. A total of 1,768 CRC patients with complete prognostic information were divided into the training cohort (n = 566) and two validation cohorts (n = 624 and 578, respectively). The LASSO Cox model was applied to construct a prediction model. To further validate the clinical significance of the model, we also validated the model with Genomics of Drug Sensitivity in Cancer (GDSC) and an advanced clear cell renal cell carcinoma (ccRCC) immunotherapy data set.Entities:
Keywords: DNA repair–related genes; colorectal cancer; immunotherapy; microsatellite instability; prognostic
Year: 2022 PMID: 35495147 PMCID: PMC9048823 DOI: 10.3389/fgene.2022.872238
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Characteristics of cohorts included in this study.
| Characteristics | Training cohort GSE39582 | Validation-1 TCGA | Validation-2 (combination of GSE14333, GSE33113, GSE37892, and GSE39084) |
|---|---|---|---|
| Number of patients | 566 | 624 | 578 |
| Mean age | 66.85 | 66.27 | 66.37 |
| Gender | |||
| Male | 256 (45.23%) | 292 (46.79%) | 270 (46.71%) |
| Female | 310 (54.77%) | 332 (53.21%) | 308 (53.29%) |
| TNM stage | |||
| Stage I | 37 (6.54%) | 105 (16.83%) | 53 (9.17%) |
| Stage II | 264 (46.64%) | 230 (36.86%) | 280 (48.44%) |
| Stage III | 205 (6.22%) | 180 (28.85%) | 164 (28.37%) |
| Stage IV | 60 (10.60%) | 88 (14.10%) | 81 (14.01%) |
| NA | - | 21 (3.37%) | - |
| Tumor location | |||
| Left | 342 (60.42%) | 354 (56.73%) | 269 (46.54%) |
| Right | 224 (39.58%) | 270 (43.27%) | 216 (37.73%) |
| NA | - | - | 93 (16.09%) |
| RFS event | |||
| Yes | 177 (30.62%) | 100 (16.03%) | 130 (22.50%) |
| No | 380 (65.74%) | 416 (66.67%) | 382 (66.09%) |
| NA | 9 (1.56%) | 108 (17.31%) | 66 (11.42%) |
| MMR status | |||
| MSI | 75 (13.25%) | 189 (30.29%) | 44 (7.61%) |
| MSS | 444 (78.45%) | 431 (69.07%) | 114 (19.72%) |
| NA | 47 (8.30%) | 4 (0.64%) | 420 (72.66%) |
| CIMP status | |||
| Positive | 91 (16.07%) | NA | 39 (6.75%) |
| Negative | 405 (71.56%) | NA | 118 (20.42%) |
| NA | 70 (12.37%) | 624 (100%) | 421 (72.84%) |
| TP53 status | |||
| Wide-type | 161 (28.45%) | - | 39 (6.75%) |
| Mutation | 190 (33.57%) | - | 29 (5.02%) |
| NA | 215 (37.99%) | - | 510 (88.24%) |
| KRAS status | |||
| Wide-type | 328 (57.95%) | 34 (5.45%) | 110 (19.03%) |
| Mutation | 217 (38.34%) | 30 (4.81%) | 48 (8.30%) |
| NA | 21 (3.71%) | 560 (89.74%) | 420 (72.66%) |
| BRAF status | |||
| Wide-type | 461 (81.45%) | 32 (5.13%) | 133 (23.01%) |
| Mutation | 51 (9.01%) | 3 (0.48%) | 25 (4.33%) |
| NA | 54 (9.54%) | 589 (94.39%) | 420 (72.66%) |
FIGURE 1Schematic flow chart of the study.
FIGURE 2(A) Identification and selection of prognostic genes by LASSO Cox proportional hazards regression. (B) Establishment of 11 DNA repair–related genes signature from the LASSO COX regression. (C) Optimal cutoff point of the prognostic gene signature at 5-y OS endpoint from the ROC curve. (D) Heat map of the 11 DNA repair–related genes in two risk groups.
FIGURE 3(A,D,G) Distribution of the DRGS risk score and its correlation to recurrence in the training, validation-1, and validation-2 cohort. (B,E,H) Time-dependent ROC analysis of disease-free survival for CRC patients in the training, validation-1, and validation-2 cohorts at the time points of 2, 3, and 5 y. (C,F,I) Kaplan–Meier curves comparing survival of patients within the low- and high-risk groups in the training cohort, validation-1, and validation-2 cohorts. p-values were calculated using log-rank tests.
C-index for DRGS risk compared with Oncotype DX.
| Cohorts | DNA repair risk | Oncotype DX colon | ||
|---|---|---|---|---|
| C-index | 95% CI | C-index | 95% CI | |
| Training cohort | 0.78 | 0.69–0.86 | 0.60 | 0.52–0.68 |
| Validation-1 cohort | 0.65 | 0.51–0.79 | 0.51 | 0.37–0.65 |
| Validation-2 cohort | 0.66 | 0.55–0.76 | 0.62 | 0.53–0.70 |
Univariate and multivariate analyses of DRGS, and clinical and pathologic factors.
| Characteristic | Training cohort GSE39582 | Validation-1 TCGA | Validation-2 (GSE14333,GSE33113,GSE37892, and GSE39084) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Univariate | Multivariate | Univariate | Multivariate | Univariate | Multivariate | |||||||
| HR (95%CI) |
| HR (95%CI) |
| HR (95%CI) |
| HR (95%CI) |
| HR (95%CI) |
| HR (95%CI) |
| |
| DRGS | 2.40 (1.67–3.44) | 9.40E-07 | 1.80 (1.22–2.64) | 0.0028 | 2.20 (1.38–3.49) | <0.001 | 1.85 (1.13–3.02) | 0.015 | 2.12 (1.40–3.21) | <0.001 | 1.75 (1.15–2.65) | 0.0085 |
| Gender | 1.21 (0.89–1.64) | 0.21 | - | - | 1.25 (0.82–1.89) | 0.3 | - | - | 1.06 (0.74–1.50) | 0.77 | - | - |
| Age | 1.00 (0.99–1.01) | 1 | - | - | 0.99 (0.97–1.00) | 0.16 | - | - | 0.99 (0.98–1.00) | 0.16 | - | - |
| Tumor location | 1.27 (0.92–1.74) | 0.14 | - | - | 0.88 (0.58–1.33) | 0.55 | - | - | 1.19 (0.81–1.75) | 0.38 | - | - |
| MMR status | 2.76 (1.46–5.24) | 0.0012 | 1.94 (0.97–3.87) | 0.062 | 0.94 (0.61–1.45) | 0.77 | - | - | 1.74 (0.80–3.77) | 0.15 | - | - |
| CIMP status | 0.71 (0.44–1.15) | 0.16 | - | - | - | - | - | - | 0.82 (0.39–1.72) | 0.6 | - | - |
| CIN status | 1.09 (0.70–1.71) | 0.71 | - | - | - | - | - | - | - | - | - | - |
| TP53 mutation | 1.41 (0.99–2.03) | 0.059 | - | - | - | - | - | - | 1.59 (0.70–3.61) | 0.26 | - | - |
| KRAS mutation | 1.40 (1.03–1.91) | 0.031 | 1.16 (0.84–1.59) | 0.37 | 0.76 (0.31–1.83) | 0.54 | - | - | 1.25 (0.65–2.38) | 0.5 | - | - |
| BRAF mutation | 0.96 (0.54–1.69) | 0.88 | NA | - | 0.00 (0.00–Inf) | 0.46 | - | - | 1.61 (0.77–3.37) | 0.2 | - | - |
FIGURE 4(A) Gene ontology of the differentially expressed genes between the two risk groups. “GeneRatio” is the percentage of total differential genes in the given GO term. (B) GSEA showed several metastasis-related processes enriched in the high-risk group, including angiogenesis, KRAS signaling, epithelial mesenchymal transit (EMT), and myogenesis signal pathways.
FIGURE 5CRC cell lines in the GDSC database were divided into the high-risk and low-risk groups based on DNA repair–related signature and the differences in response to chemotherapies between the two groups were analyzed. (A) Relationship between the cell line of the high-risk and low-risk groups and IC50 of oxaliplatin. (B) Relationship between the cell line of the high-risk and low-risk groups and IC50 of fluorouracil. (C) Relationship between the cell line of the high-risk and low-risk groups and IC50 of irinotecan.
FIGURE 6Patients in the advanced clear cell renal cell carcinoma (ccRCC) database were divided into the high-risk and low-risk groups based on the DNA repair–related signature. (A) Kaplan–Meier curves comparing the survival of patients within the low- and high-risk groups in the ccRCC database. (B) OS curve of the objective response rate (ORR) of immunotherapy in ccRCC database.
FIGURE 7Tumor immune dysfunction and exclusion (TIDE) algorithm was validated in the training set GSE39582 (A,B,C,D) and the validation set TCGA (E,F,G,H).