| Literature DB >> 35685866 |
Yong Yang1,2, Jieqing Yu3, Yuanping Xiong3, Jiansheng Xiao4, Daofeng Dai3, Feng Zhang5.
Abstract
Bladder cancer (BCa) is the tenth most common tumor in humans. DNA damage repair genes (DDRGs) play important roles in many malignant tumors; thus, their functions in BCa should also be explored. We performed a comprehensive analysis of the expression profiles of DDRGs in 410 BCa tumors and 19 normal tissues from The Cancer Genome Atlas database. We identified 123 DDRGs differentially expressed between BCa tumors and normal tissues, including 95 upregulated and 28 downregulated genes. We detected 22 DDRGs associated with overall survival (OS) of patients with BCa by performing univariate Cox regression analysis. To explore the interactions between OS-associated DDRGs, we constructed a PPI network, which showed that the top six DDRGs (CDCA2, FOXM1, PBK, RRM2, ORC1, and HDAC4) with the highest scores in the PPI network might play significant roles in OS of BCa. Moreover, to investigate the latent regulatory mechanism of these OS-associated DDRGs, we analyzed the transcription factors (TFs)-DDRGs regulatory network. The core seven TFs (NCAPG, DNMT1, LMNB1, BRCA1, E2H2, CENPA, and E2F7) were shown to be critical regulators of the OS-related DDRGs. The 22 DDRGs were incorporated into a stepwise multivariable Cox analysis. Then, we built the index of risk score based on the expression of 8 DDRGs (CAD, HDAC10, JDP2, LDLR, PDGFRA, POLA2, SREBF1, and STAT1). The p-value < 0.0001 in the Kaplan-Meier survival plot and an area under the ROC curve (AUC) of 0.771 in TCGA-BLCA training dataset suggested the high specificity and sensitivity of the prognostic index. Furthermore, we validated the risk score in the internal TCGA-BLCA and an independent GSE32894 dataset, with AUC of 0.743 and 0.827, respectively. More importantly, the multivariate Cox regression and stratification analysis demonstrated that the predictor was independent of various clinical parameters, including age, tumor stage, grade, and number of positive tumor lymph nodes. In summary, a panel of 8 DNA damage repair genes associated with overall survival in bladder cancer may be a useful prognostic tool.Entities:
Keywords: DNA damage repair genes; biomarkers; bladder cancer; differentially expressed genes; prognosis
Mesh:
Substances:
Year: 2022 PMID: 35685866 PMCID: PMC9172279 DOI: 10.3389/pore.2022.1610267
Source DB: PubMed Journal: Pathol Oncol Res ISSN: 1219-4956 Impact factor: 2.874
The clinicopathological features of BCa patients in TCGA-BLCA dataset and GSE32894 cohort.
| Variables | TCGA-BLCA dataset ( | GSE32894 cohort ( |
|
|---|---|---|---|
| Age | 4.00E-02 | ||
| 61–89 years | 244 (62.24%) | 176 (79.64%) | |
| 34–60 years | 97 (24.75%) | 45 (20.36%) | |
| Unknown | 51 (13.01%) | — | |
| Diagnosis subtype | — | ||
| Papillary | 126 (32.14%) | — | |
| Non-papillary | 261 (66.58%) | — | |
| Unknown | 5 (1.28%) | — | |
| Gender | 8.34E-01 | ||
| Female | 102 (26.02%) | 60 (27.15%) | |
| Male | 290 (73.98%) | 161 (72.85%) | |
| Lymph node examined count | — | ||
| <=12 | 78 (19.90%) | — | |
| >12 | 178 (45.41%) | — | |
| Unknown | 136 (34.69%) | — | |
| Lymph nodes | — | ||
| Negative | 149 (38.01%) | — | |
| Positive | 114 (29.08%) | — | |
| Unknown | 129 (32.91%) | — | |
| Race | — | ||
| White | 315 (80.36%) | — | |
| Black or African American or Asian | 61 (15.56%) | — | |
| Unknown | 16 (4.08%) | — | |
| Tobacco smoking history | — | ||
| No | 103 (26.28%) | — | |
| Yes | 276 (70.41%) | — | |
| Unknown | 13 (3.31%) | — | |
| Histologic grade | |||
| Low | 18 (4.59%) | 130 (58.82%) | 2.20E-16 |
| High | 371 (94.64%) | 91 (41.18%) | |
| Unknow | 3 (0.77%) | — | |
| Chemotherapy | |||
| No | 224 (57.14%) | — | |
| Yes | 139 (35.46%) | — | |
| not reported | 29 (7.40%) | — | |
| T stage | 2.71E-04 | ||
| I_II | 125 (31.89%) | 104 (47.06%) | |
| III_IV | 267 (68.11%) | 117 (52.94%) | |
| N stage | — | ||
| N0 | 226 | — | |
| N1-3 | 130 | — | |
| Unknown | 36 | — | |
| M stage | — | ||
| M0 | 187 (47.70%) | — | |
| M1 | 10 (2.55%) | — | |
| Unknown | 195 (49.75) | — |
FIGURE 1Differentially expressed DNA damage repair genes (DDRGs). Differentially expressed DDRGs are shown in the heat map (A) and volcano map (B). The red dot indicates the highly expressed genes, the green dot indicates the low expressed genes, and the black dot indicates the genes without differentially expressed genes.
The prognostic value of differentially expressed DDRGs by univariate Cox regression analysis.
| Gene | HR | 95% CI |
|
|---|---|---|---|
| ATXN1 | 1.183 | 1.032–1.356 | 0.016 |
| CAD | 1.055 | 1.026–1.085 | 0.000 |
| CDCA2 | 1.075 | 1.009–1.145 | 0.024 |
| CDK5R1 | 1.064 | 1.007–1.124 | 0.026 |
| FOXM1 | 1.020 | 1.004–1.036 | 0.011 |
| HDAC10 | 0.847 | 0.764–0.939 | 0.002 |
| HDAC4 | 1.366 | 1.103–1.691 | 0.004 |
| ISG15 | 0.999 | 0.998–1.000 | 0.047 |
| JDP2 | 1.059 | 1.013–1.106 | 0.012 |
| LATS2 | 1.092 | 1.019–1.170 | 0.013 |
| LDLR | 1.021 | 1.007–1.035 | 0.003 |
| MT1A | 1.015 | 1.007–1.023 | 0.000 |
| NEIL3 | 1.141 | 1.016–1.282 | 0.026 |
| ORC1 | 1.071 | 1.004–1.141 | 0.036 |
| PBK | 1.022 | 1.002–1.041 | 0.027 |
| PDGFRA | 1.045 | 1.018–1.073 | 0.001 |
| POLA2 | 1.066 | 1.005–1.130 | 0.033 |
| RRM2 | 1.006 | 1.000–1.012 | 0.037 |
| SREBF1 | 1.007 | 1.000–1.013 | 0.046 |
| STAT1 | 0.996 | 0.992–0.999 | 0.015 |
| TACC1 | 1.013 | 1.001–1.024 | 0.026 |
| THBS1 | 1.004 | 1.001–1.007 | 0.009 |
FIGURE 2GO terms and pathways analysis of the differentially expressed overall survival (OS)-related DDRGs. (A) Gene ontology analysis: biological processes. (B) Gene ontology analysis: molecular functions. (C) Gene ontology analysis: cellular components. (D) The significant enriched KEGG pathways.
FIGURE 3Transcription factors-mediated regulatory network. (A) Volcano plot of differentially expressed Transcription Factors (TFs)between BCa and non-tumors tissues. (B) The transcription regulatory network according to the clinically relevant DDRGs and differentially expressed TFs. The circle in a node reflects clinically relevant DDRGs and triangle represented as differentially expressed TFs. The shades of color reflect the correlation.
Multivariate cox analysis to develop a prognostic index based on these differentially expressed DNA damage repair genes.
| Gene | coef | HR | 95% CI |
|
|---|---|---|---|---|
| CAD | 0.042 | 1.043 | 0.989–1.100 | 0.117 |
| HDAC10 | −0.146 | 0.864 | 0.759–0.984 | 0.028 |
| JDP2 | 0.069 | 1.071 | 1.014–1.131 | 0.014 |
| LDLR | 0.022 | 1.022 | 1.000–1.045 | 0.049 |
| PDGFRA | 0.070 | 1.073 | 1.003–1.147 | 0.039 |
| POLA2 | 0.108 | 1.114 | 1.015–1.221 | 0.022 |
| SREBF1 | 0.008 | 1.008 | 1.000–1.017 | 0.057 |
| STAT1 | −0.009 | 0.991 | 0.985–0.996 | 0.001 |
FIGURE 4The prognosis model in TCGA-BLCA training dataset, TCGA-BLCA internal validation dataset and GSE32894 independent validation cohort. Kaplan-Meier survival curves between high-risk group and low-risk group (left) and the receiver operating characteristic (ROC) curve of the risk scores (right) in TCGA-BLCA training dataset (A), TCGA-BLCA internal validation dataset (B), and GSE32894 validation cohort (C).
The Cox regression analysis of clinical characteristics and the prognostic signature in TCGA-BLCA cohort.
| Variables | Untivariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|
| HR | 95% CI |
| HR | 95% CI |
| |
| risk-group (high vs. low) | 2.67 | 1.90–3.74 | 1.17E-08 | 2.39 | 1.25–4.57 | 8.24E-03 |
| age (>60 vs. <=60) | 1.98 | 1.26–3.11 | 2.89E-03 | 2.11 | 0.91–4.92 | 8.30E-02 |
| diagnosis subtype (non-papillary vs. papillary) | 1.85 | 1.22–2.79 | 3.54E-03 | 1.05 | 0.45–2.46 | 9.15E-01 |
| gender (male vs. female) | 0.85 | 0.6–1.21 | 3.79E-01 | — | — | — |
| race (not white vs. white) | 0.86 | 0.52–1.43 | 5.58E-01 | — | — | — |
| tobacco smoking history (yes vs. no) | 1.43 | 0.97–2.1 | 7.19E-02 | — | — | — |
| lymph node examined count (>12 vs. <=12) | 0.63 | 0.42–0.95 | 2.61E-02 | 0.37 | 0.18–0.75 | 6.08E-03 |
| lymph nodes (positive vs. negative) | 2.16 | 1.48–3.15 | 6.49E-05 | 1.73 | 0.74–4.05 | 2.05E-01 |
| stage T (III_IV vs. I_II) | 2.67 | 1.74–4.1 | 7.53E-06 | 2.06 | 0.69–6.16 | 1.96E-01 |
| stage N (N1-3 vs. N0) | 2.42 | 1.72–3.4 | 3.39119E-07 | 1.40 | 0.57–3.42 | 4.59E-01 |
| stage M (M1 vs. M0) | 2.67 | 1.15–6.22 | 2.27E-02 | 0.64 | 0.07–6.1 | 6.96E-01 |
FIGURE 5The stratification analysis of prognostic signature in BCa patients with different clinical parameters. (A) The prognostic utility of the signature in BCa patients with different age groups. (B) The prognostic utility of the signature in BCa patients with number of lymph nodes. (C) The prognostic utility of the signature in BCa patients with number of positive lymph nodes. (D) The prognostic utility of the signature in BCa patients with tumor mutational burden.
Difference analysis of gene expression between non papillary and papillary in the TCGA database.
| Variables | Total ( | Group | Statistics |
| |
|---|---|---|---|---|---|
| Non-Papillary ( | Papillary ( | ||||
| CAD, M(Q1,Q3) | 8.74 (6.28, 12.31) | 9.38 (6.93, 12.86) | 7.31 (5.63, 11.35) | Z = -3.408 |
|
| HDAC10, M(Q1,Q3) | 2.54 (1.81, 3.96) | 2.29 (1.72, 3.57) | 3.28 (1.99, 4.87) | Z = 4.043 |
|
| JDP2, M(Q1,Q3) | 3.09 (1.77, 5.03) | 3.45 (2.10, 5.39) | 2.30 (1.18, 4.01) | Z = -4.920 |
|
| LDLR, M(Q1,Q3) | 7.04 (3.32, 12.75) | 7.43 (3.59, 13.33) | 5.80 (2.83, 11.72) | Z = -2.211 |
|
| PDGFRA, M(Q1,Q3) | 1.66 (0.75, 3.45) | 1.78 (0.90, 3.71) | 1.30 (0.51, 3.16) | Z = -2.413 |
|
| POLA2, M(Q1,Q3) | 4.95 (3.65, 6.67) | 5.10 (3.90, 6.68) | 4.59 (3.23, 6.53) | Z = -2.387 |
|
| SREBF1, M(Q1,Q3) | 18.51 (12.17, 32.91) | 17.43 (11.57, 32.26) | 20.60 (12.98, 35.93) | Z = 1.442 | 0.149 |
| STAT1, M(Q1,Q3) | 33.14 (17.59, 70.31) | 39.10 (19.95, 80.48) | 22.07 (13.26, 49.97) | Z = -4.707 |
|
| riskScore, M(Q1,Q3) | 0.95 (0.59, 1.51) | 1.02 (0.66, 1.63) | 0.74 (0.49, 1.31) | Z = -3.604 |
|
| risk, n (%) | χ2 = 11.806 |
| |||
| High | 193 (49.87) | 146 (55.94) | 47 (37.30) | ||
| Low | 194 (50.13) | 115 (44.06) | 79 (62.70) | ||
Difference analysis of gene expression among the molecular subtypes in the GEO database.
| Variables | Total ( | Group | Statistics |
| ||||
|---|---|---|---|---|---|---|---|---|
| SCC-like ( | genomically unstable ( | infiltrated ( | Urobasal A ( | Urobasal B ( | ||||
| CAD, M(Q1,Q3) | −0.04 (−0.27, 0.23) | 0.40 (−0.24, 0.71) | 0.21 (−0.12, 0.44) | −0.02 (−0.27, 0.23) | −0.13 (−0.32, 0.09) | −0.14 (−0.30, 0.04) | χ2 = 26.152 |
|
| HDAC10, M(Q1,Q3) | 0.00 (−0.15, 0.16) | −0.09 (−0.21, −0.04) | −0.11 (−0.21, 0.05) | −0.05 (−0.20, 0.06) | 0.10 (−0.04, 0.28) | −0.06 (−0.19, 0.08) | χ2 = 35.819 |
|
| JDP2, M(Q1,Q3) | −0.05 (−0.19, 0.12) | 0.01 (−0.12, 0.13) | −0.05 (−0.18, 0.12) | −0.03 (−0.16, 0.12) | −0.06 (−0.20, 0.14) | −0.01 (−0.07, 0.09) | χ2 = 1.636 | 0.802 |
| LDLR, M(Q1,Q3) | 0.05 (−1.30, 0.76) | 0.58 (0.26, 1.33) | −0.02 (−0.93, 0.76) | 0.44 (−0.58, 1.13) | −0.47 (−1.78, 0.37) | 0.81 (0.39, 1.45) | χ2 = 24.410 |
|
| PDGFRA, M(Q1,Q3) | −0.24 (−0.83, 0.52) | 0.22 (0.11, 0.80) | −0.30 (−0.96, 0.31) | 1.44 (0.59, 2.12) | −0.60 (−0.99, 0.14) | 0.02 (−0.59, 0.56) | χ2 = 58.297 |
|
| POLA2, M(Q1,Q3) | −0.17 (−0.45, 0.37) | 0.46 (0.04, 0.80) | 0.52 (0.16, 0.73) | −0.22 (−0.39, 0.14) | −0.40 (−0.57, −0.17) | 0.09 (−0.17, 0.53) | χ2 = 93.192 |
|
| SREBF1, M(Q1,Q3) | 0.05 (−0.48, 0.58) | 0.09 (−0.33, 0.66) | 0.14 (−0.32, 0.90) | −0.60 (−0.81, −0.11) | 0.05 (−0.32, 0.56) | 0.55 (−0.19, 0.66) | χ2 = 19.973 |
|
| STAT1, M(Q1,Q3) | −0.21 (−0.87, 0.66) | 1.31 (0.66, 2.11) | −0.19 (−0.88, 0.76) | 0.36 (−0.15, 1.44) | −0.59 (−1.04, −0.07) | 0.85 (0.28, 1.43) | χ2 = 57.366 |
|
| risk_score, M(Q1,Q3) | −0.02 (−0.11, 0.05) | 0.13 (0.02, 0.16) | 0.04 (−0.03,0.08) | 0.06 (0.03, 0.14) | −0.10 (−0.15, −0.03) | 0.03 (−0.02, 0.08) | χ2 = 109.387 |
|
| risk_group, n (%) | χ2 = 80.861 |
| ||||||
| High | 110 (49.77) | 10 (90.91) | 37 (67.27) | 30 (96.77) | 23 (20.91) | 10 (71.43) | ||
| Low | 111 (50.23) | 1 (9.09) | 18 (32.73) | 1 (3.23) | 87 (79.09) | 4 (28.57) | ||