| Literature DB >> 35664768 |
Jinyang Liu1,2, Yu Lan1,2, Geng Tian1,2, Jialiang Yang1,2,3.
Abstract
As one of the most common cancers of the digestive system, colon cancer is a predominant cause of cancer-related deaths worldwide. To investigate prognostic genes in the tumor microenvironment of colon cancer, we collected 461 colon adenocarcinoma (COAD) and 172 rectal adenocarcinoma (READ) samples from The Cancer Genome Atlas (TCGA) database, and calculated the stromal and immune scores of each sample. We demonstrated that stromal and immune scores were significantly associated with colon cancer stages. By analyzing differentially expressed genes (DEGs) between two stromal and immune score groups, we identified 952 common DEGs. The significantly enriched Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) terms for these DEGs were associated with T-cell activation, immune receptor activity, and cytokine-cytokine receptor interaction. Through univariate Cox regression analysis, we identified 22 prognostic genes. Furthermore, nine key prognostic genes, namely, HOXC8, SRPX, CCL22, CD72, IGLON5, SERPING1, PCOLCE2, FABP4, and ARL4C, were identified using the LASSO Cox regression analysis. The risk score of each sample was calculated using the gene expression of the nine genes. Patients with high-risk scores had a poorer prognosis than those with low-risk scores. The prognostic model established with the nine-gene signature was able to effectively predict the outcome of colon cancer patients. Our findings may help in the clinical decisions and improve the prognosis for colon cancer.Entities:
Keywords: DEGs; colon cancer; prognostic genes; prognostic model; risk score; tumor microenvironment
Year: 2022 PMID: 35664768 PMCID: PMC9161737 DOI: 10.3389/fonc.2022.899156
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Clinical characterizations of patients.
| Clinicopathologic variables | Category | Count (%) ( |
|---|---|---|
| Female | 294 (46.4) | |
| Male | 335 (52.9) | |
| Unknown | 4 (0.6) | |
| ≤65 | 253 (39.9) | |
| >65 | 376 (59.3) | |
| Unknown | 4 (0.6) | |
| Stage I | 109 (17.2) | |
| Stage II | 229 (36.1) | |
| Stage III | 181 (28.5) | |
| Stage IV | 90 (14.2) | |
| Unknown | 24 (3.7) | |
| T1 | 20 (3.1) | |
| T2 | 109 (17.2) | |
| T3 | 428 (67.6) | |
| T4 | 70 (11.0) | |
| Unknown | 6 (0.9) | |
| N0 | 357 (56.3) | |
| N1 | 151 (23.8) | |
| N2 | 118 (18.6) | |
| Unknown | 7 (1.1) | |
| M0 | 467 (73.7) | |
| M1 | 89 (14.0) | |
| Unknown | 77 (12.1) | |
| Alive | 499 (78.8) | |
| Death | 130 (20.5) | |
| Unknown | 4(0.6) |
Figure 1Stromal and immune scores were markedly related to colon cancer stages. (A–D) Distribution of immune scores in nonmetastatic (M0) patients and distant metastases (M1) patients (A), N0-2 patients (B), stage I–IV patients (C), T1–4 patients (D). (E, F) Distribution of stromal scores in N0-2 patients (E) and T1-4 patients (F).
Figure 2Analysis of DEGs-based stromal and immune scores. The heatmap of the top 40 DEGs based on stromal scores (A) and immune scores (B). Venn diagrams displaying the number of upregulated DEGs (C) and downregulated DEGs (D) detected in both groups. Top 30 enriched (p < 0.05) GO terms (E) and KEGG terms (F).
Figure 3Identification of key prognostic genes within intersection DEGs. (A) Forest plot of risk genes: Red represented high-risk genes (hazard ratios, HR > 1); green represented low-risk genes (HR < 1). (B) Constructing the LASSO coefficient prediction model. (C) Selecting variables in LASSO regression with minimum criteria by 1,000 times cross-validation. (D) Overall survival between high- and low-risk score groups. (E) ROC curves for predicting 3- and 5-year overall survival probability with the nine-gene score.
Nine key prognostic genes.
| Gene | Coef |
|---|---|
| CCL22 | −0.103078 |
| FABP4 | 0.000222 |
| ARL4C | 0.000224 |
| SERPING1 | 0.000982 |
| SRPX | 0.002037 |
| PCOLCE2 | 0.077076 |
| HOXC8 | 0.108125 |
| CD72 | 0.209473 |
| IGLON5 | 0.211910 |
Figure 4The composition of 22 immune cells in colon cancer tumors from the TCGA dataset. (A) The relative abundances of 22 immune cells in the tumor tissue of colon cancer patients. (B) The correlation matrix between different cell types; the size of the circle represented the degree of correlation. (C) Fractions of infiltrating immune cells in high versus low risk score groups.
Figure 5External validation of key prognostic genes using the GEO database. (A) Forest plot of risk genes: Red represents high-risk genes (hazard ratios, HR > 1); green represents low-risk genes (HR < 1). (B, C) Overall survival between high and low CCL22 (B) and ARL4C (C) expression groups.