| Literature DB >> 35300428 |
Wen-Yu Zhai1,2, Fang-Fang Duan3, Si Chen1,2, Jun-Ye Wang1, Ze-Rui Zhao1,2, Yi-Zhi Wang1,2, Bing-Yu Rao1,2, Yao-Bin Lin1,2, Hao Long1,2.
Abstract
Aging is an inevitable process characterized by a decline in many physiological activities, and has been known as a significant risk factor for many kinds of malignancies, but there are few studies about aging-related genes (ARGs) in lung squamous carcinoma (LUSC). We designed this study to explore the prognostic value of ARGs and establish an ARG-based prognosis signature for LUSC patients. RNA-sequencing and corresponding clinicopathological data of patients with LUSC were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). The ARG risk signature was developed on the basis of results of LASSO and multivariate Cox analysis in the TCGA training dataset (n = 492). Furthermore, the GSE73403 dataset (n = 69) validated the prognostic performance of this ARG signature. Immunohistochemistry (IHC) staining was used to verify the expression of the ARGs in the signature. A five ARG-based signature, including A2M, CHEK2, ELN, FOS, and PLAU, was constructed in the TCGA dataset, and stratified patients into low- and high-risk groups with significantly different overall survival (OS) rates. The ARG risk score remained to be considered as an independent indicator of OS in the multivariate Cox regression model for LUSC patients. Then, a prognostic nomogram incorporating the ARG risk score with T-, N-, and M-classification was established. It achieved a good discriminative ability with a C-index of 0.628 (95% confidence interval [CI]: 0.586-0.671) in the TCGA cohort and 0.648 (95% CI: 0.535-0.762) in the GSE73403 dataset. Calibration curves displayed excellent agreement between the actual observations and the nomogram-predicted survival. The IHC staining discovered that these five ARGs were overexpression in LUSC tissues. Besides, the immune infiltration analysis in the TCGA cohort represented a distinctly differentiated infiltration of anti-tumor immune cells between the low- and high-risk groups. We identified a novel ARG-related prognostic signature, which may serve as a potential biomarker for individualized survival predictions and personalized therapeutic recommendation of anti-tumor immunity for patients with LUSC.Entities:
Keywords: aging; anti-tumor immune cells infiltration; lung squamous carcinoma (LUSC); prognostic signature; risk stratification
Year: 2022 PMID: 35300428 PMCID: PMC8921527 DOI: 10.3389/fcell.2022.770550
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
FIGURE 1Flowchart of data collection and analysis.
Patients’ characteristics.
| TCGA training cohort | GEO validation cohort | |
|---|---|---|
| Gender | ||
| Male | 364 (74.0) | 65 (94.2) |
| Female | 128 (26.0) | 4 (5.8) |
| Age (year) | 61.3 ± 9.6 | 58.3 ± 8.5 |
| ≤65 | 171 (34.8) | 44 (63.8) |
| >65 | 321 (65.2) | 25 (36.2) |
| Smoking history | ||
| Yes or ever or unknown | 474 (96.3) | 58 (84.1) |
| No | 18 (3.7) | 11 (15.9) |
| T stage | ||
| T1 | 114 (23.2) | 4 (5.8) |
| T2 | 286 (58.1) | 42 (60.9) |
| T3 | 69 (14.0) | 20 (29.0) |
| T4 | 23 (4.7) | 3 (4.3) |
| N stage | ||
| N0 | 320 (65.0) | 35 (50.7) |
| N1 | 127 (25.8) | 17 (24.6) |
| N2 | 40 (8.1) | 17 (24.6) |
| N3 | 5 (1.0) | 0 (0) |
| M stage | ||
| M0 | 486 (98.8) | 69 (100) |
| M1 | 6 (1.2) | 0 (0) |
FIGURE 2Identification of a prognosis-related ARG-based signature in the TCGA training cohort. (A) Selection of the optimal candidate genes in the LASSO model. (B) LASSO coefficients of prognosis-associated ARGs, each curve represents a gene. (C) Forest plots showing results of univariate Cox regression analysis between the candidate ARG expression and overall survival.
FIGURE 3Assessment of prognostic value of the ARG signature model in the TCGA training cohort. (A) Determination of cut-off value of ARG risk scores by the maximally selected log-rank statistics. (B) The distribution of risk scores in the TCGA cohort. (C) Patient distribution in the high- and low-risk groups according to overall survival status. (D) The heatmap showing expression profiles of the five ARGs. (E) Kaplan-Meier curves for the overall survival of patients in the high- and low-risk groups. (F) Multivariate Cox regression analysis of the ARG signature and other clinicopathological factors.
FIGURE 4Assessment of prognostic value of the ARG signature model in the GSE20685 validation cohort. (A) The distribution of risk scores in the GSE20685 cohort. (B) Patient distribution in the high- and low-risk groups according to overall survival status. (C) The heatmap showing expression profiles of the five ARGs. (D) Kaplan-Meier curves for the overall survival of patients in the high- and low-risk groups.
Multivariate Analysis of GEO validation.
| Factors | Multivariate analysis | |
|---|---|---|
| HR (95%CI) |
| |
| Gender | ||
| Female | References | |
| Male | 1.68 (0.25–11.36) | 0.59 |
| Age (year) | ||
| ≤65 | References | |
| >65 | 1.01 (0.96–1.05) | 0.81 |
| Smoking history | 1.77 (0.44–7.08) | 0.42 |
| T stage | 1.86 (1.02–3.41) | 0.04 |
| N stage | 1.50 (0.92–2.44) | 0.10 |
| M stage | — | — |
| Risk score | 9.51e+18 (752.28–1.20e+35) | 0.02 |
FIGURE 5Gene set enrichment analysis between the low- and high-risk subgroups. (A) Enriched GO terms between high- and low-risk groups. (B) Enriched KEGG pathways between high- and low-risk groups.
FIGURE 6The landscape of immune cell infiltration between the high- and low-risk groups in the TCGA training cohort. (A) Heatmap of the 28 tumor-infiltrating cell proportions in ssGSEA. (B) Barplot of 22 immune cell infiltrations in CIBERSORT. (C) Correlation matrix of the association between the expression level of the five ARGs and tumor-infiltrating immune cell infiltrations. (D) Violin plot showing differences of infiltrating immune cell types between the low- and the high-risk groups. (E) Expression of the immune score between the low- and the high-risk groups. (F) Expression of the stromal score between the low- and the high-risk groups.
FIGURE 7(A) Expression of PD-1 between the low- and the high-risk groups. (B) Expression of PD-L1 between the low- and the high-risk groups. (C) Expression of PD-L2 between the low- and the high-risk groups. (D) Expression of CTL4 between the low- and the high-risk groups. (E) TMB between the low- and the high-risk groups.
FIGURE 8Development of a nomogram based on the ARG signature for predicting overall survival of patients with LUSC. (A) The nomogram plot integrating ARG risk score and T-, N-, and M-classification in the TCGA training cohort. (B) The calibration plot for the probability of 1-, 3-, and 5 years OS in the TCGA training cohort; 1-year: red; 3 years: blue; 5 years: black. (C) The calibration plot for the probability of 1-, 3-, and 5 years OS in the GSE73403 validation cohort; 1 year: red; 3 years: blue; 5 years: black. (D) Time-dependent ROC curves comparing the prognostic accuracy of the risk score combining ARGs and the TNM system in the training cohort; risk score + TNM: red; TNM only: black. (E) Time-dependent ROC curves comparing the prognostic accuracy of the risk score combining ARGs and the TNM system in the validation cohort; risk score + TNM: red; TNM only: black.
FIGURE 9The representative images of IHC staining of five ARGs from SYSUCC. (A) A2M; (B) CHEK2; (C) FOS; (D) PLAU; and (E) ELN. The representative images of IHC staining of four ARGs from Human Protein Atlas. (F) A2M; (G) CHEK2; (H) FOS; and (I) PLAU. (J) The IHC score of 5 ARGs.