| Literature DB >> 35046992 |
Yiyuan Han1, Xiaolin Cao1,2, Xuemei Wang1, Qing He1.
Abstract
Head and neck squamous cell carcinoma (HNSCC) is one of the most common cancer worldwide and seriously threats public health safety. Despite the improvement of diagnostic and treatment methods, the overall survival for advanced patients has not improved yet. This study aimed to sort out prognosis-related molecular biomarkers for HNSCC and establish a prognostic model to stratify the risk hazards and predicate the prognosis for these patients, providing a theoretical basis for the formulation of individual treatment plans. We firstly identified differentially expressed genes (DEGs) between HNSCC tissues and normal tissues via joint analysis based on GEO databases. Then a total of 11 hub genes were selected for single-gene prognostic analysis to identify the prognostic genes. Later, the clinical information and transcription information of HNSCC were downloaded from the TCGA database. With the application of least absolute shrinkage and selection operator (LASSO) algorithm analyses for the prognostic genes on the TCGA cohort, a prognostic model consisting of three genes (COL4A1, PLAU and ITGA5) was successfully established and the survival analyses showed that the prognostic model possessed a robust performance in the overall survival prediction. Afterward, the univariate and multivariate regression analysis indicated that the prognostic model could be an independent prognostic factor. Finally, the predicative efficiency of this model was well confirmed in an independent external HNSCC cohort.Entities:
Keywords: cox regression; head and neck squamous cell carcinoma; lasso algorithm; prognostic model; survival analysis
Year: 2022 PMID: 35046992 PMCID: PMC8762258 DOI: 10.3389/fgene.2021.721199
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The clinical information of the HNSCC patients in TCGA datasets and GSE65858.
| Clinicopathological features | TCGA datasets (n = 395) | GSE65858 (n = 270) |
|---|---|---|
| State of survival | ||
| Dead | 144 | 94 |
| Alive | 251 | 176 |
| Age | ||
| >65 | 133 | 184 |
| ≤65 | 262 | 86 |
| Gender | ||
| Male | 289 | 223 |
| Female | 106 | 47 |
| Stage | ||
| 1–2 | 75 | 55 |
| 3–4 | 320 | 215 |
| T | ||
| 1–2 | 140 | 115 |
| 3–4 | 255 | 155 |
| N | ||
| 0–1 | 168 | 126 |
| 2–3 | 227 | 144 |
| M | ||
| 0 | 386 | 263 |
| 1 | 9 | 7 |
FIGURE 1(A): The differential expression results of 3 GEO datasets. (B): Venn diagram of DEGs common to 3 GEO datasets.
FIGURE 2GO annotation and KEGG pathway enrichment analysis of upregulated DEGs (A) and downregulated DEGs (B).
FIGURE 3(A): The PPI network of DEGs. Upregulated DEGs are marked in red; downregulated DEGs are marked in blue. (B): The survival analyses of hub genes. p < 0.05 was considered statistically significant.
FIGURE 4(A): Differential expression of prognostic genes in TCGA databases.“***” represents “p < 0.05”; “**” represents “p < 0.01”; “***” represents “p < 0.001”. (B): The Pearson correlation analysis results of 6 prognostic genes.
FIGURE 5(A): LASSO coefficient profiles of the 7 prognosis-related genes. A coefficient profile plot was generated against the log (λ) sequence; (B): Selection of the optimal parameter (λ) in the LASSO model. The results of LASSO algorithm and cross-validation; (C): The Kaplan-Meier survival curve of HNSCC patients in the TCGA cohort; (D): The ROC curve and AUC value of the prognostic model; (E,F): The results of the univariate and multivariate Cox regression analyses.
FIGURE 6(A,B): The Kaplan-Meier survival curve and ROC curve of HNSCC patients in the external dataset; (C,D): The results of the univariate and multivariate Cox regression analyses in the external dataset.