| Literature DB >> 34257533 |
Wei Xin1,2, Chaoran Zhao1,2, Longyang Jiang1,2, Dongmei Pei3, Lin Zhao1,2, Chengpu Zhang3.
Abstract
Head and neck squamous cell cancer (HNSCC) is one of the most common types of cancer worldwide. There have been many reports suggesting that biomarkers explored via database mining plays a critical role in predicting HNSCC prognosis. However, a single biomarker for prognostic analysis is not adequate. Additionally, there is growing evidence indicating that gene signature could be a better choice for HNSCC prognosis. We performed a comprehensive analysis of mRNA expression profiles using clinical information of HNSCC patients from The Cancer Genome Atlas (TCGA). Gene Set Enrichment Analysis (GSEA) was performed, and we found that a set of genes involved in epithelial mesenchymal transition (EMT) contributed to HNSCC. Cox proportional regression model was used to identify a four-gene (WIPF1, PPIB, BASP1, PLOD2) signature that were significantly associated with overall survival (OS), and all the four genes were significantly upregulated in tumor tissues. We successfully classified the patients with HNSCC into high-risk and low-risk groups, where in high-risk indicated poorer patient prognosis, indicating that this gene signature might be a novel potential biomarker for the prognosis of HNSCC. The prognostic ability of the gene signature was further validated in an independent cohort from the Gene Expression Omnibus (GEO) database. In conclusion, we identified a four-EMT-based gene signature which provides the potentiality to serve as novel independent biomarkers for predicting survival in HNSCC patients, as well as a new possibility for individualized treatment of HNSCC.Entities:
Keywords: HNSCC; Prognostic; epithelial-mesenchymal transition; mRNAs; survival
Mesh:
Substances:
Year: 2021 PMID: 34257533 PMCID: PMC8262154 DOI: 10.3389/pore.2021.585192
Source DB: PubMed Journal: Pathol Oncol Res ISSN: 1219-4956 Impact factor: 3.201
Summary information of clinical characteristics of HNSCC patients in entire TCGA set (n = 494), TCGA validation set 1 (n = 346), TCGA validation set 2 (n = 148).
| Clinical feature | TCGA-HNSCC entire cohort (n = 494), n (%) | Patients in validation set 1 (n = 346), n (%) | Patients in validation set 2 (n = 148), n (%) |
|---|---|---|---|
| Gender | |||
| Male | 363 (73.48) | 255 (73.70) | 108 (72.97) |
| Female | 131 (26.52) | 91 (26.30) | 40 (27.03) |
| Age | |||
| ≥61 | 253 (51.21) | 182 (52.60) | 71 (47.97) |
| <61 | 241 (48.79) | 164 (47.40) | 77 (52.03) |
| Clinical T | |||
| T1-T2 | 173 (29.85) | 126 (37.39) | 47 (33.10) |
| T3-T4 | 306 (63.88) | 211 (62.61) | 95 (66.90) |
| Clinical N | |||
| N0 | 235 (49.68) | 174 (52.57) | 61 (42.96) |
| N1- N3 | 238 (50.32) | 157 (47.43) | 81 (57.04) |
| Clinical stage | |||
| I-II | 111 (23.13) | 80 (23.67) | 31 (21.83) |
| III-IV | 369 (76.88) | 258 (76.33) | 111 (78.17) |
| Grade | |||
| I-II | 356 (74.95) | 252 (76.12) | 104 (72.22) |
| III-IV | 119 (25.05) | 79 (23.87) | 40 (27.78) |
| HPV P16 status | |||
| Negative | 71 (70.30) | 46 (70.77) | 25 (69.44) |
| Positive | 30 (29.70) | 19 (29.23) | 11 (30.56) |
| Person neoplasm cancer status | |||
| Tumor free | 311 (69.73) | 212 (68.61) | 99 (72.26) |
| With tumor | 135 (30.27) | 97 (31.39) | 38 (27.74) |
| New tumor event after initial treatment | |||
| Yes | 46 (24.86) | 36 (26.67) | 10 (20.00) |
| No | 139 (75.14) | 99 (73.33) | 40 (80.00) |
| Alcohol history | |||
| Yes | 327 (67.70) | 229 (67.95) | 98 (67.12) |
| No | 156 (32.30) | 108 (32.05) | 48 (32.88) |
| Tobacco smoking history | |||
| I-II | 277 (57.23) | 186 (55.03) | 91 (62.33) |
| III-IV | 207 (42.77) | 152 (44.97) | 55 (37.67) |
| Lymph node neck dissection | |||
| Yes | 402 (81.71) | 280 (81.40) | 122 (82.43) |
| No | 90 (18.29) | 64 (18.60) | 26 (17.57) |
| Lymph vascular invasion | |||
| Yes | 119 (35.63) | 81 (33.61) | 38 (40.86) |
| No | 215 (64.37) | 160 (66.39) | 55 (59.14) |
| Perineural invasion | |||
| Yes | 162 (46.82) | 116 (47.54) | 46 (45.10) |
| No | 184 (53.18) | 128 (52.46) | 56 (54.90) |
| Radiation therapy | |||
| Yes | 118 (67.05) | 84 (65.63) | 34 (70.83) |
| No | 58 (32.95) | 44 (34.38) | 14 (29.17) |
FIGURE 1Enrichment plots of five gene sets which were significantly different in normal and HNSCC tissues using GSEA. [Including (A) MYC targets V2; (B) MYC targets V1; (C) E2F targets; (D) Epithelial Mesenchymal Transition; (E) G2/M checkpoint].
Gene sets enriched in HNSCC.
| GS follow link to MSigDB | Size | ES | NOM | Rank at Max |
|---|---|---|---|---|
| E2F Targets | 197 | −0.8 | 0.002 | 7,217 |
| G2M Checkpoint | 196 | −0.74 | 0.002 | 7,248 |
| Epithelial mesenchymal transition | 197 | −0.65 | 0.013 | 5,720 |
| MYC Targets V1 | 197 | −0.67 | 0.016 | 7,656 |
| MYC Targets V2 | 58 | −0.66 | 0.041 | 9,384 |
Detailed information of four prognostic mRNAs significantly associated with OS in patients with HNSCC.
| mRNA | Ensemble Id | Location | β(cox) |
|---|---|---|---|
| WIPF1 | ENST00000359761.7 | chr2:174,562,204–174,682,883 | −0.23734 |
| PPIB | ENST00000300026.3 | chr15:64,155,812–64,163,205 | 0.19805 |
| BASP1 | ENST00000322611.3 | chr5:17,217,560–17,276,834 | 0.09904 |
| PLOD2 | ENST00000360060.7 | chr3:146,069,444–146,161,167 | 0.15977 |
Four-mRNA risk model p-value = 1.166e-05.
FIGURE 2Differential expression of four selected genes.
FIGURE 3The four-mRNA signature related to risk score predicts OS in patients with HNSCC (A) mRNA risk score distribution in each patient. (B) Survival (in days) of patients in order of the value of risk scores. (C) A heatmap showing the expression profile of the four selected genes.
After grouping demographic and clinical characteristics of TCGA-HNSCC cohort.
| Clinical feature | Risk score | χ2 |
| |
|---|---|---|---|---|
| High risk n (%) | Low risk n (%) | |||
| Gender | 0.208 | 0.649 | ||
| Male | 183 (74.4%) | 180 (72.6%) | ||
| Female | 63 (25.6%) | 68 (27.4%) | ||
| Age | 0.453 | 0.5008 | ||
| ≥61 | 123 (49.8%) | 131 (52.8%) | ||
| <61 | 124 (50.2%) | 117 (47.2%) | ||
| Clinical T | 4.21 | 0.240 | ||
| T1 | 11 (4.6%) | 22 (9.0%) | ||
| T2 | 67 (28.1%) | 73 (29.8%) | ||
| T3 | 66 (27.7%) | 64 (26.1%) | ||
| T4-T4b | 94 (39.5%) | 86 (35.1%) | ||
| Clinical N | 1.601 | 0.206 | ||
| N0 | 112 (49.8%) | 123 (51.5%) | ||
| N1- N3 | 133 (50.2%) | 116 (48.5%) | ||
| Clinical stage | 3.283 | 0.350 | ||
| I | 7 (2.9%) | 12 (5.0%) | ||
| II | 46 (19.3%) | 46 (19.0%) | ||
| III | 47 (19.8%) | 55 (22.7%) | ||
| IV | 139 (58.0%) | 129 (53.3%) | ||
| Grade | 10.619 | 0.014 | ||
| I | 28 (11.7%) | 33 (14.0%) | ||
| II | 164 (68.3%) | 131 (55.7%) | ||
| III | 48 (20.0%) | 69 (29.4%) | ||
| IV | 0 (0%) | 2 (0.9%) | ||
| HPV P16 status | 2.760 | 0.097 | ||
| Negative | 34 (79.1%) | 37 (63.8%) | ||
| Positive | 9 (21.0%) | 21 (36.2) | ||
| Person neoplasm cancer status | 6.948 | 0.008 | ||
| Tumor free | 139 (63.8%) | 173 (75.2%) | ||
| With tumor | 79 (36.2%) | 57 (24.8%) | ||
| New tumor event after initial treatment | 5.535 | 0.018 | ||
| Yes | 32 (31.7%) | 14 (16.7%) | ||
| No | 69 (68.3%) | 71 (83.3%) | ||
| Alcohol history | 1.206 | 0.2722 | ||
| Yes | 157 (65.4%) | 171 (70.0%) | ||
| No | 83 (34.6%) | 73 (30.0%) | ||
| Tobacco smoking history | 0.8138 | 0.846 | ||
| I | 58 (24.3%) | 52 (21.4%) | ||
| II | 79 (33.0%) | 87 (35.8%) | ||
| III | 34 (14.2%) | 37 (15.2%) | ||
| IV | 68 (28.5%) | 67 (27.6%) | ||
| Lymph node neck dissection | 1.461 | 0.226 | ||
| Yes | 207 (83.8%) | 195 (79.6%) | ||
| No | 40 (16.2%) | 50 (20.4%) | ||
| Lymph vascular invasion | 1.141 | 0.286 | ||
| Yes | 55 (32.7%) | 64 (38.3%) | ||
| No | 113 (67.3%) | 103 (61.7%) | ||
| Perineural invasion | 1.559 | 0.212 | ||
| Yes | 91 (50%) | 71 (43.3%) | ||
| No | 91 (50%) | 93 (56.7%) | ||
| Radiation therapy | 0.2363 | 0.627 | ||
| Yes | 64 (66.7%) | 54 (67.5%) | ||
| No | 32 (33.3%) | 26 (32.5%) | ||
Univariable and multivariable analyses for each clinical feature.
| Univariate analysis | ||||
|---|---|---|---|---|
| Clinical feature | HR | 95% Cl |
| HR |
| Risk score | 2.177 | 1.627–2.912 | <0.001 |
|
| Clinical T | 1.078 | 0.936–1.242 | 0.296 | |
| Clinical N | 1.173 | 1.012–1.360 | 0.034 | |
| Clinical M | 4.785 | 1.762–12.994 | 0.002 | |
| Margin status | 1.340 | 1.123–1.599 | 0.001 | |
| Tumor subsite | 0.980 | 0.937–1.025 | 0.980 | |
| Perineural invasion present | 2.186 | 1.545–3.095 | <0.001 | |
| Lymph vascular invasion present | 1.704 | 1.211–2.397 | 0.002 | |
|
| ||||
|
|
|
|
|
|
| Risk score | 1.844 | 1.198–2.836 | 0.005 |
|
| Clinical T | 0.926 | 0.748–1.146 | 0.479 | |
| Clinical N | 1.161 | 0.932–1.446 | 0.182 | |
| Clinical M | 3.744 | 0.839–16.713 | 0.084 | |
| Margin status | 1.410 | 1.107–1.798 | 0.005 | |
| Tumor subsite | 0.971 | 0.913–1.033 | 0.347 | |
| Perineural invasion present | 2.218 | 1.490–3.302 | <0.001 | |
| Lymph vascular invasion present | 1.250 | 0.839–1.861 | 0.273 | |
FIGURE 4Kaplan-Meier survival analysis for patients with HNSCC stratified by the prognostic signature. (A) The receiver operating characteristic (ROC) for EMT-based prognostic signature, margin status, lymph node vascular invasion and perineural invasion only, and the EMT-based risk score combined with the above clinical parameters for OS of HNSCC patients. (B) The Kaplan-Meier curve of OS for patients in TCGA-HNSCC cohort divided into high-risk and low-risk groups. (C) The Kaplan-Meier curve of DFS for patients in TCGA-HNSCC cohort divided into high-risk and low-risk groups. (D) The Kaplan-Meier curve of prognostic signature for HNSCC patients in TCGA validation set 1 (n = 346). (E) The Kaplan-Meier curve of prognostic signature for HNSCC patients in TCGA validation set 2 (n = 148). (F) The Kaplan-Meier curve of OS in the high- and low-risk groups stratified by the four-mRNA signature in the GSE27020. Different clinical features including (G) margin status, (H) lymph vascular invasion, and (I) presence of perineural invasion predicts patient survival.
FIGURE 5Kaplan–Meier curves for prognostic value of risk score signature for patients divided by each clinical feature. (A) Age (B) Gender (C) Expression status of HPV P16 (D) Status of radiation therapy (E) Alcohol status (F) Status of lymph node neck dissection (G) Presence of perineural invasion (H) Presence of lymph vascular invasion.
FIGURE 6The top ten of GO term and pathway by target genes of four-mRNA signature in TCGA HNSCC cohort. The analysis was considered significant when P values were corrected for false discovery rate (FDR) (A) Parts of GO-CC categories of four-mRNA signature (B) Parts of GO-BP categories of four-mRNA signature (C) Parts of GO-MF categories of four-mRNA signature (D) Parts of KEGG pathway of four-mRNA signature.