Literature DB >> 31921296

Identification and Validation of an Immune-Related RNA Signature to Predict Survival of Patients With Head and Neck Squamous Cell Carcinoma.

Shuo Wu1, Xinyi Dai2, Dielai Xie3.   

Abstract

Head and neck squamous cell carcinoma (HNSCC) is a heterogeneous disease characterized by different molecular subgroups and clinical features. Therefore, it is important to uncover reliable molecular biomarkers for distinguishing different risk patient subgroup. Here, we conducted a multi-omics analysis to examine the joint predictive power of a multi-type RNA signature in the prognosis of HNSCC patients through integration analysis of mRNA, miRNA, and lncRNA expression profiles and clinical data in a large number of HNSCC patients. A multi-type RNA signature (15SigRS) was constructed which can classify patients into the high-risk group and low-risk group with the significantly different outcome [hazard ratio (HR) = 2.718, 95% confidence interval (CI), 2.258-3.272, p < 0.001] in the discovery data set, and subsequently validated in the Cancer Genome Atlas (TCGA) testing data set (HR = 1.299, 95% CI, 1.170-1.442, p < 0.001) and another independent GSE65858 data set (HR = 1.077, 95% CI, 1.016-1.143, p = 0.013). Further multivariate Cox regression analysis and stratification analysis demonstrated the independence of predictive performance of the 15SigRS relative to conventional clinicopathological factors. Furthermore, the 15SigRS has a prior performance in prognostic prediction than other single RNA type-based signatures. Functional analysis suggested that the 15SigRS are involved in immune- or metabolism-related KEGG pathways. In summary, our study demonstrated the potential application of mixed RNA types as molecular markers for predicting the outcome of cancer patients.
Copyright © 2019 Wu, Dai and Xie.

Entities:  

Keywords:  biomarkers; head and neck squamous cell carcinoma; immune; prognosis; signature

Year:  2019        PMID: 31921296      PMCID: PMC6915042          DOI: 10.3389/fgene.2019.01252

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Head and neck squamous cell carcinoma (HNSCC), the most frequent histological type of head and neck cancers, is the sixth most common cancers worldwide and account for nearly 5% of all malignancies worldwide (Marur and Forastiere, 2016). Smoking tobacco, drinking alcohol, and human papillomaviruses (HPV) are important risk factors and have been implicated in the pathogenesis of HNSCC (Kobayashi et al., 2018). Surgery combined with radiation therapy, chemotherapy, and targeted therapy is the main treatment option. Although TNM stage has been considered as an important clinical prognostic factor for guiding treatment options, some patients with the same clinical features may have different prognosis because of molecular heterogeneity. Therefore, there is an urgent need to identify reliable biomarkers for predicting prognosis of HNSCC patients With advances in high-throughput omics technique, increasing efforts have been made to meet this urgent need. Some previous studies used gene expression data and identified some mRNA-based signatures. For example, Bai and colleagues identified a 12-gene signature for predicting progression and prognosis (Bai et al., 2019) Another six-mRNA signature was identified by Tian et al. to predict the death risk of HNSCC patients using gene expression profiles in the Cancer Genome Atlas (TCGA) (Tian et al., 2019). Recently, non-coding RNAs (ncRNAs) have been found to be an important class of RNA molecules and are involved a wide range of biological processes (Fatica and Bozzoni, 2014; Bracken et al., 2016). The dysregulation of ncRNAs has been implicated in various human diseases including cancers (Esteller, 2011), demonstrating the role of ncRNAs as a potential biomarker in cancer diagnosis, prognosis, and treatment (Li et al., 2014; Gonzalez et al., 2015; Jiang et al., 2016; Zhou et al., 2017; Zhou et al., 2018a; Zhou et al., 2018b; Zhou et al., 2019). For HNSCC, recent studies have revealed the altered expression of ncRNAs in the development and progression of HNSCC (Salyakina and Tsinoremas, 2016; Sannigrahi et al., 2018), and several miRNA- or lncRNA-related signatures were identified to improve clinical outcome (Irani, 2016; Wong et al., 2016; Cao et al., 2017; Liu et al., 2018; Diao et al., 2019). However, previous signatures often focus on one type of RNAs, and the joint predictive power of multiple types of RNAs was not investigated yet. In this study, we tried to investigate the joint predictive power of multi-type RNAs as novel prognostic biomarkers by integrating mRNA expression profiles, miRNA expression profiles, lncRNA expression profiles, and clinical data in a large number of HNSCC patients.

Materials and Methods

Patient Data Set

RNA-Seq data (HTSeq), miRNA expression data (Illumina HiSeq), and corresponding clinical data were derived from the TCGA database (https://cancergenome.nih.gov/). Ensembl gene id of mRNAs, miRNAs, and lncRNAs were derived from HUGO Gene Nomenclature Committee (HGNC) database (https://www.genenames.org/). After cross-referenced by Ensembl gene id and tumor barcodes and removing patient samples without survival information and genes with zero expression values in more than 10% samples, a total of 19,163 mRNAs, 3,931 lncRNAs, and 1,854 miRNAs in 489 patients were obtained. All patients were randomly split into two equal patient cohorts: discovery data set (n = 245) and validation data set (n = 244). Another independent validation data set including 270 HNSCC patients was obtained from the Gene Expression Omnibus (GEO) database under the accession number GSE65858 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE65858). Clinical features of HNSCC patients used in this study can be seen in .
Table 1

Summary of clinical characteristics of three HNSCC patient data sets in the study.

CharacteristicDiscovery dataset (N = 245)Validation dataset (N = 244)TCGA dataset (N = 489)GSE65858 dataset (N = 270)
Vital status, n (%)Alive150 (61.2)128 (52.5)278 (56.9)176 (65.2)
Dead95 (38.8)116 (47.5)211 (43.1)94 (34.8)
Age (years), n (%)> = 60132 (53.9)141 (57.8)273 (55.8)117 (49.3)
<60113 (46.1)103 (42.2)216 (44.2)153 (56.7)
Gender, n (%)Female64 (26.1)66 (27.0)130 (26.6)47 (17.4)
Male181 (73.9)178 (73.0)359 (73.4)223 (82.6)
Stage, n (%)Stage I/II57 (23.3)37 (15.2)94 (19.2)55 (20.4)
Stage III/IV157 (64.1)171 (70.1)328 (67.1)215 (79.6)
NA31 (12.6)36 (14.7)67 (13.7)
Grade, n (%)G129 (11.8)32 (13.1)61 (12.5)
G2147 (60)144 (59.0)291 (59.5)
G359 (24.1)58 (23.8)117 (23.9)
NA10 (4.1)10 (4.1)20 (4.1)
Race, n (%)White201 (82)216 (88.5)417 (85.3)
Other_race34 (13.9)24 (9.8)58 (11.9)
NA10 (4.1)4 (1.7)14 (2.9)
ANGIOLYMPHATIC_INVASION, n (%)Yes54 (22)61 (25)115 (23.5)
No112 (45.7)104 (42.6)216 (44.2)
NA79 (32.2)79 (32.4)158 (32.3)
PERINEURAL_INVASION, n (%)Yes72 (29.4)86 (35.2)158 (32.3)
No98 (40)86 (35.2)184 (37.6)
NA75 (30.6)72 (29.5)147 (30.1)
Smoking_pack_years, n (%)> = 4082 (33.5)78 (32.0)160 (32.7)222 (YES, 82.2)
<4061 (24.9)58 (23.8)119 (24.3)48 (NO, 17.8)
NA102 (41.6)108 (44.2)210 (42.9)
ALCOHOL_HISTORY_DOCUMENTED, n (%)Yes159 (64.9)165 (67.6)324 (66.3)
No81 (33.1)73 (29.9)154 (31.5)
NA5 (2)6 (2.5)11 (2.2)
HPV_STATUS_P16, n (%)Negative37 (15.1)32 (13.1)69 (14.1)
Positive15 (6.1)15 (6.1)30 (6.1)
NA193 (78.8)197 (80.8)390 (79.8)

HNSCC, head and neck squamous cell carcinoma; TCGA, the Cancer Genome Atlas.

Summary of clinical characteristics of three HNSCC patient data sets in the study. HNSCC, head and neck squamous cell carcinoma; TCGA, the Cancer Genome Atlas.

Identification of Survival-Related a Multi-Type RNA Prognostic Signature

To identify survival-related genes, univariate Cox proportional hazards analyses were used to identify candidate prognostic mRNAs, miRNAs, and lncRNAs. Candidate prognostic mRNAs, miRNAs, and lncRNAs were retained only if they have significant p values (p < 0.05). Then these candidate prognostic mRNAs, miRNAs, and lncRNAs were fitted in a multivariable Cox regression analysis to identify independent survival-related genes. Finally, multi-type RNA prognostic signature was constructed as the linear combination of expression values of each independent survival-related mRNAs, miRNAs, and lncRNAs, weighted by their estimated regression coefficients in the multivariate Cox regression analysis according to previous studies (Zhou et al., 2015a; Zhou et al., 2015b).

Statistical Analysis

Kaplan–Meier survival curve analysis and a log-rank test were used to compare differences in overall survival (OS) time between the high-risk group and low-risk group. Univariate and multivariate Cox regression analyses were performed on the individual clinical variables with and without the multi-type RNA prognostic signature in each data set. Hazard ratios (HRs) and 95% confidence intervals (CIs) were calculated. The time-dependent receiver operating characteristic (ROC) curve at 3 and 5 years was then calculated to compare the sensitivity and specificity of survival prediction. Hierarchical clustering of the expression values of independent prognostic gene biomarkers was performed using the metric of Euclidean distance and complete linkage. The chi-square test was used to test the significance of survival status between two groups. All statistical analyses were performed using the R/Bioconductor (version 3.0.2).

Functional Enrichment Analysis

GO and KEGG functional enrichment analysis was performed using Bioconductor package “clusterProfiler” (Yu et al., 2012).

Results

Identification of Independent Survival-Related mRNAs, miRNAs, and lncRNAs

To identify survival-related mRNAs, miRNAs, and lncRNAs, we performed univariate Cox regression analysis to evaluate the association between expression of each type of RNA and OS in the discovery data set. A total of 23 mRNAs, 15 lncRNAs, and 1 miRNAs were found to be significantly associated with OS, and were considered as candidate prognostic mRNAs, miRNAs, and lncRNAs. Then all these candidate prognostic mRNAs, miRNAs, and lncRNAs were fitted into multivariate Cox regression analysis, 15 of 39 genes were identified as independent prognostic gene biomarkers. Hierarchical clustering of the expression values of 15 independent prognostic gene biomarkers revealed two distinctive sample clusters in the discovery data set (). The survival status of two distinctive sample clusters is significantly different (dead 57.8% vs. 23.5%, p = 9.349e-08, chi-square test). Survival analysis suggested that the OS time between the two sample clusters was significantly different (, p < 0.001, log-rank test). Similar results also were observed in the validation data set. Two distinctive sample clusters also were obtained using hierarchical clustering analysis (). These two distinctive sample clusters have significantly different survival status (dead 55.5% vs. 34.1%, p = 0.002, chi-square test) and survival time (, p < 0.001, log-rank test). These results revealed the potential of these 15 candidate independent prognostic genes as biomarkers in the prognosis of HNSCC patients.
Figure 1

Identification of independent survival-related mRNAs, miRNAs, and lncRNAs. (A) Hierarchical clustering analysis of 245 patients in the discovery data set using 15 prognostic genes. (B) Kaplan–Meier survival curves of overall survival between two clusters in the discovery data set. (C) Hierarchical clustering analysis of 244 patients in the validation data set using 15 prognostic genes. (D) Kaplan–Meier survival curves of overall survival between two clusters in the validation data set.

Identification of independent survival-related mRNAs, miRNAs, and lncRNAs. (A) Hierarchical clustering analysis of 245 patients in the discovery data set using 15 prognostic genes. (B) Kaplan–Meier survival curves of overall survival between two clusters in the discovery data set. (C) Hierarchical clustering analysis of 244 patients in the validation data set using 15 prognostic genes. (D) Kaplan–Meier survival curves of overall survival between two clusters in the validation data set.

Establishment and Evaluation of a Multi-Type RNA Prognostic Signature in Predicting Survival in the Discovery Data Set

To establish a multi-type RNA prognostic signature for survival prediction, these 15 candidate independent prognostic genes were fitted in a multivariate Cox regression analysis in the discovery data set. Then a multi-type RNA prognostic signature (15SigRS) were constructed according to the expression of 15 prognostic genes and multivariate Cox regression coefficient as the weight using risk scoring method as described previously, as follows: 15SigRS = (0.5344*CDH6)+(1.0462*CYP19A1)+(0.4723*TRPA1)+(0.2764*PPARG)+(0.0068*KRT84)+(−0.2291*FGD3)+(0.3113*ADGRE1)+(−0.7948*SLC25A45)+(0.4878*OXCT2)+(−4.0659*OTUD7A)+(1.2231*FAM198B-AS1)+(−0.3978*LINC00968)+(1.8352*LINC01123)+(0.1240*ZBED5-AS1)+(−0.0602*MIR4664). We computed a 15SigRS for each HNSCC patient and classified patients into the high-risk group or low-risk group with the cutoff point of median risk score (−0.04) in the discovery data set. Using the 15SigRS, 245 patients in the discovery data set were divided into high-risk (n = 123) and low-risk groups (n = 122). We found that the survival time of the high-risk group is significantly shorter than the low-risk group (, p < 0.001, log-rank test). The time-dependent ROC curves analysis for the15SigRS achieved an area under the ROC curve (AUC) of 0.781 at 3 years and 0.768 at 5 years (). The distribution of risk scores and survival status of patients and expression patterns of 15 prognostic genes in the 15SigRS were shown in .
Figure 2

Development and evaluation of the 15SigRS in the discovery data set. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group. (B) Time-dependent receiver operating characteristic (ROC) analysis at 3 and 5 years. (C) The distribution of risk scores and survival status of patients and expression patterns of 15 prognostic genes in the 15SigRS.

Development and evaluation of the 15SigRS in the discovery data set. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group. (B) Time-dependent receiver operating characteristic (ROC) analysis at 3 and 5 years. (C) The distribution of risk scores and survival status of patients and expression patterns of 15 prognostic genes in the 15SigRS.

Independent Confirmation of the 15SigRs for Survival Prediction in the Validation Data Set and TCGA Data Set

To evaluate the robustness of prognostic performance of the15SigRS, the 15SigRS was tested in the independent validation data set. With 15SigRS and cutoff derived from the discovery data set, all 244 patients in the validation data set also were classified into the high-risk group (n = 119) and low-risk group (n = 125). As shown in , patients in the low-risk group showed a better outcome than those in the high-risk group (, p < 0.001, log-rank test). The time-dependent ROC curves analysis for the15SigRS achieved an AUC of 0.658 at 3 years and 0.663 at 5 years (). In univariate analysis, the HRs of high-risk group versus low-risk group for OS were 1.299 (p < 0.001, CI, 1.170–1.442) ().
Figure 3

Independent validation of the 15SigRS in the Cancer Genome Atlas (TCGA) data set. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group in the validation data set. (B) Time-dependent ROC analysis at 3 and 5 years in the validation data set. (C) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group in the TCGA data set. (D) Time-dependent ROC analysis at 3 and 5 years in the TCGA data set.

Table 2

Univariate and multivariate Cox regression analysis of OS in each data set.

VariableUnivariate analysisMultivariable analysis
HR95% CI of HRP valueHR95% CI of HRP value
Discovery data set (n = 245)
15SigRS2.7182.258–3.272<0.0012.5621.999–3.284<0.001
Age1.0431.024–1.0630.0001.0160.991–1.0420.218
Gender (male/female)0.6430.42–0.9840.0420.4990.272–0.9160.025
Stage (III&IV/I&II)1.2540.753–2.0880.3851.1870.352–4.0010.783
Grade (G2/G1)1.3620.703–2.6400.3601.6850.584–4.8610.334
Grade (G3/G1)1.1390.554–2.3430.7231.8290.599–5.5820.289
Race (White/other race)0.5780.329–1.0140.0561.6230.679–3.8800.276
ALCOHOL_HISTORY_DOCUMENTED (yes/no)0.7340.483–1.1130.1451.0100.562–1.8150.974
ANGIOLYMPHATIC_INVASION (yes/no)1.6870.988–2.8810.056
PERINEURAL_INVASION (yes/no)2.8791.689–4.9070.000
SMOKING_PACK_YEARS1.0010.995–1.0080.737
Validation data set (n = 244)
15SigRS1.2991.170–1.442<0.0011.3111.158–1.484<0.001
Age1.0030.986–1.020.7211.0200.996–1.0450.111
Gender (male/female)0.9040.605–1.3520.6241.0630.606–1.8660.832
Stage (III & IV/I & II)3.2941.595–6.8050.0013.1020.569–16.9050.191
Grade (G2/G1)2.2641.157–4.4300.0171.3260.593–2.9670.492
Grade (G3/G1)1.9270.936–3.9640.0751.2480.520–2.9930.620
Race (White/other race)0.8710.477–1.5870.6510.9000.452–1.7900.763
ALCOHOL_HISTORY_DOCUMENTED (yes/no)1.1850.788–1.780.4151.5360.882–2.6750.130
ANGIOLYMPHATIC_INVASION (yes/no)1.8091.138–2.8740.012
PERINEURAL_INVASION (yes/no)1.6981.061–2.7160.027
SMOKING_PACK_YEARS1.0010.992–1.0110.801
TCGA data set (n = 489)
15SigRS1.4961.393–1.606<0.0011.4821.348–1.629<0.001
Age1.0221.009–1.0350.0011.0271.010–1.0440.002
Gender (male/female)0.7590.568–1.0160.0640.7860.526–1.1740.239
Stage (III & IV/I & II)1.8121.216–2.7010.0031.8460.768–4.4370.171
Grade (G2/G1)1.7491.102–2.7770.0181.2400.679–2.2640.483
Grade (G3/G1)1.5070.913–2.4870.1091.4410.754–2.7540.269
Race (White/other race)0.7100.473–1.0650.0980.8110.492–1.3350.410
ALCOHOL_HISTORY_DOCUMENTED (yes/no)0.9510.712–1.270.7341.1650.792–1.7140.437
ANGIOLYMPHATIC_INVASION (yes/no)1.7501.239–2.4730.001
PERINEURAL_INVASION (yes/no)2.2221.563–3.160.000
SMOKING_PACK_YEARS1.0010.995–1.0060.765
HPV_STATUS_P16 (yes/no)0.5040.172–1.4770.212
GSE65858 data set (n = 270)
15SigRS1.0771.016–1.1430.0131.0731.012–1.1370.019
Age1.0371.006–1.0480.0121.031.007–1.0530.01
Gender (male/female)1.0460.6174–1.7710.8681.0260.602–1.7490.923
Stage (II/I)0.3860.112–1.3330.1320.3060.088–1.0710.064
Stage (III/I & II)0.4590.1447–1.4540.1850.4230.133–1.3430.144
Stage (IV/I&)1.4950.603–3.7050.3851.3390.537–3.3360.531
SMOKING (yes/no)0.9410.555–1.5950.8211.2940.733–2.2840.373

OS, overall survival; HR, hazard ratio.

Independent validation of the 15SigRS in the Cancer Genome Atlas (TCGA) data set. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group in the validation data set. (B) Time-dependent ROC analysis at 3 and 5 years in the validation data set. (C) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group in the TCGA data set. (D) Time-dependent ROC analysis at 3 and 5 years in the TCGA data set. Univariate and multivariate Cox regression analysis of OS in each data set. OS, overall survival; HR, hazard ratio. A similar analysis also was performed in the TCGA data set. The patients of TCGA data set were segregated into a high-risk group (n = 242) and low-risk group (n = 247) with significantly different OS (, p < 0.001, log-rank test). The time-dependent ROC curves analysis for the15SigRS achieved an AUC of 0.681 at 3 years and 0.649 at 5 years (). In univariate analysis, the HRs of high-risk group versus low-risk group for OS were 1.496 (p < 0.001, CI, 1.393–1.606) ().

Further Confirmation of the 15SigRs for Survival Prediction in GEO Data Set With Microarray Platform

Further validation of the 15SigRS for survival prediction was performed using another independent data set (GSE65858) of 270 patients with microarray platform (Illumina HumanHT-12 V4.0). Finally, expression value of 9 mRNAs of the 15SigRS can be obtained from GSE65858. With the same score model, the 15SigRS could distinguish between patients with high and low risks of death (, p = 0.021, log-rank test). The OS rate of patients in the low group were 69.9% at 3 years and 59.2% at 5 years, respectively, which is significantly higher than that (60.6% at 3 years and 37% at 5 years) in the high-risk group. The AUC of time-dependent ROC curves analysis is 0.581 at 3 years and 0.595 at 5 years (). In univariate analysis, the HRs of high-risk group versus low-risk group for OS were 1.077 (p = 0.013, CI, 1.016–1.143) ().
Figure 4

Independent validation of the 15SigRS in the Gene Expression Omnibus (GEO) data set. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group in the GSE65858 data set. (B) Time-dependent ROC analysis at 3 and 5 years in the GSE65858 data set.

Independent validation of the 15SigRS in the Gene Expression Omnibus (GEO) data set. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group in the GSE65858 data set. (B) Time-dependent ROC analysis at 3 and 5 years in the GSE65858 data set.

Independent Predictive Power of the 15SigRs From Clinicopathological Factors

To further investigate whether the predictive power of the 15SigRS was independent of other clinicopathological factors, we performed multivariate Cox regression analysis of the15SigRS with selected covariables including age, gender, stage, grade, race, and alcohol history. Results of multivariate analysis suggested that the 15SigRS still have a significant association with OS when adjusted by other clinicopathological factors in the discovery data set (HR = 2.562, p < 0.001; 95% CI, 1.999–3.284), validation data set (HR = 1.311, p < 0.001; 95% CI, 1.158–1.484), TCGA data set (HR = 1.482, p < 0.001; 95% CI, 1.348–1.629), and independent GSE65858 data set (HR = 1.073, p = 0.019; 95% CI, 1.012–1.137) (). We next performed a stratification analysis of smoking and alcohol. A total of 279 patients with smoking information were firstly divided into two patient data sets: smoking-light data set (n = 119) and smoking-heavy data set (n = 160). Using the 15SigRS, patients in the smoking-light data set could be subdivided into a high-risk group and low-risk group with the significantly different outcome (, p = 0.005, log-rank test). Similar results were observed when the 15SigRS was tested in the smoking-heavy data set (, p < 0.001, log-rank test). Then 478 patients with alcohol information were divided into two patient data sets: alcohol-no data set (n = 154) and alcohol-yes data set (n = 324). Using the 15SigRS, patients in the alcohol-no data set could be subdivided into the high-risk group (n = 83) and low-risk group (n = 71) with the significantly different outcome (, p < 0.001, log-rank test). Similar results were observed when the15SigRS was tested in the alcohol-yes data set (, p < 0.001, log-rank test). Multivariate and stratification analysis shows that the predictive power of the 15SigRS was independent of other clinicopathological factors for survival prediction in a patient with HNSCC.
Figure 5

Stratification analysis for smoking and alcohol. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for smoking-light patients. (B) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for smoking-heavy patients. (C) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for alcohol-no patients. (D) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for alcohol-yes patients.

Stratification analysis for smoking and alcohol. (A) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for smoking-light patients. (B) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for smoking-heavy patients. (C) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for alcohol-no patients. (D) Kaplan–Meier survival curves of overall survival between the high-risk group and low-risk group for alcohol-yes patients.

Performance Comparison of the 15SigRs With the Single RNA Type-Based Signatures

We then performed a comparative analysis for predictive performance of the 15SigRS with other single RNA type-based signatures. We performed ROC analysis and computed AUCs for 15SigRS and the other three types of RNA signatures in three data sets, respectively. As shown in , the15SigRS achieved a better prediction performance with an AUC value of 0.79 in the discovery data set, which is higher than other three types RNA signatures (mRNA-based signature AUC = 0.777, lncRNA-based signature AUC = 0.574, and miRNA signature AUC = 0.539). The 15SigRS also performed well in the validation data set and TCGA data set compared with other three types of RNA signatures (). Taken together, the 15SigRS generated by our approach has a prior performance in prognostic prediction than other single RNA type-based signatures.
Figure 6

ROC analysis of the 15SigRS with the single RNA type-based signatures in the discovery data set (A), validation data set (B), and TCGA data set (C).

ROC analysis of the 15SigRS with the single RNA type-based signatures in the discovery data set (A), validation data set (B), and TCGA data set (C).

Functional Characteristics of the 15SigRs

To further explore the potential function of the 15SigRS, we first calculated the Pearson correlation coefficient between expression levels of mRNAs and lncRNAs in the 15SigRS and identified ranking top 5% mRNAs as lncRNA-related mRNAs. Then we performed GO and KEGG functional enrichment analysis for these lncRNA-related mRNAs. Results of GO enrichment analysis suggested that these lncRNA-related mRNAs are enriched in immune- or cell differentiation-related GO terms (). Results of KEGG enrichment analysis suggested that these lncRNA-related mRNAs are enriched in immune- or metabolism-related KEGG pathways ().
Figure 7

Function enrichment analysis. (A) GO enrichment analysis. (B) KEGG enrichment analysis.

Function enrichment analysis. (A) GO enrichment analysis. (B) KEGG enrichment analysis.

Discussion

The molecular landscape has highlighted that HNSCC is a heterogeneous disease characterized by different molecular subgroups and clinical features (Leemans et al., 2018). Despite improvements in diagnosis and treatment for HNSCC patients, different patient subgroups with different molecular features and same TNM stage might benefit from effective personalized treatment options. Therefore, it is critical to identify reliable molecular biomarkers for distinguishing different risk patient subgroup. Although increasing efforts have been made to meet this need, previously reported gene signatures involved in only one type RNA such as mRNAs, lncRNAs, and miRNAs. Cooperative roles among different RNA molecules have been unveiled in cancer development and progression (Zhou et al., 2016a; Zhou et al., 2016b; Pan et al., 2019; Zhu et al., 2019). Therefore, in this study, we performed a systematic analysis to examine the joint predictive power of a multi-type RNA signature in the prognosis of HNSCC patients through integration analysis of mRNA, miRNA, and lncRNA expression profiles and clinical data in a large number of HNSCC patients. Because of the limitation in available HNSCC patient data with paired mRNA profiles, miRNAs, lncRNA profiles, and clinical data, TCGA HNSCC patient data were first split randomly into two independent patient data sets for the purpose of discovery and independent validation. Then we identified 15 RNA genes (including 10 mRNAs, 4 lncRNAs, and 1 miRNA) as independent biomarkers and constructed a 15-RNA signature (15SigRS) which can classify patients into the high-risk group and low-risk group with a significantly different outcome in the discovery data set. Furthermore, the 15SigRS was further validated in the independent patient data set which revealed the performance robustness in survival prediction. Further multivariate Cox regression analysis and stratification analysis demonstrated the independence of predictive performance of the 15SigRS relative to conventional clinicopathological factors, such as age, gender, stage, grade, race, smoking, and drinking, both in discovery data set and validation data set. Among 15 RNAs in the signature, several RNAs have been reported to be associated with cancer development and prognosis. For example, ADGRE1 encodes F4/80 antigen which was expressed in immune cells and used as a monocyte-macrophage marker in mice (Waddell et al., 2018). KRT84 has been reported to be up-regulated in squamous cell carcinoma and involved in metabolic pathways (Koringa et al., 2016). Sancisi found that CDH6 was highly expressed in thyroid tumor patients and could be as a regulator of invasiveness in thyroid tumors (Sancisi et al., 2013). The pan-cancer analysis suggested that hsa-mir-4664 was over-expressed in eight cancers (Hu et al., 2018). Low LINC00968 expression has recently reported associated with poor prognosis in breast cancers by attenuating drug resistance (Xiu et al., 2019). To gain a global view for the biological function of the 15SigRS, we performed a GO and KEGG function enrichment analysis which indicated that the 15SigRS may be involved in immune- or metabolism-related biological function. These are several limitations in our study that need to be noted. First, only some of 15 RNAs in the 15SigRS have been experimentally studied, and other remaining RNAs should be investigated in further experiments which may provide new therapeutic target in HNSCC. Second, the 15SigRS was validated in only one independent patient data set because of data limitations, and more patient data sets were expected to validate the performance of the 15SigRS for accelerating the clinical application. Taken together, our study identified a novel multi-type RNA signature associated with the clinical outcome of HNSCC patients. This signature may be a novel independent molecular prognostic marker for selecting high-risk patients which may benefit from more individualized treatment.

Data Availability Statement

The data analyzed in this study was obtained from the Cancer Genome Atlas (TCGA) database (https://cancergenome.nih.gov/) and Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE65858).

Author Contributions

SW conceived and designed the experiments. SW, XD, and DX performed the experiments and analyzed the data. SW wrote the paper. All authors read and approved the final manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  34 in total

1.  Transcriptome Analysis of Triple-Negative Breast Cancer Reveals an Integrated mRNA-lncRNA Signature with Predictive and Prognostic Value.

Authors:  Yi-Zhou Jiang; Yi-Rong Liu; Xiao-En Xu; Xi Jin; Xin Hu; Ke-Da Yu; Zhi-Ming Shao
Journal:  Cancer Res       Date:  2016-03-03       Impact factor: 12.701

Review 2.  The molecular landscape of head and neck cancer.

Authors:  C René Leemans; Peter J F Snijders; Ruud H Brakenhoff
Journal:  Nat Rev Cancer       Date:  2018-03-02       Impact factor: 60.716

3.  A three-lncRNA signature derived from the Atlas of ncRNA in cancer (TANRIC) database predicts the survival of patients with head and neck squamous cell carcinoma.

Authors:  Wei Cao; Jian-Nan Liu; Zeqi Liu; Xu Wang; Ze-Guang Han; Tong Ji; Wan-Tao Chen; Xin Zou
Journal:  Oral Oncol       Date:  2017-01-03       Impact factor: 5.337

Review 4.  Head and Neck Squamous Cell Carcinoma: Update on Epidemiology, Diagnosis, and Treatment.

Authors:  Shanthi Marur; Arlene A Forastiere
Journal:  Mayo Clin Proc       Date:  2016-03       Impact factor: 7.616

5.  A potential signature of eight long non-coding RNAs predicts survival in patients with non-small cell lung cancer.

Authors:  Meng Zhou; Maoni Guo; Dongfeng He; Xiaojun Wang; Yinqiu Cui; Haixiu Yang; Dapeng Hao; Jie Sun
Journal:  J Transl Med       Date:  2015-07-17       Impact factor: 5.531

6.  ADGRE1 (EMR1, F4/80) Is a Rapidly-Evolving Gene Expressed in Mammalian Monocyte-Macrophages.

Authors:  Lindsey A Waddell; Lucas Lefevre; Stephen J Bush; Anna Raper; Rachel Young; Zofia M Lisowski; Mary E B McCulloch; Charity Muriuki; Kristin A Sauter; Emily L Clark; Katharine M Irvine; Clare Pridans; Jayne C Hope; David A Hume
Journal:  Front Immunol       Date:  2018-10-01       Impact factor: 7.561

7.  A six-mRNA prognostic model to predict survival in head and neck squamous cell carcinoma.

Authors:  Saisai Tian; Guofeng Meng; Weidong Zhang
Journal:  Cancer Manag Res       Date:  2018-12-20       Impact factor: 3.989

8.  Long non-coding RNA LINC00968 attenuates drug resistance of breast cancer cells through inhibiting the Wnt2/β-catenin signaling pathway by regulating WNT2.

Authors:  Dian-Hui Xiu; Gui-Feng Liu; Shao-Nan Yu; Long-Yun Li; Guo-Qing Zhao; Lin Liu; Xue-Feng Li
Journal:  J Exp Clin Cancer Res       Date:  2019-02-21

9.  Cadherin 6 is a new RUNX2 target in TGF-β signalling pathway.

Authors:  Valentina Sancisi; Greta Gandolfi; Moira Ragazzi; Davide Nicoli; Ione Tamagnini; Simonetta Piana; Alessia Ciarrocchi
Journal:  PLoS One       Date:  2013-09-12       Impact factor: 3.240

Review 10.  miRNAs Signature in Head and Neck Squamous Cell Carcinoma Metastasis: A Literature Review.

Authors:  Soussan Irani
Journal:  J Dent (Shiraz)       Date:  2016-06
View more
  7 in total

1.  Development of a prognostic metabolic signature in stomach adenocarcinoma.

Authors:  Yu Gong; Siyuan Wu; Sen Dong; Shuai Chen; Gengdi Cai; Kun Bao; Haojun Yang; Yuwen Jiao
Journal:  Clin Transl Oncol       Date:  2022-03-30       Impact factor: 3.340

2.  Screening a novel signature and predicting the immune landscape of metastatic osteosarcoma in children via immune-related lncRNAs.

Authors:  Jie Wei; Da-Lang Fang; Cheng Kua Huang; Shu-Liang Hua; Xiao-Sheng Lu
Journal:  Transl Pediatr       Date:  2021-07

3.  A Novel Ferroptosis-Related Gene Signature to Predict Prognosis in Patients with Head and Neck Squamous Cell Carcinoma.

Authors:  Li Xu; Ying-Ying Li; Yang-Chun Zhang; Yong-Xu Wu; Dan-Dan Guo; Dan Long; Zhao-Hui Liu
Journal:  Dis Markers       Date:  2021-11-22       Impact factor: 3.434

4.  Constructing an immune- and ferroptosis-related lncRNA signature to predict the immune landscape of human bladder cancer.

Authors:  Xing Li; Libin Zhou; Tefei Lu; Lei Zhang; Yanjun Li; Jianting Xu; Min Yin; Huimin Long
Journal:  J Clin Lab Anal       Date:  2022-04-14       Impact factor: 3.124

5.  Immune-Related lncRNA Signature for Predicting the Immune Landscape of Head and Neck Squamous Cell Carcinoma.

Authors:  Ji Yin; Xiaohui Li; Caifeng Lv; Xian He; Xiaoqin Luo; Sen Li; Wenjian Hu
Journal:  Front Mol Biosci       Date:  2021-07-13

6.  An immune-associated ten-long noncoding RNA signature for predicting overall survival in cervical cancer.

Authors:  Shengkang Dai; Desheng Yao
Journal:  Transl Cancer Res       Date:  2021-12       Impact factor: 1.241

Review 7.  Further Understanding of the Immune Microenvironment in Head and Neck Squamous Cell Carcinoma: Implications for Prognosis.

Authors:  Nerina Denaro; Marco Carlo Merlano; Cristiana Lo Nigro
Journal:  Cancer Manag Res       Date:  2021-05-17       Impact factor: 3.989

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.