| Literature DB >> 31620361 |
Bowen Yang1,2, Jiming Shen1,2, Lu Xu1,2, Ying Chen1,2, Xiaofang Che1,2, Xiujuan Qu1,2, Yunpeng Liu1,2, Yuee Teng1,2, Zhi Li1,2.
Abstract
Objectives: LncRNAs are essential survival prognostic indicators with important biological functions in tumorigenesis and tumor progression. This study aimed to establish a long non-coding RNA (lncRNA) signature that can effectively predict the prognosis of patients with head and neck squamous cell carcinoma (HNSCC) and explore the potential functions of these lncRNAs. Materials andEntities:
Keywords: gene set enrichment analysis (GSEA); head and neck squamous cell carcinoma (HNSCC); long non-coding RNA (lncRNA); prognostic signature; weighted co-expression network construction (WGCNA)
Year: 2019 PMID: 31620361 PMCID: PMC6759597 DOI: 10.3389/fonc.2019.00898
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Analysis of flowchart illustrates the exploration procedure for the HNSC prognostic lncRNAs and the related mechanisms.
Figure 2The seed lncRNAs were extracted by 1,000 times Cox LASSO regression. (A) Highly consistency was demonstrated in the lncRNAs among the 8 extracted lncRNA sets. The left ordinate indicates the seed lncRNA set and the number of seeds lncRNAs found by every single iteration of LASSO. The right ordinate is the frequency of the seed lncRNA set disclosed through the 1,000 times Cox LASSO regression. The horizontal ordinate is the lncRNA name. The yellow block represents the occurrence of the particular lncRNA in the specific lncRNA set; (B) Totally 28 seed lncRNAs with >990 occurrences in the most common lncRNA set were filtered out for further analysis. The blue column indicates the frequency of each lncRNA occurs in the most common lncRNA set.
Descriptions of the eight lncRNAs.
| MIR99AHG | ENSG00000215386.12 | OTTHUMG00000074377.5 | lincRNA | 21 | 15928296 | 16645065 | + |
| AC008655.1 | ENSG00000267815.1 | OTTHUMG00000183045.1 | Antisense | 19 | 50310022 | 50310539 | – |
| AC010624.1 | ENSG00000204666.3 | OTTHUMG00000165525.1 | Sense_overlapping | 19 | 50050589 | 50066793 | + |
| AC055758.2 | ENSG00000244358.1 | OTTHUMG00000159391.1 | lincRNA | 3 | 145939912 | 145961536 | + |
| AC130456.4 | ENSG00000260681.1 | OTTHUMG00000177223.1 | Antisense | 16 | 19,410,729 | 19,411,662 | – |
| AC118553.1 | ENSG00000228084.1 | OTTHUMG00000010804.1 | Antisense | 1 | 99968383 | 99969864 | – |
| LINC00608 | ENSG00000236445.4 | OTTHUMG00000154665.5 | Antisense | 2 | 218975393 | 218989940 | + |
| LINC01300 | ENSG00000253595.5 | OTTHUMG00000164482.1 | Antisense | 8 | 141340549 | 141344621 | + |
The correlations of the lncRNAs with patients' overall survival in HNSCC based on GSE65858 dataset using uni- and multi-variate Cox analysis.
| AC006213.1 | 0.02 | 0–0.25 | 0.002 | 0.14 | 0.01–2.15 | 0.16 |
| AC008655.1 | 0.33 | 0.18–0.63 | 0.001 | 0.47 | 0.23–0.94 | 0.033 |
| AC009093.1 | 26.05 | 2.81–241.84 | 0.004 | 9.1 | 0.45–182.4 | 0.149 |
| AC010624.1 | 0.01 | 0–0.23 | 0.003 | 0 | 0–0.1 | 0.001 |
| AC011447.3 | 0.53 | 0.24–1.17 | 0.117 | 0.45 | 0.16–1.26 | 0.129 |
| AC026954.3 | 1.55 | 0.98–2.47 | 0.063 | 1.32 | 0.73–2.41 | 0.363 |
| AC055758.2 | 0.04 | 0–0.74 | 0.031 | 0.02 | 0–0.67 | 0.029 |
| AC073957.2 | 14.28 | 1.07–190.83 | 0.044 | 9.94 | 0.54–184.8 | 0.123 |
| AC092368.3 | 5.29 | 1.93–14.52 | 0.001 | 3.2 | 0.82–12.55 | 0.095 |
| AC097376.2 | 0.04 | 0–0.46 | 0.01 | 0.36 | 0.02–6.28 | 0.482 |
| AC097478.1 | 0.71 | 0.51–1 | 0.049 | 0.89 | 0.57–1.37 | 0.589 |
| AC118553.1 | 0.02 | 0–0.22 | 0.001 | 0.03 | 0–0.4 | 0.008 |
| AC130456.4 | 5.02 | 1.3–19.41 | 0.019 | 22.51 | 4.02–126.25 | 0 |
| AC244034.2 | 0.17 | 0.04–0.65 | 0.01 | 0.99 | 0.19–4.99 | 0.986 |
| AL121839.2 | 5.2 | 0.68–39.61 | 0.111 | 10.86 | 0.92–127.89 | 0.058 |
| AL356157.1 | 12.25 | 2.18–68.67 | 0.004 | 2.51 | 0.2–32.17 | 0.479 |
| AL359649.1 | 30.21 | 2.05–445.86 | 0.013 | 7.09 | 0.19–260.04 | 0.286 |
| AL391069.2 | 0.03 | 0–0.36 | 0.005 | 0.07 | 0–1.06 | 0.055 |
| AL451065.1 | 10.35 | 1.03–103.75 | 0.047 | 2.74 | 0.12–62.27 | 0.527 |
| AL512625.3 | 11.94 | 1.26–113.15 | 0.031 | 8.72 | 0.57–133.69 | 0.12 |
| AL589863.1 | 0.01 | 0–0.25 | 0.005 | 0.09 | 0–2.54 | 0.16 |
| CERNA2 | 0.54 | 0.33–0.88 | 0.014 | 0.7 | 0.36–1.38 | 0.305 |
| CYTOR | 49.02 | 4.42–543.27 | 0.002 | 1.05 | 0.06–19.48 | 0.972 |
| FLJ13224 | 0.01 | 0–0.32 | 0.008 | 0.03 | 0–1.13 | 0.058 |
| LINC00346 | 21.89 | 2.24–213.51 | 0.008 | 8.91 | 0.41–196.03 | 0.165 |
| LINC00608 | 0.04 | 0–0.74 | 0.03 | 0.01 | 0–0.27 | 0.006 |
| LINC01300 | 9.26 | 1.67–51.37 | 0.011 | 39.02 | 3.79–401.34 | 0.002 |
| MIR99AHG | 0.19 | 0.05–0.69 | 0.012 | 0.23 | 0.05–0.97 | 0.046 |
P < 0.05,
P < 0.01,
P < 0.001.
Figure 3Nomogram plot for patients with Head and Neck Squamous Cell Carcinoma. (A) Overall survival analysis on signature score in the GSE65858 dataset. (B) The gene nomogram was developed with signature score, T category, M category and HPV status. (C) Calibration curves of the gene nomogram. The y-axis represents the actual overall survival rate. The x-axis represents the predicted overall survival rate. The gray diagonal represents a perfect prediction of an ideal model. Dark red solid lines indicate the performance of the nomogram, where closer to the diagonal dashed line indicates a better prediction. (D) Decision curve analysis for the gene signature nomogram and the model without Score. The y-axis measures the net benefit. The red solid line represents the gene signature nomogram. The blue dashed line represents model without Score. The green line represents the assumption that all patients have died. Thin black line represents the assumption that no patients have died.
Model discussion for 8 lncRNA HNSCC.
| HPV_status | 0.53 | 0.31–0.9 | 0.018 | 0.53 | 0.31–0.9 | 0.018 |
| M_category | 3.29 | 1.41–7.63 | 0.006 | 3.47 | 1.5–8.01 | 0.004 |
| N_category | 1.18 | 0.66–2.1 | 0.576 | 1.27 | 0.71–2.26 | 0.425 |
| T_category | 2.42 | 1.28–4.58 | 0.007 | 2.41 | 1.27–4.57 | 0.007 |
| Uicc_stage | 1.03 | 0.37–2.88 | 0.949 | 0.92 | 0.33–2.57 | 0.874 |
| Score | 0.28 | 0.17–0.45 | 0 | |||
| C-index | 0.68 (0.65–0.70) | 0.75 (0.73–0.78) | ||||
P < 0.05,
P < 0.01,
P < 0.001.
Figure 4Score correlated enrichment gene analysis with GSEA. (A) Hallmark IL6 JAK SATA3 SIGNALING (P = 0.004; FDR = 0.104; ES = 0.72); (B) HALLMARK COMPLEMENT (P = 0.002; FDR = 0.110; ES = 0.62); (C) HALLMARK ALLOGRAFT REJECTION (P = 0.031; FDR = 0.180; ES = 0.69).
Gene Set Enrichment Analysis (GSEA) group by score.
| HALLMARK_IL6_JAK_STAT3_SIGNALING | 66 | 0.72197646 | 1.6556728 | 0.004040404 | 0.104168214 | 0.065 | 1,383 | Tags = 42%, list = 11%, signal = 48% |
| HALLMARK_COMPLEMENT | 144 | 0.6246483 | 1.593254 | 0.002061856 | 0.1100425 | 0.138 | 1,045 | Tags = 34%, list = 8%, signal = 37% |
| HALLMARK_ALLOGRAFT_REJECTION | 146 | 0.6888915 | 1.5099503 | 0.031189084 | 0.18018493 | 0.287 | 1,987 | Tags = 51%, list = 16%, signal = 60% |
Figure 5Determination of soft-thresholding power in WGCNA. (A) Analysis of the scale-free fit index for various soft-thresholding powers (β) and analysis of the mean connectivity for various soft-thresholding powers. (B) Dendrogram of all differentially expressed genes clustered based on a dissimilarity measure. (C) Heatmap of the correlation between module eigengenes and clinical traits of HNSC. Scatter plot for correlation between gene module membership in the blue module and gene significance.
Figure 6LncRNA-mRNA network in Black module and functional enrichment. (A) The lncRNA-mRNA network in Black module, (B) GO-BP enrichment analysis (All genes in Black module), (C) GO-BP enrichment analysis (AC01 0624.1 related genes in Black module), and (D) GO-BP enrichment analysis (AC130456.4 related genes in Black module).
Figure 10LncRNA-mRNA network in Magenta module and functional enrichment. (A) The lncRNA-mRNA network in Magenta module, (B) GO-CC enrichment analysis (All genes in Magenta module), and (C) GO-CC enrichment analysis (AC118553.1 related genes in Magenta module).
Figure 11Overall survival analyses and Gene expression difference analysis on MIR99AHG in the TCGA data set. (A) Kaplan-Meier curves of overall survival according to expression of MIR99AHG in TCGA-HNSC database. (B) Differentially expressed MIR99AHG in HNSC and normal tissues (P < 0.05).