| Literature DB >> 33869232 |
Qiuxu Wang1,2, Weiwei Yang3, Wei Peng2, Xuemei Qian3, Minghui Zhang4, Tianzhen Wang3.
Abstract
Increasing evidence has demonstrated the crosstalk between DNA epigenetic alterations and aberrant expression of long non-coding RNAs (lncRNAs) during carcinogenesis. However, epigenetically dysregulated lncRNAs and their functional and clinical roles in Head and Neck Squamous Cell Carcinoma (HNSCC) are still not explored. In this study, we performed an integrative analysis of DNA methylation data and transcriptome data and identified a DNA methylation-dysregulated four-lncRNA signature (DNAMeFourLncSig) from 596 DNA methylation-dysregulated lncRNAs using a machine-learning-based feature selection method, which classified the patients of the discovery cohort into two risk groups with significantly different survival including overall survival, disease-specific survival, and progression-free survival. Then the DNAMeFourLncSig was implemented to another two HNSCC patient cohorts and showed similar prognostic values in both. Results from multivariable Cox regression analysis revealed that the DNAMeFourLncSig might be an independent prognostic factor. Furthermore, the DNAMeFourLncSig was substantially correlated with the complete response rate of chemotherapy and may predict chemotherapy response. Functional in silico analysis found that DNAMeFourLncSig-related mRNAs were mainly enriched in cell differentiation, tissue development and immune-related pathways. Overall, our study will improve our understanding of underlying transcriptional and epigenetic mechanisms in HNSCC carcinogenesis and provided a new potential biomarker for the prognosis of patients with HNSCC.Entities:
Keywords: DNA methylation; biomarker; head and neck squamous cell carcinoma; long-coding RNAs; signature
Year: 2021 PMID: 33869232 PMCID: PMC8047109 DOI: 10.3389/fcell.2021.666349
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
FIGURE 1Unsupervised hierarchical clustering heatmap of 528 HNSCC tumor tissues and 50 normal tissues based on 780 differentially methylated sites.
Detailed information of four prognostic lncRNA biomarkers.
| Ensembl ID | Gene ID | Chromosomal location | HR | 95%CI | |
| ENSG00000230747 | AC021188.1 | Chr2: 96,307,263–96,321,731(−) | 0.71 | 0.57–0.89 | 0.003 |
| ENSG00000269427 | AC024075.3 | Chr 19: 16,630,743–16,643,942(+) | 0.76 | 0.62–0.93 | 0.0072 |
| ENSG00000233532 | LINC00460 | Chr 13: 106,374,477–106,384,315(+) | 1.26 | 1.07–1.48 | 0.0056 |
| ENSG00000265751 | AC015878.1 | Chr 18: 21,380,286–21,451,017(−) | 1.26 | 1.08–1.46 | 0.003 |
FIGURE 2Performance evaluation of the DNAMeFourLncSig in the discovery cohort. (A–C) Kaplan-Meier survival curves of survival between low-risk and high-risk groups. (D) ROC analysis of the DNAMeFourLncSig at 3- and 5-years. (E) The distribution of DNAMeFourLncSig risk score, survival status and expression heatmap of patients. (F) Boxplots for expression levels of four lncRNA biomarkers between low-risk and high-risk groups.
FIGURE 3Performance validation of the DNAMeFourLncSig. (A–C) Kaplan-Meier survival curves of survival between low-risk and high-risk groups in the validation cohort. (D) ROC analysis of the DNAMeFourLncSig at 3- and 5-years in the validation cohort. (E–G) Kaplan-Meier survival curves of survival between low-risk and high-risk groups in the TCGA cohort. (H) ROC analysis of the DNAMeFourLncSig at 3- and 5-years in the TCGA cohort.
Univariable and multivariable Cox regression analyses in each patient cohort.
| Variables | Univariate analysis | Multivariate analysis | |||||
| HR | 95%CI | HR | 95%CI | ||||
| DNAMeFourLncSig | High vs. Low | 2.45 | 1.69–3.56 | 2.3e−06 | 2.28 | 1.31–3.97 | 0.0038 |
| Age | >60 vs. =60 | 1.05 | 0.73–1.5 | 0.8 | 1.13 | 0.7–1.83 | 0.62 |
| Stage | III/IV vs. I/II | 1 | 0.63–1.57 | 0.99 | 1.07 | 0.58–1.95 | 0.84 |
| Grade | (III/IV vs. I/II) | 1.08 | 0.73–1.62 | 0.69 | 1.65 | 0.96–2.84 | 0.068 |
| Treatment response | CR vs. non-CR | 0.16 | 0.09–0.29 | 6.6e−10 | 0.25 | 0.13–0.47 | 2.4e−05 |
| DNAMeFourLncSig | High vs. Low | 2.06 | 1.35–3.16 | 0.00089 | 3.21 | 1.72–5.97 | 0.00024 |
| Age | >60 vs. =60 | 1.51 | 1–2.28 | 0.049 | 2.13 | 1.2–3.79 | 0.01 |
| Stage | III/IV vs. I/II | 1.41 | 0.88–2.26 | 0.15 | 1.74 | 0.87–3.5 | 0.12 |
| Grade | (III/IV vs. I/II) | 0.79 | 0.51–1.23 | 0.3 | 1.07 | 0.59–1.96 | 0.82 |
| Treatment response | CR vs. non-CR | 0.2 | 0.1–0.4 | 3.8e−06 | 0.27 | 0.14–0.53 | 0.00015 |
| DNAMeFourLncSig | High vs. Low | 2.22 | 1.68–2.94 | 2e–08 | 2.53 | 1.69–3.79 | 5.9e−06 |
| Age | >60 vs. =60 | 1.23 | 0.94–1.61 | 0.14 | 1.49 | 1.04–2.13 | 0.03 |
| Stage | III/IV vs. I/II | 1.22 | 0.88–1.69 | 0.23 | 1.43 | 0.91–2.25 | 0.12 |
| Grade | (III/IV vs. I/II) | 0.9 | 0.67–1.22 | 0.5 | 1.26 | 0.85–1.88 | 0.25 |
| Treatment response | CR vs. non-CR | 0.18 | 0.11–0.27 | 8e–15 | 0.24 | 0.15–0.37 | 5.9e−10 |
FIGURE 4Association of the DNAMeFourLncSig with treatment response. (A) Boxplots for DNAMeFourLncSig risk score in patients with and without complete response. (B) Correlation between DNAMeFourLncSig risk score and complete response rate. (C) Boxplots for the probability of patients with and without complete response in the low-risk and high-risk groups. Kaplan-Meier survival curves of survival between low-risk and high-risk groups for CR patients (D) and non-CR patients (E).
FIGURE 5Function in silico analysis of the DNAMeFourLncSig. (A) The network of enriched GO terms. (B) Enriched KEGG pathways.