Literature DB >> 32218699

Integrative analysis of DNA methylation-driven genes for the prognosis of lung squamous cell carcinoma using MethylMix.

Rui Li1, Yun-Hong Yin1, Jia Jin2, Xiao Liu1, Meng-Yu Zhang1, Yi-E Yang3, Yi-Qing Qu1.   

Abstract

Background: DNA methylation acts as a key component in epigenetic modifications of genomic function and functions as disease-specific prognostic biomarkers for lung squamous cell carcinoma (LUSC). This present study aimed to identify methylation-driven genes as prognostic biomarkers for LUSC using bioinformatics analysis. Materials and
Methods: Differentially expressed RNAs were obtained using the edge R package from 502 LUSC tissues and 49 adjacent non-LUSC tissues. Differentially methylated genes were obtained using the limma R package from 504 LUSC tissues and 69 adjacent non-LUSC tissues. The methylation-driven genes were obtained using the MethylMix R package from 500 LUSC tissues with matched DNA methylation data and gene expression data and 69 non-LUSC tissues with DNA methylation data. Gene ontology and ConsensusPathDB pathway analysis were performed to analyze the functional enrichment of methylation-driven genes. Univariate and multivariate Cox regression analyses were performed to identify the independent effect of differentially methylated genes for predicting the prognosis of LUSC.
Results: A total of 44 methylation-driven genes were obtained. Univariate and multivariate Cox regression analyses showed that twelve aberrant methylated genes (ATP6V0CP3, AGGF1P3, RP11-264L1.4, HIST1H4K, LINC01158, CH17-140K24.1, CTC-523E23.14, ADCYAP1, COX11P1, TRIM58, FOXD4L6, CBLN1) were entered into a Cox predictive model associated with overall survival in LUSC patients. Methylation and gene expression combined survival analysis showed that the survival rate of hypermethylation and low-expression of DQX1 and WDR61 were low. The expression of DQX1 had a significantly negatively correlated with the methylation site cg02034222.
Conclusion: Methylation-driven genes DQX1 and WDR61 might be potential biomarkers for predicting the prognosis of LUSC. © The author(s).

Entities:  

Keywords:  A Cox predictive model; Biomarkers; Lung squamous cell carcinoma; Methylation-driven genes; Overall survival

Mesh:

Substances:

Year:  2020        PMID: 32218699      PMCID: PMC7085273          DOI: 10.7150/ijms.43272

Source DB:  PubMed          Journal:  Int J Med Sci        ISSN: 1449-1907            Impact factor:   3.738


Introduction

Lung cancer is the leading cause of cancer related deaths worldwide 1. Lung squamous cell carcinoma (LUSC) accounts for about 30% of all lung cancers with the highest mortality in the world 2. LUSC accounts for approximately more than 400,000 deaths worldwide each year 3. The five year survival rate of lung squamous cell carcinoma is less than 15% 4. Due to the limitation of treatment and poor survival rate for lung squamous cell carcinoma 5, it is imperative that we explore specific diagnostic and prognostic biomarkers for LUSC. In the present study, we aimed to identify novel specific diagnostic and prognostic biomarkers for predicting survival in LUSC. Genetic aberrant expression is important for the etiology of human cancer and the combined effect of genetic and epigenetic changes contribute to the progress of human cancer 6-10. DNA methylation is one of the most important elements in epigenetic modifications and participates in the regulation of cellular functions and carcinogenesis 11. Epigenetic modification, especially DNA methylation, plays a significant role in predicting the prognosis of lung cancer 12-15. For instance, the identification of eight DNA methylation biomarkers using high-throughput DNA methylation analysis can predict the prognosis of lung squamous cell carcinoma 16. Aberrant ANK1 methylation contributes to miR-486-5p repression and discriminates lung tumors by histology and smoking status 17. Pharmacologic inhibition of DNA methylation combined with gene expression reveals new specific diagnostic and prognostic biomarkers in lung squamous cell carcinoma 18. MethylMix is an R package used for identifying disease-specific hyper- and hypo-methylated genes 19. More precisely, MethylMix includes two major characteristics: automatic download of DNA methylation and gene expression data sets from TCGA and automated pre-processing of such data sets: value interpolation, batch correction, and CpG positions within each gene point clustering. Currently, few studies based on using MethylMix R package to identify specific methylation-driven genes have been reported 20-22. Recently, a study based on MethylMix reveals potential prognostic methylation-driven genes for predicting the prognosis in lung adenocarcinoma (LUAD) has been reported 23. In the present study, we extracted the DNA methylation and RNA-Seq data using bioinformatics methods from The Cancer Genome Atlas (TCGA) database, and then the MethylMix R package was performed to obtain methylation-driven genes. Furthermore, a Cox predictive model was established to predict the diagnosis and prognosis of LUSC. Eventually, the joint of methylation and gene expression combined survival analysis was performed to reveal potential specific methylation-driven genes for predicting the prognosis of LUSC.

Materials and Methods

Data extraction and analysis

RNA-Seq data and methylation data were downloaded from the Cancer Genome Atlas (TCGA) database. The methylation data consists of 504 LUSC samples and 69 normal samples from the Illumina Infinium Human Methylation 450k platform. The RNA-Seq data (level 3) incorporating lncRNA and mRNA expression was obtained from 502 LUSC samples and 49 normal samples from the IlluminaHiSeq_RNASeq platform. First, based on the limma R package, we retrieved the aberrant methylated genes with the screening criteria absolute fold change (log2) > 0 and adjusting the false discovery rate (FDR) to a P value < 0.05. Then, we obtained the differentially expressed lncRNA and mRNA using the edge R package in R software with the absolute fold change (log2) > 0 and adjusted the FDR to a P value < 0.05. Next, we used the MethylMix R package with the screening criteria (|logFC|>0, P<0.05, Cor<-0.3) to extract the methylation-driven genes. Analysis of average methylation differences at 3000 bp (base pair) sites upstream of genes to identify differential methylation levels in gene promoters 23-25. The differential level of methylation in the promoter of genes was performed by using the limma R package 26. Eventually, we identified methylation-driven genes and aberrant methylated genes to establish a β-mixture model. The data was directly from the TCGA database. No approval was required from the local ethics committee.

Enrichment analysis of methylation-driven genes in LUSC

We used the Database for Annotation, Visualization and Integrated Discovery (DAVID) (http://david.abcc.ncifcrf.gov/) database to analyze the biological function in methylation-driven genes by using Gene ontology and ConsensusPathDB pathway analysis. In the GO analysis, a P value was less than 0.05 was considered as statistically significance. We used the GOCircle and GOChord plotting to explore the relationship between the methylation-driven genes and their biological function. ConsensusPathDB (http://cpdb.molgen.mpg.de/) is an online software incorporating gene regulatory, drug-target interactions and binary complex signaling. In the ConsensusPathDB pathway analysis, P < 0.05 was considered as statistically significance.

Establishment of predictive model of differentially methylated genes based on DNA methylation in LUSC

First, we obtained the expression of differentially methylated genes and retrieved the survival time and survival status of 366 LUSC patients, then used the survival.pl script to extract the data that combined the expression of differentially methylated genes and survival data. Next, we used the univariate R package to obtain the result of univariate Cox regression. We selected 92 aberrant methylated genes to submit to multivariate Cox regression analysis with the screening criteria P < 0.05. Based on the median risk score, LUSC patients were divided into two groups, incorporating high-risk groups and low-risk groups. To test the influence on differentially methylated genes signature (high risk vs low risk) on overall survival, we performed the receiver operating characteristic (ROC) curves to calculate the area under the curve (AUC) to reveal prognostic specific biomarkers for predicting survival in LUSC.

Univariate and multivariate Cox regression independent prognosis analysis of patients of LUSC

In order to further explore the twelve gene signature that can be used as independent prognosis factor, we performed the univariate and multivariate Cox regression independent prognosis analysis of LUSC patients. We extracted 276 LUSC patients with complete clinical information and combined the twelve genes signature risk score of 276 LUSC patients and the expression data of twelve genes of 276 LUSC patients to perform the univariate and multivariate Cox regression independent prognosis analysis. P < 0.05 was considered as statistically significant.

Methylation and gene expression combined survival analysis in LUSC

We used the methylation and gene expression combined survival analysis to further explore the effect of methylation-driven genes on patient prognosis in terms of expression and methylation levels. We performed the joint of methylation and gene expression combined survival analysis to identify potential methylation-driven genes for predicting the prognosis of LUSC patients. Therefore, we performed the Kaplan-Meier curve analysis. P < 0.05 was considered as statistically significant.

Correlation analysis between site methylation and gene expression in LUSC

In order to further explore the relationship between the methylation-driven genes expression and their methylation sites, we used the R package and Perl Package to perform the correlation analysis. First, site methylation data for methylation-driven genes associated with overall survival extracted from the TCGA database using the Perl package, then, we merged site methylation with gene expression data. Finally, we used the R package to figure out the correlation between site methylation and methylation-driven genes expression.

Survival analysis of methylation site in LUSC patients

In order to further explore the survival rate of methylation site cg02034222 in DQX1 in LUSC patients, we performed the Kaplan-Meier curve analysis of methylation site cg02034222 in DQX1 in LUSC patients. P value less than 0.05 was considered as statistically significant.

Results

Identification of methylation-driven genes in LUSC

A total of 44 methylation-driven genes were identified to be connected with DNA methylation using the MethylMix R package. The methylation-driven genes incorporating 42 methylation-driven mRNAs and two methylation-driven lncRNAs were shown in Table 1. Figure 1B and 1D shows methylation-driven gene ATL3 and DQX1 have significant negative correlation in methylation and gene expression level. The distribution of the methylation degree shows that ATL3 is hyper-methylated in LUSC patients and hypo-methylated in normal patients (Figure 1C). The distribution of the methylation degree shows that DQX1 is hyper-methylated in non-LUSC patients and hypo-methylated in LUSC patients (Figure 1E). A flow diagram of methylation-driven genes is shown in Figure 1A. A heat-map of methylation driven genes mRNAs and lncRNAs is shown in Figure 2.
Table 1

Methylation-driven mRNAs and lncRNAs in LUSC

GeneNormal MeanTumor MeanlogFCP-ValueAdjusted-PcorCor P-Value
ATL30.172487990.196935930.191232.67E-050.001522-0.57428853.28E-45
EID30.119479060.222831560.8991961.50E-068.54E-05-0.5579972122.91E-42
HSPA1A0.176816540.17435814-0.02020.0004980.028362-0.5467603662.54E-40
KRT70.327036550.402709310.3002875.94E-083.39E-06-0.5357063291.75E-38
ALG1L0.264238980.14912111-0.825362.11E-201.20E-18-0.5228263992.01E-36
HOXB20.193798990.329467710.7655762.62E-071.49E-05-0.496423961.81E-32
ZNF7010.15113660.216218340.5166362.00E-191.14E-17-0.4953793612.55E-32
SMIM100.113042370.202234320.8391641.07E-050.000613-0.4539291248.73E-27
DQX10.54660670.41245817-0.406261.12E-176.41E-16-0.4497923032.84E-26
ZNF3820.090302490.205445861.185921.78E-151.01E-13-0.4491223523.43E-26
MKRN30.599706210.46968784-0.352554.32E-122.46E-10-0.4401573154.15E-25
RNF410.30898540.37203670.2679063.83E-082.18E-06-0.4261988231.75E-23
TBX180.120783270.179278410.5697811.55E-078.83E-06-0.4139423684.04E-22
MRPL490.44024780.531504920.2717671.10E-076.27E-06-0.4094742471.23E-21
HIGD1A0.284095620.356684440.3282724.55E-102.59E-08-0.4079027931.81E-21
ZNF4180.149608370.275389190.8802812.27E-201.29E-18-0.4031010055.86E-21
EWSR10.238558470.19310873-0.304931.42E-078.07E-06-0.4021884967.30E-21
GOLGA40.230638980.20194077-0.19170.0001720.009832-0.3963815172.93E-20
ZSCAN180.247751910.290988150.2320645.04E-082.87E-06-0.396029963.18E-20
ME30.245756260.315866150.3620858.19E-124.67E-10-0.3902983491.22E-19
TOR4A0.434811320.503423090.2113821.19E-126.78E-11-0.3860939973.20E-19
ZNF3350.175444740.13484788-0.379682.95E-050.001681-0.3858387763.39E-19
GON4L0.248339520.18937769-0.391051.24E-137.09E-12-0.3788848951.63E-18
GPN10.448409680.531718480.2458450.0003660.020889-0.36883741.47E-17
NDST10.372442270.393756270.0802863.03E-050.001725-0.3678887441.80E-17
LYRM10.269403250.331398270.2987991.71E-109.73E-09-0.3636196954.48E-17
ZNF8800.109551730.164122830.5831643.15E-081.80E-06-0.3585517321.29E-16
HNRNPU0.181856540.14481692-0.328573.32E-081.89E-06-0.3559477752.22E-16
PKP10.337300170.28220385-0.25731.19E-096.79E-08-0.3555191192.42E-16
CLDN80.597297070.5081929-0.233071.40E-077.95E-06-0.3530710853.99E-16
GPN30.440537660.541890410.2987361.93E-101.10E-08-0.3510396056.03E-16
STX120.298646430.380698390.350211.20E-106.82E-09-0.3502778077.03E-16
MAFG0.301949950.26400157-0.193764.30E-060.000245-0.3473747611.26E-15
BIRC60.180053620.14959829-0.267330.0003590.020475-0.3434418912.74E-15
NSMCE20.253834240.286607130.1751880.000310.017663-0.3422864493.44E-15
IKZF50.279246720.324748610.2177836.03E-050.00344-0.3365526521.05E-14
TIMM10B0.418010390.517475190.3079518.21E-104.68E-08-0.330368223.38E-14
SLC35A50.305000290.367143830.2675352.83E-081.61E-06-0.3268276496.55E-14
WDR610.255397930.294896360.2074611.50E-068.55E-05-0.3264788416.98E-14
TCAIM0.237099620.294666130.3135881.15E-106.54E-09-0.3234595421.22E-13
RNF50.391358380.483351060.3045811.75E-099.96E-08-0.312503458.67E-13
UBE2D30.291645470.345579350.2448012.00E-060.000114-0.3090877281.57E-12
HOXA10-AS0.149207170.242125650.6984392.43E-139.74E-13-0.3432455432.85E-15
LINC008570.286417270.349630670.2877130.00460.018401-0.5122632668.45E-35
Figure 1

Identification of methylation and gene expression most relevant genes between LUSC and normal tissue. A The flow diagram of identification of methylation driven genes in LUSC. B, D The correlation between methylation and gene expression in methylation driven genes. C, E The methylation degree of the methylation-driven genes between LUSC and normal tissue. The red curve indicates the methylation degree from the LUSC tissue, the green curve indicates the methylation degree from the normal tissue. The black line above the figure is the distribution of methylation level in normal tissue.

Figure 2

Heat map of methylation-driven genes between LUSC and normal tissue. Red represents highly methylated genes and green represents low methylated genes between LUSC and normal tissue.

Functional enrichment analysis of methylation-driven genes in LUSC

Functional enrichment analysis shows that eight GO terms (transcription, DNA-templated; transcription factor activity, sequence-specific DNA binding; RNA polymerase II complex import to nucleus; RNA polymerase III complex localization to nucleus; regulation of transcription, DNA-templated; Smc5-Smc6 complex; ligase activity; histone H3-K4 tri-methylation) with statistical significance (P < 0.05). The highest GO term was biological process “GO0006351 transcription, DNA templated” (Figure 3A and 3C). Figure 3B shows the all methylation-driven mRNAs with their related eight GO terms. Figure 4 shows that 13 pathways (BARD1 signaling events; Protein processing in endoplasmic reticulum - Homo sapiens (human); E3 ubiquitin ligases ubiquitinate target proteins; SUMOylation of DNA damage response and repair proteins; Protein ubiquitination; Apoptosis Modulation and Signaling; SUMO E3 ligases SUMOylate target proteins; SUMOylation; Keratinization; Spliceosome - Homo sapiens (human); Ubiquitin mediated proteolysis - Homo sapiens (human); NRF2 pathway; EMT transition in Colorectal Cancer) were considered statistically significant (P < 0.05). As we can see from the Figure 4, the methylation-driven genes were most enriched in BARD1 signaling events, Protein processing in endoplasmic reticulum - Homo sapiens (human) and E3 ubiquitin ligases ubiquitinate target proteins (P<0.01).
Figure 3

Functional enrichment analysis of methylation-driven genes in LUSC. A The outer circle represents the expression of methylation driven mRNAs in each enriched GO terms: red dots which were on each GO terms indicated the up-regulated methylation driven mRNAs, the inner circle indicates the significance of GO terms (log10-adjusted P values). Blue dots indicate the down-regulated methylation driven mRNAs. B The circle represents the correlation between 42 methylation driven mRNAs and their GO terms. C The distribution of methylation driven mRNAs in significant GO terms.

Figure 4

ConsensusPathDB pathway analysis of methylation-driven genes in LUSC. The red circles indicate the number of methylation driven genes on each pathway. The line between the two red circles indicates the ratio of methylation driven genes present in the common genes of the two pathways.

The pathway analysis is shown in Table 2.
Table 2

Pathway analysis

PathwaycountP-valueq-value
BARD1 signaling events20.001660.05648
Protein processing in endoplasmic reticulum - Homo sapiens (human)30.004830.07633
E3 ubiquitin ligases ubiquitinate target proteins20.006740.07633
SUMOylation of DNA damage response and repair proteins20.010980.08233
Protein ubiquitination20.012110.08233
Apoptosis Modulation and Signaling20.015470.08766
SUMO E3 ligases SUMOylate target proteins20.025990.10038
SUMOylation20.028440.10038
Keratinization20.029280.10038
Spliceosome - Homo sapiens (human)20.031860.10038
Ubiquitin mediated proteolysis - Homo sapiens (human)20.033180.10038
NRF2 pathway20.035430.10038
EMT transition in Colorectal Cancer20.043990.11506

Establishment of predictive model of 12 differentially methylated genes associated with overall survival in LUSC

Univariate Cox regression analysis was first used to identify differentially methylated genes associated with overall survival in LUSC; incorporating 92 differentially methylated genes were selected to submit to perform multivariate Cox regression analysis. (P < 0.05) Multivariate Cox regression analysis shows that 12 differentially methylated genes were finally selected to establish a predictive model. The linear combination of the expression of 12 aberrant methylated genes was performed to establish the predictive model. The relative coefficients weighted in the multivariate Cox regression are as follows: survival risk score = ((-1.7055) × expression value of ATP6V0CP3 + (-2.4550) × expression value of AGGF1P3 + (-2.0585) × expression value of RP11-264L1.4 + 3.1730 × expression value of HIST1H4K + 4.3379 × expression value of LINC01158 + 1.2379 × expression value of CH17-140K24.1 + (-2.3096) × expression value of CTC-523E23.14 + (-3.5519) × expression value of ADCYAP1 + (-1.1394) × expression value of COX11P1 + (-2.0323) ×expression value of TRIM58 + (-1.8202) × expression value of FOXD4L6 + 3.9318 × expression value of CBLN1). The multivariate Cox regression analysis is shown in Table 3.
Table 3

Multivariate Cox regression analysis of 12 genes associated with overall survival in LUSC patients

coefexp(coef)se(coef)zp
ATP6V0CP3-1.70550.18171.06-1.610.108
AGGF1P3-2.4550.08591.586-1.550.122
RP11-264L1.4-2.05850.12761.034-1.990.046
HIST1H4K3.17323.87911.19642.650.008
LINC011584.337976.54812.10682.060.039
CH17-140K24.11.23793.44820.82871.490.135
CTC-523E23.14-2.30960.09931.1061-2.090.037
ADCYAP1-3.55190.02871.6107-2.210.027
COX11P1-1.13940.320.7318-1.560.119
TRIM58-2.03230.1310.9696-2.10.036
FOXD4L6-1.82020.1620.8431-2.160.031
CBLN13.931851.00051.90692.060.039

ROC curve analysis and risk groupings

The heat-map shows that 12 differentially methylated genes (ATP6V0CP3, AGGF1P3, RP11-264L1.4, HIST1H4K, LINC01158, CH17-140K24.1, CTC-523E23.14, ADCYAP1, COX11P1, TRIM58, FOXD4L6, CBLN1) were divided into two groups based on the median risk scores (Figure 5A). A total of 366 patients with complete survival information were divided into a high-risk group (n=183) and a low-risk group (n=183). The Kaplan-Meier curve with a Log-rank statistical examination was used to perform survival analysis (Figure 5B). As shown in Figure 5B, patients in the high-risk group had significantly poor survival rate than in the low-risk group (P =1e-05). Receiver operating characteristic (ROC) curve was performed to identify the effect of 12 aberrant methylated genes signature associated with overall survival in LUSC (Figure 5C). The graph of the risk score between high risk group and low risk group in the Cox model is shown in Figure 5D.
Figure 5

Prognostic value of 12-differentially methylated genes based on DNA methylation in LUSC. A Risk heat-map predictive model established from 12 differentially methylated genes from 366 LUSC patients. B Kaplan-Meier curve analysis of 12 differentially methylated genes for overall survival of LUSC patients. C ROC curve analysis of prognostic 12 differentially methylated genes signature. D The riskscore in the Cox model between high risk group and low risk group of LUSC.

Univariate and multivariate Cox regression analysis for the various DNA methylation classifiers of patients with LUSC

In order to further verify the twelve gene signature is an independent prognosis factor, we used the univariate and multivariate Cox regression analysis. We extracted the complete LUSC clinical features with 276 samples and then combined the twelve gene expression data and the prognostic risk score with the LUSC clinical features, including LUSC age, gender, pathology stage, pathology T stage, pathology M stage, pathology N stage to perform the univariate and multivariate Cox regression analysis. The forest plot of univariate and multivariate Cox regression analysis was showed in Figure 6. Univariate Cox regression analysis showed that pathology stage, pathology T stage and twelve gene signature risk score can act as independent prognostic factor (Figure 6A). Multivariate Cox regression analysis showed that twelve gene signature is an independent prognosis factor (Figure 6B) (Table 4).
Figure 6

The forest plot of univariate and multivariate Cox regression analysis of the various DNA methylation classifiers of LUSC patients. A The forest plot of univariate Cox regression independent prognosis analysis of LUSC patients. B The forest plot of multivariate Cox regression independent prognosis analysis of LUSC patients. The red mark in the plot indicates that this clinical feature is a high risk factor. The green mark in the plot indicates that this clinical feature is a low risk factor.

Table 4

Univariate and multivariate Cox regression analysis of LUSC clinical characteristics based on predictive model

VariablesUnivariate analysisMultivariate analysis
Hazards ratio (95% CI)P valueHazards ratio (95% CI)P value
Age1.018 (0.994-1.043)0.1361.022 (0.996-1.049)0.099
Gender0.971 (0.617-1.529)0.8990.957 (0.603-1.520)0.854
Stage1.381 (1.091-1.747)0.0071.198 (0.644-2.231)0.568
T1.317 (1.033-1.679)0.0261.080 (0.707-1.650)0.722
M2.465 (0.605-10.044)0.2082.187 (0.318-15.043)0.427
N1.254 (0.947-1.661)0.1131.103 (0.629-1.933)0.732
riskScore2.117 (1.738-2.578)<0.0012.063 (1.692-2.517)<0.001
The combined survival analysis revealed that the joint of methylation and gene expression of the genes (DQX1, WDR61) had significant correlation with the prognosis of LUSC patients (Figure 7A and 7B). As shown in Figure 6, the hypermethylation and low-expression survival rate of DQX1 and WDR61 were low. The joint of methylation and gene expression survival analysis shows that the DNA methylation and gene expression of DQX1 and WDR61 were associated with overall survival in LUSC patients (P < 0.05). We used the median to define the DNA methylation cut-off to classify a sample as hypermethylated or hypomethylated about the survival rate based on DQX1 and WDR61 data.
Figure 7

Survival analysis and correlation analysis of methylation and gene expression and survival analysis of methylation site of LUSC. A, B Kaplan-Meier curve analysis of the joint of DNA methylation and gene expression in LUSC patients. C The correlation analysis between the expression of DQX1 and cg02034222 methylation in LUSC patients. D Kaplan-Meier curve analysis of methylation site cg02034222 in DQX1 in LUSC patients.

Correlation analysis between gene expression and methylation sites

To figure out the relationship between the DNA methylation and gene expression level in DQX1 and WDR61, we used the Perl package to obtain the methylation sites in DQX1 and WDR61, the expression of DQX1 had 8 methylation sites in TCGA database, while only one methylation site had significant correlation between expression and DNA methylation level. The expression of WDR61 had two methylation sites in TCGA database, while no methylation site had significant correlation with the expression of WDR61. As is shown in Figure 7C, the expression of DQX1 had a significant correlation (Cor = -0.725) with cg02034222 methylation (P = 1.26e-74).

Survival analysis of methylation site cg02034222 in DQX1 in LUSC patients

In order to further figure out whether the one methylation site (cg02034222) in DQX1 is responsible of the survival rate of LUSC patients, we performed survival analysis of methylation site (cg02034222) in DQX1 in hypermethylation and hypomethylation LUSC patients (Figure 7D). The one CpG site cg02034222 in DQX1 in hypermethylation LUSC patients had a poor survival rate than the one CpG site cg02034222 in DQX1 in hypomethylation LUSC patients (P = 0.021).

Discussion

In recent years, with the increasing numbers of advanced diagnosis and poor prognosis in lung squamous cell carcinoma, it is crucial to explore more effective diagnostic and prognostic biomarkers for predicting survival in LUSC. Genetic and epigenetic changes facilitate the progression of LUSC. DNA methylation and RNA-Seq data analysis provide a novel perspective to reveal disease specific diagnostic and prognostic biomarkers in human lung cancer 27,28. The rapid development of RNA-Seq analysis technologies provides a novel perspective to explore the molecular characteristic and pathogenesis of LUSC and provides significant evidence for predicting the prognosis of LUSC. Emerging evidences shows that the studies on the molecular mechanism of LUSC and the prognostic biomarkers of the LUSC associated with methylation driven genes is still lacking. In our study, we used the MethylMix R package to identify methylation driven genes and a cox predictive model was established to predict the prognosis of LUSC and the joint of methylation and gene expression combined survival analysis reveals potential prognostic biomarkers for predicting the prognosis of LUSC. Epigenetics modification, especially DNA methylation, participates in the pathogenesis of LUSC. Accumulating evidences have demonstrated that DNA methylation acts as the major molecular mechanism of epigenetic modification was associated with the human malignant cancer, incorporating lung cancer 29-31. The joint of gene and DNA methylation using bioinformatics analysis revealing new diagnostic and prognostic biomarkers for predicting the prognosis of cancer 32-34. Therefore, it is pivotal to identify disease-specific prognostics biomarkers to determine the exploration of the molecular mechanism of LUSC. The methylation of L1RE1, RARB, and RASSF1 acts as potential disease specific biomarkers for predicting the prognosis of lung cancer 35. TRIM58/cg26157385 methylation associated with eight genes including A2ML1, CCNE1, COBL, ESCO2, GPR115, MMP10, OVOL1 and SCGB1A1 in lung squamous cell carcinoma 36. The prognostic value of HOXA9 promoter methylation was associated with lung cancer and can become a new diagnostic and prognostic biomarker for predicting the prognosis of stage I lung adenocarcinoma 37. Therefore, bioinformatics analysis of DNA methylation and gene expression provide a significant horizon for identifying disease specific diagnostic and prognostic biomarkers in lung squamous cell carcinoma. Functional enrichment analysis of methylation-driven mRNA reveals the methylation-driven genes might be mostly associated with the biologic function of transcription, DNA template; transcription factor activity; RNA polymerase II complex import to nucleus (P < 0.05). The pathway analysis shows that the methylation-driven genes were associated with 13 pathways (P < 0.05); the most enriched pathway was the Protein processing in endoplasmic reticulum-Homo sapiens (human). In our study, univariate and multivariate Cox regression analysis shows that ATP6V0CP3, AGGF1P3, RP11-264L1.4, HIST1H4K, LINC01158, CH17-140K24.1, CTC-523E23.14, ADCYAP1, COX11P1, TRIM58, FOXD4L6, CBLN1 were associated with overall survival and could establish a survival predictive model to predict the prognosis of LUSC. The 12 differentially methylated genes could also act as an independent prognostic factor for predicting the prognosis of LUSC. HIST1H4K can be used as prognostic factors for predicting the prognosis of cervical cancer patients 38. HIST1H4K can be used as DNA methylation biomarkers for predicting the prognosis of prostate cancer 39. ADCYAP1 can act as prognostic biomarkers to predict risk of endometrial cancer 40. Recently a study based on TCGA database reveal ADCYAP1 can act as prognostic biomarkers for predicting the prognosis of LUSC 4. ADCYAP1 can act as novel biomarkers for predicting the prognosis of cervical precancer and cancer 41. TRIM58/cg26157385 methylation may play a significant role for predicting the prognosis of LUSC 36. CBLN1 can be used as marker genes for predicting the prognosis of the Ventromedial Hypothalamic Nucleus 42. In our study, the joint of methylation and gene expression combined survival analysis shows that the survival rate of hypermethylation and low-expression of DQX1 was significant lower than the survival rate of hypomethylation and high-expression of DQX1. Epigenome-wide association studies (EWASs) suggested a novel association between T2D and cg06721411(DQX1; P value=1.18×10(-9)) 43. The survival rate of hypomethylation and high-expression of WDR61 was significant higher than the survival rate of hypermethylation and low-expression of WDR61. Furthermore, we performed the correlation analysis between DNA methylation sites and gene expression. The expression of DQX1 was significantly negatively associated with the methylation sites cg02034222 (P value= 1.26e-74), which might provide a significant horizon to explore prognostic biomarkers for predicting the diagnosis and prognosis of LUSC. Compared with previous studies4, our study first obtained aberrant methylated genes using the limma R package, obtained differentially expressed genes using the edge R package and then we filtered the missing values and low expression genes and intersected gene expression data with DNA methylation data to input MethylMix R package to identify methylation driven genes, which may provide a novel perspective to reveal disease-specific prognostic biomarkers in LUSC and may play a significant role in predicting the diagnosis and prognosis of LUSC.

Conclusions

In our study, we aimed to identify methylation-driven genes using MethylMix between LUSC patients and normal samples from the TCGA database for predicting the prognosis of lung squamous cell carcinoma. Univariate and multivariate Cox regression analysis showed that the survival predictive model established by twelve differentially methylated genes (ATP6V0CP3, AGGF1P3, RP11-264L1.4, HIST1H4K, LINC01158, CH17-140K24.1, CTC-523E23.14, ADCYAP1, COX11P1, TRIM58, FOXD4L6, CBLN1) can act as independent prognostic factor for predicting the prognosis of LUSC. In addition, the DNA methylation and gene expression levels of DQX1 and WDR61 are significantly associated with overall survival and the expression of DQX1 has a significantly negatively correlated with the methylation site cg02034222. Our study may provide a novel perspective for predicting the prognosis of LUSC.
  43 in total

Review 1.  The promise of epigenetic therapy: reprogramming the cancer epigenome.

Authors:  Andrew D Kelly; Jean-Pierre J Issa
Journal:  Curr Opin Genet Dev       Date:  2017-04-14       Impact factor: 5.578

2.  An Integrated Prognostic Classifier for Stage I Lung Adenocarcinoma Based on mRNA, microRNA, and DNA Methylation Biomarkers.

Authors:  Ana I Robles; Eri Arai; Ewy A Mathé; Hirokazu Okayama; Aaron J Schetter; Derek Brown; David Petersen; Elise D Bowman; Rintaro Noro; Judith A Welsh; Daniel C Edelman; Holly S Stevenson; Yonghong Wang; Naoto Tsuchiya; Takashi Kohno; Vidar Skaug; Steen Mollerup; Aage Haugen; Paul S Meltzer; Jun Yokota; Yae Kanai; Curtis C Harris
Journal:  J Thorac Oncol       Date:  2015-07       Impact factor: 15.609

3.  Validation of a clinical assay of multi-locus DNA methylation for prognosis of newly diagnosed AML.

Authors:  Courtney D Dinardo; Marlise R Luskin; Martin Carroll; Catherine Smith; Jenna Harrison; Sherry Pierce; Steve Kornblau; Marina Konopleva; Tapan Kadia; Hagop Kantarjian; Gerald B Wertheim; Stephen R Master
Journal:  Am J Hematol       Date:  2017-02       Impact factor: 10.047

4.  DNA methylation biomarkers of prostate cancer: confirmation of candidates and evidence urine is the most sensitive body fluid for non-invasive detection.

Authors:  Shannon R Payne; Jurgen Serth; Martin Schostak; Jorn Kamradt; Arne Strauss; Paul Thelen; Fabian Model; J Kevin Day; Volker Liebenberg; Andrew Morotti; Su Yamamura; Joe Lograsso; Andrew Sledziewski; Axel Semjonow
Journal:  Prostate       Date:  2009-09-01       Impact factor: 4.104

Review 5.  Novel treatment opportunities for sulfur mustard-related cancers: genetic and epigenetic perspectives.

Authors:  Soheila Rahmani; Mohammad Abdollahi
Journal:  Arch Toxicol       Date:  2017-10-10       Impact factor: 5.153

6.  ANK1 Methylation regulates expression of MicroRNA-486-5p and discriminates lung tumors by histology and smoking status.

Authors:  Mathewos Tessema; Christin M Yingling; Maria A Picchi; Guodong Wu; Tyrone Ryba; Yong Lin; Aaron O Bungum; Eric S Edell; Avrum Spira; Steven A Belinsky
Journal:  Cancer Lett       Date:  2017-09-29       Impact factor: 8.679

7.  Analysis of DNA methylation in endometrial biopsies to predict risk of endometrial cancer.

Authors:  Francesco Multinu; Jun Chen; Joseph D Madison; Michelle Torres; Jvan Casarin; Daniel Visscher; Viji Shridhar; Jamie Bakkum-Gamez; Mark Sherman; Nicolas Wentzensen; Andrea Mariani; Marina Walther-Antonio
Journal:  Gynecol Oncol       Date:  2020-01-03       Impact factor: 5.482

8.  Comparative transcriptomes of adenocarcinomas and squamous cell carcinomas reveal molecular similarities that span classical anatomic boundaries.

Authors:  Eric W Lin; Tatiana A Karakasheva; Dong-Jin Lee; Ju-Seog Lee; Qi Long; Adam J Bass; Kwok K Wong; Anil K Rustgi
Journal:  PLoS Genet       Date:  2017-08-07       Impact factor: 5.917

9.  DNA methylation biomarkers offer improved diagnostic efficiency in lung cancer.

Authors:  Georgios Nikolaidis; Olaide Y Raji; Soultana Markopoulou; John R Gosney; Julie Bryan; Chris Warburton; Martin Walshaw; John Sheard; John K Field; Triantafillos Liloglou
Journal:  Cancer Res       Date:  2012-09-07       Impact factor: 12.701

10.  Integrative analysis of the epigenetic basis of muscle-invasive urothelial carcinoma.

Authors:  Thomas Sanford; Maxwell V Meng; Reema Railkar; Piyush K Agarwal; Sima P Porten
Journal:  Clin Epigenetics       Date:  2018-02-12       Impact factor: 6.551

View more
  6 in total

1.  Pan-Cancer Methylated Dysregulation of Long Non-coding RNAs Reveals Epigenetic Biomarkers.

Authors:  Ning Zhao; Maozu Guo; Chunlong Zhang; Chunyu Wang; Kuanquan Wang
Journal:  Front Cell Dev Biol       Date:  2022-05-27

2.  Potential Genes Associated with the Survival of Lung Adenocarcinoma Were Identified by Methylation.

Authors:  Ziyuan Shen; Chenlu He; Haimiao Chen; Lishun Xiao; Yingliang Jin; Shuiping Huang
Journal:  Comput Math Methods Med       Date:  2020-11-18       Impact factor: 2.238

3.  Integral Analysis of the RNA Binding Protein-associated Prognostic Model for Renal Cell Carcinoma.

Authors:  Xin Qin; Zhengfang Liu; Keqiang Yan; Zhiqing Fang; Yidong Fan
Journal:  Int J Med Sci       Date:  2021-01-01       Impact factor: 3.738

4.  Specific Lung Squamous Cell Carcinoma Prognosis-Subtype Distinctions Based on DNA Methylation Patterns.

Authors:  Guichuan Huang; Jing Zhang; Ling Gong; Daishun Liu; Xin Wang; Yi Chen; Shuliang Guo
Journal:  Med Sci Monit       Date:  2021-03-04

5.  Regulatory pattern of abnormal promoter CpG island methylation in the glioblastoma multiforme classification.

Authors:  Rendong Wang; Lei Zhao; Shijia Wang; Xiaoxiao Zhao; Chuanyu Liang; Pei Wang; Dongguo Li
Journal:  Front Genet       Date:  2022-09-19       Impact factor: 4.772

Review 6.  Clinicopathological implications of lncRNAs, immunotherapy and DNA methylation in lung squamous cell carcinoma: a narrative review.

Authors:  Gabriel B K Sasa; Cheng Xuan; Meiyue Chen; Zhenggang Jiang; Xianfeng Ding
Journal:  Transl Cancer Res       Date:  2021-12       Impact factor: 1.241

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.