Literature DB >> 31558162

Methylation and transcriptome analysis reveal lung adenocarcinoma-specific diagnostic biomarkers.

Rui Li1, Yi-E Yang2, Yun-Hong Yin1, Meng-Yu Zhang1, Hao Li3, Yi-Qing Qu4.   

Abstract

BACKGROUND: DNA methylation can regulate the role of long noncoding RNAs (lncRNAs) in the development of lung adenocarcinoma (LUAD). The present study aimed to identify methylation-driven lncRNAs and mRNAs as biomarkers in the prognosis of LUAD using bioinformatics analysis.
METHODS: Differentially expressed RNAs were obtained using the edge R package from 535 LUAD tissues and 59 adjacent non-LUAD tissues. Differentially methylated genes were obtained using the limma R package from 475 LUAD tissues and 32 adjacent non-LUAD tissues. Methylation-driven mRNA and lncRNA were obtained using the MethylMix R package from 465 LUAD tissues with matched DNA methylation and RNA expression and 32 non-LUAD tissues with DNA methylation. Gene ontology and ConsensusPathDB pathway analysis were performed to identify functional enrichment of methylation-driven mRNAs. Univariate and multivariate Cox regression analyses were performed to identify the independent effect of each variable for predicting the prognosis of LUAD. Kaplan-Meier curve analysis of DNA methylation and gene expression might provide potential prognostic biomarkers for LUAD patients.
RESULTS: A total of 99 methylation-driven mRNAs and 17 methylation-driven lncRNAs were obtained. Univariate and multivariate Cox regression analysis showed that 6 lncRNAs (FOXE1, HOXB13-AS1_2, VMO1, HIST1H3F, AJ003147.8, ASXL3) were retrieved to construct a predictive model associated with overall survival in LUAD patients. Combined DNA methylation and gene expression survival analysis revealed that 4 lncRNAs (AC023824.1, AF186192.1, LINC01354 and WASIR2) and 8 mRNAs (S1PR1, CCDC181, F2RL1, EFS, KLHDC9, MPV17L, GKN2, ITPRIPL1) might act as independent biomarkers for the prognosis of LUAD.
CONCLUSIONS: Methylation-driven lncRNA and mRNA contribute to the survival of LUAD, and 4 lncRNAs and 8 mRNAs might be potential biomarkers for the prognosis of LUAD.

Entities:  

Keywords:  Biomarkers; Lung adenocarcinoma; Methylation-driven lncRNA; Methylation-driven mRNA; Overall survival rate

Year:  2019        PMID: 31558162      PMCID: PMC6764142          DOI: 10.1186/s12967-019-2068-z

Source DB:  PubMed          Journal:  J Transl Med        ISSN: 1479-5876            Impact factor:   5.531


Background

Lung cancer is the leading cause of cancer-related death worldwide [1]. Lung adenocarcinoma (LUAD) accounts for 45–55% of non-small-cell lung cancer (NSCLC), with a 5-year overall survival rate of less than 15% [2]. Due to the heterogeneity of lung adenocarcinoma, it is still a great challenge to develop successful individual-based treatment [3]. Therefore, there is an urgently need to identify effective and promising biomarkers in predicting the prognosis of LUAD. Genetic aberrant expression is crucial for cancer etiology, and the joint effect of both genetic and epigenetic changes facilitates the development of human cancer [4-6]. DNA methylation acts as the key element in epigenetic modifications and plays a significant role in the regulation of cellular functions and carcinogenesis [7-9]. Epigenetic modification, especially DNA methylation, can provide a novel horizon to explore new biomarkers in predicting the prognosis of cancer [10-14]. A large number of studies have demonstrated that DNA methylation can regulate the expression of lncRNA, and this phenomenon may be associated with the prognosis of lung cancer [15]. For instance, the lncRNA AFAP1-AS1 acts as an oncogene in NSCLC, while its expression is tightly regulated by DNA methylation, which might provide prognostic and diagnostic values for NSCLC patients [16]. MethylMix, an algorithm implemented in the R programming environment, identifies disease-specific hyper- and hypomethylated genes [17]. Currently, few studies on methylation-driven genes have been reported [18]. Recently, a study based on using MethylMix to explore methylation-driven genes for predicting the prognosis of LUAD was reported; however, they only obtained information about methylation-driven mRNA [19]. In this study, DNA methylation and RNA-Seq data were extracted from The Cancer Genome Atlas (TCGA) database, and we used the MethylMix R package to obtain LUAD-specific methylation-driven lncRNA sequences. Furthermore, a Cox survival predictive model with 6 lncRNAs was constructed to predict the diagnosis and prognosis of LUAD. Finally, the combined effect of DNA methylation and gene expression survival analysis was examined, which might provide a novel insight to explore methylation-driven lncRNA and mRNA for predicting the prognosis of LUAD.

Methods

Data retrieving and analyzing

Methylation and RNA-Seq expression data were retrieved from LUADs from the TCGA database. The methylation data were downloaded from 475 cancer tissues and 32 noncancer tissues from the Illumina Human methylation 450k platform. The RNA-Seq data (level 3), including mRNA and lncRNA expression, were retrieved from 535 cancer tissues and 59 noncancer tissues from the IlluminaHiSeq_RNASeq platform. First, on the basis of “limma R” packages in R with absolute fold change (log 2) > 0 and adjusting the false discovery rate (FDR) to a P value < 0.05, we obtained aberrant methylated genes. The methylation difference of the mean of the 3000 bp (base pair) sites upstream of the gene was analyzed to identify the differential level of methylation in the gene promoter [19-22]. We analyzed the differential level of methylation in the promoter of genes by using the limma R package [23]. Then, based on the “edge R” packages in R with absolute fold change (log 2) > 3 and adjusting the false discovery rate (FDR) to a P value < 0.01 to correct the statistical significance of multiple experiments, we retrieved differentially expressed mRNA and lncRNA. MethylMix is a kind of R statistical package for integrating DNA methylation data and RNA expression data to identify methylation-driven genes in kinds of cancers [17, 24–26]. Filtering or eliminating missing value genes is the preprocessing common step when running MethylMix R software [17]. In this present manuscript, we filtered or eliminated missing value genes and intersected DNA methylation data with RNA expression data for matching. Finally, there were a total of 465 LUAD samples with matched DNA methylation and RNA expression and 32 non-LUAD samples with DNA methylation data for entering the MethylMix R package. Then, calculated the correlation between DNA methylation level and RNA expression to find significantly negatively related genes, a beta mixture model was constructed for the degree of methylation of samples, Wilcoxon rank test was used to calculate differential methylation in LUAD and adjacent non-LUAD samples. Finally, the methylation-driven mRNA and lncRNA was obtained. Since the data were directly obtained from the TCGA database, no approval was required from the local ethics committee.

Functional enrichment analysis of methylation-driven mRNA in LUAD

To determine the function represented in the methylation-driven mRNA, we used the Database for Annotation, Visualization and Integrated Discovery (DAVID) (http://david.abcc.ncifcrf.gov/) to perform a functional and enrichment analysis on the methylation-driven mRNA by using GO and ConsensusPathDB analysis. In the GO analysis, a P value of less than 0.05 was considered statistically significant. Furthermore, the GOCircle and GOChord plotting functions of the GOplot R package were used to allow data from expression analysis and data from functional annotation enrichment analysis. ConsensusPathDB (http://cpdb.molgen.mpg.de/) is an online software program that includes binary and complex signaling, gene regulatory and drug-target interactions, and biochemical pathways. P < 0.05 was considered statistically significant.

Construction of a differentially methylated, lncRNA-related predictive model in LUAD

We identified the differentially methylated lncRNA associated with overall survival with P < 0.05 to act as prognostic methylation lncRNA candidates for multivariate Cox regression analysis. On the basis of the median risk score, LUAD patients were divided into two cohorts, high-risk cohorts and low-risk cohorts. Receiver operating curves were used to test the effect of the lncRNA signature (high risk vs low risk) on overall survival. We analyzed the receiver operating curve by calculating the area under the curve (AUC) under the binomial exact confidence interval to reveal prognostic biomarkers for predicting survival in LUAD.

Combined methylation and gene expression survival analysis

To further explore the effect of methylation level and gene expression level of the same methylation-driven key gene on LUAD patient prognosis, we performed a combined methylation and gene expression survival analysis to identify potential methylation-driven mRNAs and methylation-driven lncRNAs for predicting the prognosis of LUAD patients. Therefore, a Kaplan–Meier curve was performed. P < 0.05 was regarded as statistically significant.

Results

Identification of methylation-driven mRNA and lncRNA in LUAD

A total of 99 mRNAs and 17 lncRNAs were identified to be associated with DNA methylation using MethylMix criteria. The methylation-driven mRNAs and IncRNAs are shown in Tables 1 and 2, respectively. We constructed a mixed model and performed a Wilcoxon rank test for determining differential methylation (logFC > 0, P < 0.05, Cor < − 0.3). Figure 1 shows that two methylation-driven mRNAs (ZNF454 and ZNF471) (Fig. 1f, g) and two methylation-driven lncRNAs (TUSC8 and LINC00676) (Fig. 1h, i) have significant negative correlations between methylation and gene expression levels. In Fig. 1, the distribution of the methylation degree shows that ZNF454 (Fig. 1b), ZNF471 (Fig. 1c), and LINC00676 (Fig. 1e) are hypermethylated in LUAD patients and hypomethylated in the normal group, while TUSC8 (Fig. 1d) is hypomethylated in LUAD patients and hypermethylated in the normal group. A heat map of methylation-driven mRNAs and lncRNAs is shown in Fig. 2a, b. A flow diagram of the exploration of methylation-driven mRNA and lncRNA in LUAD is shown in Fig. 1a.
Table 1

Methylation-driven mRNAs

mRNANormal meanTumor meanlogFCP ValueAdjusted-PCorCor P-value
ECSCR0.6515497450.7778752070.2556633956.97E−211.03E−18− 0.4016943461.85E−19
TBX40.34918940.5030874080.5267993157.58E−211.12E−18− 0.3538911863.64E−15
TIE10.5598308670.6750471370.2699972121.21E−201.79E−18− 0.4066299016.08E−20
NEFH0.2544441220.4355423740.7754642241.40E−202.07E−18− 0.3217988881.16E−12
ACVRL10.3989753450.5175351290.3753571953.26E−204.82E−18− 0.3868776744.73E−18
USHBP10.4191998870.5250594560.3248424713.63E−205.37E−18− 0.3230072479.42E−13
ERN20.2646479250.3876482780.5506739143.92E−205.80E−18− 0.4507286211.20E−24
COX7A10.5603884440.6660595790.2492240269.41E−201.39E−17− 0.3425692423.00E−14
SULT1C40.1769800420.31131280.8147782242.45E−193.62E−17− 0.3753383185.28E−17
ART40.5312341990.6526807880.2970295482.53E−193.75E−17− 0.4253402947.44E−22
HIST1H3E0.3138894790.4561381760.5392142441.45E−182.15E−16− 0.3904701962.19E−18
ZNF4920.0776504270.1963642341.338466421.18E−171.75E−15− 0.3788663422.55E−17
ZNF7280.1387310420.2909551211.0685060011.32E−171.95E−15− 0.3463566621.50E−14
S1PR10.336442150.4244564450.335258064.34E−176.43E−15− 0.3754950785.11E−17
MUC130.6578417860.551979674− 0.2531255074.44E−176.57E−15− 0.4026042021.51E−19
CCDC80.3606715720.4805009590.4138535968.99E−171.33E−14− 0.3890644822.96E−18
ZNF5780.2811455570.3946156140.4891307941.12E−161.66E−14− 0.4596215781.11E−25
FES0.4249188250.499460570.2331835312.74E−164.05E−14− 0.3709604121.29E−16
ASCL10.0967162990.2007037481.0532366042.98E−164.41E−14− 0.3317638792.08E−13
ALG1L0.4613797190.287062126− 0.6845915893.14E−164.65E−14− 0.4669083781.49E−26
ELF30.4624121930.359973363− 0.3612892854.47E−166.61E−14− 0.450677241.22E−24
TMEM880.5836256180.7381896250.3389482497.89E−161.17E−13− 0.3903088982.27E−18
GSTM50.3568963030.4819420790.4333548151.13E−151.68E−13− 0.3534627693.95E−15
IRX10.0986298870.2205524171.1610247831.21E−151.79E−13− 0.3441576872.24E−14
ZNF4540.1836161210.3385014110.882469121.44E−152.13E−13− 0.6022353213.13E−47
TK20.6152916410.7029219160.1920940421.71E−152.52E−13− 0.3653522133.94E−16
ZSCAN10.2897684040.4003553480.4663807832.83E−154.19E−13− 0.3974889984.73E−19
ZNF6770.2204971960.3156265970.5174584712.86E−154.23E−13− 0.5273520481.21E−34
ZNF5820.1303720530.2396791070.8784695055.74E−158.49E−13− 0.5589686421.43E−39
DAPP10.5104325740.394016678− 0.3734637041.29E−141.91E−12− 0.4905845591.57E−29
SRPX20.6005913350.480798168− 0.320952261.42E−142.10E−12− 0.3834301499.83E−18
CCDC1810.2836055850.418238990.5604416173.23E−144.78E−12− 0.3190447541.84E−12
SULT4A10.246524820.3480991930.4977655677.69E−141.14E−11− 0.3582081981.59E−15
LRRC40.3213703510.4186185020.381399241.26E−131.87E−11− 0.4210451832.10E−21
ZSCAN230.122510130.2093360950.7729200461.57E−132.32E−11− 0.3625841376.80E−16
F2RL10.2427906450.208656331− 0.2185841381.74E−132.57E−11− 0.359413421.26E−15
ZNF3340.174666990.2568299330.5562063712.31E−133.43E−11− 0.4594510821.16E−25
ZNF4710.1307380510.2790805461.0940024655.46E−138.07E−11− 0.5480114948.37E−38
HOXB20.343549420.5130880340.5786887336.70E−139.91E−11− 0.4911556951.32E−29
PRR190.6664072620.7287560310.1290317969.23E−131.37E−10− 0.4208361292.20E−21
AGR20.6699539340.554274136− 0.2734622091.68E−122.49E−10− 0.512056841.93E−32
NQO10.7021580990.525979398− 0.4167896162.08E−123.08E−10− 0.4254440787.26E−22
GIPC20.1793104150.2815721650.6510454333.52E−125.21E−10− 0.367210462.73E−16
OXT0.5532652880.6567855730.2474510264.61E−126.83E−10− 0.5193447441.77E−33
B3GALT20.5342569640.6586269010.3019276316.78E−121.00E−09− 0.4777781936.83E−28
EFS0.1976136250.2971572420.5885441151.42E−112.10E−09− 0.3914226311.78E−18
RAB340.2955288690.254270265− 0.216936312.05E−113.03E−09− 0.4244114539.32E−22
ACTRT30.3911860630.353188058− 0.1474184085.54E−118.21E−09− 0.3291545723.27E−13
CLDN80.7278302550.618464903− 0.2349102979.70E−111.44E−08− 0.3286900813.55E−13
AQP10.3665699840.5014822990.4521101212.34E−103.46E−08− 0.5972459372.74E−46
SP80.2028413560.2979169670.5545584662.69E−103.98E−08− 0.3170096282.58E−12
HORMAD20.3504762030.4611468190.3959096583.14E−104.65E−08− 0.4579077371.76E−25
GALM0.2559486250.226747119− 0.1747700416.86E−101.02E−07− 0.3562396062.33E−15
RABGGTB0.478356780.385164319− 0.3126129856.89E−101.02E−07− 0.33300191.67E−13
KCNE30.2566711420.3143161990.2922955257.52E−101.11E−07− 0.3500377227.53E−15
ZNF8790.1476298210.20003670.4382805351.01E−091.50E−07− 0.4538565875.23E−25
ZNF2570.1575890160.2456099730.6402021551.28E−091.89E−07− 0.3591209321.34E−15
ZNF3820.1014920.2954889811.5417383161.35E−091.99E−07− 0.3915665211.73E−18
GSTM10.1563964310.2539100310.6991097972.76E−094.08E−07− 0.5632121622.84E−40
PLAU0.621501960.7021459350.1760119734.00E−095.92E−07− 0.5709344251.41E−41
PIGR0.6679525010.7085755820.0851762365.52E−098.16E−07− 0.6586181493.68E−59
CFTR0.2829430870.3949726660.4812409269.95E−091.47E−06− 0.421508431.88E−21
SPDYC0.7041474680.7248371070.0417792131.76E−082.60E−06− 0.475883251.18E−27
ZNF4180.2425110450.3479066190.5206496752.19E−083.23E−06− 0.604359231.23E−47
KLHDC90.2182233260.21602453− 0.0146101822.63E−083.90E−06− 0.3249840316.72E−13
ZNF690.1175643790.1426194040.2787192742.66E−083.94E−06− 0.3211045041.30E−12
FAM84A0.2121753320.2830415650.4157573.53E−085.23E−06− 0.3359164419.96E−14
PPP1R14D0.5549478270.482097369− 0.2030275896.64E−089.82E−06− 0.5607104027.40E−40
TCP110.6241046270.6599129430.0804878077.18E−081.06E−05− 0.6517468791.43E−57
ZNF3000.3284090050.4152500760.3384867469.38E−081.39E−05− 0.4721101833.46E−27
MPV17L0.0923857160.1632539780.8213764311.86E−072.75E−05− 0.4417925341.23E−23
KRT200.7903742370.8040700210.024785222.92E−074.33E−05− 0.5718002351.00E−41
GKN20.5992992480.6712129350.1634939554.56E−076.76E−05− 0.3406976544.22E−14
ZNF5020.3292088790.4026970690.2906917256.68E−079.88E−05− 0.6632711712.91E−60
C17orf980.5454965570.6025661940.1435496441.03E−060.000151843− 0.5298653335.12E−35
ZNF8800.1408750180.2387869260.761308051.04E−060.000154315− 0.5607150997.38E−40
ZNF7010.2407002250.2904761880.2711824111.55E−060.000229592− 0.4086432073.83E−20
NR0B10.2836142740.3926517760.4693202791.59E−060.000235507− 0.3207097031.39E−12
ZNF430.0671790390.1157889420.785414421.79E−060.000265629− 0.3751241865.52E−17
HCAR10.434002310.376162376− 0.2063471671.93E−060.000285658− 0.4011995742.07E−19
IRX20.2898872570.402814960.4746253513.59E−060.000530859− 0.5394218251.83E−36
TMEM63A0.1887449010.177446694− 0.0890519734.10E−060.000607535− 0.3450969821.89E−14
ITPRIPL10.2945028020.3740692040.3450238394.14E−060.000613146− 0.449604881.61E−24
LYZ0.6992316980.7336827940.0693858644.77E−060.000705417− 0.4248404668.40E−22
IFNLR10.2462261990.232227751− 0.0844438919.60E−060.001421224− 0.4206731592.29E−21
TUSC10.136591810.182963410.4216841849.60E−060.001421229− 0.3955875977.19E−19
MAGEB20.8424828350.77366049− 0.1229466941.08E−050.001598591− 0.3520088195.20E−15
BVES0.1914066380.2526645510.4005824071.12E−050.001655808− 0.3214490811.23E−12
LRRIQ40.7043790760.645150221− 0.1267169261.21E−050.001791741− 0.389126312.93E−18
RASSF100.1204200320.1810390380.5882254165.00E−050.007400364− 0.3282527943.83E−13
PRICKLE40.6053306620.6863797710.1812836038.15E−050.012068682− 0.3188833471.89E−12
SYCP20.6650379970.6753036890.0220996680.0001230740.018214969− 0.6559435211.55E−58
WBP2NL0.5245793670.489275744− 0.10051330.0001840040.027232614− 0.3432460322.65E−14
BST20.4228812060.380549863− 0.1521669440.0001872920.027719241− 0.5896170957.07E−45
PLSCR40.175713660.1966640140.1625066490.0002419160.035803514− 0.3651490844.10E−16
GBP40.1499418940.137445221− 0.1255467840.0002492460.036888372− 0.3164765162.82E−12
RPL7A0.8019958470.754500966− 0.0880720190.0002752070.040730644− 0.3192713721.77E−12
ARHGDIB0.3276073530.313354218− 0.0641733230.0002820850.04174863− 0.5522431.77E−38
CYB5A0.1273001530.1413094760.1506240550.0003029430.044835593− 0.3711060981.25E−16
Table 2

Methylation-driven lncRNAs

lncRNANormal meanTumor meanlogFCP-valueAdjusted-PCorCor P-value
HOTAIRM10.2504635180.443383840.8239557092.68E−196.44E−18− 0.3268786464.85E−13
HOXB-AS30.3196866330.4491954380.4906848535.50E−181.32E−16− 0.4129136721.43E−20
HOXB-AS10.3350419070.4841218930.5310287791.17E−172.80E−16− 0.4591437981.26E−25
AF186192.10.1535171150.2898652740.9169830021.36E−163.26E−15− 0.4320330571.44E−22
WASIR20.675680.534227796− 0.338885113.03E−167.26E−15− 0.380411641.85E−17
HOXC-AS30.1804576480.3044257090.7544299093.10E−157.45E−14− 0.336844338.44E−14
LINC013540.5071199550.6569773520.3735165942.20E−145.28E−13− 0.4918256211.08E−29
BARX1-AS10.4694749280.5719056610.2847290721.61E−133.87E−12− 0.4854000127.36E−29
AC005498.30.0773939110.1507628430.961988949.66E−112.32E−09− 0.3476152951.18E−14
AC147651.40.5458278830.591349570.1155651229.66E−112.32E−09− 0.354242623.41E−15
LINC006760.656371480.7703294640.2309630532.88E−106.92E−09− 0.6523054151.07E−57
LINC014600.2323708260.19315723− 0.2666532764.57E−091.10E−07− 0.3649972724.23E−16
AC023824.10.6465229450.7034298030.1217048877.06E−091.69E−07− 0.3658417353.58E−16
LINC015350.2278399480.2920018280.3579566844.99E−071.20E−05− 0.3719765921.05E−16
LINC005060.1416700970.2035603610.5229213833.03E−050.000726506− 0.3304614432.61E−13
TUSC80.8060788730.749074143− 0.1058124889.91E−050.002378923− 0.7130800722.02E−73
LINC008470.8209169160.8245589070.0063863480.0011306540.027135699− 0.350467976.95E−15
Fig. 1

Identification of top hypermethylated and hypomethylated mRNAs and lncRNAs in LUAD. a A flow diagram of the exploration of methylation-driven mRNA and lncRNA in LUAD. b–e The methylation degree when comparing cancer patients to normal patients in LUAD. The red curve indicates the methylation degree from the cancer group, the green curve indicates the methylation degree from the normal group, and the black line above the figure is the distribution of methylation levels in normal patients. f–i The correlation between methylation and gene expression in methylation-driven mRNAs and lncRNAs

Fig. 2

Heat map of methylation-driven mRNAs and lncRNAs in LUAD. a The hierarchical clustering heat map of LUAD-specific methylation-driven mRNAs. b The hierarchical clustering heat map of LUAD-specific methylation-driven lncRNAs. In the figure, red represents highly methylated genes and green represents low methylated genes between LUAD and adjacent tissues

Methylation-driven mRNAs Methylation-driven lncRNAs Identification of top hypermethylated and hypomethylated mRNAs and lncRNAs in LUAD. a A flow diagram of the exploration of methylation-driven mRNA and lncRNA in LUAD. b–e The methylation degree when comparing cancer patients to normal patients in LUAD. The red curve indicates the methylation degree from the cancer group, the green curve indicates the methylation degree from the normal group, and the black line above the figure is the distribution of methylation levels in normal patients. f–i The correlation between methylation and gene expression in methylation-driven mRNAs and lncRNAs Heat map of methylation-driven mRNAs and lncRNAs in LUAD. a The hierarchical clustering heat map of LUAD-specific methylation-driven mRNAs. b The hierarchical clustering heat map of LUAD-specific methylation-driven lncRNAs. In the figure, red represents highly methylated genes and green represents low methylated genes between LUAD and adjacent tissues

Enrichment analysis of methylation-driven mRNAs in LUAD

Gene ontology analysis showed that there were 5 GO terms (regulation of transcription, DNA-templated; transcription factor activity, sequence-specific DNA binding; nucleic acid binding; transcription, DNA-templated; metal ion binding) with significant differences (P < 0.05), and the highest GO biological process was “GO:0006355 regulation of transcription, DNA-templated” (Fig. 3a, c). The GOChord plot shows the top 30 methylation-driven mRNAs with their related GO terms (Fig. 3b). Figure 4 shows that 11 pathways (Generic Transcription Pathway, Benzene metabolism, RNA Polymerase II Transcription, Gene expression (Transcription), Platinum Pathway, Pharmacokinetics/Pharmacodynamics, Phase II—Conjugation of compounds, Sulfation Biotransformation Reaction, Estrogen metabolism, Glutathione-mediated detoxification, Cytosolic sulfonation of small molecules, Drug metabolism—other enzymes—Homo sapiens (human)) were considered statistically significant (P < 0.05). Furthermore, pathway analysis showed that the methylation-driven mRNAs were most enriched in the Generic Transcription Pathway, RNA Polymerase II Transcription and Gene expression (Transcription) pathways (P < 0.01) (Fig. 4). The pathway analysis is shown in Table 3.
Fig. 3

Functional enrichment analysis of methylation-driven mRNAs in LUAD. a The outer circle represents the expression (logFC) of methylation-driven mRNAs in each enriched GO (gene ontology) term: red dots on each GO term indicate upregulated methylation-driven mRNAs and blue dots indicate downregulated methylation-driven mRNAs. The inner circle indicates the significance of GO terms (log10-adjusted P values). b The circle indicates the correlation between the top 30 methylation-driven mRNAs and their gene ontology terms. c The distribution of the methylation-driven mRNAs in significant GO terms

Fig. 4

Pathway analysis of methylation-driven mRNAs in LUAD. The red circles indicate the number of methylation-driven mRNAs on each pathway. The line between the two red circles represents the ratio of methylation-driven mRNAs present in the common genes of the two pathways; the thicker the line, the more common methylation-driven mRNAs are represented

Table 3

Pathway analysis

IDPathwayCountP-valueq-value
R-HSA-212436Generic transcription pathway150.0001692480.01031453
WP3891Benzene metabolism20.0003381810.01031453
R-HSA-73857RNA polymerase II transcription150.0005641620.011471298
R-HSA-74160Gene expression (transcription)150.0016680040.020896371
PA150642262Platinum pathway, Pharmacokinetics/pharmacodynamics20.0017207260.020896371
R-HSA-156580Phase II—conjugation of compounds40.0020553810.020896371
WP692Sulfation biotransformation reaction20.0029632690.025340686
WP697Estrogen metabolism20.0033233690.025340686
PWY-4061Glutathione-mediated detoxification20.0058848460.035982304
R-HSA-156584Cytosolic sulfonation of small molecules20.0063768380.035982304
path:hsa00983Drug metabolism—other enzymes—Homo sapiens (human)30.0064886120.035982304
Functional enrichment analysis of methylation-driven mRNAs in LUAD. a The outer circle represents the expression (logFC) of methylation-driven mRNAs in each enriched GO (gene ontology) term: red dots on each GO term indicate upregulated methylation-driven mRNAs and blue dots indicate downregulated methylation-driven mRNAs. The inner circle indicates the significance of GO terms (log10-adjusted P values). b The circle indicates the correlation between the top 30 methylation-driven mRNAs and their gene ontology terms. c The distribution of the methylation-driven mRNAs in significant GO terms Pathway analysis of methylation-driven mRNAs in LUAD. The red circles indicate the number of methylation-driven mRNAs on each pathway. The line between the two red circles represents the ratio of methylation-driven mRNAs present in the common genes of the two pathways; the thicker the line, the more common methylation-driven mRNAs are represented Pathway analysis

Construction of a predictive model of six differentially methylated lncRNAs in LUAD

Univariate Cox regression analysis was performed first to identify the prognosis associated with differentially methylated genes in LUAD, incorporating 10 methylation genes that were conspicuously associated with overall survival (P < 0.05). Next, multivariate Cox regression was used and showed that six lncRNAs were eventually selected to construct a predictive model. We used the linear combination of the expression of the 6 lncRNAs to construct the predictive model. The relative coefficients weighted in the multivariate Cox regression were as follows: survival risk score = (3.0040 × expression value of FOXE1 + 1.0226 × expression value of HOXB13-AS1_2 + 1.0540 × expression value of VMO1 + 1.0050 × expression value of HIST1H3F + (− 3.0925) × expression value of AJ003147.8 + 1.4791 × expression value of ASXL3). The multivariate Cox analysis is shown in Table 4.
Table 4

Multivariate Cox regression analysis of 6 lncRNAs associated with overall survival in LUAD patients

coefexp(coef)se(coef)zP
FOXE13.00420.16651.0182.950.0032
HOXB13-AS1_21.02262.78040.56571.810.0706
VMO11.0542.8690.59921.760.0786
HIST1H3F1.0052.73190.38722.60.0094
AJ003147.8− 3.09250.04540.6695− 4.623.80E−06
ASXL31.47914.38880.79691.860.0635
Multivariate Cox regression analysis of 6 lncRNAs associated with overall survival in LUAD patients

Risk groupings and ROC curve analysis

As shown in the heat map, the expression of six prognostic methylation genes was profiled (Fig. 5a). Based on the median risk scores, a total of 449 samples of complete survival information were divided into a high-risk group (n = 224) and a low-risk group (n = 225). We used the Kaplan–Meier curve with a log-rank statistical examination to perform survival analysis. As shown in Fig. 5b, patients in the low-risk group had conspicuously better overall survival than those in the high-risk group (Fig. 5b). The receiver operating characteristic (ROC) curve was analyzed to test the influence on the 6-lncRNA signature associated with overall survival in LUAD (Fig. 5c).
Fig. 5

Prognostic value of 6-methylation lncRNAs in LUAD. a A risk heat map established from 6 lncRNAs from 449 LUAD patients. b Kaplan–Meier curve analysis for OS (overall survival) of LUAD patients using the 6 lncRNA signatures. c ROC curve analysis of the prognostic 6-lncRNA signature

Prognostic value of 6-methylation lncRNAs in LUAD. a A risk heat map established from 6 lncRNAs from 449 LUAD patients. b Kaplan–Meier curve analysis for OS (overall survival) of LUAD patients using the 6 lncRNA signatures. c ROC curve analysis of the prognostic 6-lncRNA signature

Combined methylation and gene expression survival analysis in LUAD

The combined Kaplan–Meier curve analysis revealed that the combination of methylation and expression of lncRNAs AC023824.1, AF186192.1, LINC01354 and WASIR2 had a conspicuous correlation with the prognosis of LUAD patients (Fig. 6a–d). The hypermethylation and low-expression survival rate of AC023824.1 was high, while the hypermethylation and low-expression survival rate of AF186192.1, LINC01354 and WASIR2 were low. The combined Kaplan–Meier curve analysis showed that the combination of methylation and mRNA expression of mRNAs CCDC181, EFS, F2RL1, GKN2, ITPRIPL1, KLHDC9, MPV17L, and S1PR1 were associated with overall survival of LUAD (P < 0.05) (Fig. 6e–l). The hypermethylation and low-expression survival rate of F2RL1 was high. However, the hypermethylation and low-expression survival rates of EFS, CCDC181, GKN2, ITPRIPL1, KLHDC9, MPV17L, and S1PR1 were low (Fig. 6).
Fig. 6

The combined methylation and gene expression data survival analysis in LUAD. a–d Kaplan–Meier curve analysis of four lncRNAs in LUAD. e–l Kaplan–Meier curve analysis of eight mRNAs in LUAD

The combined methylation and gene expression data survival analysis in LUAD. a–d Kaplan–Meier curve analysis of four lncRNAs in LUAD. e–l Kaplan–Meier curve analysis of eight mRNAs in LUAD

Discussion

In recent years, with the increasing numbers of advanced diagnoses and poor prognoses in lung adenocarcinoma, it is pivotal to find more effective prognostic biomarkers to predict survival in LUAD. LncRNA-related studies have attracted the attention of various cancer fields. Accumulating studies show that cancer-related lncRNAs may serve as diagnostic or predictive biomarkers of cancer and have a significant effect on the therapeutic treatment of cancer [27]. Emerging evidence shows that studies on the molecular mechanisms and prognostic biomarkers of LUAD associated with methylation-driven lncRNA and mRNA are still lacking. In recent years, epigenetic alterations in DNA methylation, noncoding RNA expression, chromatin modeling and post-transcriptional regulators have been found to play significant roles in the regulation and development of lung cancer pathogenesis [28-32]. Some studies have shown that epigenetic changes in DNA methylation cause changes in the expression of lncRNA, which might provide a novel insight to explore new biomarkers for predicting the prognosis of human cancer [33-36]. For instance, analysis of microarray data on gene expression and methylation showed that the expressions of lncRNAs LOC146880 and ENST00000439577 were regulated by DNA methylation, which might provide a new horizon to predict the diagnosis and prognosis of NSCLC [37]. Lu et al. indicated that MEG3 is significantly downregulated in NSCLC tissues that could be affected by DNA methylation [38]. Previous studies have shown that survival-associated, methylation-driven lncRNAs might serve as novel prognostic biomarkers for predicting the prognosis of LUAD. Recent studies have shown that the roles of lncRNA in tumorigenesis and metastasis can indicate that lncRNA may function as a novel biomarker for the diagnosis and prognosis of cancer [39-44]. LncRNA TUBA4B has been reported to serve as a new predictor for prognosis and modulate cell viability in non-small-cell lung cancer [45]. lncRNA AFAP1-AS1 may act as an oncogenic to facilitate the migration of non-small-cell lung cancer (NSCLC) [16]. Long noncoding RNA ANRIL acts as an oncogene by silencing KLF2 and P21 expression to promote the development of NSCLC [46]. LncRNA PANDAR acts as a cancer suppressor gene by regulating Bcl-2 to affect cell apoptosis in NSCLC [47]. In the present study, we retrieved methylation and lncRNA and mRNA expression from the TCGA database by using bioinformatics analysis and obtained methylation-driven lncRNAs and mRNAs to predict the prognosis of LUAD. First, we obtained differentially expressed methylation and lncRNA and mRNA using the MethylMix R package to obtain methylation-driven lncRNA and mRNA. Functional enrichment analysis was performed to analyze the methylation-driven mRNA to identify its biological functions in the regulation and development of LUAD. Furthermore, univariate and multivariate Cox regressions were performed to construct a predictive model for predicting the prognosis of LUAD. Finally, a combined methylation and lncRNA expression survival analysis was carried out, which might provide novel insight to predict the diagnosis and prognosis of LUAD. In the present study, we combined methylation and lncRNA and mRNA expression data with survival analysis to identify 4 lncRNAs and 8 mRNAs that act as independent prognostic factors for predicting the diagnosis and prognosis of LUAD. LINC01354 acts as a ceRNA to predict the early diagnosis and prognosis of colorectal cancer [48]. The combined survival analysis showed that the low expression of AC023824.1 with hypermethylation, compared to the high expression of AC023824.1 with hypomethylation, had a higher survival rate. (P = 0). The combined hypermethylation and low-expression survival rate of AF186192.1 was lower than the hypomethylation and high-expression survival rate of AF186192.1. (P = 0.01). The low expression and hypermethylation survival rate of LINC01354 was low. (P = 0.038). The high expression and hypomethylation survival rate of WASIR2 was high. The survival analysis showed that lncRNA AF186192.1, LINC01354 and WASIR2 might act as cancer suppressor genes regulated by DNA methylation to play significant roles in predicting the prognosis of LUAD. LncRNA AC023824.1 might act as an oncogene regulated by DNA methylation to have a pivotal effect on predicting the prognosis of LUAD. The survival rate of hypermethylation and low expression of CCDC181, EFS, GKN2, ITPRIPL1, KLHDC9, MPV17L, and S1PR1 were low. However, the hypermethylation and low-expression of the survival rate of F2RL1 was high. F2RL1 might act as an oncogene for predicting the prognosis of LUAD. Previous studies have shown that CCDC181, KLHDC9, and S1PR1 act as methylation-driven genes to reveal prognostic biomarkers in LUAD [19]. GKN2 may contribute to the homeostasis of gastric epithelial cells by inhibiting GKN1 activity [49]. F2RL1 may act as novel acute myeloid leukemia subsets that are meaning for treatment guidance [50]. MPV17L acts as a unique interacting protein and regulator of HtrA2 protease, mediating antioxidant and antiapoptotic functions in mitochondria [51]. Compared with previous studies, our study first obtained differentially expressed mRNA and lncRNA using the edge R package and aberrant methylated genes using the limma R package, and then we filtered low expression genes and intersected mRNA and lncRNA expression data with DNA methylation data to obtain methylation-driven genes by using the MethylMix R package. The MethylMix (https://bioconductor.riken.jp/packages/3.1/bioc/html/MethylMix.html) is an algorithm implemented to integrate DNA methylation with RNA expression to identify methylation-driven genes in cancers [52]. In summary, MethylMix provides a tool that contributes to the analysis of methylation-driven lncRNAs and mRNAs in cancer studies from TCGA [17, 18]. However, the MethylMix focuses on identifying cis-regulatory effects of DNA methylation on gene expression and does not currently model trans-regulatory effects [18]. Further studies are needed to solve the multiple testing challenge on identifying trans-regulatory effects of DNA methylation on gene expression. Our study may provide a novel method for determining disease-specific prognostic biomarkers in LUAD and may play a significant role in predicting the diagnosis and prognosis of LUAD. Our study subjects were retrieved from the TCGA database, which is a significant tool for analyzing prognostic biomarkers. It is not known whether our results are applicable to other groups. The predictive prognostic lncRNA and mRNA signature needs to be verified by molecular biologic experiments on clinical samples in future studies. Eventually, large-scale samples and experimental studies could validate the biological function of prognostic biomarkers in LUAD.

Conclusion

In conclusion, our study identified methylation-driven mRNA and lncRNA by using bioinformatics analysis from the TCGA database. A Cox predictive model was performed to identify independent prognostic factors. Methylation and gene expression data combined with survival analysis was used to identify LUAD-specific, methylation-driven lncRNAs and mRNAs for predicting the diagnosis and prognosis of LUAD.
  52 in total

Review 1.  Rules for making human tumor cells.

Authors:  William C Hahn; Robert A Weinberg
Journal:  N Engl J Med       Date:  2002-11-14       Impact factor: 91.245

2.  Long noncoding RNA-mediated maintenance of DNA methylation and transcriptional gene silencing.

Authors:  Faizaan Mohammad; Gaurav Kumar Pandey; Tanmoy Mondal; Stefan Enroth; Lisa Redrup; Ulf Gyllensten; Chandrasekhar Kanduri
Journal:  Development       Date:  2012-06-21       Impact factor: 6.868

Review 3.  Long non-coding RNAs: a new frontier in the study of human diseases.

Authors:  Xuefei Shi; Ming Sun; Hongbing Liu; Yanwen Yao; Yong Song
Journal:  Cancer Lett       Date:  2013-06-18       Impact factor: 8.679

4.  Mpv17l protects against mitochondrial oxidative stress and apoptosis by activation of Omi/HtrA2 protease.

Authors:  Stefanie Krick; Shaolin Shi; Wenjun Ju; Christian Faul; Su-yi Tsai; Peter Mundel; Erwin P Böttinger
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-04       Impact factor: 11.205

5.  Long interspersed nuclear element 1 hypomethylation is a marker of poor prognosis in stage IA non-small cell lung cancer.

Authors:  Kenichiro Saito; Kazuyuki Kawakami; Isao Matsumoto; Makoto Oda; Go Watanabe; Toshinari Minamoto
Journal:  Clin Cancer Res       Date:  2010-04-06       Impact factor: 12.531

6.  Impact of proposed IASLC/ATS/ERS classification of lung adenocarcinoma: prognostic subgroups and implications for further revision of staging based on analysis of 514 stage I cases.

Authors:  Akihiko Yoshizawa; Noriko Motoi; Gregory J Riely; Cami S Sima; William L Gerald; Mark G Kris; Bernard J Park; Valerie W Rusch; William D Travis
Journal:  Mod Pathol       Date:  2011-01-21       Impact factor: 7.842

Review 7.  The mammalian epigenome.

Authors:  Bradley E Bernstein; Alexander Meissner; Eric S Lander
Journal:  Cell       Date:  2007-02-23       Impact factor: 41.582

8.  Expression of LINC00312, a long intergenic non-coding RNA, is negatively correlated with tumor size but positively correlated with lymph node metastasis in nasopharyngeal carcinoma.

Authors:  Wenling Zhang; Chen Huang; Zhaojian Gong; Yanhua Zhao; Ke Tang; Xiaoling Li; Songqing Fan; Lei Shi; Xiayu Li; Pan Zhang; Yanhong Zhou; Donghai Huang; Fang Liang; Xinli Zhang; Minghua Wu; Li Cao; Jieru Wang; Yong Li; Wei Xiong; Zhaoyang Zeng; Guiyuan Li
Journal:  J Mol Histol       Date:  2013-03-26       Impact factor: 2.611

9.  Combination epigenetic therapy has efficacy in patients with refractory advanced non-small cell lung cancer.

Authors:  Rosalyn A Juergens; John Wrangle; Frank P Vendetti; Sara C Murphy; Ming Zhao; Barbara Coleman; Rosa Sebree; Kristen Rodgers; Craig M Hooker; Noreli Franco; Beverly Lee; Salina Tsai; Igor Espinoza Delgado; Michelle A Rudek; Steven A Belinsky; James G Herman; Stephen B Baylin; Malcolm V Brock; Charles M Rudin
Journal:  Cancer Discov       Date:  2011-11-09       Impact factor: 39.397

10.  Cause and consequences of genetic and epigenetic alterations in human cancer.

Authors:  B Sadikovic; K Al-Romaih; J A Squire; M Zielenska
Journal:  Curr Genomics       Date:  2008-09       Impact factor: 2.236

View more
  21 in total

1.  Risk coefficient model of necroptosis-related lncRNA in predicting the prognosis of patients with lung adenocarcinoma.

Authors:  GenYi Qu; NianXi Tan; HuiWei Chen; Zhimin Xie; QingZhu Li; YuLong Zhang
Journal:  Sci Rep       Date:  2022-06-29       Impact factor: 4.996

2.  Centromere protein N promotes lung adenocarcinoma progression by activating PI3K/AKT signaling pathway.

Authors:  Yi Zheng; Hui You; Jingzhu Duan; Biyu Chen; Chenlin Wu; Peipei Chen; Meifang Wang
Journal:  Genes Genomics       Date:  2022-02-12       Impact factor: 2.164

3.  Establishment of a Gene Signature to Predict Prognosis for Patients with Lung Adenocarcinoma.

Authors:  Zhaodong Li; Fangyuan Qi; Fan Li
Journal:  Int J Mol Sci       Date:  2020-11-11       Impact factor: 5.923

4.  A Novel Prognostic Model of Early-Stage Lung Adenocarcinoma Integrating Methylation and Immune Biomarkers.

Authors:  Jin Ren; Yun Yang; Chuanyin Li; Lu Xie; Ronggui Hu; Xiong Qin; Menghuan Zhang
Journal:  Front Genet       Date:  2021-01-21       Impact factor: 4.599

5.  Construction of a Myc-associated ceRNA network reveals a prognostic signature in hepatocellular carcinoma.

Authors:  Dan-Dan Zhang; Yi Shi; Ji-Bin Liu; Xiao-Li Yang; Rui Xin; Hui-Min Wang; Pei-Yao Wang; Cheng-You Jia; Wen-Jie Zhang; Yu-Shui Ma; Da Fu
Journal:  Mol Ther Nucleic Acids       Date:  2021-05-01       Impact factor: 8.886

6.  A methylation-based nomogram for predicting survival in patients with lung adenocarcinoma.

Authors:  Xuelong Wang; Bin Zhou; Yuxin Xia; Jianxin Zuo; Yanchao Liu; Xin Bi; Xiong Luo; Chengwei Zhang
Journal:  BMC Cancer       Date:  2021-07-12       Impact factor: 4.430

7.  A risk score system based on DNA methylation levels and a nomogram survival model for lung squamous cell carcinoma.

Authors:  Ming Zhang; Libing Sun; Yi Ru; Shasha Zhang; Junjun Miao; Pengda Guo; Jinghuan Lv; Feng Guo; Biao Liu
Journal:  Int J Mol Med       Date:  2020-04-27       Impact factor: 4.101

8.  PNO1, which is negatively regulated by miR-340-5p, promotes lung adenocarcinoma progression through Notch signaling pathway.

Authors:  Dongming Liu; Li Lin; Yajie Wang; Lu Chen; Yuchao He; Yi Luo; Lisha Qi; Yan Guo; Liwei Chen; Zhiqiang Han; Guangtao Li; Qiang Li; Zhiyong Liu; Peng Chen; Hua Guo
Journal:  Oncogenesis       Date:  2020-06-01       Impact factor: 7.485

9.  Prognostic Value of DNA Methylation-Driven Genes in Clear Cell Renal Cell Carcinoma: A Study Based on Methylation and Transcriptome Analyses.

Authors:  Maolin Hu; Jiangling Xie; Huiming Hou; Ming Liu; Jianye Wang
Journal:  Dis Markers       Date:  2020-07-11       Impact factor: 3.434

10.  Zc3h12d, a Novel of Hypomethylated and Immune-Related for Prognostic Marker of Lung Adenocarcinoma.

Authors:  Bo Yang; Lin-Lin Ji; Hong-Liang Xu; Xiao-Ping Li; Hong-Gang Zhou; Ting Xiao; Xiao-He Li; Zhou-Yong Gao; Jian-Zhong Li; Wei-Dong Zhang; Guang-Shun Wang; Ming-Jiang Li
Journal:  J Inflamm Res       Date:  2021-06-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.