Literature DB >> 29029488

Integrated analysis of competing endogenous RNA network revealing lncRNAs as potential prognostic biomarkers in human lung squamous cell carcinoma.

Jing Sui1, Si-Yi Xu1, Jiali Han2, Song-Ru Yang3, Cheng-Yun Li1, Li-Hong Yin1, Yue-Pu Pu1, Ge-Yu Liang1.   

Abstract

Accumulating evidence shows the important role of long non-coding RNAs (lncRNAs) in competing endogenous RNA (ceRNA) networks for predicting survival in tumor patients. However, prognostic biomarkers for lung squamous cell carcinoma (LUSC) are still lacking. The objective of this study is to identify a lncRNA signature for evaluation of overall survival (OS) in 474 LUSC patients from The Cancer Genome Atlas (TCGA) database. A total of 474 RNA sequencing profiles in LUSC patients with clinical data were obtained, providing a large sample of RNA sequencing data, and 83 LUSC-specific lncRNAs, 26 miRNAs, and 85 mRNAs were identified to construct the ceRNA network (fold change>2, P<0.05). Among these above 83 LUSC-specific lncRNAs, 22 were assessed as closely related to OS in LUSC patients using a univariate Cox proportional regression model. Meanwhile, two (FMO6P and PRR26) of the above 22 OS-related lncRNAs were identified using a multivariate Cox regression model to construct a risk score as an independent indicator of the prognostic value of the lncRNA signature in LUSC patients. LUSC patients with low-risk scores were more positively correlated with OS (P<0.001). The present study provides a deeper understanding of the lncRNA-related ceRNA network in LUSC and suggests that the two-lncRNA signature could serve as an independent biomarker for prognosis of LUSC.

Entities:  

Keywords:  LUSC; ceRNA network; lncRNA; prognostic biomarker

Year:  2017        PMID: 29029488      PMCID: PMC5630388          DOI: 10.18632/oncotarget.19627

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

Lung cancer remains one of the most frequently diagnosed and fatal cancers globally. In 2012 nearly 1.8 million new cases were diagnosed, causing 1.6 million deaths worldwide, with a sharp rise from 2008 [1]. Non-small cell lung cancer, including lung squamous cell carcinoma (LUSC) and lung adenocarcinoma (LUAD), is the most pathological type (approximate 80%) in lung cancer. Nearly 30% of NSCLC is LUSC, which causes approximately 400,000 deaths annually worldwide, with both high incidence and poor prognosis (5-year survival rate<15%) [2]. Based on tumor node metastasis (TNM) taxonomy, LUSC can be classified into stages I, II, III, and IV [3]. Recent studies show that LUSC is closely associated with smoking and is more common in men than in women [4]. It is important to distinguish between LUSC and LUAD in the management of NSCLC since their therapeutic regimens and targeted agents differ [5]. Thus, identify effective potential molecular biomarkers for distinguishing between LUSC and LUAD is urgent. In the present study, we aim to find effective potential molecular biomarkers for predicting survival in LUSC. Long non-coding RNAs (lncRNAs), ranging from 200 nucleotides to 100 kb in length, can modulate gene expression at the transcriptional, post-transcriptional, and epigenetic levels and are broadly distributed in the genome [6-9]. A growing body of evidence demonstrates that lncRNA expression profiles are different in tumors tissues compared to the adjacent non-tumor tissues in various cancers [10-12], including LUSC [13, 14]. It has been proposed that the differentially expressed lncRNAs may correlate with progression and survival in various cancers, which have also been detected in LUSC [15-19]. In 2011, the ceRNA (competing endogenous RNA) hypothesis was presented as a novel regulatory mechanism between non-coding RNA and coding RNA [20]. The central concept is that RNA interacts with miRNA response elements (MREs); this kind of RNA competition crosstalk also exists between lncRNAs and mRNAs [21]. Although numerous lncRNAs have been determined to predict outcomes for lung cancer, the conclusions of previous studies are inconsistent, possibly due to small sample sizes. Recently, lncRNA expression profiles were obtained from The Cancer Genome Atlas (TCGA) database, an open-access and publicly available large-scale database. In the present study, the TCGA database was first used to obtain lncRNA expression profiles and combined with clinical features to construct a lncRNA-miRNA-mRNA ceRNA network in LUSC. Through an integrated analysis of lncRNA expression patterns in the ceRNA network, we identified a lncRNA signature in LUSC with two lncRNAs (FMO6P and PRR26) as a new candidate indicator with the potential to predict overall survival (OS) in LUSC patients.

RESULTS

Identification of significantly differentially expressed lncRNAs

In 474 LUSC patients from TCGA database, we initially performed differential expression analysis by comparing the expression of 1801 lncRNAs in LUSC and adjacent normal lung tissue in the TCGA database. We set fold change>2 and P value>0.05 as cutoffs to identify significantly differentially expressed lncRNAs. Then we obtained 171 differentially expressed lncRNAs between stages I-II (non-lymphatic metastasis) LUSC and adjacent-normal lung tissue, 161 differentially expressed lncRNAs between stages III-IV (non-lymphatic metastasis) LUSC and adjacent-normal lung tissue, 184 differentially expressed lncRNAs between stages I-II (lymphatic metastasis) LUSC and adjacent-normal lung tissue, and 180 differentially expressed lncRNAs between stages III-IV (lymphatic metastasis) LUSC and adjacent-normal lung tissue (fold change>2, P value<0.05). When we combined these four groups of differentially expressed lncRNAs, 127 differentially expressed lncRNAs (55 up-regulated and 72 down-regulated) showed consistently differential expression (Figures 1A and 2A, Supplementary Table 1) and were thus selected to construct the ceRNA network.
Figure 1

Venn diagram analysis of differentially expressed RNA in LUSC

(A) lncRNAs; (B) miRNAs; (C) mRNAs. Lym, lymphatic metastasis; nLym, non- lymphatic metastasis. Each oval represents a group. The brown intersection in the middle represents RNAs, which are consistently and significantly differentially expressed in four groups.

Figure 2

The differential expression of intersected RNAs in LUSC

(A) lncRNAs; (B) miRNAs; (C) miRNAs. A heatmap showing the differentially expressed RNAs.

Venn diagram analysis of differentially expressed RNA in LUSC

(A) lncRNAs; (B) miRNAs; (C) mRNAs. Lym, lymphatic metastasis; nLym, non- lymphatic metastasis. Each oval represents a group. The brown intersection in the middle represents RNAs, which are consistently and significantly differentially expressed in four groups.

The differential expression of intersected RNAs in LUSC

(A) lncRNAs; (B) miRNAs; (C) miRNAs. A heatmap showing the differentially expressed RNAs.

miRNA predicted targets analysis and ceRNA network construction

A total of 1,030 miRNAs and 18,633 mRNAs were identified from the TCGA database, 101 miRNAs and 4,289 mRNAs of which were found to be differentially expressed between LUSC and adjacent normal lung tissue (fold change>2, P<0.05). In total, 42 differentially expressed miRNAs (17 up-regulated and 25 down-regulated) and 2950 differentially expressed mRNAs (1180 up-regulated and 1770 down-regulated) (shown in Figures 1B, 1C and 2B, 2C; Supplementary Tables 2, 3) were selected to construct the ceRNA network. The next step was to predict the mRNA and lncRNAs targeted by miRNA. We then focused on the targeting relationship between the 42 miRNAs and 2,950 mRNAs as mentioned above. miRTarBase and Targetscan were used to predict miRNA-targeted mRNAs. The result showed that 26 of 42-LUSC specific miRNAs might target to the 85 of 2,950 LUSC-specific mRNAs (Table 1). A few targets are tumor-associated genes such as CDC25A, COL11A1, FZD10, INHBE, ISL1, NAT8L, PAX7, PLAU, SIX4, and TFAP4.
Table 1

miRNAs targeting LUSC specific mRNAs

miRNAmRNA
hsa-miR-130b-3pATP1A2, CHRM2, CSF1, IL6ST, LDLR, LRP2, MAGI1, PPARG, ROBO2, S1PR1, TGFBR2
hsa-miR-130b-5pNRG3, PLCL1
hsa-miR-144-3pTNFSF11, UCK2
hsa-miR-182-5pCACNB4, COL4A4, DKK2, EPAS1, MITF, SGMS2, SORT1, ST6GALNAC3, ZFP36
hsa-miR-183-5pC8B, NTN4, PRKCB, ZEB1
hsa-miR-196a-5pAQP4, COL14A1
hsa-miR-196b-5pCOL14A1
hsa-miR-205-5pACACB, ERBB4, MAGI1, MAGI3, RAB11FIP1, SHROOM3
hsa-miR-210-3pGPD1L
hsa-miR-218-5pBRCA1, GNG4, HOXA10, UGT8
hsa-miR-3065-3pCOL11A1, COL7A1, GRM4, ISL1, PLAU
hsa-miR-30a-3pFANCI, SIX4
hsa-miR-30a-5pGCLC
hsa-miR-30c-2-3pINHBE, NAT8L, PAX7
hsa-miR-30d-3pSIX4
hsa-miR-30d-5pCCNE2
hsa-miR-31-5pFZD4, LATS2, PRKCE, RORC
hsa-miR-326FANCE
hsa-miR-490-3pADCY10
hsa-miR-497-5pCDC25A, FZD10, PAX7, TFAP4
hsa-miR-511-5pCACNA1B, FZD10
hsa-miR-629-3pCOL4A4, DHH, IL6R, KCNJ5, NEGR1, SLC34A2, ZEB1
hsa-miR-708-3pFGR, SHROOM4
hsa-miR-708-5pCNTFR, MASP1
hsa-miR-9-5pANK2, CNTFR, GABRB2, ID4, IL6R, LIFR, MEF2C, SGMS2, SHC3, SHROOM4
hsa-miR-96-5pCACNA1C, CACNA2D2, CACNB4, COL13A1, HBEGF, MAGI1, MAGI3, NR4A3, NTN4, SCNN1G, SLC1A1, SLC26A9, SORT1
Subsequently, the targeting relationships among 127 differentially expressed lncRNAs (Supplementary Table 1) and 26 differentially expressed miRNAs (Table 1) were assessed. In the ceRNA network, lncRNA might interact with the miRNA through MREs. Using miRanda to identify the potential MREs, the result showed these 26 LUSC-specific miRNAs might target to 83 of the 127 LUSC-specific lncRNAs (Table 2).
Table 2

miRNAs that may target LUSC specific lncRNAs

lncRNAmiRNA
ABCC13hsa-miR-130b-3p
ABCC6P1hsa-miR-196b-5p, hsa-miR-31-5p, hsa-miR-96-5p
ABHD11-AS1hsa-miR-182-5p
AFAP1-AS1hsa-miR-511-5p
ALOX12P2hsa-miR-3065-3p, hsa-miR-30a-3p, hsa-miR-30d-3p, hsa-miR-326, hsa-miR-497-5p
C1orf140hsa-miR-182-5p, hsa-miR-196a-5p, hsa-miR-9-5p, hsa-miR-96-5p
C1orf220hsa-miR-3065-3p, hsa-miR-30d-3p, hsa-miR-511-5p
CCL15-CCL14hsa-miR-205-5p
CHIAP2hsa-miR-182-5p, hsa-miR-708-3p
CMAHPhsa-miR-205-5p, hsa-miR-629-3p, hsa-miR-708-3p, hsa-miR-708-5p
CYP1B1-AS1hsa-miR-130b-3p, hsa-miR-708-3p
CYP2B7Phsa-miR-130b-3p, hsa-miR-708-5p
CYP2D7hsa-miR-3065-3p
CYP4Z2Phsa-miR-96-5p
DDX12Phsa-miR-497-5p
DGCR5hsa-miR-326, hsa-miR-490-3p
DIRC3hsa-miR-3065-3p, hsa-miR-30a-3p
DLX6-AS1hsa-miR-30c-2-3p
DNM1P46hsa-miR-205-5p
FAM86JPhsa-miR-218-5p
FAR2P1hsa-miR-3065-3p, hsa-miR-30c-2-3p, hsa-miR-497-5p
FER1L4hsa-miR-3065-3p, hsa-miR-30c-2-3p, hsa-miR-490-3p
FIRREhsa-miR-30a-3p, hsa-miR-30d-3p
FLJ34503hsa-miR-130b-5p, hsa-miR-9-5p
FMO6Phsa-miR-30a-3p
GGTA1Phsa-miR-9-5p
GVINP1hsa-miR-196a-5p, hsa-miR-205-5p, hsa-miR-9-5p
HOTAIRhsa-miR-30a-5p, hsa-miR-326
KC6hsa-miR-326
KIAA0087hsa-miR-708-5p, hsa-miR-9-5p
KRTAP5-AS1hsa-miR-130b-5p
KTN1-AS1hsa-miR-30c-2-3p, hsa-miR-511-5p
LHFPL3-AS2hsa-miR-205-5p
LINC00092hsa-miR-629-3p
LINC00173hsa-miR-30c-2-3p
LINC00261hsa-miR-130b-5p, hsa-miR-182-5p, hsa-miR-196a-5p, hsa-miR-629-3p, hsa-miR-708-5p
LINC00312hsa-miR-708-5p
LINC00319hsa-miR-326, hsa-miR-497-5p
LINC00341hsa-miR-205-5p, hsa-miR-708-5p
LINC00472hsa-miR-205-5p, hsa-miR-96-5p
LINC00482hsa-miR-196a-5p
LINC00704hsa-miR-511-5p
LINC00887hsa-miR-490-3p
LINC00908hsa-miR-130b-3p, hsa-miR-196b-5p
LINC00924hsa-miR-196a-5p
LINC00961hsa-miR-96-5p
LINC00982hsa-miR-130b-3p
LINC01105hsa-miR-130b-3p, hsa-miR-182-5p, hsa-miR-183-5p, hsa-miR-629-3p, hsa-miR-708-5p, hsa-miR-96-5p
LOC100499484-C9ORF174hsa-miR-130b-5p, hsa-miR-708-5p, hsa-miR-96-5p
LOC148709hsa-miR-3065-3p, hsa-miR-326, hsa-miR-497-5p
LOC285629hsa-miR-30c-2-3p
LOC399815hsa-miR-30a-3p, hsa-miR-30d-3p
LOC642846hsa-miR-3065-3p
LOC90246hsa-miR-130b-5p
LOC93429hsa-miR-3065-3p, hsa-miR-30c-2-3p, hsa-miR-326
MEIS3P1hsa-miR-130b-5p
MGC27382hsa-miR-130b-5p
MIR31HGhsa-miR-30c-2-3p, hsa-miR-511-5p
MIR9-3HGhsa-miR-144-3p, hsa-miR-326, hsa-miR-511-5p
MIR99AHGhsa-miR-182-5p, hsa-miR-196a-5p, hsa-miR-31-5p
MSL3P1hsa-miR-30c-2-3p
NAPSBhsa-miR-205-5p
NCF1Bhsa-miR-182-5p
OR7E91Phsa-miR-218-5p, hsa-miR-490-3p
PART1hsa-miR-3065-3p, hsa-miR-30d-3p, hsa-miR-490-3p, hsa-miR-497-5p
PGM5P2hsa-miR-182-5p, hsa-miR-196a-5p, hsa-miR-9-5p
PRR26hsa-miR-130b-5p, hsa-miR-96-5p
PSMG3-AS1hsa-miR-130b-5p, hsa-miR-205-5p, hsa-miR-629-3p, hsa-miR-708-5p, hsa-miR-9-5p
PVT1hsa-miR-326
PWARSNhsa-miR-196b-5p, hsa-miR-31-5p
SCARNA12hsa-miR-3065-3p
SFTA1Phsa-miR-9-5p
SLC6A10Phsa-miR-3065-3p, hsa-miR-326, hsa-miR-497-5p
SNHG4hsa-miR-3065-3p, hsa-miR-497-5p
SOX2-OThsa-miR-30a-3p, hsa-miR-30d-3p, hsa-miR-490-3p
TCAM1Phsa-miR-30c-2-3p, hsa-miR-326, hsa-miR-511-5p
TMPO-AS1hsa-miR-326
TPRXLhsa-miR-30c-2-3p
TPTEP1hsa-miR-708-5p
TRHDE-AS1hsa-miR-9-5p
UMODL1-AS1hsa-miR-210-3p
WDFY3-AS2hsa-miR-31-5p
WWC2-AS2hsa-miR-130b-5p, hsa-miR-205-5p
Based on the above data (Tables 1 and 2), we constructed the lncRNA-miRNA-mRNA ceRNA network with Cytoscape 3.0. To obtain the most reliable results, a maximal information coefficient (MIC) algorithm was employed to screen the pair-wise relationships based on the expression levels of the lncRNA, miRNA, and mRNA in the TCGA database (MIC>0.15, MIC-P2>0.15) [22]. In all, 83 lncRNAs, 26 miRNAs, and 85 mRNAs were involved in the ceRNA network (Figure 3).
Figure 3

The lncRNA-miRNA-mRNA ceRNA network

Blue balls surrounded by green rings = down-regulated lncRNAs; red balls surrounded by green rings = up-regulated lncRNAs; blue diamonds = down-regulated miRNAs; red diamonds = up-regulated miRNAs; blue balls = down-regulated mRNAs; red balls = up-regulated mRNAs.

The lncRNA-miRNA-mRNA ceRNA network

Blue balls surrounded by green rings = down-regulated lncRNAs; red balls surrounded by green rings = up-regulated lncRNAs; blue diamonds = down-regulated miRNAs; red diamonds = up-regulated miRNAs; blue balls = down-regulated mRNAs; red balls = up-regulated mRNAs.

Functional assessment

Furthermore, to identify functions associated with the differentially expressed mRNAs in the ceRNA network, we analyzed these mRNAs with DAVID bioinformatics resources. This process revealed enrichment of 81 KEGG pathways and 433 GO terms (P-value<0.05 and enrichment score>2) (Supplementary Table 4). KEGG pathway analyses identified the most significant pathways as Transcriptional dysregulation in cancer (path ID: 05202) and Protein digestion and absorption (path ID: 04974) (Figure 4, Table 3). Five of the top 15 pathways were tumor-related pathways including Transcriptional dysregulation in cancer, Pathways in cancer, Proteoglycans in cancer, PI3K-Akt signaling pathway, and MAPK signaling pathway. There were also tumor-related pathways such as microRNAs in cancer, Wnt signaling pathway, Hippo signaling pathway, NF-kappa B signaling pathway, Ras signaling pathway, Basal cell carcinoma, and Glioma and Small cell lung cancer in the remaining significantly differentially changed pathways. The top GO terms were Ciliary neurotrophic factor-mediated signaling pathway (GO: 0070120) and Synaptic transmission (GO: 0007268) (Figure 4, Table 3).
Figure 4

Top 15 enrichment of KEGG pathways and GO terms for differentially expressed mRNAs in ceRNA network

Table 3

Top 15 KEEG pathways and GO terms enriched by the coding genes involved in ceRNA network

CategoryTermNo. of genes-lgP
KEGG pathwaysTranscriptional misregulation in cancer910.674
Protein digestion and absorption79.609
Pathways in cancer108.926
Proteoglycans in cancer88.629
Signaling pathways regulating pluripotency of stem cells78.210
GABAergic synapse67.868
Cytokine-cytokine receptor interaction87.720
Retrograde endocannabinoid signaling67.506
Serotonergic synapse67.235
PI3K-Akt signaling pathway86.832
Tight junction66.693
MAPK signaling pathway76.457
Oxytocin signaling pathway66.344
ErbB signaling pathway56.160
Morphine addiction56.062
GOciliary neurotrophic factor-mediated signaling pathway410.121
synaptic transmission97.745
negative regulation of neuron apoptotic process67.667
positive regulation of transcription from RNA polymerase II promoter117.644
positive regulation of cell proliferation97.490
negative regulation of apoptotic process96.882
collagen catabolic process56.575
innate immune response96.384
extracellular matrix disassembly56.371
brain development66.0965
positive regulation of smooth muscle cell proliferation45.831
axon guidance75.819
response to estrogen stimulus45.376
positive regulation of osteoblast differentiation45.248
regulation of blood pressure45.158

Construction of the LUSC-specific lncRNA-based prognostic signature

The 83 specific lncRNAs in the ceRNA network were further analyzed according to clinical features including gender, tumor stage, TNM staging system, lymph node metastasis, and patient outcome assessment at diagnosis in TCGA database. There were 25 LUSC-specific lncRNAs, the expression levels of which were significantly differentially expressed in clinical feature comparisons (P<0.05; Table 4). Seventeen lncRNAs were differentially expressed in gender, three in tumor stage, three in TNM staging system, six in lymphatic metastasis, and four were differentially expressed in patient outcome assessment.
Table 4

The correlations between LUSC specific lncRNAs from ceRNA network and clinical features

ComparisonsDown-regulatedUp-regulated
Gender (Female vs. Male)CMAHP, LINC01105, LINC00261, NAPSB, UMODL1-AS1, KRTAP5-AS1, KRTAP5-AS1, GVINP1, MIR99AHGLOC399815, KTN1-AS1, TPRXL, KC6, SLC6A10P, LINC00173, DDX12P, FAR2P1
Tumor stage (I-II vs. III-IV)TCAM1P, KTN1-AS1, CYP2D7
TNM staging system (T1 + T2 vs. T3 + T4)CMAHPKTN1-AS1, TCAM1P
Lymphatic metastasis (No vs. Yes)CYP2B7P, PGM5P2, LOC100499484-C9ORF174LOC399815, TMPO-AS1, PART1
Patient outcome assessment (Dead vs. Alive)MIR99AHG, PRR26, CCL15-CCL14CYP2D7
Univariate Cox proportional hazards regression showed that 22 of the 83 differentially expressed lncRNAs in the ceRNA network were identified to have a significant prognostic value (Table 5). A multivariate Cox proportional hazards regression analysis indicated that only two lncRNAs showed a significant prognostic value for LUSC: KRTAP5-AS1 and SOX2-OT (Figure 5). The risk score for predicting the OS was constructed with the formula: Risk score=expFMO6P * (-0.401) + expPRR26 * 0.399.
Table 5

Prognostic value of the differentially expressed lncRNAs by univariate cox regression analysis

LncRNAEstimateStdErrChiSqPHazard ratio ( 95%CI)
ABCC130.3670.1535.7380.017*1.444 (1.069-1.950)
ABHD11-AS1-0.3050.1530.39800.046*0.737 (0.546-0.995)
C1orf220-0.3150.1544.2080.040*0.729 (0.504-0.986)
CCL15-CCL140.3630.1535.6200.018*1.437 (1.065-1.939)
CHIAP20.4590.1548.8630.003*1.583 (1.170-2.141)
DDX12P-0.3200.1544.3490.037*0.726 (0.537-0.981)
DIRC3-0.4370.1538.1370.004*0.646 (0.478-0.872)
FMO6P-0.4860.15410.0410.002*0.615 (0.455-0.831)
KC6-0.3060.1534.0160.045*0.736 (0.546-0.993)
KIAA00870.4940.15410.3480.001*1.639 (1.213-2.214)
KRTAP5-AS10.4090.1547.0890.008*1.505 (1.114-2.035)
KTN1-AS1-0.4850.1549.9230.002*0.616 (0.455-0.833)
LINC002610.3900.1546.3970.011*1.478 (1.092-2.000)
LINC00472-0.2990.1523.8590.049*0.742 (0.550-0.999)
LINC009610.4350.1548.0020.005*1.545 (1.143-2.088)
PART1-0.4660.1539.2690.002*0.627 (0.465-0.847)
PRR260.5050.15510.6570.001*1.657 (1.224-2.244)
PVT1-0.3340.1524.8030.028*0.716 (0.531-0.965)
SOX2-OT-0.4730.1549.4870.002*0.623 (0.461-0.842)
TCAM1P0.3660.1535.6890.017*1.442 (1.067-1.948)
TMPO-AS1-0.3700.1535.8280.016*0.691 (0.512-0.933)
UMODL1-AS10.3020.1533.9050.048*1.352 (1.002-1.824)

*: P<0.05.

Figure 5

Two differentially expressed lncRNAs (FMO6P and PRR26)

(A) The expression levels of two lncRNAs in the LUSC tissues compared with adjacent normal tissues. (B) Kaplan-Meier curves showing the relationship between the two lncRNAs and OS. The cases were divided into under- and over-expression groups by the mean lncRNAs level. (C) ROC curves of the two lncRNAs to distinguish LUSC tissue from adjacent normal tissues.

*: P<0.05.

Two differentially expressed lncRNAs (FMO6P and PRR26)

(A) The expression levels of two lncRNAs in the LUSC tissues compared with adjacent normal tissues. (B) Kaplan-Meier curves showing the relationship between the two lncRNAs and OS. The cases were divided into under- and over-expression groups by the mean lncRNAs level. (C) ROC curves of the two lncRNAs to distinguish LUSC tissue from adjacent normal tissues. Meanwhile, based on the individual inflection point of the prognostic risk score, LUSC patients were divided into low-risk and high-risk groups (Figure 6). The risk score widely predicted 5-year survival of LUSC patients, as the area under ROC curve (AUC) was 0.694 (Figure 7A). Furthermore, K-M curves confirmed that low-risk was more positively correlated with OS (P<0.001, Figure 7B).
Figure 6

Risk score analysis of the differentially expressed lncRNA signature of LUSC

Survival status and duration of cases (Top); risk score of lncRNA signature (Middle); low and high score groups for the 2 lncRNAs (Bottom).

Figure 7

The two-differentially expressed lncRNA signature of LUSC for outcome

(A) The risk score shown by the time-dependent ROC curve for predicting 5-year survival. (B) The Kaplan-Meier test of the risk score for the OS.

Risk score analysis of the differentially expressed lncRNA signature of LUSC

Survival status and duration of cases (Top); risk score of lncRNA signature (Middle); low and high score groups for the 2 lncRNAs (Bottom).

The two-differentially expressed lncRNA signature of LUSC for outcome

(A) The risk score shown by the time-dependent ROC curve for predicting 5-year survival. (B) The Kaplan-Meier test of the risk score for the OS. Moreover, the prognostic value of different clinical features was also evaluated. The univariate Cox proportional hazards regression showed that some clinical features predicted poorer survival of patients with LUSC (Table 6).However, when analyzed by a multivariate Cox regression model, gender, cancer neoplasms, and risk score were the independent prognostic indictors of LUSC (Table 6). The Kaplan-Meier curves of the above clinical features are shown in Figure 8. The results revealed that Tumor stage, T stage, M stage, residual tumor, cancer neoplasm, and primary therapy outcome were associated with OS (P=0.002, P=0.004, P=0.009, P=0.005, P=0.005, and P<0.001, respectively).
Table 6

The predictive values of related clinical features and risk score

VariablesPatient N=429Univariate analysisMultivariate analysis
HR(95% CI)PHR(95% CI)P
RaceWhite2921(reference)1(reference)
Asian291.414(0.859-2.327)0.1731.061(0.589-1.912)0.844
GenderFemale1111(reference)1(reference)
Male3111.235(0.863-1.766)0.2492.663(1.296-5.471)0.008*
Age<=652481(reference)1(reference)
>651691.344(0.983-1.838)0.0860.996(0.588-1.689)0.989
Tumor stageI2021(reference)1(reference)
II1351.015(0.710-1.452)0.9331.225(0.591-2.541)0.585
III781.610(1.106-2.343)0.013*0.474(0.181-1.242)0.129
IV73.464(1.393-8.613)0.007*0.966(0.213-4.381)0.964
T stageT1951(reference)1(reference)
T22431.255(0.849-1.855)0.2550.859(0.473-1.558)0.617
T3631.771(1.081-2.899)0.023*1.830(0.832-4.028)0.133
T4212.470(1.302-4.687)0.006*1.865(0.580-6.001)0.296
N stageN02661(reference)1(reference)
N11161.080(0.770-1.514)0.6561.186(0.454-3.097)0.728
N2341.378(0.834-2.277)0.2110.707(0.098-5.086)0.731
N352.713(0.665-11.067)0.164
M stageM04151(reference)
M173.119(1.274-7.633)0.013*
Atomic neoplasm subdivisionLeft lung1791(reference)
Right lung2201.050(0.773-1.426)0.7540.868(0.499-1.509)0.617
Bronchia90.395(0.097-1.609)0.1951.760(0.167-18.498)0.638
Primary therapy outcomeComplete remission2591(reference)1(reference)
Stable disease143.166(1.523-6.579)0.002*2.881(0.908-9.143)0.073
Progressive disease224.901(2.876-8.351)<0.001*1.778(0.829-3.814)0.139
Partial remission63.579(1.304-9.823)0.013*0.747(0.167-3.336)0.702
RadiotherapyNO3181(reference)1(reference)
YES451.427(0.915-2.225)0.1170.818(0.321-2.084)0.674
Neoplasm cancerTumor free2761(reference)1(reference)
With tumor874.229(2.994-5.976)<0.001*3.116(1.884-5.154)<0.001*
Residual tumorR03451(reference)1(reference)
R1+R2133.135(1.361-7.224)0.007*1.561(0.198-12.338)0.673
Risk scoreLow2121(reference)1(reference)
High2101.815(1.341-2.456)<0.001*3.116(1.884-5.154)0.001*

HR: hazard ratio; CI: confidence interval; *: P<0.05.

Figure 8

The prognostic value of different clinical features for OS of LUSC patients

Kaplan-Meier curves of seven independent prognostic indictors. SD, stable disease; PD, progressive disease; CR, complete remission; PR, partial response.

HR: hazard ratio; CI: confidence interval; *: P<0.05.

The prognostic value of different clinical features for OS of LUSC patients

Kaplan-Meier curves of seven independent prognostic indictors. SD, stable disease; PD, progressive disease; CR, complete remission; PR, partial response. In addition, we assessed the relationship between the risk score based on the differentially expressed lncRNAs signature and various clinical features, and the risk score showed prognostic value for predicting the status of tumor stage, T stage, metastasis, and neoplasm (Figure 9). The expression patterns of these two differentially expressed lncRNAs in the low-score and high-score groups were given in Figure 10.
Figure 9

The predictive value of the risk score for clinical features

ROC curve predicting different clinical features.

Figure 10

The expression level of the two lncRNAs in the low- and high-risk groups

The difference in the expression level of FMO6P and PRR26 between the low-risk and high-risk groups. *P<0.05.

The predictive value of the risk score for clinical features

ROC curve predicting different clinical features.

The expression level of the two lncRNAs in the low- and high-risk groups

The difference in the expression level of FMO6P and PRR26 between the low-risk and high-risk groups. *P<0.05.

DISCUSSION

To define lncRNAs significantly related to OS, we constructed a lncRNA-miRNA-mRNA ceRNA network according to the specific criteria in a large sample of LUSC patients, based on information from the TCGA database. A univariate Cox proportional hazards regression with the significance level set at 0.05 was first performed on 83 differentially expressed lncRNAs from the ceRNA network. A total of 22 lncRNAs were identified as associated with OS. A multivariate Cox proportional hazards regression analysis indicated that FMO6P and PRR26 showed a significant prognostic value for LUSC patients’’ survival. We then developed a risk score by combining the two lncRNAs and found that this two-lncRNA signature independently predicted OS in LUSC patients, which was further validated in LUSC patients. To our knowledge, this is the first study combining a ceRNA network with TCGA data to assess the survival of LUSC patients by constructing a lncRNA-related risk score. Lung cancer produces the most lethal solid malignancies, and the role of lncRNAs in the genesis and development— and thus diagnosis and prognosis —of lung cancer has been widely studied [23-26]. In addition to the metastatic behavior of lung cancer, lack of precise and accurate biomarkers for diagnosis and prognosis may also contribute strongly to the low survival rate (< 15%). Therefore, there is a pressing need for potential and reliable prognostic biomarkers that predict lung cancer outcomes. In previous studies, expression of miRNA and mRNA have been examined to reveal characteristics associated with lung cancer outcome [27-30]. Differential expression of lncRNAs has rapidly emerged in various studies of tumorigenesis and cancer progression, and the dysregulated lncRNAs play key roles in cancer prognosis. There is accumulating evidence that lncRNAs might be a pivotal component in the ceRNA network by modulating other RNA transcripts [31-33]. CHIAP2 was detected in the lung cancer-related ceRNA network [34]. Snhg1 promoted cell proliferation by acting as a sponge for the tumor suppressor miR-338 in esophageal cancer cells [35]. Wang et al. [36] affirmed that TUG1 might affect ROCK1 expression, which could mediate migration/invasion to influence prognosis, by working as a ceRNA manner by targeting miR-335-5p. The up-regulation of AFAP1-AS1 was associated with poor prognosis in NSCLC patients [37]. Therefore, potential connections among lncRNA, miRNA, and mRNA might exist in the genesis and development of LUSC. In the present study, we first used the TCGA database to construct a ceRNA network to reveal a novel regulatory network in LUSC. Some cancer-specific lncRNAs, such as ABCC13, HOTAIR, LINC00472, and FER1L4 have also been reported to act as potential diagnostic and prognostic biomarkers in cancers [38-41]. In the present study, we sought to determine whether the LUSC-specific lncRNAs in the ceRNA network were indirectly related to mRNA signal pathways. The results of this pathway analysis showed that there were a few pathways related to cancer. Hence, our results suggest that specific lncRNAs may play crucial roles in the genesis and development of LUSC. Based on public data from large-scale samples, the prognostic value of lncRNAs has been evaluated in various cancers, including lung adenocarcinoma [42], gastric cancer [43], urothelial carcinoma [44], colorectal cancer [45], intrahepatic cholangiocarcinoma [46], and breast cancer [47]. One recent study evaluated the prognostic value of miRNAs and mRNAs in LUSC. Using the TCGA database, Gao et al. [48] found that 12 of the 133 most significantly altered miRNAs were associated with OS of LUSC After a comprehensive analysis, a seven-miRNA signature for prediction of OS in LUSC patients was established. Similarly, multivariable Cox regression analysis indicated that up-regulated expression of ASCL2 was an independent prognostic factor for OS (HR=2.764, P<0.05) in LUSC patients [49]. However, the relationship between differentially expressed lncRNAs and survival in LUSC patients has not been studied in large-scale samples using comprehensive analysis. Unlike previous studies, we combined the ceRNA network with TCGA data to analyze the lncRNAs in LUSC. Since then, we determined that significantly differential expression of two novel lncRNAs (FMO6P and PRR26) could be a novel independent risk factor for LUSC. Moreover, the risk score based on these two lncRNAs could be a new indicator for the prognosis of LUSC patients. However, there is no report on the association between these two lncRNAs and disease. Moreover, no study has investigated the function of the above two lncRNAs. Here, we constructed a lncRNA-miRNA-mRNA network to discover the relevant genes of these two lncRNAs. The FMO6P-related genes were enriched in the Fanconi anemia pathway and cancer transcriptional misregulation. Furthermore, the PRR26-related genes were enriched in Transcriptional misregulation in cancer, the MAPK signaling pathway, in tight-junction Proteoglycans in cancer, and in Protein digestion and absorption, most of which are classical signaling pathways closely related to the genesis and progression of cancer. For example, the MAPK signaling pathway has been proposed to be associated with the occurrence, invasion, and metastasis of LUSC [50]. Therefore, functional enrichment analysis may elucidate the role of FMO6P and PRR26 in carcinogenesis of LUSC. Although the findings of the present study might have significant clinical implications, several limitations should be considered. First, a longer follow-up time is needed to validate our findings. Second, apart from data from the TCGA database, other experimental methods are required to verify the present findings. Third, the function of FMO6P and PRR26 in LUSC needs to be further studied. In conclusion, the present study identified a two-lncRNA signature as a potential outcome predictor for LUSC patients via analyzing the genome-wide lncRNA expression profiles from TCGA using a ceRNA network. Future functional investigations are required to explore the mechanisms underlying the roles of these lncRNAs in LUSC.

MATERIALS AND METHODS

TCGA dataset and patient information

RNA sequencing (RNA-Seq) data from 504 individuals with LUSC were obtained from the TCGA data portal (https://portal.gdc.cancer.gov/). Exclusion criteria were as follows: 1) histologic diagnosis ruled out LUSC; 2) another malignancy besides LUSC. Overall, 474 LUSC patients were included in this study, including data from 429 LUSC tissue samples and 45 non-tumorous adjacent-normal lung tissue samples up to February 12, 2017. Clinical data, including outcome and staging information on TCGA LUSC patients, were also downloaded from the Data Coordinating Center. Of those 474 patients, there were 159 LUSC patients with lymphatic metastasis and 270 LUSC patients with non-lymphatic metastasis. According to the TNM stage, 343 patients were identified as having well- or moderately differentiated LUSC (stage I-II), and the remaining 86 patients had poorly differentiated LUSC (stage III-IV). Since the data were obtained from the TCGA database, additional approval from the Ethics Committee was not needed. Data processing procedures met the requirements of the data access policies and NIH TCGA human subject protection (http://cancergenome.nih.gov/publications/publicationguidelines).

Identification of differentially expressed RNAs in LUSC samples

The expression of human RNA (lncRNA, mRNA, and miRNA) in LUSC samples was analyzed using RNASeqV2 and Illumina HiSeq 2000 miRNA sequencing platforms. Level 3 RNA expression data were collected from TCGA Data Portal and normalized by TCGA [51]. To detect the differential expression of RNA, samples were divided into tumor tissues vs. adjacent non-tumor tissues, stage I-II vs. stage III-IV, and lymphatic metastasis vs. non-lymphatic metastasis, respectively. The flow chart for bioinformatics analysis was given in Figure 11. Since many RNAs were not expressed in particular tissue types or showed little variation among patients in the TCGA database, only RNAs expressed in at least two normal or tumor samples, with at least 100 counts per million were retained in the profile. Expression differences were characterized by fold change and associated P-values. Fold change indicates the difference in expression of each RNA from LUSC to adjacent non-tumor tissues. Up-regulated and down-regulated RNA were assigned fold changes >2 and <0.5, respectively, with FDR-adjusted P<0.05.
Figure 11

Flow chart of bioinformatics analysis

Construction of the ceRNA network and functional assessment

Based on the relationship among lncRNA, mRNA, and miRNA, the ceRNA network was constructed in three steps: i) LUSC-specific RNA (lncRNA, mRNA, and miRNA) filtration: Up-regulated and down-regulated LUSC-specific RNAs were assigned fold changes >2 and <0.5, respectively, with P<0.05. To maximize data reliability, LUSC-specific lncRNAs not registered in GENCODE (http://www.gencodegenes.org/) were abandoned; ii) miRTarBase (http://mirtarbase.mbc.nctu.edu.tw/) and Targetscan (http://www.targetscan.org/) were used to predict the mRNAs targeted by miRNAs; iii) lncRNA-miRNA interactions were predicted using miRanda tools (http://www.microrna.org/microrna/home.do). Furthermore, miRNAs that negatively regulated expression of both lncRNAs and mRNAs were selected for construction of the ceRNA network. Cytoscape v3.0 was used to construct and visualize the ceRNA network [52]. To understand the underlying pathways and biological processes of differentially expressed genes in the ceRNA network, the Database for Annotation, Visualization, and Integrated Discovery (DAVID) (http://david.abcc.ncifcrf.gov/) [53] was employed for functional enrichment analysis, in which KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways and GO (Gene Ontology) biological processes were interested at significance level of P<0.05 and an enrichment score>1.5. Based on the ceRNA network, LUSC-specific lncRNAs were selected, and the expression level of each lncRNA was log2 transformed for further analysis. The univariate Cox proportional hazards regression model with a significance level set at 0.05 was used to analyze LUSC-specific lncRNAs associated with OS. The prognostic risk score for predicting OS was calculated as previously reported: Risk score=explncRNA1 *βlncRNA1+explncRNA2*βlncRNA2+…explncRNA*βlncRNAn (exp: expression level, β: the regression coefficient derived from the multivariate Cox regression model) [54]. Utilizing the median risk score as the cutoff point, LUSC patients were divided into high-score and low-score groups [55]. Univariate Cox proportional hazards regression analyses were further conducted to investigate the effects of various clinical characteristics and the risk score on the OS of LUSC patients. We further analyzed clinical features including gender, tumor stage, TNM staging system, lymphatic metastasis, and patient outcome assessment with lncRNAs in the ceRNA network. The relationship between the LUSC-specific lncRNAs and clinical features was examined using Student’s t-test. Univariate Cox proportional hazards regression analyses were further performed to investigate the effects of various clinical features and risk score on the OS of LUSC. The hazard ratio (HR) and 95% confidence interval (CI) were assessed. HRs for a two-fold change in gene expression from univariate Cox regression analysis were used to identify LUSC-specific lncRNAs associated with OS. lncRNAs defined as having a protective signature showed HR<1 and those defined as high-risk had HR for death>1. A time-dependent receiver operating characteristic (ROC) curve analysis within five years as the defining point was performed with the R package “survival-ROC,” to evaluate the predictive value of the risk score for time-dependent outcomes [56]. Kaplan-Meier and log-rank methods (Mantel-Haenszel test) were used to test the equality of survival distributions in different groups subjected to comparison using IBM SPSS Statistics Version 21 software (SPSS Inc., Chicago, IL, USA). The operating characteristic curve was used to evaluate LUSC-specific lncRNAs for their sensitivity and specificity for detection of LUSC.
  56 in total

1.  Prognostic significance of long noncoding RNA Z38 as a candidate biomarker in breast cancer.

Authors:  Zhen-Lin Nie; Yi-Shan Wang; Yan-Ping Mei; Xin Lin; Guo-Xing Zhang; Hui-Ling Sun; Yi-Lian Wang; Yong-Xiang Xia; Shu-Kui Wang
Journal:  J Clin Lab Anal       Date:  2017-03-01       Impact factor: 2.352

2.  lncRNA-HIT promotes cell proliferation of non-small cell lung cancer by association with E2F1.

Authors:  L Yu; F Fang; S Lu; X Li; Y Yang; Z Wang
Journal:  Cancer Gene Ther       Date:  2017-04-21       Impact factor: 5.987

3.  Activation of LTBP3 gene by a long noncoding RNA (lncRNA) MALAT1 transcript in mesenchymal stem cells from multiple myeloma.

Authors:  Bingzong Li; Ping Chen; Jing Qu; Lei Shi; Wenyue Zhuang; Jinxiang Fu; Jun Li; Xiaohui Zhang; Yu Sun; Wenzhuo Zhuang
Journal:  J Biol Chem       Date:  2014-09-03       Impact factor: 5.157

Review 4.  Competing endogenous RNA interplay in cancer: mechanism, methodology, and perspectives.

Authors:  Dong-Liang Cheng; Yuan-Yuan Xiang; Li-juan Ji; Xiao-Jie Lu
Journal:  Tumour Biol       Date:  2015-01-22

5.  The hormone response element mimic sequence of GAS5 lncRNA is sufficient to induce apoptosis in breast cancer cells.

Authors:  Mark R Pickard; Gwyn T Williams
Journal:  Oncotarget       Date:  2016-03-01

6.  Therapeutic targeting using tumor specific peptides inhibits long non-coding RNA HOTAIR activity in ovarian and breast cancer.

Authors:  Ali R Özeş; Yinu Wang; Xingyue Zong; Fang Fang; Jay Pilrose; Kenneth P Nephew
Journal:  Sci Rep       Date:  2017-04-18       Impact factor: 4.379

7.  Long non-coding RNA 00312 regulated by HOXA5 inhibits tumour proliferation and promotes apoptosis in Non-small cell lung cancer.

Authors:  Qingqing Zhu; Tangfeng Lv; Ying Wu; Xuefei Shi; Hongbing Liu; Yong Song
Journal:  J Cell Mol Med       Date:  2017-03-24       Impact factor: 5.310

8.  Long intergenic non-protein coding RNA 00858 functions as a competing endogenous RNA for miR-422a to facilitate the cell growth in non-small cell lung cancer.

Authors:  Shao-Ping Zhu; Jun-Yu Wang; Xian-Guo Wang; Jin-Ping Zhao
Journal:  Aging (Albany NY)       Date:  2017-02-06       Impact factor: 5.682

9.  BRB-ArrayTools Data Archive for human cancer gene expression: a unique and efficient data sharing resource.

Authors:  Yingdong Zhao; Richard Simon
Journal:  Cancer Inform       Date:  2008-04-21

10.  Long non-coding RNA Fer-1-like protein 4 suppresses oncogenesis and exhibits prognostic value by associating with miR-106a-5p in colon cancer.

Authors:  Ben Yue; Bo Sun; Chenchen Liu; Senlin Zhao; Dongyuan Zhang; Fudong Yu; Dongwang Yan
Journal:  Cancer Sci       Date:  2015-09-21       Impact factor: 6.716

View more
  16 in total

1.  Systematic analysis identifies three-lncRNA signature as a potentially prognostic biomarker for lung squamous cell carcinoma using bioinformatics strategy.

Authors:  Jing Hu; Lutong Xu; Tao Shou; Qiang Chen
Journal:  Transl Lung Cancer Res       Date:  2019-10

2.  Prediction of Specific Subtypes and Common Markers of Non-Small Cell Lung Cancer Based on Competing Endogenous RNA Network.

Authors:  Yao Liu; Hao Wang; Wenhan Yang; Youhui Qian
Journal:  Med Sci Monit       Date:  2020-07-13

3.  MicroRNAs as biomarkers for the progression and prognosis of colon carcinoma.

Authors:  Hui Zhang; Zhuo Wang; Rong Ma; Jianzhong Wu; Jifeng Feng
Journal:  Int J Mol Med       Date:  2018-07-25       Impact factor: 4.101

4.  Integrated analysis of long noncoding RNA associated-competing endogenous RNA as prognostic biomarkers in clear cell renal carcinoma.

Authors:  Hang Yin; Xiaoyuan Wang; Xue Zhang; Yan Wang; Yangyang Zeng; Yudi Xiong; Tianqi Li; Rongjie Lin; Qian Zhou; Huan Ling; Fuxiang Zhou; Yunfeng Zhou
Journal:  Cancer Sci       Date:  2018-09-27       Impact factor: 6.716

5.  HOTTIP Functions as a Key Candidate Biomarker in Head and Neck Squamous Cell Carcinoma by Integrated Bioinformatic Analysis.

Authors:  Xiteng Yin; Weidong Yang; Junqi Xie; Zheng Wei; Chuanchao Tang; Chuanhui Song; Yufeng Wang; Yu Cai; Wenguang Xu; Wei Han
Journal:  Biomed Res Int       Date:  2019-03-26       Impact factor: 3.411

6.  Construction and Investigation of a lncRNA-Associated ceRNA Regulatory Network in Cholangiocarcinoma.

Authors:  Junyu Long; Jianping Xiong; Yi Bai; Jinzhu Mao; Jianzhen Lin; Weiyu Xu; Hui Zhang; Shuguang Chen; Haitao Zhao
Journal:  Front Oncol       Date:  2019-08-09       Impact factor: 6.244

7.  Comprehensive analysis of a long noncoding RNA-associated competing endogenous RNA network in colorectal cancer.

Authors:  Qiaowei Fan; Bingrong Liu
Journal:  Onco Targets Ther       Date:  2018-05-01       Impact factor: 4.147

8.  Identification of potential prognostic long non‑coding RNA signatures based on a competing endogenous RNA network in lung adenocarcinoma.

Authors:  Xiaojuan Wang; Yawen Ding; Bangming Da; Yan Fei; Gang Feng
Journal:  Oncol Rep       Date:  2018-09-21       Impact factor: 3.906

9.  Molecular characterization of papillary thyroid carcinoma: a potential three-lncRNA prognostic signature.

Authors:  Xin You; Sheng Yang; Jing Sui; Wenjuan Wu; Tong Liu; Siyi Xu; Yanping Cheng; Xiaoling Kong; Geyu Liang; Yongzhong Yao
Journal:  Cancer Manag Res       Date:  2018-10-08       Impact factor: 3.989

10.  Identification of key genes and evaluation of clinical outcomes in lung squamous cell carcinoma using integrated bioinformatics analysis.

Authors:  Yangfeng Shi; Yeping Li; Chao Yan; Hua Su; Kejing Ying
Journal:  Oncol Lett       Date:  2019-09-30       Impact factor: 2.967

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.