Literature DB >> 30272355

Identification of potential prognostic long non‑coding RNA signatures based on a competing endogenous RNA network in lung adenocarcinoma.

Xiaojuan Wang1, Yawen Ding1, Bangming Da1, Yan Fei1, Gang Feng1.   

Abstract

A number of experimental and computational studies have demonstrated the key roles of long non‑coding RNAs (lncRNAs) acting as competing endogenous RNAs (ceRNAs) in the tumorigenesis of lung adenocarcinoma (LUAC). However, there remains a requirement for prognostic candidate biomarkers acting as ceRNAs for the prediction of overall survival in patients with LUAC. The main goal of the present study was to identify novel lncRNAs associated with LUAC overall survival and assess their prognostic values. The study analyzed coding RNA and ncRNA expression profiles of patients with LUAC by retrieving existing RNA‑sequencing datasets from The Cancer Genome Atlas database, and 2,507 differentially expressed mRNAs, 1,633 lncRNAs and 113 miRNAs were screened from patients with LUAC compared with those of adjacent normal samples (P<0.01 and |logFC|>2). Of these LUAC‑specific RNAs, 134 lncRNAs, 21 miRNAs and 34 mRNAs were used to build an lncRNA‑mRNA‑miRNA ceRNA network, among which 8 lncRNAs and 9 mRNAs were associated with overall survival in patients with LUAC by acting as ceRNAs. Next, an lncRNA‑based prognostic signature was constructed by risk scoring approach based on the expression levels of 9 prognosis‑associated lncRNAs using Cox's regression analysis. Moreover, the prognostic capacity of the 9‑lncRNA signature was independent of known clinical prognostic factors. These results provide novel insight into the potential of lncRNA ceRNAs to be candidate biomarkers associated with LUAC overall survival.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30272355      PMCID: PMC6196588          DOI: 10.3892/or.2018.6719

Source DB:  PubMed          Journal:  Oncol Rep        ISSN: 1021-335X            Impact factor:   3.906


Introduction

Lung adenocarcinoma (LUAC) is the leading cause of cancer-associated mortality worldwide and is one major subtype of non-small cell lung cancer, defined by distinct pathological characteristics, including mixed subtype, acinar, papillary and lepidic patterns, and the solid predominant subtype with mucin production (1,2). As the most common type of lung cancer, accounting for 40% of all non-small cell lung cancer cases as determined by the World Health Organization in 2012, the incidence of LUAC is on the rise mainly in women and non-smokers (3,4). The 5-year overall survival rate is ~15%, but has not improved in recent years. Since approximately two-thirds of LUAC patients are diagnosed at advanced cancer stages, and local or distant tumor recurrence can frequently present following surgical resection, the prognosis is poor for the majority of patients. Therefore, identifying LUAC at earlier pathological stages can greatly reduce overall mortality rates. Given that adenocarcinoma is more difficult to detect by clinical approaches, including bronchoscopy, sputum cytology and computed tomography, the major obstacle in LUAC management is the lack of an adequate method for its early detection and prognosis. Non-coding RNAs (ncRNAs) have become increasingly relevant targets of study due to their specialized and well-adapted biological roles in tumor development (5). Generally, ncRNAs can be divided into two major classes based on their size: Small ncRNAs and long ncRNAs (lncRNAs). Small ncRNAs consist of several subtypes, including microRNAs (miRNAs/miRs), ribosomal RNAs, small nucleolar RNAs and transfer ribonucleic acids (6). An ever-increasing body of evidence demonstrates the key role of miRNAs in tumor biology contributing to tumorigenesis by modulating oncogenic and tumor suppressor pathways (7–9). However, research on lncRNAs is in its infancy compared with miRNA research. Importantly, lncRNAs have been implicated in several biological processes from pluripotency to immune responses, and are predicted to be involved in more complex mechanisms such as tumor regulation (10,11). One of the best-studied lncRNAs, X-inactive specific transcript, is involved in the development of several cancer types through recruitment of chromatin-modifying complexes to inactivate an entire chromosome in the majority of cells (12). Since ncRNAs serve various important roles in tumor development, interactions between miRNAs and lncRNAs have become an area of focus for the identification of putative ncRNA biomarkers for tumor prognosis. As our understanding of the transcriptome space has expanded and the development of RNA-sequencing technology has taken place, a novel hypothesis known as the competing endogenous RNA (ceRNA) hypothesis has emerged in recent years (13,14). One lncRNA, hepatocellular carcinoma upregulated lncRNA, has been shown to be one of the most clearly overexpressed ncRNAs in hepatocellular carcinoma, and contains miR-372-binding sites to reduce miR-372 expression and activity (15). Another lncRNA and ceRNA, papillary thyroid carcinoma susceptibility candidate 3, has been identified to be downregulated in thyroid cancer and mediates the expression of miR-574-5p (16). In addition to lncRNA ceRNAs, certain miRNAs and mRNAs also have ceRNA capacity. Several lncRNA ceRNAs have been found to be involved in the diagnosis and prognosis of patients with lung tumors (17,18). Nevertheless, the prognostic value of lncRNA ceRNAs in LUAC has not yet been fully investigated. In the current study, to identify LUAC-specific lncRNAs involved in ceRNA crosstalk, RNA-sequencing data and clinical data were obtained from The Cancer Genome Atlas (TCGA) database and an lncRNA-mRNA-miRNA ceRNA network was constructed. Combined with survival analysis, analyses of these data identified a 9-lncRNA signature (LASiglnc-9) with prognostic value to predict overall survival in patients with LUAC.

Materials and methods

Data source and patient information

All RNA expression data and patient clinical data were obtained from TCGA Data Portal (https://portal.gdc.cancer.gov), which is open-access and publicly available. LUAC-related RNA-sequencing data were downloaded with the key words ‘lung adenocarcinoma’ and ‘RNA-seq’. A total of 594 LUAC patients were included and sample exclusion criteria were follows: i) Patients who were not histologically diagnosed with LUAC; ii) patients who suffered from one or more malignancies besides LUAC; and iii) samples without complete data. Gene expression profiles for 535 tumor samples and 59 adjacent non-tumor samples, and miRNA expression data for 521 LUAC samples and 46 adjacent normal samples were obtained. In addition, clinical data for 482 LUAC patients, including 260 male and 222 female patients, were also downloaded from TCGA Data Coordinating Center. There were 170 patients with lymphatic metastasis and 312 patients with non-lymphatic metastasis. Additionally, 164 patients presented with distant organ metastases and 318 patients presented with non-distant metastasis. Patients were classified as stage I–II (well and moderately differentiated LUAC, n=377) and stage III–IV (poorly differentiated LUAC, n=105) according to guidelines from the Union for International Cancer Control (19).

Differential expression analysis of LUAC data

To determine the differential expression of mRNAs, lncRNAs and miRNAs between tumor and adjacent normal tissues in LUAC samples, a Bioconductor package edgeR (version 3.6) (20) was used for the gene differential expression analysis. A P-value of <0.01 and |logFC|>2 were set as the cut-off criteria. Volcano plots were drawn using gplots package (version 3.0.1; http://cran.r-project.org/web/packages/gplots/index.html). Heatmaps were constructed using pheatmap package (version 1.0.8; http://cran.r-project.org/web/packages/pheatmap/index.html).

ceRNA network construction and functional annotation

Considering the important role of interactions between lncRNAs, mRNAs and miRNAs in tumorigenesis and development, the ceRNA networks of LUAC were constructed based on three steps: i) LUAC-specific lncRNAs with an absolute P-value of <0.01 and |logFC|>2 were retained; ii) miRcode online tool (http://www.mircode.org) was applied to predict potential target miRNAs of differentially expressed lncRNAs and to predict lncRNA-miRNA interactions; and iii) potential mRNAs targeted by miRNAs were retrieved from miRDB (http://www.mirdb.org/index.html), miRTarBase (http://mirtarbase.mbc.nctu.edu.tw/php/index.php) and TargetScan (http://www.targetscan.org/vert_71/). Finally, miRNAs that are negatively regulated by lncRNAs and mRNAs were selected to construct the ceRNA network. To visualize the lncRNA-mRNA-miRNA ceRNA network, cytoscape v3.5.1 (21) was used for network construction. To further study the biological roles of differentially expressed mRNAs targeted by lncRNAs and miRNAs in the ceRNA network, the Database for Annotation, Visualization and Integrated Discovery (DAVID, http://david.abcc.ncifcrf.gov/) was used. Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) biological processes were annotated at significance levels of P<0.05.

Survival analysis

Kaplan-Meier survival analysis and a log-rank test were used to evaluate the association between expression levels of differentially expressed mRNAs, lncRNAs and miRNAs in the ceRNA network and the overall survival of the patients. To obtain more detail on the role of lncRNAs in LUAC, the univariate Cox's proportional hazards regression model with a significant level set at 0.01 was applied to analyze differentially expressed lncRNAs and mRNAs from the ceRNA network that were associated with overall survival. Next, the selected differentially expressed mRNAs and lncRNAs were fit into a multivariate Cox regression analysis to build the lncRNA-based prognostic signature and lncRNA-mRNA-based prognostic signature. The prognostic risk score for predicting overall survival was calculated using the following formula: Risk score = exp1*β1 + exp2*β2+…+ expn*βn, where exp indicates expression level and β is the regression coefficient. The linear combination of expression levels of LUAC-specific mRNAs or lncRNAs with estimated regression coefficients was obtained from the aforementioned multivariate Cox regression analysis (22). LUAC patients were divided into high-risk and low-risk groups using the median risk score (0.959 for the LASiglnc-9 signature; 0.923 for the LASiglnc2-m3 signature). The time-dependent receiver operating characteristic (ROC) curves were drawn using the R package ‘survival-ROC’ to compare the specificity and sensitivity of the risk prediction of the survival rate for specific lncRNAs and mRNAs in the model. Meanwhile, univariate and multivariate Cox's analyses were applied for prognostic prediction of risk score and clinical features, including age, gender, stage of pathology and Tumor-Node-Metastasis (TNM) staging system (23). Hazard ratios (HRs) and 95% confidence intervals (CIs) were assessed using the Cox regression model. All statistical analyses were conducted with R software (version 3.4.1).

Results

Identification of differentially expressed RNAs in LUAC from RNA-seq data

In the present study, RNA-seq data, including gene and miRNA expression data, was retrieved from TCGA data portal for the purpose of finding biomarkers associated with tumor prognosis. Compared with adjacent normal samples, the LUAC samples contained a total of 2,507 differentially expressed mRNAs (1,977 upregulated and 527 downregulated mRNAs), 1,633 differentially expressed lncRNAs (1,425 upregulated and 208 downregulated lncRNAs) and 113 differentially expressed miRNAs (88 upregulated and 23 downregulated miRNAs). The differentially expressed lncRNAs, miRNAs and mRNAs showed clear separation in the heat maps (Fig. 1A) and volcano plots (Fig. 1B).
Figure 1.

Heat maps and volcano plots of differentially expressed lncRNAs, mRNAs and miRNAs in patients with LUAC. (A) The hierarchical clustering heat maps of differentially expressed lncRNAs, mRNAs and miRNAs between LUAC and adjacent normal samples. (B) Volcano plot of LUAC-specific lncRNAs, mRNAs and miRNAs. lncRNA, long non-coding RNA; miRNA, microRNA; LUAC, lung adenocarcinoma; FDR, false discovery rate; FC, fold-change.

miRNA target prediction and ceRNA network

To predict the lncRNAs targeted by miRNAs, the miRcode online tool was used and 134 lncRNAs, including 115 upregulated and 19 downregulated lncRNAs, were selected to build the ceRNA network (Table I). Next, the miRDB, miRTarBase and TargetScan online tools were used to predict mRNAs targeted by miRNAs. The targeting associations between 21 miRNAs (17 upregulated and 4 downregulated miRNAs; Table II) and 34 mRNAs (25 upregulated and 9 downregulated mRNAs; Table III) were obtained and selected for ceRNA network construction.
Table I.

Differentially expressed lncRNAs in competing endogenous RNA network of lung adenocarcinoma.

lncRNAlogFCP-valueFDR
DSCAM-AS18.004.74×10−122.80×10−11
AL160271.16.915.96×10−102.64×10−9
HOTAIR6.778.85×10−201.28×10−18
AC061975.66.535.32×10−218.45×10−20
CLDN10-AS16.4942.04×10−306.67×10−29
POU6F2-AS26.181.27×10−141.02×10−13
RMRP5.914.63×10−81.58×10−7
NOVA1-AS15.841.20×10−161.23×10−15
MUC25.814.96×10−143.74×10−13
LINC003925.801.09×10−84.04×10−8
AC020907.15.799.50×10−467.61×10−44
ERVMER61-15.702.43×10−101.13×10−9
UCA15.696.01×10−219.50×10−20
LINC004915.603.86×10−153.34×10−14
LINC005015.281.00×10−171.15×10−16
LINC002215.196.98×10−103.06×10−9
AL513123.15.171.40×10−151.29×10−14
NAALADL2-AS25.071.22×10−161.25×10−15
MIR137HG4.975.03×10−154.30×10−14
LINC003934.803.61×10−151.43×10−8
ERVH48-14.744.28×10−164.12×10−15
AL356133.24.721.42×10−96.02×10−9
LINC005184.489.96×10−147.17×10−13
DLX6-AS14.443.14×10−163.07×10−15
LINC004604.439.38×10−191.22×10−17
LINC003554.364.32×10−101.95×10−9
LINC004664.291.57×10−161.58×10−15
LINC004834.257.95×10−135.15×10−12
POU6F2-AS14.162.36×10−121.44×10−11
LINC004614.047.80×10−241.59×10−22
AC087269.13.819.38×10−221.59×10−20
AC084262.13.741.64×10−182.07×10−17
AC010145.13.731.07×10−52.58×10−5
LINC004733.656.95×10−82.32×10−7
MYCNOS3.629.13×10−125.17×10−11
LINC001603.587.19×10−211.13×10−19
HOTTIP3.561.13×10−73.65×10−7
AC080129.13.481.78×10−97.43×10−9
AC006372.13.432.63×10−78.13×10−7
LINC005253.421.30×10−232.55×10−22
LINC005243.411.04×10−105.14×10−10
WASIR23.402.06×10−359.20×10−34
H193.353.20×10−111.68×10−10
AC022148.13.342.47×10−131.71×10−12
LINC002003.335.05×10−51.10×10−4
KIF25-AS13.321.98×10−109.34×10−10
LINC005363.304.92×10−112.51×10−10
LINC003083.211.15×10−73.74×10−7
FER1L6-AS13.214.93×10−51.08×10−4
SAMSN1-AS13.181.87×10−111.02×10−10
AC026320.13.162.03×10−65.53×10−6
ABCA9-AS13.165.38×10−102.40×10−9
STEAP2-AS13.142.57×10−193.58×10−18
LINC004703.082.05×10−87.33×10−8
C20orf1973.047.90×10−191.03×10−17
GRM7-AS33.036.73×10−71.96×10−6
LSAMP-AS13.026.99×10−82.33×10−7
AL354707.12.982.42×10−297.35×10−28
FNDC1-IT12.962.65×10−142.06×10−13
C2orf482.941.70×10−284.95×10−27
LINC004882.944.10×10−59.08×10−5
CACNA1C-IT32.912.60×10−55.93×10−5
CHODL-AS12.903.14×10−79.61×10−7
LINC000512.901.61×10−64.42×10−6
AP002478.12.871.54×10−96.49×10−9
AC112721.12.851.18×10−149.56×10−14
LINC003372.859.70×10−221.64×10−20
AP000553.12.822.69×10−287.68×10−27
TDRG12.776.14×10−61.55×10−5
E2F3-IT12.762.17×10−65.87×10−6
AL021395.12.708.27×10−61.75×10−4
PVT12.663.20×10−493.03×10−47
TBL1XR1-AS12.661.99×10−98.25×10−9
HNF1A-AS12.651.65×10−107.91×10−10
AL139002.12.656.36×10−41.16×10−3
LINC003192.621.46×10−96.19×10−9
DPYD-AS22.572.80×10−56.35×10−5
DSCR102.545.21×10−51.14×10−4
IGF2-AS2.542.15×10−65.82×10−6
LINC004402.489.96×10−52.08×10−4
LPP-AS12.452.82×10−45.46×10−4
VCAN-AS12.451.41×10−85.15×10−8
LINC005192.457.77×10−167.32×10−15
AL353803.12.411.09×10−84.04×10−8
IL20RB-AS12.408.01×10−72.31×10−6
ARHGEF3-AS12.391.13×10−42.34×10−4
CHL1-AS12.381.43×10−106.93×10−10
ATG10-AS12.376.38×10−41.17×10−3
EGOT2.334.72×10−154.05×10−14
C11orf442.332.95×10−79.05×10−7
SOX21-AS12.291.91×10−109.02×10−10
GRM5-AS12.274.77×10−91.86×10−8
U52111.12.267.60×10−261.82×10−24
AC007731.12.255.46×10−51.19×10−4
AC012640.12.236.87×10−177.26×10−16
FOXP1-IT12.234.71×10−71.40×10−6
AL117190.12.225.40×10−81.83×10−7
C1orf2202.197.28×10−404.06×10−38
AC092535.12.181.18×10−127.50×10−12
LINC004852.151.02×10−42.12×10−4
LINC003302.146.66×10−92.55×10−8
AL391152.12.145.84×10−81.97×10−7
ZBTB20-AS32.103.01×10−34.95×10−3
SYNPR-AS12.082.06×10−131.44×10−12
AL139385.12.074.99×10−133.31×10−12
AC110921.12.071.13×10−42.33×10−4
MEG32.061.96×10−109.25×10−10
HECW1-IT12.049.40×10−41.67×10−3
ANO1-AS22.032.62×10−55.96×10−5
ARHGAP26-AS12.021.51×10−74.80×10−7
LINC001842.017.45×10−113.74×10−10
AL365356.12.011.74×10−75.48×10−7
C10orf912.012.52×10−131.74×10−12
AC016773.12.013.08×10−311.06×10−29
AP000525.12.006.49×10−123.75×10−11
HHATL-AS1−2.006.24×10−144.62×10−13
AGAP11−2.071.06×10−354.79×10−34
RMST−2.102.26×10−172.51×10−16
AC025431.1−2.122.42×10−152.15×10−14
C5orf64−2.121.67×10−409.91×10−39
TTTY16−2.143.44×10−91.37×10−8
LINC00472−2.173.92×10−442.89×10−42
AC004832.1−2.208.05×10−221.37×10−20
MED4-AS1−2.286.93×10−631.04×10−60
SRGAP3-AS2−2.302.17×10−141.70×10−13
LINC00211−2.312.04×10−369.54×10−35
MYO16-AS1−2.365.18×10−175.57×10−16
AP003064.2−2.469.03×10−272.33×10−25
ADAMTS9-AS1−2.774.41×10−751.20×10−72
NAV2-AS2−2.788.94×10−446.27×10−42
AC105206.1−2.965.33×10−271.41×10−25
AL109754.1−2.962.39×10−431.63×10−41
AP000438.1−3.012.80×10−726.44×10−70
LINC00163−3.432.89×10−809.25×10−78

lncRNA, long non-coding RNA; FC, fold-change; FDR, false discovery rate.

Table II.

Differentially expressed miRNAs in the competing endogenous RNA network of lung adenocarcinoma.

miRNAlogFCP-valueFDR
hsa-mir-3727.085.63×10−92.36×10−8
hsa-mir-1225.911.51×10−64.54×10−6
hsa-mir-3735.503.48×10−69.72×10−6
hsa-mir-2105.071.11×10−586.33×10−57
hsa-mir-1374.423.31×10−132.06×10−12
hsa-mir-314.372.23×10−171.81×10−16
hsa-mir-301b3.601.99×10−222.06×10−21
hsa-mir-2152.953.54×10−91.52×10−8
hsa-mir-1922.815.71×10−112.98×10−10
hsa-mir-2052.736.86×10−92.78×10−8
hsa-mir-962.721.42×10−403.74×10−39
hsa-mir-4892.551.06×10−73.76×10−7
hsa-mir-5032.481.45×10−211.43×10−20
hsa-mir-216b2.441.53×10−54.05×10−5
hsa-mir-1872.388.58×10−114.38×10−10
hsa-mir-1832.364.29×10−339.46×10−32
hsa-mir-1822.051.31×10−292.25×10−28
hsa-mir-195−2.274.49×10−907.68×10−88
hsa-mir-143−2.754.79×10−886.56×10−86
hsa-mir-184−2.914.74×10−287.20×10−27
hsa-mir-144−3.424.43×10−981.01×10−95

FC, fold-change; FDR, false discovery rate; miRNA/miR, microRNA.

Table III.

Differentially expressed mRNA in the competing endogenous RNA network of lung adenocarcinoma.

mRNAlogFCP-valueFDR
HOXC136.961.73×10−221.34×10−21
SALL15.816.30×10−173.21×10−16
HOXA104.224.90×10−223.67×10−21
NPTX13.764.78×10−162.26×10−15
PSAT13.523.76×10−481.06×10−46
ELAVL23.471.09×10−155.03×10−15
PBK3.413.11×10−437.08×10−42
CCNE13.311.55×10−423.43×10−41
CEP553.281.30×10−555.03×10−54
SLC7A113.092.38×10−231.96×10−22
CCNB13.008.18×10−553.01×10−53
RET2.971.80×10−136.93×10−13
COL1A12.919.49×10−321.27×10−30
E2F72.763.91×10−315.04×10−30
CLSPN2.754.81×10−409.63×10−39
TBX182.692.39×10−107.10×10−10
KCNQ52.654.31×10−202.80×10−19
KIF232.616.02×10−431.35×10−41
CBX22.521.99×10−251.86×10−24
CDC25A2.371.98×10−373.51×10−36
CHEK12.261.69×10−433.89×10−42
MCM42.244.41×10−471.17×10−45
COL5A22.205.53×10−286.09×10−27
PFKP2.172.44×10−333.57×10−32
MIXL12.005.19×10−172.66×10−16
PROK2−2.021.49×10−241.32×10−23
SLC1A1−2.171.86×10−505.75×10−49
OSCAR−2.203.65×10−672.25×10−65
BDNF−2.312.27×10−323.15×10−31
TGFBR3−2.531.67×10−871.77×10−85
SELE−2.902.85×10−631.50×10−61
RS1−3.701.04×10−941.30×10−92
TMEM100−4.311.71×10−1437.55×10−141
SERTM1−4.743.04×10−702.05×10−68

FC, fold-change; FDR, false discovery rate.

Subsequently, the interactions between 21 miRNAs and 134 lncRNAs were assessed, as well as those between 11 miRNAs and 34 mRNAs (data not shown). Based on these targeting associations, the lncRNA-miRNA-mRNA ceRNA network was constructed using Cytoscape version 3.5.1. According to the expression levels of differentially expressed mRNAs, lncRNAs and miRNAs, two ceRNA networks, namely overexpression and underexpression networks, were constructed (Fig. 2).
Figure 2.

lncRNA-mRNA-miRNA ceRNA network of (A) underexpressed and (B) overexpressed lncRNAs, mRNAs and miRNAs. In network A, blue circles represent underexpressed mRNAs, green rectangles represent underexpressed lncRNAs, red diamonds represent overexpressed miRNAs and pink diamonds represent underexpressed miRNAs. In network B, red circles represent overexpressed mRNAs, pink rectangles represent overexpressed lncRNAs, green diamonds represent overexpressed miRNAs and blue diamonds represent underexpressed miRNAs. lncRNA, long non-coding RNA; miRNA, microRNA.

Functional enrichment analysis

To further predict putative disease prognosis-related biomarkers and the biological processes and pathways to which they belong, functional enrichment analysis of lncRNAs in the ceRNA networks was performed for GO terms and KEGG pathways. Differentially expressed mRNAs targeted by lncRNAs in the ceRNA networks were analyzed using the DAVID database. In total, 2,507 differentially expressed mRNAs were identified, including 1,977 upregulated and 527 downregulated mRNAs from LUAC tissues, when compared with adjacent normal samples based on P-values of <0.01 and |logFC|>2. Functional annotation indicated that upregulated mRNAs were involved in 23 GO terms, most significantly in ‘DNA replication’, ‘G1/S transition of the mitotic cell cycle’ and ‘cell cycle regulation’. These genes were mainly enriched in ‘cell cycle’ and ‘p53 signaling pathways’. By contrast, downregulated genes were found to be associated with GO terms of ‘BMP signaling pathway’, ‘integral component of membrane’, ‘extracellular region’ and ‘perinuclear region of cytoplasm’ (Table IV).
Table IV.

Gene ontology and KEGG pathway analysis of differentially expressed mRNA in the competing endogenous RNA network of lung adenocarcinoma.

CategoryTermCount%P-valueGenes
Upregulated
  BP_DirectGO:0006260~DNA replication4.000.090.00CLSPN, CHEK1, MCM4, CDC25A
GO:0000082~G1/S transition of mitotic cell cycle3.000.070.01CCNE1, MCM4, CDC25A
GO:0051726~regulation of cell cycle3.000.070.01CCNB1, CCNE1, CDC25A
GO:0001501~skeletal system development3.000.070.02HOXA10, COL1A1, COL5A2
GO:0000086~G2/M transition of mitotic cell cycle3.000.070.02CCNB1, CHEK1, CDC25A
GO:0031572~G2 DNA damage checkpoint2.000.050.03CLSPN, CHEK1
GO:0045893~positive regulation of transcription, DNA-templated4.000.090.04CCNE1, RET, SALL1, COL1A1
GO:0006997~nucleus organization2.000.050.04CHEK1, CEP55
GO:0000281~mitotic cytokinesis2.000.050.04KIF23, CEP55
GO:0000077~DNA damage checkpoint2.000.050.04CLSPN, CHEK1
GO:0006270~DNA replication initiation2.000.050.04CCNE1, MCM4
GO:0045944~positive regulation of transcription from RNA polymerase II promoter5.000.120.05HOXC13, E2F7, SALL1, HOXA10, MIXL1
GO:0007067~mitotic nuclear division3.000.070.05PBK, CEP55, CDC25A
GO:0048565~digestive tract development2.000.050.05CCNB1, MIXL1
  CC_DirectGO:0005654~nucleoplasm11.000.250.00KIF23, CCNB1, CLSPN, CCNE1, E2F7, SALL1, CHEK1, ELAVL2, CBX2, MCM4, CDC25A
GO:0005634~nucleus15.000.350.00KIF23, E2F7, PFKP, CHEK1, CBX2, PBK, MCM4, MIXL1, CDC25A, CCNB1, CCNE1, HOXC13, SALL1, HOXA10, TBX18
GO:0005813~centrosome4.000.090.02KIF23, CCNB1, CHEK1, CEP55
GO:0000792~heterochromatin2.000.050.03SALL1, CBX2
  MF_DirectGO:0005515~protein binding20.000.460.01KIF23, CLSPN, RET, E2F7, ELAVL2, CHEK1, CBX2, PBK, CEP55, MCM4, CDC25A, SLC7A11, CCNB1, CCNE1, KCNQ5, HOXC13, SALL1, HOXA10, COL1A1, TBX18
GO:0003677~DNA binding7.000.160.03CLSPN, HOXC13, E2F7, SALL1, CBX2, MCM4, TBX18
GO:0043565~sequence-specific DNA binding4.000.090.04HOXC13, SALL1, HOXA10, MIXL1
GO:0001077~transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding3.000.070.04HOXC13, HOXA10, MIXL1
GO:0016301~kinase activity3.000.070.05CCNE1, RET, CHEK1
  KEGG_Pathwayhsa04110:Cell cycle5.000.120.00CCNB1, CCNE1, CHEK1, MCM4, CDC25A
hsa04115:p53 signaling pathway3.000.070.00CCNB1, CCNE1, CHEK1
Downregulated
  BP_DirectGO:0030509~BMP signaling pathway2.000.130.03TGFBR3, TMEM100
  CC_DirectGO:0016021~integral component of membrane7.000.460.01BDNF, SERTM1, OSCAR, TGFBR3, TMEM100, SLC1A1, SELE
GO:0005576~extracellular region4.000.260.03PROK2, BDNF, OSCAR, TGFBR3
GO:0048471~perinuclear region of cytoplasm3.000.200.03BDNF, TMEM100, SELE

KEGG, Kyoto Encyclopedia of Genes and Genomes; BP, biological process; CC, cell component; MF, molecular function.

Determination and analysis of predictive prognostic signature

Since the selected differentially expressed mRNAs, lncRNAs and miRNAs in the ceRNA network exhibited distinct expression patterns in patients with LUAC, these coding and non-coding ceRNAs were analyzed using Kaplan-Meier and log-rank test methods to predict the prognosis of such patients. A total of 8 differentially expressed lncRNA ceRNAs were identified, including AP000525.1, AP002478.1, LINC00518, MED4-antisense 1 (AS1), NAV2-AS2, STEAP2-AS1, SYNPR-AS1 and urothelial cancer-associated 1, as well as 9 differentially expressed mRNA ceRNAs, including cyclin B1 (CCNB1), centrosomal protein 55 (CEP55), checkpoint kinase 1 (CHEK1), E2F transcription factor 7 (E2F7), kinesin family member 23 (KIF23), minichromosome maintenance complex component 4, PDZ binding kinase, phosphofructokinase platelet and retinoschisin 1 (RS1), which were associated with overall survival (Figs. 3 and 4). Subsequent to univariate Cox's proportional hazards regression model analysis for differentially expressed lncRNAs in the ceRNA networks, 19 lncRNAs were selected to have a significant prognostic value (data not shown), but 22 lncRNAs and mRNAs from the ceRNA networks were identified by integrated univariate Cox's model analysis as aberrantly expressed lncRNAs and mRNAs (data not shown). Based on the criterion of a P-value of <0.01, the selected lncRNA and mRNA ceRNAs were used to build lncRNA- or lncRNA-mRNA-based prognostic signatures using a multivariate Cox's regression model. The results showed that 9 lncRNA ceRNAs were included in a lncRNA-based prognostic signature (termed LASiglnc-9), and two lncRNA and three mRNA ceRNAs were included in a lncRNA-mRNA-based prognostic signature (termed LASiglnc2-m3) (Fig. 5). The prognostic risk score for predicting overall survival was calculated as: exp1*β1 + exp2*β2+…+expn*βn. The median was used as the cutoff of risk score, and LUAC patients were divided into high-risk and low-risk groups based on this categorization (Fig. 5). Differentially expressed lncRNAs and mRNAs included in the two models are shown in Fig. 5, these include ABCA9-AS1, MED4-AS1, C5orf64, AP000438.1, LINC00319, LINC00518, C20orf197, LINC00460, LINC00519, CCNB1, KIF23 and E2F7. The time-dependent ROC curves analysis for LASiglnc-9 achieved an area under the curve (AUC) of 0.701 for the 5-year survival of LUAC patients (Fig. 6A) and the survival rate of the low-risk group was higher than that of high-risk group (P<0.001; Fig. 6B). The time-dependent ROC curve analysis of LASiglnc2-m3 achieved an AUC of 0.627 (Fig. 6C) and the survival rate was similar to that of LASiglnc-9 (Fig. 6D). These results suggest that the accuracy of LASiglnc-9 is higher than that of LASiglnc2-m3 for predicting LUAC prognosis functioned as ceRNAs.
Figure 3.

Kaplan-Meier survival curves for 8 lncRNAs associated with overall survival of patients with lung adenocarcinoma. Horizontal axis, overall survival time in years; vertical axis, survival function.

Figure 4.

Kaplan-Meier survival curves for 9 mRNAs associated with overall survival of lung adenocarcinoma. Horizontal axis, overall survival time in years; vertical axis, survival function. CCNB1, cyclin B1; CEP55, centrosomal protein 55; CHEK1, checkpoint kinase 1; E2F7, E2F transcription factor 7; KIF23, kinesin family member 23; MCM4, minichromosome maintenance complex component 4; PBK, PDZ binding kinase; PFKP, phosphofructokinase platelet; RS1, retinoschisin 1.

Figure 5.

Risk score analysis of two prognostic signatures associated with overall survival in patients with LUAC. (A) 9-lncRNA signature LASiglnc-9 and (B) LASiglnc2-m3 signature. Survival status and duration of cases (top panels); risk score of lncRNA signature (middle panels); and heat map of LUAC-specific lncRNAs and mRNAs (bottom panels). CCNB1, cyclin B1; KIF23, kinesin family member 23; E2F7, E2F transcription factor 7; LUAC, lung adenocarcinoma.

Figure 6.

Two prognostic signatures, LASiglnc-9 and LASiglnc2-m3, lung adenocarcinoma outcome. (A) The ROC curve for 5-year overall survival prediction using the LASiglnc-9 signature. (B) The Kaplan-Meier curve of the risk score for the overall survival using the LASiglnc-9 signature; the log-rank test was used to compare the difference between low- and high-risk groups. (C) The ROC curve for predicting 5-year survival using the LASiglnc2-m3 signature. (D) The Kaplan-Meier curve of the risk score for the overall survival using the LASiglnc2-m3 signature. ROC, receiver operating characterstic; AUC, area under the curve.

To further study the value of LASiglnc-9 for LUAC prognosis, the expression pattern of 9 lncRNAs of tumor patients in two risk groups was analyzed and presented in Fig. 7. Of these 9 lncRNAs, the expression of 5 lncRNAs (LINC00460, LINC00519, LINC00518, ABCA9-AS1 and LINC00319) was higher in the high-risk group than that in the low-risk group (P<0.001), while the expression of the other 4 lncRNAs (AP000438.1, MED4-AS1, C5orf64 and C20orf197) was lower in the high-risk group than that in the low-risk group (P<0.001).
Figure 7.

Expression patterns of 9 lncRNAs in high- and low-risk groups. ****P<0.001 for high- vs. low-risk groups. lncRNA, long non-coding RNA.

Independence of predictive capacity of LASiglnc-9 from clinical factors

Kaplan-Meier curve analysis for clinical factors, including age, gender, stage of pathology, and T, N and M stages, revealed that stage of pathology (P<0.001), T stage (P=9×10−5) and N stage (P<0.001) were associated with overall survival in LUAC patients (Fig. 8). Univariate Cox's regression model analysis showed that stage of pathology (HR, 2.82; 95% CI, 1.94–4.09; P<0.001), T stage (HR, 2.49; 95% CI, 1.55–4.00; P<0.001), N stage (HR, 2.78; 95% CI, 1.92–4.01; P<0.001) and risk score (HR, 0.39; 95% CI, 0.26–0.58; P<0.001) were significantly associated with overall survival (P<0.001). However, T stage (HR, 1.46; 95% CI, 0.85–2.50; P=0.170) was not associated with overall survival in LUAC patients upon multivariate regression analysis (Table V). These results suggest that the stage of pathology, N stage and the risk score based on LASiglnc-9 function as independent prognostic factors.
Figure 8.

Prognostic value of different clinical factors for overall survival of patients with lung adenocarcinoma. Kaplan-Meier curves of six prognostic indicators.

Table V.

Predictive values of clinical features and risk score.

Univariate analysisMultivariate analysis


VariablesPatients, nHR (95% CI)P-valueHR (95% CI)P-value
Age (<60/≥60 years)131/3511.04 (0.70–1.56)0.831.17 (0.77–1.77)0.47
Gender (male/female)260/2220.88 (0.61–1.27)0.490.82 (0.57–1.20)0.31
Pathological stage (I–II/III–IV)377/1052.82 (1.94–4.09)0.001.68 (1.02–2.77)0.04
T stage (T1-T2/T3-T4)417/652.49 (1.55–4.00)0.001.46 (0.85–2.50)0.17
N stage (N0/NX)312/1702.78 (1.92–4.01)0.001.80 (1.15–2.83)0.01
M stage (M0/MX)318/1640.87 (0.58–1.30)0.490.90 (0.59–1.36)0.61
Risk score (low/high)238/2440.39 (0.26–0.58)0.000.51 (0.34–0.76)0.00

HR, hazard ratio; CI, confidence interval.

Discussion

Although clinical management of lung cancer has improved over the years through a variety of technologies that reduce patient mortality rate, an ever-increasing number of patients remain in danger of tumor recurrence or mortality (1). This is mainly due to the fact that the majority of lung cancer cases are diagnosed at advanced stages where surgical resection is not a good choice for tumor cure. Moreover, clinicopathological factors, including tumor stage, lymph node status, tumor grade and size, and lymphatic and vascular invasion appear to be associated with LUAC prognosis, but do not appear to be sufficient for predicting treatment outcomes in LUAC patients (24). A growing number of studies are focusing on microarray technology and high-throughput sequencing with the hope of identifying molecular signatures, including protein-coding genes, lncRNAs or miRNAs, that can assist in predicting survival, metastasis and the prognosis of patients (17,25). Furthermore, with a greater understanding of RNA crosstalk and interaction in the scientific community, the integrated analysis of an lncRNA-associated ceRNA network is becoming more widely used to predict prognostic signatures in various cancer types, including LUAC (26). Although several lncRNAs and miRNAs have been associated with LUAC prognosis (17,25), their expression patterns and prognostic values have not been thoroughly studied and they cannot be considered to be valid prognostic biomarkers at this time. In the current study, RNA-sequencing and clinical data were retrieved from TCGA database and then analyzed and screened for differentially expressed mRNAs, lncRNAs and miRNAs between LUAC patient tissues and adjacent normal tissues. With LUAC-specific dysregulated lncRNAs, miRNAs and mRNAs, the lncRNA-mRNA-miRNA ceRNA network was constructed, which provides more insight into the detection of key RNAs associated with LUAC prognosis. Kaplan-Meier and log-rank analyses revealed 8 differentially expressed lncRNAs and 9 mRNAs associated with overall survival from exhibiting as ceRNA in patients with LUAC. Next, an lncRNA-based prognostic signature, LASiglnc-9, was constructed, which contains 9 lncRNAs, as well as an lncRNA-mRNA-based prognostic signature, LASiglnc2-m3, which contains 2 lncRNAs and 3 mRNAs based on the differentially expressed RNAs that were mapped into the ceRNA network. Of these, LASiglnc-9 showed that it may be able to more accurately predict the overall survival of patients with LUAC compared with LASiglnc2-m3. Furthermore, it was found that the predictive ability of LASiglnc-9 is certainly independent from clinicopathological factors, including stage of pathology (HR, 2.82; 95% CI, 1.94–4.09; P<0.001), T stage (HR, 2.49; 95% CI, 1.55–4.00; P<0.001), N stage (HR, 2.78; 95% CI, 1.92–4.01; P<0.001) and risk score (HR, 0.39; 95% CI, 0.26–0.58; P<0.001) through Cox's regression analysis. These findings show that LASiglnc-9 may be a candidate biomarker for LUAC prognosis prediction based on mechanisms derived from the ceRNA networks. The lncRNA ABCA9-AS1, 1 of 9 prognosis-related lncRNAs, is targeted by hsa-mir-195 in the present ceRNA network of downregulated lncRNAs and mRNAs. It is well known that hsa-mir-195 is implicated in various cancer types, including hepatocellular carcinoma (27), esophageal squamous cell carcinoma (28) and glioblastoma (29). Notably, a previous study demonstrated that serum mir-195 was predictive of the recurrence risk of adrenocortical cancer (30). Moreover, target prediction analysis in the present study showed that hsa-mir-195 may regulate the expression of several mRNAs in the ceRNA network, including RS1, transmembrane protein 100, osteoclast-associated immunoglobulin-like receptor, transforming growth factor β receptor 3, E2F7, phosphoserine aminotransferase 1, spalt like transcription factor 1, CEP55, KIF23, ret proto-oncogene, cell division cycle 25A, chromobox 2, cyclin E1, homeobox A10 (HOXA10), CHEK1 and claspin. GO and KEGG enrichment analysis for mRNAs co-expressed with lncRNAs and miRNAs indicated that the majority of the implicated genes are significantly involved in cell cycle-related biological processes mediating tumor cell proliferation. Another lncRNA MED4-AS1, which is targeted by hsa-mir-143 and hsa-mir-144, was overexpressed in the low-risk group. Several genes, including collagen type I α1 chain (COL1A1), COL5A2, T-box 18, potassium voltage-gated channel subfamily Q member 5 and HOXA10, were predicted to be regulated by hsa-mir-143 and hsa-mir-144, and are clearly enriched in cell proliferation-associated GO terms. As studies on the roles and mechanisms of action of lncRNAs are in their infancy, functional interpretation of their co-expressed mRNAs within a ceRNA network is considered to be an effective computational strategy. The present study found that ABCA9-AS1 and MED4-AS1 may be involved in the ‘skeletal system’, ‘protein binding’, ‘DNA binding’, ‘cell cycle’ and ‘p53 signaling pathway’. The skeletal system served a vital role in body support and movement. Meanwhile, collagen, as one component of the skeletal system, has been proven to promote tumor initiation and progression (31). It is widely accepted that the p53 tumor suppressor inhibits tumor growth by mediating cell-cycle arrest, apoptotic cell death and cellular senescence triggered by diverse cellular stresses (32). As a result, we hypothesize that dysregulation of these 9 lncRNAs associated LUAC prognosis contributes to the poor outcome of patients with LUAC by mediating known tumor-associated biological processes and pathways acting as ceRNAs regulating gene expression. Currently, the TNM staging system is the most widely used system in predicting the survival of patients with LUAC. However, there are several limitations to the system. For example, not all stage III–IV patients experienced worse survival times compared with stage I–II patients, and patients who were in the same stage experienced variable survival times. Thus, the genetic predictive markers are required to assist doctors in forming more accurate estimates in clinical practice. In the present study, the identified 9-lncRNA signature showed prognostic value in LUAC patients. Even in the same pathological stage, the 9-lncRNA signature can classify patients into high- and low-risk groups with lncRNA expression level, suggesting that this lncRNA signature can improve the accuracy of survival prediction. Therefore, this result may aid doctors in selecting the corresponding therapeutic schedule for patients at different pathological stages, which can improve the overall survival of patients with LUAC. However, there are certain limitations to the present study. First, the limited available lncRNA and miRNA expression profiles only identified a fraction of the lncRNAs that may be associated with LUAC prognosis. Second, the predictive value of lncRNA signatures remains to be verified by molecular and clinical experiments in future studies. Therefore, larger cohorts and experimental studies are required to validate this signature to further investigate the functional roles of LASiglnc-9 in LUAC prognosis. In summary, the present study identified a 9-lncRNA signature that is closely associated with the tumor prognosis of patients with LUAC by use of lncRNAs profiles and construction of ceRNA networks, and by performing survival analysis. The present study not only indicates the predictive ability of lncRNA ceRNAs as potential biomarkers for LUAC diagnosis and prognosis, but also provides novel insight into the molecular mechanism underlying LUAC with further experimental validation.
  31 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

Review 2.  Expression and function of a large non-coding RNA gene XIST in human cancer.

Authors:  Sarah M Weakley; Hao Wang; Qizhi Yao; Changyi Chen
Journal:  World J Surg       Date:  2011-08       Impact factor: 3.352

3.  Unique microRNA molecular profiles in lung cancer diagnosis and prognosis.

Authors:  Nozomu Yanaihara; Natasha Caplen; Elise Bowman; Masahiro Seike; Kensuke Kumamoto; Ming Yi; Robert M Stephens; Aikou Okamoto; Jun Yokota; Tadao Tanaka; George Adrian Calin; Chang-Gong Liu; Carlo M Croce; Curtis C Harris
Journal:  Cancer Cell       Date:  2006-03       Impact factor: 31.743

Review 4.  The multilayered complexity of ceRNA crosstalk and competition.

Authors:  Yvonne Tay; John Rinn; Pier Paolo Pandolfi
Journal:  Nature       Date:  2014-01-16       Impact factor: 49.962

5.  Suppression of non-small cell lung tumor development by the let-7 microRNA family.

Authors:  Madhu S Kumar; Stefan J Erkeland; Ryan E Pester; Cindy Y Chen; Margaret S Ebert; Phillip A Sharp; Tyler Jacks
Journal:  Proc Natl Acad Sci U S A       Date:  2008-02-28       Impact factor: 11.205

6.  Serum miR-483-5p and miR-195 are predictive of recurrence risk in adrenocortical cancer patients.

Authors:  O Chabre; R Libé; G Assie; O Barreau; J Bertherat; X Bertagna; J-J Feige; N Cherradi
Journal:  Endocr Relat Cancer       Date:  2013-07-05       Impact factor: 5.678

7.  Downregulation of miR-195 via cyclosporin A in human glioblastoma cells.

Authors:  Sunde Yilaz Susluer; Cigir Biray Avci; Yavuz Dodurga; Zeynep Ozlem Dogan Sigva; Nezih Oktar; Cumhur Gunduz
Journal:  J BUON       Date:  2015 Sep-Oct       Impact factor: 2.533

8.  let-7 regulates self renewal and tumorigenicity of breast cancer cells.

Authors:  Fengyan Yu; Herui Yao; Pengcheng Zhu; Xiaoqin Zhang; Qiuhui Pan; Chang Gong; Yijun Huang; Xiaoqu Hu; Fengxi Su; Judy Lieberman; Erwei Song
Journal:  Cell       Date:  2007-12-14       Impact factor: 41.582

9.  CREB up-regulates long non-coding RNA, HULC expression through interaction with microRNA-372 in liver cancer.

Authors:  Jiayi Wang; Xiangfan Liu; Huacheng Wu; Peihua Ni; Zhidong Gu; Yongxia Qiao; Ning Chen; Fenyong Sun; Qishi Fan
Journal:  Nucleic Acids Res       Date:  2010-04-27       Impact factor: 16.971

10.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

View more
  6 in total

1.  Integrative analyses of noncoding RNAs reveal the potential mechanisms augmenting tumor malignancy in lung adenocarcinoma.

Authors:  Jou-Ho Shih; Hsin-Yi Chen; Shin-Chih Lin; Yi-Chen Yeh; Roger Shen; Yaw-Dong Lang; Dung-Chi Wu; Chien-Yu Chen; Ruey-Hwa Chen; Teh-Ying Chou; Yuh-Shan Jou
Journal:  Nucleic Acids Res       Date:  2020-02-20       Impact factor: 16.971

2.  Comprehensive analysis of the LncRNAs, MiRNAs, and MRNAs acting within the competing endogenous RNA network of LGG.

Authors:  Yiming Ding; Hanjie Liu; Chuanbao Zhang; Zhaoshi Bao; Shuqing Yu
Journal:  Genetica       Date:  2022-01-07       Impact factor: 1.082

3.  Competing endogenous RNA network identifies mRNA biomarkers for overall survival of lung adenocarcinoma: two novel on-line precision medicine predictive tools.

Authors:  Jinsong Lin; Shubiao Lu; Zhijian Jiang; Chongjing Hu; Zhiqiao Zhang
Journal:  PeerJ       Date:  2021-05-07       Impact factor: 2.984

4.  Identification of LINC02310 as an enhancer in lung adenocarcinoma and investigation of its regulatory network via comprehensive analyses.

Authors:  Wenyuan Zhao; Jun Wang; Qingxi Luo; Wei Peng; Bin Li; Lei Wang; Chunfang Zhang; Chaojun Duan
Journal:  BMC Med Genomics       Date:  2020-12-11       Impact factor: 3.063

5.  Identification of candidate RNA signatures in triple-negative breast cancer by the construction of a competing endogenous RNA network with integrative analyses of Gene Expression Omnibus and The Cancer Genome Atlas data.

Authors:  Ping Yan; Lingfeng Tang; Li Liu; Gang Tu
Journal:  Oncol Lett       Date:  2020-01-10       Impact factor: 2.967

6.  Comprehensive analysis of prognostic biomarkers in lung adenocarcinoma based on aberrant lncRNA-miRNA-mRNA networks and Cox regression models.

Authors:  Yan Yao; Tingting Zhang; Lingyu Qi; Ruijuan Liu; Gongxi Liu; Jia Wang; Qi Song; Changgang Sun
Journal:  Biosci Rep       Date:  2020-01-31       Impact factor: 3.840

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.