Literature DB >> 31211495

Survival analysis and functional annotation of long non-coding RNAs in lung adenocarcinoma.

Abbas Salavaty1, Zahra Rezvani1, Ali Najafi2.   

Abstract

Long non-coding RNAs (lncRNAs) are a subclass of non-protein coding transcripts that are involved in several regulatory processes and are considered as potential biomarkers for almost all cancer types. This study aims to investigate the prognostic value of lncRNAs for lung adenocarcinoma (LUAD), the most prevalent subtype of lung cancer. To this end, the processed data of The Cancer Genome Atlas LUAD were retrieved from GEPIA and circlncRNAnet databases, matched with each other and integrated with the analysis results of a non-small cell lung cancer plasma RNA-Seq study. Then, the data were filtered in order to separate the differentially expressed lncRNAs that have a prognostic value for LUAD. Finally, the selected lncRNAs were functionally annotated using a bioinformatic and systems biology approach. Accordingly, we identified 19 lncRNAs as the novel LUAD prognostic lncRNAs. Also, based on our results, all 19 lncRNAs might be involved in lung cancer-related biological processes. Overall, we suggested several novel biomarkers and drug targets which could help early diagnosis, prognosis and treatment of LUAD patients.
© 2019 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.

Entities:  

Keywords:  bioinformatics; functional annotation; lung adenocarcinoma; prognostic lncRNAs; systems biology

Mesh:

Substances:

Year:  2019        PMID: 31211495      PMCID: PMC6652661          DOI: 10.1111/jcmm.14458

Source DB:  PubMed          Journal:  J Cell Mol Med        ISSN: 1582-1838            Impact factor:   5.310


Highlights

Nineteen lncRNAs are presented as novel prognostic biomarkers for LUAD. The plasma abundance of SNHG6 could be used as a diagnostic and/or prognostic biomarker in LUAD. LncRNAs could involve in LUAD development through influencing the hsa04080 KEGG pathway.

INTRODUCTION

Lung cancer is the number one cause of cancer‐related death among both men and women worldwide.1 Non‐small cell lung cancer (NSCLC) accounts for approximately 85% of lung cancer cases.2 NSCLC is histologically divided into three subtypes of which lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) account for ~50% and ~40% of the cases respectively.3 Unfortunately, most of the NSCLC patients are diagnosed at advanced stages and have a very poor prognosis, which consequently results in a low overall survival (OS) rate (15%).4, 5 In this regard, LUAD is one of the most aggressive and deadliest types of cancer with less than 5 years of OS.6 A variety of factors such as cigarette smoking, exposure to second‐hand smoke, air pollution, cooking fumes, asbestos and radon put individuals at the risk of LUAD. Besides, immunologic dysfunction, genetic susceptibility as well as some diseases including asthma and tuberculosis infections would enhance the risk of LUAD.7 Additionally, it is reported that the carcinogenesis of LUAD varies between men and women as well as between smokers and never smokers.8 Long non‐coding RNAs (lncRNAs) refer to a class of non‐protein coding RNAs that are more than 200 nucleotides and are differentially accumulated in the nucleus and cytoplasm.9 LncRNAs play various regulatory roles in the cell including regulation of development, stem cell pluripotency, cell growth and apoptosis and are frequently dysregulated in different cancers.9, 10 HOTAIR, as an example, is a well‐known oncogenic lncRNA that is up‐regulated in several cancers.11 The lncRNA linc00665 has recently been represented as an oncogenic factor in LUAD.12 However, there are hundreds of lncRNAs that their exact roles in different cancers are yet to be discovered and/or experimentally validated. RAB6C‐AS1, for instance, is a poorly known lncRNA that is presented as a potential candidate biomarker for prostate and brain cancers but its implications in the carcinogenesis of these cancers are not still neither computationally nor experimentally examined and validated.13 LncRNAs are also involved in the tumourigenesis and progression of lung cancer through aberrant regulation of gene expression at the transcriptomic, epigenomic and genomic levels.14 Additionally, epigenetic and RNA deregulations are considered as a potential hallmark of LUAD.15 Altogether, discovering the functional roles of lncRNAs in LUAD would greatly enhance our knowledge of the aetiology of LUAD and lead to the advent of novel promising biomarkers and drug targets for this deadly disease. As lncRNAs play essential roles in the progression of different cancers, they have the potential to be used as diagnostic and prognostic biomarkers.16 Also, the presence of several circulating transcripts has been reported in the plasma and serum of cancer patients which could be used for diagnostic purposes.17 Moreover, different circulatory non‐coding RNAs (ncRNAs) including lncRNAs are being constantly represented as biomarkers for cancer diagnosis, prognosis and monitoring of treatment response.5, 18 Thus, lncRNAs are potential factors for the prediction of OS and disease‐free survival (DFS) periods of cancer patients. Today, lncRNAs are being regarded as potential diagnostic factors and therapeutic targets for NSCLC.19 The lncRNA LINC00578, as an example, is represented as a promising biomarker and therapeutic target for LUAD.20 In another study, TG et al introduced three lncRNAs including HCP5, SNHG12 and LINC00472 as potential biomarkers for LUAD management.21 Also, several lncRNAs such as LHFPL3‐AS2, LINC01105, LINC00092, LINC00908 and FAM83AAS1 have been reported as prognostic factors for LUAD.22 Furthermore, the diagnostic value of circulating lncRNAs as plasma signatures for the early detection of lung cancer has been confirmed.23 A growing number of computational models are being constantly developed for the identification of lncRNA‐disease associations and characterization of functional roles of lncRNAs in diseases including lung cancer. KATZLDA, as an example, is a robust computational model for the prediction of lncRNA‐disease associations.24 In another study, Chen et al proposed a kind of top‐down model. They assumed that similar diseases tend to be associated with functionally similar lncRNAs and accordingly, developed a computational model named LRLSLDA.25 Generally speaking, identification of lncRNA‐disease associations is achieved based on two different approaches; using known lncRNA‐disease associations, as in machine learning‐based and network‐based models, or using models based on the known disease‐related genes/miRNAs. Functional similarity calculation method, which is based on the assumption that functionally similar lncRNAs are associated with similar diseases, is commonly applied in both of the aforementioned approaches but usually in combination with other methods.26 Various information resources are used for the calculation of lncRNA functional similarity which could be summarized into four categories: lncRNA expression similarity, GO term‐based lncRNA functional similarity, miRNA/mRNA‐lncRNA interaction‐based functional similarity and lncRNA‐disease association‐based functional similarity.27 In the current study, a GO term‐based lncRNA functional similarity method was used to functionally interrogate the lncRNA‐LUAD associations. In the context of the prediction of functional roles of lncRNAs in diseases, several computational models have thus far been proposed that could be classified into four major categories, including gene coexpression‐based models, lncRNA‐miRNA/mRNA/protein interaction‐based models, sequence alignment‐based models and integrative features‐based models which incorporates sequence‐derived and experimental features of lncRNAs.27 In this study, we applied a coexpression‐based model for the prediction of functional roles of lncRNAs in LUAD. It is frequently reported that lncRNAs are differentially expressed (DE) in cancer tissues.28 Also, according to the guilt by association principle, if a gene shows an expression correlation with the expression profiles of a set of genes involved in a specific function, that gene is possibly involved in the same function.29 Therefore, identification of the coexpressed genes (CEGs) of DE‐lncRNAs can help functional annotation of lncRNAs in cancer. Moreover, CEGs can have common regulatory sequences and might be interacting partners of the same complex and/or involve in the same pathway.30 Actually, dysregulated lncRNAs interact with other macromolecules and consequently drive various cancer manifestations.31 Hence, identification of the CEGs of DE‐lncRNAs in cancer assists in the characterization of oncogenic or tumour suppressive functions of lncRNAs and recognition of pathways they are involved in. This is a common methodology that can be used for the functional annotation of poorly known genes in the context of different diseases including cancer.32 There are several notable variances in the gene expression profile and molecular features between LUSC and LUAD and consequently different therapeutic strategies and regiments are administrated to these two major subtypes of NSCLC.33 Thus, NSCLC studies should precisely target either LUAD or LUSC so as to get more specific results. In this study, we systematically analysed the prognostic value of lncRNAs for LUAD and annotated their functional roles using a bioinformatic and systems biology approach. A schematic outline of the implemented methodology is given in Figure 1.
Figure 1

Schematic outline of the research protocol

Schematic outline of the research protocol

METHODS

Data preparation

All of The Cancer Genome Atlas (TCGA) LUAD DE‐lncRNAs (1 < |Log2FC|; adjusted P‐value (adjp) < 0.05) were retrieved from the circlncRNAnet (Table S1).34 The circlncRNAnet is an integrated web‐based resource for mapping functional networks of long or circular forms of ncRNAs. TCGA LUAD is one of the projects conducted by TCGA Research Network and comprises 483 LUAD tumour samples and 59 normal lung samples. Also, all TCGA LUAD DEGs were obtained from GEPIA (Table S2).35 GEPIA is a web server specialized for analysing the RNA‐seq data of 9736 tumours and 8587 normal samples from the TCGA and the GTEx projects. In the context of the differential expression analysis of genes in LUAD, GEPIA has added 288 normal lung samples from GTEx projects to the normal samples of TCGA LUAD so as to make a higher balance between the number of normal and cancer samples. Then, common DE‐lncRNAs between circlncRNAnet and GEPIA with the same expression dysregulation (either over‐ or underexpression) in both databases were selected. Moreover, a total of six plasma RNA‐seq data samples, including three normal and three NSCLC plasma samples, were retrieved from the PRJNA286036 study at the European Nucleotide Archive and analysed using an Australian Galaxy server (GVL QLD, GVL 4.0.1; https://galaxy-qld.genome.edu.au/galaxy).

RNA‐seq data analysis

We applied the following pipeline with this exact sequence of steps for analysing the plasma RNA‐seq data obtained from the European Nucleotide Archive; reads were mapped to the hg19 reference genome using STAR36; lncRNA transcripts were assembled using Cufflinks37 according to the GTF (UCSC compatible) GRCh37/hg19 Version 5.0 full database annotation file downloaded from the LNCipedia38, 39; all Cufflinks' GTF output files and the LNCipedia GTF file were merged using Cuffmerge37; read counts were calculated using the SAM/BAM to count matrix tool based on the HTSeq code40; Differential_Count tool was used to analyse the matrix of the read counts for differentially expressed genes according to the DESeq2 method41; the Benjamini‐Hochberg method was used for multiple hypothesis correction; finally, DE‐lncRNAs with adjp under 0.05 were extracted. As the documents of PRJNA286036 study have not mentioned the exact NSCLC subtype of cancer samples, the extracted lncRNAs were queried across circlncRNAnet TCGA LUAD and cBioPortal42, 43 TCGA LUAD, Provisional, datasets to find and select the ones with significant alterations in LUAD.

Data filtration

All of the prepared data were filtered so as to make our downstream analyses more specific. First, the list of lncRNAs selected from the intersection of data retrieved from circlncRNAnet and GEPIA databases was combined with the list of lncRNAs outputted from the RNA‐seq data analysis. This combined list was named as the gene library (gene library = (circlncRNAnet DE‐lncRNAs ⋂ GEPIA DEGs) ⋃ (NSCLC plasma DE‐lncRNAs ⋂ LUAD altered/DE‐lncRNAs)). Then, the lncRNAs without RefSeq sequences in the gene library were filtered out in order to lay a firm foundation for our downstream analyses. To this purpose, a list of all RefSeq lncRNAs was retrieved from HGNC BioMart (Table S3) 44 on 29 January 2018 according to the following options; Filter by genes with RefSeqs accession; Status: Approved; Locus group: non‐coding RNA; Locus type: RNA, long non‐coding.

Survival analysis

Prognostic value of those RefSeq lncRNAs that remained in the last step of the data filtration process was investigated using the GEPIA web server. To this end, the prognostic value of all of the remained RefSeq lncRNAs was analysed across the TCGA LUAD dataset with the default options of GEPIA web server. Lastly, lncRNAs with significant prognostic value (Logrank test P‐value < 0.05) for OS and/or DFS were selected and named as LUAD Prognostic lncRNAs (LUAD Prognostic lncRNAs = (gene library ⋂ RefSeq‐lncRNAs) ⋂ LUAD survival‐associated lncRNAs). Also, using the GEPIA web server, the expression of LUAD Prognostic lncRNAs (LAProLncRs) was analysed across the LUAD tumour samples compared with normal controls to illustrate their expression dysregulations. Additionally, a multivariate Cox regression analysis with adjustments for the clinicopathological features of patients, including tumour stage, gender and smoking history was done to figure out if any of the prognostic lncRNAs in LUAD could be considered as an independent prognostic factor or not. For this purpose, the Kaplan‐Meier plotter online software (http://kmplot.com/analysis) 45 was used to perform a multivariate Cox regression analysis on a LUAD microarray study (GSE31210). In this step, a microarray dataset rather than an RNA‐Seq one was used to obtain more reliable results.

Coexpression analysis

The coexpression analysis was done for each lncRNA independently. Different resources of TCGA LUAD processed data were integrated in order to identify high‐confident CEGs. To this purpose, first, all of the significantly CEGs with each lncRNA were retrieved from both circlncRNAnet and GEPIA databases and their shared genes were outputted. Then, the intersection of CEGs with the list of all LUAD DEGs was queried ((circlncRNAnet CEGs ⋂ GEPIA CEGs) ⋂ GEPIA DEGs) so as to separate differentially expressed CEGs (DECEGs) (Table S4). Finally, the coexpression networks of lncRNAs with one another and with other DEGs were reconstructed using Cytoscape v3.5.1.46

Functional analysis

A coexpression‐based model was applied for the prediction of functional roles of lncRNAs in LUAD. First, the DECEGs of each LAProLncR were used to perform a gene set enrichment analysis for gene ontology‐biological process (GO‐BP) terms via Enrichr web server.47, 48 It should be noted that because SNHG6 was not significantly differentially expressed in LUAD, not only its DECEGs, but all of its CEGs were used for the gene set enrichment analysis. Then, the FuncPred database29 was used to investigate the association of LAProLncRs with GO‐BP terms in normal lung tissue based on the tissue‐specific and evolutionary conserved expression data. Finally, the first ranked GO‐BP terms of Enrichr (according to the highest combined score) and FuncPred (according to the lowest FDR) as well as their intersection were selected as the most remarkable GO‐BP terms and were illustrated as a network using the Cytoscape software. Furthermore, considering the clustered lncRNAs in the lncRNA‐GO‐BP network, the LncPath R package (https://CRAN.R-project.org/package=LncPath) was used to interrogate the synergistic function of lncRNAs across the KEGG pathways. At last, the DECEGs of synergic lncRNAs were mapped onto the predicted pathway using KEGG Mapper49 and the resulted pathway was imported into Cytoscape by means of KEGGScape app50 and enhanced manually. LncPath conducts a random walk strategy followed by applying a weighted Kolmogorov‐Smirnov statistic to evaluate the pathways related to the lncRNA sets based on their CEGs.

Clinicopathological and demographic analysis

The differential expression of LAProLncRs among different LUAD stages was analysed using the GEPIA web server. Also, the impact of smoking habit and gender on the expression of prognostic lncRNAs in LUAD was investigated using the Lung Cancer Explorer (http://lce.biohpc.swmed.edu/lungcancer). Lung Cancer Explorer is an online database that provides the exploration of gene expression data from several public lung cancer datasets.

Statistical and topological analysis

All of the statistical analyses, except Cox regression analysis, were done using R statistical software (R Development Core Team (2014), freely available at http://www.r-project.org). The multivariate Cox regression analysis was done by the Kaplan‐Meier plotter web server. The Pearson correlation coefficient (R)>0.3 was considered as the significant threshold throughout the study. Also, the P‐value < 0.05 was considered statistically significant in all of the analyses. In the context of graph topology, two network metrics including betweenness centrality (a measure of node centrality based on shortest paths) and degree (the number of edges incident to each node) were coincidently employed to determine the hub nodes, whenever possible.

RESULTS

Selection of 168 lncRNAs as the gene library

After filtration of the TCGA LUAD DE‐lncRNAs, only 164 lncRNAs remained. Also, RNA‐seq data analysis resulted in 109 circulating lncRNAs with significant differential abundance (2 < |Log2FC|, adjp < 0.05) in NSCLC plasma samples compared with normal ones (Table S5). Interestingly, all 109 lncRNAs had lower abundance in NSCLC plasma samples compared with normal samples. Filtration of these 109 lncRNAs through circlncRNAnet and cBioPortal databases indicated that four of the 109 circulating lncRNAs were significantly amplified/overexpressed in TCGA LUAD samples (Data not shown). Altogether, 168 lncRNAs were selected for downstream analyses (Table S6).

Presentation of 19 lncRNAs as candidate LUAD biomarkers

Among all lncRNAs in our gene library, only 62 lncRNAs came out as RefSeq lncRNAs after filtration through HGNC RefSeq lncRNAs (Table S7). Subsequently, survival analyses using the GEPIA web server demonstrated that 19 of the 62 RefSeq lncRNAs had significant prognostic values (Logrank test P‐value < 0.05) for LUAD (Table 1). Remarkably, one of these lncRNAs, namely SNHG6, was of the lncRNAs with differential abundance between plasma samples of NSCLC patients and healthy controls. Also, Kaplan‐Meier plots illustrated that the association of these 19 lncRNAs with OS/DFS of patients is in accordance with the dysregulation of these lncRNAs in TCGA LUAD cancer samples (Figure 2 and Figure S1). Actually, while down‐regulated lncRNAs had higher expression levels in patients with higher percentages of OS/DFS, up‐regulated lncRNAs had lower expression levels in those patients. Furthermore, the expression analysis of these lncRNAs using the GEPIA web server demonstrated obvious differences in the expression of these 19 lncRNAs between normal and cancer samples (Figure 3).
Table 1

LncRNAs with prognostic value in lung adenocarcinoma (LUAD)

lncRNA symbolGene descriptionPrognostic valueLogrank test P‐value
ADAMTS9‐AS2ADAMTS9 antisense RNA 2OS0.00072
C8orf34‐AS1C8orf34 antisense RNA 1OS0.028
CADM3‐AS1CADM3 antisense RNA 1OS0.0016
FAM83A‐AS1FAM83A antisense RNA 1DFS0.0024
FAM83A antisense RNA 1OS3.9e‐05
FENDRRFOXF1 adjacent non‐coding developmental regulatory RNAOS0.0026
LANCL1‐AS1LANCL1 antisense RNA 1OS0.014
LINC00092long intergenic non‐protein coding RNA 92OS0.033
LINC00467long intergenic non‐protein coding RNA 467OS0.0038
LINC00857long intergenic non‐protein coding RNA 857OS0.032
LINC00891long intergenic non‐protein coding RNA 891OS0.0013
LINC00968long intergenic non‐protein coding RNA 968OS0.0021
LINC00987long intergenic non‐protein coding RNA 987OS0.0023
LINC01506long intergenic non‐protein coding RNA 1506OS0.035
MAFG‐AS1MAFG antisense RNA 1 (head to head)OS0.013
MIR497HGmir‐497‐195 cluster host geneOS0.037
RAMP2‐AS1RAMP2 antisense RNA 1DFS0.029
RHOXF1‐AS1RHOXF1 antisense RNA 1OS0.038
RHOXF1 antisense RNA 1DFS0.019
SNHG6small nucleolar RNA host gene 6OS0.014
TBX5‐AS1TBX5 antisense RNA 1OS0.017

Abbreviations: DFS, Disease‐Free Survival; OS, Overall Survival.

Figure 2

Association of lncRNAs with OS in LUAD. The association of (A) ADAMTS9‐AS2, (B) C8orf34‐AS1, (C) CADM3‐AS1, (D) FAM83A‐AS1, (E) FENDRR, (F) LANCL1‐AS1, (G) LINC00092, (H) LINC00467, (I) LINC00857, (J) LINC00891, (K) LINC00968, (L) LINC00987, (M) LINC01506, (N) MAFG‐AS1, (O) MIR497HG, (P) RHOXF1‐AS1, (Q) TBX5‐AS1, (R) SNHG6, lncRNAs with the OS of LUAD patients. TPM is a unit of transcript expression and the abbreviation of Transcripts per Million. The plots were achieved using GEPIA web server

Figure 3

Altered expression of 19 LAProLncRs. The altered expression of (A) ADAMTS9‐AS2, (B) C8orf34‐AS1, (C) CADM3‐AS1, (D) FAM83A‐AS1, (E) FENDRR, (F) LANCL1‐AS1, (G) LINC00092, (H) LINC00467, (I) LINC00857, (J) LINC00891, (K) LINC00968, (L) LINC00987, (M) LINC01506, (N) MAFG‐AS1, (O) MIR497HG, (P) RAMP2‐AS1, (Q) RHOXF1‐AS1, (R) SNHG6, (S) TBX5‐AS1, lncRNAs in LUAD tumour samples. FC and TPM are the abbreviations of Fold‐Change and Transcripts per Million respectively. Box plots were achieved using GEPIA web server

LncRNAs with prognostic value in lung adenocarcinoma (LUAD) Abbreviations: DFS, Disease‐Free Survival; OS, Overall Survival. Association of lncRNAs with OS in LUAD. The association of (A) ADAMTS9‐AS2, (B) C8orf34AS1, (C) CADM3AS1, (D) FAM83AAS1, (E) FENDRR, (F) LANCL1AS1, (G) LINC00092, (H) LINC00467, (I) LINC00857, (J) LINC00891, (K) LINC00968, (L) LINC00987, (M) LINC01506, (N) MAFGAS1, (O) MIR497HG, (P) RHOXF1AS1, (Q) TBX5AS1, (R) SNHG6, lncRNAs with the OS of LUAD patients. TPM is a unit of transcript expression and the abbreviation of Transcripts per Million. The plots were achieved using GEPIA web server Altered expression of 19 LAProLncRs. The altered expression of (A) ADAMTS9‐AS2, (B) C8orf34AS1, (C) CADM3AS1, (D) FAM83AAS1, (E) FENDRR, (F) LANCL1AS1, (G) LINC00092, (H) LINC00467, (I) LINC00857, (J) LINC00891, (K) LINC00968, (L) LINC00987, (M) LINC01506, (N) MAFGAS1, (O) MIR497HG, (P) RAMP2AS1, (Q) RHOXF1AS1, (R) SNHG6, (S) TBX5AS1, lncRNAs in LUAD tumour samples. FC and TPM are the abbreviations of Fold‐Change and Transcripts per Million respectively. Box plots were achieved using GEPIA web server

LAProLncRs are coexpressed with several other genes

The coexpression analysis of lncRNAs indicated that they were significantly coexpressed (PCC > 0.3) with several other DEGs in LUAD (Figure 4A). The coexpression analysis of lncRNAs with other LUAD DEGs also demonstrated that overexpressed and underexpressed lncRNAs do not have common CEGs and tend to cluster in separate modules. According to the topological analysis of the coexpression network, ATIC and JAM2 were identified as the hub nodes in the overexpressed and underexpressed modules respectively. Moreover, 16 of the 19 lncRNAs were significantly coexpressed (PCC > 0.3) with each other of which FENDRR was the one with the highest degree and betweenness centrality in the lncRNAs coexpression network (Figure 4B).
Figure 4

Gene coexpression networks. (A). The network of LAProLncRs and their DECEGs; the yellow nodes represent LAProLncRs. (B) The gene coexpression network of LAProLncRs. The intensity of violet node colour is proportional to betweenness centrality score; stronger violet colour indicates higher betweenness centrality score. Red and Blue node borders are indicative of overexpression and underexpression respectively. The width of node border indicates the GEPIA Log2FC; wider border indicates greater Log2FC. The size of violet nodes and their label font size indicate the node degree; bigger violet node and label font size are indicative of higher node degree. The edge colour and width shows the GEPIA PCC; darker and wider edge is indicative of higher GEPIA PCC. PCC is the abbreviation of Pearson Correlation Coefficient. The networks were reconstructed using the Cytoscape software

Gene coexpression networks. (A). The network of LAProLncRs and their DECEGs; the yellow nodes represent LAProLncRs. (B) The gene coexpression network of LAProLncRs. The intensity of violet node colour is proportional to betweenness centrality score; stronger violet colour indicates higher betweenness centrality score. Red and Blue node borders are indicative of overexpression and underexpression respectively. The width of node border indicates the GEPIA Log2FC; wider border indicates greater Log2FC. The size of violet nodes and their label font size indicate the node degree; bigger violet node and label font size are indicative of higher node degree. The edge colour and width shows the GEPIA PCC; darker and wider edge is indicative of higher GEPIA PCC. PCC is the abbreviation of Pearson Correlation Coefficient. The networks were reconstructed using the Cytoscape software

LAProLncRs are involved in several regulatory biological processes

The gene set enrichment analysis of the DECEGs of LAProLncRs indicated that these lncRNAs might be involved in several regulatory biological processes including cancer‐related ones (Figure 5; Table 2; Table S8). As depicted in the lncRNA‐GO‐BP network, some lncRNAs and their associated GO terms were grouped in modules and might work in common biological processes. While C8orf34AS1 and LINC00467 in Module A were mostly associated with the cellular lipid catabolic processes, SNHG6 and CADM3AS1 were related to the regulation of protein translation, targeting and localization. On the other hand, lncRNAs in Module B were connected with the biological adhesion processes, cell and tissue migration, apoptosis and signalling pathways including Wnt and Notch signalling pathways. Non‐modulated lncRNAs were associated with cancer‐related biological processes as well; LINC00857 was associated with DNA strand elongation and positive regulation of cell proliferation; MAFGAS1 with the regulation of DNA protection, repair and recombination; LINC01506 with the regulation of immune system and responses; RHOXF1AS1 with apoptotic and catabolic processes; LANCL1AS1 with the regulation of lipid metabolic processes and angiogenesis; MIR497HG with the regulation of endothelium development and migration; and FAM83AAS1 and LINC00987 were associated with development and morphogenesis. According to the topological analysis of the lncRNA‐GO‐BP network, GO:0006414 (translational elongation) and GO:0051058 (negative regulation of small GTPase mediated signal transduction) were identified as the hub nodes of Modules A and B respectively. Furthermore, the pathway analysis of Module A and B of the lncRNA‐GO‐BP network demonstrated that the lncRNAs in Module B could have significant synergistic functions (P‐value:0.012; FDR:0.024) in the neuroactive ligand‐receptor interaction pathway (Table 3 and Figure S2).
Figure 5

The lncRNA‐GO‐BP association network. The yellow nodes represent LAProLncRs. The intensity of pink node colour indicates the betweenness centrality score; stronger pink colour indicates higher betweenness centrality score. The node height is proportional to node degree; Bigger node indicates higher node degree. The edge colour is representative of the source database of predicted GO term and the reference tissue type. Please refer to Table 2 for finding the description of GO IDs. The network was reconstructed using the Cytoscape software

Table 2

GO terms associated with LAProLncRs

GO‐BP IDGO‐BP description
GO:0000184nuclear‐transcribed mRNA catabolic process, nonsense‐mediated decay
GO:0000959mitochondrial RNA metabolic process
GO:0002474antigen processing and presentation of peptide antigen via MHC class I
GO:0003012muscle system process
GO:0006119oxidative phosphorylation
GO:0006259DNA metabolic process
GO:0006285base‐excision repair, AP site formation
GO:0006302double‐strand break repair
GO:0006310DNA recombination
GO:0006401RNA catabolic process
GO:0006412translation
GO:0006414translational elongation
GO:0006415translational termination
GO:0006418tRNA aminoacylation for protein translation
GO:0006520cellular amino acid metabolic process
GO:0006613cotranslational protein targeting to membrane
GO:0006614SRP‐dependent cotranslational protein targeting to membrane
GO:0006631fatty acid metabolic process
GO:0006768biotin metabolic process
GO:0007160cell‐matrix adhesion
GO:0007219Notch signalling pathway
GO:0007411axon guidance
GO:0007517muscle organ development
GO:0008284positive regulation of cell proliferation
GO:0008637apoptotic mitochondrial changes
GO:0009062fatty acid catabolic process
GO:0009127purine nucleoside monophosphate biosynthetic process
GO:0009128purine nucleoside monophosphate catabolic process
GO:0009158ribonucleoside monophosphate catabolic process
GO:0009168purine ribonucleoside monophosphate biosynthetic process
GO:0009169purine ribonucleoside monophosphate catabolic process
GO:0010565regulation of cellular ketone metabolic process
GO:0010631epithelial cell migration
GO:0015671oxygen transport
GO:0016042lipid catabolic process
GO:0016054organic acid catabolic process
GO:0018196peptidyl‐asparagine modification
GO:0019058viral life cycle
GO:0019083viral transcription
GO:0022616DNA strand elongation
GO:0030099myeloid cell differentiation
GO:0030336negative regulation of cell migration
GO:0030513positive regulation of BMP signalling pathway
GO:0030879mammary gland development
GO:0031290retinal ganglion cell axon guidance
GO:0031589cell‐substrate adhesion
GO:0031623receptor internalization
GO:0035050embryonic heart tube development
GO:0035136forelimb morphogenesis
GO:0035338long‐chain fatty‐acyl‐CoA biosynthetic process
GO:0042262DNA protection
GO:0042273ribosomal large subunit biogenesis
GO:0042742defense response to bacterium
GO:0043116negative regulation of vascular permeability
GO:0043312neutrophil degranulation
GO:0043534blood vessel endothelial cell migration
GO:0043542endothelial cell migration
GO:0043624cellular protein complex disassembly
GO:0044242cellular lipid catabolic process
GO:0044282small molecule catabolic process
GO:0045047protein targeting to ER
GO:0046395carboxylic acid catabolic process
GO:0046949fatty‐acyl‐CoA biosynthetic process
GO:0048736appendage development
GO:0048738cardiac muscle tissue development
GO:0048844artery morphogenesis
GO:0050919negative chemotaxis
GO:0051056regulation of small GTPase mediated signal transduction
GO:0051058negative regulation of small GTPase mediated signal transduction
GO:0051189prosthetic group metabolic process
GO:0051895negative regulation of focal adhesion assembly
GO:0060173limb development
GO:0060271cilium morphogenesis
GO:0060828regulation of canonical Wnt signalling pathway
GO:0061621canonical glycolysis
GO:0070286axonemal dynein complex assembly
GO:0070972protein localization to endoplasmic reticulum
GO:0072011glomerular endothelium development
GO:0072599establishment of protein localization to endoplasmic reticulum
GO:0090051negative regulation of cell migration involved in sprouting angiogenesis
GO:0090130tissue migration
GO:0090132epithelium migration
GO:0098542defense response to other organism
GO:2000352negative regulation of endothelial cell apoptotic process
GO:2000738positive regulation of stem cell differentiation
GO:2001223negative regulation of neuron migration

Abbreviations: BP, Biological Process; GO, Gene Ontology.

Table 3

LAProLncRs and their coexpressed genes with synergistic function in the neuroactive ligand‐receptor interaction pathway

DECEGLncRNA symbolPCCDysregulation in LUAD
CHRM1FENDRR0.9Down‐regulated
ADRA1AADAMTS9‐AS20.54Down‐regulated
ADRB2ADAMTS9‐AS20.59Down‐regulated
ADRB1FENDRR0.76Down‐regulated
ADRB2FENDRR0.83Down‐regulated
ADRB2LINC000920.58Down‐regulated
ADRB2LINC008910.65Down‐regulated
ADRB2LINC009680.76Down‐regulated
EDNRBADAMTS9‐AS20.56Down‐regulated
EDNRBFENDRR0.92Down‐regulated
EDNRBTBX5‐AS10.73Down‐regulated
EDNRBLINC000920.59Down‐regulated
EDNRBLINC008910.6Down‐regulated
EDNRBLINC009680.79Down‐regulated
NMUR1FENDRR0.88Down‐regulated
NMUR1LINC000920.65Down‐regulated
NMUR1LINC009680.69Down‐regulated
NMUR1RAMP2‐AS10.4Down‐regulated
NMUR1TBX5‐AS10.74Down‐regulated
PTGIRTBX5‐AS10.79Down‐regulated
S1PR1ADAMTS9‐AS20.58Down‐regulated
S1PR1FENDRR0.87Down‐regulated
S1PR1LINC000920.58Down‐regulated
S1PR1LINC009680.79Down‐regulated
RXFP1FENDRR0.88Down‐regulated
RXFP1RAMP2‐AS10.4Down‐regulated
RXFP1LINC009680.74Down‐regulated
CALCRLFENDRR0.93Down‐regulated
CALCRLTBX5‐AS10.74Down‐regulated
CALCRLLINC000920.58Down‐regulated
CALCRLLINC009680.73Down‐regulated
VIPR1FENDRR0.91Down‐regulated
GRIA1ADAMTS9‐AS20.59Down‐regulated
GRIK4ADAMTS9‐AS20.53Down‐regulated
GRIA1FENDRR0.82Down‐regulated
GRIA1TBX5‐AS10.78Down‐regulated
GRIK4TBX5‐AS10.72Down‐regulated
GRIA1LINC000920.67Down‐regulated
GRIK4LINC000920.58Down‐regulated
GRIA1LINC008910.66Down‐regulated
GRIK4LINC008910.59Down‐regulated
GRIA1LINC009680.84Down‐regulated
GRIK4LINC009680.74Down‐regulated

Abbreviations: DECEGs, differentially expressed coexpressed genes; LUAD, lung adenocarcinoma; PCC, Pearson correlation coefficient.

The lncRNA‐GO‐BP association network. The yellow nodes represent LAProLncRs. The intensity of pink node colour indicates the betweenness centrality score; stronger pink colour indicates higher betweenness centrality score. The node height is proportional to node degree; Bigger node indicates higher node degree. The edge colour is representative of the source database of predicted GO term and the reference tissue type. Please refer to Table 2 for finding the description of GO IDs. The network was reconstructed using the Cytoscape software GO terms associated with LAProLncRs Abbreviations: BP, Biological Process; GO, Gene Ontology. LAProLncRs and their coexpressed genes with synergistic function in the neuroactive ligand‐receptor interaction pathway Abbreviations: DECEGs, differentially expressed coexpressed genes; LUAD, lung adenocarcinoma; PCC, Pearson correlation coefficient.

The expression of some of the LAProLncRs is associated with clinicopathological and demographic features

The analyses demonstrated that the expression of nine of the 19 LAProLncRs was significantly associated (P‐value < 0.05) with different stages of LUAD (Figure 6). Also, the violin plots in Figure 6 indicated that the association of these nine lncRNAs with different stages of LUAD is in accordance with the dysregulation of these lncRNAs in TCGA LUAD cancer samples; while up‐regulated lncRNAs usually had higher expression levels in patients with higher tumour stages, down‐regulated lncRNAs had lower expression levels in those patients. Likewise, the expression of four of the 19 LAProLncRs was significantly influenced (P‐value < 0.05) by the smoking habit in LUAD and this association was in accordance with the dysregulation of these lncRNAs in TCGA LUAD cancer samples (Figure 7A‐D). Furthermore, the expression of three of the 19 LAProLncRs was significantly associated (P‐value < 0.05) with the gender of LUAD patients (Figure 7E‐G). However, multivariate Cox regression analyses indicated that while the expression level of most of the LAProLncRs and the stage of patients are independent prognostic factors, gender and smoking history do not have independent prognostic value for LUAD (Table 4).
Figure 6

The expression‐stage plot of LAProLncRs. (A) The expression‐stage plot of ADAMTS9‐AS2 lncRNA. (B) The expression‐stage plot of FAM83A‐AS1 lncRNA. (C) The expression‐stage plot of LANCL1‐AS1 lncRNA. (D) The expression‐stage plot of LINC00092 lncRNA. (E) The expression‐stage plot of LINC00857 lncRNA. (F) The expression‐stage plot of LINC00891 lncRNA. (G) The expression‐stage plot of MIR497HG lncRNA. (H) The expression‐stage plot of SNHG6 lncRNA. (I) The expression‐stage plot of TBX5‐AS1 lncRNA. The plots were achieved by the GEPIA web server

Figure 7

The impact of smoking habit and gender on the expression of LAProLncRs. (A) The impact of smoking habit on ADAMTS9‐AS2 expression in LUAD. (B) The impact of smoking habit on FAM83A‐AS1 expression in LUAD. (C) The impact of smoking habit on MAFG‐AS1 expression in LUAD. (D) The impact of smoking habit on SNHG6 expression in LUAD. (E) The impact of gender on LINC00092 expression in LUAD. (F) The impact of gender on SNHG6 expression in LUAD. (G) The impact of gender on LINK00467 expression in LUAD. The plots were obtained from the Lung Cancer Explorer

Table 4

Prognostic value of LAProLncRs with adjustments for clinicopathological features of patients

LncRNADysregulation (Up/Down)Overall survivala Multivariate analysis
CovariateHR95%CI P‐value
ADAMTS9‐AS2DownLowerLncRNA expression0.230.11‐0.50.0002
Stage2.81.39‐5.630.0039
Gender1.10.42‐2.870.8426
Smoking history0.90.34‐2.380.832
FENDRRDownLowerLncRNA expression0.10.01‐0.740.0242
Stage3.421.73‐6.730.0004
Gender1.070.44‐2.580.8819
Smoking history0.930.38‐2.280.88
LINC00092DownLowerLncRNA expression0.260.11‐0.620.0022
Stage3.41.71‐6.770.0005
Gender1.190.46‐3.080.7208
Smoking history1.150.43‐3.090.7758
LINC00467UpHigherLncRNA expression0.420.17‐1.030.058
Stage3.61.81‐7.160.0003
Gender1.070.43‐2.660.8839
Smoking history0.720.29 ‐ 1.840.4969
LINC00857UpHigherLncRNA expression2.781.4‐5.510.0034
Stage3.741.9‐7.380.0001
Gender1.170.45‐3.060.7474
Smoking history0.90.34‐2.390.8268
LINC00968DownLowerLncRNA expression0.290.12‐0.70.0055
Stage2.61.27‐5.340.0092
Gender1.220.5‐2.970.6559
Smoking history1.070.43‐2.660.8775
MAFG‐AS1UpHigherLncRNA expression1.710.8‐3.680.1693
Stage4.182.12‐8.230
Gender1.050.42‐2.610.9143
Smoking history0.750.3‐1.90.5491
MIR497HGDownLowerLncRNA expression0.620.32‐1.220.1637
Stage3.751.9‐7.420.0001
Gender1.150.45‐2.920.7695
Smoking history0.820.32‐2.090.6722
RAMP2‐AS1DownLowerLncRNA expression0.330.15‐0.730.006
Stage3.61.82‐7.120.0002
Gender1.240.48‐3.230.6542
Smoking history0.910.34‐2.390.8417
SNHG6Upb HigherLncRNA expression0.520.26‐1.040.0638
Stage3.881.97‐7.650.0001
Gender1.250.5‐3.140.6297
Smoking history0.870.34‐2.190.7626
TBX5‐AS1DownLowerLncRNA expression0.210.09‐0.490.0004
Stage2.571.27‐5.190.0086
Gender1.320.52‐3.340.5632
Smoking history1.130.43‐2.930.8041

Abbreviations: CI, Confidence Interval; HR, Hazard Ratio.

Expression level of the lncRNA in LUAD patients with lower overall survival (OS) compared to patients with higher OS.

Higher expression in tumour vs normal samples but with no significant differential expression.

The expression‐stage plot of LAProLncRs. (A) The expression‐stage plot of ADAMTS9‐AS2 lncRNA. (B) The expression‐stage plot of FAM83AAS1 lncRNA. (C) The expression‐stage plot of LANCL1AS1 lncRNA. (D) The expression‐stage plot of LINC00092 lncRNA. (E) The expression‐stage plot of LINC00857 lncRNA. (F) The expression‐stage plot of LINC00891 lncRNA. (G) The expression‐stage plot of MIR497HG lncRNA. (H) The expression‐stage plot of SNHG6 lncRNA. (I) The expression‐stage plot of TBX5AS1 lncRNA. The plots were achieved by the GEPIA web server The impact of smoking habit and gender on the expression of LAProLncRs. (A) The impact of smoking habit on ADAMTS9‐AS2 expression in LUAD. (B) The impact of smoking habit on FAM83AAS1 expression in LUAD. (C) The impact of smoking habit on MAFGAS1 expression in LUAD. (D) The impact of smoking habit on SNHG6 expression in LUAD. (E) The impact of gender on LINC00092 expression in LUAD. (F) The impact of gender on SNHG6 expression in LUAD. (G) The impact of gender on LINK00467 expression in LUAD. The plots were obtained from the Lung Cancer Explorer Prognostic value of LAProLncRs with adjustments for clinicopathological features of patients Abbreviations: CI, Confidence Interval; HR, Hazard Ratio. Expression level of the lncRNA in LUAD patients with lower overall survival (OS) compared to patients with higher OS. Higher expression in tumour vs normal samples but with no significant differential expression.

DISCUSSION AND CONCLUSION

The aim of this investigation was to systematically examine the diagnostic and predictive value of lncRNAs for LUAD and to annotate their functions by employing a bioinformatic and systems biology approach. Recently, the focus of cancer investigations has shifted from protein‐coding genes to non‐coding transcripts especially lncRNAs given their diverse regulatory roles on gene expression at the transcriptional and post‐transcriptional levels. Although the association of tens of lncRNAs with LUAD has been previously reported, most of them are not annotated and their functions in LUAD development have not been deciphered. Dysregulation of eight of the 19 LAProLncRs including ADAMTS9‐AS2,51 FENDRR,52 LINC00968,53 RAMP2AS1,54 SNHG6,55 LINC00092,22 FAM83A‐AS122 and TBX5‐AS156 in LUAD has been previously demonstrated. Though, the prognostic value of most of these lncRNAs for LUAD patients has not been evaluated. Similarly, the differential abundance of lncRNAs in the plasma of normal and LUAD specimens, which could be used as signatures for diagnosis and prognosis of LUAD, has not yet been examined. Altogether, the detection and characterization of LAProLncRs could help early diagnosis, prognosis and treatment of patients with this deadly disease. In this study, we addressed all of the above issues. Circulating lncRNAs have recently emerged as novel cancer biomarkers for diagnostic and prognostic purposes. For instance, the circulating lncRNAs NEAT1, ANRIL and SPRY4‐IT1 have been suggested as new diagnostic biomarkers for NSCLC.5 It has been reported that the dysregulation of lncRNAs in plasma is in accordance with their dysregulation in the source tumour tissue.16 Thus, according to our RNA‐Seq data analysis, dysregulated lncRNAs in NSCLC plasma samples could be used for diagnostic purposes and might play important roles in NSCLC development. However, there is a delicate point that is yet ignored regarding the abundance of circulating RNAs. Circulating RNAs originate as a result of different cellular events especially apoptosis and escape from the enzymatic degradation via absorption by extracellular vesicles including apoptotic bodies.57, 58 Also, it is well known that cancer cells evade apoptosis by employing different strategies.59 Consequently, circulating RNAs might have a lower abundance in the plasma of cancer specimens relative to normal ones. Altogether, further studies on the diagnostic and prognostic value of all 109 circulating lncRNAs might represent new potential circulating lncRNA biomarkers as tools for early diagnosis, prognosis and monitoring of treatment response for NSCLC patients. According to our results, one of the 19 LAProLncRs, namely SNHG6, was also dysregulated in NSCLC plasma samples and could possibly be used as a diagnostic and/or prognostic biomarker for non‐invasive detection and treatment monitoring of LUAD. In addition, other 18 LAProLncRs could be used as diagnostic and/or prognostic biomarkers in clinical practice as well. Notably, the association of six of the 19 LAProLncRs with lung cancer has been previously confirmed by microarray studies (Table 5). However, there is no report about the other 13 LAProLncRs.
Table 5

The association of six lncRNAs with lung cancer based on microarray studies

lncRNA symbolCancer subtypeMethodsReference
ADAMTS9‐AS2NSCLCMicroarray 60
FENDRRLUADqRT‐PCR and RNA‐FISH 52
NSCLCMicroarray 60
LUSCRNA‐seq and Microarray 33
LINC00857Lung cancerMicroarray 61
LINC00968NSCLCMicroarray 60
LUSCRNA‐seq and Microarray 33
LINC00987LUADMicroarray 62
MAFG‐AS1NSCLCMicroarray 60

Abbreviations: NSCLC: Non‐Small Cell Lung Cancer; LUSC: Lung Squamous Cell carcinoma; LUAD: Lung Adenocarcinoma; qRT‐PCR: Quantitative Reverse Transcription‐Polymerase Chain Reaction; RNA‐FISH: Fluorescent In situ Hybridization Targeting Ribonucleic Acid Molecules

The association of six lncRNAs with lung cancer based on microarray studies Abbreviations: NSCLC: Non‐Small Cell Lung Cancer; LUSC: Lung Squamous Cell carcinoma; LUAD: Lung Adenocarcinoma; qRT‐PCR: Quantitative Reverse Transcription‐Polymerase Chain Reaction; RNA‐FISH: Fluorescent In situ Hybridization Targeting Ribonucleic Acid Molecules Considering the guilt by association principle, CEGs might share common features. Although application of this principle could readily help prediction of functions and features of unknown genes in normal cells and tissues, employing this principle to annotate unknown genes in diseased cells is somewhat challenging. Regarding the association of genes and cancer progression, non‐differentially expressed genes might not play an active and direct role in the development of tumours. In other words, DE‐lncRNAs have common features with their DECEGs and not all of their CEGs. This is a key point that has been ignored in almost all of the studies by far, which may mislead the authors and result in incorrect outcomes and interpretations. Accordingly, we only considered those CEGs of LAProLncRs that were differentially expressed as well, namely DECEGs, in all of the analyses. The results of coexpression analyses illustrate that the overexpressed and underexpressed LAProLncRs are independently clustered with their DECEGs in two big modules. This implies that overexpressed and underexpressed LAProLncRs are involved in different biological processes and networks. Also, hub nodes in the coexpression networks might be driving cancer genes and potential drug targets for the treatment of LUAD. The genes ATIC and JAM2 were identified as the hub nodes in the overexpressed and underexpressed LAProLncRs modules respectively. According to the IGDB.NSCLC database,63 the dysregulation of ATIC and JAM2 in LUAD is confirmed by microarray studies as well. ATIC can be translocated with ALK, a potential target for the treatment of NSCLC.64 Also, Pemetrexed, an approved drug for unresectable and metastatic non‐squamous NSCLC, is an antifolate that inhibits the products of ATIC and some other genes.65 Also, polymorphisms in ATIC, rs12995526 for instance, could impact on the therapeutic efficacy of Pemetrexed‐treated patients with LUAD.66 Moreover, inhibition of ATIC or its knockdown by small interfering‐RNA (siRNA) is a novel chemoradiosensitization strategy which might enhance the treatment efficacy of LUAD patients.67 JAM2 is a multifunctional transmembrane protein and is involved in the regulation of diverse cellular processes such as cell growth, proliferation, angiogenesis and tumour metastasis. It is reported that JAM2 is down‐regulated in NSCLC68 and LUAD.69 Also, Tian et al demonstrated that JAM2, ADARB1, FENDRR and some other LUAD DEGs might synergistically function in the tumourigenesis of stage I LUAD.70 Furthermore, Glen et al reported that JAM2 could be targeted for the treatment of NSCLC.71 In addition, according to the topological features of the coexpression network of LAProLncRs with each other, FENDRR is the most important node and might essentially contribute to the tumourigenesis of LUAD. The gene set enrichment analysis demonstrated that most of the LAProLncRs and their associated GO‐BP terms are clustered in two modules. The LAProLncRs in Module A are mostly related to protein and lipid regulatory processes especially lipid catabolic processes. Specific lipids play important roles in endoplasmic reticulum stress, intracellular oncogenic signalling and the relation between cancer cells and cells of the tumour microenvironment.72 Also, it has been shown that the aberrant lipid metabolism promotes prostate cancer73 and blocking of the lipid catabolism decreases prostate tumour growth.72 In addition, GO:0006414 (translational elongation) is the hub GO‐BP term in Module A and is common among CADM3AS1, LINC00467 and SNHG6. Translation elongation factors play significant roles in cancer development in a cancer‐specific manner. Also, their overexpression predicts poor prognosis in lung cancer.74 The LAProLncRs in Module B of the lncRNA‐GO‐BP network are associated with well‐known cancer‐related biological processes and signalling pathways and GO:0051058 (negative regulation of small GTPase mediated signal transduction) was identified as the hub GO‐BP term in this module. Members of the Rab family of small GTPase superfamily are essential factors in tumourigenesis75 and their up‐regulation is associated with poor prognosis and aggressiveness of lung, breast, ovarian, renal and other cancers.76 Actually, they play essential roles in the regulation of metabolism, cell‐cell adhesion and cell proliferation and migration,77 which are concordant with other GO‐BP terms in Module B. Non‐modulated LAProLncRs are connected with cancer‐related GO‐BP terms as well. These data provide insights into the functional roles of LAProLncRs in LUAD tumourigenesis. In addition to hub GO‐BP terms, the shared terms between LUAD and normal lung tissues (blue edges in Figure 5) should also be considered with a higher priority in future studies. Also, the pathway enrichment analysis demonstrated that the LAProLncRs in Module B might synergistically function in the neuroactive ligand‐receptor interaction pathway. The neuroactive ligand‐receptor interaction is a methylation‐enriched pathway78 which its association with LUSC,79 osteosarcoma,80 breast cancer,81 colon cancer,82 pancreatic cancer83 and hepatocellular carcinoma78 has been previously reported. However, further studies are required to decode the precise functional roles of Module B LAProLncRs in this pathway as well as the association of this pathway with LUAD. The clinicopathological and demographic analyses indicated that the expression level of some of the LAProLncRs is associated with cancer stage, sex and smoking habits in LUAD patients. Besides, the tumour stage is negatively correlated with survival period of NSCLC patients.84 This implies that the expression level of LAProLncRs could be used as an additional signature for distinguishing between different stages and consequently predicting the survival period of LUAD patients. Also, results of the association analysis of smoking habit with the expression of LAProLncRs are consistent with the results of differential expression analysis. Actually, while down‐regulated LAProLncRs have a lower expression level in smoker LUAD patients, up‐regulated LAProLncRs have a higher expression level in such patients compared with non‐smoker LUAD patients. Furthermore, based on the results of demographic analyses, men might be more vulnerable to LUAD. On the other hand, adjustment of the survival analysis of the expression of LAProLncRs for clinicopathological features of LUAD patients demonstrated that while gender and smoking history are not independent prognostic factors, tumour stage of the patients and the expression level of most of the LAProLncRs have independent prognostic value in LUAD (Table 4). Collectively, we conducted the most comprehensive systematic analysis and functional annotation, by far, on the prognostic lncRNAs of LUAD and presented 19 lncRNAs as novel LAProLncRs. Several novel biomarkers and drug targets were suggested which might open up new avenues for the early diagnosis, prognosis and treatment of LUAD patients. Also, our research lays the groundwork for the design of the next studies. However, we faced several limitations in this study that should be noticed in future studies. As we used available online tools with default options in several steps of the project, investigation of the expression level and coexpression of LAProLncRs and their CEGs in different contexts such as age, gender, smoking habit and tumour stage and simultaneous consideration of all of these conditions was not possible. Also, the number of normal and NSCLC plasma samples that we had access to was too low and consequently our results regarding the differential abundance of lncRNAs in the blood might not be robust enough. Additionally, further in silico, in vitro and in vivo assays are required to evaluate the potential of LAProLncRs as biomarkers and/or drug targets for LUAD patients.

CONFLICT OF INTEREST

The authors have no conflict of interest.

AUTHOR CONTRIBUTIONS

AS: Designed the bioinformatic and systems biology analyses, performed all of the analyses and wrote the first manuscript draft. ZR: Supervised the whole study and revised the final version of the manuscript. AN: Designed the bioinformatic and systems biology analyses, supervised the whole study and revised the final version of the manuscript. All authors read and approved the final manuscript. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  84 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  Effective similarity measures for expression profiles.

Authors:  Golan Yona; William Dirks; Shafquat Rahman; David M Lin
Journal:  Bioinformatics       Date:  2006-04-04       Impact factor: 6.937

Review 3.  Long non-coding RNAs in cancer invasion and metastasis.

Authors:  Xiao-han Shen; Peng Qi; Xiang Du
Journal:  Mod Pathol       Date:  2014-06-13       Impact factor: 7.842

4.  Regulatory interactions between long noncoding RNA LINC00968 and miR-9-3p in non-small cell lung cancer: A bioinformatic analysis based on miRNA microarray, GEO and TCGA.

Authors:  Dong-Yao Li; Wen-Jie Chen; Jun Shang; Gang Chen; Shi-Kang Li
Journal:  Oncol Lett       Date:  2018-04-12       Impact factor: 2.967

Review 5.  Evading apoptosis in cancer.

Authors:  Kaleigh Fernald; Manabu Kurokawa
Journal:  Trends Cell Biol       Date:  2013-08-16       Impact factor: 20.808

6.  Identification and validation of long noncoding RNA biomarkers in human non-small-cell lung carcinomas.

Authors:  Hui Yu; Qinghua Xu; Fang Liu; Xun Ye; Jialei Wang; Xia Meng
Journal:  J Thorac Oncol       Date:  2015-04       Impact factor: 15.609

7.  SNHG6 functions as a competing endogenous RNA to regulate E2F7 expression by sponging miR-26a-5p in lung adenocarcinoma.

Authors:  Rui Liang; Guodong Xiao; Meng Wang; Xiang Li; Yuan Li; ZengQian Hui; Xin Sun; Sida Qin; Boxiang Zhang; Ning Du; Dapeng Liu; Hong Ren
Journal:  Biomed Pharmacother       Date:  2018-09-01       Impact factor: 6.529

8.  Insights into pancreatic cancer etiology from pathway analysis of genome-wide association study data.

Authors:  Peng Wei; Hongwei Tang; Donghui Li
Journal:  PLoS One       Date:  2012-10-04       Impact factor: 3.240

9.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

10.  Folic-acid metabolism and DNA-repair phenotypes differ between neuroendocrine lung tumors and associate with aggressive subtypes, therapy resistance and outcome.

Authors:  Robert Fred Henry Walter; Fabian Dominik Mairinger; Robert Werner; Claudia Vollbrecht; Thomas Hager; Kurt Werner Schmid; Jeremias Wohlschlaeger; Daniel Christian Christoph
Journal:  Oncotarget       Date:  2016-04-12
View more
  5 in total

1.  Survival analysis and functional annotation of long non-coding RNAs in lung adenocarcinoma.

Authors:  Abbas Salavaty; Zahra Rezvani; Ali Najafi
Journal:  J Cell Mol Med       Date:  2019-06-18       Impact factor: 5.310

2.  Systematic construction and validation of an immune prognostic model for lung adenocarcinoma.

Authors:  Chenghan Luo; Mengyuan Lei; Yixia Zhang; Qian Zhang; Lifeng Li; Jingyao Lian; Shasha Liu; Liping Wang; Guofu Pi; Yi Zhang
Journal:  J Cell Mol Med       Date:  2019-11-28       Impact factor: 5.310

3.  The construction and analysis of ceRNA network and patterns of immune infiltration in lung adenocarcinoma.

Authors:  Jinglong Li; Wenyao Liu; Xiaocheng Dong; Yunfeng Dai; Shaosen Chen; Enliang Zhao; Yunlong Liu; Hongguang Bao
Journal:  BMC Cancer       Date:  2021-11-16       Impact factor: 4.430

4.  Development and validation of the potential biomarkers based on m6A-related lncRNAs for the predictions of overall survival in the lung adenocarcinoma and differential analysis with cuproptosis.

Authors:  Chen Gao; Ning Kong; Fan Zhang; Liuzhi Zhou; Maosheng Xu; Linyu Wu
Journal:  BMC Bioinformatics       Date:  2022-08-08       Impact factor: 3.307

5.  Characterization of a non-coding RNA-associated ceRNA network in metastatic lung adenocarcinoma.

Authors:  Feifei Fan; Yu Ping; Li Yang; Xiaoran Duan; Nomathamsanqa Resegofetse Maimela; Bingjie Li; Xiangnan Li; Jing Chen; Kai Zhang; Liping Wang; Shasha Liu; Xuan Zhao; Hongmin Wang; Yi Zhang
Journal:  J Cell Mol Med       Date:  2020-08-29       Impact factor: 5.310

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.