Literature DB >> 28977863

Clinical roles of the aberrantly expressed lncRNAs in lung squamous cell carcinoma: a study based on RNA-sequencing and microarray data mining.

Wen-Jie Chen1, Rui-Xue Tang2, Rong-Quan He3, Dong-Yao Li1, Liang Liang4, Jiang-Hui Zeng1, Xiao-Hua Hu3, Jie Ma3, Shi-Kang Li1, Gang Chen2.   

Abstract

Lung squamous cell carcinoma (LUSC) accounts for a significant proportion of lung cancer and there have been few therapeutic alternatives for recurrent LUSC due to the lack of specific driver molecules. To investigate the prospective role of lncRNAs in the tumorigenesis and progression of LUSC, the aberrantly expressed lncRNAs were calculated based on The Cancer Genome Atlas RNA-seq data. Of 7589 lncRNAs with 504 LUSC cases, 884 lncRNAs were identified as being aberrantly expressed (|log2 fold change| >2 and adjusted P<0.05) by DESeq R. The top 10 lncRNAs with the highest diagnostic value were SFTA1P,LINC00968, LINC00961, LINC01572,RP1-78O14.1, FENDRR, LINC01314,LINC01272, GATA6-AS1, and MIR3945HG. In addition to the significant roles in the carcinogenesis of LUSC, several lncRNAs also played vital parts in the survival and progression of LUSC. SFTA1P, LINC01272, GATA6-AS1 and MIR3945HG were closely related to the survival time of LUSC. Furthermore, LINC01572 and LINC01314 could distinguish the LUSC at early stage from that at advanced stage. The prospective molecular assessment of key lncRNAs showed that a certain series of genes could be involved in the regulation network. Furthermore, the OncoPrint from cBioPortal indicated that 14% (69/501) LUSC cases with genetic alterations could be obtained, including amplification, deep deletion and mRNA upregulation. More interestingly, the cases with genetic alterations had a poorer survival as compared to those without alterations. Overall, the study propounds a potentiality for interpreting the pathogenesis and development of LUSC with lncRNAs, and provides a novel platform for searching for more capable diagnostic biomarkers for LUSC.

Entities:  

Keywords:  LUSC; TCGA; biomarker; lncRNAs; tumorigenesis

Year:  2017        PMID: 28977863      PMCID: PMC5617423          DOI: 10.18632/oncotarget.18058

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

Lung cancer is the one of the leading causes of cancer deaths in the world. Among all lung cancers, more than 85% are categorized as non-small cell lung cancer (NSCLC), of which lung squamous cell carcinoma (LUSC) accounts for an approximate proportion of 30% [1-6]. Different from lung adenocarcinoma (LUAD), LUSC starts in squamous cells, which are slim, flat cells from histology, which look like fish scales. More importantly, the genetic and epigenetic profiles in the process of tumorigenesis and development vary strikingly between LUAD and LUSC [7-10]. There is a wide range of pivotal molecules verified for LUAD, which leads to great therapeutic improvement for recurrent or unresectable LUAD. Instead, there have been few therapeutic alternatives for recurrent LUSC due to the lack of specific driver molecules or mutations [11-15]. Hence, accurate indicators in the tumorigenesis and development of LUSC are urgently required. To date, a number of prospective markers for LUSC have been identified; however, the pathogenesis of LUSC is sophisticated. Furthermore, sensitive and specific markers are lacking to identify LUSC in the early stage. Long non-coding RNAs (lncRNAs) have arisen as new master regulators of initiation, progression, and response to specific therapies in a broad variety of solid and hematological neoplasms [16-18]. LncRNAs have also been demonstrated to gain various functions in tumorigenesis of lung cancer. However, most of the studies concerned the general NSCLC, but few focused on LUSC [19]. Thus, identification of LUSC-related lncRNAs, and investigation of their clinical roles and molecular mechanisms are essential for understanding the development and progression of LUSC. The Cancer Genome Atlas (TCGA) database of LUSC has facilitated the analysis on the high throughput data of various genomic alterations, including non-coding RNAs. The aberrantly expressed genes were identified for LUSC based on TCGA data and those genes that highly mutated were highlighted [20]. The clinical role of the most significantly altered microRNAs was also studied in TCGA LUSC cohort [21]. Most recently, the lncRNA alteration frequencies, but not the expression levels, were investigated by cBioPortal with 504 cases of LUSC, as well as LUAD from TCGA database [22]. Another study also compared the lncRNA profiling in LUAD and LUSC with data from TCGA and Gene Expression Omnibus (GEO). However, the concern of this study was the distinct lncRNA expression pattern between LUAD and LUSC. Furthermore, only the paired tissue samples of RNA-sequencing (RNA-Seq) from TCGA (16 pairs) were analyzed. Even the authors validated their findings with microarray data from GEO (GSE19188), only a small number of cases were involved [23]. Thus, in the current study, we calculated the 884 aberrantly expressed lncRNAs from 7589 lncRNAs in 502 LUSC cases. We further selected the top 10 lncRNAs to evaluate their clinicopathological value and potential mechanism for LUSC.

RESULTS

Aberrantly expressed lncRNAs based on TCGA data in LUSC

The expression level of each lncRNA transformed with log2 was calculated by DESeq R. Following the calculating criteria, we achieved 884 aberrantly expressed lncRNAs (Figure 1) in LUSC, including 669 highly and 215 lowly expressed lncRNAs. All the aberrantly expressed lncRNAs were sent for ROC analysis and we listed the top 75 lncRNAs obtaining over 0.95 for the area under ROC curve (AUC) (Table 1), which demonstrated that these lncRNAs might play essential roles in the occurrence of LUSC and had high diagnostic value for LUSC patients.
Figure 1

Volcano plot of the aberrantly expressed lncRNAs between LUSC and para-tumorous lung tissues

Red dots indicate high expression and green dots indicate low expression of lncRNAs. Black dots show the lncRNAs with expression of |log2FC|<2. The X axis represents an adjusted FDR and the Y axis represents the value of log2FC. Aberrantly expressed lncRNAs were calculated by DESeq R. Altogether, 669 high and 215 low expressed lncRNAs were achieved. This volcano plot was conducted by the ggplot2 package of R language.

Table 1

Analysis results of 75 lncRNAs gaining the most significant diagnostic value for LUSC (AUC >0.95)

LncRNAAUCFCLog2FCP-valueAdjusted P-value
SFTA1P0.9984150.041652365-4.5854577853.1E-1001.3E-96
LINC009680.9973980.04726163-4.4031868051.18E-385.55E-36
LINC009610.9965850.100222391-3.3187232361.07E-181.01E-16
LINC015720.9963419.6269530563.2670792556.51E-079.65E-06
RP1-78O14.10.9951220.054413873-4.1998816716E-362.53E-33
FENDRR0.9941050.05863219-4.092163256.64E-741.4E-70
LINC013140.9939830.047958874-4.3820583922.62E-421.85E-39
LINC012720.9921940.122249416-3.0321005234.58E-422.76E-39
GATA6-AS10.9917880.105033789-3.2510745851.82E-128.09E-11
MIR3945HG0.9914630.050535778-4.3065510636.68E-281.34E-25
LINC006070.9909750.121229481-3.0441875182.48E-121.05E-10
PCAT190.9907720.128341572-2.9619395412.91E-318.19E-29
AC018647.30.990690.077664175-3.6866069223.88E-121.56E-10
RP11-108L7.150.9902848.6873130323.1189100233.92E-050.00038
AC006273.40.988170.125905728-2.9895841731.1E-071.86E-06
LINC007020.9873570.115641102-3.1122738341.09E-191.18E-17
AC109642.10.9872750.091201867-3.4547928383.57E-321.16E-29
LINC011970.9860560.155580728-2.6842647296.63E-091.42E-07
CTB-193M12.50.9854055.1562488862.3663219037.43E-112.45E-09
LINC005110.98512116.331220574.0295607144.14E-277.29E-25
RP11-672A2.40.9847960.1040007-3.2653348591.55E-138.08E-12
RP11-434D9.10.9828030.073633018-3.7635033584.61E-163.04E-14
LINC002610.982560.095008585-3.3957983011.02E-113.81E-10
C14orf1320.9808110.188757476-2.4053943116.34E-281.34E-25
FAM83H-AS10.9806498.2124324063.0378095912.14E-181.89E-16
Z83851.40.9790636.0002544722.5850236873.61E-086.63E-07
RP11-532F6.30.9772750.195894607-2.3518504112.53E-095.97E-08
SLC2A1-AS10.97658310.247419123.3571886997.82E-102.12E-08
RP11-161I6.20.97631965.932397746.0429156438.46E-176.38E-15
LINC012900.9750790.187150523-2.4177290147.79E-069.09E-05
RP11-796E10.10.97487654.560869655.7697947355.57E-091.23E-07
RP11-513N24.10.9744290.174994013-2.5146225343E-063.8E-05
RP11-401P9.40.9741440.176841689-2.4994696762.8E-085.26E-07
AC068831.160.97400235.543458965.1515121818.37E-071.19E-05
AC007405.40.9737780.145766461-2.7782692814.91E-091.09E-07
LINC004720.9734940.215066069-2.2171481647.91E-071.14E-05
OGFRP10.9734536.1080439222.6107104366.7E-067.99E-05
RP5-1159O4.20.9730060.207307461-2.2701560582.37E-050.000245
RP11-560J1.20.9728236.1698015612.6252240890.0002370.001837
CTD-2527I21.150.97219397.724162716.6106434141.59E-212.03E-19
RP11-540A21.20.9721126.6737446232.7384964821.68E-050.000178
CASC90.971827190.55301927.5740486572.21E-482.33E-45
RP11-12G12.70.9714615.0659398342.3408299438.1E-102.15E-08
RP11-613D13.80.9710140.069766914-3.8413131621.81E-181.66E-16
RP11-245D16.40.9707296.4893515732.6980743292.55E-050.000261
RP11-473M20.90.9705670.227433967-2.136480362.47E-073.9E-06
RP4-758J18.130.9703234.0080780312.0029105961.49E-050.000162
LINC005190.97026264.450338316.0101160268.24E-281.58E-25
RP11-435O5.20.9682094.2290371762.0803292434.79E-050.00045
RP11-396C23.20.9674368.2428770493.0431479761.3E-061.77E-05
RP11-284N8.30.9662570.199304014-2.3269573271.85E-181.66E-16
RP11-236L14.20.9660950.205476098-2.282957514.32E-050.000413
PVT10.9658515.3934695492.4312136391.1E-114.05E-10
AC005537.20.9645736.05381935.1720801921.28E-169.04E-15
AC006273.50.9603630.163091113-2.6162499259.01E-102.33E-08
CTD-2626G11.20.9599150.137024821-2.867490851.73E-151.03E-13
CTD-2245E15.30.959550.188508673-2.407297199.76E-081.67E-06
RP11-344B5.20.9590620.248593902-2.0081371851.43E-061.93E-05
RP11-624L4.10.95889913.30363483.7337485661.33E-126.18E-11
CTA-989H11.10.9569075.5578999992.4745398774.67E-050.000439
RP11-353N14.20.95678515.466875693.9511098966.61E-067.91E-05
CARMN0.9557280.249734448-2.0015332553.89E-087.05E-07
AC006129.10.9554030.174046529-2.5224550543.18E-063.99E-05
RP11-776H12.10.95532255.445504715.7929985922.66E-203.03E-18
RP11-244M2.10.95507827.414098454.7768461241.56E-159.48E-14
RP13-463N16.60.95446893.402911456.5453956163.44E-131.71E-11
RP11-546J1.10.9538995.8244591362.5421240860.0033030.01729
MIR100HG0.9537770.20365128-2.2958272141.89E-050.0002
RP11-1038A11.30.9533727.313915954.771564264.97E-184.28E-16
RP11-429J17.70.9528015.7634018122.5269206050.0003580.002668
RP11-357P18.20.9519470.123202366-3.0208981385.65E-091.24E-07
RP5-899E9.10.9518250.246349573-2.0212211260.0001070.000915
RP4-616B8.50.9505246.5465081942.7107256010.0008940.005807
LINC009240.9501590.185515247-2.4303903342.2E-062.86E-05
RP11-7F17.30.9500370.203290027-2.2983886538.14E-069.4E-05

FC: fold change

Volcano plot of the aberrantly expressed lncRNAs between LUSC and para-tumorous lung tissues

Red dots indicate high expression and green dots indicate low expression of lncRNAs. Black dots show the lncRNAs with expression of |log2FC|<2. The X axis represents an adjusted FDR and the Y axis represents the value of log2FC. Aberrantly expressed lncRNAs were calculated by DESeq R. Altogether, 669 high and 215 low expressed lncRNAs were achieved. This volcano plot was conducted by the ggplot2 package of R language. FC: fold change

Clinical value of the top 10 aberrantly expressed lncRNAs in LUSC

The top 10 aberrantly expressed lncRNAs (Table 2) were selected for further analysis, including Surfactant associated 1 (SFTA1P), LINC00968, LINC00961, LINC01572, RP1-78O14.1, FOXF1 adjacent non-coding developmental regulatory RNA (FENDRR), LINC01314, LINC01272, GATA6-AS1, and MIR3945HG. The level of LINC01572 was remarkably higher in the LUSC than that in the para-tumorous lung tissues. On the contrary, the other nine lncRNAs were all obviously downregulated in LUSC tissues (Figure 2). All these 10 aberrantly expressed lncRNAs showed high diagnostic values to distinguish LUSC from non-cancerous lung tissues with AUC all more than 0.99 (Figure 3). Survival analyses showed that SFTA1P, LINC01272, GATA6-AS1 and MIR3945HG were significantly related to the survival time of LUSC (Figure 4). Further, the multivariate cox analysis showed that SFTA1P might be an independent prognostic indicator for LUSC (P=0.019, Supplementary Table 1). When concerning the relationship between these 10 lncRNAs and the progression of LUSC, several lncRNAs were closely related to some clinical parameters of LUSC (Table 3, Figure 5). Especially, the level of LINC01572 and LINC01314 could distinguish the LUSC patients in early-stage from the advanced-stage. Original data of FGFR1 was extracted from TCGA platform. Significantly positive correlations were noted between FGFR1 and ten-lncRNA (Figure 6).
Table 2

Characteristics of top 10 LncRNAs by the AUC size ranking

LncRNAEnsembleLocationRegulationFCAUCCIP-value
SFTA1PENSG0000022538310p14Down0.0416523650.99840.996, 1.000<0.001
LINC00968ENSG000002464308q12.1Down0.047261630.99740.995, 1.000<0.001
LINC00961ENSG000002353879p13.3Down0.1002223910.99660.993, 1.000<0.001
LINC01572ENSG0000026100816q22.2Up9.6269530560.99630.992, 1.000<0.001
RP1-78O14.1ENSG0000025789412q21.2Down0.0544138730.99510.990, 1.000<0.001
FENDRRENSG0000026838816q24.1Down0.058632190.99410.989, 0.999<0.001
LINC01314ENSG0000025941715q25.1Down0.0479588740.99400.989, 0.999<0.001
LINC01272ENSG0000022439720q13.13Down0.1222494160.99220.985, 0.999<0.001
GATA6-AS1ENSG0000026601018q11.2Down0.1050337890.99180.985, 0.998<0.001
MIR3945HGENSG000002512304q35.1Down0.0505357780.99150.983, 0.999<0.001

FC: fold change; AUC: area under the curve

CI: confidence interval

Figure 2

Different expression of the top 10 lncRNAs between LUSC and para-tumorous lung tissues

Red column indicates LUSC tissues, and green column indicates lung para-tumorous tissue (pT). The X axis indicates tissue types. The Y axis represents normalized expression of lncRNAs. This figure was drawn by ggplot2 package of R language. *: P<0.05, **: P<0.01, ***: P<0.001.

Figure 3

ROC curves of the top 10 lncRNAs sorted by AUC in LUSC

Red represents sensitive curve, green indicates identify line. The X axis shows false positive rate, presented as “1-Specificity”. The Y axis indicates true positive rate, shown as “Sensitivity”. These curves were provided by GraphPad Prism 6.

Figure 4

K-M curves of the top 10 lncRNAs in LUSC

Red line represents high level of a lncRNA, and green line represents low level. The X axis indicates overall survival time (day), and the Y axis indicates the survival rate. These curves were conducted by GraphPad Prism 6.

Table 3

Relationship between the expression of the top 10 lncRNAs and clinicopathological factors in LUSC from TCGA

LncRNA\factorDimension (small/large)Smoking (no/yes)T (T1/2 vs. T3/4)N (no/yes)M (no/yes)Pathological stage (I/II vs III/IV)Targeted molecular therapy (no/yes)
tPtPtPtPtPtPtP
SFTA1P-2.2360.026-1.0970.2731.6810.093-2.6700.0081.1820.2380.0200.984-2.5420.011
LINC00968-2.7520.006-2.5490.0111.1380.256-0.2690.7880.9500.3430.9890.323-2.9100.044
LINC00961-3.1690.002-1.8060.0721.9030.0581.6350.1030.4160.678-0.5530.581-0.2090.835
LINC015722.4080.0162.4330.015-0.0960.9243.0120.0031.9590.051-2.7170.0072.1230.034
RP1-78O14.1-3.597<0.0011.0200.3080.0870.930-2.2500.0250.6440.5201.1370.246-2.6340.009
FENDRR-1.0580.290-1.9910.0471.8120.071-0.5880.5360.6030.5471.1330.258-1.4970.135
LINC01314-1.0360.301-0.2010.8412.0660.039-3.880<0.0010.4930.6231.9910.047-2.3350.020
LINC01272-3.3330.0010.0700.994-0.6720.502-1.1890.2351.4300.1530.1310.896-1.3670.172
GATA6-AS10.4240.6720.9960.3200.3430.732-0.6230.534-0.3360.7370.7610.447-1.7160.087
MIR3945HG-1.7300.0841.1610.246-0.1180.907-1.5800.115-0.5170.6051.3710.171-1.8690.062

T: tumor stage; N: lymph node; M: metastasis

Figure 5

Association between the expression of key lncRNAs and clinicopathological features in LUSC

Statistical significance differences of several key lncRNAs were noted in various clinicopathological features: tumor stage (T1/T2 vs. T3/T4), lymph node metastasis (no vs. yes), pathological stage (I/II vs. III/IV), smoking status (no smoking vs. current smoking), targeted molecular therapy (no vs. yes). The X axis indicates different lncRNAs, and the Y axis indicates the normalized expression (log2). The plots were conducted by ggplot2 package of R language. *: P<0.05, **: P<0.01, ***: P<0.001.

Figure 6

Correlation between FGFR1 expression and lncRNAs in LUSC

The expression of these lncRNAs were positively correlated with FGFR1 expression based on TCGA dataset.

FC: fold change; AUC: area under the curve CI: confidence interval

Different expression of the top 10 lncRNAs between LUSC and para-tumorous lung tissues

Red column indicates LUSC tissues, and green column indicates lung para-tumorous tissue (pT). The X axis indicates tissue types. The Y axis represents normalized expression of lncRNAs. This figure was drawn by ggplot2 package of R language. *: P<0.05, **: P<0.01, ***: P<0.001.

ROC curves of the top 10 lncRNAs sorted by AUC in LUSC

Red represents sensitive curve, green indicates identify line. The X axis shows false positive rate, presented as “1-Specificity”. The Y axis indicates true positive rate, shown as “Sensitivity”. These curves were provided by GraphPad Prism 6.

K-M curves of the top 10 lncRNAs in LUSC

Red line represents high level of a lncRNA, and green line represents low level. The X axis indicates overall survival time (day), and the Y axis indicates the survival rate. These curves were conducted by GraphPad Prism 6. T: tumor stage; N: lymph node; M: metastasis

Association between the expression of key lncRNAs and clinicopathological features in LUSC

Statistical significance differences of several key lncRNAs were noted in various clinicopathological features: tumor stage (T1/T2 vs. T3/T4), lymph node metastasis (no vs. yes), pathological stage (I/II vs. III/IV), smoking status (no smoking vs. current smoking), targeted molecular therapy (no vs. yes). The X axis indicates different lncRNAs, and the Y axis indicates the normalized expression (log2). The plots were conducted by ggplot2 package of R language. *: P<0.05, **: P<0.01, ***: P<0.001.

Correlation between FGFR1 expression and lncRNAs in LUSC

The expression of these lncRNAs were positively correlated with FGFR1 expression based on TCGA dataset.

Potential molecular mechanism of the top 10 aberrantly expressed lncRNAs in LUSC

The co-expressed genes of all these ten key lncRNAs were determined by the WGCNA. As a result, 120 genes were revealed to be co-expressed with SFTA1P, and 47 genes were discovered to have co-expressed relationship with LINC01272, as well as the other key lncRNAs (46 genes for RP1-78O14.1, 18 for LINC00968, 8 for LINC00961, 4 for LINC01314, and 2 for GATA6-AS1 and 1 for MIR3945HG). Whereas the WGCNA showed no gene being co-expressed with FENDRR or LINC01572 (Figure 7).
Figure 7

Prospective gene networks of the 10 top differentially expressed lncRNAs

To explore the regulation network of the key lncRNAs, the co-expressed genes of those key down-regulated lncRNAs were screened out by WGCNA. Red diamonds showed the key lncRNAs and blue balls are for key lncRNAs co-expressed mRNAs.

Prospective gene networks of the 10 top differentially expressed lncRNAs

To explore the regulation network of the key lncRNAs, the co-expressed genes of those key down-regulated lncRNAs were screened out by WGCNA. Red diamonds showed the key lncRNAs and blue balls are for key lncRNAs co-expressed mRNAs. The OncoPrint from cBioPortal showed that 14% (69/501) cases with genetic alterations could be obtained (Figure 8A), except RP1-78O14.1, whose data were not available in cBioPortal. And only SFTA1P, LINC00968, LINC00961, and FENDRR had genetic alterations, including amplification, deep deletion and mRNA upregulation. More interestingly, the cases with genetic alterations had a poorer survival as compared to those without alterations (P=0.0359, Figure 8B). CBioPortal also provided the probable co-occurrence of these top 10 lncRNAs. As Table 4 showed, there was a tendency towards co-occurrence between SFTA1P and LINC00961 in LUSC.
Figure 8

The genetic alterations and their prognostic value of the lncRNAs in LUSC

(A) Genetic alterations. Red represents amplification, blue represents deep deletion and pink represents mRNA up-regulation. Genetic alterations were found in 69 of 501 LUSC patients (14%). The aberrant expression threshold was defined as z-score ± 2.0 from the TCGA RNA Seq V2 data. This OncoPrint was conducted by cBioPortal. (B) K-M curve between groups with alterations and without alterations. Red line represents cases with alterations, and blue line represents cases without. The X axis indicates overall survival time (days), and the Y axis indicates the survival rate. Kaplan-Meier test was performed. These curves were generated by cBioPortal.

Table 4

Results of mutual exclusivity and co-occurrence analysis by cBioPortal.

Gene AGene BP-valueLog odds ratioAssociation
SFTA1PLINC009680.515821057-InfinityTendency towards mutual exclusivity
SFTA1PLINC009610.047459771.2549926238226372Tendency towards co-occurrence(Significant)
SFTA1PLINC015721InfinityTendency towards co-occurrence
SFTA1PFENDRR0.96007984-InfinityTendency towards mutual exclusivity
SFTA1PLINC013141InfinityTendency towards co-occurrence
SFTA1PLINC012721InfinityTendency towards co-occurrence
SFTA1PGATA6-AS11InfinityTendency towards co-occurrence
SFTA1PMIR3945HG1InfinityTendency towards co-occurrence
LINC00968LINC009610.297586666-InfinityTendency towards mutual exclusivity
LINC00968LINC015721InfinityTendency towards co-occurrence
LINC00968FENDRR0.968063872-InfinityTendency towards mutual exclusivity
LINC00968LINC013141InfinityTendency towards co-occurrence
LINC00968LINC012721InfinityTendency towards co-occurrence
LINC00968GATA6-AS11InfinityTendency towards co-occurrence
LINC00968MIR3945HG1InfinityTendency towards co-occurrence
LINC00961LINC015721InfinityTendency towards co-occurrence
LINC00961FENDRR0.928143713-InfinityTendency towards mutual exclusivity
LINC00961LINC013141InfinityTendency towards co-occurrence
LINC00961LINC012721InfinityTendency towards co-occurrence
LINC00961GATA6-AS11InfinityTendency towards co-occurrence
LINC00961MIR3945HG1InfinityTendency towards co-occurrence
LINC01572FENDRR1InfinityTendency towards co-occurrence
LINC01572LINC013141InfinityTendency towards co-occurrence
LINC01572LINC012721InfinityTendency towards co-occurrence
LINC01572GATA6-AS11InfinityTendency towards co-occurrence
LINC01572MIR3945HG1InfinityTendency towards co-occurrence
FENDRRLINC013141InfinityTendency towards co-occurrence
FENDRRLINC012721InfinityTendency towards co-occurrence
FENDRRGATA6-AS11InfinityTendency towards co-occurrence
FENDRRMIR3945HG1InfinityTendency towards co-occurrence
LINC01314LINC012721InfinityTendency towards co-occurrence
LINC01314GATA6-AS11InfinityTendency towards co-occurrence
LINC01314MIR3945HG1InfinityTendency towards co-occurrence
LINC01272GATA6-AS11InfinityTendency towards co-occurrence
LINC01272MIR3945HG1InfinityTendency towards co-occurrence
GATA6-AS1MIR3945HG1InfinityTendency towards co-occurrence

The query contains 5 gene pairs with mutually exclusive alterations (none significant), and 31 gene pairs with co-occurrent alterations (1 significant).

Log odds ratio > 0: Association towards co-occurrence

Log odds ratio <= 0: Association towards mutual exclusivity

P-value < 0.05: Significant association

P-value: Derived from Fisher Exact Test

Log odds ratio: Quantifies how strongly the presence or absence of alterations in gene A are associated with the presence or absence of alterations in gene B in the selected tumors

The genetic alterations and their prognostic value of the lncRNAs in LUSC

(A) Genetic alterations. Red represents amplification, blue represents deep deletion and pink represents mRNA up-regulation. Genetic alterations were found in 69 of 501 LUSC patients (14%). The aberrant expression threshold was defined as z-score ± 2.0 from the TCGA RNA Seq V2 data. This OncoPrint was conducted by cBioPortal. (B) K-M curve between groups with alterations and without alterations. Red line represents cases with alterations, and blue line represents cases without. The X axis indicates overall survival time (days), and the Y axis indicates the survival rate. Kaplan-Meier test was performed. These curves were generated by cBioPortal. The query contains 5 gene pairs with mutually exclusive alterations (none significant), and 31 gene pairs with co-occurrent alterations (1 significant). Log odds ratio > 0: Association towards co-occurrence Log odds ratio <= 0: Association towards mutual exclusivity P-value < 0.05: Significant association P-value: Derived from Fisher Exact Test Log odds ratio: Quantifies how strongly the presence or absence of alterations in gene A are associated with the presence or absence of alterations in gene B in the selected tumors As a result, the STA1P co-expressed genes were most enriched in lysosome and LINC01272 co-expressed genes were most significantly involved in integral component of membrane. Meanwhile, the most enriched GO terms for mRNAs co-expressed with RP1-78O14.1 was actomyosin structure organization. The result was shown in Table 5. Additionally, we also analyzed the most enriched GO terms within all the mRNAs co-expressed with these lncRNAs. Consequently, plasma membrane was revealed to be the most GO terms and the result was showed in Table 6.
Table 5

Significant GO terms based the co-expressed genes with each lncRNA

CategoryTermCount%P-valueFold enrichmentBonferroniBenjaminiFDR
SFTA1P
GOTERM_CC_DIRECTGO:0005764∼lysosome67.897.53E-048.200.060.060.81
GOTERM_CC_DIRECTGO:0005886∼plasma membrane2431.580.0026271.800.210.112.81
GOTERM_CC_DIRECTGO:0031225∼anchored component of membrane45.260.00559110.930.390.155.90
GOTERM_CC_DIRECTGO:0016021∼integral component of membrane2532.890.0222221.500.860.3921.65
GOTERM_MF_DIRECTGO:0009055∼electron carrier activity33.950.03026810.820.990.9930.54
GOTERM_BP_DIRECTGO:0016337∼single organismal cell-cell adhesion33.950.0377369.591.001.0040.73
GOTERM_BP_DIRECTGO:0045730∼respiratory burst22.630.03878549.681.001.0041.61
GOTERM_CC_DIRECTGO:0043197∼dendritic spine33.950.0403919.270.970.5236.08
GOTERM_MF_DIRECTGO:0052890∼oxidoreductase activity, acting on the CH-CH group of donors, with a flavin as acceptor22.630.04438943.281.000.9641.63
GOTERM_BP_DIRECTGO:0043149∼stress fiber assembly22.630.0446243.061.000.9946.25
GOTERM_MF_DIRECTGO:0003995∼acyl-CoA dehydrogenase activity22.630.04727940.581.000.9043.69
GOTERM_BP_DIRECTGO:0033539∼fatty acid beta-oxidation using acyl-CoA dehydrogenase22.630.05330635.881.000.9952.53
GOTERM_BP_DIRECTGO:0019370∼leukotriene biosynthetic process22.630.05905532.291.000.9856.30
GOTERM_CC_DIRECTGO:0031674∼I band22.630.07073526.861.000.6654.90
GOTERM_BP_DIRECTGO:0046686∼response to cadmium ion22.630.07327625.831.000.9964.47
GOTERM_MF_DIRECTGO:0004857∼enzyme inhibitor activity22.630.08684521.641.000.9665.95
GOTERM_MF_DIRECTGO:0000062∼fatty-acyl-CoA binding22.630.08684521.641.000.9665.95
LINC01272
GOTERM_CC_DIRECTGO:0016021∼integral component of membrane2457.140.0001052.070.010.010.10
GOTERM_BP_DIRECTGO:0050900∼leukocyte migration511.900.00013118.600.030.030.17
GOTERM_CC_DIRECTGO:0005886∼plasma membrane2150.000.0001342.270.010.000.13
GOTERM_BP_DIRECTGO:0050776∼regulation of immune response511.900.00055212.750.110.060.70
GOTERM_CC_DIRECTGO:0005887∼integral component of plasma membrane1023.810.0030223.140.160.052.94
GOTERM_BP_DIRECTGO:0007165∼signal transduction819.050.0105453.130.900.5412.62
GOTERM_BP_DIRECTGO:0007169∼transmembrane receptor protein tyrosine kinase signaling pathway37.140.01796114.180.980.6320.60
GOTERM_MF_DIRECTGO:0005164∼tumor necrosis factor receptor binding24.760.05846132.341.001.0048.36
GOTERM_BP_DIRECTGO:0002376∼immune system process24.760.06039131.301.000.9454.75
GOTERM_BP_DIRECTGO:0045087∼innate immune response49.520.0639234.221.000.9156.87
GOTERM_BP_DIRECTGO:0001525∼angiogenesis37.140.0824176.111.000.9366.54
RP1-78O14.1
GOTERM_BP_DIRECTGO:0031032∼actomyosin structure organization28.330.01913195.680.790.7918.65
GOTERM_BP_DIRECTGO:0006821∼chloride transport28.330.02822364.580.900.6926.35
GOTERM_MF_DIRECTGO:0008092∼cytoskeletal protein binding28.330.03909546.890.840.8431.40
GOTERM_CC_DIRECTGO:0019898∼extrinsic component of membrane28.330.06543327.780.870.8743.81
GOTERM_MF_DIRECTGO:0005200∼structural constituent of cytoskeleton28.330.08749420.460.990.8857.91
Table 6

Significant GO terms based the all the mRNAs co-expressed with lncRNAs

CategoryTermCount%P-ValueFold enrichmentBonferroniBenjaminiFDR
GOTERM_CC_DIRECTGO:0005886∼plasma membrane4534.351151.82E-051.825690.0021290.0021290.020817
GOTERM_CC_DIRECTGO:0016021∼integral component of membrane5239.694662.27E-051.6839080.0026570.0013290.025982
GOTERM_BP_DIRECTGO:0050900∼leukocyte migration64.5801530.0007058.4268990.3090010.3090011.017256
GOTERM_BP_DIRECTGO:0045730∼respiratory burst32.2900760.00247139.54160.7265450.477073.523245
GOTERM_CC_DIRECTGO:0005887∼integral component of plasma membrane1813.740460.0039282.1268320.3690210.1422944.397623
GOTERM_BP_DIRECTGO:0001525∼angiogenesis64.5801530.009484.6102320.9932010.81055912.89588
GOTERM_BP_DIRECTGO:0031032∼actomyosin structure organization32.2900760.01055219.038550.9961470.75085314.25337
GOTERM_CC_DIRECTGO:0005764∼lysosome64.5801530.0111474.4387430.7305760.27954212.022
GOTERM_MF_DIRECTGO:0009055∼electron carrier activity43.0534350.0138577.8975440.9548530.95485316.29635
GOTERM_MF_DIRECTGO:0005102∼receptor binding75.3435110.0139983.5236920.9562570.79085116.44801
GOTERM_BP_DIRECTGO:0050776∼regulation of immune response53.8167940.0197274.8131160.9999710.87607525.08463
GOTERM_BP_DIRECTGO:0008277∼regulation of G-protein coupled receptor protein signaling pathway32.2900760.02130213.180530.9999870.84748926.8108
GOTERM_CC_DIRECTGO:0072557∼IPAF inflammasome complex21.5267180.02928566.877060.9691170.5011828.79674
GOTERM_CC_DIRECTGO:0031225∼anchored component of membrane43.0534350.0296295.9183240.9703710.44372929.08439
GOTERM_MF_DIRECTGO:0004046∼aminoacylase activity21.5267180.03295359.231580.9994120.91622734.7645
GOTERM_MF_DIRECTGO:0001665∼alpha-N-acetylgalactosaminide alpha-2,6-sialyltransferase activity21.5267180.03295359.231580.9994120.91622734.7645
GOTERM_BP_DIRECTGO:0007165∼signal transduction139.9236640.0350751.91861310.93093840.40209
GOTERM_CC_DIRECTGO:0031256∼leading edge membrane21.5267180.04644741.798170.9961690.54838941.9262
GOTERM_BP_DIRECTGO:0046470∼phosphatidylcholine metabolic process21.5267180.05630234.2693910.97753156.82828
GOTERM_BP_DIRECTGO:0032868∼response to insulin32.2900760.0573157.67225110.96782157.49532
GOTERM_MF_DIRECTGO:0016811∼hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amides21.5267180.05959232.308130.9999990.96695954.30836
GOTERM_BP_DIRECTGO:0009312∼oligosaccharide biosynthetic process21.5267180.06175631.1539910.96457260.3076
GOTERM_BP_DIRECTGO:0045444∼fat cell differentiation32.2900760.0666427.04165510.96257263.20096
GOTERM_BP_DIRECTGO:0007171∼activation of transmembrane receptor protein tyrosine kinase activity21.5267180.0671828.5578210.95200763.50671
GOTERM_CC_DIRECTGO:0005856∼cytoskeleton64.5801530.0697662.7039240.9997890.65273556.23642
GOTERM_BP_DIRECTGO:0007166∼cell surface receptor signaling pathway53.8167940.074313.12676910.95550767.34839
GOTERM_MF_DIRECTGO:0052890∼oxidoreductase activity, acting on the CH-CH group of donors, with a flavin as acceptor21.5267180.08037923.6926310.97577765.63741
GOTERM_BP_DIRECTGO:0070374∼positive regulation of ERK1 and ERK2 cascade43.0534350.0808123.91650110.95731670.51973
GOTERM_CC_DIRECTGO:0005925∼focal adhesion64.5801530.0830082.5656160.999960.67584462.84951
GOTERM_BP_DIRECTGO:0043149∼stress fiber assembly21.5267180.08326422.8462610.9520271.63928
GOTERM_MF_DIRECTGO:0003995∼acyl-CoA dehydrogenase activity21.5267180.08550522.2118410.9633868.00047
GOTERM_BP_DIRECTGO:0033539∼fatty acid beta-oxidation using acyl-CoA dehydrogenase21.5267180.09907419.0385510.96718677.96049
GOTERM_BP_DIRECTGO:0001574∼ganglioside biosynthetic process21.5267180.09907419.0385510.96718677.96049

Validation of the expression and ROC of the eight lncRNAs with GEO data

One study was screened out from GEO datasets (GSE30219). The expression level of eight key lncRNAs, SFTA1P, LINC00968, LINC00961, RP1-78O14.1, FENDRR, LINC01314 and LINC01272, could be extracted from the dataset, among which the remarkably lower expression of SFTA1P, LINC00968, LINC00961, RP1-78O14.1, FENDRR, LINC01314 and LINC01272 could be observed, while predominantly higher expression of GATA6-AS1 was found in LUSC tissues (Table 7). The ROC curves of eight lncRNAs all indicated favorable diagnostic value of LUSC (Figure 9).
Table 7

Validation of expression and diagnostic value of eight lncRNAs in LUSC based on GEO dataset (GSE30219)

VariablepTLUSCT-testROC
nMeanSDnMeanSDtPAUCSE95% CIP
FENDRR145.2149150.663845824.2950790.1883727.254<0.00010.9220.04370.850 - 0.967<0.0001
GATA6-AS1145.8463850.939914825.97200013.297005.972<0.00010.9030.06130.826 - 0.954<0.0001
LINC00961146.2858010.370772825.6729970.2556157.722<0.00010.9000.05550.822 - 0.952<0.0001
LINC00968146.8245951.210060823.5566480.4496969.988<0.00010.9950.00460.952 - 1.000<0.0001
LINC01272144.6936690.253514824.3517410.2860884.574<0.00010.8170.06190.725 - 0.889<0.0001
LINC01314144.7015640.272653824.4859060.1555802.8810.01200.7530.09180.655 - 0.8360.0058
RP1-78O14.1145.1663601.060565823.3471130.3988676.342<0.00010.8630.08830.778 - 0.925<0.0001
SFTA1P147.9481371.428409825.1202260.7150067.254<0.00010.9170.05610.843 - 0.964<0.0001

pT: para-noncancerous tissue; LUSC: lung squamous cell carcinoma

Figure 9

Validation of ROC results of eight lncRNAs in LUSC based on GEO dataset

Blue represents sensitive curve, red indicates identify line. The X axis shows false positive rate, presented as “100%- Specificity%”. The Y axis indicates true positive rate, shown as “Sensitivity”. These curves were performed by GraphPad Prism 6.

pT: para-noncancerous tissue; LUSC: lung squamous cell carcinoma

Validation of ROC results of eight lncRNAs in LUSC based on GEO dataset

Blue represents sensitive curve, red indicates identify line. The X axis shows false positive rate, presented as “100%- Specificity%”. The Y axis indicates true positive rate, shown as “Sensitivity”. These curves were performed by GraphPad Prism 6.

Validation based on clinical samples of LUSC

We performed real time RT-qPCR to confirm the expression of LINC00968 and FENDRR in the 12 paired clinical samples. In these patients, the mean expression level of LINC00968 was notably lower in LUSC tissues (0.3343±0.08582) than that of non-cancerous lung tissues (0.8258±0.1469; P=0.0085, Figure 10A). Moreover, the AUC of LINC00968 was 0.778 (P=0.0021, Figure 10B). However, there was no significant correlation between LINC00968 and the tumorigeneses of LUSC (P=0.508, Figure 10C). Meanwhile, the expression trend of FENDRR was similar to that of LINC00968 (P=0.0015, Figure 10D). The AUC of FENDRR is 0.882 (P=0.0015, Figure 10E). And we also assessed the relationship between FENDRR and the tumorigeneses of LUSC (P=0.031, Figure 10F).
Figure 10

Validation of LINC00968 and FENDRR based on 12 paired clinical samples of LUSC

(A) The expression of LINC00968 between para-tumorous lung tissues (pT) and LUSC (RT-qPCR); (B) ROC curve of LINC00968; (C) The correlation of LINC00968 between para-tumorous lung tissues (pT) and LUSC; (D) The expression of FENDRR between para-tumorous lung tissues (pT) and LUSC (RT-qPCR); (E) ROC curve of FENDRR; (F) The correlation of FENDRR between para-tumorous lung tissues (pT) and LUSC. pT: para-noncancerous tissues.

Validation of LINC00968 and FENDRR based on 12 paired clinical samples of LUSC

(A) The expression of LINC00968 between para-tumorous lung tissues (pT) and LUSC (RT-qPCR); (B) ROC curve of LINC00968; (C) The correlation of LINC00968 between para-tumorous lung tissues (pT) and LUSC; (D) The expression of FENDRR between para-tumorous lung tissues (pT) and LUSC (RT-qPCR); (E) ROC curve of FENDRR; (F) The correlation of FENDRR between para-tumorous lung tissues (pT) and LUSC. pT: para-noncancerous tissues.

Further analysis for the key lncRNAs expression in 22 types of cancers based on TCGA

Based on the results derived from GEPIA, down-regulation of SFTA1P was found in the lung adenocarcinoma (LUAD) and rectal adenocarcinoma (READ), while the expression of SFTA1P was significantly up-regulated in clear cell kidney carcinoma (KIRC). As shown in the figures, the consistent results were found in breast cancer (BRCA), LUAD and thymoma (THYM), revealing that LINC00968 level was significant lower in these cancers compared with para-noncancerous tissues. consistent with the result in LUSC, the lower expression of LINC00961 was demonstrated in BRCA, kidney chromophobe (KICH), kidney renal papillary cell carcinoma (KIRP) and LUAD. Additionally, lower RP1-78O14.1 expression was also revealed in several types of cancers including cervical squamous cell carcinoma (CESC), KIRC, KIRP and LUAD. Moreover, the significance of FENDRR down-regulation was reached in the bladder urothelial carcinoma (BLCA), colon adenocarcinoma (COAD), LUAD, Prostate adenocarcinoma (PRAD) and READ. Meanwhile, the result also showed the down-regulation of LINC01314 in cholangiocarcinoma (CHOL), esophageal carcinoma (ESCA), KICH, KIRC, KIRP, LUAD and pheochromocytoma and paraganglioma (PCPG), together with the up-regulation in the thyroid carcinoma (THCA). Interestingly, though lower expression of LINC01272 was found in LUAD, the result revealed a significant trend of up-regulation for LINC01272 in CESC, COAD, ESCA, KIRC, KIRP, READ, stomach adenocarcinoma(STAD) and uterine corpus endometrial carcinoma(UCEC). In the support of the result, GATA6-AS1 might act as a tumor suppressor in the several cancers including BLCA, CESC, ESCA, LUAD, pheochromocytoma and paraganglioma (PCPG) and UCEC. Nevertheless, MIR3945HG was only significantly lower in LUAD and there was no significant difference of LINC01572 expression between cancer tissues and para-noncancerous tissues among these 22 cancer types. All the details were presented in the Figure 11, which were derived from GEPIA.
Figure 11

Comparisons of lncRNAs expression between cancer tissues and non-cancerous tissues among 22 types of cancers involved in TCGA based on GEPIA

(A) SFTA1P; (B) LINC00968; (C) LINC00961; (D) LINC01572; (E) RP1-78O14.1; (F) FENDRR (G) LINC01314; (H) LINC01272; (I) GATA6-AS1; (J) MIR3945HG. Y axis indicates the log2 (TPM + 1) for lncRNA expression. Green bar shows the tumor tissues and red bas indicates the non-cancerous tissues. These figures were derived from GEPIA. *: P<0.05. TPM: Transcripts per Kilobase Million.

Comparisons of lncRNAs expression between cancer tissues and non-cancerous tissues among 22 types of cancers involved in TCGA based on GEPIA

(A) SFTA1P; (B) LINC00968; (C) LINC00961; (D) LINC01572; (E) RP1-78O14.1; (F) FENDRR (G) LINC01314; (H) LINC01272; (I) GATA6-AS1; (J) MIR3945HG. Y axis indicates the log2 (TPM + 1) for lncRNA expression. Green bar shows the tumor tissues and red bas indicates the non-cancerous tissues. These figures were derived from GEPIA. *: P<0.05. TPM: Transcripts per Kilobase Million.

DISCUSSION

There are marked variances in the aberrant gene profiling and molecular characteristics between LUAD and LUSC, which result in the altered therapeutic regimens administered to the two NSCLC subtypes [24-29]. Development in molecular biology has extended our awareness in decoding a wide scale of genomic unevenness that gradually leads normal lung cells to a cancerous state. In LUAD patients, EGFR-activating somatic mutations in exons 18/19/20/21 modify the sensitivity (namely exon 21 L858R, exon 19 deletion) or resistance (namely exon 20 T790M and/or insertion) to tyrosine kinase inhibitor (TKI) mediated targeted therapeutic strategies. However, as the second most frequent subtype in NSCLC, the treatment possibilities for LUSC remain very inadequate. In the current study, we focused on the aberrantly expressed lncRNAs in LUSC based on TCGA RNA-seq data. Ten lncRNAs with the highest diagnostic value (SFTA1P, LINC00968, LINC00961, LINC01572, RP1-78O14.1, FENDRR, LINC01314, LINC01272, GATA6-AS1, and MIR3945HG) were selected for further investigation of their clinical roles in LUSC. Furthermore, these lncRNAs could play essential roles in LUSC via lncRNA-mRNA networks, as well as genetic alterations, including amplification, deep deletion and mRNA upregulation. EGFR mutations are extremely rare (<5%) in LUSC [30]; nonetheless, other genetic alterations, like overexpression and gene amplification are much common in LUSC, which play pivotal roles in the biological process and disease development of LUSC [31]. This could be explained by the use of cetuximab in the FLEX phase III studies [32], and necitumumab in the SQUIRE study [33, 34]. Except the recently approved molecular target drug nivolumab [35-39], there have been no other recommendations specifically for LUSC as approved by US Food and Drug Administration. The recent molecular advances in lncRNAs could open up a new research area for the clinical setting of LUSC. Single lncRNA in LUSC has been studied by some groups [40-43]; however, the studies based on high throughput RNA-seq data have been rarely reported. Most recently, Liu et al [22] investigated the altered lncRNAs between LUSC and LUAD. CBioPortal was used to examine lncRNA alteration frequencies, as well as the capacity to evaluate overall survival from TCGA database. In LUSC, 624 lncRNAs were observed to gain alteration rates > 1% and 64 > 10%. Two lncRNAs, including IGF2BP2-AS1 and DGCR5 were related to better overall survival in LUSC. This study [22] focused on the genetic alteration of lncRNAs in LUSC. Similarly, Wei et al [23] also compared the lncRNA transcriptional fingerprints between LUSC and LUAD based on transcriptome analysis with TCGA and GEO. They found that there were 117 dysregulated lncRNAs in LUSC, including 56 up-regulated and 61 down-regulated lncRNAs. Among our top 10 lncRNAs, only LINC00968 was mentioned in the 117 dysregulated lncRNAs identified by Wei et al [23]. Only 16 cases of paired LUSC tissue samples were examined in the study of Wei et al [23], and this could partially explained the distinction of aberrantly expressed lncRNAs found between Wei et al [23] and our current study. The top 10 lncRNAs (SFTA1P, LINC00968, LINC00961, LINC01572, RP1-78O14.1, FENDRR, LINC01314, LINC01272, GATA6-AS1, and MIR3945HG) had extremely high diagnostic values for LUSC, since the AUCs were all over 0.99. The differential expression levels and diagnostic potency of eight among these 10 lncRNAs could also be confirmed with independent data from GEO, which further supports the findings based on TCGA. We also performed real time RT-qPCR to verify the expression level of two lncRNAs (LINC00968 and FENDRR) with clinical sample in house. Besides, some lncRNAs may also play vital parts in the survival and progression in LUSC, which make them potential novel master regulators for LUSC. Some of these lncRNAs have been reported in other diseases. Among these 10 top aberrantly expressed lncRNAs, only the role and function of FENDRR have been well documented by several studies. FENDRR was first identified as a tissue-specific lncRNA, which was a crucial modulator of the growth of heart and body wall in mice [44]. FENDRR can bind to Proteasome component 2 (PRC2) and TrxG/MLL complexes to act as a regulator of chromatin signatures that define relevant gene activity [44]. Molecular data also suggests that FENDRR plays important part at target regulatory elements via dsDNA/RNA triplex formation, and thus directly raises PRC2 residence at these sites. FENDRR can connect epigenetic mechanisms with gene regulatory networks in embryogenesis in the mouse [45]. Furthermore, multiple knockout mouse models also unveil that FENDRR is requisite for life and brain development [46]. The clinical role and molecular mechanism of FENDRR in cancers also received much attention [47]. Decreased expression of FENDRR in infantile hemangioma was detected by both microarray analysis and qPCR [48]. Down-regulation of FENDRR was found in gastric cancer and moreover, FENDRR was closely related to the poor prognosis in gastric cancer. As for the mechanism, FENDRR can modulate the metastasis of gastric cancer cells via influencing fibronectin1 expression [49]. Most recently, high throughput microarray assay and quantitative reverse transcription-polymerase chain reaction (qRT-PCR) were conducted to confirm that FENDRR was significantly down-regulated in human Xuanwei lung cancer (XWLC) as compared to that in para normal lung tissues [50]. In the support of this study, we speculated that down-regulation of FENDRR might play a vital role in lung cancer based on TCGA dataset and our validation based on a small size of patients by real time RT-qPCR. SFTA1P was first mentioned by a genome-wide association (GWAS) study which investigated the susceptibility genes in the risk for dental caries. SNP rs11256676 in Phenotypes DMFS5mand of Chr. 10p14 was discovered and its function was unknown in 2013 [51]. Interestingly, SFTA1P was later reported to be predominately up-regulated in lung adenocarcinoma and one of the most remarkable enriched functions was surfactant homeostasis by array-based transcriptional survey in 2014 [52]. On the contrary, SFTA1P was found to be down-regulated in LUSC tissues in the current study, which indicates the distinct role of SFTA1P in LUAD and LUSC. Additionally, two lncRNAs, MIR3945HG V1 and MIR3945HG V2, were identified as novel candidate diagnostic markers for tuberculosis [53]. But LINC01314, LINC00968, LINC00961, LINC01572, GATA6-AS1, RP1-78O14.1 and LINC01272 are absolutely new lncRNAs, since no publications were available by far. The clinical role of these novel lncRNAs needs further verification in LUSC. The exact mechanisms of these aberrantly expressed lncRNAs in LUSC remain unknown. An emerging signature tune in the non-coding RNA world goes to the crosstalk between lncRNAs and mRNAs. We then predicted the prospective regulation of lncRNA co-expressed mRNA. Several lncRNAs might exert their functions via co-expressing with mRNA. Even none of WGCNA has been verified in LUSC, it is quite likely to perform in-depth studies to reveal the pathogenesis of LUSC based on aberrantly expressed lncRNAs. Furthermore, the genetic alterations can also regulate the function of certain lncRNA, and thus influence the clinical outcome [54-57]. The roles of lncRNA genetic alterations in LUSC have not been well established. Only several studies explored single lncRNAs and their genetic variants in lung cancer. For instance, among the advanced lung cancer patients, cases with rs3200401 CT and CT + TT genotypes in MALAT1 had clearly better prognosis than those with the MALAT1 rs3200401 CC genotype [58]. SNP rs114020893 of NEXN-AS1 at 1p31.1 might also contribute to lung cancer susceptibility [59]. In the current study, gene amplification, deep deletion and mRNA upregulation were detected in SFTA1P, LINC00968, LINC00961 and FENDRR and these genetic alterations of the lncRNAs showed a close correlation with survival of LUSC. However, the clinical potential of these genetic alterations needs to be confirmed with larger sample size and the exact mechanism of these genetic alterations also required in vitro and in vivo verification. Overall, we show a signature of aberrantly expressed lncRNAs in LUSC tissues and the top 10 of them have great clinical value to act as diagnostic biomarkers, and indicators to evaluate the survival and progression of LUSC. However, other precise detecting methods, like real time RT-qPCR or FISH are required to validate the diagnostic potentials of these novel lncRNAs. Also, more in-depth experiments are necessary to explore the underlying mechanism of these lncRNAs in LUSC.

MATERIALS AND METHODS

TCGA dataset of LUSC

High throughput data of RNA-Seq diagnosed with LUSC were downloaded from TCGA on November 9, 2016 [22, 23, 60]. These RNA-seq data from Illumina HiSeq RNASeq platform included 504 LUSC and 49 adjacent non-cancerous lung tissues. Since the TCGA data were a community resource project, additional approval by the ethics committee of our hospital was not mandatory. Also, the present study adhered to the TCGA publication guidelines and data access policies.

Exploration of the aberrantly expressed lncRNAs in LUSC

The RNA-Seq data of LUSC with 60,483 mRNAs covers 7589 lncRNAs, as described by NCBI (https://www.ncbi.nlm.nih.gov/) or Ensembl (http://asia.ensembl .org/). The R language package DESeq [61, 62] was subsequently used for the calculation of aberrantly expressed lncRNAs (adjusted P<0.05 and the absolute log2 fold change >2), respectively. The lncRNAs of which expression was less than 1 in more than 10% of samples were excluded and the expression level of each lncRNA was log2 transformed for the downstream analysis.

Clinical role of the top 10 aberrantly expressed lncRNAs in LUSC

The receiver operating characteristic (ROC) curve was used to assess the diagnostic effectiveness of all aberrantly expressed lncRNAs in LUSC and the top 10 were then selected for further evaluation. All expression data were presented as the mean ± standard deviation (SD). The different expression levels of the top 10 aberrantly expressed lncRNAs between LUSC and non-cancerous lung tissues, as well as between different clinical groups were assessed by Student’s t test. Pearson correlation test (SPSS Inc., Chicago, IL, USA) was performed for the relationship between FGFR1 and each lncRNA in LUSC. The prognostic roles of these lncRNAs were examined with the Kaplan–Meier method, and the log-rank test was conducted to contradistinguish survival time. The univariate and multivariate cox analyses of these lncRNAs were also performed. A P-value < 0.05 represented statistical significance. The statistical analyses were all carried out by SPSS 22.0. To explore the regulation network of the key lncRNAs, the co-expressed genes of those key lncRNAs were screened out by weighted gene co-expression network analysis (WGCNA) [63-65]. Finally, the lncRNA co-expression network was established based on WGCNA and finally visualized by Cytoscape software. Additionally, we also performed the GO analyses for the co-expression genes for six lncRNAs based on the Database for Annotation, Visualization and Integrated Discovery (DAVID, https://david.ncifcrf.gov/). It could be assumed that the elevated expression of these lncRNAs in LUSC could be caused by genetic alterations, including amplification, deletion, or point mutations. Consequently, cBioPortal was used to summarize the possible genetic alterations for these the top 10 aberrantly expressed lncRNAs in LUSC, which were presented as OncoPrint. The clinical values of the genetic alterations were also evaluated.

Validation of the aberrant expression and clinical value of lncRNAs in LUSC based on GEO datasets

Data from Gene Expression Omnibus database (GEO, http://www.ncbi.nlm.nih.gov/geo) was used to validate the results from TCGA. Search strategy was as following: (cancer OR carcinoma OR squamous cell carcinoma OR SqCC OR SCC OR tumor OR tumor OR malignanc* OR neoplas*) AND (lung OR pulmonary OR respiratory OR respiration OR aspiration OR bronchi OR bronchioles OR alveoli OR pneumocytes OR “air way”). We only retained the original study that analyzed gene expression profiling between human LUSC tissues and normal control tissues. Independent sample T-test (SPSS 22.0 Inc., Chicago, IL, USA) was used for the statistical analysis of the differentially expressed level of these lncRNAs between LUSC and para-carcinoma lung tissues. The ROC curve analysis was used to validate the diagnostic value of the lncRNAs for LUSC patients based on GEO dataset. To further verify the data from TCGA and GEO, we conducted real time RT-qPCR to detect the level of lncRNA LINC00968 and FENDRR with clinical LUSC samples (n=12) from the First Affiliated Hospital of Guangxi Medical University as previously reported [66-69]. The Ethical Committee of First Affiliated Hospital of Guangxi Medical University, China approved the present study. All participating patients provided informed consent and agreement for the research use of the clinical samples. GAPDH was used as internal reference with the primers as follows: Forward-5’-GCTCTCTGCTCCTCCTGTTC-3’, Reverse-5’-ACGACCAAATCCGTTGACTC-3’. The primers were listed as follows: LINC00968, Forward-5’-CCACTCCTTTAGTCGTTGTGC-3’; Reverse-5’- GGTCCCTCATTCCTATCCC-3’; FENDRR, Forward-5’- TAAAATTGCAGATCCTCCG-3’; Reverse-5’-AACGTTCGCATTGGTTTAGC-3’. Paired-samples t test was performed to compare the difference of lncRNAs between LUSC and non-cancerous lung tissues with SPSS 22.0. ROC curves were used to assess the effect of lncRNAs to discriminate the LUSC from non-cancerous lung tissue.

Analysis for the expression pattern of the lncRNAs in all tumors involved in TCGA based on GEPIA

We also showed the expression levels of the lncRNAs between cancer tissues and para-noncancerous tissues with the assistance of GEPIA (http://gepia.cancer-pku.cn), which could analyze the RNA sequencing expression data of 23 types of cancers and normal samples from the TCGA according to the standard processing pipeline.
  69 in total

1.  GWAS of dental caries patterns in the permanent dentition.

Authors:  J R Shaffer; E Feingold; X Wang; M Lee; K Tcuenco; D E Weeks; R J Weyant; R Crout; D W McNeil; M L Marazita
Journal:  J Dent Res       Date:  2012-10-11       Impact factor: 6.116

2.  Sp1 cooperates with Sp3 to upregulate MALAT1 expression in human hepatocellular carcinoma.

Authors:  Ziling Huang; Lanshan Huang; Siqiao Shen; Jia Li; Huiping Lu; Weijia Mo; Yiwu Dang; Dianzhong Luo; Gang Chen; Zhenbo Feng
Journal:  Oncol Rep       Date:  2015-09-08       Impact factor: 3.906

3.  Weight loss at the time of diagnosis is not associated with prognosis in patients with advanced-stage non-small cell lung cancer.

Authors:  Cem Sahin; Muhyettin Omar; Hasan Tunca; Serdar Kalemci; Burak Ozseker; Gulhan Akbaba; Ozgur Tanriverdi
Journal:  J BUON       Date:  2015 Nov-Dec       Impact factor: 2.533

4.  The tissue-specific lncRNA Fendrr is an essential regulator of heart and body wall development in the mouse.

Authors:  Phillip Grote; Lars Wittler; David Hendrix; Frederic Koch; Sandra Währisch; Arica Beisaw; Karol Macura; Gaby Bläss; Manolis Kellis; Martin Werber; Bernhard G Herrmann
Journal:  Dev Cell       Date:  2013-01-28       Impact factor: 12.270

5.  Monitoring PD-L1 positive circulating tumor cells in non-small cell lung cancer patients treated with the PD-1 inhibitor Nivolumab.

Authors:  Chiara Nicolazzo; Cristina Raimondi; MariaLaura Mancini; Salvatore Caponnetto; Angela Gradilone; Orietta Gandini; Maria Mastromartino; Gabriella Del Bene; Alessandra Prete; Flavia Longo; Enrico Cortesi; Paola Gazzaniga
Journal:  Sci Rep       Date:  2016-08-24       Impact factor: 4.379

6.  Microarray analysis of long noncoding RNA and mRNA expression profiles in human macrophages infected with Mycobacterium tuberculosis.

Authors:  Xiaofan Yang; Jiahui Yang; Jinli Wang; Qian Wen; Hui Wang; Jianchun He; Shengfeng Hu; Wenting He; Xialin Du; Sudong Liu; Li Ma
Journal:  Sci Rep       Date:  2016-12-14       Impact factor: 4.379

7.  Identification of differentially expressed genes between lung adenocarcinoma and lung squamous cell carcinoma by gene expression profiling.

Authors:  Chaojing Lu; Hezhong Chen; Zhengxiang Shan; Lixin Yang
Journal:  Mol Med Rep       Date:  2016-06-22       Impact factor: 2.952

8.  Integrative analyses of transcriptome sequencing identify novel functional lncRNAs in esophageal squamous cell carcinoma.

Authors:  C-Q Li; G-W Huang; Z-Y Wu; Y-J Xu; X-C Li; Y-J Xue; Y Zhu; J-M Zhao; M Li; J Zhang; J-Y Wu; F Lei; Q-Y Wang; S Li; C-P Zheng; B Ai; Z-D Tang; C-C Feng; L-D Liao; S-H Wang; J-H Shen; Y-J Liu; X-F Bai; J-Z He; H-H Cao; B-L Wu; M-R Wang; D-C Lin; H P Koeffler; L-D Wang; X Li; E-M Li; L-Y Xu
Journal:  Oncogenesis       Date:  2017-02-13       Impact factor: 7.485

9.  Comprehensive characterization of cancer subtype associated long non-coding RNAs and their clinical implications.

Authors:  Weihong Zhao; Jiancheng Luo; Shunchang Jiao
Journal:  Sci Rep       Date:  2014-10-13       Impact factor: 4.379

10.  Genetic Polymorphisms in Long Noncoding RNA H19 Are Associated With Susceptibility to Breast Cancer in Chinese Population.

Authors:  Zongjiang Xia; Rui Yan; Fujiao Duan; Chunhua Song; Peng Wang; Kaijuan Wang
Journal:  Medicine (Baltimore)       Date:  2016-02       Impact factor: 1.889

View more
  38 in total

1.  Studies on host-foodborne bacteria in intestinal three-dimensional cell culture model indicate possible mechanisms of interaction.

Authors:  Marita Gimenez Pereira; Otávio Guilherme Gonçalves de Almeida; Hevelin Regiane Augusto da Silva; Marília Harumi Ishizawa; Elaine Cristina Pereira De Martinis
Journal:  World J Microbiol Biotechnol       Date:  2021-01-18       Impact factor: 3.312

2.  SPAG5 promotes proliferation and suppresses apoptosis in bladder urothelial carcinoma by upregulating Wnt3 via activating the AKT/mTOR pathway and predicts poorer survival.

Authors:  J Y Liu; Q H Zeng; P G Cao; D Xie; F Yang; L Y He; Y B Dai; J J Li; X M Liu; H L Zeng; X J Fan; L Liu; Y X Zhu; L Gong; Y Cheng; J D Zhou; J Hu; H Bo; Z Z Xu; K Cao
Journal:  Oncogene       Date:  2018-04-17       Impact factor: 9.867

3.  An innovative systematic approach introduced the involved lncRNA-miR-mRNA network in cell cycle and proliferation after conventional treatments in breast cancer patients.

Authors:  Maryam Mohsenikia; Solmaz Khalighfard; Ali Mohammad Alizadeh; Vahid Khori; Maziar Ghandian Zanjan; Mohammadreza Zare; Ramesh Omranipour; Elham Patrad; Hengamesadat Razavi; Ziba Veisi Malekshahi; Zahra Bagheri-Hosseinabadi
Journal:  Cell Cycle       Date:  2022-05-15       Impact factor: 5.173

4.  Expression of exportin-1 in diffuse large B-cell lymphoma: immunohistochemistry and TCGA analyses.

Authors:  Bin Luo; Lanshan Huang; Yongyao Gu; Chunyao Li; Huiping Lu; Gang Chen; Zhigang Peng; Zhenbo Feng
Journal:  Int J Clin Exp Pathol       Date:  2018-12-01

5.  Long non-coding RNA GATA6-AS1 upregulates GATA6 to regulate the biological behaviors of lung adenocarcinoma cells.

Authors:  Honggang Kang; Dan Ma; Jing Zhang; Jun Zhao; Mengxiang Yang
Journal:  BMC Pulm Med       Date:  2021-05-15       Impact factor: 3.320

6.  Long non-coding RNA LINC00968 reduces cell proliferation and migration and angiogenesis in breast cancer through up-regulation of PROX1 by reducing hsa-miR-423-5p.

Authors:  Xianfu Sun; Tao Huang; Chengjuan Zhang; Shengze Zhang; Yingjie Wang; Qiang Zhang; Zhenzhen Liu
Journal:  Cell Cycle       Date:  2019-06-29       Impact factor: 5.173

Review 7.  Modes of Interaction of KMT2 Histone H3 Lysine 4 Methyltransferase/COMPASS Complexes with Chromatin.

Authors:  Agnieszka Bochyńska; Juliane Lüscher-Firzlaff; Bernhard Lüscher
Journal:  Cells       Date:  2018-03-02       Impact factor: 6.600

8.  A Combined RNA Signature Predicts Recurrence Risk of Stage I-IIIA Lung Squamous Cell Carcinoma.

Authors:  Li Sun; Juan Li; Xiaomeng Li; Xuemei Yang; Shujun Zhang; Xue Wang; Nan Wang; Kanghong Xu; Xinquan Jiang; Yi Zhang
Journal:  Front Genet       Date:  2021-06-14       Impact factor: 4.599

Review 9.  The Role of Sex-specific Long Non-coding RNAs in Cancer Prevention and Therapy.

Authors:  Hye Kyung Song; Sun Young Kim
Journal:  J Cancer Prev       Date:  2021-06-30

10.  Decreased AGO2 and DCR1 in PBMCs from War Veterans with PTSD leads to diminished miRNA resulting in elevated inflammation.

Authors:  M Bam; X Yang; E E Zumbrun; J P Ginsberg; Q Leyden; J Zhang; P S Nagarkatti; M Nagarkatti
Journal:  Transl Psychiatry       Date:  2017-08-29       Impact factor: 6.222

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.