Xi-Wen Wang1, Qi-Qiang Guo2, Yang Wei1, Kai-Ming Ren1, Fu-Shuang Zheng1, Jun Tang1, Hong-Yan Zhang1, Jun-Gang Zhao1. 1. Department of The First Thoracic Surgery, Shengjing Hospital of China Medical University, Shenyang, Liaoning 110004, P.R. China. 2. Key Laboratory of Medical Cell Biology, Ministry of Education, Institute of Translational Medicine, China Medical University, Liaoning Province Collaborative Innovation Center of Aging Related Disease Diagnosis and Treatment and Prevention, Shenyang, Liaoning 110122, P.R. China.
Abstract
The competing endogenous RNA (ceRNA) network is crucial for the development and progression of tumors, including non‑small cell lung cancer (NSCLC). However, what type of ceRNA network regulates NSCLC has not been clarified. The present study aimed to elucidate the long non‑coding RNA (lncRNA)/microRNA (miRNA)/mRNA ceRNA network in NSCLC, particularly for the significance of lncRNAs in NSCLC. NSCLC‑specific differentially expressed lncRNAs, miRNAs and mRNAs in the Cancer Genome Atlas (TCGA) were analyzed and their relationship was analyzed by a ceRNA network. Their potential functions of differentially expressed mRNAs were analyzed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). Furthermore, the expression levels of four selected lncRNAs in TCGA were determined and their associated survival of patients was examined. In addition, the expression profiles of these four lncRNAs in 48 NSCLC specimens and cell lines, their cellular distribution and associated clinical parameters were examined. We successfully constructed a ceRNA network, including 113 lncRNAs, 12 miRNAs and 36 mRNAs differentially expressed between NSCLC and non‑tumor tissues. LINC00525, MED4‑AS1, STEAP2‑AS1 and SYNPR‑AS1 lncRNAs were selected and validated for their association with the survival of NSCLC patients. The expression of these lncRNAs was upregulated in 48 NSCLC tissues and was varying in NSCLC cells. While LINC00525 was mainly expressed in the cytoplasm, MED4‑AS1 was in both the nucleus and cytoplasm of A549 cells. In addition, the expression of LINC00525 was significantly associated with smoking history (P<0.05); MED4‑AS1 was significantly associated with women, poor differentiation and lymph node metastasis (P<0.05); STEAP2‑AS1 was significantly associated with women (P<0.01); and SYNPR‑AS1 was significantly associated with women and adenocarcinoma (P<0.05). These lncRNAs may be valuable biomarkers for prognosis of NSCLC and the ceRNA network may provide new insights in the pathogenesis of NSCLC.
The competing endogenous RNA (ceRNA) network is crucial for the development and progression of tumors, including non‑small cell lung cancer (NSCLC). However, what type of ceRNA network regulates NSCLC has not been clarified. The present study aimed to elucidate the long non‑coding RNA (lncRNA)/microRNA (miRNA)/mRNA ceRNA network in NSCLC, particularly for the significance of lncRNAs in NSCLC. NSCLC‑specific differentially expressed lncRNAs, miRNAs and mRNAs in the Cancer Genome Atlas (TCGA) were analyzed and their relationship was analyzed by a ceRNA network. Their potential functions of differentially expressed mRNAs were analyzed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). Furthermore, the expression levels of four selected lncRNAs in TCGA were determined and their associated survival of patients was examined. In addition, the expression profiles of these four lncRNAs in 48 NSCLC specimens and cell lines, their cellular distribution and associated clinical parameters were examined. We successfully constructed a ceRNA network, including 113 lncRNAs, 12 miRNAs and 36 mRNAs differentially expressed between NSCLC and non‑tumor tissues. LINC00525, MED4‑AS1, STEAP2‑AS1 and SYNPR‑AS1 lncRNAs were selected and validated for their association with the survival of NSCLCpatients. The expression of these lncRNAs was upregulated in 48 NSCLC tissues and was varying in NSCLC cells. While LINC00525 was mainly expressed in the cytoplasm, MED4‑AS1 was in both the nucleus and cytoplasm of A549 cells. In addition, the expression of LINC00525 was significantly associated with smoking history (P<0.05); MED4‑AS1 was significantly associated with women, poor differentiation and lymph node metastasis (P<0.05); STEAP2‑AS1 was significantly associated with women (P<0.01); and SYNPR‑AS1 was significantly associated with women and adenocarcinoma (P<0.05). These lncRNAs may be valuable biomarkers for prognosis of NSCLC and the ceRNA network may provide new insights in the pathogenesis of NSCLC.
Non-small cell lung cancer (NSCLC) is a common lung cancer with a high incidence. In the past decades, although great advances have been made in NSCLC diagnosis, surgical resection, targeted therapies, radiotherapies, immunotherapies and chemotherapies (1–5) the 5-year overall survival and recurrence rate of NSCLCpatients is still not satisfactory. The poor outcomes may stem from the few biomarkers available for precise prognosis of NSCLC and incompletely understanding the pathogenesis of NSCLC. Therefore, identification of new prognostic markers and understanding of the molecular mechanisms underlying the pathogenesis of NSCLC would be of great significance in the management of NSCLCpatients.Long non-coding RNAs (lncRNAs), a class of non-coding RNAs with >200 nucleotides in length (6). are important regulators of many biological activities, such as transcription, RNA protein modification, and RNA interaction (7). lncRNAs demonstrate their own characteristics: i) The sequence is less conserved, only ~12% of lncRNAs can be found in other organisms other than humans, indicating their specificity in humans; ii) some lncRNAs have a unique subcellular location and may have a novel subcellular composition; iii) lncRNAs can regulate gene expression from chromatin remodeling, transcriptional regulation and post-transcriptional processing; iv) lncRNAs can be recognized by other complementary nucleic acids and direct the protein to specific sequence sites, which makes lncRNAs more abundant in cancers. Previous studies have revealed that altered expression of lncRNAs is associated with the development and progression of humanmalignant tumors, including lung and liver cancer, osteosarcoma, glioblastoma and as well as other types of cancer (8–11). Furthermore, lncRNAs can act as oncogenes or tumor suppressor genes, modulating the progression of NSCLC and affecting the overall survival (OS) of NSCLCpatients (12,13).Multiple RNA transcripts, such as mRNAs, lncRNAs, pseudogenes, and circular RNAs, can act as competing endogenous RNAs (ceRNAs) to mutually regulate their function by competitive binding of miRNA response elements (MREs) (14). Abnormal expression of any transcript may cause a failure to control a regulatory network, ultimately leading to the development and progression of cancer (15). This type of regulatory network is widely found in various cancer diseases, such as hepatocellular carcinoma, pheochromocytoma, and ovarian cancer (16–18). Previous studies have revealed that upregulated HOTAIR expression was associated with metastasis and poor prognosis of NSCLC, and HOTAIR could promote the proliferation, survival, invasion, metastasis, and drug resistance of NSCLC cells (19) while MALAT1 altered the expression of metastasis-associated genes in NSCLC (20). However, little is known on what type of ceRNA network regulates the progression of NSCLC.In the present study, the differentially expressed lncRNAs, miRNAs and related mRNAs in NSCLC were extracted from The Cancer Genome Atlas (TCGA) to construct a ceRNA network. Subsequently, the expression of four selected lncRNAs, their subcellular localization, and associated clinical parameters in NSCLC tissues were analyzed. The present findings indicated that these lncRNAs may be valuable biomarkers for prognosis and the ceRNA network may provide new insights into the pathogenesis of NSCLC.
Materials and methods
RNA sequencing data and analysis in TCGA
Differentially expressed RNAs were screened from RNA-seq data and miRNA-seq data in TCGA (cancergenome.nih.gov) separately, since lncRNAs and mRNAs were included in the RNA-seq database and miRNAs were included in the miRNA-seq database (21). Then, the data of RNA expression in NSCLC were extracted from these two databases and normalized by ‘DESeq2’ and ‘edgeR’ package. The obtained P-values were corrected by false discovery rate (FDR). Differentially expressed mRNAs, lncRNAs and miRNAs were considered when the RNA transcript between NSCLC and non-tumor tissues reached a P-value of <0.01, FDR <0.01 and |logFC (fold change)|>2. Their expression levels were expressed as Fragments Per Kilobase Million (FPKM).
Construction of a ceRNA network
A ceRNA network was established by three steps: i) By miRcode matching, individual miRNAs that interacted with the differentially expressed lncRNAs and overlapped with the differentially expressed miRNAs were used for establishment of the lncRNA/miRNA relationship; ii) the mRNAs targeted by miRNAs in the lncRNA/miRNA network were predicted by miRDB (http://mirdb.org/), miRTarBase (http://mirtarbase.mbc.nctu.edu.tw/php/index.php), and TargetScan databases (http://www.targetscan.org/vert_72/), and they were overlapped with the previously selected differentially expressed mRNAs to construct the miRNA-mRNA relationship network; iii) finally, the ceRNA network was constructed, based on the aforementioned two relationship networks using the ‘ggalluvial’ R package.
Prognostic analysis of lncRNAs in the network
Individual subjects in the TCGA database were stratified, according to the levels of specific lncRNAs, and their survival data were extracted. The survival difference of subjects with varying levels of individual lncRNAs in the ceRNA network was analyzed using ‘survminer’ R package and ‘surv_cutpoint’ function.
Bioinformatics
The potential functions and pathway enrichments of the differentially expressed genes identified in the ceRNA network were analyzed by gene ontology (GO) (http://geneontology.org/) (22) and Kyoto Encyclopedia of Genes and Genomes (KEGG) (https://www.kegg.jp/) (23) using the R language, based on a P-value of <0.05. The diagnostic values of these differentially expressed RNAs were analyzed by receiver operating characteristic (ROC) curve.
Clinical specimens
Surgical NSCLC specimens were obtained from 48 individual patients (23 males and 25 females; age range, 45–81 years), who underwent NSCLC surgery at the Affiliated Shengjing Hospital, China Medical University, Shenyang, China from May 2017 to July 2018. All samples were frozen at −80°C immediately after excision during surgery. The NSCLC specimens were staged pathologically by the TNM of the UICC (Union for International Cancer Control) (24). The experimental protocol was approved by the Shengjing Hospital Ethics Committee (2018PS170K) and written informed consent was obtained from all patients.
Cell culture
Humannon-tumor lung epithelial BEAS-2B cells, 293 cells, NSCLCA549, NCI-H460, NCI-H1299 and NCI-H292 cells (<25 passages) were obtained from the Type Culture Collection of the Chinese Academy of Sciences (Shanghai, China). BEAS-2B and 293 cells were cultured in Dulbecco's modified Eagle's medium (DMEM)/F12 containing 10% heat-inactivated fetal bovine serum (FBS), 100 IU/ml penicillin and 100 µg/ml streptomycin at 37°C in 5% CO2. NSCLC cells were maintained in 10% FBS RPMI-1640 medium.
RNA extraction and quantitative RT-PCR
Total RNA was extracted from individual specimens and cells using TRIzol reagent and reversely-transcribed into cDNA using HisScript™ QRT SuperMix (Vazyme Biotech Co., Ltd., Nanjing, China). The relative levels of RNA transcripts to the control were determined by qRT-PCR on Agilent Technologies Stratagene Mx3000P using ChamQ™ Universal SYBR qPCR Master Mix (Vazyme Biotech Co., Ltd.) and specific primers are presented in Table SI. The data were normalized and analyzed by 2−ΔΔCq method (25).
Subcellular fractionation
The cells were harvested and after being washed, and then they were treated with 500 µl of cell disruption buffer (PARIS kit; cat. no. AM1921; Invitrogen; Thermo fisher Scientific, Inc.) on ice for 10 min, followed by centrifugation at 500 × g. Their supernatants were collected as the cytoplasmic samples. The pellets were washed and solved in an equal volume of nuclear lysate buffer, followed by centrifugation at 500 × g. The supernatants were used as the nuclear samples. Subsequently, the RNAs in the nuclear and cytoplasmic samples were extracted and subjected to qRT-PCR using primers presented in Table SI.
Statistical analysis
Data are expressed as the mean ± SD. Difference between two groups was analyzed by unpaired Student's t-test. Difference of multiple comparisons was analyzed by ANOVA (parametric) followed by Dunnett's post-test. The category data were analyzed by Chi-square test. All statistical analyses were performed using SPSS 22.0 software (IBM Corp.) and GraphPad Prism version 7.0 (GraphPad Software, Inc.). A two-tailed P-value of <0.05 was considered to indicate a statistically significant difference.
Results
Screening of differentially expressed RNAs of NSCLC in TCGA
To screen differentially expressed RNAs, RNA-seq data of 535 NSCLC and 59 paracancerous lung samples and miRNA-seq data of 521 NSCLC and 46 paracancerous lung samples from the TCGA database were extracted. Based on statistical significance of P<0.01, FDR<0.01 and |logFC|>2, these data were analyzed by heatmaps and volcanoes (Fig. 1). There were 2,502 differentially expressed mRNAs (Fig. 1A), of which, 1,975 were upregulated and 527 downregulated (Fig. 1D). Furthermore, there were 1,685 differentially expressed lncRNAs (Fig. 1B), of which, 1,486 were upregulated and 199 downregulated (Fig. 1E). Moreover, there were 120 differentially expressed miRNAs (Fig. 1C), of which, 104 were upregulated and 16 downregulated (Fig. 1F).
Figure 1.
Analysis of differentially expressed RNAs in NSCLC tissues by heatmaps and volcano maps. The expression levels of mRNAs, lncRNAs and miRNAs in NSCLC and adjacent non-tumor tissues was extracted from TCGA and analyzed by (A-C) heatmap and (D-F) volcano map, respectively. Data are the distribution of individual NSCLC patients. The green dots, downregulated RNAs; the red dots, upregulated RNAs. NSCLC, non-small cell lung cancer; lncRNAs, long non-coding RNAs; miRNAs, microRNAs; TCGA, The Cancer Genome Atlas.
Construction of ceRNA network
The miRcode matching indicated that 144 lncRNAs and 22 miRNAs effectively generated 570 pairs of associations (Table SII). Furthermore, using the miRDB, miRTarBase and TargetScan databases, we predicted that these 22 miNRAs targeted 722 mRNAs. After overlapping the 2,502 differentially expressed mRNAs with 722 mRNAs, 12 miRNAs formed 44 miRNA-mRNA association pairs (Table SIII). Accordingly, a ceRNA network was constructed and contained 113 DElncRNAs, 12 DEmiRNAs, and 36 DEmRNAs (Fig. 2). Further analysis of the network revealed that it contained 161 nodes and 372 edges. The degrees of the network were detailed in Table SIV.
Figure 2.
The ceRNA network consists of 113 differentially expressed lncRNAs, 12 miRNAs, and 36 mRNAs. ceRNA, competing endogenous RNA; lncRNAs, long non-coding RNAs; miRNAs, microRNAs.
GO and KEGG pathway analyses
The potential biological functions of mRNA in ceRNA networks were analyzed by GO and KEGG. The results revealed that mRNAs in the ceRNA network were mainly involved in the cell cycle G1/S phase transition and DNA replication (Fig. 3A) and participated in the processes of the cell cycle, cellular senescence and p53 signaling (Fig. 3B).
Figure 3.
Potential functions of the differentially expressed lncRNAs analyzed by GO and KEGG analyses. The potential functions of the differentially expressed lncRNAs in NSCLC were analyzed by GO and KEGG analyses. (A) The GO analysis of biological processes, cellular components, molecular functions. (B) The KEGG analysis of potential pathways. Data are expressed as the enrichment values of lncRNAs for the specific function. lncRNAs, long non-coding RNAs; GO, gene ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; NSCLC, non-small cell lung cancer.
Related lncRNAs in the ceRNA network
After stratification of those subjects, based on the differentially expressed lncRNAs, 103 out of 113 lncRNAs in the network were significantly associated with prognosis in this population (Table SIII). Since LINC00525, MED4-AS1, STEAP2-AS1, SYNPR-AS1 were novel lncRNAs in NSCLC and their differential fold and P-values were the most significant (Table SV), their functional association with survival was further analyzed (Fig. 4). The higher levels of LINC00525 or STEAP2-AS1 were significantly associated with a shorter survival of NSCLCpatients while higher levels of MED4-AS1 or SYNPR-AS1 were significantly associated with a longer survival of NSCLCpatients. Hence, these lncRNAs had opposite functions in regulating the development and progression of NSCLC.
Figure 4.
Kaplan-Meier analysis of the survival of patients with varying levels of lncRNA expression in NSCLC. Individual NSCLC patients were stratified, based on the cut-off value of each lncRNA and their overall survival was estimated by the Kaplan-Meier analysis. The difference between the groups was analyzed by log-rank test. (A and C) The upregulated LINC00525 and STEAP2-AS1 expression was associated with a shorter survival of NSCLC patients. (B and D) The upregulated MED4-AS1 and SYNPR-AS1 expression was associated with a longer survival of NSCLC patients. lncRNA, long non-coding RNA; NSCLC, non-small cell lung cancer.
Expression and ROC curve analysis of four lncRNAs in TCGA
To quantify lncRNA expression, four lncRNAs in tumor and adjacent non-tumor tissues of individual subjects in TCGA were quantified as FPKM values. The FPKM values of LINC00525, STEAP2-AS1 and SYNPR-AS1 in NSCLC tissues were significantly greater while the FPKM values of MED4-AS1 in NSCLC tissues were significantly less than that in the adjacent non-tumor tissues (Fig. 5A-D). Further ROC analysis revealed that the area under the curve (AUC) values of LINC00525, MED4-AS1, STEAP2-AS1, and SYNPR-AS1 were 0.841, 0.962, 0.816, and 0.749, respectively.
Figure 5.
Expression levels of four selected lncRNAs. The expression levels of four selected lncRNAs in NSCLC and their adjacent non-tumor tissues in TCGA were compared and the potential diagnostic values of these lncRNAs were determined by ROC analysis. Data are expressed as individual values and the AUC values of individual lncRNAs. (A-D) Scatter plots of the expression of individual lncRNAs. (E-H) The ROC analysis of individual lncRNAs. lncRNAs, long non-coding RNAs; NSCLC, non-small cell lung cancer; TCGA, The Cancer Genome Atlas; ROC, receiver operaring characteristic; AUC, area under the curve. ****P<0.0001.
Validation of four lncRNAs in NSCLC samples and cells
To clarify the clinical significance of these four lncRNAs, 48 pairs of NSCLC and adjacent tissue specimens were collected. The relative expression levels of four lncRNAs in NSCLC and adjacent tissues were determined by qRT-PCR. The relative levels of four lncRNA transcripts in NSCLC tissues were significantly higher than that in the adjacent tissues (Fig. 6A-D; P<0.05). Analyses of different cell lines indicated various expression levels of these four lncRNAs (Fig. 6E-H). In comparison with that in non-tumorBEAS-2B cells, LINC00525, MED4-AS1 and STEAP2-AS1 were significantly upregulated in A549 cells. Significantly upregulated LINC00525 expression was also detected in NCI-H460 cells. SYNPR-AS1 revealed no significant difference between groups. Further analysis revealed that LINC00525 was mainly expressed in the cytoplasm while MED4-AS1 was detected in both the nucleus and cytoplasm of A549 cells (Fig. 7A and B).
Figure 6.
Expression levels of four selected lncRNAs in 48 NSCLC and their adjacent non-tumor tissues and different cell lines. The expression levels of individual lncRNAs in 48 NSCLC and their adjacent non-tumor tissues and different cell lines were determined by quantitative RT-PCR. Data are expressed as the mean values of individual specimens or the mean value ± SD of each group of cells from three separate experiments. (A-D) The differential expression of individual lncRNAs in 48 pairs of NSCLC and their adjacent non-tumor tissues. The statistical test was performed using unpaired Student's t-test. (E-H) The expression levels of individual lncRNAs in the indicated cells. The statistical test was performed using ANOVA (parametric) test. lncRNAs, long non-coding RNAs; NSCLC, non-small cell lung cancer. *P<0.05, **P<0.01, ***P<0.001, ****P<0.0001.
Figure 7.
Distribution of LINC00525 and MED4-AS1 in A549 cells. The nucleus and cytoplasm of A549 cells were extracted and the levels of LINC00525 and MED4-AS1 in these cellular compartments were determined by quantitative RT-PCR. Data are presented as the mean values of individual lncRNAs in each compartment of A549 cells from three independent experiments. (A and B) The nuclear and cytoplasmic distribution of LINC00525 and MED4-AS1 in A549 cells. U1 snRNA, nuclear control; GAPDH, cytoplasmic control. The statistical test was performed using Chi-square test. lncRNAs, long non-coding RNAs.
Higher expression levels of four lncRNAs are associated with clinical characteristics
Finally, we analyzed the potential association between higher expression levels of individual lncRNAs and clinical parameters in 48 NSCLCpatients. Notably, higher expression levels of LINC00525 were significantly associated with smoker history (P=0.036; Table I). Higher expression levels of MED4-AS1 were significantly associated with females (P=0.02), poor differentiation (P=0.046) and lymph node metastasis (P=0.043) (Table II). Higher expression levels of STEAP2-AS1 were significantly associated with females (P=0.009; Table III) and higher expression levels of SYNPR-AS1 were significantly associated with females (P=0.043) and adenocarcinoma (P=0.035) (Table IV) in this population. After the P-value was adjusted for sex, the expression of MED4-AS1 was markedly associated with poor differentiation (adj.P=0.055) and lymph node metastasis (adj.P=0.07). However, their expression levels were not significantly associated with other parameters assessed.
Table I.
The association between LINC00525 expression and the clinical characteristics of NSCLC patients.
Relative LINC00525 expression
Characteristics
N
Low
High
P-value
Age (years)
>65
9
5
4
0.712
≤65
39
19
20
Sex
Male
23
13
10
0.386
Female
25
11
14
Differentiation
Well, moderate
36
17
19
0.505
Poor
12
7
5
Tumor size (maximum diameter)
>3 cm
22
10
12
0.562
≤3 cm
26
14
12
Histological tumor type
Squamous cell carcinoma
17
9
8
0.763
Adenocarcinoma
31
15
16
Smoking history
Smokers
20
6
14
0.036[a]
Never smoked
28
17
11
Lymph node metastasis
Positive
21
9
12
0.383
Negative
27
15
12
TNM stage
Stage I
20
12
8
0.242
Stages II, III or IV
28
12
16
P<0.05. Data were analyzed by Chi-square test. The high and low groups were divided by the median expression value of LINC00525 (1.405). NSCLC, non-small cell lung cancer.
Table II.
The association between MED4-AS1 expression and the clinical characteristics of NSCLC patients.
Relative MED4-AS1 expression
Characteristics
N
Low
High
P-value
Adj.P-value
Age (years)
>65
11
6
5
0.731
0.708
≤65
37
18
19
Sex
Male
22
15
7
0.020[a]
Female
26
9
17
Differentiation
Well, moderate
36
21
15
0.046[a]
0.055
Poor
12
3
9
Tumor size (maximum diameter)
>3 cm
22
8
14
0.141
0.109
≤3 cm
26
15
11
Histological tumor type
Squamous cell carcinoma
16
8
8
0.999
0.097
Adenocarcinoma
32
16
16
Smoking history
Smokers
19
12
7
0.14
0.477
Never smoked
29
12
17
Lymph node metastasis
Positive
23
8
15
0.043[a]
0.07
Negative
25
16
9
TNM stage
Stage I
19
12
7
0.14
0.596
Stages II, III or IV
29
12
17
P<0.05. Data were analyzed by Chi-square test. The high and low groups were divided by the median expression value of MED4-AS1 (1.085). NSCLC, non-small cell lung cancer.
Table III.
The association between STEAP2-AS1 expression and clinical characteristics of NSCLC patients.
Relative STEAP2-AS1 expression
Characteristics
N
Low
High
P-value
Adj.P-value
Age (years)
>65
10
7
3
0.155
0.151
≤65
38
17
21
Sex
Male
23
16
7
0.009[a]
Female
25
8
17
Differentiation
Well, moderate
36
18
18
0.999
0.839
Poor
12
6
6
Tumor size (maximum diameter)
>3 cm
23
9
14
0.149
0.19
≤3 cm
25
15
10
Histological tumor type
Squamous cell carcinoma
17
10
7
0.365
0.388
Adenocarcinoma
31
14
17
Smoking history
Smokers
19
9
10
0.768
0.115
Never smoked
29
15
14
Lymph node metastasis
Positive
21
9
12
0.383
0.489
Negative
27
15
12
TNM stage
1 stage
20
12
8
0.242
0.74
2.3.4 stage
28
12
16
P<0.05. Data were analyzed by Chi-square test. The high and low groups were divided by the median expression value of STEAP2-AS1 (1.240). NSCLC, non-small cell lung cancer.
Table IV.
The association between SYNPR-AS1 expression and the clinical characteristics of NSCLC patients.
Relative SYNPR-AS1 expression
Characteristics
N
Low
High
P-value
Adj.P-value
Age (years)
>65
10
5
5
0.999
0.964
≤65
38
19
19
Sex
Male
23
15
8
0.043[a]
Female
25
9
16
Differentiation
Well, moderate
36
16
20
0.182
0.129
Poor
12
8
4
Tumor size (maximum diameter)
>3 cm
23
13
10
0.386
0.279
≤3 cm
25
11
14
Histological tumor type
Squamous cell carcinoma
17
12
5
0.035[a]
0.245
Adenocarcinoma
31
12
19
Smoking history
Smokers
19
11
8
0.376
0.962
Never smoked
29
13
16
Lymph node metastasis
Positive
23
10
13
0.386
0.578
Negative
25
14
11
TNM stage
Stage I
19
10
9
0.768
0.984
Stages I, II or III
29
14
15
P<0.05. Data were analyzed by Chi-square test. The high and low groups were divided by the median expression value of SYNPR-AS1 (1.545). NSCLC, non-small cell lung cancer.
Discussion
NSCLC is a common lethal cancer in the world, and the current 5-year survival rate of NSCLC remains low (26). In the present study, 2,502 differentially expressed mRNAs, 1,685 lncRNAs, and 120 miRNAs were identified in NSCLC tissues from TCGA, respectively. Further GO and KEGG analyses indicated that the differentially expressed mRNAs may participate in cellular processes (27), molecular binding (28), signal transduction and survival (29), which are important for the progression of malignant tumors. Using these differentially expressed RNAs, a ceRNA network was constructed, which reflected the relationships among different RNA transcripts. Hence, such findings may provide new insights into the pathogenesis of NSCLC.lncRNAs are abundant in different species of animals and can regulate various biological processes and malignancies. Previous studies have revealed that tumor-associated lncRNAs not only bind to transcription factors and proteins, but also affect tumor-associated mRNAs by competitive binding to miRNAs (30,31). In fact, LINC00858 was revealed to act as a ceRNA to effectively sponge miR-422a, modulating the expression of KLK4 to impair NSCLC cell proliferation, migration and invasion (32). Similarly, ROR acted as a ceRNA of miR-145 to promote the FSCN1-mediated metastasis of esophageal squamous cell carcinoma (33). In addition, DLG1-AS1, along with miR-107 and ZHX1, formed a ceRNA network to regulate cervical cancer progression (34). MALAT1 may act as a sponger of miR-124 and attenuate the miR-124-downregulated FoxQ1 expression, regulating the progression and metastasis of bladder transitional cell carcinoma (35). In addition, the MEG3/miR-181a/HOXA11 ceRNA network and the lncRNA/miRNA/mRNA may reflect the regulatory communication among ceRNAs in multiple myeloma and colorectal carcinoma (36,37). The network construction visually demonstrated that ceRNAs are key regulators in the communication among different RNA transcripts. It may reveal the intrinsic regulatory network mechanism of NSCLC at the transcriptional level. In addition, by analyzing and validating the core factors in clinical specimen data and survival data within this network, novel biomarkers may be identified to serve as diagnostic and prognostic indicators for NSCLC. Thus, the lncRNA/miRNA/mRNA ceRNA network we constructed in NSCLC extended previous findings.In the present study, 103 lncRNAs were identified in the network that were significantly associated with prognosis, including H19, HOTAIR, MEG3, UCA1 (38–41) and other star molecules were associated with NSCLC cell proliferation, invasion and apoptosis. Among them, LINC00525, MED4-AS1, STEAP2-AS1 and SYNPR-AS1 were new lncRNAs and their altered expression was significantly associated with the survival of NSCLCpatients. Notably, these lncRNAs were associated with clinical parameters in 48 NSCLCpatients. Accordingly, these lncRNAs may be valuable biomarkers for prognosis of NSCLCpatients. Notably, while the expression of MED4-AS1 and SYNPR-AS1 was upregulated in NSCLC tissues they were positively associated with OS of NSCLCpatients. Such data indicated that these lncRNAs may be the tumor-stimulated compensatory lncRNAs that may, through a complex regulatory network, inhibit the progression, recurrence and metastasis of NSCLC (42). We are interested in further investigating how these lncRNAs regulate the progression of NSCLC.Previous studies have revealed that BCAR4, XIST and SNHG3 as well as other lncRNAs are significantly elevated in NSCLC specimens (43–45) and they regulate the progression of NSCLC. In the present study, it was revealed that LINC00525, STEAP2-AS1 and SYNPR-AS1 transcripts were significantly increased in NSCLC in the TCGA and 48 NSCLC tissues we collected. Notably, the expression of MED4-AS1 was downregulated in NSCLC from the TCGA, but upregulated in 48 NSCLC specimens we collected. The discrepancy may stem from the different genetic backgrounds of patients with varying stages of NSCLC. In addition, it was revealed that the frequency of female NSCLCpatients with increased MED4-AS1, STEAP2-AS1 or SYNPR-AS1 expression was significantly higher than male patients. We are interested in further examining whether these upregulated lncRNAs can regulate the expression of sex-determining region Y-box 2 (SOX2) transcription factor (46). Moreover, it was revealed that these four lncRNAs were variably expressed in different NSCLC cells. While LINC00525 was mainly present in the cytoplasm, MED4-AS1 was present in both the nucleus and cytoplasm of A549 cells. The inconsistent distribution may reflect different functions of these lncRNAs in A549 cells, supporting the notion that lncRNAs regulate the expression of many oncogenic or tumor suppressive genes at transcriptional or post-transcriptional levels (47).Heavy smoking is a risk factor of NSCLC. A previous study revealed that cigarette smoke exposure (CSE) increased LINC00152 expression in airway epithelial cells in a dose-dependent manner, and promoted cyclin D1 expression and G1/S transition, leading to metastasis and proliferation of NSCLC (48). In the present study, it was revealed that upregulated LINC00525 expression was positively associated with a smoking history. In fact, it was revealed in a previous study that >50% of NSCLC cases occured in non-smoking women, and these cases were closely associated to EGFR and ALK mutations (49). In addition, upregulated MED4-AS1 expression was positively associated with poor differentiation and lymphatic node metastasis, indicating that it may promote invasion and metastasis of NSCLC. SYNPR-AS1 was positively associated with adenocarcinoma. Notably, the present study had limitations of small sample size and the lack of functional studies of individual lncRNAs. This study mainly predicted potential new biomarkers for NSCLC. Further investigations are warranted in validating the diagnostic and prognostic values of these lncRNAs and how these lncRNAs synergistically regulate the progression of NSCLC by co-expression of two or multiple lncRNAs.In conclusion, the present data indicated that there were numerous differentially expressed lncRNAs, and along with the differentially expressed miRNAs and mRNAs, a ceRNA network was constructed. These differentially expressed mRNAs were involved in many biological processes. The expression of four selected lncRNAs was significantly associated with OS of NSCLCpatients and their expression was validated in 48 NSCLC specimens we collected. These lncRNAs were significantly associated with some clinical parameters, and LINC00525 and MED4-AS1 were differentially present in the nucleus and cytoplasm of A549 cells. Therefore, these four lncRNAs may be valuable biomarkers for prognosis of NSCLC and may provide new insights in the pathogenesis of NSCLC.
Authors: G Franci; G Manfroni; R Cannalire; T Felicetti; O Tabarrini; A Salvato; M L Barreca; L Altucci; V Cecchetti Journal: Cell Prolif Date: 2015-12 Impact factor: 6.831