Literature DB >> 29127420

Genome-wide analyses of long noncoding RNA expression profiles in lung adenocarcinoma.

Zhenzi Peng1, Jun Wang1, Bin Shan2, Fulai Yuan1, Bin Li1, Yeping Dong1, Wei Peng1, Wenwen Shi1, Yuanda Cheng3, Yang Gao3, Chunfang Zhang3, Chaojun Duan4,5.   

Abstract

LncRNAs have emerged as a novel class of critical regulators of cancer. We aimed to construct a landscape of lncRNAs and their potential target genes in lung adenocarcinoma. Genome-wide expression of lncRNAs and mRNAs was determined using microarray. qRT-PCR was performed to validate the expression of the selected lncRNAs in a cohort of 42 tumor tissues and adjacent normal tissues. R and Bioconductor were used for data analysis. A total of 3045 lncRNAs were differentially expressed between the paired tumor and normal tissues (1048 up and 1997 down). Meanwhile, our data showed that the expression NONHSAT077036 was associated with N classification and clinical stage. Further, we analyzed the potential co-regulatory relationship between the lncRNAs and their potential target genes using the 'cis' and 'trans' models. In the 25 related transcription factors (TFs), our analysis of The Cancer Genome Atlas database (TCGA) found that patients with lower expression of POU2F2 and higher expression of TRIM28 had a shorter overall survival time. The POU2F2 and TRIM28 co-expressed lncRNA landscape characterized here may shed light into normal biology and lung adenocarcinoma pathogenesis, and be valuable for discovery of biomarkers.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 29127420      PMCID: PMC5681506          DOI: 10.1038/s41598-017-15712-y

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Lung cancer is the leading cause of death among all types of cancer[1]. Lung cancer has two histological types: small cell lung cancer (SCLC) and non-small lung cancer (NSCLC). The most common histological type of NSCLC is lung adenocarcinoma (ADC) that accounts for approximately 50% of NSCLC[2]. In general, ADC, originated from distal airways, is less frequently associated with chronic inflammation and smoking than squamous cell carcinoma (SCC)[3]. There is still no definitive biomarker for diagnosis and prognosis of ADC although several studies have identified a large number of genes associated with tumor initiation and progression. LncRNAs have emerged as important regulators of physiology and disease. However, little is known about the mechanism of their functions[4]. lncRNAs are noncoding RNAs that are longer than 200 nucleotides in length. Recent RNA-SEQ based transcriptome studies revealed that 68% transcripts of human are lncRNAs and approximately 80% of them are unannotated[5]. lncRNAs are categorized into four groups: intronic, exonic, overlapping, and intergenic according to their locations in the genome[6]. lncRNAs are also classified as ‘cis’ or ‘trans’ lncRNAs according to their modes of regulation of transcription. lncRNAs can act as decoys, scaffolds, sponges, and guide of the protein and RNA molecules in cells[7]. To delineate genome-wide ADC-associated lncRNAs expression, we employed locus-by-locus lncRNA and mRNA microarray probes to identify the lncRNAs and mRNAs that are differentially expressed between the ADC tumor tissues and the matched adjacent normal lung tissues. Our results establish a link between lncRNAs and clinical features of ADC, namely age, gender, smoking index, differentiation, TNM stage, and clinical stage. Using the ‘cis’ and ‘trans’ mode, we reveal potential co-regulatory relationship between the lncRNAs and their target genes. We also construct a “TFs-lncRNAs” two-element network and a “TFs-lncRNAs-mRNA” three-element network in ADC using hyper geometric distribution. We also identify potential cis-regulatory target genes of the differentially expressed lncRNAs in ADC by screening the co-expressed genes located near the differentially expressed lncRNAs. We identify the TFs that potentially regulate the differentially expressed lncRNAs by combining analysis of the TCGA database and hyper geometric distribution.

Results

LncRNAs and mRNAs expression profile in ADC

To construct the lncRNA landscape in lung adenocarcinoma, we analyzed lncRNA and mRNA expression profile of 6 lung adenocarcinoma and their matched adjacent normal lung tissues using microarray. In order to avoid complicating factors derived from the heterogeneous nature of ADC clinical characteristics, we chose all female patients who have no smoking history and are between 45 and 55 year old. A volcano plot was used to provide an overview of the dysregulated lncRNAs in our microarray data sets (Fig. 1A). Principal component analysis (PCA) (Fig. 1B) and hierarchical clustering analysis (HCA) were applied to establish and cluster the lncRNA expression profile (Fig. 1C) and the mRNA expression profile (Fig. 1D). This difference distinguished the ADC group from the adjacent normal tissue group. From the microarray data, we identified 3045 lncRNAs and 2602 mRNAs that were differentially expressed between 6 paired ADC tumor tissues and adjacent normal lung tissues (Fold Change ≥2.0 or ≤0.5, p  < 0.05 and False discovery rate (FDR) <0.05=)[8]. Among these, 1048 lncRNAs and 694 mRNAs were upregulated in all six ADC samples, and 1997 lncRNAs and 1908 mRNAs were downregulated (see Supplementary Table S1, S2). Among them 57 upregulated lncRNAs and 329 downregulated lncRNAs were altered greater than 6.0-fold. The most upregulated lncRNAs was NONHSAT097328 (Fold change: 26.42) and the most downregulated was NR_038190.1 (Fold change: 117.37).
Figure 1

LncRNAs and mRNAs expression profiles in ADC. (A) Volcano Plot of the differentially expressed lncRNAs in ADC tumor tissues and adjacent normal lung tissues. The red points in the plot represent differentially expressed lncRNAs with statistical significance. (B) Principal Components Analysis. The B group (red plots) represents adjacent normal lung tissues and C group (blue plots) represents the ADC tumor tissues. (C) Hierarchical Clustering shows a distinguishable lncRNA expression profile and (D) mRNA expression profile.

LncRNAs and mRNAs expression profiles in ADC. (A) Volcano Plot of the differentially expressed lncRNAs in ADC tumor tissues and adjacent normal lung tissues. The red points in the plot represent differentially expressed lncRNAs with statistical significance. (B) Principal Components Analysis. The B group (red plots) represents adjacent normal lung tissues and C group (blue plots) represents the ADC tumor tissues. (C) Hierarchical Clustering shows a distinguishable lncRNA expression profile and (D) mRNA expression profile.

Validation of differential lncRNA expression

To evaluate our microarray performance, we measured the expression of 9 lncRNAs in all 42 paired ADC tumor tissues and adjacent normal lung tissues using qRT-PCR (Fig. 2A). The result showed that the expression patterns of these lncRNAs were consistent with microarray data in ADC tumor tissues and adjacent normal tissues.
Figure 2

Validation of differential lncRNA expression and function. (A) Expression level of the selected lncRNAs in ADC tissues and adjacent normal tissues. qRT-PCR were used to verify differentially expressed lncRNAs (P ≤ 0.05). Gene expression was normalized to 36B4 expression. The blue points represent cancer tissues and the red points represent adjacent normal tissues. (B) Frequency distribution of lncRNAs enrichment on TFs. The X-axis is frequency distribution and Y-axis is the TFs name. (C–F) Go and Pathway analysis of lncRNAs co-expressed genes. The top 10 enriched terms were calculated as −log10 (P-value).

Validation of differential lncRNA expression and function. (A) Expression level of the selected lncRNAs in ADC tissues and adjacent normal tissues. qRT-PCR were used to verify differentially expressed lncRNAs (P ≤ 0.05). Gene expression was normalized to 36B4 expression. The blue points represent cancer tissues and the red points represent adjacent normal tissues. (B) Frequency distribution of lncRNAs enrichment on TFs. The X-axis is frequency distribution and Y-axis is the TFs name. (C–F) Go and Pathway analysis of lncRNAs co-expressed genes. The top 10 enriched terms were calculated as −log10 (P-value). To further investigate an association between lncRNAs expression and clinicopathological features in ADC, the patients were categorized by age, TNM stages, and differentiation. Based on the expression in paired tumor and adjacent normal tissues, we found NONHSAT077036 expression was associated with N classification and clinical stage (Table 1 P = 0.002, P = 0.008). Then we further investigated whether the association between NONHSAT077036 and TNM staging was specific to ADC. We measured the expression of NONHSAT077036 in 30 paired SCC tumor tissues and adjacent normal lung tissues using qRT-PCR. The data showed that NONHSAT077036 was not associated with N classification and clinical stage in SCC (Table 2 P = 0.253, P = 0.448). This indicated that NONHSAT077036 was only associated with ADC but not with SCC.
Table 1

P-value of lncRNAs expression with clinicopathological features in ADC patients.

LncRNAs No.Age (years)GenderSmoking IndexDifferentiationT classificationN classificationClinical Stage
NONHSAT0770360.8090.550.0560.2420.1010.0020.008
NONHSAG0034400.960.940.4610.640.8940.3990.919
NONHSAT0908790.4970.5910.1390.8720.8470.2590.815
NONHSAT0479100.1140.0480.1170.6820.7050.6070.823
NONHSAT0041370.960.940.4610.3790.8480.3990.919
NONHSAT0753390.1880.550.2440.9460.6460.8530.906
NONHSAT0592040.960.2890.2330.390.4820.7010.21
NR_002165.10.7480.1270.7550.9660.7630.7010.925
NONHSAT0722070.9570.8240.1020.4360.2850.0910.269
Table 2

P-value of lncRNAs expression with clinicopathological features in SCC patients.

LncRNAs No.Age (years)GenderSmoking IndexDifferentiationT classificationN classificationClinical Stage
NONHSAT0770360.5690.6120.6880.1390.5740.2530.448
NONHSAG0034400.870.0140.9990.2750.0410.5890.035
NONHSAT0908790.4240.0260.1190.3780.9990.0840.378
NONHSAT0479100.6880.0430.2610.1540.1470.8450.083
NONHSAT0041370.1460.6450.0390.4950.4660.1570.347
NONHSAT0753390.5960.180.4570.4220.1420.5480.105
NONHSAT0592040.0340.5020.3520.4220.1420.5480.704
NR_002165.10.2890.180.4570.4220.8340.4890.704
NONHSAT0722070.710.8140.4730.990.0850.9690.683
P-value of lncRNAs expression with clinicopathological features in ADC patients. P-value of lncRNAs expression with clinicopathological features in SCC patients. To gain further insight into the role of NONHSAT077036 in ADC, we examined its sequence and structure across species. Generally, lncRNAs are not as conserved as protein coding genes. Therefore, it is difficult to predict lncRNA function based on evolutionary conservation (Fig. 3A). However, NONHSAT077036 showed strong sequence conservation from zebrafish to human, which suggests important functions of NONHSAT077036. Intriguingly NONHSAT077036 exhibited sequence similarity to H19 (132–159, 28 bp), which is also elevated in lung cancer and promotes cancer cell proliferation[9] (Fig. 3B). We then predicted the target genes of NONHSAT077036 using co-expression analysis (Fig. 3C). Among these predicted target genes, many are critical regulators of tumorigenesis. For instance, there is a significant negative correlation between NONHSAT077036 and TOP2A (r = −0.89) that is an oncogene involved in G2 checkpoint in response to DNA damage[10].There is a positive correlation between NONHSAT077036 and CCBE1 (r = 0.89) that reported to regulate extracellular matrix remodeling and migration[11]. These correlations suggest regulation of proliferation and invasion by NONHSAT077036 in ADC.
Figure 3

Function prediction of lncRNAs NONHSAT077036. (A) Sequence conservation of NONHSAT077036 from humans to zebrafish. (B) Sequence similarity between NONHSAT077036 and H19. (C) Potential target genes of NONHSAT077036.

Function prediction of lncRNAs NONHSAT077036. (A) Sequence conservation of NONHSAT077036 from humans to zebrafish. (B) Sequence similarity between NONHSAT077036 and H19. (C) Potential target genes of NONHSAT077036.

Identification of potentially functional lncRNAs in ADC

To predict potential functions of the differentially expressed lncRNAs, we calculated the correlation value of lncRNAs and mRNAs. Differentially expressed lncRNAs were divided into two subsets: the upregulated and downregulated lncRNAs. Top 200 most differentially expressed lncRNAs in each subset were selected for further analysis. LncRNA NR_038190.1 was the most significantly downregulated lncRNAs. The analysis pertaining to it was shown as a representative result. The top 30 correlations between lncRNA NR_038190.1 and its target genes were showed in Table 3.
Table 3

Target genes of NR_038190.1.

Gene Symbol p-valueCorrelationGene Symbol p-valueCorrelation
TBX40.0001010.891002PIK3C30.0001070.889654
RAPGEF40.0001010.891000PTPLAD20.0001070.889621
TTC280.0001020.890780SYNGR10.0001070.889601
SH2D1B0.0001020.890723ZAK0.0001080.889386
ITGA80.0001020.890706APPBP20.0001090.889201
PIP5K1B0.0001020.890704SPATA130.0001100.888938
C1orf1450.0001020.890619CBFA2T30.0001100.888911
ANXA30.0001030.890460SNX10.0001100.888899
GIMAP10.0001030.890407CCNDBP10.0001110.888839
FAM83A0.000103−0.890375NHSL10.0001120.888607
BDNF0.0001040.890349FIGF0.0001120.888513
PLEKHH20.0001040.890150WNT7A0.0001130.888409
FAM162B0.0001050.889993KIDINS2200.0001130.888351
PRKG20.0001050.889954JPH40.0001130.888299
CTDSP10.0001060.889779F100.0001150.887960
Target genes of NR_038190.1. We further analyzed the enrichment of GO (http://geneontology.org/) and KEGG pathway (http://www.kegg.jp/kegg/) terms associated with the lncRNAs that were differentially expressed between ADC and normal tissues. DAVID functional annotation software (https://david.ncifcrf.gov/home.jsp)[12] was used to analyze all co-expressed genes. Top 200 upregulated lncRNA genes and 200 downregulated lncRNA genes were subjected to GO and KEGG pathway analyses. We selected the top 200 reliability prediction terms (according to the p-value and enrichment) for co-expressed and aberrant lncRNA genes, respectively. The top 200 terms in the GO terms were highly enriched for cell adhesion, proliferation, migration (ontology: molecular function), extracellular region part, adheren junction (ontology: cellular component) and cytoskeletal protein binding, growth factor binding (ontology: molecular function). Top 200 terms in the KEGG pathway were associated with pathways in cancer. The most significant top 10 GO terms and KEGG pathway are shown in Fig. 2C–F. Besides, we calculated the enrichment of functional terms of co-expressed genes for each differentiated lncRNAs (see Supplementary Table S3, S4).

LncRNAs target prediction

To explore how lncRNAs function in lung adenocarcinoma, we predicted the cis- and trans-regulated genes of the differentially expressed lncRNAs using co-expression network analysis. The co-expressed genes within 300 kb upstream or downstream from a selected lncRNA were identified as potential “cis” genes of a given lncRNA (p-value of correlation <=0.05). We predicted the cis-regulated genes at the top of differentially expressed lncRNAs (see Supplementary Table S5). In ADC tissue and adjacent normal lung tissue controls, 56 upregulated lncRNAs had 92 ‘cis’ genes and 35 downregulated lncRNAs had 42 ‘cis’ genes. 8 upregulated lncRNAs had at least 3 ‘cis’ genes. The maximal number of cis genes assigned to a differentially expressed lncRNA was 5. The cis relationship of 6 significantly dysregulated (up.58_NONHSAT053536, up.106_NONHSAT024969, up.128_NONHSAG018334, up.129_ENST00000522875, up.190_NONHSAT083792, up.195_ENST00000518528) are shown in Fig. 4A–F.
Figure 4

Cis-regulation genes of representative lncRNAs in the chromosome. The X-axis represents lncRNA position in chromosome, the Y-axis represents correlation coefficient of lncRNA and potential “cis” genes. The red line represents the genome width of lncRNA and blue point represents the position of potential “cis” genes.

Cis-regulation genes of representative lncRNAs in the chromosome. The X-axis represents lncRNA position in chromosome, the Y-axis represents correlation coefficient of lncRNA and potential “cis” genes. The red line represents the genome width of lncRNA and blue point represents the position of potential “cis” genes. Among all these potential cis-regulated target genes, further analysis showed that the most highly related categories were the processes that affect cell growth, differentiation, and migration. The most correlated genes were related to cell cycle. Among the ‘cis’ genes, EGR1, EHF, PLCE1, PKNOX2, MLST8, FGF17, FOSB, PTPRQ, NKTR, and CAV2 are known to function in cell growth and differentiation. CDK7, CCNF, CDKN3, CDC20, TUBG1, FEN1, and GRK5 are critical regulators of cell cycle. PEA15 and CAV2 are regulators of apoptosis among ‘cis’ genes. JAM3, CD36, and RAB21 are believed to bear a critical role in adhesive processes and cell migration. These cis-regulated lncRNAs are potential regulators of ADC. It is noteworthy that lncRNAs also act in ‘trans’ to regulate TFs mediated chromatin remodeling and transcription[13]. We intended to discover which TFs might interact with the differentially expressed lncRNAs using hyper geometric distribution that can calculate the overlap of TFs target genes and chromatin regulators with the co-expressed lncRNA genes. As showed in Fig. 2B, the lncRNAs were significantly regulated by 25 TFs: SIN3A, POU2F2, TRIM28, etc. Early studies have demonstrated that these TFs are master regulators of cancer[14-20]. We further analyzed the relationship between the top 25 lncRNA related TFs and overall survival in TCGA database. We observed that patients with lower expression of POU2F2 and higher expression of TRIM28 had shorter overall survival time (Fig. 5A,C), suggesting that POU2F2 and TRIM28 are biomarkers for poor prognosis of ADC. Based on the results of the lncRNA co-expression analysis, we generated “POU2F2-lncRNAs” and “TRIM28-lncRNAs” two-element network by Cytoscape software (Fig. 5B,D). Then we added target co-expressed genes and generated “POU2F1-lncRNAs-mRNAs” and “TRIM28-lncRNAs-mRNAs” three-element relationship tables (see Supplementary Table S6).
Figure 5

TFs and lncRNAs co-expressed network. (A,C) Kaplan-Meier survival analysis of POU2F2 and TRIM28 expression of ADC patients in TCGA database. (B,D) “POU2F2-lncRNAs” and “TRIM28-lncRNAs” two-element network. Green points represent up-regulated lncRNAs. Red points represent down-regulated lncRNAs. The size of each dot is proportional to the magnitude of the change of a given lncRNA.

TFs and lncRNAs co-expressed network. (A,C) Kaplan-Meier survival analysis of POU2F2 and TRIM28 expression of ADC patients in TCGA database. (B,D) “POU2F2-lncRNAs” and “TRIM28-lncRNAs” two-element network. Green points represent up-regulated lncRNAs. Red points represent down-regulated lncRNAs. The size of each dot is proportional to the magnitude of the change of a given lncRNA.

Discussion

Aberrant expression lncRNAs is recognized as a hallmark feature in pathogenesis and progression of various diseases, including lung cancer. ADC is the largest histological phenotypes of lung cancer. However, the genome-wide expression profile and classification and function of lncRNAs have not been examined in ADC. Unraveling the functions and mechanisms of these lncRNAs can substantially improve our understanding of ADC. Integration of lncRNAs and target genes profiling is also a promising approach to identify effective biomarkers of ADC. Therefore, we screened the genome-wide expression profile of lncRNAs and mRNAs in 6 paired ADC tumor tissues and adjacent normal lung tissues in this study. Microarray data were further validated by qRT-PCR in another 42 paired tissues. We predicted the function of selected lncRNAs according to co-expression genes and Gene Ontology (GO) biological process. In addition, we predicted ‘cis’ and ‘trans’ regulated modes of the 200 tops differentially expressed lncRNAs to find out how they might regulate ADC progression. Several dysregulated lncRNAs reported in the previous studies were also found in our study. The overlapping lncRNA expression profiles between our current study and the previous studies validate the significance of our study. For instance, lncRNA PVT1 is upregulated in various human cancers[21,22], including lung cancer[23]. In our data, 13 probes were designed to measure PVT1 and all of them were upregulated. The average fold change was 5.45. More importantly, we showed that PVT1 may be regulated by 5 TFs: SIN3A, KAT2A, E2F4, E2F1 and GATA2. Another example is the tumor suppressor FENDRR. FENDRR inhibits cell proliferation and migration and is downregulated in cancer cell lines and cancerous tissues[24]. In support of this view, our results also showed a 22.57-fold decrease of FENDRR expression in ADC tissues. Recent reports indicated that FENDER overexpression suppressed invasion and migration by downregulating fibronectin1 expression. Our results also showed a significant inverse correlation between all 5 FENDRR probes and fibronectin1 (Correlation valued ≥0.7 or ≤−0.7). However, functions of the novel differentially expressed lncRNAs identified in our study remain to be characterized experimentally. It is of note that lung cancer is a highly heterogeneous disease. Our analysis showed that NONHSAT077036 expression was associated clinicopathological features only in ADC patients (P < 0.05=, but not other histologic subtypes of non-small cell lung cancer. However, additional studies are needed to verify the significance of NONHSAT077036 in ADC. Functional analysis of NONHSAT077036 needs to be carried out in cell and animal based models of ADC. We also performed Gene Ontology and pathway analysis of co-expressed genes of 400 lncRNAs. Our data indicate that the most related categories include cell adhesion, proliferation, migration, growth factor binding, etc. These GO terms are well-established critical factors in tumorigenesis and tumor progression. Little is known about the exact function of lncRNAs, although evidence so far indicates that lncRNAs participate in various biological processes. We analyzed the aberrantly expressed lncRNAs in ADC in “cis” and “trans” regulated mechanisms. Our data indicated that cis-regulated target genes participate in initiation and progression of ADC. For instance, EGR1 belongs to the EGR family whose activation is involved in differentiation and mitogenesis. In addition, EGR1 supports FGF-dependent angiogenesis during neovascularization and tumor growth[25]. Another study indicates that EGR1 upregulates the expression of lincRNA H19 in liver cancer[26]. In addition, EHF encodes an ETS transcription factor that is expressed in epithelial-specific manner. It has been reported that EHF is silenced by epigenetic mechanisms during NSCLC (non-small cell lung cancer) development[27]. EHF in ovarian cancer cells regulates cell proliferation and G1 phase checkpoint[28]. It is noteworthy that our bioinformatical analysis of the lncRNAs was largely based on a correlation between the expression patterns of lncRNAs and mRNAs. Further studies are needed to experimentally validate the link between lncRNAs and mRNAs identified in the current study. It is generally accepted that lncRNAs can directly interact with gene promoters and TFs. Many of these lncRNAs recruit protein factors to enhancers and regulate the activity of enhancers[29]. Transcriptional processes are also controlled by lncRNAs via their interaction with primary coding transcripts. A survey of correlation between lncRNAs and TFs also helps us reveal its function. TCGA database contains a large number of clinical information about ADC patients. We carried out survival analysis of 25 related ‘trans’ mode TFs in TCGA database and found that the patients with lower expression of POU2F2 have a shorter overall survival. These findings suggest that POU2F2 may be a biomarker for a poor prognosis of ADC. To date, the role of POU2F2 is controversial. Oct-2, encoded by the gene POU2F2, is a B-cell restricted transcription factor. Emerging evidence indicates that POU2F2 is essential to the later stages of B-cell differentiation[30]. Mice with deletion of POU2F2 die shortly after birth[31]. Recent evidence indicates that POU2F2 mediates metastasis induced by ROB01 in gastric cancer[32]. In our study, POU2F2 appears to be a tumor suppressor in ADC. Moreover the POU2F2-lncRNAs” two-element network modulates the expression of 53 lncRNAs. Deciphering the functions of POU2F2 in ADC needs further investigations. We identified another lncRNA related TF, TRIM28 in ADC in our study. The prevailing view is that knockdown of TRIM28 expression impairs cell proliferation in NSCLC cell lines. In addition, patients with elevated expression of TRIM28 suffered shorter tumor-specific survival[33]. Our data support this view because the patients with elevated expression of TRIM28 have a shorter overall survival. Moreover the TRIM28-lncRNAs” two-element network modulates the expression of 129 lncRNAs. It is currently well accepted that molecular networks of multiple genes and pathways, instead of a single gene or a pathway, underlie pathogenesis of cancer. Network analysis has provided an efficient method to model biological processes. It should also be emphasized that dynamic feedback motifs will help us to obtain a unified view of various cellular processes[34,35]. Thus, it is necessary to integrate omics data (gene regulatory networks, cell signaling networks and metabolic networks) in network analysis. Network analysis is an effective approach in predicting potential lncRNA–disease associations. There are a wide range of computational models and web servers that have been developed for this purpose. Chen et al. introduced state-of-the-art computational and FMLNCSIM models to identify disease-related lncRNAs from experimental validation. They developed an effective computational models to construct lncRNA functional similarity and the similarity scores (Long non-coding RNAs and complex diseases: from experimental results to computational models) (FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model)[36-40]. An important limitation of our work is that we have not distinguished different sub-clones. Cancer is a highly heterogeneous disease although one clone can become the dominant population in the tumor at diagnosis[41]. There are many distinct sub-clones that coexist in a tumor and are likely derived from different clonal backup[42,43]. Thus, a drug targeting only one sub-clone within a tumor may have limited effect. Tumor genome-wide analysis is needed to model cancer cells by constructing networks for individual clones. Our findings reported in the current study warrant further investigations of the mechanisms of the differentially expressed lncRNAs to understand their clinical significance in ADC. The lncRNA profile we established in ADC will lay the foundation for a better understanding of the impacts of lncRNAs in ADC patients. The ADC-associated lncRNAs identified in our study are promising biomarkers with potential in tumor diagnosis, classification, prognosis and therapeutic evaluation.

Materials and Methods

Samples

All lung adenocarcinoma patients were diagnosed at the Thoracic Surgery Department of Xiangya Hospital, Central South University. The cohort of 42 lung adenocarcinoma patients provided written informed consent in compliance with the code of ethics of the World Medical Association. The collection and usage of the clinical specimens were approved by the Xiangya Hospital Medical Research Ethics Committee. All experimental methods were performed in accordance with the relevant guidelines and regulations of Xiangya Hospital Medical Research Ethics Committee and the Scientific Research Project 201403216 (Histopathological Application). The tumor-node-metastasis was classified based on the criteria of the eighth tumor-node-metastasis (TNM) staging system. All tissues were stored at −80 °C after initial freezing in liquid nitrogen. The collection of clinical data of these patients include sex, age, smoking status, differentiation and TNM stage (see Supplementary S7).

RNA extraction and array data production

Sample preparation and microarray hybridization were performed by OE Biotech Corporation, Shanghai, P.R. China. Briefly, total RNA was extracted and 200ng RNA was purified from each sample (RNasey Mini Kit (Qiagen p/n 74104). The Agilent 2100 bioanalyzer and RNA LabChip® kits was used to assess RNA quality. RNA Integrity Number (RIN) ≥7 and 28 S/18 S ≥0.7 was used for synthesizing double-stranded cDNA (see Supplementary Table S8). Then cDNA was labeled by Cy3-dCTP and hybridized to the OE Biotech Human 4 × 180 K lncRNAs chip, which contained 46,506 lncRNAs probes and 30,656 mRNAs probes collected from eight databases, including Agilent ncRNA, GencodeV13, lncRNAdb, H-invDB, RefSeq, NONCODE v3.0, UCR and UCSC lncRNAs Transcripts.

qRT-PCR validation

qRT-PCR was used to validate our microarray data. Briefly, total RNA was extracted using Trizol reagent (Invitrogen) and then reverse-transcribed using GoScript™ Reverse Transcription System (Promega) in accordance with the manufacturer’s protocol. Real-time PCR was performed using All-in-One™ qPCR Mix (GeneCopoeia). Specific primers are listed in Table 4. Each sample was normalized by the internal control gene of 36B4. The results represent means of 3 repetitions and were quantified by the 2−ΔΔct method. The mRNA levels of lncRNAs between tumor and non-tumor tissues were compared using T-test (P < 0.05) using R. The mRNA levels of lncRNAs between cancer and non-cancer tissues were compared using T-test (P < 0.05) using R. Pearson × 2 test was used to analyze the relationship between lncRNAs expression and clinicopathologic parameters in SPSS software (version 20.0, Chicago, IL).
Table 4

Primers Used for qRT-PCR Analysis of lncRNAs.

LncRNAs No.PositionPrimer sequence
NONHSAT077036ForwardTGAAGAAGTAACAAGCCTGTCT
ReverseTGGTCTTGATCATCACCGTCT
NONHSAG003440ForwardGGAGGAGTGTGGAGGTTCAA
ReverseTACATGCCTGGGTCAGCTAC
NONHSAT090879ForwardATATACTACAGTGCGTTGTTGTCC
ReverseAGCAGTTGGATGACAGAGAATAG
NONHSAT047910ForwardCAAGTCCCAGAATCCTCCAG
ReverseAGGCTTACAGGAAATGTGCAG
NONHSAT004137ForwardAGCCAGTCTAGTGGACAGAGA
ReverseCCTGCATTGAATAATCACAAGACCA
NONHSAT075339ForwardGACTGGGTTTATTACCCTCTCCT
ReverseTAAGACTGCCTCTGCCCTTC
NONHSAT059204ForwardGAGTGTGACCTAGCGCAGAA
ReverseGAGCACACCTTCCAAGCAC
NR_002165.1ForwardATGGCTAGAAGTGACCCCAG
ReverseTGCCCAGCCTAGACTTCTC
FR407620ForwardCACCTCCCTCAAACCTGTCT
ReverseGCCAGAATTGCTTGCCTCAT
NONHSAT072207ForwardTTGGGAGTGTGCATGAGGTA
ReverseTTTGGTTACATGTCGGCAGT
Primers Used for qRT-PCR Analysis of lncRNAs.

Bioinformatics analysis

Data analysis including heat map, volcano plot, PCA and survival was carried out using R by gplots, lattice, MASS, ggplot2, hash and survival packages (https://www.R-project.org/)[44].

Differential expression analysis

Differentially expressed lncRNAs and mRNAs were identified using paired t-test (Fold Change ≥2.0 or ≤0.5, p < 0.05 and FDR < 0.05). The microarray data have been uploaded in NCBI Gene Expression Omnibus (GEO) and the GEO accession number is GSE85716. Red indicates high expression and green indicates low expression in tumor tissues.

lncRNA co-expression analysis

We evaluated potential co-expression between lncRNAs and mRNAs using Pearson Correlation. A positive correlation between a lncRNA and a mRNA was defined as a Pearson Correlation greater than 0.7 and a p-value less than 0.05. Hypergeometric cumulative distribution function was used to calculate the enrichment of co-expressed mRNAs. The False Discovery rate was determined using the method as previously described. The ontology of co-expressed genes was categorized by gene annotation and summary information obtained from DAVID database[12]. Annotations of the lncRNAs co-expressed mRNAs were determined using GO analysis on cellular component, molecular function, biological processes and specific pathways.

Prediction of lncRNAs function

We searched for an lncRNA co-expressed genes within a 300 kb window of each lncRNA in the top 200 up-regulated and downregulated lncRNAs (P < 0.05=. The co-expressed genes on both sides of an lncRNA were defined as potentially ‘cis’ regulated genes by a given lncRNA. To examine which genes were potentially ‘trans’ regulated by lncRNAs we determined which TF might interact with the lncRNAs of interest using Jemboss software. TF target gene sets were obtained from Encyclopedia of DNA Elements (ENCODE). Hyper geometric distribution was used to identify the overlap of TFs target genes and co-expressed genes of lncRNAs. p-value was used to measure the enrichment of differentially expressed genes in the term. The TF and lncRNAs relationship networks were drawn using Cytoscape software. The TFs survival analysis was performed using the RNA-Seq and survival data extracted from the TCGA database (https://portal.gdc.cancer.gov/). Supplymentary information Dataset 1 Dataset 2 Dataset 3 Dataset 4 Dataset 5 Dataset 6 Dataset 7 Dataset 8
  42 in total

Review 1.  Long non-coding RNAs: a new frontier in the study of human diseases.

Authors:  Xuefei Shi; Ming Sun; Hongbing Liu; Yanwen Yao; Yong Song
Journal:  Cancer Lett       Date:  2013-06-18       Impact factor: 8.679

2.  Linkage and sequence analysis indicate that CCBE1 is mutated in recessively inherited generalised lymphatic dysplasia.

Authors:  Fiona Connell; Kamini Kalidas; Pia Ostergaard; Glen Brice; Tessa Homfray; Lesley Roberts; David J Bunyan; Sally Mitton; Sahar Mansour; Peter Mortimer; Steve Jeffery
Journal:  Hum Genet       Date:  2009-11-13       Impact factor: 4.132

Review 3.  Dynamic modeling and analysis of cancer cellular network motifs.

Authors:  Mathieu Cloutier; Edwin Wang
Journal:  Integr Biol (Camb)       Date:  2011-06-15       Impact factor: 2.192

Review 4.  Modular regulatory principles of large non-coding RNAs.

Authors:  Mitchell Guttman; John L Rinn
Journal:  Nature       Date:  2012-02-15       Impact factor: 49.962

5.  Long non-coding RNA PVT1 as a novel biomarker for diagnosis and prognosis of non-small cell lung cancer.

Authors:  Di Cui; Cai-Hua Yu; Min Liu; Qing-Qing Xia; Yu-Feng Zhang; Wei-Long Jiang
Journal:  Tumour Biol       Date:  2015-10-21

6.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals.

Authors:  Mitchell Guttman; Ido Amit; Manuel Garber; Courtney French; Michael F Lin; David Feldser; Maite Huarte; Or Zuk; Bryce W Carey; John P Cassady; Moran N Cabili; Rudolf Jaenisch; Tarjei S Mikkelsen; Tyler Jacks; Nir Hacohen; Bradley E Bernstein; Manolis Kellis; Aviv Regev; John L Rinn; Eric S Lander
Journal:  Nature       Date:  2009-02-01       Impact factor: 49.962

7.  Knockdown of EHF inhibited the proliferation, invasion and tumorigenesis of ovarian cancer cells.

Authors:  Zhongping Cheng; Jing Guo; Li Chen; Ning Luo; Weihong Yang; Xiaoyan Qu
Journal:  Mol Carcinog       Date:  2015-08-10       Impact factor: 4.784

8.  The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression.

Authors:  Thomas Derrien; Rory Johnson; Giovanni Bussotti; Andrea Tanzer; Sarah Djebali; Hagen Tilgner; Gregory Guernec; David Martin; Angelika Merkel; David G Knowles; Julien Lagarde; Lavanya Veeravalli; Xiaoan Ruan; Yijun Ruan; Timo Lassmann; Piero Carninci; James B Brown; Leonard Lipovich; Jose M Gonzalez; Mark Thomas; Carrie A Davis; Ramin Shiekhattar; Thomas R Gingeras; Tim J Hubbard; Cedric Notredame; Jennifer Harrow; Roderic Guigó
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

Review 9.  The non-coding RNAs of the H19-IGF2 imprinted loci: a focus on biological roles and therapeutic potential in Lung Cancer.

Authors:  Imad J Matouk; David Halle; Michal Gilon; Abraham Hochberg
Journal:  J Transl Med       Date:  2015-04-09       Impact factor: 5.531

10.  FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model.

Authors:  Xing Chen; Yu-An Huang; Xue-Song Wang; Zhu-Hong You; Keith C C Chan
Journal:  Oncotarget       Date:  2016-07-19
View more
  12 in total

1.  Heat shock protein B8 promotes proliferation and migration in lung adenocarcinoma A549 cells by maintaining mitochondrial function.

Authors:  Ling-Ling Yu; Yuan Wang; Zu-Ke Xiao; Sheng-Song Chen
Journal:  Mol Cell Biochem       Date:  2020-09-14       Impact factor: 3.396

Review 2.  Genome-wide analysis reveals the emerging roles of long non-coding RNAs in cancer.

Authors:  Xiaoxia Ren
Journal:  Oncol Lett       Date:  2019-11-22       Impact factor: 2.967

3.  Comprehensive Analysis of Candidate Diagnostic and Prognostic Biomarkers Associated with Lung Adenocarcinoma.

Authors:  Jingyuan Li; Xingyuan Liu; Zan Cui; Guanying Han
Journal:  Med Sci Monit       Date:  2020-06-24

4.  The long noncoding RNA LINC00312 induces lung adenocarcinoma migration and vasculogenic mimicry through directly binding YBX1.

Authors:  Zhenzi Peng; Jun Wang; Bin Shan; Bin Li; Wei Peng; Yeping Dong; Wenwen Shi; Wenyuan Zhao; Dan He; Minghao Duan; Yuanda Cheng; Chunfang Zhang; Chaojun Duan
Journal:  Mol Cancer       Date:  2018-11-23       Impact factor: 27.401

5.  Microarray expression profile of long non-coding RNAs in human lung adenocarcinoma.

Authors:  Zhiyi Yang; Hongli Li; Zhaoyan Wang; Yuling Yang; Jie Niu; Yuanyuan Liu; Zhiliang Sun; Chonggao Yin
Journal:  Thorac Cancer       Date:  2018-08-27       Impact factor: 3.500

6.  LINC81507 act as a competing endogenous RNA of miR-199b-5p to facilitate NSCLC proliferation and metastasis via regulating the CAV1/STAT3 pathway.

Authors:  Wei Peng; Dan He; Bin Shan; Jun Wang; Wenwen Shi; Wenyuan Zhao; Zhenzi Peng; Qingxi Luo; Minghao Duan; Bin Li; Yuanda Cheng; Yeping Dong; Faqing Tang; Chunfang Zhang; Chaojun Duan
Journal:  Cell Death Dis       Date:  2019-07-11       Impact factor: 8.469

7.  Prognostic Signature for Lung Adenocarcinoma Patients Based on Cell-Cycle-Related Genes.

Authors:  Wei Jiang; Jiameng Xu; Zirui Liao; Guangbin Li; Chengpeng Zhang; Yu Feng
Journal:  Front Cell Dev Biol       Date:  2021-03-18

Review 8.  Mechanistic Insights Into the Interaction Between Transcription Factors and Epigenetic Modifications and the Contribution to the Development of Obesity.

Authors:  Qi Huang; Chaoyang Ma; Li Chen; Dan Luo; Rui Chen; Fengxia Liang
Journal:  Front Endocrinol (Lausanne)       Date:  2018-07-06       Impact factor: 5.555

9.  Uncovering association networks through an eQTL analysis involving human miRNAs and lincRNAs.

Authors:  Paulo R Branco; Gilderlanio S de Araújo; Júnior Barrera; Guilherme Suarez-Kurtz; Sandro José de Souza
Journal:  Sci Rep       Date:  2018-10-09       Impact factor: 4.379

Review 10.  Long non-coding RNAs: How to regulate the metastasis of non-small-cell lung cancer.

Authors:  Cheng Fang; Lixin Wang; Chenyuan Gong; Wenbin Wu; Chao Yao; Shiguo Zhu
Journal:  J Cell Mol Med       Date:  2020-02-12       Impact factor: 5.310

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.