Literature DB >> 31179185

Functional analysis of lncRNAs based on competitive endogenous RNA in tongue squamous cell carcinoma.

Yidan Song1, Yihua Pan1, Jun Liu1.   

Abstract

BACKROUND: Tongue squamous cell carcinoma (TSCC) is the most common malignant tumor in the oral cavity. An increasing number of studies have suggested that long noncoding RNA (lncRNA) plays an important role in the biological process of disease and is closely related to the occurrence and development of disease, including TSCC. Although many lncRNAs have been discovered, there remains a lack of research on the function and mechanism of lncRNAs. To better understand the clinical role and biological function of lncRNAs in TSCC, we conducted this study.
METHODS: In this study, 162 tongue samples, including 147 TSCC samples and 15 normal control samples, were investigated and downloaded from The Cancer Genome Atlas (TCGA). We constructed a competitive endogenous RNA (ceRNA) regulatory network. Then, we investigated two lncRNAs as key lncRNAs using Kaplan-Meier curve analysis and constructed a key lncRNA-miRNA-mRNA subnetwork. Furthermore, gene set enrichment analysis (GSEA) was carried out on mRNAs in the subnetwork after multivariate survival analysis of the Cox proportional hazards regression model.
RESULTS: The ceRNA regulatory network consists of six differentially expressed miRNAs (DEmiRNAs), 29 differentially expressed lncRNAs (DElncRNAs) and six differentially expressed mRNAs (DEmRNAs). Kaplan-Meier curve analysis of lncRNAs in the TSCC ceRNA regulatory network showed that only two lncRNAs, including LINC00261 and PART1, are correlated with the total survival time of TSCC patients. After we constructed the key lncRNA-miRNA -RNA sub network, the GSEA results showed that key lncRNA are mainly related to cytokines and the immune system. High expression levels of LINC00261 indicate a poor prognosis, while a high expression level of PART1 indicates a better prognosis.

Entities:  

Keywords:  TSCC; ceRNA; lncRNA

Year:  2019        PMID: 31179185      PMCID: PMC6544013          DOI: 10.7717/peerj.6991

Source DB:  PubMed          Journal:  PeerJ        ISSN: 2167-8359            Impact factor:   2.984


Introduction

Tongue squamous cell carcinoma (TSCC) is one of the most lethal malignancies of oral cancers, and it is the most common oral cancer (Mannelli et al., 2018; Kamali et al., 2017). TSCC is highly invasive and prone to recurrence and early metastasis, often endangering the lives of patients, and treatment is more difficult. Therefore, exploring the molecular mechanism of the occurrence and development of TSCC, studying the molecular markers of TSCC and finding effective molecular therapeutic targets play an important role in improving the survival rate of patients with TSCC. In recent years, a regulatory network composed of long noncoding RNAs (lncRNAs), microRNAs (miRNAs) and messenger RNAs (mRNAs) has gained interest in the study of molecular biological mechanisms involved in the process of tumor occurrence and progression. lncRNA is a noncoding RNA with a length greater than 200 nucleotides. Studies have shown that lncRNA plays an important role in many physiological activities, such as the dosage compensation effect, epigenetic regulation, cell cycle regulation and cell differentiation regulation (Peng, Koirala & Mo, 2017; Koch, 2017). In recent years, increasing evidence has shown that lncRNA has important potential application prospects in the treatment and diagnosis of malignant tumors and other human diseases (Fan et al., 2018; Zhang et al., 2018c). miRNA is a kind of endogenous small RNA with a length of approximately 20–24 nucleotides that plays a variety of important regulatory roles in cells. miRNA is involved in the regulation of gene expression after transcription in cells. It can act on RNA and render it untranslatable, which leads to the silencing of corresponding genes (Ye et al., 2008). Researchers have shown that miRNA is directly related to many important physiological and biochemical processes and diseases, including tumor occurrence and metastasis (Wang, Hu & Liu, 2015; Amirkhah et al., 2015). Some RNAs, pseudogenes, long-chain noncoding RNA (lncRNA) and cyclic RNA (circRNA) contain some binding sites for miRNAs, which can competently bind to the same miRNAs through miRNA response elements (MREs), thus eliminating or alleviating the inhibition of miRNAs on target genes and regulating the expression of target genes. This mechanism is referred to as the competitive endogenous RNA (ceRNA) hypothesis (Qi et al., 2015). An increasing number of studies have shown that in the process of tumorigenesis, lncRNA can be used as ceRNA, which can inhibit or promote tumorigenesis by competitive binding of the same tumorigenic or antitumorigenic miRNAs (Karreth & Pandolfi, 2013). For example, a study showed that lncRNA Gas5 acts as a ceRNA to regulate PTEN expression by sponging miR-222-3p in papillary thyroid carcinoma (Zhang, Ye & Zhao, 2017). In addition, it has been showed that lncRNA SNHG1 can antagonize the effect of miR-145a-5p on the downregulation of NUAK1 expression in nasopharyngeal carcinoma cells (Lan & Liu, 2019). In order to diagnose and treat oral tumors more accurately, more and more scholars are studying the mechanism of lncRNA in the development of oral tumors. One study explored the role of lncRNA in OSCC by conducting a series of analyses of data downloaded from GEO databases and conducting related experiments to verify the conclusions. Finally, the authors found that lncRNA H1, as a ceRNA, can promote the development of tumors by inhibiting the expression of miR-138 and EZH2, which is helpful for the treatment of OSCC (Hong et al., 2018). In addition, some scholars have found that lncRNA OIP5-AS1 is closely related to undifferentiated oral tumors, which can regulate downstream target genes. Moreover, lncRNA OIP5-AS1 can be transactivated by stemness-associated transcription factors in cancer, resulting in poor prognosis (Arunkumar et al., 2018). There are an increasing number of studies on the construction of ceRNA networks for cancer research, such as gastric, lung, renal and kidney networks (Xia et al., 2014; Wang et al., 2018; Qu et al., 2016). However, until now, there has been nearly no research on lncRNAs and miRNAs related to TSCC based on high-throughput sequencing and a large-scale sample size. The purpose of this study was to analyze the expression of lncRNAs in TSCC and their correlation with clinical indicators based on the competitive endogenous RNA (ceRNA) theory and to explore the possible molecular mechanism of lncRNAs in TSCC by gene set enrichment analysis (GSEA). We aimed to provide new indicators for the early diagnosis and prognosis of TSCC and provide new targets for the treatment of TSCC.

Materials and Methods

Selection of datasets and data collection

Gene expression data of TSCC were downloaded from The Cancer Genome Atlas (TCGA) (https://gdc-portal.nci.nih.gov/). TCGA is a project co-supervised by the National Cancer Institute and the National Human Genome Research Institute. It aims to apply high-throughput genome analysis technology to help people have a better understanding of cancer and improve the ability to prevent, diagnose and treat cancer. As the largest cancer gene information database at present, TCGA not only contains many cancer types, but also contains abundant multigroup data. We downloaded RNA-Seq V2 of tongue samples from TCGA, and this data is raw count, which comes from 162 tongue samples. Meanwhile, the downloaded miRNAs-seq came from 169 tongue samples.

RNA sequence data processing

We downloaded all RNA sequences of 162 tongue samples from TCGA. In addition, we organized the downloaded data into gene expression matrix files and divided the 162 samples into 147 tumor samples and 15 normal control samples by the programming code written in Perl software. And we used the same method to convert the miRNA seq into miRNA matrix files and divided the 169 samples into 154 tumor samples and 15 normal control samples. And we also count the clinical data of the samples. The results are shown in the Table 1.
Table 1

Clinical variables in TSCC samples.

CovariatesTypeStat
fustatdead58(39.5%)
fustatalive89 (60.5%)
gendermale102(69.4%)
genderfemale45(30.6%)
age≤6569(46.9%)
age>6578(53.1%)
stageI13(8.8 %)
stageII23(15.6%)
stageIII32(21.8%)
stageIV73(49.7%)
stageNA6(4.1%)
In this study, we mainly used programming code written in Perl software and R language to analyze the RNA data.

Analysis of DEGs, DElncRNAs, and DEmiRNAs in TSCC

Ensembl (version 89; https://uswest.ensembl.org/info/website/archives/index.html) (Aken et al., 2016) is a software system capable of automatic annotation and maintenance of eukaryotic genomes. It is co-operated by the Wellcome Foundation of Sanger Institute and the European Institute of Bioinformatics, a division of the European Molecular Biology Laboratory. We used the Ensembl database to screen for RNA and lncRNA and exclude the RNA and lncRNA that do not exist in the database. Before conducting differential expression analysis, we processed the data and deleted outliers samples. Using “WGCNA” package in R software (R Core Team, 2018), clustering analysis was carried out on the expression data of mRNA, lncRNA and miRNA respectively, and outlier samples were deleted, the results are shown in supplemental files. Next, we obtained differentially expressed mRNAs (DEmRNAs), lncRNAs (DElncRNAs) and miRNAs (DEmiRNAs) using the “edgeR” package in R software. The P-value was obtained using the false discovery rate (FDR) to correct the statistical significance of multiple tests. We specified fold changes (log2 absolute) > 2 and an FDR adjusted to P < 0.01 as statistically significant.

Construction of a ceRNA regulatory network

miRcode database is a software to predict microRNA targets. Currently, it covers a complete set of transcripts annotated by GENCODE, including 10,419 registered lncRNAs. When predicting the binding sites of miRNAs, we used the miRNA family defined by Target Scan v6. The miRNAs under the same family have the same binding site type. For the predicted region of the binding sites of miRNAs, the results are filtered and screened according to the gene type, the conservativeness of binding sites and the distribution of transcripts. First, the DElncRNAs and DEmiRNAs were matched in the miRcode database (Jeggari, Marks & Larsson, 2012). Unmatched lncRNAs and miRNAs were deleted. StarBase is used to annotate the suffix of miRNA. miRTarBase (Chou et al., 2016), miRDB (Wong & Wang, 2015) and TargetScan (Fromm et al., 2015) databases are all websites that can predict miRNA target genes. In order to make the results more accurate, we used the three databases to predict the target genes of DEmiRNAs. After that, the “Venn Diagram” package in R software was used to screen the target genes of DEmiRNAs by crossing the differentially expressed genes with the target genes, in order to prepare for the construction of ceRNA network. In order to measure the binding, a correlation analysis is necessary. We used the code in R software to detect the correlation between miRNAs and mRNAs, and between miRNAs and lncRNAs. P < 0.05 was considered statistically significant. The results of correlation analysis were provided in the supplemental files. Then, we obtained the ceRNA regulatory network of lncRNA-miRNA-mRNA and mapped it with Cytoscape software (http://www.cytoscape.org/).

Survival analysis

We used the “survival” package in R software for survival analysis of all RNAs in the ceRNA network. The expression of lncRNAs, mRNAs and miRNAs correlated with overall survival was identified by the univariate Cox proportional hazards regression model. The expression of RNAs was sequenced from low to high. The first 50% of the samples were used as the low expression group, and the second 50% as the high expression group. Then, we used the log-rank test to compare the significant differences in the univariate analysis between subgroups for the overall survival rates. P < 0.05 was considered statistically significant.

Construction of the key lncRNA–miRNA–mRNA subnetwork

We used lncRNAs after survival analysis as key lncRNAs and extracted their linked mRNAs and miRNAs to reconstruct the subnetwork in Cytoscape software.

Multivariate survival analysis of cox proportional risk regression model

The Cox proportional hazards regression model plays a key role in the clinical diagnosis of cancer and can be used to detect the risk of key genes. We first downloaded gene expression data from TCGA and then merged the data with Perl script to obtain relevant survival information. Then, mRNAs in the key lncRNA-miRNA-mRNA subnetwork were regarded as key genes. Next, we performed multivariate survival analysis of the key genes using the “survival” package in R software, and the risk level of key genes was obtained. We provided the multi-variate analysis in the Supplemental Files.

Gene Set Enrichment Analysis (GSEA)

GSEA connects expression profile chip data with biological significance through statistical analysis, which enables researchers to interpret chip results more easily and reasonably. To further explore the role of the ceRNA regulatory network in the occurrence and development of TSCC, after multivariate analysis, the key genes were divided into two groups according to the risk level as follows: h represents the high gene expression group and l represents the low gene expression group. Then, the results were imported into the GSEA software. This analysis used two gene sets as follows: the KEGG gene set and the gene ontology gene set, including biological processes, cellular components and molecular functions. FDRs (false discovery rates) <25% for the gene sets indicated significantly enriched gene sets using the GSEA software. In the GSEA results, the gene set with FDR < 25% is the enrichment set by default. Because it is possible to generate meaningful hypotheses from these functional genes to facilitate further research. In most cases, the choice of FDR less than 25% is appropriate.

Results

DERNAs between primary tumor and control samples

Before the differential expression analysis, the clustering analysis were performed to check the outlier samples. Finally, one cancer sample was deleted from the mRNA data, four control samples were deleted from the lncRNA data, and one control sample was deleted from the miRNA data, the results are shown in Supplemental Files. Based on the data downloaded from TCGA, using a logFC >2 and an FDR <0.01 as the cut-off criterion, we screened 1,590 DEmRNAs (Fig. 1), including 608 upregulated mRNAs and 982 downregulated mRNAs. Similarly, 627 DElncRNAs (Fig. 1) and 75 DEmiRNAs (Fig. 1) were obtained, including 401 upregulated and 226 downregulated lncRNAs and 41 upregulated and 34 downregulated miRNAs. Tables 2–4 show the first 25 upregulated RNAs and 25 downregulated RNAs and their corresponding logFC values, P-values, and FDR values. The complete list of DERNAs is provided in supplemental files.
Figure 1

Heat maps of DERNAs.

(A) Heat map of DEmRNAs. (B) Heat map of DElncRNAs. (C) Heat map of DEmiRNAs. The red and green colors indicate higher expression levels and lower expression levels, respectively. The rows represent the DERNAs and the columns represent the samples.

Table 2

Top 25 differentially expressed mRNAs in TSCC samples.

Top 25 up-regulated mRNAsTop 25 down-regulated mRNAs
mRNADescriptionlogFClogCPMPValueFDRmRNADescriptionlogFClogCPMPValueFDR
HOXC8homeobox C85.2837270.3129364.53E–192.79E–17PRB3proline rich protein BstNI subfamily 3−9.703884.8723747.81E–981.40E–93
BMP1bone morphogenetic protein 12.1980726.6386353.16E—181.81E–16PRH1proline rich protein HaeIII subfamily 1−7.722762.5602883.04E–942.72E–90
MFAP2microfibril associated protein 23.1888616.0420715.08E–182.81E–16LCN1lipocalin 1−11.39978.0894833.38E–792.02E–75
PLAUplasminogen activator, urokinase3.0058718.327128.47E–184.60E–16PRR4proline rich 4−8.315665.8658822.35E–761.05E–72
C1QTNF6C1q and TNF related 63.2648845.6732754.53E–172.27E–15OMGoligodendrocyte myelin glycoprotein−4.8649−0.113531.75E–736.28E–70
CD276CD276 molecule2.093347.1654637.14E–173.48E–15RSPH4Aradial spoke head component 4A−5.170190.1794781.19E–653.57E–62
SERPINH1serpin family H member 12.5114468.511677.17E–173.48E–15KRT85keratin 85−7.193770.5801092.80E–627.16E–59
COL13A1serpin family H member 12.5291882.4746651.74E–168.05E–15RSPH1radial spoke head component 1−5.363050.8502171.37E–613.06E–58
HOXC6homeobox C64.4610241.2266072.93E–161.34E–14VWA3Bvon Willebrand factor A domain containing 3B−4.966490.3213751.12E–592.24E–56
TGFBItransforming growth factor beta induced3.85130810.078351.11E–154.78E–14ZG16Bzymogen granule protein 16B−8.050157.9963961.76E–573.16E–54
HOXC4homeobox C43.7776351.1895561.40E–156.00E–14C6orf58chromosome 6 open reading frame 58−8.421694.314571.14E–561.86E–53
HSD17B6hydroxysteroid 17-beta dehydrogenase 62.4054521.2779055.23E–152.07E–13ZMYND10zinc finger MYND-type containing 10−5.11361.2088376.75E–541.01E–50
LAMC2laminin subunit gamma 23.83681510.614565.85E–152.29E–13DNAH9dynein axonemal heavy chain 9−4.739831.3436832.57E–533.54E–50
GRIN2Dglutamate ionotropic receptor NMDA type subunit 2D4.3310323.5466289.64E–153.69E–13C9orf24chromosome 9 open reading frame 24−4.84518−0.090584.11E–535.26E–50
PDPNpodoplanin2.6953337.364532.52E–149.09E–13CFAP43cilia and flagella associated protein 43−4.378670.3835934.26E–525.09E–49
IL11interleukin 113.8813453.00413.16E–141.12E–12ECT2Lepithelial cell transforming 2 like−4.08998−1.005662.97E–513.33E–48
CA9carbonic anhydrase 95.9629565.0206823.36E–141.18E–12PIFOprimary cilia formation−4.332820.8345539.28E–519.51E–48
P3H1prolyl 3-hydroxylase 12.1821975.662454.70E–141.63E–12FAM166Bfamily with sequence similarity 166 member B−3.57818−0.506849.55E–519.51E–48
LBX2ladybird homeobox 22.707215−0.432991.27E–134.15E–12LRRC46leucine rich repeat containing 46−4.076710.7961687.59E–507.16E–47
ZNF114zinc finger protein 1143.9904542.2371091.33E–134.32E–12TEKT1tektin 1−5.72580.0949674.01E–493.59E–46
SNX10sorting nexin 102.1391524.0119671.34E–134.32E–12WDR38WD repeat domain 38−5.66533−0.239151.25E–481.07E–45
DNMT3BDNA methyltransferase 3 beta2.40993.4450321.42E–134.57E–12PRB4proline rich protein BstNI subfamily 4−9.41497−0.082093.81E–473.10E–44
TMEM132Atransmembrane protein 132A2.0711827.0534221.75E–135.51E–12EFHBEF-hand domain family member B−4.48575−0.845045.50E–474.28E–44
HOXC9homeobox C93.9086230.6660572.46E–137.54E–12ERICH3glutamate rich 3−6.914520.2135281.66E–461.24E–43
ARTNartemin3.2780993.3464972.76E–138.34E–12LPOlactoperoxidase−6.764341.465613.06E–462.20E–43
Table 4

Top 25 differentially expressed miRNAs in TSCC samples.

Top 25 up-regulated miRNAsTop 25 down-regulated miRNAs
miRNAlogFClogCPMPValueFDRmiRNAlogFClogCPMPValueFDR
hsa-mir-6155.7307991.4784396.70E–172.22E–15hsa-mir-299−2.997713.8639241.11E–326.98E–30
hsa-mir-4552.5389418.8449845.06E–151.38E–13hsa-mir-135a-2−5.113780.1878233.80E–321.19E–29
hsa-mir-196b4.3947137.0971621.72E–133.72E–12hsa-mir-381−3.592267.8480884.55E–319.53E–29
hsa-mir-196a-24.9703255.1322452.79E–125.17E–11hsa-mir-135a-1−4.56572−0.168311.59E–282.51E–26
hsa-mir-5033.1846723.7769451.55E–112.57E–10hsa-mir-30a−2.1002613.639233.18E–234.01E–21
hsa-mir-196a-14.9068094.9809261.94E–112.97E–10hsa-mir-885−4.103731.6504394.81E–225.04E–20
hsa-mir-2242.1417377.1219619.49E–101.15E–08hsa-mir-378c−2.495894.0566659.94E–228.93E–20
hsa-mir-450a-12.0453142.3943191.39E–091.62E–08hsa-mir-411−2.725644.4015951.26E–219.90E–20
hsa-mir-8772.4356121.5849912.04E–092.33E–08hsa-mir-34c−2.758735.7222923.57E–212.50E–19
hsa-mir-19103.996050.6588712.55E–092.82E–08hsa-mir-375−3.9609911.453322.19E–191.15E–17
hsa-mir-301a2.094083.7467883.37E–083.17E–07hsa-mir-378a−2.2499710.817576.76E–193.04E–17
hsa-mir-46524.6677141.654686.24E–085.45E–07hsa-mir-376c−2.273533.5012451.03E–184.31E–17
hsa-mir-13053.3640840.5615891.34E–071.11E–06hsa-mir-495−2.180823.3247043.04E–181.19E–16
hsa-mir-2102.33144910.024083.90E–072.89E–06hsa-mir-486-2−2.339257.1900989.14E–183.35E–16
hsa-mir-77052.1027990.655856.32E–074.51E–06hsa-mir-486-1−2.354317.2051339.59E–183.35E–16
hsa-mir-9372.5494172.1381078.21E–075.80E–06hsa-mir-29c−2.0837211.166761.91E–166.01E–15
hsa-mir-78542.596544−0.022231.05E–067.28E–06hsa-mir-378d-2−2.580360.4009492.85E–168.52E–15
hsa-mir-39402.1883970.7897221.10E–067.53E–06hsa-mir-34b−2.525333.097593.37E–159.63E–14
hsa-mir-301b2.4098461.0032521.36E–069.09E–06hsa-mir-337−2.109215.0589.56E–152.51E–13
hsa-mir-5452.274835−0.210261.89E–061.22E–05hsa-mir-379−2.0183610.572291.61E–144.06E–13
hsa-mir-12933.0205344.2963482.21E–061.39E–05hsa-mir-1258−2.55350.1595422.30E–145.57E–13
hsa-mir-312.6201647.4368083.09E–061.85E–05hsa-mir-410−2.264374.4668493.02E–147.04E–13
hsa-mir-47132.832803−0.297419.36E–065.26E–05hsa-mir-378d-1−2.554160.1462159.49E–121.66E–10
hsa-mir-46583.809692−0.471281.54E–058.33E–05hsa-mir-499a−2.975422.1031452.54E–113.80E–10
hsa-mir-548f-14.2849690.0728041.87E–059.87E–05hsa-mir-99a−2.019389.0033635.18E–117.24E–10

Heat maps of DERNAs.

(A) Heat map of DEmRNAs. (B) Heat map of DElncRNAs. (C) Heat map of DEmiRNAs. The red and green colors indicate higher expression levels and lower expression levels, respectively. The rows represent the DERNAs and the columns represent the samples.

Construction of the ceRNA regulatory network

To better understand the role of DElncRNAs in TSCC and further elaborate on the interaction between DElncRNAs and DEmiRNAs, a ceRNA regulatory network map related to lncRNA-miRNA-mRNA was constructed using 627 DElncRNAs and 75 DEmiRNAs obtained from the abovementioned studies. In the ceRNA regulatory network of TSCC, the interaction between 627 DEIncRNAs and 75 DERNAs was predicted based on human gene data in the downloaded microcode database using the Perl program. A total of 45 pairs of interacting lncRNAs and microRNAs were obtained. According to the microTarBase, TargetScan and microRNADB databases, 99 DEmiRNAs targeting mRNAs were searched. Then, eight differentially expressed and targeted mRNAs were screened out by comparing the targeted mRNAs with TSCC DEmRNAs from TCGA. Cytoscape was used to construct a ceRNA regulatory network including six DEmiRNAs, 29 DElncRNAs and six DEmRNAs in TSCC based on the abovementioned RNAs (Fig. 2).
Figure 2

DElncRNA—mediated ceRNA regulatory network in tongue squamous cellcarcinoma.

The nodes highlighted in red indicate downregulated expression, and the nodes highlighted in green indicate upregulated expression. IncRNAs, miRNAs and mRNAs are represented by diamonds, rounded rectangles, and ellipses, respectively.

DElncRNA—mediated ceRNA regulatory network in tongue squamous cellcarcinoma.

The nodes highlighted in red indicate downregulated expression, and the nodes highlighted in green indicate upregulated expression. IncRNAs, miRNAs and mRNAs are represented by diamonds, rounded rectangles, and ellipses, respectively.

Correlation analysis of survival in the ceRNA network

To further determine whether the expression of genes in the regulatory network is related to the prognosis of TSCC patients, Kaplan–Meier curve analysis of lncRNAs in the TSCC ceRNA regulatory network showed that only two lncRNAs, including LINC00261 and PART1, were correlated with the overall survival time of TSCC patients (P < 0.05), as shown in Fig. 3. Moreover, the expression levels of LINC00261 were negatively correlated with the overall survival rate, while the expression levels of PART1 were positively correlated with the overall survival rate.
Figure 3

Kaplan-Meiercurve analysis of DElncRNA (A: LINC00261 and B: PART1) inTSCC patients.

Blue represents low expression and red represents high expression.

Kaplan-Meiercurve analysis of DElncRNA (A: LINC00261 and B: PART1) inTSCC patients.

Blue represents low expression and red represents high expression.

The key lncRNA-miRNA-mRNA sub-network

LncRNAs and mRNAs appear in the coexpression patterns of ceRNA. Thus, we used two lncRNAs, including LINC00261 and PART1, as the key lncRNAs. We extracted their linked miRNAs and mRNAs and reconstructed a new subnetwork using Cytoscape software. As shown in Fig. 4, The lncRNA PART1-miRNA-mRNA subnetwork was composed of one lncRNA node, three miRNA nodes, and four mRNA nodes.
Figure 4

The subnetwork of PART1.

The nodes highlighted in red indicate downregulated expression, and the nodes highlighted in green indicate upregulated expression. IncRNAs, miRNAs and mRNAs are represented by diamonds, rounded rectangles, and ellipses, respectively.

The subnetwork of PART1.

The nodes highlighted in red indicate downregulated expression, and the nodes highlighted in green indicate upregulated expression. IncRNAs, miRNAs and mRNAs are represented by diamonds, rounded rectangles, and ellipses, respectively.

Functional prediction of lncRNAs based on the key lncRNA-miRNA-mRNA subnetwork

In the network, one or more mRNAs are centered on a lncRNA and connected with it. We can better speculate the function of lncRNA according to the function of the mRNA linked to the lncRNA. Therefore, after multifactor analysis of the mRNAs in the constructed subnetwork, the subnetwork was divided into a high-expression group and a low-expression group according to the risk value, and GSEA analysis was then carried out to explore the biological pathways involved in lncRNA. The GSEA results showed that in the high-expression group of PART1(Fig. 5), six significantly enriched gene sets were screened, including one leukocyte migration gene set, two leukocyte chemotaxis gene sets, one immune system process gene set, one innate immune response gene set and one tyrosine peptide phosphorylation gene set.
Figure 5

Gene setenrichment analysis of the subnetwork.

Gene set enrichment analysis (GSEA) of mRNAs in the subnetwork demonstrated that PART1 is enriched in genes that participate in leukocyte chemotaxis (A), immune system process (B) and peptidyl tyrosine phosphorylation (C).

Gene setenrichment analysis of the subnetwork.

Gene set enrichment analysis (GSEA) of mRNAs in the subnetwork demonstrated that PART1 is enriched in genes that participate in leukocyte chemotaxis (A), immune system process (B) and peptidyl tyrosine phosphorylation (C).

Discussion

In recent decades, TSCC is the most common oral squamous cell carcinoma, and its treatment is difficult. Currently, basic research and clinical treatment of tongue cancer are the focuses of head and neck cancer surgery. The overall clinical treatment of tongue cancer has improved in recent decades. The 5-year survival rate is 50–60% (Sano & Myers, 2007; Torre et al., 2015), but it has been stagnant. Neoadjuvant therapies, such as biotherapy and targeted therapy, have been gradually applied in clinical practice. It has been shown that ceRNAs alter gene expression through a mechanism mediated by microRNAs, thereby affecting cell function, which may lead to cancer (Qi et al., 2015). However, only a few studies have been performed on ceRNA in TSCC. More than 10,000 kinds of lncRNAs have been shown to have potential ceRNA characteristics (Tay, Rinn & Pandolfi, 2014). In recent years, an increasing number of studies have confirmed that lncRNAs, as a competitive platform for miRNAs and mRNAs, is involved in the regulation of the cell cycle and cell death in various malignant tumors, such as breast, gastric, liver, lung and esophageal cancers (Luan et al., 2017; Zhang et al., 2018a; Wang et al., 2017a; Jiao et al., 2016). LncRNA effects the invasion and metastasis of tumors, thus playing an important role in the occurrence and development of tumors. In the study of TSCC, many lncRNAs have been shown to be associated with the occurrence, development and prognosis of TSCC. In a study published in 2018, a high level of lncRNA KCNQ1OT1 expression was shown to be closely correlated with poor prognosis in TSCC. Furthermore, KCNQ1OT1 was shown to promote TSCC proliferation (Zhang et al., 2018b). Another study showed that lncRNA MALAT1 expression is upregulated in TSCC tissues by investigating the expression and function of MALAT1 in TSCC tissues and cells, and MALAT1 was shown to be related to cervical lymph node metastasis. Finally, MALAT1 was shown to induce cell migration, invasion and EMT and inhibit apoptosis by modulating the Wnt/ β-catenin signaling pathway (Liang et al., 2017). These results may provide new insights for the treatment of TSCC. Therefore, to study the functional roles of lncRNAs as ceRNA in the occurrence and development of TSCC and the regulatory mechanism of lncRNAs, we used data downloaded from TCGA to create a lncRNA-miRNA-mRNA network and carried out a series of analyses, which is very relevant for the treatment of TSCC. In our study, two lncRNAs, including LINC00261 and PART1, were shown to be closely related to overall survival. Patients with high expression levels of LINC00261 had a poor prognosis, while PART1 had the opposite effect. The results of other studies are different from this study. A study (Yu et al., 2017) showed that patients with low expression levels of LINC00261 had a higher recurrence rate than those with high expression levels of LINC00261 in gastric cancer, suggesting that patients with low expression levels of LINC00261 have a poor prognosis. We hypothesize that the results are different because of the small number of samples in the present study. Currently, there are few studies on the two kinds of key lncRNAs in TSCC, but many scholars have indicated that these key lncRNAs have an impact on the occurrence and development of other diseases. LINC00261 plays an important role in gastric, lung and colon cancers (Liu, Xiao & Xu, 2017; Wang et al., 2017c). Many studies have proven that LINC00261 can inhibit cell proliferation and invasion, thus promoting the development of cancer (Sha et al., 2017; Wang et al., 2017b). There are fewer studies on PART1 than LNC00261, and only a few studies have investigated its role in prostate cancer and esophageal squamous cell carcinoma (Sun et al., 2018; Kang et al., 2018). A 2018 study showed that lncRNA PART1 modulates the toll-like receptor pathway to influence cell proliferation and apoptosis in prostate cancer cells. PART1 can be used as a novel biomarker and target in the treatment of prostate cancer (Sun et al., 2018). The results of enrichment analysis of the lncRNA-miRNA-mRNA subnetwork showed that PART1 is mainly related to cytokines and the immune system and could regulate the immune response and migration of cytokines, thus affecting the occurrence and development of tumors. However, little research has been performed on the effects of these two lncRNAs on immunity and cytokines.

Conclusions

Based on the ceRNA theory, we constructed a ceRNA regulatory network, which for the first time enabled an overall perspective and analysis of the lncRNA-associated ceRNA-mediated genes in the development of TSCC at a system-wide level. Our research showed that lncRNAs affect the development of TSCC. Furthermore, the constructed key lncRNA-miRNA-mRNA subnetwork also showed that PART1 can provide new indicators for the early diagnosis and prognosis of TSCC and provide new targets for its treatment. However, the molecular mechanism of the two lncRNAs in TSCC remains unclear. Although the GSEA results provided some research directions, further experiments are needed to verify it.

Clustering analysis of RNAs

One cancer sample was deleted from the mRNA data, four control samples were deleted from the lncRNA data, and one control sample was deleted from the miRNA data. Click here for additional data file.

Correlation analysis of RNAs

A relationship of miRNA-mRNA and miRNA-lncRNA with a P value <0.05 is considered significant. Click here for additional data file.

Expression value of RNAs

Expression value of mRNAs, lncRNAs and miRNAs in HNSCC samples. Click here for additional data file.

Expression value of lncRNA in the ceRNA network

The rank of expression value of LINC00261 and PART1. Click here for additional data file.

Multi-variate analysis of PART1

Individual factors and multi-variate analysis of PART1 Click here for additional data file.

Raw data

The clinical data, RNAseq data and miRNA data downloaded from TCGA. Click here for additional data file.

Venn diagram of mRNAs involved in the ceRNA regulatory network

As shown in the figure, the number of mRNAs expressed in the red area is only the difference in expression. The blue area presents only the target number of mRNAs that are differentially expressed, while the purple area in the middle represents the number of mRNAs, which is both the differential expression and the target. Click here for additional data file.
Table 3

Top 25 differentially expressed lncRNAs in TSCC samples.

Top 25 up-regulated lncRNAsTop 25 down-regulated lncRNAs
lncRNADescriptionlogFClogCPMPValueFDRlncRNADescriptionlogFClogCPMPValueFDR
LINC02081S-phase cancer associated transcript 16.559078.2880381.91E–176.25E–15LINC01482Long intergenic non-protein coding RNA 1482−4.741875.1524611.46E–461.05E–42
HOXC-AS2HOXC cluster antisense RNA 24.7692576.8224291.98E–166.23E–14LINC01829long intergenic non-protein coding RNA 1829−6.365226.7235023.81E–391.38E–35
ZFPM2-AS1ZFPM2 antisense RNA 16.6652318.9954236.58E–161.83E–13AC027130.1Antisense−5.976614.4604281.08E–362.61E–33
GSECG-quadruplex forming sequence containing lncRNA3.1695518.7556992.53E–155.70E–13LINC02160long intergenic non-protein coding RNA 2160−6.549274.0192982.85E–355.15E–32
AL358334.2novel transcript5.3903878.4907213.08E–156.73E–13RMSTrhabdomyosarcoma 2 associated transcript−5.266883.8453942.79E–324.03E–29
AC114956.2novel transcript4.3102338.0047712.70E–145.13E–12AC010425.1novel transcript−5.485853.4766031.74E–262.10E–23
FOXD2-AS1adjacent opposite strand RNA 12.5830998.6601591.58E–132.60E–11LINC01798long intergenic non-protein coding RNA 1798−3.585624.3246268.21E–268.46E–23
AC114956.1novel transcript4.2150666.5157975.15E–137.91E–11LINC01797long intergenic non-protein coding RNA 1797−6.209783.3906371.23E–241.11E–21
HOXA10-ASHOXA10 antisense RNA5.7424755.8450171.66E–122.36E–10FOXCUTFOXC1 upstream transcript−4.077185.6438346.56E–245.26E–21
LINC02577long intergenic non-protein coding RNA 25776.0022439.0228751.04E–111.20E–09SALRNA1senescence associated long non-coding RNA 1−3.083555.2202631.34E–209.64E–18
LINC00460long intergenic non-protein coding RNA 4606.4957027.9742142.49E–112.68E–09AC005165.1uncharacterized LOC646588−4.487576.8729834.67E–203.06E–17
AC134312.5novel transcript4.4865357.8510424.56E–114.28E–09AL451062.1Antisense−4.067725.6876061.08E–196.49E–17
LBX2-AS1LBX2 antisense RNA 12.218219.5367116.89E–116.14E–09LINC00844long intergenic non-protein coding RNA 844−4.339443.6907382.61E–191.45E–16
AC005041.3novel transcript, antisense to LBX22.3062355.5504638.96E–117.79E–09LINC01405long intergenic non-protein coding RNA 1405−4.172118.6809842.83E–191.45E–16
LINC01633long intergenic non-protein coding RNA 16335.1830344.9127571.17E–109.92E–09AC116345.1novel transcript−3.553725.0325573.01E–191.45E–16
AL513123.1novel transcript4.390585.8962681.55E–101.26E–08AL359715.1novel transcript−2.789235.7064724.44E–192.00E–16
AL139147.1novel transcript, antisense to SGIP17.2618965.8739381.82E–101.44E–08AC009041.2novel transcript−2.814537.7846022.09E–188.87E–16
U62317.1uncharacterized LOC1053730983.30761210.067092.88E–102.19E–08NKAIN3-IT1NKAIN3 intronic transcript−4.269225.389213.62E–181.40E–15
HOTAIRHOX transcript antisense RNA4.5765176.2019583.63E–102.73E–08HSD52uncharacterized LOC729467−3.841833.8994363.69E–181.40E–15
LINC01615long intergenic non-protein coding RNA 16154.4112599.6312774.36E–103.21E–08LINC02343long intergenic non-protein coding RNA 2343−5.547293.4667086.86E–182.48E–15
SLC12A5-AS1SLC12A5 and MMP9 antisense RNA 13.9544356.1830394.91E–103.54E–08AL161668.4novel transcript−3.75744.7383857.66E–182.63E–15
AC113346.1novel transcript5.6843135.7924915.03E–103.60E–08SOX9-AS1SOX9 antisense RNA 1−2.540647.0264632.56E–167.70E–14
LINC00941long intergenic non-protein coding RNA 9414.330158.9134835.20E–103.68E–08ZNF710-AS1ZNF710 antisense RNA 1−2.324019.9707436.09E–161.76E–13
RNF144A-AS1RNF144A antisense RNA 13.3915817.8767587.03E–104.83E–08DRAICdownregulated RNA in cancer, inhibitor of cell invasion and migration−2.843975.0172427.74E–162.07E–13
LINC01614long intergenic non-protein coding RNA 16147.7714868.1666191.24E–098.26E–08AC002401.3novel transcript, antisense to PDK2−3.19333.7638921.10E–152.84E–13
  39 in total

1.  Identification of critical TF-miRNA-mRNA regulation loops for colorectal cancer metastasis.

Authors:  C Wang; D Z Hu; J Z Liu
Journal:  Genet Mol Res       Date:  2015-05-22

Review 2.  MicroRNA-mRNA interactions in colorectal cancer and their role in tumor progression.

Authors:  Raheleh Amirkhah; Ulf Schmitz; Michael Linnebacher; Olaf Wolkenhauer; Ali Farazmand
Journal:  Genes Chromosomes Cancer       Date:  2015-03       Impact factor: 5.006

Review 3.  ceRNA cross-talk in cancer: when ce-bling rivalries go awry.

Authors:  Florian A Karreth; Pier Paolo Pandolfi
Journal:  Cancer Discov       Date:  2013-09-26       Impact factor: 39.397

4.  Global cancer statistics, 2012.

Authors:  Lindsey A Torre; Freddie Bray; Rebecca L Siegel; Jacques Ferlay; Joannie Lortet-Tieulent; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2015-02-04       Impact factor: 508.702

Review 5.  Metastasis of squamous cell carcinoma of the oral tongue.

Authors:  Daisuke Sano; Jeffrey N Myers
Journal:  Cancer Metastasis Rev       Date:  2007-12       Impact factor: 9.264

Review 6.  The multilayered complexity of ceRNA crosstalk and competition.

Authors:  Yvonne Tay; John Rinn; Pier Paolo Pandolfi
Journal:  Nature       Date:  2014-01-16       Impact factor: 49.962

7.  miRcode: a map of putative microRNA target sites in the long non-coding transcriptome.

Authors:  Ashwini Jeggari; Debora S Marks; Erik Larsson
Journal:  Bioinformatics       Date:  2012-06-19       Impact factor: 6.937

8.  miRDB: an online resource for microRNA target prediction and functional annotations.

Authors:  Nathan Wong; Xiaowei Wang
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 16.971

9.  Long noncoding RNA associated-competing endogenous RNAs in gastric cancer.

Authors:  Tian Xia; Qi Liao; Xiaoming Jiang; Yongfu Shao; Bingxiu Xiao; Yang Xi; Junming Guo
Journal:  Sci Rep       Date:  2014-08-15       Impact factor: 4.379

10.  The effect of central loops in miRNA:MRE duplexes on the efficiency of miRNA-mediated gene regulation.

Authors:  Wenbin Ye; Qing Lv; Chung-Kwun Amy Wong; Sean Hu; Chao Fu; Zhong Hua; Guoping Cai; Guoxi Li; Burton B Yang; Yaou Zhang
Journal:  PLoS One       Date:  2008-03-05       Impact factor: 3.240

View more
  9 in total

Review 1.  Long non‑coding RNA PART1: dual role in cancer.

Authors:  Rui Ran; Chao-Yang Gong; Zhi-Qiang Wang; Wen-Ming Zhou; Shun-Bai Zhang; Yong-Qiang Shi; Chun-Wei Ma; Hai-Hong Zhang
Journal:  Hum Cell       Date:  2022-07-21       Impact factor: 4.374

2.  Long Intergenic Non-Protein Coding RNA 519 Promotes the Biological Activities of Tongue Squamous Cell Carcinoma by Sponging microRNA-876-3p and Consequently Upregulating MACC1.

Authors:  Dejun Liu; Jing Zhao; Huiling Wang; Hui Li; Yanjie Li; Wangsen Qin
Journal:  Onco Targets Ther       Date:  2020-11-20       Impact factor: 4.147

3.  LncRNA PART1 Exerts Tumor-Suppressive Functions in Tongue Squamous Cell Carcinoma via miR-503-5p.

Authors:  Xiqun Zhao; Yanqing Hong; Qingyuan Cheng; Lixin Guo
Journal:  Onco Targets Ther       Date:  2020-10-07       Impact factor: 4.147

Review 4.  Long non‑coding RNAs are novel players in oral inflammatory disorders, potentially premalignant oral epithelial lesions and oral squamous cell carcinoma (Review).

Authors:  Kaiying Zhang; Wei Qiu; Buling Wu; Fuchun Fang
Journal:  Int J Mol Med       Date:  2020-06-03       Impact factor: 4.101

5.  Copy Number Variation Pattern for Discriminating MACROD2 States of Colorectal Cancer Subtypes.

Authors:  ShiQi Zhang; XiaoYong Pan; Tao Zeng; Wei Guo; Zijun Gan; Yu-Hang Zhang; Lei Chen; YunHua Zhang; Tao Huang; Yu-Dong Cai
Journal:  Front Bioeng Biotechnol       Date:  2019-12-19

6.  Excavating novel diagnostic and prognostic long non-coding RNAs (lncRNAs) for head and neck squamous cell carcinoma: an integrated bioinformatics analysis of competing endogenous RNAs (ceRNAs) and gene co-expression networks.

Authors:  Liu Yang; Pingan Lu; Xiaohui Yang; Kaiguo Li; Xuxia Chen; Song Qu
Journal:  Bioengineered       Date:  2021-12       Impact factor: 3.269

7.  Long non‑coding RNA PRNCR1 exerts oncogenic effects in tongue squamous cell carcinoma in vitro and in vivo by sponging microRNA‑944 and thereby increasing HOXB5 expression.

Authors:  Cong Lin; Yanan Zou; Ruijing Li; Daofeng Liu
Journal:  Int J Mol Med       Date:  2020-04-16       Impact factor: 5.314

8.  The relevance between the immune response-related gene module and clinical traits in head and neck squamous cell carcinoma.

Authors:  Yidan Song; Yihua Pan; Jun Liu
Journal:  Cancer Manag Res       Date:  2019-08-06       Impact factor: 3.989

9.  KCNQ1OT1 aggravates cell proliferation and migration in bladder cancer through modulating miR-145-5p/PCBP2 axis.

Authors:  Jingyu Wang; Hao Zhang; Jie Situ; Mingzhao Li; Hua Sun
Journal:  Cancer Cell Int       Date:  2019-12-03       Impact factor: 5.722

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.