Literature DB >> 31059106

Signature microRNAs and long noncoding RNAs in laryngeal cancer recurrence identified using a competing endogenous RNA network.

Zhengyi Tang1, Ganguan Wei2, Longcheng Zhang2, Zhiwen Xu1.   

Abstract

The aim of the present study was to identify novel microRNA (miRNA) or long noncoding RNA (lncRNA) signatures of laryngeal cancer recurrence and to investigate the regulatory mechanisms associated with this malignancy. Datasets of recurrent and nonrecurrent laryngeal cancer samples were downloaded from The Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus database (GSE27020 and GSE25727) to examine differentially expressed miRNAs (DE‑miRs), lncRNAs (DE‑lncRs) and mRNAs (DEGs). miRNA‑mRNA and lncRNA‑miRNA networks were constructed by investigating the associations among these RNAs in various databases. Subsequently, the interactions identified were combined into a competing endogenous RNA (ceRNA) regulatory network. Feature genes in the miRNA‑mRNA network were identified via topological analysis and a recursive feature elimination algorithm. A support vector machine (SVM) classifier was established using the betweenness centrality values in the miRNA‑mRNA network, consisting of 32 optimal feature‑coding genes. The classification effect was tested using two validation datasets. Furthermore, coding genes in the ceRNA network were examined via pathway enrichment analyses. In total, 21 DE‑lncRs, 507 DEGs and 55 DE‑miRs were selected. The SVM classifier exhibited an accuracy of 94.05% (79/84) for sample classification prediction in the TCGA dataset, and 92.66 and 91.07% in the two validation datasets. The ceRNA regulatory network comprised 203 nodes, corresponding to mRNAs, miRNAs and lncRNAs, and 346 lines, corresponding to the interactions among RNAs. In particular, the interactions with the highest scores were HLA complex group 4 (HCG4)‑miR‑33b, HOX transcript antisense RNA (HOTAIR)‑miR‑1‑MAGE family member A2 (MAGEA2), EMX2 opposite strand/antisense RNA (EMX2OS)‑miR‑124‑calcitonin related polypeptide α (CALCA) and EMX2OSmiR‑124‑γ‑aminobutyric acid type A receptor γ2 subunit (GABRG2). Gene enrichment analysis of the genes in the ceRNA network identified that 11 pathway terms and 16 molecular function terms were significantly enriched. The SVM classifier based on 32 feature coding genes exhibited high accuracy in the classification of laryngeal cancer samples. miR‑1, miR‑33b, miR‑124, HOTAIR, HCG4 and EMX2OS may be novel biomarkers of recurrent laryngeal cancer, and HCG4miR‑33b, HOTAIRmiR‑1‑MAGEA2 and EMX2OSmiR‑124‑CALCA/GABRG2 may be associated with the molecular mechanisms regulating recurrent laryngeal cancer.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31059106      PMCID: PMC6522811          DOI: 10.3892/mmr.2019.10143

Source DB:  PubMed          Journal:  Mol Med Rep        ISSN: 1791-2997            Impact factor:   2.952


Introduction

Laryngeal cancer is the commonest cancer in the larynx. Squamous cell carcinoma is the predominant physiological type of laryngeal cancer, as it originates from the glottic region of the larynx. At present, to the best of the authors' knowledge, effective treatment for laryngeal squamous cell carcinoma (LSCC) primarily relies on radiotherapy and surgery. Tobacco and alcohol consumption are the principal risk factors for the development of laryngeal cancer (1). The annual incidence of laryngeal cancer is ~5 per 100,000 individuals and the 5-year survival rate is 60.6% in the USA (2,3). It was estimates that 23,400 new cases of laryngeal cancer occurred in China in 2014 (4). Patients with upper-airway malignancies exhibit an increased risk of developing locoregional cancer recurrence and second primary malignancies, and the larynx and nasopharynx are among the most frequent sites of locoregional cancer recurrence (5,6). Although the available therapeutic strategies exhibit promising results, the clinical outcomes of patients with LSCC remain poor (4). Understanding of the pathogenesis of the disease may contribute to the development of novel and more effective therapeutic strategies. Numerous studies have aimed to investigate the molecular mechanisms underlying laryngeal cancer development (7–9). Cell cycle proteins have been identified to serve important roles in the carcinogenesis of laryngeal cancer, and the upregulation of cellular tumor antigen p53 (p53), p21 and cyclin dependent kinase 1 in the surgical margin of early cancer is associated with local tumor recurrence (10). Recurrence rates in patients with p27- and phosphatase and tensin homolog (PTEN)-negative carcinoma were identified to be increased compared with patients with increased expression levels of these factors (11). Multiple types of RNA, including long noncoding RNAs (lncRNAs), circular RNAs, microRNAs (miRNAs), pseudogenes and protein-coding mRNAs may serve as key competing endogenous RNAs (ceRNAs) to regulate the expression levels of various mRNAs in mammalian cells (12,13). miR-196a may promote tumor progression in numerous cancer types, and its expression level was identified to be increased in laryngeal cancer (14). miR-221 may increase the cell proliferation rate by inhibiting apoptotic protease activating factor-1 in laryngeal cancer (15), whereas miRNA-299-3p targets the transcript of human telomerase reverse transcriptase (16). NF-κB-interacting lncRNA was identified to inhibit tumor cell viability and to promote apoptosis (17). A previous study analyzed an lncRNA expression dataset from the Gene Expression Omnibus (GEO) database, and identified that two lncRNAs, RP11-169K16.4 and RP11-107E5.3, were associated with the prediction of recurrence of laryngeal cancer (18). Although various previous studies have investigated the molecular mechanisms underlying laryngeal cancer, the etiology of this malignancy, and in particular of its recurrence, remains unclear. Notably, the regulatory interactions among noncoding and coding RNAs is only partially understood. The present study investigated novel potential molecular biomarkers involved in the recurrence of laryngeal cancer by constructing a ceRNA regulatory network consisting of miRNAs, lncRNAs and mRNAs.

Materials and methods

Training and validation datasets and data processing

Head and neck squamous cell carcinoma (HNSCC) mRNA samples were downloaded from The Cancer Genome Atlas (TCGA) database (https://gdc-portal.nci.nih.gov/) on August 8th, 2016. The samples of laryngeal cancer were selected according to the following criteria: i) Laryngeal origin; ii) recurrence status was recorded; and iii) matched mRNAs and miRNAs were barcoded. In total, 84 primary laryngeal cancer samples, including 19 recurrent and 65 nonrecurrent samples, were selected from 501 HNSCC tissues. The training dataset was constructed using the RNA sequencing (RNA-seq) data downloaded from the TCGA database. The gene expression datasets GSE27020 and GSE25727 were downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/geo/) and used as validation datasets. GSE27020 dataset was obtained using the GPL96 platform (Affymetrix human genome U133A array; Affymetrix, Inc.; Thermo Fisher Scientific, Inc., Waltham, MA, USA) and included 34 recurrent and 75 non-recurrent samples. Patients included in this dataset exhibited primary squamous cell laryngeal carcinoma and underwent surgical tumor removal (19). The raw data of this dataset in CIMFast Event Language format were downloaded and processed with gene expression background correction and were normalized using the oligo package version 1.46.0 (http://www.bioconductor.org/packages/release/bioc/html/oligo.html) in R (version 3.1.0; R Foundation, Vienna, Austria; https://www.R project.org/) (20). GSE25727 dataset was generated using the GPL8432 Illumina platform (Illumina, Inc., San Diego, CA, USA) and included 17 recurrent and 39 non-recurrent samples. All the patients were treated locally with surgery or radiotherapy, and none of them received chemotherapy (21). The data from this dataset were downloaded as text files, and the probe identification numbers were converted into gene symbols. In the case of multiple probes corresponding to one gene, the average value was considered as the expression value of this gene using the limma package 3.22.1 (22).

Analysis of differentially expressed lncRNAs, mRNAs and miRNAs

In the training dataset, the mRNAs and lncRNAs were annotated based on the information recorded in the HUGO Gene Nomenclature Committee (http://www.genenames.org/) database. Subsequently, the differentially expressed RNAs, including differentially expressed mRNAs (DEGs), differentially expressed lncRNAs (DE-lncRs) and differentially expressed miRNAs (DE-miRs), between recurrent and nonrecurrent laryngeal cancer samples were analyzed using the EdgeR package version 3.8.5, which adopted an overdispersed Poisson model and an empirical Bayesian approach to improve the reliability of the prediction (23). The thresholds for DEG and DE-lncR selection were based on a false discovery rate (FDR; adjusted P-value) <0.05, whereas for DE-miRs, P<0.05 was considered to indicate a statistically significant difference. Hierarchical cluster analysis was used to analyze sample similarity.

Analysis DE-lncRs associated with recurrence

Samples in the training dataset were classified into recurrent and nonrecurrent groups. The DE-lncRs that were significantly differentially expressed between recurrent and nonrecurrent samples were used to perform univariate Cox regression analysis, which was used to select the lncRNAs associated with recurrence. Kaplan-Meier analysis was performed to examine the association between the upregulated or downregulated lncRNAs, and recurrence and survival status.

Prediction of miRNA-regulated lncRNAs and mRNAs

The regulatory interactions among miRNAs and lncRNAs were investigated using the miRcode (http://www.mircode.org/) and starBase (http://starbase.sysu.edu.cn/) databases. All miRNA-regulated mRNAs were collected from the miRTarBase (http://mirtarbase.mbc.nctu.edu.tw) database. Associations among coding genes were examined using BioGRID (http://thebiogrid.org/), HPRD (http://www.hprd.org/) and DIP (http://dip.doe-mbi.ucla.edu/) databases. The identified DE-lncRs, DEGs and DE-miRs between recurrent and nonrecurrent samples were included in these regulatory interactions to obtain the lncRNA-miRNA and miRNA-mRNA regulatory networks. These two networks were combined to construct the ceRNA regulatory network, containing the associations among lncRNAs, miRNAs and mRNAs. The ceRNA regulatory network was constructed to investigate the regulatory mechanism underlying the associations among various RNAs. The network was visualized using Cytoscape software version 3.6.1 (24).

Analysis of feature coding genes in the miRNA-mRNA regulatory network

Topological structure analysis was conducted to identify the feature coding genes in the miRNA-mRNA network. The betweenness centrality (BC) of genes, corresponding to the importance of a certain node, or gene, in the network, was calculated using the following formula (25): Where v, s and t represent the nodes in the network, σst is the number of shortest paths from s to t, and σst (v) is the number of shortest paths from s to t, going through node v. BC values exhibited a range between 0 and 1, and the BC value was correlated with the importance of that node in the network. Therefore, the nodes exhibiting interactions with numerous mRNAs, miRNAs or lncRNAs, were more important in the construction of the network. The DEGs in the miRNA-mRNA network presenting the 100 highest BC values were selected as candidate feature-coding genes.

Construction of a support vector machine (SVM) classifier for various samples

The candidate genes that were significantly differentially expressed between recurrent and nonrecurrent samples were selected, and the unsupervised clustering classification method was used to validate the sample classification performance of these feature-coding genes (26). The 100 DEGs exhibiting the highest BC values were used to identify the optimal feature-coding gene set using the recursive feature elimination (RFE) algorithm (27). Through the iterative random feature combination, classification assessment and determination of the performance of various samples (28), the optimal feature-coding gene set was obtained. Subsequently, the set of optimal feature-coding genes was used to construct an SVM classifier, which considered the expression levels of the feature genes within the samples as the feature value to classify and distinguish the samples. The recurrence status of the samples was predicted using the SVM classifier. The robustness of the classifier were validated using the GSE27020 and GSE25727 datasets. The classification performance was evaluated using sensitivity (Se), specificity (Sp), positive predictive value (PPV), negative predictive value (NPV) and area under the receiver operating characteristic curve (AUC).

Molecular function and pathway enrichment analysis

The candidate feature coding genes selected from the ceRNA network were analyzed using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) (29) to investigate the enriched molecular functions and signaling pathways in recurrent laryngeal cancer. Fisher's exact test (Fisher's noncentral hypergeometric distribution) was used to perform the enrichment analysis with the following formula (30): Where ‘N’ is the total number of genes, ‘M’ is the number of genes in the enriched pathway, ‘K’ is the number of DEGs, and ‘p’ is the probability that at least one DEG belongs to the functional pathway. Multiple hypothesis testing correction was used to identify the categories of GO molecular functions and KEGG pathways.

Results

Screening of DE-lncRs, DEGs and DE-miRs

Using the aforementioned methods for the analysis of the RNA-seq data from the TCGA database, 853 lncRNAs, 18,924 mRNAs and 1,047 human miRNAs were identified. A total of 21 DE-lncRs and 507 DEGs were identified between recurrent and nonrecurrent samples (FDR<0.05). In addition, 55 DE-miRs were identified (P<0.05). The results of the heat maps constructed for DEGs, DE-lncRs and DE-miRs suggested that the differentially expressed RNAs clustered according to the two sample types (Fig. 1). Numerous DEGs (507) were identified between recurrent and nonrecurrent samples.
Figure 1.

Heat map of DE-lncRNAs, DEGs and DE-miRNAs in recurrent and nonrecurrent laryngeal cancer samples. Heat map corresponding to (A) DE-lncRNAs, (B) DEGs and (C) DE-miRNAs. Yellow represents recurrent samples and blue represents nonrecurrent samples. Red indicates upregulated genes and green indicates downregulated genes. DE, differentially expressed; lncRNAs, long noncoding RNAs; DEGs, differentially expressed coding genes; miRNAs, microRNAs.

DE-lncRs associated with recurrence

Following the selection of DE-lncRs between recurrent and nonrecurrent samples, univariate Cox regression analysis was performed to identify the lncRNAs associated with recurrence. In total, five DE-lncRs, including testis-specific transcript, Y-linked 15 (TTTY15), EMX2 opposite strand/antisense RNA (EMX2OS), family with sequence similarity 138 member F, human leukocyte antigen complex group 4 (HCG4) and HOX transcript Antisense RNA (HOTAIR), were identified to be significantly associated with recurrence-free survival time and survival ratio (P<0.05). Survival analysis was conducted and the Kaplan-Meier survival curves (Fig. 2) suggested that the increased expression levels of the five DE-lncRs were associated with poor prognosis.
Figure 2.

KM survival curves for five differentially expressed lncRNAs associated with laryngeal cancer recurrence. Blue curves represent samples with downregulated lncRNAs, and red curves represent samples with upregulated lncRNAs. KM curves corresponding to (A) HCG4, (B) EMX2OS, (C) FAM138F, (D) HOTAIR and (E) TTTY15. KM, Kaplan-Meier; HCG4, HLA complex group 4; EMX2OS, EMX2 opposite strand/antisense RNA; FAM138F, family with sequence similarity 138 member F; HOTAIR, HOX transcript antisense RNA; TTTY15, testis-specific transcript, Y-linked 15; lncRNA, long noncoding RNA.

lncRNA-miRNA, miRNA-mRNA and mRNA-mRNA regulatory networks

By investigating the miRcode and starBase databases, a total of 268 lncRNA-miRNA regulatory interactions were screened. Among these 268 interactions, 14 interactions were present among DE-miRs and DE-lncRNAs. This lncRNA-miRNA regulatory network comprised four DE-lncRs associated with recurrence and seven DE-miRs (Fig. 3).
Figure 3.

Regulatory network of differentially expressed miR-lncRNA. Diamonds represent miRs and squares represent lncRNAs. Green represents downregulated genes and pink represents upregulated genes. HCG4, HLA complex group 4; HOTAIR, HOX transcript antisense RNA; TTTY15, testis-specific transcript, Y-linked 15; lncRNA, long noncoding RNA; miR, microRNA.

According to the information in the miRTarBase database, the DEGs regulated by the seven DE-miRs were investigated. A total of 55, 59, 34, 7, 17, 69 and 0 interacting DEGs were reported for hsa-miR-1, hsa-miR-124, hsa-miR-133a, hsa-miR-184, hsa-miR-208a, hsa-miR-33b and hsa-miR-499, respectively. Therefore, an miRNA-mRNA regulatory network was established comprising six miRNAs and 193 mRNAs (Fig. 4). Additionally to the DEGs, 22 coding genes exhibiting interactions with at least five DEGs were included in the network, as determined using the BioGRID, HPRD and DIP databases.
Figure 4.

miRNA-mRNA regulatory network corresponding to genes differentially expressed between recurrent and nonrecurrent samples. Circles represent mRNAs and diamonds represent miRNAs. Green represents downregulated genes, pink represents upregulated genes, and white circles represent genes associated with at least five miRNA-interacting genes.

Feature coding gene identification

The 100 coding genes, including 86 DEGs and 14 DEG-associated genes, exhibiting the highest BC values in the miRNA-mRNA network were examined. The 86 DEGs were used to perform clustering analysis, and the results suggested that the DEGs clustered according to the two types of samples (Fig. 5). Similar results were identified when all the DEGs were considered for the clustering analysis (Fig. 2). The present results suggested that the 86 DEGs may be used as feature-coding genes for sample grouping.
Figure 5.

Heat map of 86 feature coding genes identified by the betweenness centrality values in the microRNA-mRNA network. Yellow represents recurrent samples and blue represents nonrecurrent samples. Red indicates upregulated genes and green indicates downregulated genes.

SVM classifier

The RFE algorithm was used to refine the list of feature-coding genes, and 32 optimal feature-coding genes exhibiting the highest predictive accuracy (94.05%) for sample grouping were selected. The 32 coding genes are presented in Table I. The SVM classifier constructed using the 32 feature-coding genes presented high accuracy (94.05%, 79/84) in separating the recurrent samples from the nonrecurrent samples.
Table I.

Basic information of 32 optimal feature-coding genes, among the genes significantly differentially expressed between recurrent and nonrecurrent cancer samples.

Gene symbolBetweenness centralityNumber of interactionsP-valueLog2 fold-change
SHISA60.24040.028  0.544
MAGEA20.21550.047−0.394
TRDN0.15660.002  0.601
RGS70.14940.001  1.331
CPNE40.14240.020  0.657
CALCA0.11130.004  1.214
GABRG20.10120.035  0.599
TECRL0.08330.043  0.846
MYLK20.07330.005  0.543
MYO3A0.05830.048  0.365
ASB100.05730.001  1.261
WT10.03940.015  0.712
CALCB0.03820.006  0.986
OPCML0.03820.011  0.693
KY0.03820.011  0.502
CYP3A430.03820.027−2.312
EPHA60.03820.0011.216
SLC6A200.03820.001  0.583
NCAM20.03620.026  0.451
FLRT30.03320.022  0.312
TMC50.03320.034−0.334
AADACL30.03320.011  0.983
ANKRD10.02660.005  0.520
CLDN220.02020.005−1.662
FGF50.01820.015  0.593
SCN5A0.01620.009  0.475
PTPRR0.01620.005  0.561
LDHC0.01520.000−1.366
OLFM30.01220.036  1.283
HRASLS50.01220.042−0.560
SPINK60.01020.011−0.533
SPESP10.00820.004  0.499
By examining the two validation datasets, the SVM classifier was able to distinguish 101 samples (30 recurrent and 71 non-recurrent) and 51 samples (16 recurrent and 35 non-recurrent) from the GSE27020 and GSE25727 datasets (Fig. 6) with an accuracy of 92.66 and 91.07%, respectively. The scatter plot of sample classifications is presented in Fig. 7. The five indicators (Se, Sp, PPV, NPV and AUC) presented high scores (Table II). AUC exhibited the highest score among the five indicators (Fig. 8). The present results suggested that the SVM classifier exhibited high accuracy and reliability in grouping samples from various datasets.
Figure 6.

Classification effect of the support vector machine classifier. Classification effect was investigated on the two validation datasets, (A) GSE27020 and (B) GSE25727. Yellow represents recurrent samples and blue represents nonrecurrent samples. Red indicates upregulated genes and green indicates downregulated genes.

Figure 7.

Scatter plot of samples classification using the SVM classifier. (A) Analysis of The Cancer Genome Atlas dataset. (B) GSE27020 dataset. (C) GSE25727 dataset. Blue represents nonrecurrent samples and red represents recurrent samples. SVM, support vector machine.

Table II.

Classifying parameters of the support vector machine classifier for the three datasets analyzed.

DatasetNumber of samplesAccuracySensitivitySpecificityPositive predictive valueNegative predictive valueArea under receiver operating characteristic curve
TCGA  840.94100.9470.9380.8180.9840.986
GSE270201090.92660.8820.9470.8820.9470.946
GSE25727  560.91070.9410.8970.80.9720.921

TCGA, The Cancer Genome Atlas.

Figure 8.

AUC of the support vector machine classifier on sample classification. (A) Sample grouping of The Cancer Genome Atlas dataset. (B) Sample grouping of the GSE27020 dataset. (C) Sample grouping of the GSE25727 dataset. AUC, area under the receiver operating characteristic curve.

The ceRNA regulatory network comprised 203 nodes and 346 lines and was constructed by combining the lncRNA-miRNA regulatory network and the miRNA-mRNA regulatory network (Fig. 9). In the network, a node indicated a transcript (mRNA, miRNA or lncRNA), whereas a line indicated the association between two nodes. The coding genes in the ceRNA network were significantly enriched in 11 pathway categories, including ‘neuroactive ligand-receptor interaction’, ‘salivary secretion’ and ‘tight junction’ (Table III), and 16 molecular function categories, including ‘muscle contraction’, ‘muscle system process’ and ‘blood circulation’ (Table IV). The multiple hypotheses testing correction results for the GO molecular function and KEGG pathway categories were not statistically significant (data not shown).
Figure 9.

Competing endogenous RNA regulatory network of all DE-miRNAs, DE-lncRNAs and DEGs. Circles represent mRNAs, diamonds represent miRNAs and squares represent lncRNAs. Green represents downregulated genes, pink represents upregulated genes and white circles represent genes associated with at least five miRNA-interacting genes. miRNA, microRNA; DE, differentially expressed; lncRNA, long noncoding RNA.

Table III.

Significantly enriched pathways in the competing endogenous RNA regulatory network.

KEGG pathway IDKEGG pathway nameGene countP-valueDifferentially expressed genes
hsa04530Tight junction50.001MYH1, CLDN6, MYH4, CTNNA2, CLDN10
hsa04970Salivary secretion40.001HTN3, ATP1B4, ATP1A2, STATH
hsa04080Neuroactive ligand-receptor interaction60.004GABRA2, GABRG2, PTH2R, OPRK1, CHRNA1, GRIK2
hsa04918Thyroid hormone synthesis30.0076528, ATP1B4, ATP1A2
hsa04964Proximal tubule bicarbonate reclamation20.008ATP1B4, ATP1A2
hsa04960Aldosterone-regulated sodium reabsorption20.020ATP1B4, ATP1A2
hsa05033Nicotine addiction20.020GABRA2, GABRG2
hsa04973Carbohydrate digestion and absorption20.025ATP1B4, ATP1A2
hsa04670Leukocyte transendothelial migration30.026CLDN6, CTNNA2, CLDN10
hsa04978Mineral absorption20.032ATP1B4, ATP1A2
hsa04514Cell adhesion molecules (CAMs)30.041CLDN6, NCAM2, CLDN10

KEGG, Kyoto Encyclopedia of Genes and Genomes.

Table IV.

Significantly enriched molecular functions in the competing endogenous RNA regulatory network.

GO term IDGO term nameGene countP-valueGenes
GO:0006936Muscle contraction114.17×10−7TRDN, ACTC1, MYH1, MYL1, MYLK2, MYH4, SMPX, TTN, CHRNA1, SCN5A, CASQ2
GO:0003012Muscle system process119.88×10−7TRDN, ACTC1, MYH1, MYL1, MYLK2, MYH4, SMPX, TTN, CHRNA1, SCN5A, CASQ2
GO:0008015Blood circulation101.89×10−5CALCA, CALCB, ACTC1, MYL1, MYLK2, CARTPT, ATP1A2, TAC4, SCN5A, EPO
GO:0003013Circulatory system process101.89×10−5CALCA, CALCB, ACTC1, MYL1, MYLK2, CARTPT, ATP1A2, TAC4, SCN5A, EPO
GO:0007267Cell-cell signaling169.03×10−5FGF5, GABRG2, GABRA2, GRIK2, OPRK1, MYLK2, ATP1A2, CXCL6, IL22, CTNNA2, CALCA, PNOC, SLC1A6, CARTPT, SLC30A8, CHRNA1
GO:0006811Ion transport181.32×10−4GABRG2, SLC5A5, GABRA2, GRIK2, ATP1B4, KCNA6, ATP1A2, TCN1, BEST3, SLC1A6, SLC5A8, LTF, ANO5, ANO4, CHRNA1, SLC30A8, SCN5A, ADD2
GO:0007268Synaptic transmission111.44×10−4GABRG2, GABRA2, PNOC, GRIK2, OPRK1, SLC1A6, MYLK2, CARTPT, ATP1A2, CHRNA1, CTNNA2
GO:0030182Neuron differentiation132.09×10−4NCAM2, SLITRK4, SOX1, OPCML, FOXA2, PKHD1, EMX2, MDGA2, PTPRR, POU4F1, OLFM3, NEFL, CTNNA2
GO:0019226Transmission of nerve impulse115.20×10−4GABRG2, GABRA2, PNOC, GRIK2, OPRK1, SLC1A6, MYLK2, CARTPT, ATP1A2, CHRNA1, CTNNA2
GO:0055082Cellular chemical homeostasis103.60×10−3CALCA, CALCB, XIRP1, GRIK2, SLC1A6, LTF, CARTPT, ATP1A2, SLC30A8, CHRNA1
GO:0019725Cellular homeostasis101.29×10−2CALCA, CALCB, XIRP1, GRIK2, SLC1A6, LTF, CARTPT, ATP1A2, SLC30A8, CHRNA1
GO:0048878Chemical homeostasis102.24×10−2CALCA, CALCB, XIRP1, GRIK2, SLC1A6, LTF, CARTPT, ATP1A2, SLC30A8, CHRNA1
GO:0007155Cell adhesion122.54×10−2IBSP, FLRT3, CALCA, NCAM2, AMBN, OPCML, FAT3, PKHD1, CLDN6, CLDN22, CLDN10, CTNNA2
GO:0022610Biological adhesion122.56×10−2IBSP, FLRT3, CALCA, NCAM2, AMBN, OPCML, FAT3, PKHD1, CLDN6, CLDN22, CLDN10, CTNNA2
GO:0050877Neurological system process173.28×10−2GABRG2, GABRA2, MYO3A, GRIK2, OPRK1, MYLK2, ATP1A2, TAS1R1, CTNNA2, CALCA, NCAM2, PNOC, SLC1A6, CARTPT, POU4F1, CHRNA1, NEFL
GO:0042592Homeostatic process123.94×10−2CALCA, CALCB, XIRP1, GRIK2, PKHD1, SLC1A6, LTF, CARTPT, ATP1A2, SLC30A8, CHRNA1, EPO

GO, Gene Ontology.

ceRNA regulatory network of 32 optimal feature-coding genes

The ceRNA regulatory network corresponding to the 32 optimal feature coding genes was established and consisted of six DE-miRs, four DE-lncRs and 32 feature-coding genes (Fig. 10). In the ceRNA network, the interactions between HCG4-miR-33b, HOTAIR-miR-1-MAGE family member A2 (MAGEA2), EMX2OS-miR-124-calcitonin related polypeptide α (CALCA) and EMX2OS-miR-124-γ-aminobutyric acid type A receptor γ2 subunit (GABRG2) exhibited the highest scores. The Kaplan-Meier survival curves of these six DE-miRs suggested a significant association between recurrence-free survival times and the expression levels of the miRNAs examined, including hsa-miR-33b, hsa-miR-124, hsa-miR-133a and hsa-miR-208a (P<0.05; Fig. 11).
Figure 10.

Competing endogenous RNA regulatory network of DE-miRNAs, DE-lncRNAs, and 32 optimal feature coding genes. Circles represent mRNAs, diamonds represent miRNAs and squares represent lncRNAs. Green represents downregulated genes, pink represents upregulated genes and white circles represent genes associated with at least five miRNA-interacting genes. DE, differentially expressed; miRNA, microRNA; lncRNA, long noncoding RNA.

Figure 11.

KM survival curves of the differentially expressed miRs in the competing endogenous RNA regulatory network constructed using 32 optimal feature coding genes. Blue curves represent samples with downregulated miRs and red curves represent samples with upregulated miRs. (A) hsa-miR-1. (B) hsa-miR-33b. (C) hsa-miR-124. (D) hsa-miR-133a. (E) hsa-miR-184. (F) hsa-miR-208a. KM, Kaplan-Meier; miR, microRNA.

Discussion

By analyzing the RNA-seq data downloaded from TCGA, the DE-miRs, DEGs and DE-lncRs between recurrent and nonrecurrent laryngeal cancer samples were identified. Based on the associations identified in various databases, numerous networks (miRNA-mRNA, lncRNA-mRNA and ceRNA networks) were constructed to examine the potential interactions among feature-coding genes, miRNAs and lncRNAs. The aim of the present study was to identify the regulatory mechanisms underlying the recurrence of laryngeal cancer. In the ceRNA network, the interactions between HCG4-miR-33b, HOTAIR-miR-1- MAGEA2, EMX2OS-miR-124- CALCA and EMX2OS-miR-124-GABRG2 exhibited the highest scores. According to the BC values in the miRNA-mRNA network and the RFE algorithm, 32 optimal feature-coding genes, including MAGEA2, CALCA and GABRG2 were identified to be associated with recurrent laryngeal cancer samples. Since the 32 feature-coding genes were identified by analyzing the miRNA-mRNA network, all 32 coding genes were associated with DE-miRs. The SVM classifier constructed using these feature-coding genes exhibited high accuracy in grouping samples from the TCGA dataset (94.05%) and from the two validation datasets (GSE27020: 92.66%; GSE25727: 91.07%). miR-206 serves an important role in the inhibition of tumor proliferation and metastasis in numerous cancer types (31–33). In laryngeal cancer, miR-206 may serve as a tumor suppressor by targeting vascular endothelial growth factor (34). miR-1, a homolog of miR-206 (35), was identified as a DE-miR in the present ceRNA network and may serve a role in the control of laryngeal cancer recurrence. The present bioinformatics analyses identified that miR-1 may be associated with the lncRNA HOTAIR in laryngeal cancer. In LSCC, HOTAIR is upregulated and associated with the risk of lymphatic metastasis, and with poor prognosis (36,37). In addition, the oncogenic role of HOTAIR may be associated with PTEN methylation (38). To the best of the authors' knowledge, no previous study has investigated the regulatory association between miR-1 and HOTAIR in laryngeal cancer. However, miR-1 was demonstrated to be a direct downstream target of HOTAIR in thyroid cancer cells, and miR-1 inhibition may promote HOTAIR-mediated tumor progression (39). In bladder cancer cells, the increased expression of miR-1 is negatively associated with HOTAIR (40). In the present ceRNA network, MAGEA2 was identified to be a potential target of miR-1. In HNSCC, MAGEA2 is upregulated and promotes the growth of normal oral keratinocytes, partly via the p53 pathway (41). Collectively, the interaction HOTAIR-miR-1-MAGEA2 may serve a regulatory role in the progression of laryngeal cancer. miR-33b belongs to the miR-33 family and regulates cholesterol homeostasis, and lipid and glucose metabolism (42). A previous study observed that miR-33b may inhibit the metastasis of breast cancer (43). In gastric cancer, curcumin exerts its inhibitory activity by upregulating the expression level of miR-33b (44). In laryngeal cancer, a previous study identified that miR-33a may exert antiproliferative effects by targeting pim-1 proto-oncogene, serine/threonine kinase (45). In the present ceRNA network, miR-33b was associated with the lncRNA HCG4, which was identified to be a candidate gene in multiple sclerosis (46); however, its role in laryngeal cancer remains unclear. The present results suggested that HCG4 was a DE-lncR associated with the recurrence of laryngeal cancer, which interacted with miR-33b. Therefore, these two molecular factors may be associated with and may serve important functional roles during the recurrence of laryngeal cancer. miR-124 has been identified to serve a role in numerous cancer types (47,48). Hypermethylation of miR-124 may result in the development of colon, breast and lung cancer (49). The downregulation of miR-124 expression may suppress nasopharyngeal carcinoma (50). Furthermore, the expression level of miR-124 is associated with recurrence in patients with lung cancer (51). In HNSCC, miR-124 serves as a tumor suppressor gene (52) and inhibits the epithelial-restricted with serine box/epidermal growth factor receptor signaling pathway (53). In the present study, miR-124 was identified to be differentially expressed between recurrent and nonrecurrent laryngeal cancer samples. Therefore, miR-124 may serve as a tumor suppressor gene in laryngeal cancer and may be associated with laryngeal cancer recurrence. The present ceRNA network results suggested that EMX2OS and TTTY15 were two lncRNAs that may interact with miR-124. EMX2OS was previously identified to be associated with brain diseases (54). In comparison with healthy controls, patients with myalgic encephalomyelitis/chronic fatigue syndrome exhibited an increase in the expression level of EMX2OS (55). EMX2OS is expressed in the central nervous system, similarly to empty spiracles homeobox 2 (54). The gene fusion zinc finger DHHC-type containing 2-TTTY15 was identified as one of four fusion genes in a patient with acute myeloid leukemia, by whole genome sequencing (56). The fusion gene TTTY15-ubiquitin specific peptidase 9 Y-linked (USP9Y) exhibited high prevalence among Chinese patients with prostate cancer, and the TTTY15-USP9Y fusion is an independent predictor of prostate cancer (57). To the best of the authors' knowledge, no previous studies have reported the association between TTTY15 or EMX2OS and laryngeal cancer. Therefore, these lncRNAs may represent novel biomarkers or may exhibit gene fusion in laryngeal cancer. In addition, miR-184 expression is upregulated in the majority of HNSCC tumors (58). In the present study, miR-124 and miR-184 were associated with the lncRNA EMX2OS, suggesting a potential role EMX2OS in the regulation of laryngeal cancer. Among the genes interacting with miR-124, the present study identified that CALCA and GABRG2 were associated with recurrent laryngeal carcinoma. A previous study demonstrated that methylation of CALCA was identified to be detected in 70% of patients with recurrent thyroid cancer (59). Therefore, CALCA and GABRG2 may serve important roles in the recurrence of laryngeal cancer, and the regulatory mechanism underlying laryngeal cancer recurrence may involve the EMX2OS-miR-124-CALCA/GABRG2 axis. The SVM classifier is an effective classification method. The SVM classifier has been used to classify various cancer samples, in cancer subtypes including melanoma (60), metastatic breast cancer (61) and lung cancer (62). The SVM classifier was constructed to identify the feature-coding genes that may be able to discriminate recurrent from nonrecurrent laryngeal cancer samples. In the present study, the SVM classifier comprising 32 feature coding genes exhibited high accuracy in sample classification. Specifically, the SVM classifier exhibited high accuracy in the two validation datasets, suggesting that this method was a reliable classification method. Although the present bioinformatics results were tested using validation datasets, the present study exhibited two limitations. Since the samples were derived from TCGA and GEO databases, the sample size may be limited. Furthermore, the present results require validation in vitro and in vivo. Therefore, further experiments are required to be performed in the future to confirm the present study. Nevertheless, the present results may provide useful insights for improving the understanding of the pathogenesis of laryngeal cancer recurrence. In conclusion, an SVM classifier composed of 32 feature-coding genes classified recurrent and nonrecurrent laryngeal cancer samples with high accuracy. miR-1, miR-33b, miR-124, HOTAIR, HCG4 and EMX2OS may represent a signature of noncoding RNAs in recurrent laryngeal cancer. The interactions HCG4-miR-33b, HOTAIR-miR-1-MAGEA2 and EMX2OS-miR-124-CALCA/GABRG2 may be important in the regulation of the molecular mechanisms underlying the development of recurrent laryngeal cancer.
  62 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Universal behavior of load distribution in scale-free networks.

Authors:  K I Goh; B Kahng; D Kim
Journal:  Phys Rev Lett       Date:  2001-12-12       Impact factor: 9.161

3.  Laryngeal cancer in the United States: changes in demographics, patterns of care, and survival.

Authors:  Henry T Hoffman; Kimberly Porter; Lucy H Karnell; Jay S Cooper; Randall S Weber; Corey J Langer; Kie-Kian Ang; Greer Gay; Andrew Stewart; Robert A Robinson
Journal:  Laryngoscope       Date:  2006-09       Impact factor: 3.325

4.  Enrichment or depletion of a GO category within a class of genes: which test?

Authors:  Isabelle Rivals; Léon Personnaz; Lieng Taing; Marie-Claude Potier
Journal:  Bioinformatics       Date:  2006-12-20       Impact factor: 6.937

5.  Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data.

Authors:  Benilton Carvalho; Henrik Bengtsson; Terence P Speed; Rafael A Irizarry
Journal:  Biostatistics       Date:  2006-12-22       Impact factor: 5.899

6.  The dilemma of follow-up in head and neck cancer patients.

Authors:  I Haas; U Hauser; U Ganzer
Journal:  Eur Arch Otorhinolaryngol       Date:  2001-05       Impact factor: 2.503

7.  Detection of serum deoxyribonucleic acid methylation markers: a novel diagnostic tool for thyroid cancer.

Authors:  Shuiying Hu; Marge Ewertz; Ralph P Tufano; Mariana Brait; Andre Lopes Carvalho; Dingxie Liu; Anthony P Tufaro; Shehzad Basaria; David S Cooper; David Sidransky; Paul W Ladenson; Mingzhao Xing
Journal:  J Clin Endocrinol Metab       Date:  2005-11-01       Impact factor: 5.958

8.  Glottic carcinoma--patterns of failure and salvage treatment after curative radiotherapy in 861 consecutive patients.

Authors:  Lars Vendelbo Johansen; Cai Grau; Jens Overgaard
Journal:  Radiother Oncol       Date:  2002-06       Impact factor: 6.280

9.  SNP mapping and candidate gene sequencing in the class I region of the HLA complex: searching for multiple sclerosis susceptibility genes in Tasmanians.

Authors:  R K Burfoot; C J Jensen; J Field; J Stankovich; M D Varney; L J Johnson; H Butzkueven; D Booth; M Bahlo; B D Tait; B V Taylor; T P Speed; R Heard; G J Stewart; S J Foote; T J Kilpatrick; J P Rubio
Journal:  Tissue Antigens       Date:  2007-10-30

10.  Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data.

Authors:  Xuegong Zhang; Xin Lu; Qian Shi; Xiu-Qin Xu; Hon-Chiu E Leung; Lyndsay N Harris; James D Iglehart; Alexander Miron; Jun S Liu; Wing H Wong
Journal:  BMC Bioinformatics       Date:  2006-04-10       Impact factor: 3.169

View more
  9 in total

1.  NLR, PLR, LMR and MWR as diagnostic and prognostic markers for laryngeal carcinoma.

Authors:  Pingdong Li; Haiyang Li; Shuo Ding; Jing Zhou
Journal:  Am J Transl Res       Date:  2022-05-15       Impact factor: 3.940

2.  Development and validation of a novel metabolic signature for predicting prognosis in patients with laryngeal cancer.

Authors:  Wenfei Li; Min Fu; Kun Zhao; Fang Han; Ning Bu; Zhanqiu Wang
Journal:  Eur Arch Otorhinolaryngol       Date:  2020-10-27       Impact factor: 2.503

3.  Regulation of laryngeal squamous cell cancer progression by the lncRNA RP11-159K7.2/miR-206/DNMT3A axis.

Authors:  Xin Wang; Boyu Yu; Qianqian Jin; Junyi Zhang; Bingrui Yan; Like Yang; Yushan Li; Qiuying Li; Peng Wang; Chuanhui Sun; Ming Liu; Linli Tian; Yanan Sun
Journal:  J Cell Mol Med       Date:  2020-05-04       Impact factor: 5.310

4.  Liquid lncRNA Biopsy for the Evaluation of Locally Advanced and Metastatic Squamous Cell Carcinomas of the Head and Neck.

Authors:  Izabela Łasińska; Tomasz Kolenda; Kacper Guglas; Magda Kopczyńska; Joanna Sobocińska; Anna Teresiak; Norbert Oksza Strzelecki; Katarzyna Lamperska; Andrzej Mackiewicz; Jacek Mackiewicz
Journal:  J Pers Med       Date:  2020-09-16

5.  LINC00488 stimulates the progression of esophageal cancer by targeting microRNA-485-5p.

Authors:  Hongbo Xu; Yan Ye
Journal:  Oncol Lett       Date:  2020-12-04       Impact factor: 3.111

6.  Identification of LOC101927355 as a Novel Biomarker for Preeclampsia.

Authors:  Reyna Peñailillo; Lara J Monteiro; Stephanie Acuña-Gallardo; Felipe García; Victoria Velásquez; Paula Correa; Pilar Díaz; Patricia P Valdebenito; Cristina Navarro; Roberto Romero; Mario Sánchez; Sebastián E Illanes; Gino Nardocci
Journal:  Biomedicines       Date:  2022-05-27

7.  Clinicopathological and prognostic significance of the long non-coding RNA HOTAIR high expression in head and neck squamous cell carcinoma: a systematic review and meta-analysis.

Authors:  Gang Yang; Tianyu Huang; Faming Pan; Yuanyin Wang; Ye Wu; Jinian Wang; Zhipeng Pan; Yuting Chen
Journal:  Transl Cancer Res       Date:  2022-08       Impact factor: 0.496

8.  Identification of 12 immune-related lncRNAs and molecular subtypes for the clear cell renal cell carcinoma based on RNA sequencing data.

Authors:  Weimin Zhong; Bin Chen; Hongbin Zhong; Chaoqun Huang; Jianqiong Lin; Maoshu Zhu; Miaoxuan Chen; Ying Lin; Yao Lin; Jiyi Huang
Journal:  Sci Rep       Date:  2020-09-02       Impact factor: 4.379

Review 9.  Exosomal noncoding RNAs in cholangiocarcinoma: Laboratory noise or hope?

Authors:  Konstantinos Laschos; Dimitra Ioanna Lampropoulou; Gerasimos Aravantinos; Maria Piperis; Dimitrios Filippou; George Theodoropoulos; Maria Gazouli
Journal:  World J Gastrointest Surg       Date:  2020-10-27
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.