Literature DB >> 30431092

Screening and clinical significance of tumor markers in head and neck squamous cell carcinoma through bioinformatics analysis.

Lei Zhao1, Weiwei Chi2, Huan Cao1, Weina Cui1, Wenxia Meng1, Wei Guo3, Baoshan Wang1.   

Abstract

In order to identify potential diagnostic and prognostic biomarkers, and treatment targets for head and neck squamous cell carcinoma (HNSCC), the present study obtained the gene expression profiles in HNSCC through public data mining, and core genes were identified using a series of bioinformatics analysis methods and databases. A total of nine hub genes (SPP1, ITGA6, TMPRSS11D, MMP1, LAMC2, FAT1, ACTA1, SERPINE1 and CEACAM1) were identified to be significantly correlated with HNSCC. Furthermore, overall survival analysis demonstrated that the expression values of hub genes were associated with overall survival in HNSCC. Furthermore, certain of the identified genes, including, TMPRSS11D, ACTA1 and CEACAM1, have not been thoroughly investigated in HNSCC previously. Taken together, the nine hub genes obtained by screening in the present study may serve as potential tumor markers and important prognostic indicators for HNSCC.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30431092      PMCID: PMC6297764          DOI: 10.3892/mmr.2018.9639

Source DB:  PubMed          Journal:  Mol Med Rep        ISSN: 1791-2997            Impact factor:   2.952


Introduction

Head and neck squamous cell carcinoma (HNSCC), a common malignant tumor of the head and neck region, is primarily comprised of lip, oral cavity, larynx, nasopharynx and other pharynx carcinomas. In 2012, the number of new HNSCC cases reported worldwide was ~686,000, while the HNSCC-associated mortality cases were 375,000 (1). Currently, smoking and alcohol consumption are deemed to be risk factors for HNSCC development; furthermore, human papilloma virus (HPV) infection is considered to have an important role in the occurrence and prognosis of HNSCC (2). In spite of the application of surgery, chemoradiation and multimodal treatment approaches, the prognosis of HNSCC remains poor due to local recurrence and metastasis, and the 5-year overall survival is ~50% (1). Furthermore, poor prognosis is partially attributed to the lack of understanding of the molecular mechanism underlying the development of this cancer. The oncogenesis and progression of HNSCC is a complicated process involving multiple molecules, including microRNA-98 (3), Twist family BHLH transcription factor 1 (4) and Mastermind-like 1 (4). With the application of immunotherapy, multimodal therapeutic approaches may be improved in the future (2). Thus, identifying specific tumor markers and novel molecular targets for the treatment of HNSCC is important. Huang et al (5) has recently proposed that lysine (K)-specific demethylase 5B (KDM5B) is overexpressed in HNSCC tissues as compared with its levels in adjacent noncancerous tissues, and may be a significant prognostic biomarker of HNSCC on account of the association between KDM5B and overall survival times. A previous study by Trivedi et al (6) suggested an association between the expression of several tumor markers (including pemphigus vulgaris antigen, parathyroid hormone-related peptide and tumor-associated calcium signal transducer 1) and the metastasis of HNSCC to the lymph nodes. Furthermore, numerous established or emerging biomarkers associated with HNSCC have previously been explored, including hypoxia-inducible factor 1, carbonic anhydrase IX, programmed death ligand-1 and cytotoxic T-lymphocyte antigen 4, which are involved in immune checkpoints, hypoxia and radiation sensitivity (7). Tumor markers vary widely due to the multiple anatomical sites and histological types of HNSCC; thus, the identification of more valuable tumor markers of HNSCC is required. The approach for screening tumor markers and molecular targets has improved with the application of microarray and transcriptional sequencing technologies (8,9). Microarray analysis and high-throughput sequencing technology provide valuable information and have been successfully used in marker screening for numerous tumors (10,11). The transcriptome data containing large amounts of information is stored in a public database for sharing. Data mining and bioinformatics analysis allow for transcriptome data to be fully utilized by researchers and enable more reliable biological information to be obtained (12–14). Biological and medical research has improved through the generation and development of bioinformatics (15). Numerous biological websites and software are available for use in bioinformatics analysis, including The Cancer Genome Atlas (TCGA) (16), Gene Expression Omnibus (GEO) 2R (17), The Database for Annotation Visualization and Integrated Discovery (DAVID) (18), Gene Expression Profiling Interactive Analysis (GEPIA) (19). These websites and software have been successfully applied in numerous biological studies (13,14). Despite the extensive application of bioinformatics analysis in biomarker screening, investigations regarding HNSCC are limited. Demokan et al (20) have demonstrated that guanine nucleotide-binding protein γ-7 (GNG7) is downregulated in HNSCC through expression profile screening, and the expression level of GNG7 was then validated by quantitative methylation-specific polymerase chain reaction. A recent study also identified differentially expressed miRNAs and mRNAs in laryngeal squamous cell carcinoma through data mining and bioinformatics analysis (21). Therefore, the present study aimed to identify novel diagnostic, prognostic or predictive biomarkers for HNSCC, and highlight the core genes associated with HNSCC through bioinformatics analysis. In the present study, three mRNA expression profiles associated with HNSCC were retrieved and filtered from the National Center for Biotechnology Information (NCBI)/GEO datasets, which are public and freely available databases (17,22). The differentially expressed genes (DEGs) in each data series were analyzed using GEO2R. The intersection of the three sets of DEGs extracted from the three data series included 19 mRNAs, which were defined as the hub genes and deemed to be highly associated with HNSCC. Through bioinformatics analysis, validation and survival analysis of a large sample size based on TCGA, a total of nine hub genes were identified as potential tumor markers and important prognostic indicators for HNSCC.

Materials and methods

Data mining

Transcriptome expression profiles of HNSCC patients were retrieved from the NCBI-GEO datasets (https://www.ncbi.nlm.nih.gov/gds/) and used for further screening. Three data series were selected for investigation in the present study, including GSE6631, GSE58911 and GSE83519. The screening criteria were set as follows: The mRNA expression profiles selected were derived from matched-pairs sample, while data derived from non-paired sample or cell lines were excluded. The series GSE6631 and GSE83519 included 22 paired samples each, and GSE58911 contained 15 paired samples. These data series were downloaded (retrieval date, January 7, 2018), and the DEGs between the HNSCC and matched control samples included in the three data series were filtered using GEO2R (www.ncbi.nlm.nih.gov/geo/info/geo2r.html), an NCBI-GEO integration tool. The default parameters used for DEG identification was a false discovery rate (Benjamini-Hochberg procedure) and P<0.05 (17,22). Then, MultiExperiment Viewer (23) and R package ‘ggplot2’ were applied for cluster analysis and visualization of DEGs. With these software, the overall distribution of DEGs in cancer tissues and paired normal tissues may be presented using a hierarchical clustering graph. The distribution of each DEG in different samples and the distribution of all DEGs in a given sample may also be distinguished using a clustering graph. The distribution tendency of DEGs may be presented directly using a volcano diagram.

Screening of hub genes

Three sets of DEGs derived from the three data series were further filtered according to rigorous screening criteria, including a fold change (FC) of ≥2 and P≤0.05. The newly acquired sets of DEGs were used in subsequent analyses. The intersections of the three sets of DEGs originating from the data series (GSE6631, GSE58911 and GSE83519) were presented using a Venn diagram, which was implemented with the online software VENN DIAGRAMS (http://bioinformatics.psb.ugent.be/beg/tools/venn-diagrams). Subsequently, the common genes of the three DEG sets were extracted and defined as hub genes if they were highly correlated with HNSCC.

Validation of relative expression levels of hub genes

In order to confirm and increase the reliability of the data analysis, the relative expression levels of hub genes in HNSCC were validated using GEPIA (http://gepia.cancer-pku.cn/index.html; retrieval date, January 9, 2018), an online analysis software based on the TCGA and Genotype-Tissue Expression (GTEx) databases, using |logFC|≥1 and P≤0.05 as the cut-off criteria (19). The results are presented as box plots.

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses

The GO project describes genes and gene products on the basis of three different aspects, namely biological processes, molecular functions and cellular components, in a species-independent manner. GO has been widely used for genomics and proteomics analyses in biomedical research (24,25). KEGG, which primarily consists of various functions, including KEGG PATHWAY and KEGG GENES, is used for understanding high-level functions from molecular-level data (26). The KEGG PATHWAY, a manually reference database for pathway mapping, represents the current knowledge on molecular interactions, reactions and association networks (26). To gain an improved understanding of the hub genes, GO enrichment and KEGG pathway analyses were performed using DAVID, an online integration tool (retrieval date, January 9, 2018) (18,27).

Construction and analysis of protein-protein interaction (PPI) network

For a more in-depth understanding of the associations among hub genes, the PPI network of hub genes was structured via online database STRING, according to the provided manual (retrieval date, January 9, 2018) (28). PPIs may be evaluated and integrated via the PPI networks (28). Furthermore, the expression correlation of key nodes was verified using GEPIA (retrieval date, January 9, 2018).

Extraction and analysis of potential tumor markers

In order to identify tumor markers associated with prognosis, the hub genes were further filtered according to the score of nodes and the strength of interaction in the PPI network. Next, a log-rank test of overall survival (200 months was the longest follow-up period) associated with these selected hub genes was performed using GEPIA (retrieval date, January 9, 2018).

Results

DEGs derived from three data series associated with HNSCC

Initially, eight data series (including GSE83519, GSE58911, GSE6631, GSE23036, GSE13397, GSE40185, GSE10774 and GSE7073), which were able to be analyzed with GEO2R, were obtained by the retrieval of mRNA expression profiles associated with HNSCC in NCBI-GEO datasets. Among them, five data series were excluded out according to the screening criteria, including the cell line-based series GSE40185, GSE10774 and GSE7073, as well as the unpaired tissue sample-based series GSE23036 and GSE13397. The remaining three data series (GSE83519, GSE58911 and GSE6631) that met the screening criteria were used in the subsequent investigation. GEO2R is a NCBI integration analysis tool for expression profile data and may be used to screen DEGs of two or more groups of sample sources (17,22). Three sets of DEGs derived from the three included data series were acquired by GEO2R analysis, and these are listed in Table I.
Table I.

Differentially expressed genes in the three data series.

Differentially expressed genes

SeriesSamplesUpregulatedDownregulatedTotal
GSE663122 paired66119185
GSE5891115 paired204450654
GSE8351922 paired1,1661,1002,266
In order to present the distribution of DEGs in each sample, the microarray data associated with the three NCBI-GEO datasets were downloaded, and the relative gene expression values of DEGs were extracted from the microarray data. Based on this, clustering analysis was performed using MultiExperiment Viewer, an open-source genomic analysis software (23). Subsequently, a volcano diagram was developed using the R package ‘ggplot2’. The hierarchical clustering graphs and volcano diagrams are presented in Fig. 1. Hierarchical clustering graphs and volcano diagrams of DEGs of the three data series presented that the DEGs are both upregulated and downregulated in cancerous tissues, but mainly downregulated in cancerous tissues, which suggested that the inactivation of tumor suppressor genes and the activation of oncogenes played important roles in the development of HNSCC, especially the inactivation of tumor suppressors.
Figure 1.

Visual presentation of the differentially expressed genes (|logFC|≥1; P≤0.05) included in the three data series examined in the present study. Hierarchical clustering graphs of genes in the (A) GSE6631, (B) GSE58911 and (C) GSE83519 data series are shown. Rows represent the mRNAs and columns represent the samples. Red and green represent upregulated and downregulated genes in cancerous tissues compared with paired adjacent normal samples, respectively. Volcano plots for (D) GSE6631, (E) GSE58911 and (F) GSE83519 data series are displayed. The x-axis represents the log2 of FC and the y-axis represents the negative log10 of the P-value. Red and grey foci on the graphs represent differently and non-differentially expressed mRNAs, respectively, in head and neck squamous cell carcinoma. FC, fold change.

Nineteen hub genes are screened from the DEGs of the three data series

The common DEGs from the three data series may be associated with HNSCC and were defined as the hub genes. A total of 19 hub genes were extracted using the VENN DIAGRAMS drawing tool (Fig. 2A), and were then visualized as hierarchical clustering graphs for each data series (Fig. 2B-D). Among the 19 hub genes, 11 hub genes (including SPP1, COL4A1, COL1A1, FN1, ITGA6, MMP1, MMP11, LAMC2, FAT1, SERPINE1 and MMP9) were overexpressed in the HNSCC samples compared with the paired corresponding control samples. Conversely, 8 hub genes (including GPD1 L, MYH7, ATP2A1, HOPX, CEACAM1, TMPRSS11D, ABLIM1 and ACTA1) were downregulated in the cancerous samples compared with the controls.
Figure 2.

Hub genes derived from the three data series. (A) Venn diagram demonstrating the intersections of data series GSE6631, GSE58911 and GSE83519. Intersections represent the differentially expressed genes in two or three data series. Hierarchical clustering graph represents the hub genes derived from the data series (B) GSE6631. Hierarchical clustering graphs represent the hub genes derived from the data series (C) GSE58911 and (D) GSE83519.

Relative expression levels of hub genes in HNSCC samples against paired corresponding normal samples are validated using GEPIA

Although the screening of hub genes performed in the current study was vigorous and highly reliable, the relative expression levels of 19 hub genes in tissues were further validated through the online analysis tool GEPIA, which is based on TCGA and GTEx databases (tumor, 519 cases; normal, 44 cases in GEPIA). The results demonstrated that the relative expression trend of the 19 hub genes in GEPIA were consistent with the expression profiles (Fig. 3), supporting the reliability of the data analysis.
Figure 3.

Relative expression levels of hub genes. (A) Upregulated and (B) downregulated genes were identified using Gene Expression Profiling Interactive Analysis (GEPIA) based on The Cancer Genome Atlas database. Red and black boxes represent the relative expression levels of genes in the tumor and normal samples, respectively. The y-axis represents the relative expression levels of genes in terms of log2 (TPM+1) (tumor samples, 519; normal samples, 44 from GEPIA; P≤0.05). TPM, transcripts per million.

Nineteen hub genes are involved in multiple GO terms and KEGG pathways

GO and KEGG pathway analyses of 19 hub genes were further performed using the online analytical tool DAVID (threshold, count ≥3; EASE score, ≤0.05), in order to achieve a preliminary understanding of the biological functions in which these hub genes may participate (Table II). The results of GO analysis indicated that the main biological processes of the selected hub genes consisted of extracellular matrix disassembly or organization, leukocyte migration, collagen catabolic process, cell adhesion, proteolysis, positive regulation of cell migration and angiogenesis. The primary cellular component terms included the extracellular region, extracellular matrix, stress fiber, extracellular space, proteinaceous extracellular matrix, filopodium, extracellular exosome, lamellipodium and perinuclear region of the cytoplasm. Finally, the primary molecular function terms included serine-type endopeptidase activity, metalloendopeptidase activity, actin binding and calcium ion binding. According to the GO terms and the distribution of hub genes, the role of a protein in HNSCC may be preliminarily hypothesized. For instance, MMP9 may participate in extracellular matrix disassembly (GO:0022617) via exertion of metalloendopeptidase activity (GO:0004222) in the extracellular space (GO:0005615). Furthermore, KEGG pathway analysis indicated that the screened hub genes were primarily involved in extracellular matrix-receptor interaction (hsa04512), focal adhesion (hsa04510), the PI3K-Akt signaling pathway (hsa04151), pathways in cancer (hsa05200), small cell lung cancer (hsa05222) and amoebiasis (hsa05146).
Table II.

GO and KEGG pathway analysis of hub genes.

TermCountP-valueGenesFold enrichmentFDR
GOTERM_BP_DIRECT
  GO:0022617-extracellular matrix disassembly61.36×10−8MMP9, MMP11, FN1, MMP1, LAMC2, SPP169.772853191.78×10−5
  GO:0030198-extracellular matrix organization73.87×10−8FN1, SERPINE1, LAMC2, SPP1, COL4A1, ITGA6, COL1A131.563909775.06×10−5
  GO:0050900-leukocyte migration61.48×10−7MMP9, FN1, MMP1, CEACAM1, ITGA6, COL1A143.465056081.94×10−4
  GO:0030574-collagen catabolic process55.64×10−7MMP9, MMP11, MMP1, COL4A1, COL1A169.046052637.37×10−4
  GO:0007155-cell adhesion75.67×10−6FN1, LAMC2, CEACAM1, SPP1, FAT1, ITGA6, COL1A113.478270840.007408
  GO:0006508-proteolysis40.015348MMP9, MMP11, TMPRSS11D, MMP17.07031578918.30441
  GO:0030335-positive regulation of cell migration30.016282LAMC2, ITGA6, COL1A114.4096109819.31187
  GO:0001525-angiogenesis30.023357FN1, SERPINE1, CEACAM111.8895444926.57648
GOTERM_CC_DIRECT
  GO:0005576-extracellular region107.46×10−6MMP9, MMP11, FN1, TMPRSS11D, SERPINE1, MMP1, LAMC2, SPP1, COL4A1, COL1A15.9575024520.007625
  GO:0031012-extracellular matrix67.87×10−6MMP11, FN1, SERPINE1, MMP1, COL4A1, COL1A119.442389760.008049
  GO:0001725-stress fiber30.001279MYH7, ABLIM1, ACTA153.286549711.300056
  GO:0005615-extracellular space70.001379MMP9, FN1, SERPINE1, LAMC2, SPP1, COL1A1, ACTA14.9844879461.401185
  GO:0005578-proteinaceous extracellular matrix40.002179MMP9, MMP11, FN1, MMP114.315789472.205664
  GO:0030175-filopodium30.002199FAT1, ITGA6, ACTA140.527798372.225839
  GO:0070062-extracellular exosome90.003242MMP9, FN1, TMPRSS11D, SERPINE1, CEACAM1, SPP1, FAT1, GPD1L, ACTA13.0709430993.265211
  GO:0030027-lamellipodium30.010687ABLIM1, FAT1, ACTA117.9842105310.40403
  GO:0048471-perinuclear region of cytoplasm40.021939ATP2A1, LAMC2, SPP1, FAT16.17815069120.29296
GOTERM_MF_DIRECT
  GO:0004252-serine-type endopeptidase activity40.00235MMP9, MMP11, TMPRSS11D, MMP113.936842112.240613
  GO:0004222-metalloendopeptidase activity30.006336MMP9, MMP11, MMP123.587796935.938097
  GO:0003779-actin binding30.034758MYH7, ABLIM1, CEACAM19.58784551328.8746
  GO:0005509-calcium ion binding40.038688MMP11, ATP2A1, MMP1, FAT14.95661748531.61489
KEGG_PATHWAY
  hsa04512: ECM-receptor interaction62.08×10−7FN1, LAMC2, SPP1, COL4A1, ITGA6, COL1A136.657824931.93×10−4
  hsa04510: Focal adhesion61.50×10−5FN1, LAMC2, SPP1, COL4A1, ITGA6, COL1A115.481702760.013864
  hsa04151: PI3K-Akt signaling pathway61.78×10−4FN1, LAMC2, SPP1, COL4A1, ITGA6, COL1A19.2441471570.165019
  hsa05200: Pathways in cancer63.29×10−4MMP9, FN1, MMP1, LAMC2, COL4A1, ITGA68.1150910160.304375
  hsa05222: Small cell lung cancer43.65×10−4FN1, LAMC2, COL4A1, ITGA625.013574660.33712
  hsa05146: Amoebiasis46.98×10−4FN1, LAMC2, COL4A1, COL1A120.058055150.644143

GO, Gene Ontology; BP, biological process; CC, cellular component; MF, molecular function; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Construction and analysis of the PPI network are implemented via STRING based on hub genes

In order to analyze the interaction of hub genes, a PPI network was constructed using the online tool STRING. In total, 19 nodes and 29 edges were obtained (average node degree, 3.05; PPI enrichment P-value, 7.55×10−15; meaning of network edges, medium confidence score of 0.4), indicating a possibly high correlation among these hub genes (Fig. 4A).
Figure 4.

PPI networks of hub genes. (A) PPI network, and (B) k-means clustering of the PPI network are displayed. Nodes represent the proteins; colored nodes represent the query proteins and first layer of interactors; white nodes represent the second layer of interactors; edges represent the protein-protein associations; and the line thickness indicates the strength of data. (C) Associations among the expression of hub genes in the PPI networks produced using Gene Expression Profiling Interactive Analysis based on The Cancer Genome Atlas database (P≤0.05). PPI, protein-protein interaction.

To confirm the possibility of interactions in the PPI network, certain nodes of the PPI network were selected according to the following aspects: Edge thickness (line thickness indicates the strength of data support) (Fig. 4A), the results of the PPI clustering (k-means clustering; number of clustering=3; Fig. 4B) and the principle that the expression correlation means possible function interdependency. Correlation analysis of the gene expression levels were further verified using GEPIA, which is based on a large sample size from TCGA and GTEx databases. The selected correlations consisted of CEACAM1-FN1, MYH7-ACTA1, MMP11-MMP1 and ACTA1-LAMC2, and all the correlations were verified using GEPIA (Fig. 4C). The aforementioned evidence presumes that correlations among expression levels suggest that associations exist on the functional level, and it is further suggested that these genes may be associated with the development or progression of HNSCC.

Nine hub genes are positively or negatively associated with overall survival, and may be used as potential tumor markers and prognostic indicators

Based on the screening of tumor markers, overall survival analysis of hub genes was performed using GEPIA. The results suggested that nine hub genes (including SPP1, ITGA6, TMPRSS11D, MMP1, LAMC2, FAT1, ACTA1, SERPINE1 and CEACAM1) exhibited positive or negative associations with the overall survival of patients with HNSCC, and these hub genes may be used as potential tumor markers and prognostic indicators (Fig. 5). Nevertheless, the remaining hub genes had no effect on the prognosis of patients and may be involved in other processes of HNSCC development, which may be attributed to the interaction of a variety of factors in the occurrence of cancer. The nine hub genes validated by the online database GEPIA, correlated with the prognosis of patients. Furthermore, studies regarding the nine hub genes in HNSCC are limited according to literature retrieval in NCBI/PubMed (https://www.ncbi.nlm.nih.gov/pubmed; key words, hub gene and HNSCC, ignore language, publication data and article types). To the best of our knowledge, no previous studies regarding the effects of TMPRSS11D and ACTA1 in HNSCC have been performed to date.
Figure 5.

Association of hub gene expression with overall survival. Curves were produced using Gene Expression Profiling Interactive Analysis based on The Cancer Genome Atlas database (P≤0.05).

Discussion

Microarray and high-throughput sequencing technology provides valuable research information through profile screening of small sample sizes. Gene expression profile microarrays provide data from different samples, allowing for the identification of DEGs using bioinformatics analysis that can be applied in further studies (10,11,29). Microarray data are stored in public databases for sharing and re-mining, including GEO-datasets, TCGA and Oncomine (16,22,30). Through data mining and bioinformatics analysis, researchers are able to use public databases as valuable references for biological research (11,17,31). In the present study, three data series associated with HNSCC (GSE83519, GSE58911 and GSE6631) were identified and then deep mining of genomics data was performed with the aid of bioinformatics software. The three included data series were based on paired samples and the target genes were identified as the common intersection of the three data series. The three data series examined in the present study were all derived from paired tumor and control samples of HNSCC patients, thus increasing the reliability of the study. Furthermore, the common DEGs identified demonstrated the same distribution tendency among the three data series. These common DEGs may be regarded as results from three experiment repetitions and an enlargement of the sample size, which increases the reliability of the gene expression trend. Comprehensive analysis of the above-mentioned databases confirmed the reliability of data with regard to experimental reproducibility and sample size. Furthermore, through screening and database validation, the intersection of the selected DEGs was found to include 19 mRNAs, which were defined as the hub genes and considered to be highly correlated with HNSCC. The 19 hub genes were differentially expressed in HNSCC simultaneously, suggesting that these genes may independently or dependently be involved in the occurrence or development of HNSCC. Thus, GO and KEGG pathway analyses were adopted to perform functional enrichment of hub genes, and these analyses categorized target genes in order to identify genes of interest. Functional enrichment analysis suggested that the highly correlated hub genes may participate in a variety of biological processes. Additionally, a PPI network was constructed based on the principle that proteins with associated expression levels may interact with one another. The PPI network and the network clustering results based on hub genes identified in the present study indicated that MMP11, CEACAM1, MYH7 and ACTA1 were located in the first layer of the network. These genes were suggested to be involved in the PPI network through higher binding, as indicated by the edge thickness, which included CEACAM1-FN1-SERPINE1, MYH7-ACTA1-LAMC2 and MMP11-MMP1-MMP9. In order to further confirm these associations, the correlations among connective nodes were verified based on large sample sizes using GEPIA, following the principle that proteins with associated expression levels may exhibit interactions, including CEACAM1-FN1, MYH7-ACTA1, MMP11-MMP1 and ACTA1-LAMC2. The results suggested that each protein in the primary line of the network exhibited a positive or negative correlation. The results further demonstrated that the hub genes participated in HNSCC through protein-protein interactions. The bioinformatics analysis conducted in the present study provided important references for subsequent research; however, it also has certain limitations. For instance, genes that were differentially expressed among the three data sets were screened as hub genes, which suggested that these genes may have potential research value. However, useful information may be lost in the screening process. For example, TP53, a routine tumor suppressor, was screened out as it was only included in GSE83519, but not in other two databases. However, as a conventional tumor suppressor, TP53 has been demonstrated to be highly mutated and hold important biological significance in HNSCC (32). The principle aim of present study was to screen a number of genes or proteins and provide a reference for clinical diagnosis, treatment and prognosis of HNSCC. In line with the purpose of screening tumor markers and prognostic indicators, analysis of the association of survival with the expression of the 19 hub genes was performed using GEPIA. The results indicated that nine of the hub genes (SPP1, ITGA6, TMPRSS11D, MMP1, LAMC2, FAT1, ACTA1, SERPINE1 and CEACAM1) were identified as significant prognostic indicators of HNSCC. To the best of our knowledge, few studies have investigated these nine hub genes in HNSCC according to literature retrieval using NCBI-PubMed. Previous studies have revealed that the expression level of SPP1 in the plasma is negatively associated with the survival time of the patients with HNSCC (33), and this was consistent with the results of the present study analysis; however, these results remain controversial; another study concerning HNSCC found no correlation between SPP1 expression level and prognosis of HNSCC (34). In addition, it has been reported that ITGA6 and LAMC2 promote the migration and invasion of HNSCC cells (35), and that MMP1 serves a role in the invasiveness of HNSCC (36). Furthermore, the mutation status of FAT1 is associated with the prognosis of HPV-negative HNSCC (37). A previous study also reported that TMPRSS11D expression was reduced in squamous cell carcinogenesis (38), whereas a different study revealed that the expression of this gene was significantly higher in non-small cell lung carcinoma tissues compared with that in adjacent normal tissues (39). Nevertheless, previous studies focusing on TMPRSS11D in HNSCC have not been identified. Notably, the analysis of the present study suggested that the expression level of TMPRSS11D was positively associated with the survival time. Thus, it is necessary to perform further research regarding the role of TMPRSS11D in HNSCC. A recent study reported that CEACAM1 promotes the growth of HNSCC tumors (40). However, to the best of our knowledge, no further studies on the underlying mechanisms have been performed, and there appears to be only one other study reporting that CEACAM1 is overexpressed in oral tumors and correlated with carcinogenesis (41). In addition, previous data analysis of colon cancer demonstrated that ACTA1 may be associated with the methylation status of the oncogenome (42), but no further research has been performed, particularly in HNSCC. Furthermore, a study reported that SERPINE1 promotes cell migration, and overexpression of SERPINE1 was revealed to be associated with poor survival in HNSCC (43). All of the aforementioned studies demonstrate that the mRNAs or proteins identified through bioinformatics analysis in the present study are involved in the biological process of HNSCC, which was in accordance with our hypothesis. Notably, no studies investigating TMPRSS11D and ACTA1 in HNSCC have been identified. Although the aforementioned literature analysis is only based on PubMed search, which is an authoritative and comprehensive database, it also reflects the research status of these genes to a certain extent. Therefore, the present study also provided a practical and feasible direction for further research. Through rigorous statistical analysis, bioinformatics analysis is able to provide theoretical biological mechanisms and promote the development of biological research (13,15). However, further verification is required to confirm the results presented in the current study. In the initial analysis, the sample size of the three enrolled databases was small. If data from the TCGA or Oncomine databases, which contain more samples, are reasonably integrated in the initial analysis, the validity of the results of the present study will be increased. In conclusion, in the current study, a novel set of core genes associated with HNSCC that may serve as potential biomarkers was identified based on a series of bioinformatics analyses and a large sample database. However, further validation is required to verify these results.
  43 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  ONCOMINE: a cancer microarray database and integrated data-mining platform.

Authors:  Daniel R Rhodes; Jianjun Yu; K Shanker; Nandan Deshpande; Radhika Varambally; Debashis Ghosh; Terrence Barrette; Akhilesh Pandey; Arul M Chinnaiyan
Journal:  Neoplasia       Date:  2004 Jan-Feb       Impact factor: 5.715

3.  p120-Catenin Downregulation and PIK3CA Mutations Cooperate to Induce Invasion through MMP1 in HNSCC.

Authors:  Michal Kidacki; Heather L Lehman; Michelle V Green; Joshua I Warrick; Douglas B Stairs
Journal:  Mol Cancer Res       Date:  2017-06-21       Impact factor: 5.852

4.  Prognostic significance of plasma osteopontin in patients with locoregionally advanced head and neck squamous cell carcinoma treated on TROG 02.02 phase III trial.

Authors:  Annette M Lim; Danny Rischin; Richard Fisher; Hongbin Cao; Kathleen Kwok; Daniel Truong; Grant A McArthur; Richard J Young; Amato Giaccia; Lester Peters; Quynh-Thu Le
Journal:  Clin Cancer Res       Date:  2011-11-17       Impact factor: 12.531

5.  CEACAM1 is overexpressed in oral tumors and related to tumorigenesis.

Authors:  Fu-Fang Wang; Bing-Xin Guan; Jing-Yan Yang; Hai-Tao Wang; Cheng-Jun Zhou
Journal:  Med Mol Morphol       Date:  2016-07-27       Impact factor: 2.309

6.  MAML1 and TWIST1 co-overexpression promote invasion of head and neck squamous cell carcinoma.

Authors:  Sima Ardalan Khales; Ehsan Ebrahimi; Eisa Jahanzad; Sahar Ardalan Khales; Mohammad Mahdi Forghanifard
Journal:  Asia Pac J Clin Oncol       Date:  2018-01-15       Impact factor: 2.601

7.  Integrated Analysis of Long Noncoding RNA and mRNA Expression Profile in Advanced Laryngeal Squamous Cell Carcinoma.

Authors:  Ling Feng; Ru Wang; Meng Lian; Hongzhi Ma; Ning He; Honggang Liu; Haizhou Wang; Jugao Fang
Journal:  PLoS One       Date:  2016-12-29       Impact factor: 3.240

8.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

9.  Oncogenic long noncoding RNA landscape in breast cancer.

Authors:  Shouping Xu; Dejia Kong; Qianlin Chen; Yanyan Ping; Da Pang
Journal:  Mol Cancer       Date:  2017-07-24       Impact factor: 27.401

10.  Tumour-suppressive microRNA-29s inhibit cancer cell migration and invasion by targeting laminin-integrin signalling in head and neck squamous cell carcinoma.

Authors:  T Kinoshita; N Nohata; T Hanazawa; N Kikkawa; N Yamamoto; H Yoshino; T Itesako; H Enokida; M Nakagawa; Y Okamoto; N Seki
Journal:  Br J Cancer       Date:  2013-10-03       Impact factor: 7.640

View more
  6 in total

1.  Identification and Validation of Potential Pathogenic Genes and Prognostic Markers in ESCC by Integrated Bioinformatics Analysis.

Authors:  Lu Tang; Yuqiao Chen; Xiong Peng; Yuan Zhou; Hong Jiang; Guo Wang; Wei Zhuang
Journal:  Front Genet       Date:  2020-12-10       Impact factor: 4.599

2.  Clinical significance and potential mechanisms of miR-223-3p and miR-204-5p in squamous cell carcinoma of head and neck: a study based on TCGA and GEO.

Authors:  Lei Zhao; Congzhe Tian; Erbin Xiao; Jinduo Du; Jingwei Liang; Xianghong Chen; Weiwei Chi
Journal:  Open Med (Wars)       Date:  2020-08-03

Review 3.  The remodelling of actin composition as a hallmark of cancer.

Authors:  Rahul Suresh; Roberto J Diaz
Journal:  Transl Oncol       Date:  2021-03-21       Impact factor: 4.243

Review 4.  Cancer-Associated Fibroblasts: Mechanisms of Tumor Progression and Novel Therapeutic Targets.

Authors:  Ralf-Peter Czekay; Dong-Joo Cheon; Rohan Samarakoon; Stacie M Kutz; Paul J Higgins
Journal:  Cancers (Basel)       Date:  2022-02-27       Impact factor: 6.639

5.  Integrated Analysis Reveals ENDOU as a Biomarker in Head and Neck Squamous Cell Carcinoma Progression.

Authors:  Chengzhi Xu; Yunbin Zhang; Yupeng Shen; Yong Shi; Ming Zhang; Liang Zhou
Journal:  Front Oncol       Date:  2021-02-05       Impact factor: 6.244

6.  miR-154 Influences HNSCC Development and Progression through Regulation of the Epithelial-to-Mesenchymal Transition Process and Could Be Used as a Potential Biomarker.

Authors:  Weronika Tomaszewska; Joanna Kozłowska-Masłoń; Dawid Baranowski; Anna Perkowska; Sandra Szałkowska; Urszula Kazimierczak; Patricia Severino; Katarzyna Lamperska; Tomasz Kolenda
Journal:  Biomedicines       Date:  2021-12-13
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.