Literature DB >> 33545929

Prediction and analysis of hub genes between glioblastoma and low-grade glioma using bioinformatics analysis.

Baowei Xu1.   

Abstract

ABSTRACT: Gliomas are an intractable tumor in the central nervous system. The present study aimed to identify the differentially expressed genes (DEGs) between glioblastoma multiforme (GBM) and low-grade gliomas (LGG) in order to investigate the mechanisms of different grades of gliomas. The Cancer Genome Atlas (TCGA) database was used to identify DEGs between GBM and LGG, and 2641 genes have been found differentially expressed. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were used to determine the related functions and pathways of DEGs. Protein-protein interaction (PPI) network extracted a total of 444 nodes and 1953 interactions, and identified the top 6 hub genes in gliomas. The microarray data of the datasets GSE52009 and GSE4412, which were obtained from Gene Expression Omnibus (GEO) database, were used to externally validate DEGs expression levels. Gene Expression Profiling Interactive Analysis (GEPIA) database which was based on TCGA was used to explore the survival of hub genes in LGG and GBM. Additionally, the Oncomine database and Chinese Glioma Genome Atlas (CGGA) database were used to validate the mRNA expression level and prognostic value of hub genes. Gene Set Enrichment Analysis (GSEA) identified further hub genes-related pathways. In summary, through biological information and survival analysis, 6 hub genes may be new biomarkers for diagnosis and for guiding the choice of treatment strategies for different grades of gliomas.
Copyright © 2021 the Author(s). Published by Wolters Kluwer Health, Inc.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33545929      PMCID: PMC7837950          DOI: 10.1097/MD.0000000000023513

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.889


Introduction

Gliomas are the most common primary tumor of the central nervous system.[ According to the histopathological characteristics, gliomas can be divided into 4 grades from low to high malignant degree.[ Low-grade gliomas (LGG) (astrocytomas, oligodendrogliomas, and oligoastrocytomas) are well-differentiated and have a 5-year survival rate of nearly 60%.[ Despite many treatment strategies, glioblastoma multiforme (GBM) is one of the most lethal brain tumors with a median overall survival (OS) of 14.6 months.[ Unfortunately, most LGG will progress to GBM within 5 to 10 years. In this process, the expression of many genes and molecular pathways change.[ So far, O-6-methylguanine-DNA methyltransferase (MGMT) promoter methylation, EGFR alterations and isocitrate dehydrogenase 1 (IDH1) and isocitrate dehydrogenase 2 (IDH2) mutations have been identified as markers in molecular classification of glioblastomas.[ Therefore, it is significant to explore new genes and molecular markers for the treatment of different grades of gliomas. Gene expression profiling analyses have greatly promoted clinical oncology research, mainly in the following aspects: searching for tumor related genes, exploring molecular diagnosis, comparing therapeutic effects and predicting prognosis and recurrence of tumors.[ In recent years, the application of microarray and high-throughput sequencing technology have enlarged gene expression profiling analysis to an unprecedented scale. At the same time, multiple databases and methods are used to study and verify the differentially expressed genes (DEGs), which makes bioinformatics analysis more accurate and reliable.[ In addition, through the comparative analysis of DEGs, especially the study of signaling pathways and interaction networks, we can explore the occurrence, development and transformation mechanisms of malignant tumors.[ In this study, TCGA database was used to identify the DEGs between GBM and LGG. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were used to determine the related functions and pathways of DEGs. Protein–protein interaction (PPI) network extracted important nodes and interactions in gliomas, and identified top 6 hub genes. Subsequently, 2 clinical microarray data (GSE52009 and GSE4412) were obtained from Gene Expression Omnibus (GEO) database. The clinical value of the hub genes was further explored and verified by Gene Expression Profiling Interactive Analysis (GEPIA) and Gene Set Enrichment Analysis (GSEA) based on TCGA database.[ In order to further improve the reliability of this study, Oncomine database and Chinese Glioma Genome Atlas (CGGA) database were introduced for external validation of the mRNA expression level and prognostic value of hub genes.[

Material and methods

Datasets and identification of DEGs

The mRNA expression profiles and clinical data of LGG and GBM patients were obtained from the TCGA data portal (https://portal.gdc.cancer.gov/repository), which grades II and III of gliomas were included in the TCGA-LGG project, whereas grade IV was separated in TCGA-GBM project. Moreover, the datasets from the National Center of Biotechnology Information (NCBI) GEO database (http://www.ncbi.nlm.nih.gov/geo/) was divided into LGG and GBM according to the TCGA standard. The gene expression level and clinical data from TCGA were used for identification of DEGs, and Gene expression profile of GSE52009 and GSE4412 from GEO was used to externally validate the hub gene expression level. The GSE52009, which was based on GPL6480 Agilent-014850 Whole Human Genome Microarray was uploaded by Jiang et al, contained 92 LGG samples and 24 GBM samples. The GSE4412, which was based on GPL96 Affymetrix Human Genome U133A Array was uploaded by Nelson SF, included 26 LGG samples and 59 GBM samples.[ To clarify the DEGs between LGG and GBM, the fold change (FC) >2 (|log2FC| ≥1) and false discovery rate (FDR) < 0.05 were used as the threshold to identify DEGs. Heat map and volcano plot were generated using the R software package pheatmap and ggplot2, respectively.

Functional and pathway enrichment analysis

GO analysis was used to determine the functions including biological pathways (BP), cellular component (CC), and molecular function (MF).[ KEGG pathway enrichment analysis was used to systematically describe relevant pathways for the genes.[ The clusterProfiler package was used to perform GO analysis and KEGG pathway enrichment analysis. False discovery rate (FDR) of <0.05 was used as the cut-off value.

PPI network analysis

The online Search Tool the Retrieval of Interacting Genes (STRING) database (http://www.string-db.org) was used to demonstrate the interaction between various proteins,[ and Cytoscape software (Version 3.7.2, http://www.cytoscape.org/) was used to reconstruct a PPI network. The mapped network was detected the possible relationship among DEGs. Due to the large number of DEGs while FC >2, we identified PPI network using FC>4. The cut-off criterion of interaction confidence score was set as >0.4. The important nodes and hub genes in the network were predicted and explored by CytoHubba, a plugin of Cytoscape, and the top 10 genes were selected from the results of each method. The final top 6 hub genes in different methods were selected for further analysis.

Oncomine database analysis

The expression level of hub genes was identified in the Oncomine database (https://www.oncomine.org), and analyzed the mRNA expression differences between LGG and GBM.[ The threshold was determined according to the following values: P value of .05, fold change of 2.0.

GEPIA database analysis

GEPIA database was used to perform survival analysis which is based on TCGA database (http://gepia.cancer-pku.cn/).[ Survival analyses were carried out to achieve Kaplan–Meier plots. The log-rank test was performed during Kaplan–Meier survival analysis. The two-tailed P value was used in this study, and a P value of less than .05 considered statistically significant.

CGGA database analysis

Chinese Glioma Genome Atlas (CGGA) data (http://www.cgga.org.cn/), a Chinese glioma database which included over 2000 samples from the Chinese cohort, was used for external validation the mRNA expression level and prognostic value of hub genes.[ The results were downloaded from the CGGA website.

GSEA analysis

GSEA was conducted to further identify hub genes-related pathways. GSEA version 4.0.3 software (http://www.broadinstitute.org/gsea) was used to analyze data. The expression level of hub genes was used as the phenotype annotation, and patients in TCGA cohort were divided into low and high categories. The Molecular Signatures Database (MSigDB) of c2 (c2.cp.kegg.v7.1.symbols.gmt) was used to assess the functional differences between the low and high hub gene expression groups.[ The number of permutations was set to 1000 and the phenotype labels were high and low expression. FDR <0.25 and P < .05 were set as the cut-off criteria to confirm significant gene sets.

Results

Identification of DEGs between LGG and GBM

Gene expression profiles from the TCGA database were conducted to analyze DEGs. A total of 529 LGG and 169 GBM samples was involved in our further analysis. Differentially expressed genes between GBM and LGG were identified as explained in the methods. The results revealed that a total of 2641 genes were identified to be differentially expressed, including 1428 up-regulated genes and 1213 down-regulated genes (Fig. 1A,B).
Figure 1

Identification of expression differences between LGG and GBM. (A) Heat-map overview of the differentially expressed genes. (B) Volcano plot of the differential mRNA expression analysis.

Identification of expression differences between LGG and GBM. (A) Heat-map overview of the differentially expressed genes. (B) Volcano plot of the differential mRNA expression analysis.

Functional and pathway enrichment analysis of DEGs

Total, up-regulated and down-regulated DEGs were used for GO and KEGG analysis, respectively. GO analysis showed genes associated with BP were mainly involved in regulation of trans-synaptic signaling in total DEGs, extracellular structure organization in up-regulated DEGs, modulation of chemical synaptic transmission in down-regulated DEGs. Genes associated with CC were mainly involved in synaptic membrane in total DEGs, collagen-containing extracellular matrix in up-regulated DEGs, and synaptic membrane in down-regulated DEGs. And genes associated with MF were mainly involved in channel activity in total DEGs, extracellular matrix structural constituent in up-regulated DEGs, channel activity in down-regulated DEGs (Fig. 2A, C, E). The detailed results were presented in Table 1.
Figure 2

Functional enrichment analyses of (A) total, (C) up-regulated and (E) down-regulated DEGs (GBM vs LGG). Top 10 biological process, cellular component, and molecular function terms for the DEGs., DEGs. KEGG pathways enriched for the (B) total, (D) up-regulated and (F) down-regulated DEGs (GBM vs LGG).

Table 1

Gene Ontology functional enrichment analyses of DEGs associated with LGG and GBM.

CategoryDescriptionCount%P value
Total DEGs
 BPGO:0099177Regulation of trans-synaptic signaling1476.114.98E-26
 BPGO:0050804Modulation of chemical synaptic transmission1466.076.77E-26
 BPGO:0050808Synapse organization1345.578.23E-23
 BPGO:0042391Regulation of membrane potential1325.482.76E-19
 BPGO:0043062Extracellular structure organization1315.446.19E-20
 CCGO:0097060Synaptic membrane1636.461.50E-37
 CCGO:0062023Collagen-containing extracellular matrix1435.672.25E-29
 CCGO:0043025Neuronal cell body1405.551.53E-18
 CCGO:0045211Postsynaptic membrane1254.956.90E-30
 CCGO:0098978Glutamatergic synapse1234.883.15E-25
 MFGO:0015267Channel activity1315.499.84E-16
 MFGO:0022803Passive transmembrane transporter activity1315.491.00E-15
 MFGO:0022838Substrate-specific channel activity1265.289.84E-16
 MFGO:0005216Ion channel activity1235.159.84E-16
 MFGO:0046873Metal ion transmembrane transporter activity1164.862.48E-11
Up regulated DEGs
 BPGO:0043062Extracellular structure organization1118.381.23E-31
 BPGO:0030198Extracellular matrix organization1068.011.25E-33
 BPGO:0042119Neutrophil activation1017.633.85E-19
 BPGO:0002446Neutrophil mediated immunity1007.551.19E-18
 BPGO:0043312Neutrophil degranulation987.401.33E-18
 CCGO:0062023Collagen-containing extracellular matrix1218.756.04E-42
 CCGO:0005788Endoplasmic reticulum lumen785.641.86E-21
 CCGO:0031983Vesicle lumen745.351.63E-16
 CCGO:0060205Cytoplasmic vesicle lumen735.283.16E-16
 CCGO:0034774Secretory granule lumen715.132.94E-16
 MFGO:0005201Extracellular matrix structural constituent614.648.74E-25
 MFGO:0030246Carbohydrate binding463.501.09E-05
 MFGO:0005539Glycosaminoglycan binding453.423.37E-07
 MFGO:0061134Peptidase regulator activity433.276.38E-07
 MFGO:0019838Growth factor binding403.046.69E-12
Down regulated DEGs
 BPGO:0050804Modulation of chemical synaptic transmission12611.629.72E-51
 BPGO:0099177Regulation of trans-synaptic signaling12611.629.72E-51
 BPGO:0042391Regulation of membrane potential11110.247.42E-39
 BPGO:0050808Synapse organization1059.695.55E-37
 BPGO:0050890Cognition787.207.79E-28
 CCGO:0097060Synaptic membrane14712.881.69E-71
 CCGO:0098793Presynapse12611.045.85E-46
 CCGO:0045211Postsynaptic membrane11510.086.77E-58
 CCGO:0098978Glutamatergic synapse1109.642.65E-49
 CCGO:0099572Postsynaptic specialization1079.387.56E-47
 MFGO:0015267Channel activity10810.061.05E-33
 MFGO:0022803Passive transmembrane transporter activity10810.061.08E-33
 MFGO:0022838Substrate-specific channel activity1069.881.09E-34
 MFGO:0005216Ion channel activity1059.795.81E-35
 MFGO:0046873Metal ion transmembrane transporter activity989.131.81E-28
Functional enrichment analyses of (A) total, (C) up-regulated and (E) down-regulated DEGs (GBM vs LGG). Top 10 biological process, cellular component, and molecular function terms for the DEGs., DEGs. KEGG pathways enriched for the (B) total, (D) up-regulated and (F) down-regulated DEGs (GBM vs LGG). Gene Ontology functional enrichment analyses of DEGs associated with LGG and GBM. KEGG pathway enrichment analysis was conducted to further investigate the pathways among these DEGs. As shown in Figure 2B, D, F, the significantly enriched key pathways were neuroactive ligand-receptor interaction, cytokine-cytokine receptor interaction and neuroactive ligand-receptor interaction in total, up-regulated and down-regulated DEGs, respectively. The detailed results were presented in Table 2.
Table 2

Top 10 most KEGG of total, up regulated and down regulated DEGs.

DescriptionCount%P value
Total DEGs
 hsa04080Neuroactive ligand-receptor interaction867.518.47E-07
 hsa04010MAPK signaling pathway685.940.000258
 hsa05205Proteoglycans in cancer595.158.47E-07
 hsa04510Focal adhesion585.078.47E-07
 hsa04360Axon guidance514.451.64E-05
 hsa04145Phagosome504.371.85E-07
 hsa04020Calcium signaling pathway504.37.000157
 hsa04514Cell adhesion molecules (CAMs)474.108.47E-07
 hsa04724Glutamatergic synapse443.841.49E-08
 hsa05322Systemic lupus erythematosus443.848.47E-07
Up regulated DEGs
 hsa05165Human papillomavirus infection328.243.63E-11
 hsa04060Cytokine-cytokine receptor interaction307.067.45E-09
 hsa04145Phagosome286.627.45E-09
 hsa04510Focal adhesion456.622.81E-08
 hsa05205Proteoglycans in cancer226.475.05E-08
 hsa05166Human T-cell leukemia virus 1 infection296.475.05E-08
 hsa05169Epstein-Barr virus infection336.325.11E-08
 hsa05322Systemic lupus erythematosus255.885.24E-08
 hsa05152Tuberculosis445.881.16E-07
 hsa04110Cell cycle434.852.06E-07
Down regulated DEGs
 hsa04080Neuroactive ligand-receptor interaction3313.559.99E-17
 hsa04020Calcium signaling pathway418.825.36E-12
 hsa04724Glutamatergic synapse268.602.97E-11
 hsa04024cAMP signaling pathway248.601.33E-10
 hsa04723Retrograde endocannabinoid signaling407.967.85E-10
 hsa04727GABAergic synapse307.311.25E-09
 hsa05032Morphine addiction317.105.96E-09
 hsa04713Circadian entrainment207.101.09E-08
 hsa04261Adrenergic signaling in cardiomyocytes306.674.99E-08
 hsa04360Axon guidance186.676.05E-08
Top 10 most KEGG of total, up regulated and down regulated DEGs.

Hub genes screening from PPI network

Protein interaction analysis could clarify the important protein and biological modules. PPI network construction was conducted using STRING tool and Cytoscape software. A total of 444 nodes and 1953 interactions were screened from the PPI network (Fig. 3A). CytoHubba contained several algorithms (Table 3), and the top 10 hub genes using MCC, DEGREE, and EPC algorithm were shown in Figure 3B–D. We chose the top 6 hub nodes as hub genes for further analysis, including TIMP metallopeptidase inhibitor 1 (TIMP1), fibronectin 1 (FN1), collagen type I alpha 1 chain (COL1A1), periostin (POSTN), matrix metallopeptidase 9 (MMP9), C-X-C motif chemokine ligand 8 (CXCL8).
Figure 3

PPI network analysis and hub genes identification. (A) Protein-protein interaction networks of the DEGs. The hub genes depended on (B) MMC, (C) DEGREE and (D) EPC algorithms.

Table 3

Hub genes for DEGs ranked in cytoHubba using different methods.

Rank methods in cytoHubba
CatalogMCCMNCDegreeEPCBottleNeckEcCentricityClosenessRadialityBetweennessStress
Hub gene top 10CXCL8FN1FN1FN1CXCL8CXCL8CXCL8CXCL8CXCL8CXCL8
GABBR1CXCL8CXCL8MMP9FN1FN1FN1FN1FN1FN1
HTR1AMMP9MMP9CXCL8MMP9MMP9MMP9MMP9MMP9MMP9
ANXA1TIMP1TIMP1COL1A2GRM5GRM5TIMP1TIMP1GRM5GRM5
SAA1COL1A1GRIN1TIMP1POSTNPOSTNCOL1A1GRM5GRIN1GRIN1
BDKRB2COL1A2COL1A1COL1A1TIMP1TIMP1GRM5ANXA1LOXGABBR1
HRH3GRIN1COL1A2POSTNKCNJ9MDM2LOXCXCL10KCNJ9KCNJ9
CXCL10POSTNPOSTNCOL3A1MDM2NTRK1POSTNLOXTIMP1LOX
GALSERPINE1LOXLOXLOXGRIN1SERPINE1SERPINE1MDM2HTR1A
SSTR2SPP1SERPINE1THBS1NTRK1SNAI2SPP1SPP1SDC1HRH3
PPI network analysis and hub genes identification. (A) Protein-protein interaction networks of the DEGs. The hub genes depended on (B) MMC, (C) DEGREE and (D) EPC algorithms. Hub genes for DEGs ranked in cytoHubba using different methods.

Gene expression level in TCGA and GEO databases

To demonstrate the mRNA expression level in LGG and GBM, we first visualized the expression level in TCGA. As shown in Figure 4A, hub genes were significantly higher in GBM. We further analyzed 6 hub genes using GSE52009 and GSE4412 datasets. Similarly, hub gene mRNA expression levels were higher in GBM tissues, compared to LGG tissues (Fig. 4B,C).
Figure 4

Top 6 hub genes mRNA expression levels in TCGA and GEO database between LGG and GBM. (A) TCGA; (B) GSE52009; (C) GSE4412. Differences between groups were analyzed by the Student t test, ∗P < 0.05; ∗∗P < 0.01, ∗∗∗P < 0.001.

Top 6 hub genes mRNA expression levels in TCGA and GEO database between LGG and GBM. (A) TCGA; (B) GSE52009; (C) GSE4412. Differences between groups were analyzed by the Student t test, ∗P < 0.05; ∗∗P < 0.01, ∗∗∗P < 0.001.

The mRNA expression levels of hub genes in Oncomine database

Oncomine, an online microarray database, was used to validate the expression levels of hub genes in LGG and GBM. There we showed six representative datasets for analyzing hub genes. As shown in Figure 5A, TIMP1 in GBM was high expression in Sun, Liang, Bredel, and Freije datasets. FN1 was more highly expressed in GBM from vandenBoom, Nutt, Sun, and Bredel datasets (Fig. 5B). Up-regulation of COL1A1 was found in GBM based on Sun, vandenBoom, Bredel, and Freije datasets (Fig. 5C). Moreover, POSTN expression level was decreased in LGG tissues from Liang, Sun, Nutt, and Freije datasets (Fig. 5D). Compared to GBM, MMP9 was reduced in LGG, demonstrated by Liang, Bredel, Nutt, and Sun datasets (Fig. 5E). As for CXCL8, Liang, vandenBoom, Freije, and Sun datasets showed high expression levels in GBM, compared to LGG (Fig. 5F).
Figure 5

Top 6 hub gene mRNA expression levels in Oncomine database. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8; (G) The hub genes in human cancers, the number in the colored cell represents the number of studies meeting thresholds. The more stressed red (over-expression) or blue (under-expression) indicates a more highly significant relationship. Differences between groups were analyzed by the Student t test or one way ANOVA.

Top 6 hub gene mRNA expression levels in Oncomine database. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8; (G) The hub genes in human cancers, the number in the colored cell represents the number of studies meeting thresholds. The more stressed red (over-expression) or blue (under-expression) indicates a more highly significant relationship. Differences between groups were analyzed by the Student t test or one way ANOVA. Besides, we used Oncomine dataset to analyze the hub genes expression level differences between tumor and normal tissues. As shown in Figure 5G, the database contained a total of 357, 367, 306, 367, 363, and 370 unique analyses for TIMP1, FN1, COL1A1, POSTN, MMP9, and CXCL8, respectively. In all types of tumors, TIMP1 was ranked with the top 10% of all genes showing statistically significant differences in 121 studies, with 113 studies demonstrated higher expression levels in tumor than normal tissues. Up-regulation of FN1 was found in tumor in 131 studies. 122 significant unique analyses revealed that the mRNA expression level of COL1A1 was higher in tumor. Compared to normal tissues, POSTN was up-regulated in tumors among 95 studies, while reduced expression was found in 18 studies. Higher expression of MMP9 was found in most cancers. As for CXCL8, only 67 studies were listed, and 57 studies showed high level of CXCL8 in tumor tissues. And all hub genes were up-regulated among brain and CNS cancer. Altogether, the transcriptional expression levels of hub genes were significantly up-regulated in GBM, compared to LGG and tumor tissues, compared to normal tissues.

The Kaplan–Meier plotter of hub genes

The website, http://gepia.cancer-pku.cn/, could provide the prognostic data of the hub genes based on TCGA database. And we found that expression of TIMP1 (HR=3, P = 7.8 × 10–9) was associated with worse OS for LGG patients, as well as FN1 (HR = 1.8, P = .0022), COL1A1 (HR = 2, P = .00028), POSTN (HR = 1.7, P = .0065), not MMP9 (HR = 1.4, P = .093) and CXCL8 (HR = 1.3, P = .13) (Fig. 6A–F). Only FN1 and CXCL8 expression levels were associated with worse OS for GBM patients (HR = 1.5, P = .028, HR = 1.4, P = .049), respectively. Furthermore, POSTN expression had a trend for evaluating the prognosis of GBM patients (Fig. 6G–L).
Figure 6

Kaplan-Meier plotters and log-rank tests for the prognostic value of the hub genes in LGG (A-F, A, TIMP1; B, FN1; C, COL1A1; D, POSTN; E, MMP9; F, CXCL8) and GBM (G-L, G, TIMP1; H, FN1; I, COL1A1; J, POSTN; K, MMP9; L, CXCL8).

Kaplan-Meier plotters and log-rank tests for the prognostic value of the hub genes in LGG (A-F, A, TIMP1; B, FN1; C, COL1A1; D, POSTN; E, MMP9; F, CXCL8) and GBM (G-L, G, TIMP1; H, FN1; I, COL1A1; J, POSTN; K, MMP9; L, CXCL8).

Validation in the CGGA database

Next, we used CGGA database, which is based on a Chinese glioma patient information, to validate hub genes expression levels and prognostic value. This database has 3 datasets, including mRNAseq-325, mRNAseq-693 and mRNA array-301. mRNAseq-325 contained 325 samples which involved 182 LGG, 139 GBM and 4 unclear patients (accession number: PRJCA001746, platform: Illumina HiSeq2000 or 2500), mRNAseq-693 contained 693 samples which involved 443 LGG, 249 GBM, and 1 unclear patients (accession number: PRJCA001747, platform: Illumina HiSeq) and mRNA array-301 contained 301 patients which involved 174 LGG, 124 GBM and 2 unclear patients (platform: Agilent Whole Human Genome). We found that 6 hub genes were associated with tumor grade, except for POSTN in grade II and III (Fig. 7).
Figure 7

The hub genes mRNA expression levels in CGGA database. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8. Differences between groups were analyzed by the Student t test, ∗P < 0.05; ∗∗P < 0.01, ∗∗∗P < 0.001, ns, not significant.

The hub genes mRNA expression levels in CGGA database. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8. Differences between groups were analyzed by the Student t test, ∗P < 0.05; ∗∗P < 0.01, ∗∗∗P < 0.001, ns, not significant. As shown in Figure 8, the Kaplan–Meier survival analysis showed that all 6 hub genes were significantly associated with OS, a higher expression level in patients with shorter OS. Furthermore, we evaluated the prognostic value of hub genes in different tumor grade groups. Statistically significant difference was found in patients with grade III and IV group, and no difference was found in tumor grade II group for TIMP1, as well as FN1 and MMP9 (Fig. 8A, B, E). For COL1A1, it only showed the prognostic value in patients with tumor grade II (Fig. 8C). POSTN could predict patients with tumor grade II and III, which was similar with TCGA database (Fig. 8D). The expression level of CXCL8 showed the statistically significant difference in patients with tumor grade IV, which met the same result in TCGA database (Fig. 8F).
Figure 8

Kaplan-Meier plotters and log-rank tests for the prognostic value of hub genes in CGGA database. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8.

Kaplan-Meier plotters and log-rank tests for the prognostic value of hub genes in CGGA database. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8.

Identification of hub genes associated biological pathways

GSEA analysis was performed to determine the biologic characteristics of hub genes using TCGA database. As shown in Figure 9A, TIMP1 regulated biology process mainly associated with sugar metabolism, glutathione metabolism, and leukocyte transendothelial migration, indicating that TIMP1 might regulate metabolism and cell metastasis. Similar results were obtained in COL1A1, POSTN, MMP9, and CXCL8 group (Fig. 9C–F). It was interesting to find that FN1 might function in cell cycle-related pathways, including glycan biosynthesis, cell cycle, and cell apoptosis (Fig. 9B).
Figure 9

Signaling pathways associated with hub genes predicted by GSEA. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8.

Signaling pathways associated with hub genes predicted by GSEA. (A) TIMP1; (B) FN1; (C) COL1A1; (D) POSTN; (E) MMP9; (F) CXCL8.

Discussion

The formation of tumor and the increase of malignant degree are caused by the accumulation of many gene alterations.[ As a common malignant tumor in the brain, up to 70% of LGG transform into HGG within 10 years. This transformation process involves changes in many genes and molecular pathways.[ This study is conducted to determine the DEGs between LGG and GBM through bioinformatics analysis. And further explore and study the molecular functional pathways involved in these DEGs, as well as their impact on tumor prognosis. Therefore, it is possible to use these DEGs as biomarkers and therapeutic targets in clinical practice, which can help the diagnosis and treatment of gliomas, and improve the prognosis of GBM. The 5-year survival rate and overall survival time of patients with LGG and GBM are significantly different.[ GBM grows rapidly, has high invasiveness, is easy to cause neurological symptoms, not sensitive to chemotherapy, and easy to relapse after treatment.[ GBM can be divided into primary and secondary GBM, which are difficult to be differentiated histologically. Secondary GBM develops from LGG. Existing studies have shown that IDH mutation is a clear molecular marker for the diagnosis of secondary GBM, and it is more accurate than clinical and pathological diagnostic criteria.[ In addition, TP53 mutation and 19q deletion are more common gene changes in secondary GBM.[ Although there are some differences between primary and secondary GBM, they still have similar gene expression profiles.[ However, there are great differences between LGG and GBM in terms of biological behavior, histology, and gene expression profiles.[ Therefore, it is necessary to use the existing biological information analysis to further explore the differences between LGG and GBM, and help clinical diagnosis and treatment. We used GO analysis to further clarify the functions of DEGs in biological pathways, cellular component and molecular function. Down-regulated DEGs are mainly related to the synaptic membrane structure and the activity of membrane channel, and participate in the regulation of trans-synaptic signaling. This result revealed that the down-regulated DEGs between LGG and GBM control the biological behavior of gliomas by regulating signals among different neurons. However, the up-regulated DEGs increased the malignant degree of glioblastoma mainly by changing the extracellular matrix, cytokines, and microenvironment. The above results showed that up-regulated and down-regulated DEGs changed the molecular functions, signal transduction and biological behavior of LGG and GBM through different biological pathways. The results of KEGG pathway analysis are consistent with GO analysis. The significantly enriched key pathways were cytokine-cytokine receptor interaction and neuroactive ligand-receptor interaction in up-regulated and down-regulated DEGs, respectively. PPI network analysis was used to reveal the interactions between proteins encoded by DEGs. We identified PPI network using FC>4 to focus on DEGs with more clinical significance. Similarly, we set the cut-off criterion of interaction confidence to be greater than 0.4. The important nodes and hub DEGs in the PPI network were predicted and explored by CytoHubba. Finally, we screened out the 6 most important DEGs from PPI network analysis, and made a detailed and in-depth study on them. The 6 hub genes we focused on have been described above, as shown below: TIMP1, FN1, COL1A1, POSTN, MMP9, and CXCL8. The expression level of TIMP1 increased significantly in malignant tumors, which can inhibit the apoptosis of tumor cells. The high expression of this gene correlates with a poor prognosis of the patients.[ In malignant gliomas, TIMP1 is related to the decomposition of extracellular matrix and can promote the invasion and movement of tumor cells.[ Fibronectin-1 encoded by FN1 is an important cell adhesion molecule in the extracellular matrix. Fibronectin-1 has the functions of regulating cell adhesion, growth and differentiation, promoting cell migration and proliferation, as well as ion exchange and information transfer.[ FN1 plays an important role in the invasion and metastasis of malignant tumors, and it is also one of the current research hotspots. At present, there is a few studies focusing on FN1 and gliomas.[ The COL1A1 gene provides instructions for making part of a large molecule called type I collagen.[ A recent study has shown that this gene is related to the poor prognosis of malignant gliomas.[ By knocking down of COL1A1 in invasive gliomas, the progression of tumor can be reduced and the survival rate of experimental animals can be significantly improved.[ POSTN encodes an unstructured extracellular matrix protein called periostin. The protein can interact with multiple integrins and participate in cell proliferation, cell migration, and epithelial-mesenchymal transition.[ This gene is expressed in the process of inflammation and many kinds of cancers, and is related to the resistance to antiangiogenic therapy and poor prognosis of gliomas.[ The overexpression of MMP9 can also promote the invasion and progression of gliomas, and is related to poor prognosis.[ High expression of CXCL8 in gliomas with release more cytokines, stimulate inflammation, promote cell proliferation, and lead to tumor progression.[ The mechanism by which CXCL8 works is related to inflammatory stimulation, tumor angiogenesis, and JAK/STAT1/HIF-1α/Snail pathway activation.[ Oncomine database is one of the largest cancer gene chip databases in the world, and is committed to the data standardization and analysis of gene expression profile data of tumor samples.[ Oncomine database can be used to analyze gene expression differences, predict co-expressed genes, and classify according to the clinical information such as tumor staging and tissue types.[ CGGA database is the largest functional genomics database of gliomas in Asia. Its information covers matched samples of different pathological types and malignant degrees. In this study, the expression level and prognostic value of hub genes were validated by the above 2 databases.[ Through multiple databases and multiple methods of interactive verification, we can improve the reliability and accuracy of bioinformatics analysis in this study, and reduce the research deviation caused by single database analysis. However, there are several limitations in this study. The results above, identified in TCGA database and validated in GEO, Onocomine database, and CGGA datasets, were not validated using new tissue samples. Second, the functions of hub genes were not functionally tested, which would be conducted in our further studies.

Conclusion

This study carried out a systematic bioinformatics analysis of DEGs between LGG and GBM. The results provide potential biomarkers and therapeutic targets for gliomas. However, further experimental verification is needed to directly determine the role of these DEGs in glioma.

Author contributions

BX designed the study, collected the data, performed the bioinformatics analysis, analyzed the data and wrote the manuscript. Writing – original draft: Baowei Xu.
  39 in total

1.  Gene expression profiling of gliomas strongly predicts survival.

Authors:  William A Freije; F Edmundo Castro-Vargas; Zixing Fang; Steve Horvath; Timothy Cloughesy; Linda M Liau; Paul S Mischel; Stanley F Nelson
Journal:  Cancer Res       Date:  2004-09-15       Impact factor: 12.701

2.  Expression of FAS-L Differs from Primary to Relapsed Low-grade Gliomas and Predicts Progression-free Survival.

Authors:  Jan-Michael Werner; Saskia Kuhl; Pantelis Stavrinou; Gabriele Röhn; Boris Krischek; Tobias Blau; Roland Goldbrunner; Marco Timmer
Journal:  Anticancer Res       Date:  2017-12       Impact factor: 2.480

3.  Role of fibronectin-stimulated tumor cell migration in glioma invasion in vivo: clinical significance of fibronectin and fibronectin receptor expressed in human glioma tissues.

Authors:  T Ohnishi; S Hiraga; S Izumoto; H Matsumura; Y Kanemura; N Arita; T Hayakawa
Journal:  Clin Exp Metastasis       Date:  1998-11       Impact factor: 5.150

Review 4.  The role of interleukin-8 and its receptors in gliomagenesis and tumoral angiogenesis.

Authors:  Daniel J Brat; Anita C Bellail; Erwin G Van Meir
Journal:  Neuro Oncol       Date:  2005-04       Impact factor: 12.300

5.  Prognostic value of O-6-methylguanine-DNA methyltransferase (MGMT) protein expression in glioblastoma excluding nontumour cells from the analysis.

Authors:  R H Dahlrot; J Dowsett; S Fosmark; A Malmström; R Henriksson; H Boldt; K de Stricker; M D Sørensen; H S Poulsen; M Lysiak; P Söderkvist; J Rosell; S Hansen; B W Kristensen
Journal:  Neuropathol Appl Neurobiol       Date:  2017-06-28       Impact factor: 8.090

6.  Plasma levels of tissue inhibitor of matrix metalloproteinase-1 correlate with diagnosis and prognosis of glioma patients.

Authors:  Yi Lin; Jiang-Fei Wang; Guang-Zu Gao; Guo-Zhen Zhang; Fei-Long Wang; Yun-Jie Wang
Journal:  Chin Med J (Engl)       Date:  2013-11       Impact factor: 2.628

7.  T11TS inhibits glioma angiogenesis by modulation of MMPs, TIMPs, with related integrin αv and TGF-β1 expressions.

Authors:  Manoj Kumar Singh; Debanjan Bhattacharya; Suhnrita Chaudhuri; Sagar Acharya; Pankaj Kumar; Pranjal Santra; Anjan Kumar Basu; Swapna Chaudhuri
Journal:  Tumour Biol       Date:  2013-11-19

8.  Expression of lncRNAs in Low-Grade Gliomas and Glioblastoma Multiforme: An In Silico Analysis.

Authors:  Brian J Reon; Jordan Anaya; Ying Zhang; James Mandell; Benjamin Purow; Roger Abounader; Anindya Dutta
Journal:  PLoS Med       Date:  2016-12-06       Impact factor: 11.069

9.  The Identification of Key Genes and Pathways in Glioma by Bioinformatics Analysis.

Authors:  Mingfa Liu; Zhennan Xu; Zepeng Du; Bingli Wu; Tao Jin; Ke Xu; Liyan Xu; Enmin Li; Haixiong Xu
Journal:  J Immunol Res       Date:  2017-12-06       Impact factor: 4.818

10.  RelB, a good prognosis predictor, links cell-cycle and migration to glioma tumorigenesis.

Authors:  Feng Shen; Qing Guo; Qi Hu; Ailiang Zeng; Weining Wu; Wei Yan; Yongping You
Journal:  Oncol Lett       Date:  2018-01-29       Impact factor: 2.967

View more
  2 in total

1.  Analysis of the key prognostic genes and potential traditional Chinese medicine therapeutic targets in glioblastoma based on bioinformatics and network pharmacology methods.

Authors:  Zhiyu Xia; Peng Gao; Yu Chen; Lei Shu; Lei Ye; Hongwei Cheng; Xingliang Dai; Yangchun Hu; Zhongyong Wang
Journal:  Transl Cancer Res       Date:  2022-05       Impact factor: 0.496

2.  Glioblastoma gene network reconstruction and ontology analysis by online bioinformatics tools.

Authors:  Natalya V Gubanova; Nina G Orlova; Arthur I Dergilev; Nina Y Oparina; Yuriy L Orlov
Journal:  J Integr Bioinform       Date:  2021-11-16
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.