Literature DB >> 36197270

Bioinformatic identification of differentially expressed genes associated with hepatocellular carcinoma prognosis.

Xu Huang1, Xu Wang2, Ge Huang3, Ruotao Li4, Xingkai Liu1, Lidong Cao5, Junfeng Ye1, Ping Zhang1.   

Abstract

Hepatocellular carcinoma (HCC) is still a significant global health problem. The development of bioinformatics may provide the opportunities to identify novel therapeutic targets. This study bioinformatically identified the differentially expressed genes (DEGs) in HCC and associated them with HCC prognosis using data from published databases. The DEGs downloaded from the Gene Expression Omnibus (GEO) website were visualized using the Venn diagram software, and then subjected to the GO and KEGG analyses, while the protein-protein interaction network was analyzed using Cytoscape software with the Search Tool for the search tool for the retrieval of interacting genes and the molecular complex detection plug-in. Kaplan-Meier curves and the log rank test were used to associate the core PPI network genes with the prognosis. There were 57 upregulated and 143 downregulated genes in HCC samples. The GO and pathway analyses revealed that these DEGs are involved in the biological processes (BPs), molecular functions (MFs), and cell components (CCs). The PPI network covered 50 upregulated and 108 downregulated genes, and the core modules of this PPI network contained 34 upregulated genes. A total of 28 of these upregulated genes were associated with a poor HCC prognosis, 27 of which were highly expressed in HCC tissues. This study identified 28 DEGs to be associated with a poor HCC prognosis. Future studies will investigate their possible applications as prognostic biomarkers and potential therapeutic targets for HCC.
Copyright © 2022 the Author(s). Published by Wolters Kluwer Health, Inc.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 36197270      PMCID: PMC9509045          DOI: 10.1097/MD.0000000000030678

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.817


1. Introduction

Hepatocellular carcinoma (HCC) is the fifth most commonly diagnosed human cancer in the world,[ accounting for approximately 841,080 new annual cases and approximately 781,631 cancer-related deaths globally.[ In China, there were 461,600 new HCC cases and 422,100 HCC-related deaths in 2015.[ Clinically, HCC patients are usually diagnosed at an intermediate to an advanced stage of disease, thus making total surgical tumor resection or percutaneous ablation unlikely; thus, liver transplantation followed by radiotherapy and transcatheter arterial chemoembolization has become the best treatment of choice.[ Unfortunately, HCC has a high recurrence rate even after a successful surgical resection of tumor lesion(s), which is likely due to the insensitivity of HCC to chemotherapy and radiotherapy after surgery.[ Therefore, further studies of the underlying mechanisms of HCC development could help us to identify and evaluate the novel HCC biomarkers for prediction of treatment outcomes and prognosis of HCC patients. In recent years, the high-throughput sequencing and bioinformatics research have sped up the analytical process and understanding of gene alterations in human diseases, including cancer.[ Using microarray technology, researchers have revealed many differentially expressed genes (DEGs) involved in HCC initiation, development, progression, and metastasis. For example, Ally et al have shown that the wingless and Int-1 signaling pathway as well as mouse double minute 4 homolog, MET, vascular endothelial growth factor, myeloid cell leukemia protein 1, isocitrate dehydrogenase 1, and telomerase reverse transcriptase are aberrantly expressed or activated, suggesting that these genes could be potential therapeutic targets for HCC; whereas aberrant expression of cytotoxic T-lymphocyte-associated protein 4, programmed cell death protein 1 and programmed death-ligand 1, which are all important immune checkpoint proteins, was not observed in the control tissues.[ In addition, Mou et al have identified differentially expressed miRNAs, like hsa-mir-221, and its interactions with estrogen receptor alpha and C-X-C motif chemokine 12 as potential targets for early HCC detection and treatment.[ Moreover, many other bioinformatics studies have been conducted recently, further improving the bioinformatics methods and providing a better understanding of the underlying HCC mechanisms.[ Therefore, in this study, we searched and downloaded the GSE101685, GSE62232, and GSE112790 datasets from the NCBI Gene Expression Omnibus (GEO) website and identified DEGs and then performed GO and KEGG pathway enrichment analyses. Thereafter, we also established the protein–protein interaction (PPI) network using molecular complex detection for these DEGs and then associated the core DEGs with the HCC prognosis using the Kaplan–Meier plotter and re-analyzed these core DEGs. This study could provide a novel insight into HCC for the future development of novel biomarkers and therapeutic targets for HCC.

2. Methods

2.1. Collection of public gene expression data

This study searched and downloaded 3 gene expression profiles from HCC and the corresponding normal specimens (GSE101685, GSE62232, and GSE112790) from the GEO database (https://www.ncbi.nlm.nih.gov/gds), data of which obtained from analysis of the GPL570 platform ([HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array; Affymetrix, Santa Clara, CA, USA) in 24 cases of HCCs versus 8 normal liver, 81 HCCs versus 10 normal liver, and 183 HCC versus 15 normal liver specimens, respectively, resulting in a total of 321 samples (288 HCC and 32 nontumor liver tissue samples).

2.2. Identification of DEGs

DEGs in HCC versus normal liver tissue specimens were identified by using the GEO2R online tools.[ Specifically, the raw data were first analyzed using the online Venn diagram software (http://bioinformatics.psb.ugent.be/webtools/Venn/) to identify DEGs from these 3 datasets. If the DEGs met the criteria of |log fold change (FC)| ≥ 2.0 and adjusted P-value < .05, these genes were considered as upregulated genes, whereas the DEGs with a log FC value < 0 were considered as a downregulated ones.

2.3. The GO and KEGG analyses

The GO term analysis was applied to define genes and their products, mRNA, or proteins in order to evaluate the unique biological signaling of the high-throughput transcriptome or genomic data.[ The KEGG database was used to integrate the currently known PPI network information for the metabolism, genetic information processing, environmental information processing, cellular processes, organismal systems, human diseases, and drug development[ using the database for annotation, visualization and integrated discovery 6.8 (https://david.ncifcrf.gov), an online bioinformatics tool designed to interpret biological function annotations of genes or proteins.[ In our study, we utilized the database for annotation, visualization and integrated discovery to identify the enriched biological processes (BPs), molecular functions (MFs), and cellular components (CCs) as well as pathways with P < .05 as the cutoff criterion for these DEGs in HCC.

2.4. The PPI network construction and analysis

We used an online Search Tool of the search tool for the retrieval of interacting genes (the Retrieval of Interacting Genes) to assess the potential interactions of these DEG-related proteins.[ The cutoff criteria were as follows: required confidence level ≥ 0.4, while the maximum interactor number = 0.[ We then used Cytoscape software to visualize the PPI network using the molecular complex detection, version 1.31 functions (the degree cutoff value = 2; the maximum depth = 100; k-core = 2; and node score cutoff = 0.2 as significance).

2.5. Analysis of HCC prognosis and GEPIA confirmation of DEGs

We conducted HCC survival analysis using the European Genome-phenome Archive, The Cancer Genome Atlas, and GEO databases[ using the Kaplan–Meier plotter (www.kmplot.com), which is capable of assessing the effects of 54,000 genes for association with the prognosis of 21 cancer types in the HGU133 Plus 2.0 array, with mean follow-up time of up to 69 months.[ After that, we performed GEPIA via an interactive website (http://gepia.cancer-pku.cn)[ and further assessed these DEGs values in HCC.

2.6. Statistical analysis

The data were downloaded and analyzed using the R statistical software (version 4.0.2[;). All other analyses were incorporated directly in the above subtitles. Overall, a P value <.05 was considered statistically significant.

3. Results

3.1. Identification of HCC-related DEGs

In this study, we searched and downloaded 3 gene expression profiles (GSE101685, GSE62232, and GSE112790 containing 288 HCC and 32 normal liver samples) from the GEO database and obtained 723, 385, and 444 DEGs, respectively. We then used the cutoff value of |log FC| ≥ 2.0 or < 0 and P < .05 to identify a total of 200 DEGs (57 upregulated and 143 downregulated genes; Fig. 1 and Table 1).
Figure 1.

The Venn diagram of 200 DEGs. Illustration of the 200 DEGs from the 3 datasets (GSE101685, GSE62232, and GSE112790), which downloaded from PubMed database analyzed by using Venn diagram software (http://bioinformatics.psb.ugent.be/webtools/Venn/). The various colors represent different datasets. (A) The 57 upregulated DEGs (log FC > 0). (B) The 143 downregulated DEGs (log FC < 0). DEGs = the differentially expressed genes, FC = fold change.

Table 1

Identification of DEGs in HCC tissue samples versus normal livers.

ExpressionGenes
Upregulated DEGs(n = 57)CDK1, TYMS, FAM72A///FAM72D///FAM72B///FAM72C, SPINK1, UBD///GABBR1, RKAA2, CAP2, DTL, RACGAP1, FAM83D, CTHRC1, UHRF1, RRM2, ZWINT, CCNB1, NDC80, TOP2A, KIAA0101, ASPM, HELLS, FLVCR1, HMMR, CCNA2, CD24, TTK, CDKN3, AKR1B10, PBK, NCAPG, GINS1, GPC3, CDKN2C, SULT1C2, CCL20, ROBO1, SPP1, CENPU, PRR11, LOC101930489///MIR4435-2HG///LINC00152, NEK2, ANLN, ACSL4, APOBEC3B, BIRC5, KIF20A, AURKA, UBE2T, DUXAP10, CRNDE, NUSAP1, NQO1, BUB1B, MAD2L1, COL15A1, DLGAP5, ECT2, PRC1
Downregulated DEGs(n = 143)HBA2///HBA1, MT1G, CYP4A22///CYP4A11, LECT2, TUBE1, CYP26A1, BBOX1, PLG, CYP2A6, SOCS2, LINC01093, CYP2C8, CXCL14, SLC22A1, IGF1, SULT1E1, CYP39A1, SPP2, HAO2, LINC01554, FAM134B, MT1F, SLC25A47, MFSD2A, FLJ22763, HHIP, APOA5, ADH1B, KCND3, KCNN2, SLCO1B3, SLC10A1, SLC1A2, GSTZ1, PRG4, LY6E, ASPA, CYP1A2, INS-IGF2///IGF2, MT1E, CNDP1, MAN1C1, BCO2, FOLH1B///FOLH1, FCN3, ACSM3, GBA3, CYP2C19, PDGFRA, ANXA10, TTC36, LOC100287413///GLYATL1, CLEC4G, CYP2B6, GYS2, KBTBD11, FOLH1B, KMO, LPA, GHR, CLEC1B, MIR675///H19, CXCL2, LIFR, CLRN3, CYP2C9, CFHR3, MARCO, BHMT, CYP2A7, CYP2E1, EXPH5, MT1H, LCAT, CTH, CLEC4M, VNN1, LYVE1, ESR1, HSD11B1, RSPO3, IGFBP3, FOS, LOC101928916///NNMT, PLAC8, ALDOB, HAMP, DNASE1L3, DCN, NAT2, BCHE, CPEB3, IL1RAP, RDH16, AKR1D1, CYP8B1, CXCL12, GNMT, TMEM27, HPGD, CRHBP, DNAJC12, MFAP3L, MME, AVPR1A, WDR72, THRSP, CYP4A11, IDO2, HGFAC, IGFALS, MT1X, MT2A, ADGRG7, S100A8, C7, CYP3A43, PZP, FBP1, AADAT, ADH4, GPM6A, OIT3, HGF, MOGAT2, MT1M, MAGI2-AS3, CYP3A4, GLYAT, CYP2B7P///CYP2B6, CETP, GLS2, SRD5A2, ADRA1A, ECM1, APOF, HBB, MT1HL1, C9, SRPX, FCN2, OAT, LINC00844

The upregulated DEGs were identified by the criteria of log2 fold change (FC) ≥ 2.0 and adjusted P value <.05, and the number of the DEGs was 57. The downregulated DEGs were identified by the criteria of log fold change (FC) ≤ 2.0 and adjusted P value <.05, and the number of the DEGs was 143.

DEGs = the differentially expressed genes

Identification of DEGs in HCC tissue samples versus normal livers. The upregulated DEGs were identified by the criteria of log2 fold change (FC) ≥ 2.0 and adjusted P value <.05, and the number of the DEGs was 57. The downregulated DEGs were identified by the criteria of log fold change (FC) ≤ 2.0 and adjusted P value <.05, and the number of the DEGs was 143. DEGs = the differentially expressed genes The Venn diagram of 200 DEGs. Illustration of the 200 DEGs from the 3 datasets (GSE101685, GSE62232, and GSE112790), which downloaded from PubMed database analyzed by using Venn diagram software (http://bioinformatics.psb.ugent.be/webtools/Venn/). The various colors represent different datasets. (A) The 57 upregulated DEGs (log FC > 0). (B) The 143 downregulated DEGs (log FC < 0). DEGs = the differentially expressed genes, FC = fold change.

3.2. DEGs GO terms and KEGG pathways

These 200 DEGs were analyzed for the GO terms, that is, the upregulated DEGs were enriched in the BPs terms for different cell processes, like cell division or mitotic cytokinesis. In contrast, the downregulated DEGs were enriched in other cell pathways, like the epoxygenase P450 pathway, the oxidation-reduction process,. In the MFs, the upregulated DEGs were enriched in protein kinase binding or protein kinase activity, whereas the downregulated DEGs were enriched in oxygen binding, heme binding, and steroid hydroxylase activity. For the CC GO term, the upregulated DEGs were enriched in the midbody, cytoplasm, spindle microtubule, and centrosome; whereas the downregulated DEGs were enriched in the organelle membrane, extracellular region, endoplasmic reticulum membrane, and extracellular exosome (Table 2).
Table 2

GO terms of the DEGs in HCC tissue samples.

ExpressionCategoryTermCount% P FDR
Upregulated DEGsGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_CC_DIRECTGOTERM_CC_DIRECTGOTERM_CC_DIRECTGOTERM_CC_DIRECTGOTERM_CC_DIRECTGO:0051301–cell divisionGO:0000281–mitotic cytokinesisGO:0000910–cytokinesisGO:0051988–regulation of attachment of spindle microtubules to kinetochoreGO:0007067–mitotic nuclear divisionGO:0007052–mitotic spindle organizationGO:0019901–protein kinase bindingGO:0004672–protein kinase activityGO:0008017–microtubule bindingGO:0003682–chromatin bindingGO:0030496–midbodyGO:0005737–cytoplasmGO:0005876–spindle microtubuleGO:0005634–nucleusGO:0005813–centrosome844353554682541969.334.664.663.505.833.505.835.834.666.999.3329.14.6722.16.996.73E-077.09E-051.40E-041.68E-044.36E-04.0039.99E-058.04E-04.001.0021.79E-081.32E-061.51E-049.04E-04.0039.15E-040.0960.1900.2290.5913.6560.1050.8431.3672.4301.95E-050.0010.1640.9812.747
Downregulated DEGsGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_BP_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_MF_DIRECTGOTERM_CC_DIRECTGOTERM_CC_DIRECTGOTERM_CC_DIRECTGO:0019373–epoxygenase P450 pathwayGO:0055114–oxidation-reduction processGO:0042738–exogenous drug catabolic processGO:0017144–drug metabolic processGO:0045926–negative regulation of growthGO:0071294–cellular response to zinc ionGO:0006805–xenobiotic metabolic processGO:0008202–steroid metabolic processGO:0019825–oxygen bindingGO:0020037–heme bindingGO:0005506–iron ion bindingGO:0004497–monooxygenase activityGO:0008392–arachidonic acid epoxygenase activityGO:0008395–steroid hydroxylase activityGO:0031090–organelle membraneGO:0005576–extracellular regionGO:0072562–blood microparticle92478771081216161288153495.7215.34.455.084.454.456.355.087.6310.210.27.635.085.089.5321.65.722.53E-133.94E-111.16E-107.07E-103.26E-093.26E-095.01E-092.33E-082.06E-144.59E-142.39E-132.52E-134.72E-124.52E-108.33E-166.75E-091.00E-053.93E-106.11E-081.80E-071.10E-065.06E-065.06E-067.77E-063.61E-052.85E-116.33E-113.30E-103.49E-106.52E-096.25E-071.02E-127.78E-060.012
GOTERM_CC_DIRECTGOTERM_CC_DIRECTGO:0005789–endoplasmic reticulum membraneGO:0070062–extracellular exosome193612.122.92.38E-052.29E-040.0270.263

The GO term analysis was applied to define genes and their products, mRNA, or proteins in order to evaluate the unique biological signaling of the high-throughput transcriptome or genomic data. The table was utilized DAVID to identify the enriched biological processes (BPs), molecular functions (MFs), and cellular components (CCs) as well as pathways with P < .05 as the cutoff criterion for these DEGs in HCC.

DAVID = the database for annotation, visualization and integrated discovery, FDR = false discovery rate, GO = gene ontology.

GO terms of the DEGs in HCC tissue samples. The GO term analysis was applied to define genes and their products, mRNA, or proteins in order to evaluate the unique biological signaling of the high-throughput transcriptome or genomic data. The table was utilized DAVID to identify the enriched biological processes (BPs), molecular functions (MFs), and cellular components (CCs) as well as pathways with P < .05 as the cutoff criterion for these DEGs in HCC. DAVID = the database for annotation, visualization and integrated discovery, FDR = false discovery rate, GO = gene ontology. Furthermore, the KEGG analysis revealed that the upregulated DEGs were enriched in chemical carcinogenesis, retinol metabolism, drug metabolism - cytochrome P450, metabolism of xenobiotics by cytochrome P450, and the cell cycle; however, the downregulated DEGs did not significantly associated with (formed) any signaling (P < .05; Table 3).
Table 3

KEGG pathways of the DEGs in HCC tissue samples.

Pathway IDNameCount% P Genes
hsa05204Chemical carcinogenesis124.932.43E-08CYP3A4, CYP3A43, CYP2C19, CYP2C9, CYP2C8, ADH4, NAT2, HSD11B1, ADH1B, CYP2A6, CYP2E1, CYP1A2
hsa00830Retinol metabolism104.114.02E-07CYP3A4, CYP2B6, CYP2C9, CYP2C8, ADH4, ADH1B, CYP26A1, CYP2A6, CYP1A2, RDH16
hsa00982Drug metabolism - cytochrome P450104.116.84E-07CYP3A4, CYP2C19, CYP2B6, CYP2C9, CYP2C8, ADH4, ADH1B, CYP2A6, CYP2E1, CYP1A2
hsa00980Metabolism of xenobiotics by cytochrome P45093.371.38E-05CYP3A4, CYP2B6, CYP2C9, ADH4, HSD11B1, ADH1B, CYP2A6, CYP2E1, CYP1A2
hsa04110Cell cycle72.880.01CCNB1, CDK1, MAD2L1, CDKN2C, TTK, BUB1B, CCNA2

The KEGG database was used to integrate the currently known PPI network information for the metabolism, genetic information processing, environmental information processing, cellular processes, organismal systems, human diseases, and drug development using DAVID 6.8 (https://david.ncifcrf.gov), an online bioinformatics tool designed to interpret biological function annotations of genes or proteins.

DAVID = the database for annotation, visualization and integrated discovery, HCC = hepatocellular carcinoma, KEGG = Kyoto Encyclopedia of Genes and Genomes.

KEGG pathways of the DEGs in HCC tissue samples. The KEGG database was used to integrate the currently known PPI network information for the metabolism, genetic information processing, environmental information processing, cellular processes, organismal systems, human diseases, and drug development using DAVID 6.8 (https://david.ncifcrf.gov), an online bioinformatics tool designed to interpret biological function annotations of genes or proteins. DAVID = the database for annotation, visualization and integrated discovery, HCC = hepatocellular carcinoma, KEGG = Kyoto Encyclopedia of Genes and Genomes.

3.3. Identification of DEGs-related PPI network

Since PPIs are key for gene-gene interactions in cells, we obtained the PPI network of these DEGs using the online search tool search tool for the retrieval of interacting genes and Cytoscape software. This PPI network contained a total of 158 nodes and 885 edges, covering 50 upregulated and 108 downregulated genes (Fig. 2A); however, we could not relate 42 of these 200 DEGs into this PPI network complex because they did not associate with this overall PPI network. Next, we analyzed the core modules of this PPI network using the MOCDE app and found them to include 34 nodes and 537 edges (Fig. 2B), showing that all 34 core nodes were from upregulated genes of the 158 core DEGs, but nor from the downregulated DEGs, with unknown reason.
Figure 2.

The PPI network. (A) The PPI network covers a total of 158 DEGs. Nodes indicate proteins; edges indicate the interaction of proteins. The blue nodes refer to downregulated DEGs; the red nodes refer to upregulated DEGs. (B) Module analysis of the DEGs using the Cytoscape software with a cutoff value of 2, the node score cutoff value = 0.2 (k-core = 2), and maximum depth = 100. DEGs = the differentially expressed genes, PPI = protein–protein interaction.

The PPI network. (A) The PPI network covers a total of 158 DEGs. Nodes indicate proteins; edges indicate the interaction of proteins. The blue nodes refer to downregulated DEGs; the red nodes refer to upregulated DEGs. (B) Module analysis of the DEGs using the Cytoscape software with a cutoff value of 2, the node score cutoff value = 0.2 (k-core = 2), and maximum depth = 100. DEGs = the differentially expressed genes, PPI = protein–protein interaction.

3.4. Association of these core DEGs with survival of HCC patients

We then performed Kaplan–Meier analysis (http://kmplot.com/analysis/) to associate these 34 main DEGs with HCC prognosis. The database is capable to assess the effect of 54k genes (i.e., mRNA, miRNA, or protein) on survival in 21 cancer types, including breast (n = 7830), ovarian (n = 2190), lung (n = 3452), and gastric (n = 1440) cancers. As shown in Figure 3 and Table 4, we found that 28 DEGs were significantly associated with a poor HCC prognosis, whereas the remaining 6 DEGs were not associated with the HCC prognosis (P > .05).
Figure 3.

The Kaplan–Meier survival curves of the 28 DEGs. Association of the 34 core genes with HCC prognosis as analyzed by using the online tool Kaplan–Meier plotter. The data demonstrated that 28 of 34 genes were associated with significantly poor HCC prognosis (P < .05). DEGs = the differentially expressed genes, HCC = hepatocellular carcinoma.

Table 4

Survival significance of the 34 key candidate genes in HCC.

CategoryGenes
Genes with a significantly worse survival (P < .05)ANLN, ASPM, AURKA, BIRC5, BUB1B, CCNB1, CDK1, CDKN3, DLGAP5, DTL, ECT2, FAM83D, GINS1, HELLS, KIAA0101, KIF20A, MAD2L1, NCAPG, PBK, PRC1, RACGAP1, RRM2, TOP2A, TTK, TYMS, UBE2T, UHRF1, ZWINT
Genes without a significantly worse survival (P > .05)CCNA2, NDC80, NEK2, NUSAP1, HMMR, CENPU

HCC = hepatocellular carcinoma.

Survival significance of the 34 key candidate genes in HCC. HCC = hepatocellular carcinoma. The Kaplan–Meier survival curves of the 28 DEGs. Association of the 34 core genes with HCC prognosis as analyzed by using the online tool Kaplan–Meier plotter. The data demonstrated that 28 of 34 genes were associated with significantly poor HCC prognosis (P < .05). DEGs = the differentially expressed genes, HCC = hepatocellular carcinoma. Next, we performed GEPIA of these 28 genes between cancer patients and healthy individuals (http://gepia.cancer-pku.cn/). We found that 27 of these 28 genes were highly expressed in HCC tissue samples versus normal livers (P < .05; Fig. 4 and Table 5). These 27 genes, particularly Cyclin B1, CDK1, MAD2L1, or BUB1B, were enriched in the cell cycle signaling (P = 3.09E-4; Fig. 5 and Table 6).
Figure 4.

The expression level of the 27 DEGs between the liver tissue and the normao tissue. Identification of 27 genes that are significantly highly expressed in HCC tissues. The 28 genes that are significantly associated with a poor survival of HCC patients were analyzed by using GEPIA of HCC tissues versus normal livers. The data showed that 27 of these 28 genes were highly expressed in HCC versus normal liver samples (*P < .05). The red color refers to HCC; the gray color refers to normal liver samples. DEGs = the differentially expressed genes, GEPIA = gene expression profiling interactive analysis, HCC = hepatocellular carcinoma.

Table 5

Validation of the 28 genes from the GEPIA data.

CategoryGenes
Highly expressed genes (P < .05)ANLN, ASPM, AURKA, BIRC5, BUB1B, CCNB1, CDK1, CDKN3, DLGAP5, DTL, ECT2, FAM83D, GINS1, HELLS, KIAA0101, KIF20A, MAD2L1, NCAPG, PBK, PRC1, RACGAP1, RRM2, TOP2A, TYMS, UBE2T, UHRF1, ZWINT
Nonhighly expressed genes (P > .05)TTK

GEPIA = gene expression profiling interactive analysis.

Figure 5.

The KEGG pathway enrichment analysis of the 27 selected genes. The data showed that level of Cyclin B1, CDK1, MAD2L1, and BUB1B was markedly enhanced in the cell cycle pathway. The red arrows point out the position of these genes. CycB, Cyclin B1; Mad2, MAD2L1; BubR1, BUB1B. BUB1B = mitotic checkpoint serine/threonine kinase B, CDK1 = Cyclin-dependent kinase 1, KEGG = Kyoto Encyclopedia of Genes and Genomes, MAD2L1 = Mitotic arrest deficient 2 like 1.

Table 6

Re-analysis of 27 selected genes via KEGG pathway enrichment.

Pathway IDNameCount% P Genes
cfa04110Cell cycle48.363.09E-04CCNB1, CDK1, MAD2L1, BUB1B
cfa04115p53 signaling pathway36.27.002CCNB1, CDK1, RRM2

KEGG = Kyoto Encyclopedia of Genes and Genomes.

Validation of the 28 genes from the GEPIA data. GEPIA = gene expression profiling interactive analysis. Re-analysis of 27 selected genes via KEGG pathway enrichment. KEGG = Kyoto Encyclopedia of Genes and Genomes. The expression level of the 27 DEGs between the liver tissue and the normao tissue. Identification of 27 genes that are significantly highly expressed in HCC tissues. The 28 genes that are significantly associated with a poor survival of HCC patients were analyzed by using GEPIA of HCC tissues versus normal livers. The data showed that 27 of these 28 genes were highly expressed in HCC versus normal liver samples (*P < .05). The red color refers to HCC; the gray color refers to normal liver samples. DEGs = the differentially expressed genes, GEPIA = gene expression profiling interactive analysis, HCC = hepatocellular carcinoma. The KEGG pathway enrichment analysis of the 27 selected genes. The data showed that level of Cyclin B1, CDK1, MAD2L1, and BUB1B was markedly enhanced in the cell cycle pathway. The red arrows point out the position of these genes. CycB, Cyclin B1; Mad2, MAD2L1; BubR1, BUB1B. BUB1B = mitotic checkpoint serine/threonine kinase B, CDK1 = Cyclin-dependent kinase 1, KEGG = Kyoto Encyclopedia of Genes and Genomes, MAD2L1 = Mitotic arrest deficient 2 like 1.

4. Discussion

HCC development and progression, like most other malignancies, involve the aberrant activation of oncogenes and the loss of tumor suppressor genes, resulting in abnormal BPs, MFs, and CCs as well as alterations of cell growth and apoptosis.[ In the current study, altered CCNB1, CDK1, MAD2L1, and BUB1B expression was shown in HCC versus normal liver tissues, and changes in expression of these genes resulted in tumor cell cycle progression and cell growth. Therefore, these genes were associated with a poor overall survival of HCC patients. A previous study conducted by Sarathi et al reported alteration of several genes in HCC, which associated with the stage of HCC, such as NDUFA4L2, CRHBP, and PIGU. However, their data were not shown in our current study, which might be due to the different RNA-Seq data. The heterogeneity of HCC might be the another reason and blocked the treatment in clinical.[ The cell cycle is a cell process that is strictly controlled by different cyclins and cyclin-dependent kinases (CDKs).[ Cyclin B1 (CCNB1) is a protein associated with cell mitosis and primarily expressed in the G2/M phase of the cell cycle. Zheng et al have shown that miRNA-200c expression was able to enhance the radiosensitivity of esophageal cancer by arrest of cells at cell cycle G2/M via downregulation of CCNB1 expression.[ Similarly, Pan et al have reported that eukaryotic translation initiation factor 3 subunit D silencing was able to suppress renal cancer cell tumorigenesis by induction of G2/M-phase arrest and downregulation of the CCNB1/CDK1 signaling pathway activity.[ Other studies have demonstrated that CCNB1 overexpression is associated with a poor prognosis of human epidermal growth factor receptor 2-positive breast cancer,[ colorectal cancer,[ nonmuscle-invasive bladder cancer,[ pancreatic cancer,[ and non-small cell lung cancer.[ Our current data further support these findings in other cancers. Indeed, Zhuang et al also have revealed that the upregulated CCNB1 expression in HCC tissues associated with a worse HCC prognosis.[ Furthermore, Gavet et al have previously demonstrated that the cyclinB1-Cdk1 complex, which functions as the master mitotic factor, was significantly activated in the late G2 phase to trigger the prophase of the cell cycle.[ CDK1, also known as cell division control protein 2 or CDC2, is one of the CDK family members that is frequently mutated in various tumor tissues; this mutation results in uncontrolled cell proliferation or unscheduled reentry into the cell cycle.[ Zhong et al have revealed that the BTB (BR-C, ttk and bab) or POZ (Pox virus and Zinc finger) domain-containing protein potassium channel tetramerisation domain containing 12 was able to activate CDK1 and Aurora A to induce the G2/M transition for cell growth and tumorigenesis.[ In addition, a possible reason for everolimus resistance in prostate cancer cells might be due to upregulation of the CDK1-cyclin B complex, which leads to tumor cell progression towards cell cycle G2/M phase.[ Meanwhile, knockdown of eukaryotic translation initiation factor 3 subunit D expression notably suppressed renal cell carcinoma cell tumorigenesis by inhibition of the cyclin B1/CDK1 signaling.[ Thus, CDK1 not only participates in cell-cycle progression but also regulates cell apoptosis by phosphorylation of caspases,[ indicating that CDK1 is a promoter of tumorigenesis in different tissues and cells and could be a target for future cancer therapy. Mitotic arrest deficient 2 like 1 (MAD2L1) is a component of the mitotic spindle assembly checkpoint that participates in cell mitosis.[ Foijer et al have demonstrated that deletion of MADL1 in mouse hepatocytes results in HCC development and progression because of chromosome instability,[ although another study conducted by Scintu et al has shown that the MAD2L1 level is not associated with intra-chromosomal instability in human ductal breast carcinoma.[ This discrepancy might be due to the different types of species and neoplasms. However, several studies have shown an association of high MAD1L2 expression with cancer risk and a poor prognosis.[ Additionally, Li et al have reported that miR-200c-5p expression leads to the reduction of HCC cell proliferation and metastasis by suppression of MAD1L2 expression.[ Our current data further support the oncogenic role of MAD1L2 in HCC. Furthermore, the mitotic checkpoint serine/threonine kinase B (BUB1B) is a mitotic checkpoint serine/threonine protein kinase that binds to centromeres during cell mitosis.[ BUB1B overexpression has been associated with the development of several human cancers[ and with a poor survival of patients. Fu et al also have reported that a high BUB1B expression enhanced the development of prostate cancer and was associated with a poor prognosis of patients.[ Moreover, mutations of the BUB1B gene are associated with mosaic variegated aneuploidy syndrome and contribute to a high risk of developing childhood cancers.[ However, the current study is just a proof-of-principle study and lacks experimental verification, although previous studies have been conducted to associate some of these molecules with HCC tumorigenesis. Our finding of the association of 4 genes (CCNB1, CDK1, MAD2L1, and BUB1B) with a poor HCC prognosis will need to be confirmed in the future. In conclusion, the current study identified a number of DEGs (especially Cyclin B1, CDK1, MAD2L1, or BUB1B) to be associated with a poor HCC prognosis. Future studies will investigate their functions in HCC and apply them as prognostic biomarkers and potential therapeutic targets for HCC.

5. Conclusions

We assessed DEGs in 288 HCC versus 32 normal liver tissues from 3 online datasets (GSE101685, GSE62232, and GSE112790) and revealed a total of 200 DEGs (57 upregulated and 143 downregulated genes). Our GO term and pathway analysis identified that these DEGs were involved in BPs, MFs, and CCs. The PPI network analysis covered 50 upregulated and 108 downregulated genes. The core modules of this PPI network contained 34 upregulated genes, 28 of which were associated with a poor HCC prognosis, and 27 of these 28 genes were highly expressed in HCC tissues. In conclusion, our current study revealed 200 DEGs, 28 of which were associated with a poor HCC prognosis. However, further studies are needed to confirm their involvement in HCC development and to verify them as prognostic markers and potential treatment targets for HCC.

Acknowledgments

We would like to thank the individuals who helped us with this work, especially Ms. Luodan Wang in inspiring the first author for his research over the past decade.

Author contributions

Conceptualization: Xu Huang, Ping Zhang. Data curation: Xu Huang, Xu Wang, Ge Huang. Formal analysis: Xu Huang, Xu Wang, Ruotao Li. Investigation: Xu Huang, Xingkai Liu; Lidong Cao. Methodology: Xu Huang, Xu Wang, Ge Huang, Ruotao Li, Ping Zhang. Supervision: Junfeng Ye; Ping Zhang; Validation: Junfeng Ye; Ping Zhang. Visualization: Xu Huang. Writing – original draft: Xu Huang, Ruotao Li, Lidong Cao. Writing – review & editing: Xu Wang, Ge Huang, Ping Zhang.
  50 in total

1.  DAVID: Database for Annotation, Visualization, and Integrated Discovery.

Authors:  Glynn Dennis; Brad T Sherman; Douglas A Hosack; Jun Yang; Wei Gao; H Clifford Lane; Richard A Lempicki
Journal:  Genome Biol       Date:  2003-04-03       Impact factor: 13.583

2.  GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor.

Authors:  Sean Davis; Paul S Meltzer
Journal:  Bioinformatics       Date:  2007-05-12       Impact factor: 6.937

3.  Defective Cyclin B1 Induction in Trastuzumab-emtansine (T-DM1) Acquired Resistance in HER2-positive Breast Cancer.

Authors:  MohammadA Sabbaghi; Gabriel Gil-Gómez; Cristina Guardia; Sonia Servitja; Oriol Arpí; Sara García-Alonso; Silvia Menendez; Montserrat Arumi-Uria; Laia Serrano; Marta Salido; Aura Muntasell; Maria Martínez-García; Sandra Zazo; Cristina Chamizo; Paula González-Alonso; Juan Madoz-Gúrpide; Pilar Eroles; Joaquin Arribas; Ignasi Tusquets; Ana Lluch; Atanasio Pandiella; Federico Rojo; Ana Rovira; Joan Albanell
Journal:  Clin Cancer Res       Date:  2017-08-18       Impact factor: 12.531

4.  Downregulation of Cyclin B1 mediates nagilactone E-induced G2 phase cell cycle arrest in non-small cell lung cancer cells.

Authors:  Le-Le Zhang; Zhe-Ling Feng; Min-Xia Su; Xiao-Ming Jiang; Xiuping Chen; Yitao Wang; Ao Li; Li-Gen Lin; Jin-Jian Lu
Journal:  Eur J Pharmacol       Date:  2018-04-19       Impact factor: 4.432

5.  MiR-200c-5p suppresses proliferation and metastasis of human hepatocellular carcinoma (HCC) via suppressing MAD2L1.

Authors:  Yuanhang Li; Weijun Bai; Jianjun Zhang
Journal:  Biomed Pharmacother       Date:  2017-06-10       Impact factor: 6.529

6.  Cancer-Specific requirement for BUB1B/BUBR1 in human brain tumor isolates and genetically transformed cells.

Authors:  Yu Ding; Christopher G Hubert; Jacob Herman; Philip Corrin; Chad M Toledo; Kyobi Skutt-Kakaria; Julio Vazquez; Ryan Basom; Bin Zhang; Jennifer K Risler; Steven M Pollard; Do-Hyun Nam; Jeffery J Delrow; Jun Zhu; Jeongwu Lee; Jennifer DeLuca; James M Olson; Patrick J Paddison
Journal:  Cancer Discov       Date:  2012-11-15       Impact factor: 39.397

7.  Cancer statistics in China, 2015.

Authors:  Wanqing Chen; Rongshou Zheng; Peter D Baade; Siwei Zhang; Hongmei Zeng; Freddie Bray; Ahmedin Jemal; Xue Qin Yu; Jie He
Journal:  CA Cancer J Clin       Date:  2016-01-25       Impact factor: 508.702

8.  Expression of mitotic checkpoint proteins BUB1B and MAD2L1 in salivary duct carcinomas.

Authors:  Young Hyeh Ko; Ji Hyeon Roh; Young-Ik Son; Man Ki Chung; Jeon Yeob Jang; Hayoung Byun; Chung-Hwan Baek; Han-Sin Jeong
Journal:  J Oral Pathol Med       Date:  2009-12-16       Impact factor: 4.253

9.  Identification and interaction analysis of key genes and microRNAs in hepatocellular carcinoma by bioinformatics analysis.

Authors:  Tong Mou; Di Zhu; Xufu Wei; Tingting Li; Daofeng Zheng; Junliang Pu; Zhen Guo; Zhongjun Wu
Journal:  World J Surg Oncol       Date:  2017-03-16       Impact factor: 2.754

10.  Overexpression of BUB1B contributes to progression of prostate cancer and predicts poor outcome in patients with prostate cancer.

Authors:  Xin Fu; Guo Chen; Zhi-Duan Cai; Cong Wang; Ze-Zhen Liu; Zhuo-Yuan Lin; Yong-Ding Wu; Yu-Xiang Liang; Zhao-Dong Han; Jun-Chen Liu; Wei-De Zhong
Journal:  Onco Targets Ther       Date:  2016-04-15       Impact factor: 4.147

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.