Literature DB >> 32443377

Bioinformatics analysis of key biomarkers and potential molecular mechanisms in hepatocellular carcinoma induced by hepatitis B virus.

Zhe Li, Jingyong Xu, Hongyuan Cui, Jinghai Song, Jian Chen, Junmin Wei.   

Abstract

BACKGROUND: Hepatocellular carcinoma (HCC) accounts for up to 90% of all primary hepatic malignancies; it is the sixth most common cancer and the second most common cause of cancer mortality worldwide. Numerous studies have shown that hepatitis B virus and its products, HBV integration, and mutation can induce HCC. However, the molecular mechanisms underpinning the regulation of HCC induced by HBV remain unclear.
METHODS: We downloaded 2 gene expression profiling datasets, of HBV and of HCC induced by HBV, from the gene expression omnibus (GEO) database. Differentially expressed genes (DEGs) between HCC and HBV were identified to explore any predisposing changes in gene expression associated with HCC. DEGs between HCC and adjacent healthy tissues were investigated to identify genes that may play a key role in HCC. Any overlapping genes among these DEGs were included in our bioinformatics analysis. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses of overlapping genes were performed using the Metascape online database; the protein-protein interaction (PPI) network was analyzed using the STRING online database; and we obtained the hub genes of the PPI network using Cytoscape software. An overall survival (OS) analysis of hub genes was performed using km-plotter and the gene expression profiling interactive analysis (GEPIA) online database. The expression levels of hub genes were determined using the TCGA and GEPIA databases. Finally, the relationships between hub genes and tumors were analyzed using the comparative toxicogenomics database (CTD).
RESULTS: We identified 113 overlapping genes from the 2 datasets. Using functional and pathway analyses, we found that the overlapping genes were mainly related to the AMPK signaling pathway and cellular responses to cadmium ions. C8A, SPP2, KLKB1, PROZ, C6, FETUB, MBL2, HGFAC, C8B, and ANGPTL3 were identified as hub genes and C8A, SPP2, PROZ, C6, HGFAC, and C8B were found to be significant for survival.
CONCLUSION: The DEGs re-analyzed between HCC and hepatitis B enable a systematic understanding of the molecular mechanisms of HCC reliant on hepatitis B virus.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32443377      PMCID: PMC7254842          DOI: 10.1097/MD.0000000000020302

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.889


Introduction

Hepatocellular carcinoma (HCC) is the fifth most common cancer and the third most common cause of cancer-related death and primary hepatic malignancy in the world. HCC results in more than 600,000 deaths per year because it invades the liver vasculature and undergoes metastasis.[ Numerous studies have identified risk factors contributing to the development of HCC; one of the most common risk factors is chronic viral hepatitis, especially hepatitis B.[ It has been reported that patients with early stage HCC with small, solitary tumors and preserved liver function have a high 10-year overall survival (OS) following resection, liver transplantation, or ablation.[ However, most patients who were diagnosed with HCC at an advanced stage, when it is not suitable for curative therapies, have a low overall survival (OS).[ Thus, it is necessary to identify appropriate molecular biomarkers for diagnosis of HCC at an early stage and potential targets for therapies. In recent years, bioinformatics analysis has been commonly used to screen and identify key biomarkers and potential molecular mechanisms of some cancers due to advances in microarrays based on high-throughput platforms, which could lead to improvements in 10-year OS and prognosis. The gene expression omnibus (GEO) online database is a public repository available worldwide for high-throughput microarray and next-generation sequence functional genomic data sets submitted by the research community. We downloaded and re-analyzed 2 original microarray datasets, GSE44074 and GSE55092, from the GEO database and found biomarkers and disease mechanisms that could prove valuable for future research. In our study, we searched for DEGs using GEO2R tools. The Kyoto Encyclopedia of Genes and Genomes (KEGG) and gene ontology (GO) analyses of overlapping genes among the 2 datasets were performed using the Metascape database. PPI networks were generated using the STRING online database and analyzed via Cytoscape. Cytoscape was also used to search for hub genes. The 10-year OS associated with hub genes were analyzed using the gene expression profiling interactive analysis (GEPIA) database. The identification of hub genes associated with malignancy was performed using the CTD database. Finally, microRNA (miRNA)-gene pairs of the hub genes were screened using the TargetScan database. The results of this will help us understand the molecular mechanisms underlying HCC induced by hepatitis B virus and improve early detection.

Material and methods

Access to public data

The GEO database is a publicly accessible, functional genomics data repository supporting MIAME-compliant data submissions. We obtained the gene expression profiles GSE44074 (GPL13536, Kanazawa Univ. Human Liver chip 10k) and GSE55092 (GPL570, [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array). The probes were transformed into homologous gene symbols using the platform's annotation information. GSE44074 contained 17 HCC specimens (hepatitis B virus-related) and 36 chronic hepatitis B specimens, while GSE55092 contained 39 specimens (hepatitis B virus-related) and 81 chronic hepatitis B specimens.

Identification of DEGs

GEO2R (http://www.ncbi.nlm.nih.gov/geo/geo2r/) is a web application for identifying DEGs by comparing samples from GEO series. In our study, DEGs between HCC and hepatitis B were searched for using GEO2R. The cut-off P value was .05 and | logFC | was more than 1.

GO and KEGG analysis of overlapping genes

GO is a comprehensive source of digital data relating to the functions of genes, spanning 3 aspects of biology: BP (biological processes), CC (cellular components), and MF (molecular functions). KEGG is a database for both the functional interpretation and the practical application of genomic information, which can be used to assign functional meanings to genes and genomes, both at the molecular level and above. Metascape (http://metascape.org/) is a widely used, web-based tool with functional annotations of DEGs. In our study, the GO and KEGG analyses of overlapping genes were performed using Metascape (P < .05).

Construction and analysis of the PPI network

STRING (http://string-db.org) is an online database that can predict and generate a protein–protein interaction (PPI) network after overlapping genes have been imported. Cytoscape is open-source software used to analyze PPI networks and search for hub genes. In our study, the PPI network was generated by STRING and analyzed using Cytoscape.

Survival analysis

GEPIA (http://gepia.cancer-pku.cn/) is an online tool based on the TCGA and GTEx databases and can provide key interactive and customizable functions, such as differential expression analysis, correlation analysis, patient survival analysis, profile plotting, and dimension reduction analysis. In our study, the effects of hub genes on the 10-year OS of patients with HCC were analyzed using GEPIA (P < .05).

Identification of hub genes associated with malignancy

The comparative toxicogenomics database (CTD) (http://ctdbase.org/) is a web-based tool that provides information about interactions between chemicals and gene products and their connection to diseases. The relationships between hub genes and tumors were analyzed using CTD.

Identification of miRNA-gene pairs

TargetScan (www.targetscan.org) is a web-based database that predicts the biological targets of miRNAs. In our study, the miRNA-gene pairs of the hub genes were screened using TargetScan.

Ethical approval

All data in this study were obtained from open, public databases, therefore ethical approval was not necessary.

Results

The screening of GSE44074 identified 394 DEGs, 247 genes which were up-regulated and 147 which were down-regulated (Fig. 1A). Similarly, we obtained 1809 DEGs from GSE55092, 1021 genes which were up-regulated and 788 which were down-regulated (Fig. 1C). There were 113 overlapping genes between the two datasets (Fig. 2A). The heat maps of the datasets are shown in Figure 1B and D.
Figure 1

The expression of differentially expressed genes (DEGs) in GSE44074 and GSE55092. (A, C) Volcano plots of DEGs between hepatitis B and hepatocellular carcinoma. (A) GSE44074 (C) GSE55092. (B, D) Heat maps of the DEGs between hepatitis B and hepatocellular carcinoma. Red and green colors indicate higher expression and lower expression, respectively. (B) GSE44074 (D) GSE55092. Hierarchical clustering showed separate groupings between hepatitis B and hepatocellular carcinoma.

Figure 2

(A) Overlapping genes between GSE44074 and GSE55092. (B) The protein–protein interaction (PPI) network complex for the common DEGs. (C, D) The top ten highly connected genes of the PPI network. (E) Gene ontology (GO) analyses of common DEGs using Metascape. (F) Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses of common DEGs using Metascape.

The expression of differentially expressed genes (DEGs) in GSE44074 and GSE55092. (A, C) Volcano plots of DEGs between hepatitis B and hepatocellular carcinoma. (A) GSE44074 (C) GSE55092. (B, D) Heat maps of the DEGs between hepatitis B and hepatocellular carcinoma. Red and green colors indicate higher expression and lower expression, respectively. (B) GSE44074 (D) GSE55092. Hierarchical clustering showed separate groupings between hepatitis B and hepatocellular carcinoma. (A) Overlapping genes between GSE44074 and GSE55092. (B) The protein–protein interaction (PPI) network complex for the common DEGs. (C, D) The top ten highly connected genes of the PPI network. (E) Gene ontology (GO) analyses of common DEGs using Metascape. (F) Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses of common DEGs using Metascape.

Functional and pathway enrichment analysis

GO analysis of the overlapping genes showed they were mainly enriched in cellular amino acid catabolic processes, vitamin binding, terpenoid metabolic processes, animal organ regeneration, monocarboxylic acid metabolic processes, responses to bacteria, complement activation lectin pathway, cellular responses to cadmium ions, peptidase inhibitor activity, responses to acid chemical, toxin biosynthetic processes, cellular responses to potassium ion starvation, cellular responses to interferon-gamma, DNA topological changes, cofactor metabolic processes, blood microparticles, and protein activation cascade (Fig. 2E). KEGG analysis of overlapping genes showed they were mainly enriched in the Citrate cycle (TCA cycle), AMPK signaling pathway, Leukocyte transendothelial migration, Caffeine metabolism, Chemical carcinogenesis, Glycine, Serine and threonine metabolism, Phenylalanine metabolism, and Complement and coagulation cascades (Fig. 2F).

PPI network construction and searching for hub genes

The PPI network of DEGs was constructed using the STRING database (Fig. 2B). The top ten hub genes were C8A, SPP2, KLKB1, PROZ, C6, FETUB, MBL2, HGFAC, C8B, and ANGPTL3 (Fig. 2C,D). All hub genes were down-regulated. The OS rate analysis of the hub genes showed that C8A, SPP2, PROZ, C6, HGFAC, and C8B were significant for survival (Fig. 3). All of these genes were positively associated with the OS of patients with HCC (P < .05 Table 1
Figure 3

Kaplan–Meier curves of 6 common DEGs (C8A, SPP2, PROZ, C6, HGFAC, and C8B). These genes were all correlated with OS in patients with hepatocellular carcinoma. Data were analyzed using the GEPIA online database. Patients with expression above and below the median are indicated by the red and black lines, respectively. (HR = hazard ratio). (A) C8A (B) SPP2 (C) PROZ (D) C6 (E) HGFAC (F) C8B.

Table 1

A summary of hepatocellular carcinoma microarray datasets from different GEO datasets.

Kaplan–Meier curves of 6 common DEGs (C8A, SPP2, PROZ, C6, HGFAC, and C8B). These genes were all correlated with OS in patients with hepatocellular carcinoma. Data were analyzed using the GEPIA online database. Patients with expression above and below the median are indicated by the red and black lines, respectively. (HR = hazard ratio). (A) C8A (B) SPP2 (C) PROZ (D) C6 (E) HGFAC (F) C8B. A summary of hepatocellular carcinoma microarray datasets from different GEO datasets.

Identification of hub genes

The identification of hub genes associated with malignancy was performed using the CTD database. The results are shown in Figure 4.
Figure 4

Relationship between hepatitis B and hepatocellular carcinoma as related to common DEGs based on the CTD database. (A) C8A (B) FETUB (C) SPP2 (D) MBL2 (E) KLKB1 (F) HGFAC (G) PROZ (H) C8B (I) C6 (J) ANGPTL3.

Relationship between hepatitis B and hepatocellular carcinoma as related to common DEGs based on the CTD database. (A) C8A (B) FETUB (C) SPP2 (D) MBL2 (E) KLKB1 (F) HGFAC (G) PROZ (H) C8B (I) C6 (J) ANGPTL3.

Prediction of miRNAs that regulate hub genes

The miRNAs that regulate hub genes were screened using TargetScan (Table 2).
Table 2

A summary of miRNAs that regulate hub genes.

A summary of miRNAs that regulate hub genes.

Discussion

HCC is a major public health problem and a leading cause of death worldwide. HCC is characterized by rapid progression, recurrence, and metastasis; it is also associated with a high degree of malignancy and a high mortality rate.[ There are many ways in which HCC is treated, such as radiotherapy, embolization, and chemotherapy.[ However, conventional treatment regimens cannot significantly prolong the median OS of patients with advanced HCC, and there is a high frequency of tumor recurrence following these treatments.[ Surgery at an early stage is the most effective method for the successful treatment of HCC.[ Recently, biotherapies such as molecule-targeted therapy and immunotherapy have attracted great attention with respect to their use in standardized treatments for HCC.[ Thus, it is necessary to identify appropriate molecular biomarkers to enable diagnosis at an early stage as well as potential targets for HCC therapies. HBV infection is the most common chronic viral infection in the world and is a historically challenging disease that primarily affects populations from developing or low-income countries.[ The most recent research indicates that approximately two billion people have been infected with HBV and more than 600,000 individuals worldwide die each year of HBV-related diseases, such as chronic hepatitis B, liver cirrhosis, and hepatocellular carcinoma.[ Numerous studies have shown that HBV infection is a major risk factor for the development of HCC, with 50% of newly diagnosed HCC cases attributed to HBV infection because of the direct and indirect oncogenic effects of the virus.[ Surgery at an early stage could improve the long-term survival of patients. However, the early diagnosis of HCC in patients with HBV is challenging. Thus, it is critical to develop ways to enable the early detection of HCC by exploring the underlying molecular mechanisms linking HCC with HBV and to improve the survival rate and prognosis of patients in this situation. In our study, we obtained two gene expression file datasets, GSE44074 and GSE55092, from the GEO database; we identified 113 overlapping genes between these datasets. Based on the GO analysis of the overlapping genes, monocarboxylic acid metabolic processes had the highest enrichment score; these processes are associated with aerobic glycolysis and lactate efflux. The enhancement of aerobic glycolysis and lactate efflux could provide tumor cells with a metabolic advantage and an invasive phenotype in the acidic microenvironment of a tumor.[ Cellular amino acid catabolic processes are associated with the metabolism of amino acids. Glutamine, glutamate, proline, and glycine are metabolic regulators involved in supporting cancer cell growth and metastasis.[ It has been reported that “vitamin binding” is associated with HCC, such as the vitamin B12 (B12)-binding protein, haptocorrin, which could be a potential biomarker in patients with HCC.[ Animal organ regeneration is associated with tissue remodeling and integrating growth, processes which underlie cancer metastasis.[ Peptidase inhibitor activity impacts tumor progression. For example, secretory leukocyte protease inhibitor (SLPI) could inhibit elastase, serine proteases, and matrix metalloproteinases, which could influence tumor invasion.[ Cellular responses to interferon-gamma (IFN-gamma) play a key role in innate and adaptive immune responses, and it has been shown that IFN-gamma could prevent the development of primary and transplanted tumors.[ Finally, the blood microparticle and protein activation cascade is associated with abnormalities in laboratory coagulation tests of patients with cancer because of tumor-shed procoagulant microparticles.[ For the KEGG analysis of overlapping genes, chemical carcinogenesis had the highest enrichment score, which could induce malignant tumors.[ The AMPK signaling pathway is involved in the process of tumor invasion and migration in the liver, because liver kinase B1 (also known as serine/threonine kinase 11) phosphorylates and activates the AMPK signaling pathway.[ The citrate cycle (TCA cycle) is a central pathway for oxidative phosphorylation in cells. It has been reported that cancer cells with deregulated oncogene and tumor suppressor expression rely on the TCA cycle for energy production and macromolecule synthesis.[ Leukocyte transendothelial migration could breach structural and cellular boundaries, which could promote cancer cell invasion and metastasis.[ Finally, glycine, serine, and threonine metabolism and phenylalanine metabolism, together, provide essential precursors for the synthesis of proteins that are crucial to cancer cell growth. Moreover, serine could regulate tumor homeostasis by affecting cellular antioxidative capacity.[ Protein–protein interaction (PPI) analysis was conducted using the STRING online database and provided detailed connections between overlapping genes. In our study, the top ten hub genes were C8A, SPP2, KLKB1, PROZ, C6, FETUB, MBL2, HGFAC, C8B, and ANGPTL3. C6 (complement C6), C8A (complement C8 alpha chain), and C8B (complement C8 beta chain) encode components of the complement cascade. The complement system is part of the innate immune system, which is generally recognized to be a protective mechanism against the formation of tumors in humans.[ SPP2 (secreted phosphoprotein 2) encodes a member of the cystatin superfamily. KLKB1 (kallikrein B1) encodes a glycoprotein that participates in the surface-dependent activation of blood coagulation, fibrinolysis, kinin generation, and inflammation. It has been reported that KLKB1 may function as a tumor suppressor gene in lung cancers.[ PROZ (protein Z) encodes a vitamin K-dependent glycoprotein that is synthesized in the liver and secreted into the plasma; it is associated with oral squamous cell carcinoma and pancreatic cancer.[ FETUB (fetuin B) encodes a member of the fetuin family and may potentially be useful in the treatment of diseases characterized by excess angiogenesis, such as cancer.[ MBL2 (mannose-binding lectin 2) encodes the soluble mannose-binding lectin. Numerous studies have shown that MBL2 is an important element in the innate immune system, which plays a key role in the immune surveillance against malignancies. Genetic variations in MBL2 could influence the risk of developing cancer. HGFAC (HGF activator) encodes a member of the peptidase S1 protein family. HGFAC has been shown to be involved in liver and kidney cancers, and the low expression of HGFAC is significantly associated with short OS time in patients with liver cancer.[ ANGPTL3 (angiopoietin-like 3) encodes a member of a family of secreted proteins that function in angiogenesis; this family of proteins plays an important mechanistic role in cancer initiation, progression, and eventual prognosis.[ In conclusion, we obtained genes that were overlapping between HCC and hepatitis B from two datasets, and which may play an important role in the development of HCC. These new biomarkers, such as HGFAC, could be used to diagnose HCC at an early stage. The GO and KEGG analyses revealed further molecular mechanisms underpinning the regulation of HCC induced by HBV. Together, these findings may lead to more effective prevention, detection, and treatment of HCC.

Author contributions

Conceptualization: Hongyuan Cui Data curation: Jinghai Song Formal analysis: Jian Chen Investigation: Junmin Wei Methodology: Hongyuan Cui, Zhe Li Writing – original draft: Zhe Li, Jingyong Xu Writing – review & editing: Zhe Li
  40 in total

Review 1.  The roles of IFN gamma in protection against tumor development and cancer immunoediting.

Authors:  Hiroaki Ikeda; Lloyd J Old; Robert D Schreiber
Journal:  Cytokine Growth Factor Rev       Date:  2002-04       Impact factor: 7.638

2.  A Panel of Three Biomarkers Identified by iTRAQ for the Early Diagnosis of Pancreatic Cancer.

Authors:  Xing Wu; Zi-Xiang Zhang; Xing-Yu Chen; Ya-Ling Xu; Ni Yin; Jian Yang; Dong-Ming Zhu; De-Chun Li; Jian Zhou
Journal:  Proteomics Clin Appl       Date:  2019-05-07       Impact factor: 3.494

Review 3.  [Current treatment in advanced hepatocellular carcinoma and prospects for immuno-oncology therapy].

Authors:  J M Xu
Journal:  Zhonghua Zhong Liu Za Zhi       Date:  2017-08-23

Review 4.  Pathology and pathogenesis of hepatocellular carcinoma.

Authors:  C Röcken; S Carl-McGrath
Journal:  Dig Dis       Date:  2001       Impact factor: 2.404

Review 5.  Linking DNA adduct formation and human cancer risk in chemical carcinogenesis.

Authors:  Miriam C Poirier
Journal:  Environ Mol Mutagen       Date:  2016-06-27       Impact factor: 3.216

Review 6.  Chemical carcinogenesis.

Authors:  Paula A Oliveira; Aura Colaço; Raquel Chaves; Henrique Guedes-Pinto; Luis F De-La-Cruz P; Carlos Lopes
Journal:  An Acad Bras Cienc       Date:  2007-12       Impact factor: 1.753

7.  Recurrence of hepatocellular cancer after resection: patterns, treatments, and prognosis.

Authors:  Parissa Tabrizian; Ghalib Jibara; Brian Shrager; Myron Schwartz; Sasan Roayaie
Journal:  Ann Surg       Date:  2015-05       Impact factor: 12.969

Review 8.  Hepatocellular carcinoma: clinical frontiers and perspectives.

Authors:  Jordi Bruix; Gregory J Gores; Vincenzo Mazzaferro
Journal:  Gut       Date:  2014-02-14       Impact factor: 23.059

Review 9.  Circulating predictive and diagnostic biomarkers for hepatitis B virus-associated hepatocellular carcinoma.

Authors:  Stijn Van Hees; Peter Michielsen; Thomas Vanwolleghem
Journal:  World J Gastroenterol       Date:  2016-10-07       Impact factor: 5.742

Review 10.  Treatment of hepatocellular carcinoma with portal vein tumor thrombus: advances and challenges.

Authors:  Jin-Fang Jiang; Yong-Cong Lao; Bao-Hong Yuan; Jun Yin; Xin Liu; Long Chen; Jian-Hong Zhong
Journal:  Oncotarget       Date:  2017-05-16
View more
  5 in total

1.  PROZ May Serve as a Prognostic Biomarker for Early Hepatocellular Carcinoma.

Authors:  Xiaocong Jiang; Ting Song; Xiuhua Pan; Xinyu Zhang; Yuhong Lan; Li Bai
Journal:  Int J Gen Med       Date:  2021-08-06

Review 2.  "Complimenting the Complement": Mechanistic Insights and Opportunities for Therapeutics in Hepatocellular Carcinoma.

Authors:  Astha Malik; Unmesha Thanekar; Surya Amarachintha; Reena Mourya; Shreya Nalluri; Alexander Bondoc; Pranavkumar Shivakumar
Journal:  Front Oncol       Date:  2021-02-24       Impact factor: 5.738

3.  Transcriptomic identification of HBx-associated hub genes in hepatocellular carcinoma.

Authors:  Zhengzhong Ni; Jun Lu; Weiyi Huang; Hanif Khan; Xuejun Wu; Danmei Huang; Ganggang Shi; Yongdong Niu; Haihua Huang
Journal:  PeerJ       Date:  2021-12-22       Impact factor: 2.984

4.  Serum KLKB1 as a Potential Prognostic Biomarker for Hepatocellular Carcinoma Based on Data-Independent Acquisition and Parallel Reaction Monitoring.

Authors:  Yi-Qun Che; Yue Zhang; Han-Bing Li; Di Shen; Wei Cui
Journal:  J Hepatocell Carcinoma       Date:  2021-10-12

5.  PROZ Associated with Sorafenib Sensitivity May Serve as a Potential Target to Enhance the Efficacy of Combined Immunotherapy for Hepatocellular Carcinoma.

Authors:  Yinkui Chen; Xiusheng Qiu; Donghao Wu; Xu Lu; Guanghui Li; Yongsheng Tang; Changchang Jia; Zhiyong Xiong; Tiantian Wang
Journal:  Genes (Basel)       Date:  2022-08-26       Impact factor: 4.141

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.