Literature DB >> 36187396

Dissecting novel mechanisms of hepatitis B virus related hepatocellular carcinoma using meta-analysis of public data.

Jihad Aljabban1, Michael Rohr2, Saad Syed3, Eli Cohen4, Naima Hashi5, Sharjeel Syed6, Kamal Khorfan7, Hisham Aljabban8, Vincent Borkowski9, Michael Segal9, Mohamed Mukhtar10, Mohammed Mohammed11, Emmanuel Boateng4, Mary Nemer9, Maryam Panahiazar12, Dexter Hadley13, Sajid Jalil14, Khalid Mumtaz14.   

Abstract

BACKGROUND: Hepatitis B virus (HBV) is a cause of hepatocellular carcinoma (HCC). Interestingly, this process is not necessarily mediated through cirrhosis and may in fact involve oncogenic processes. Prior studies have suggested specific oncogenic gene expression pathways were affected by viral regulatory proteins. Thus, identifying these genes and associated pathways could highlight predictive factors for HCC transformation and has implications in early diagnosis and treatment. AIM: To elucidate HBV oncogenesis in HCC and identify potential therapeutic targets.
METHODS: We employed our Search, Tag, Analyze, Resource platform to conduct a meta-analysis of public data from National Center for Biotechnology Information's Gene Expression Omnibus. We performed meta-analysis consisting of 155 tumor samples compared against 185 adjacent non-tumor samples and analyzed results with ingenuity pathway analysis.
RESULTS: Our analysis revealed liver X receptors/retinoid X receptor (RXR) activation and farnesoid X receptor/RXR activation as top canonical pathways amongst others. Top upstream regulators identified included the Ras family gene rab-like protein 6 (RABL6). The role of RABL6 in oncogenesis is beginning to unfold but its specific role in HBV-related HCC remains undefined. Our causal analysis suggests RABL6 mediates pathogenesis of HBV-related HCC through promotion of genes related to cell division, epigenetic regulation, and Akt signaling. We conducted survival analysis that demonstrated increased mortality with higher RABL6 expression. Additionally, homeobox A10 (HOXA10) was a top upstream regulator and was strongly upregulated in our analysis. HOXA10 has recently been demonstrated to contribute to HCC pathogenesis in vitro. Our causal analysis suggests an in vivo role through downregulation of tumor suppressors and other mechanisms.
CONCLUSION: This meta-analysis describes possible roles of RABL6 and HOXA10 in the pathogenesis of HBV-related HCC. RABL6 and HOXA10 represent potential therapeutic targets and warrant further investigation. ©The Author(s) 2022. Published by Baishideng Publishing Group Inc. All rights reserved.

Entities:  

Keywords:  Genomics; Hepatitis B virus; Hepatocellular carcinoma; Meta-analysis

Year:  2022        PMID: 36187396      PMCID: PMC9516659          DOI: 10.4251/wjgo.v14.i9.1856

Source DB:  PubMed          Journal:  World J Gastrointest Oncol


Core Tip: Hepatitis B virus (HBV) is a cause of hepatocellular carcinoma (HCC). Interestingly, this process is not necessarily mediated through cirrhosis and may in fact involve oncogenic processes. Prior studies have suggested specific oncogenic gene expression pathways were affected by viral regulatory proteins. Thus, identifying these genes and associated pathways could highlight predictive factors for HCC transformation, and has implications in early diagnosis and treatment. Our manuscript leverages big data to offer key insights to oncogenesis of HBV infection in HCC. We were able to dissect key genetic drivers to disease and namely demonstrate a newfound role for rab-like protein 6 and homeobox A10.

INTRODUCTION

Hepatitis B virus (HBV) is a major cause of liver disease, significantly contributing to global morbidity and mortality. Recent estimates show there are approximately 240-300 million people chronically infected with HBV worldwide[1,2]. Chronic HBV infection leads to cirrhosis in 30% of patients, of which 53% later develop hepatocellular carcinoma (HCC)[3]. The Global Burden of Disease Study estimated that HBV-related cirrhosis and liver cancer annually causes 786000 deaths worldwide[4]. HBV utilizes both direct and indirect means to promote HCC. For example, HBV-induced HCC, without cirrhosis suggests involvement of oncogenic pathways independent of chronic liver inflammation. Some studies have implicated viral regulatory proteins such as HBV X protein affecting gene expression pathways[5]. In addition, mutations resulting in growth advantages may be conferred to infected cells by virtue of host chromosomal HBV DNA integration - a phenomenon known as insertional mutagenesis[5]. In the decades since Garcia et al[6] and Wang et al[7] identified cyclin (CCN) A and retinoic acid receptor-beta genes, respectively, as targets of HBV integration, other points of vulnerability have been identified. Li et al[8] found HBV integration into telomerase reverse transcriptase-promoter genes results in sex hormone-dependent responsiveness, providing a possible explanation for the threefold male-to-female preponderance of HBV-related HCC[6-8]. Despite advancements in nucleoside nucleotide analog (NA)-based treatment, and adequate viral suppression resulting in undetectable HBV DNA, patients are still at risk for developing HCC. This may be due to NAs ability to suppress viremia but not eliminate infection and leading to oncogenesis. Importantly, chronic HBV infection, irrespective of cirrhosis represents a lifetime risk for HCC development 10-25 times greater than non-infected patients[1,5,9]. These data highlight the importance of identifying predictive factors for HCC transformation, non-resolving acute infections, and chronic disease development. Understanding the evolution of HCC following HBV infection, and genetic signatures of HBV oncogenicity, will pave the way for improved risk assessment, treatment options, and patient outcomes. In this meta-analysis, we aim to identify transcriptomic correlates of HCC development in patients with HBV infection.

MATERIALS AND METHODS

Search Tag Analyze Resource

We developed the Search Tag Analyze Resource (STARGEO) platform to utilize the wealth of genomics data featured in the National Center for Biotechnology Information’s Gene Expression Omnibus. STARGEO allows for meta-analysis of transcriptomic signatures between sample sets, such as between disease and normal tissue, through tagging of biological samples from public data. Briefly, through stargeo.org, we searched for studies that studied HBV-related HCC. We then manually curated samples through the “Tagging” interface built into STARGEO based on interactive regular expressions. We gathered liver tumor samples under the “HBV_HCC” tag and control samples from adjacent tumor samples under the “HCC_Control” tag. More information on STARGEO and can be found in our previous paper[10]. To investigate HBV-specific HCC, we tagged 155 tumor and 185 adjacent non-tumor samples as a control. Samples were paired 1:1 within each study. Data was sourced from the GSE19665, GSE44074, GSE55092, GSE62232, and GSE67764 series[10-15]. HCC patients in these studies had confirmed chronic HBV infection with no other co-morbid hepatic infections and had biopsied taken at time of diagnosis. Genetic analysis focused on hepatocytes. Stargeo.org mappings are based on mygene.info gene annotation service to map all probe identifiers to Entrez gene identifiers[16]. The mean difference of contrasts for expressions and the standard deviation of that mean difference were calculate for each gene in every study. Standard meta-analysis with fixed and random effects model were used to combine these estimates across studies to generate both meta P values and effects size across studies. Study weight percentages were calculated using the inverse variance method via the DerSimonian-Laird estimate[17]. We use Python to achieve the analyses explained above in stargeo.org. More information on this particular analysis of HBV-related HCC can be found on http://stargeo.org/analysis/669/. Individual genes can be searched and the number of patient samples in which we observed change in genetic expression is available, along with other information (see Figure 1). Lastly, to best contextualize our results we cited high quality articles in Reference Citation Analysis (https://www.referencecitationanalysis.com).
Figure 1

Screenshot from Fixed and random treatment effects are illustrated. MD: Mean difference; CI: Confidence interval. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material).

Screenshot from Fixed and random treatment effects are illustrated. MD: Mean difference; CI: Confidence interval. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material).

Ingenuity pathway analysis

In order to dissect potential mechanism of disease, potential biomarkers, and therapeutic targets, we extracted more than 21000 genes for our meta-analysis and analyzed the output using the ingenuity pathway analysis (IPA) tool[18]. Analysis was restricted to genes that showed statistical significance (P < 0.05) in both fixed and random effects models with an absolute experimental log ratio greater than 0.7 between experimental (HCC) and control samples. A total of 1035 genes were included in the IPA analysis. Top up- and downregulated genes determined by STARGEO are featured in Table 1. Genes analyzed by IPA are summarized in Supplementary Table 1 with P values and experimental log ratios.
Table 1

Summary of the most up and down-regulated genes from the meta-analysis of primary tumor samples from hepatocellular carcinoma patients, experimental log ratios indicating magnitude of change from control samples are shown

Top upregulated
Top downregulated
GPC32.426CLEC4G-4.212
ANLN2.398CLEC1B-4.122
CCNB12.239LINC01092-3.834
ASPM2.220SLCO1B3-3.788
CDK12.215CLEC4M-3.689
NEK22.121HAMP-3.657
EPS8L32.116STAB2-3.604
PBK2.017OIT3-3.575
DTL2.010MT1M-3.508
PRC11.993HHIP-3.417

GPC3: Glypican 3; ANLN: Anillin; CCNB1: Cycline B1; ASPM: Abnormal spindle-like microcephaly-associated protein; CDK1: Cyclin-dependent kinase 1; NEK2: NIMA related kinase 2; EPS8L3: Epidermal growth factor receptor kinase substrate 8-like protein 3; PBK: PDZ binding kinase; DTL: Denticleless E3 ubiquitin protein ligase homolog; PRC1: Protein regulator of cytokinesis 1; CLEC4G: C-type lectin domain family 4 member G; CLEC1B: C-type lectin domain family 1 member B; LINC01092: Long intergenic non-protein coding RNA 1092; SLCO1B3: Solute carrier organic anion transporter family member 1B3; CLEC4M: C-type lectin domain family 4 member M; HAMP: Hepcidin; STAB2: Stabilin 2; OIT3: Oncoprotein induced transcript 3; MT1M: Metallothionein 1M; HHIP: Hedgehog interacting protein.

Summary of the most up and down-regulated genes from the meta-analysis of primary tumor samples from hepatocellular carcinoma patients, experimental log ratios indicating magnitude of change from control samples are shown GPC3: Glypican 3; ANLN: Anillin; CCNB1: Cycline B1; ASPM: Abnormal spindle-like microcephaly-associated protein; CDK1: Cyclin-dependent kinase 1; NEK2: NIMA related kinase 2; EPS8L3: Epidermal growth factor receptor kinase substrate 8-like protein 3; PBK: PDZ binding kinase; DTL: Denticleless E3 ubiquitin protein ligase homolog; PRC1: Protein regulator of cytokinesis 1; CLEC4G: C-type lectin domain family 4 member G; CLEC1B: C-type lectin domain family 1 member B; LINC01092: Long intergenic non-protein coding RNA 1092; SLCO1B3: Solute carrier organic anion transporter family member 1B3; CLEC4M: C-type lectin domain family 4 member M; HAMP: Hepcidin; STAB2: Stabilin 2; OIT3: Oncoprotein induced transcript 3; MT1M: Metallothionein 1M; HHIP: Hedgehog interacting protein. HCC is a complex disease that is a result of several pathological drivers. We used IPA Upstream Regulator analysis to elucidate upstream transcription regulators that best reflect our observed genetic expression dataset[18]. The P values are based on the degree of overlap of known effector targets and our gene list submission. The activation z-score illustrates the upstream regulator activation state, the magnitude of which represents likely activation states of upstream regulators.

Survival analysis

The association between rab-like protein 6 (RABL6) expression in HCC patients and survival was found using GPEIA2[19]. GPEIA2 uses samples from The Cancer Genome Atlas (TCGA) HCC cohort and analyzes survival based on the Kaplan-Meier survival method. Expression was split into the top 75% and bottom 25% of RABL6 expression. There were no interactions with human subjects or interventions involved in this study. Additionally, as all content presented is sourced from publicly available data and no patient-protected information was used. Therefore, no institutional review board approval was deemed necessary.

RESULTS

Top canonical pathways and gene candidates from HBV-HCC analysis

IPA analysis demonstrated liver X receptor (LXR)/retinoid x receptor (RXR) activation, lipopolysaccharide-interleukin-1 (LPS-IL-1) mediated RXR inhibition, acetone degradation, melatonin degradation, and farnesoid X receptor (FXR)/RXR activation as top canonical pathways in HBV-associated HCC (see Supplementary Table 2 for more information on top canonical pathways). FXR (NR1H4) was downregulated in our analysis suggesting its inhibition (Supplementary Table 2). Top upregulated genes in our analysis are implicated in several signaling processes (Table 1). For example, glypican-3 (GPC3), a cell surface glycoprotein that interacts with Wnt/β-catenin, Yes associated protein, and Hedgehog pathways[20]. Additionally, Akt signaling was activated through the epidermal growth factor receptor kinase substrate 8-like protein 3, a substrate for the epidermal growth factor receptor[21]. Furthermore, we found upregulation of the serine/threonine kinase PDZ-binding kinase (PBK), a mitotic kinase related to mitogen-activated protein kinase kinase (MAPKK)[22]. Other top regulated genes are involved in cell cycle division and control including anilin, CCNB1, assembly factor for microtubules, cyclin dependent kinase 1, and protein regulator of cytokinesis 1. Moreover, we found upregulation of the serine/threonine kinase NIMA related kinase a mediator of centrosome separation in mitosis and meiosis, and deniticleless, an adaptor of the E3-ubiquitin ligase that targets p21 to drive cell division[23]. Top downregulated genes are involved in liver metabolism and other processes. Several members of the C-type lectin family CLEC4G, CLEC1B, and CLEC4M were found among the most repressed genes. C-type lectins participate in adhesion and can act as signaling receptors for inflammation and immune-related processes[24]. Other downregulated genes are indicative of impaired liver function including the liver specific anion transporter solute carrier organic anion transporter family member 1B3, iron regulator hepcidin (HAMP), and the metallothionen (MT1M)[25]. Lastly, we found downregulation of Hh-interacting protein (HHIP), a negative regulator of Hedgehog signaling[26].

Network analysis of HBV-HCC

To elucidate top disease functions from our results we employed the IPA Network analysis function[18]. IPA ranks networks from the Global Molecular Network based on the number of focus genes from given networks that match with our analysis. Significance is given by the p-score [p-score = -log10(P value)]. We identified 25 networks with most being involved in cancer, cell cycle, gastrointestinal disease, and other disease functions. The top 6 networks are summarized in Table 2 and the top network is illustrated in Figure 2 (see Supplementary Table 3 for all networks).
Table 2

Top disease and functions identified by ingenuity pathway analysis network analysis

Top disease and function
P-score
Focus genes
Genes in network
Cell cycle; cellular assembly and organization; DNA replication, recombination and repair4633BUB1, BUB1B, C4, C4BP, CENPA, CENPH, CENPK, CENPL, CENPM, CENPW, CNDP1, DSN1, ESR1, FCN2, FCN3, GGT5, HIST1H2BF, HJURP, HPS5, KNL1, LILRB5, MASP1, MASP2, MBL2, MND1, MPC1, NDC80, NUF2, OIP5, OVOS2, SLC1A4, TENM1, TUFT1, WHRN, XK
Cancer; cell death and survival; organismal injury and abnormalities4231ANKS6, Ap1, AURKA, B9D1, BCKDHB, BOLA2/BOLA2B, CA5A, CBX5, CCT3, CDK1, CDKN2A, CEMIP, EPB41L5, estrogenreceptor, ETFRF1, EZH2, H2AFX, HIST1H2AM, HMGA1, KIF11, MCM2, MFAP4, MKI67, NAT2, NGFR, NT5DC2, PRKDC, Rnr, SETDB1, Smad2/3, TCF19, TK1, TMEM131L, TUBE1, ZSWIM5
Cancer; cell cycle; cellular movement4231ATAD2, ATPase, BMP, BMP5, CCBE1, CEP55, CTH, DTL, ECT2, GORASP2, IGF2BP1, IL12 (family), IL18R1, IL1RAP, IPO9, KIF14, KIF23, LUM, MAP1LC3, MSH2, NAAA, NUP62, OLA1, PBLD, PLSCR4, RAD54B, SIGLEC1, STAU2, TEX37, TRAF5, TRIP13, VSIG2, VWA8, WDYHV1, XPO5
Cell morphology, cellular assembly and organization, DNA replication, recombination, and repair3930ACAA2, ARMC6, BCHE, C4BPA, CDCA3, CDCA8, CENPE, CENPF, Ciap, CRNDE, ENO3, Enolase, FOXM1, KALRN, KIF20A, KIF2C, KIF4A, LRAT, MAGEA3/MAGEA6, MZT1, NAV1, NEB, PRC1, RAS, RASGRP2, RASSF4, SESTD1, SGO2, SRD5A1, SRD5A2, Steroid 5 alpha-Reductase, TARBP1, TGM3, transglutaminase, TRIO
Cancer; organismal injury and abnormalities; reproductive system disease3930ANGPTL6, BMPER, Cysteine Protease, DPF3, EGLN, FNIP2, GCDH, GDF2, Granzyme B-Perforin-SRGN, GREM2, HMGCL, HOXA13, KIF15, LYVE1, MS4A7, MT1G, NOSTRIN, PCDH9, PDE7B, PLVAP, RNF125, RNF165, RRAGD, SERPINB9, SESN3, SLC7A2, Smad1/5/8, SPARCL1, SRGN, STC1, TPX2, TRIM16, Vegf, VSIG4, ZFP
Cancer, gastrointestinal disease, hepatic system disease3729AKR1D1, ALDH, ALDH1A3, ALDH6A1, ALDH8A1, ANK3, CA2, COBLL1, CYP39A1, ENAH, ESM1, FOS, GBA, GLS2, GPM6A, GPSM2, GRHPR, GUCY1A1, HIST1H3H, histone-lysine N-methyltransferase, HOOK1, MECOM, NCKAP1L, NSD2, PALM2, PXMP2, Rab11, RAB11FIP4, sGC, SLC1A1, SMYD3, Sos, TSKU, UXS1

P-score indicates statistical significance [p-score = -log10(P value)] and the number of focus genes indicates the number of genes in our analysis that are a part of the respective network. Genes that are labeled red are upregulated in our analysis and genes that are green are downregulated. Genes in black are a part of the network but were not featured in our results. For P values and experimental log ratios of genes see Supplementary Table 1. TENM1: Teneurin transmembrane protein 1; GGT5: Gamma-glutamyltransferase5; SLC1A4: Solute carrier family 1 member 4; OVOS2: Alpha-2-macroglobulin like 1 pseudogene; XK: X-linked Kx blood group; WHRN: Whirlin; MPC1: Mitochondrial pyruvate carrier 1; LILRB5: Leukocyte immunoglobulin like receptor B5; CNDP1: Carnosine dipeptidase 1; CENPM: Centromere protein M; CENPL: Centromere protein L; ESR1: Estrogen receptor 1; C4BP: C4b-binding protein; TUFT1: Tuftelin 1; DSN1: MIND kinetochore complex component; KNL1: Kinetochore scaffold 1; BUB1: Mitotic checkpoint serine/threonine-protein kinase BUB1; HPS5: Hermansky-Pudlak syndrome 5 protien; FCN3: Ficolin 3; MASP1: MBL associated serine protease 1; MBL2: Mannose binding lectin 2; HIST1H2BF: Histone H2B type 1; OIP5: Opa interacting protein 5; NDC80: Kinetochore protein NDC80; BUB1B: BUB1 mitotic checkpoint serine/threonine kinase B; MND1: Meiotic nuclear divisions 1; CENPW: Centromere protein W; CENPH: Centromere protein H; CENPK: Centromere protein K; CENPA: Centromere protein A; ANKS6: Ankyrin repeat and SAM domain-containing protein 6; Ap1: Activator protein 1; AURKA: Aurora A kinase; B9D1: B9 domain containing 1; BCKDHB: 2-Oxoisovalerate dehydrogenase subunit beta; BOLA2/BOLA2B: BolA family member 2; CA5A: Carbonic anhydrase 5A; CBX5: Chromobox protein homolog 5; CCT3: Chaperonin containing TCP1 complex; CDK1: Cyclin-dependent kinase 1; CDKN2A: Cyclin dependent kinase inhibitor 2A; CEMIP: Cementum protein 1; EPB41L5: Erythrocyte membrane protein band 4.1 like 5; ETFRF1: Electron transfer flavoprotein regulatory factor 1; EZH2: Enhancer of zeste 2 polycomb repressive complex 2 subunit; H2AFX: H2A histone family, member X; HIST1H2AM: Histone cluster 1, H2am; HMGA1: High mobility group AT-hook 1; KIF11: Kinesin family member 11; MCM2: Mini-chromosome maintenance complex component 2; MFAP4: Microfibril associated protein 4; MKI67: Marker of proliferation Ki-67; NAT2: N-acetyltransferase 2; NGFR: Nerve growth factor receptor; NT5DC2: 5’-nucleotidase domain containing 2; PRKDC: Protein kinase, DNA-activated, catalytic subunit; Rnr: Ribonucleotide reductase; SETDB1: SET domain bifurcated histone lysine methyltransferase 1; Smad2/3: SMAD family member 2/3; TCF19: Transcription factor 19; TK1: Thymidine kinase 1; TMEM131L: Transmembrane 131 like; TUBE1: Tubulin epsilon 1; ZSWIM5: Zinc finger SWIM-type containing 5; ATAD2: ATPase family AAA domain containing 2; BMP: Bone morphogenetic protein; BMP5: Bone morphogenetic protein 5; CCBE1: Collagen and calcium binding EGF domains 1; CEP55: Centrosomal protein 55; CTH: Cystathionine gamma-lyase; DTL: Denticleless E3 ubiquitin protein ligase homolog; ECT2: Epithelial cell transforming 2; GORASP2: Golgi reassembly stacking protein 2; IGF2BP1: Insulin like growth factor 2 mRNA binding protein 1; IL12 (family): Interleukin-12; IL18R1: Interleukin-18 receptor 1; IL1RAP: Interleukin-1 receptor accessory protein; IPO9: Importin 9; KIF14: Kinesin-like protein 14; KIF23: Kinesin-like protein 23; LUM: Lumican; MAP1LC3: Microtubule-associated protein 1 light chain 3 beta; MSH2: MutS homolog 3; NAAA: N-acylethanolamine acid amidase; NUP62: Nucleoporin 62; OLA1: Obg like ATPase 1; PBLD: Phenazine biosynthesis like protein domain containing; PLSCR4: Phospholipid scramblase 4; RAD54B: RAD 54 homolog B; SIGLEC1: Sialic acid binding Ig like lectin 1; STAU2: Staufen double-stranded RNA binding protein 2; TEX37: Testis expressed 37; TRAF5: TNF receptor associated factor 5; TRIP13: Thyroid hormone receptor interactor 13; VSIG2: V-set and immunoglobulin domain containing 2; VWA8: Von Willebrand factor A domain containing 8; WDYHV1: WDYHV motif containing 1; XPO5: Exportin 5; ACAA2: Acetyl-CoA acyltransferase 2; ARMC6: Armadillo repeat containing 6; BCHE: Butyrylcholinesterase; C4BPA: Complement component 4 binding protein alpha; CDCA3: Cell division cycle associated 3; CDCA8: Cell division cycle associated 8; CENPE: Centromere protein E; CENPF: Centromere protein F; CRNDE: Colorectal neoplasia differentially expressed; ENO3: Enolase 3; FOXM1: Forkhead box M1; KALRN: Kalirin RhoGEF kinase; KIF20A: Kinesin family member 20A; KIF2C: Kinesin family member 2C; KIF4A: Kinesin family member 4A; LRAT: Lecithin-retinol acyltransferase; MAGEA3/MAGEA6: MAGE family member A3/A6; MZT1: Mitotic spindle organizing protein 1; NAV1: Neuron navigator 1; NEB: Nebulin; PRC1: Protein regulator of cytokinesis 1; RAS: RAS GTPase; RASGRP2: RAS guanyl releasing protein 2; RASSF4: Ras association domain family member 4; SESTD1: SEC14 and spectrin domain containing 1; SGO2: Shugoshin 2; SRD5A1: steroid 5 alpha-reductase 1; SRD5A2: Steroid 5 alpha-reductase 5; TARBP1: TAR RNA binding protein 1; TGM3: Transglutaminase 3; TRIO: Trio Rho guanine nucleotide exchange factor; ANGPTL6: Angiopoietin like 6; BMPER: BMP-binding endothelial regulator; DPF3: Double PHD fingers 3; EGLN: Endoglin; FNIP2: Folliculin interacting protein 2; GCDH: Glutaryl-CoA dehydrogenase; GDF2: Growth differentiation factor 2; GREM2: Gremlin 2; HMGCL: 3-hydroxy-3-methylglutaryl-CoA lyase; HOXA13: Homeobox A13; KIF15: Kinesin family member 15; LYVE1: Lymphatic vessel endothelial hyaluronan receptor 1; MS4A7: Membrane spanning 4-domains A7; MT1G: Metallothionein 1G; NOSTRIN: Nitric oxide synthase trafficking; PCDH9: Protocadherin 9; PDE7B: Phosphodiesterase 7B; PLVAP: Plasmalemma vesicle associated protein; RNF125: Ring finger protein 125; RNF165: Ring finger protein 165; RRAGD: Ras related GTP binding D; SERPINB9: Serpin family B member 9; SESN3: Sestrin 3; SLC7A2: Solute carrier family 7 member 2; Smad1/5/8: SMAD family member 1/5/8; SPARCL1: SPARC like 1; SRGN: Serglycin; STC1: Stanniocalcin; TPX2: TPX2 microtubule nucleation factor; TRIM16: Tripartite motif containing 16; Vegf: Vascular endothelial growth factor; VSIG4: V-set and immunoglobulin domain containing 4; ZFP: Zinc finger protein; AKR1D1: Aldo-keto reductase family 1 member D1; ALDH: Aldehyde dehydrogenase; ALDH1A3: Aldehyde dehydrogenase 1 family member A3; ALDH6A1: Aldehyde dehydrogenase 6 family member A1; ALDH8A1: Aldehyde dehydrogenase 8 family member A1; ANK3: Ankyrin 3; CA2: Carbonic anhydrase 2; COBLL1: Cordon-bleu WH2 repeat protein like 1; CYP39A1: Cytochrome P450 family 39 subfamily A member 1; ENAH: ENAH actin regulator; ESM1: Endothelial cell specific molecule 1; FOS: Fos proto-oncogene, AP-1 transcription factor subunit; GBA: Glucosylceramidase beta; GLS2: Glutaminase 2; GPM6A: Glycoprotein M6A; GPSM2: G protein signaling modulator 2; GRHPR: Glyoxylate and hydroxypyruvate reductase; GUCY1A1: Guanylate cyclase 1, soluble, alpha 1; HIST1H3H: Histone cluster 1 H3 family member h; histone-lysine N-methyltransferase; HOOK1: Hook microtubule tethering protein 1; MECOM: MDS1 and EVI1 complex locus; NCKAP1L: NCK associated protein 1 like; NSD2: Nuclear receptor binding SET domain protein 2; PALM2: Paralemmin 2; PXMP2: Peroxisomal membrane protein 2; Rab11: Rab 11 protein; RAB11FIP4: RAB11 family interacting protein 4; sGC: Soluble guanylyl cyclase; SLC1A1: Solute carrier family 1 member 1; SMYD3: SET and MYND domain containing 3; Sos: Son of Sevenless; TSKU: Tsukushi, small leucine rich proteoglycan; UXS1: UDP-glucuronate decarboxylase 1.

Figure 2

Top network (cell cycle; cellular assembly and organization; DNA replication, recombination and repair) identified by ingenuity pathway analysis Network analysis. Legend illustrates class of the gene. Red indicates upregulation and green downregulation, with shade depicting magnitude of change. Solid and dashed lines depict direct and indirect, respectively, relationship between genes. Figure was generated using ingenuity pathway analysis. TENM1: Teneurin transmembrane protein 1; GGT5: Gamma-glutamyltransferase5; SLC1A4: Solute carrier family 1 member 4; OVOS2: Alpha-2-macroglobulin like 1 pseudogene; XK: X-linked Kx blood group; WHRN: Whirlin; MPC1: Mitochondrial pyruvate carrier 1; LILRB5: Leukocyte immunoglobulin like receptor B5; CNDP1: Carnosine dipeptidase 1; CENPM: Centromere protein M; CENPL: Centromere protein L; ESR1: Estrogen receptor 1; C4BP: C4b-binding protein; TUFT1: Tuftelin 1; DSN1: MIND kinetochore complex component; KNL1: Kinetochore scaffold 1; BUB1: Mitotic checkpoint serine/threonine-protein kinase BUB1; HPS5: Hermansky-Pudlak syndrome 5 protien; FCN3: Ficolin 3; MASP1: MBL associated serine protease 1; MBL2: Mannose binding lectin 2; HIST1H2BF: Histone H2B type 1; OIP5: Opa interacting protein 5; NDC80: Kinetochore protein NDC80; BUB1B: BUB1 mitotic checkpoint serine/threonine kinase B; MND1: Meiotic nuclear divisions 1; CENPW: Centromere protein W; CENPH: Centromere protein H; CENPK: Centromere protein K; CENPA: Centromere protein A. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material).

Top network (cell cycle; cellular assembly and organization; DNA replication, recombination and repair) identified by ingenuity pathway analysis Network analysis. Legend illustrates class of the gene. Red indicates upregulation and green downregulation, with shade depicting magnitude of change. Solid and dashed lines depict direct and indirect, respectively, relationship between genes. Figure was generated using ingenuity pathway analysis. TENM1: Teneurin transmembrane protein 1; GGT5: Gamma-glutamyltransferase5; SLC1A4: Solute carrier family 1 member 4; OVOS2: Alpha-2-macroglobulin like 1 pseudogene; XK: X-linked Kx blood group; WHRN: Whirlin; MPC1: Mitochondrial pyruvate carrier 1; LILRB5: Leukocyte immunoglobulin like receptor B5; CNDP1: Carnosine dipeptidase 1; CENPM: Centromere protein M; CENPL: Centromere protein L; ESR1: Estrogen receptor 1; C4BP: C4b-binding protein; TUFT1: Tuftelin 1; DSN1: MIND kinetochore complex component; KNL1: Kinetochore scaffold 1; BUB1: Mitotic checkpoint serine/threonine-protein kinase BUB1; HPS5: Hermansky-Pudlak syndrome 5 protien; FCN3: Ficolin 3; MASP1: MBL associated serine protease 1; MBL2: Mannose binding lectin 2; HIST1H2BF: Histone H2B type 1; OIP5: Opa interacting protein 5; NDC80: Kinetochore protein NDC80; BUB1B: BUB1 mitotic checkpoint serine/threonine kinase B; MND1: Meiotic nuclear divisions 1; CENPW: Centromere protein W; CENPH: Centromere protein H; CENPK: Centromere protein K; CENPA: Centromere protein A. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material). Top disease and functions identified by ingenuity pathway analysis network analysis P-score indicates statistical significance [p-score = -log10(P value)] and the number of focus genes indicates the number of genes in our analysis that are a part of the respective network. Genes that are labeled red are upregulated in our analysis and genes that are green are downregulated. Genes in black are a part of the network but were not featured in our results. For P values and experimental log ratios of genes see Supplementary Table 1. TENM1: Teneurin transmembrane protein 1; GGT5: Gamma-glutamyltransferase5; SLC1A4: Solute carrier family 1 member 4; OVOS2: Alpha-2-macroglobulin like 1 pseudogene; XK: X-linked Kx blood group; WHRN: Whirlin; MPC1: Mitochondrial pyruvate carrier 1; LILRB5: Leukocyte immunoglobulin like receptor B5; CNDP1: Carnosine dipeptidase 1; CENPM: Centromere protein M; CENPL: Centromere protein L; ESR1: Estrogen receptor 1; C4BP: C4b-binding protein; TUFT1: Tuftelin 1; DSN1: MIND kinetochore complex component; KNL1: Kinetochore scaffold 1; BUB1: Mitotic checkpoint serine/threonine-protein kinase BUB1; HPS5: Hermansky-Pudlak syndrome 5 protien; FCN3: Ficolin 3; MASP1: MBL associated serine protease 1; MBL2: Mannose binding lectin 2; HIST1H2BF: Histone H2B type 1; OIP5: Opa interacting protein 5; NDC80: Kinetochore protein NDC80; BUB1B: BUB1 mitotic checkpoint serine/threonine kinase B; MND1: Meiotic nuclear divisions 1; CENPW: Centromere protein W; CENPH: Centromere protein H; CENPK: Centromere protein K; CENPA: Centromere protein A; ANKS6: Ankyrin repeat and SAM domain-containing protein 6; Ap1: Activator protein 1; AURKA: Aurora A kinase; B9D1: B9 domain containing 1; BCKDHB: 2-Oxoisovalerate dehydrogenase subunit beta; BOLA2/BOLA2B: BolA family member 2; CA5A: Carbonic anhydrase 5A; CBX5: Chromobox protein homolog 5; CCT3: Chaperonin containing TCP1 complex; CDK1: Cyclin-dependent kinase 1; CDKN2A: Cyclin dependent kinase inhibitor 2A; CEMIP: Cementum protein 1; EPB41L5: Erythrocyte membrane protein band 4.1 like 5; ETFRF1: Electron transfer flavoprotein regulatory factor 1; EZH2: Enhancer of zeste 2 polycomb repressive complex 2 subunit; H2AFX: H2A histone family, member X; HIST1H2AM: Histone cluster 1, H2am; HMGA1: High mobility group AT-hook 1; KIF11: Kinesin family member 11; MCM2: Mini-chromosome maintenance complex component 2; MFAP4: Microfibril associated protein 4; MKI67: Marker of proliferation Ki-67; NAT2: N-acetyltransferase 2; NGFR: Nerve growth factor receptor; NT5DC2: 5’-nucleotidase domain containing 2; PRKDC: Protein kinase, DNA-activated, catalytic subunit; Rnr: Ribonucleotide reductase; SETDB1: SET domain bifurcated histone lysine methyltransferase 1; Smad2/3: SMAD family member 2/3; TCF19: Transcription factor 19; TK1: Thymidine kinase 1; TMEM131L: Transmembrane 131 like; TUBE1: Tubulin epsilon 1; ZSWIM5: Zinc finger SWIM-type containing 5; ATAD2: ATPase family AAA domain containing 2; BMP: Bone morphogenetic protein; BMP5: Bone morphogenetic protein 5; CCBE1: Collagen and calcium binding EGF domains 1; CEP55: Centrosomal protein 55; CTH: Cystathionine gamma-lyase; DTL: Denticleless E3 ubiquitin protein ligase homolog; ECT2: Epithelial cell transforming 2; GORASP2: Golgi reassembly stacking protein 2; IGF2BP1: Insulin like growth factor 2 mRNA binding protein 1; IL12 (family): Interleukin-12; IL18R1: Interleukin-18 receptor 1; IL1RAP: Interleukin-1 receptor accessory protein; IPO9: Importin 9; KIF14: Kinesin-like protein 14; KIF23: Kinesin-like protein 23; LUM: Lumican; MAP1LC3: Microtubule-associated protein 1 light chain 3 beta; MSH2: MutS homolog 3; NAAA: N-acylethanolamine acid amidase; NUP62: Nucleoporin 62; OLA1: Obg like ATPase 1; PBLD: Phenazine biosynthesis like protein domain containing; PLSCR4: Phospholipid scramblase 4; RAD54B: RAD 54 homolog B; SIGLEC1: Sialic acid binding Ig like lectin 1; STAU2: Staufen double-stranded RNA binding protein 2; TEX37: Testis expressed 37; TRAF5: TNF receptor associated factor 5; TRIP13: Thyroid hormone receptor interactor 13; VSIG2: V-set and immunoglobulin domain containing 2; VWA8: Von Willebrand factor A domain containing 8; WDYHV1: WDYHV motif containing 1; XPO5: Exportin 5; ACAA2: Acetyl-CoA acyltransferase 2; ARMC6: Armadillo repeat containing 6; BCHE: Butyrylcholinesterase; C4BPA: Complement component 4 binding protein alpha; CDCA3: Cell division cycle associated 3; CDCA8: Cell division cycle associated 8; CENPE: Centromere protein E; CENPF: Centromere protein F; CRNDE: Colorectal neoplasia differentially expressed; ENO3: Enolase 3; FOXM1: Forkhead box M1; KALRN: Kalirin RhoGEF kinase; KIF20A: Kinesin family member 20A; KIF2C: Kinesin family member 2C; KIF4A: Kinesin family member 4A; LRAT: Lecithin-retinol acyltransferase; MAGEA3/MAGEA6: MAGE family member A3/A6; MZT1: Mitotic spindle organizing protein 1; NAV1: Neuron navigator 1; NEB: Nebulin; PRC1: Protein regulator of cytokinesis 1; RAS: RAS GTPase; RASGRP2: RAS guanyl releasing protein 2; RASSF4: Ras association domain family member 4; SESTD1: SEC14 and spectrin domain containing 1; SGO2: Shugoshin 2; SRD5A1: steroid 5 alpha-reductase 1; SRD5A2: Steroid 5 alpha-reductase 5; TARBP1: TAR RNA binding protein 1; TGM3: Transglutaminase 3; TRIO: Trio Rho guanine nucleotide exchange factor; ANGPTL6: Angiopoietin like 6; BMPER: BMP-binding endothelial regulator; DPF3: Double PHD fingers 3; EGLN: Endoglin; FNIP2: Folliculin interacting protein 2; GCDH: Glutaryl-CoA dehydrogenase; GDF2: Growth differentiation factor 2; GREM2: Gremlin 2; HMGCL: 3-hydroxy-3-methylglutaryl-CoA lyase; HOXA13: Homeobox A13; KIF15: Kinesin family member 15; LYVE1: Lymphatic vessel endothelial hyaluronan receptor 1; MS4A7: Membrane spanning 4-domains A7; MT1G: Metallothionein 1G; NOSTRIN: Nitric oxide synthase trafficking; PCDH9: Protocadherin 9; PDE7B: Phosphodiesterase 7B; PLVAP: Plasmalemma vesicle associated protein; RNF125: Ring finger protein 125; RNF165: Ring finger protein 165; RRAGD: Ras related GTP binding D; SERPINB9: Serpin family B member 9; SESN3: Sestrin 3; SLC7A2: Solute carrier family 7 member 2; Smad1/5/8: SMAD family member 1/5/8; SPARCL1: SPARC like 1; SRGN: Serglycin; STC1: Stanniocalcin; TPX2: TPX2 microtubule nucleation factor; TRIM16: Tripartite motif containing 16; Vegf: Vascular endothelial growth factor; VSIG4: V-set and immunoglobulin domain containing 4; ZFP: Zinc finger protein; AKR1D1: Aldo-keto reductase family 1 member D1; ALDH: Aldehyde dehydrogenase; ALDH1A3: Aldehyde dehydrogenase 1 family member A3; ALDH6A1: Aldehyde dehydrogenase 6 family member A1; ALDH8A1: Aldehyde dehydrogenase 8 family member A1; ANK3: Ankyrin 3; CA2: Carbonic anhydrase 2; COBLL1: Cordon-bleu WH2 repeat protein like 1; CYP39A1: Cytochrome P450 family 39 subfamily A member 1; ENAH: ENAH actin regulator; ESM1: Endothelial cell specific molecule 1; FOS: Fos proto-oncogene, AP-1 transcription factor subunit; GBA: Glucosylceramidase beta; GLS2: Glutaminase 2; GPM6A: Glycoprotein M6A; GPSM2: G protein signaling modulator 2; GRHPR: Glyoxylate and hydroxypyruvate reductase; GUCY1A1: Guanylate cyclase 1, soluble, alpha 1; HIST1H3H: Histone cluster 1 H3 family member h; histone-lysine N-methyltransferase; HOOK1: Hook microtubule tethering protein 1; MECOM: MDS1 and EVI1 complex locus; NCKAP1L: NCK associated protein 1 like; NSD2: Nuclear receptor binding SET domain protein 2; PALM2: Paralemmin 2; PXMP2: Peroxisomal membrane protein 2; Rab11: Rab 11 protein; RAB11FIP4: RAB11 family interacting protein 4; sGC: Soluble guanylyl cyclase; SLC1A1: Solute carrier family 1 member 1; SMYD3: SET and MYND domain containing 3; Sos: Son of Sevenless; TSKU: Tsukushi, small leucine rich proteoglycan; UXS1: UDP-glucuronate decarboxylase 1.

Analysis of potential drivers of HBV-HCC pathogenesis

To investigate genes that most greatly influenced our gene dataset and oncogenesis we used IPA to identify top upstream regulators (see methods). We searched for genes that were the identified as the most activated up-regulators in IPA and were upregulated in our dataset (see Supplementary Table 4). The most activated upstream regulator was RABL6, a member of the Ras family of GTPases[27]. We also found activation of transcription factors T-box transcription factor 2 (TBX2) involved in hepatocyte proliferation, migration, and invasion[28]. We also noted activation of E2F transcription factor 1 (E2F1), which has a mixed but predominantly proliferative role in HCC[29]. Additionally, we found activation of the transcription factor forkhead box M1 (FOXM1), which promotes cell turnover through CCNB1 and CCND1 upregulation and through other mechanisms[30,31]. Other activated transcription factors implicated in tumor activity included RUNX transcription factor 1, melanocyte inducing transcription factor, and homeobox A10 (HOXA10)[32-35]. Moreover, other top upstream regulators demonstrated varied biologic activity. For example, E1A binding protein P400 (EP400), a component of the NuA4 histone acetyltransferase complex, is associated with epigenetic activity[36]. Other upstream regulators are proteins involved in signaling pathways including actin like 6a, an actin binding protein involved in Notch1/SOX2 signaling, LIN9-MYBL2 [interacts with the tumor suppressors such as retinoblastoma (Rb) protein], SHC adaptor protein 1 (SHC1, a signaling adaptor for growth factor receptors), protein inhibitor of activated STAT 4 (PIAS4, a protein inhibitor of STAT), and Tumor necrosis factor (TNF) receptor associated factor 2 [TRAF2, role in TNF and nuclear factor-kappaB (NF-κB)][37-42]. We also found activation of RNA binding proteins including ELAVL1, a regulator of ferroptosis in hepatic stellate cells, as well as oncogenic members of the negative elongation factor family NELFE, NELFA, and NELFCD[43]. Lastly, we found activation of oncogenesis-promoting kinases such as mitogen activated kinase 4 and neurotrophic receptor kinase 2 (NTRK2), which aid in cell adhesion[44,45]. We next focused on inhibited upstream regulators (Supplementary Table 5), several of which are implicated in the innate immune response including the innate receptor DExD/H-Box helicase 58 (DDX58), or RIG-I, and the pattern recognition and toll-like receptors (TLR)2, TLR3, TLR4, and TLR9[46,47]. There was predicted inhibition and downregulation of inflammatory mediators NF-κB and interleukins interleukin (IL)-5, IL-12, IL-18, and IL-33. We also found downregulation and predicted inhibition of the immune co-stimulatory molecule CD86, as well as forkhead head transcription factor FOXO3, a mediator of the antioxidant response and autophagy[48]. Several other inhibited upstream regulators are well-described tumor suppressors such as TP53, CDKN1A, Rb gene, and Rb transcriptional suppressor type 2[49]. Additionally, we found predicted inhibition of hepatocyte nuclear factors HNF4 and HNF41A, and the LXR NR1H3, which have been shown to exhibit anti-tumor activity[50-53]. We also found downregulation and predicted inhibition of the transcription factor CCAAT enhancer binding protein delta (CEBPD), a regulator of apoptosis and potential tumor suppressor[54]. Lastly, we observed downregulation and predicted inhibition of the SAM domain, SH3 domain, and nuclear localization signals 1 (SAMSN1), a lung cancer tumor suppressor that is hypermethylated in HCC[55,56].

Causal networks of HBV-HCC

To elucidate the pathologic potential of the upstream regulators described above, we assessed downstream effector genes through IPA. We focused on upstream regulators with high activation z-scores. We first investigated RABL6 as it is the most activated upstream regulator in our analysis and its target genes play important roles in HBV-HCC (see Supplementary Table 4). Most of the activated genes are involved in promoting cell division. For example, the mitotic spindle checkpoint genes BUB1 and BUB1B, several cyclins including CCNA2, CCNB2, and CCNE2, and the M-phase inducer CDC25C were all upregulated[57]. Likewise, we observed upregulation of centromere protein F, helicase RAD54B, and topoisomerase TOP2A. We also found upregulation of the mitogen PBK, NEK2 (a regulator of mitotic progression), and the kinase TTK, all of which promote HCC cell proliferation and migration via Akt signaling[21,22,58]. Similarly, we found upregulation of minichromosome maintenance family (MCMs) members MCM2 and MCM10, which also promote cell division[59-61]. There are also downstream genes of RABL6 with recently described roles in cancer. For example, we found upregulation of the ubiquitin-conjugating enzyme E2C. Knockdown of this gene has been shown to suppress cellular proliferation, migration, and invasion in HCC[62]. Lastly, RABL6 may mediate HCC progression through downregulation of enhancer of zeste homolog 2, which regulates histone and DNA methylation and silences tumor suppressors[63,64]. Thus, our analysis suggests RABL6 promotes HCC through several pro-oncogenic mechanisms. A summary of the downstream effects of RABL6 are presented in Figure 3.
Figure 3

Ingenuity pathway analysis of rab-like protein 6 signaling in hepatitis B-related hepatocellular carcinoma tumors. Genes are implicated in several disease potential disease processes including inflammation, cell division, Akt signaling, and more. Legend illustrates relationship between genes. See Figure 2 legend for identification of shapes. RABL6: RAB, member RAS oncogene family like 6; TP53: Tumor protein p53; E2F4: E2F transcription factor 4; FOXO1: Forkhead box O1; FOXM1: Forkhead box M1; MCM10: Mini-chromosome maintenance 10; BUB1: Mitotic checkpoint serine/threonine-protein kinase BUB1; TTK: TTK protein kinase; UBE2C: Ubiquitin-conjugating enzyme E2 C; TOP2A: DNA topoisomerase IIα; CCNA2: Cyclin A2; CENPF: Centromere protein F; EZH2: Enhancer of zeste homolog 2; NEK2: Serine/threonine-protein kinase Nek2. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material).

Ingenuity pathway analysis of rab-like protein 6 signaling in hepatitis B-related hepatocellular carcinoma tumors. Genes are implicated in several disease potential disease processes including inflammation, cell division, Akt signaling, and more. Legend illustrates relationship between genes. See Figure 2 legend for identification of shapes. RABL6: RAB, member RAS oncogene family like 6; TP53: Tumor protein p53; E2F4: E2F transcription factor 4; FOXO1: Forkhead box O1; FOXM1: Forkhead box M1; MCM10: Mini-chromosome maintenance 10; BUB1: Mitotic checkpoint serine/threonine-protein kinase BUB1; TTK: TTK protein kinase; UBE2C: Ubiquitin-conjugating enzyme E2 C; TOP2A: DNA topoisomerase IIα; CCNA2: Cyclin A2; CENPF: Centromere protein F; EZH2: Enhancer of zeste homolog 2; NEK2: Serine/threonine-protein kinase Nek2. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material). Since RABL6 has not been described in HCC, we conducted survival analysis using the GEP1A2 platform. GEP1A2 is a website that uses patient samples from TCGA, which is used for bioinformatic analysis on genes of interest among differ cancer types[19]. We examined prognosis based on quartile expression of RABL6 (upper 75% vs lower 25%) in HCC patients. We found a statistically lower chance of survival with higher expression of RABL6. The upstream regulators TBX2, E2F1, FOXM1, and EP400 share similar activated genes to those described for RABL6 (see Supplementary material), such as HOXA10, and thus may act synergistically to promote HBV-HCC. This prompted us to next study activated genes downstream of HOXA10 due to its stark upregulation and high activation z-score (see Supplementary Table 6). While role of HOXA10 is not well-defined in HCC, knockdown model has been recently shown to inhibit HCC cell tumorigenesis[35]. Additionally, another study by Shao et al[65] demonstrated the involvement of HOXA10 in the renewal and survival of liver tumor initiation cells. IPA identified several genes downstream of HOXA10 that may explain its pathogenic activity. For example, HOXA10 may induce tumor progression through downregulation of the tumor suppressor gene NDRG2 as well as glutathione S-transferase A3 or GSTA3, whose inactivity results in hepatocyte oxidative stress and liver injury[66,67]. We also found upregulation of dickkopf-1, a negative regulator of Wnt signaling and negative prognosticator for HCC[68]. Insulin-like growth factor binding protein-3 is a potential mediator of growth suppression signals and is downregulated in our dataset[69]. The hepatic enzyme CYP2E1 was similarly downregulated and is known to be repressed in HCC and linked with a poor prognosis[70]. Lastly, xanthine dehydrogenase, a rate-limiting enzyme in purine metabolism, was downregulated and its suppression has been linked to enhanced cancer stem-cell activity in HCC[71]. A suggested model of the potential multifactorial role HOXA10 and its interplay in HCC is shown in Figure 4.
Figure 4

Ingenuity pathway analysis of homeobox A10 activity in hepatitis B-related hepatocellular carcinoma. Homeobox A10 signaling has potential implications on tumor suppression, liver metabolism, and other disease-related activity. Genes and location are shown above. Legend illustrates relationship between genes and gene classification. COL15A1: Collagen alpha-1(XV) chain; IGFBP3: Insulin-like growth factor binding protein 3; DKK1: Dickkopf-related protein 1; CFD: Complement factor D; LEPR: Leptin receptor; BCHE: Butyrylcholinesterase; THBD: Thrombomodulin; SFRP4: Secreted frizzled-related protein 4; ALPL: Alkaline phosphatase; CYP2E1: Cytochrome P450 2E1; TDO2: Tryptophan 2,3-dioxygenase; NDRG2: N-myc downstream-regulated gene family member 2; GSTA3: Glutathione S-transferase A3; PHGDH: Phosphoglycerate dehydrogenase; XDH: Xanthine dehydrogenase; INMT: Indolethylamine N-methyltransferase; SCD: Stearoyl-CoA desaturase; HBG1: Hemoglobin subunit gamma 1; ADH4: Alcohol dehydrogenase 4; HOXA10: Homeobox A10; THRSP: Thyroid hormone-inducible hepatic protein. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material).

Ingenuity pathway analysis of homeobox A10 activity in hepatitis B-related hepatocellular carcinoma. Homeobox A10 signaling has potential implications on tumor suppression, liver metabolism, and other disease-related activity. Genes and location are shown above. Legend illustrates relationship between genes and gene classification. COL15A1: Collagen alpha-1(XV) chain; IGFBP3: Insulin-like growth factor binding protein 3; DKK1: Dickkopf-related protein 1; CFD: Complement factor D; LEPR: Leptin receptor; BCHE: Butyrylcholinesterase; THBD: Thrombomodulin; SFRP4: Secreted frizzled-related protein 4; ALPL: Alkaline phosphatase; CYP2E1: Cytochrome P450 2E1; TDO2: Tryptophan 2,3-dioxygenase; NDRG2: N-myc downstream-regulated gene family member 2; GSTA3: Glutathione S-transferase A3; PHGDH: Phosphoglycerate dehydrogenase; XDH: Xanthine dehydrogenase; INMT: Indolethylamine N-methyltransferase; SCD: Stearoyl-CoA desaturase; HBG1: Hemoglobin subunit gamma 1; ADH4: Alcohol dehydrogenase 4; HOXA10: Homeobox A10; THRSP: Thyroid hormone-inducible hepatic protein. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material). Next, we investigated the downstream signaling of PIAS4 given its significant upregulation and activation z-score (see Supplementary Table 4). PIAS4 involvement in HCC has been recently described[41]. Downstream of PIAS4, we found upregulation of lymphoid enhancer factor 1 and downregulation of protocadherin 9, both of which promote epithelial-mesenchymal transition in HCC[72-74]. We also found downregulation of fatty acid binding protein 1, which has been shown to reduce oxidative stress, a major contributor to HCC development[75]. Its loss may also lead to microsatellite instability in colorectal carcinomas and may have similar effects in HCC[76]. Furthermore, we found downregulation of albumin, with evidence showing albumin itself suppressing HCC cellular proliferation[77]. These results suggest a mechanistic role for PIAS4 in HCC progression (Figure 5).
Figure 5

Protein inhibitor of activated STAT 4 and SH3 domain, and nuclear localization signals 1 potential role in homeobox A10 have only been recently described and much remains to be understood. Ingenuity pathway analysis analysis demonstrated activation of protein inhibitor of activated STAT 4, and activation of downstream genes implicated in epithelial-mesenchymal transition through LEF1 and protocadherin 9. Our analysis also demonstrated inhibition of SH3 domain, and nuclear localization signals 1 with downstream effects on viral recognition and regulation of cell survival. Legend illustrates relationship between genes. See legend of Figure 3 for classification of genes. ALB: Albumin; FABP1: Fatty acid binding protein 1; MARCO: Macrophage receptor with collagenous structure; FGL2: Fibrinogen like 2; SAMSN1: SAM domain, SH3 domain and nuclear localization signals 1; USP12: Ubiquitin-specific protease 12; PIAS4: Protein inhibitor of activated STAT protein gamma; PCDH9: Protocadherin 9; LEF1: Lymphoid enhancer-binding factor 1; TLR3: Toll-like receptor 3. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material).

Protein inhibitor of activated STAT 4 and SH3 domain, and nuclear localization signals 1 potential role in homeobox A10 have only been recently described and much remains to be understood. Ingenuity pathway analysis analysis demonstrated activation of protein inhibitor of activated STAT 4, and activation of downstream genes implicated in epithelial-mesenchymal transition through LEF1 and protocadherin 9. Our analysis also demonstrated inhibition of SH3 domain, and nuclear localization signals 1 with downstream effects on viral recognition and regulation of cell survival. Legend illustrates relationship between genes. See legend of Figure 3 for classification of genes. ALB: Albumin; FABP1: Fatty acid binding protein 1; MARCO: Macrophage receptor with collagenous structure; FGL2: Fibrinogen like 2; SAMSN1: SAM domain, SH3 domain and nuclear localization signals 1; USP12: Ubiquitin-specific protease 12; PIAS4: Protein inhibitor of activated STAT protein gamma; PCDH9: Protocadherin 9; LEF1: Lymphoid enhancer-binding factor 1; TLR3: Toll-like receptor 3. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material). Lastly, we investigated how inhibition of SAMSN1 may contribute to HCC. As mentioned previously, SAMSN1 inhibition is linked to malignant HCC tumorigenesis[55]. From our IPA analysis, inhibition of SAMSN1 has immunologic implications including downregulation of the pattern recognition receptor TLR3 and macrophage receptor with collagenous structure. These changes are known to negatively impact HCC prognosis[46,78]. Furthermore, we found downregulation of the de-ubiquitinase USP12, which complexes with WD repeat protein WDR48 to suppress Akt signaling and tumor cell survival[79]. Our suggested model of molecular mechanisms linking SAMSN1 inhibition with HCC is showed in Figure 5.

DISCUSSION

Despite robust vaccination strategies in some countries, hepatitis B infection remains a leading global cause of liver cancer[1,3]. Therapeutic options for HBV-related HCC remain poor owing to an overall lack of understanding of pathways involved in HBV oncogenesis. Current literature suggests HCC development is a result of aberrant activation of cellular signaling processes such as Wnt/FZD/β-catenin, PI3K/Akt/mTOR, IRS1/IGF, and Ras/Raf/MAPK[80]. Even with such knowledge, directed therapy for HBV-related HCC cases requires a more detailed understanding of the interactome. In this meta-analysis, we found potential underlying cellular pathways that define HBV-related HCC and its disease mechanisms. Our results build upon known contributors to HBV-related HCC including LXR/FXR/RXR signaling, Akt signaling, and immunological changes within the tumor microenvironment that are favorable for both HBV infection and tumor progression. We also illustrate the possible activities of upstream regulators, whose role in HBV-related HCC are not well described, such as RABL6, HOXA10, PIAS4, and SAMSN1. We began our analysis by studying the top canonical pathways identified by IPA, which included LXR/RXR activation, LPS-IL-1 mediated RXR inhibition, acetone degradation, melatonin degradation, and FXR/RXR activation. LXR heterodimerizes with RXR and bind to LXR response elements, directly regulating gene expression[81]. LXR primarily regulates expression of genes essential for lipid metabolism and is a known HCC tumor-suppressor[82]. Similarly, the bile acid regulator FXR (NR1H4) suppresses hepatocarcinogenesis and was starkly downregulated in our analysis[83]. In addition, inhibition of the acetone degradation pathway in HCC suggests alteration hepatic ketone metabolism, suggesting an increase in acetone levels may serve as a disease biomarker[84]. Lastly, we saw predicted inhibition of the melatonin degradation pathway, which would presumably lead to a rise in melatonin. Melatonin has been demonstrated to inhibit HCC progression through let7i-3p-mediated RAF1 suppression[85]. Overall, the top canonical pathways above reinforce the roles of LXR/RXR/FXR signaling in HCC pathogenesis and suggests a role for melatonin degradation in HBV-related HCC. Top up- and downregulated genes identified in our analysis are implicated in oncogenic cellular signaling and other pathologic processes. For example, GPC3, the most upregulated gene in our analysis, functions as a co-receptor for Wnt proteins[86]. Wnt signaling is vital for hepatobiliary function and cell differentiation. Therefore it’s no surprise that aberrations in activity are major contributors to HCC tumorigenesis and other liver disorders[87]. GPC3 is also implicated in hedgehog signaling[20], another important regulator of cell growth and differentiation; overactivation of which is associated with multiple cancer types including HCC[88]. In vitro studies suggest GPC3 mediates cell proliferation through hedgehog signaling[89]. Additionally, the negative hedgehog regulator HHIP was one of the most downregulated genes in our dataset[26]. This change may act synergistically with GPC3 to further promote hedgehog signaling and cellular proliferation. Lastly, we found the kinase PBK as one of the most upregulated genes. This kinase is related to the MAPKK family and has been recently been shown to promote HCC metastasis through ETV4a-uPar signaling[22,90]. ETV4 is part of the ETS family of transcription factors and directly regulates cell division to promote pancreatic cancer and other cancer types[91,92]. In addition to activation of cellular signaling pathways described above, our results also highlight changes in the tumor microenvironment and the immune response that may confer advantages to HBV infection and tumor progression. Our most downregulated genes included several members of the C-type lectin family including CLEC4G, CLEC1B, and CLEC4M. C-type lectins function in both the innate and adaptive immune response and downregulation of has been demonstrated in HCC[24,93]. Lower expression of CLEC1B is associated with poorer outcomes in HCC[94]. In addition, we found downregulation of pattern recognition receptors, like TLR3 and TLR2, implying impairment of the innate immune response. For example, TLR3, a receptor that recognizes viral components and double-stranded RNA[95], has been associated with control of HBV infection and apoptosis of HCC cells[46]. Likewise, TLR2, whose activity limits HCC cellular proliferation, was downregulated[47]. We also noted downregulation of the innate immune receptor DDX58 or RIG-I. Elevated RIG-I expression limits HCC cellular proliferation and invasion[96]. Moreover, HBV limits RIG-I signaling through induction of the miRNA miR146a[97]. Lastly, other immunologic changes of note were repression of CD86 and IL-18. CD86 is a co-stimulatory molecule that has been well described as an anti-tumor response inducer through stimulation of cytotoxic T cells and other means[98]. While the role of IL-18 in cancer is unclear, expression of IL-18 exhibits anti-tumor effects through the recruitment of tumor-infiltrating T cells[99]. Thus, downregulation of IL-18 in the tumor microenvironment could be a prime contributor to HCC tumor progression. Our analysis revealed FOXM1, E2F1, and EP400 as activated top upstream regulators in HBV-related HCC, each of which play a prominent role in facilitating cancer proliferation. FOXM1 has previously been shown to promote HCC progression via expression of genes KIF4A and CCNB1[22]. Additionally, FOXM1 promotes tumor cell proliferation via increasing expression of CCNB1 and CCND1, and decreasing expression of cell cycle checkpoint molecules p27 and p21[100]. E2F1 is a transcription factor that has been shown to have both proliferative and apoptotic effects in HCC although the proliferative effects seem to be more prominent[29]. E2F1 activates MYBL2 (another upregulated transcription factor in our analysis) and is involved in cell cycle progression[101]. EP400 is a component of NuA4 histone acetyltransferase complex and is associated with activation of various genes. Recent studies have revealed it to be a critical transcription factor associated with greater HCC relapse and lower overall survival[36]. In addition, our results showed RABL6, ESR1, NR0B2, and CEBPD as top upstream regulators that negatively regulate HCC tumor suppression. RABL6 is a member of Ras GTPase family that is overexpressed in HCC[102]. Survival analysis suggests overexpression leads to a poorer prognosis (Figure 6). ESR1 was downregulated in our analysis, concordant with prior findings implicating ESR1 as a potential HCC tumor suppressor gene[103]. Moreover, our results showed downregulation of NR0B2 and CEBPD. NR0B2 is a nuclear receptor and tumor suppressor; downregulation of which is associated with HCC and renal cell carcinoma[104,105]. Similarly, CEBPD has also been posited as a candidate HCC tumor suppressor gene primarily through modulating IL-1 signaling[106].
Figure 6

Survival analysis comparing high and low rab-like protein 6 expression in survival of hepatocellular carcinoma patients. RABL6: Rab-like protein 6; HR: Hazards ratio. Plot generated using GPEIA2. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material).

Survival analysis comparing high and low rab-like protein 6 expression in survival of hepatocellular carcinoma patients. RABL6: Rab-like protein 6; HR: Hazards ratio. Plot generated using GPEIA2. Citation: Figures produced from IPA are available under an open-access CC-BY 4.0 license for purposes of publication. The authors have obtained the permission for figure using from the QIAGEN Digital Insights (Supplementary material). Interestingly, our results also revealed that HBV-related HCC progression may be intrinsically linked with repression of inflammatory and innate immune responses. Our analysis showed stark inhibition of the NF-κB pathway, a Myc-dependent driver of HCC tumorigenesis[107]. Indeed, other studies have proposed the NF-κB pathway is an important mediator of hepatic fibrosis and disease progression, especially when inhibition is pronounced[108]. While our meta-analysis allows for robust results of larger datasets, it has certain limitations. Annotations of public samples are limited and can introduce confounding variables to our analysis. For one, samples were taken from patients at different stages of HBV-related HCC. There may be significant differences in genetic aberrations based on tumor stage and grade. While samples were taken at the time of diagnosis, patient characteristics, such as ethnicity, comorbidities, and medications, were not clarified and may affect the results. Additionally, there may be differences in how samples were processed and how omics were performed between the studies included in our analysis. Of note, the studies in our meta-analysis did not clarify if changes in gene expression were attributable to HBV-DNA integration to the hepatocyte genome. Analysis focused on hepatocytes and the genes identified have links to HBV infection epiphonema as detailed above so a portion of the changes we have seen are associated to HBV. As explained, analysis focused on hepatocytes so gene changes would not be related to infiltrates. We did identify several genes involved in cell cycle regulation, but cellularity is important to function of these genes and could not be described in this approach. For future directions, we aim to validate our top genetic candidates using patient samples and control for such factors as stage of disease. We also hope to compare results between different etiologies of HCC such as HCV and alcohol. Doing so will elucidate the similarities and differences between these etiologies, allowing for a greater understanding of the oncogenic process while aiding the development of directed therapeutics for patient-specific treatment.

CONCLUSION

HBV is a leading cause of HCC and treatment options are still limited. In this meta-analysis based on public data, we studied the pathogenesis of HBV and pave the way for novel therapeutic avenues. We illustrated genetic changes that contribute to pro-oncogenic signaling through such pathways as the Akt, hedgehog, ETV4, and Wnt pathways. We also illustrated changes in the tumor microenvironment and immune response that are contributory to HBV infection and tumor progression. Additionally, we clarify the role of key upstream regulators such as RABL6, HOXA10, PIAS4, and SAMNS1 and describe how their downstream effects contribute to disease. These observations need to be further confirmed in prospective studies on oncogenesis. There is also need for investigating HBV-related cirrhosis and progressive changes of HBV-related HCC to assess the stepwise activity that define HBV oncogenesis.

ARTICLE HIGHLIGHTS

Research background

Hepatitis B virus (HBV) is a major cause of hepatocellular carcinoma (HCC) through several mechanisms including cirrhosis and direct oncogenic phenomena.

Research motivation

Studying HBV related HCC will offer novel insights to viral hepatic oncogenesis. It will also potentially lead to more directed therapy in HBV-related HCC.

Research objectives

Identify genetic changes and pathways that define HBV-related HCC. Identify novel therapeutic targets.

Research methods

Used our novel Search Tag Analyze Resource platform to mine liver biopsies from HBV-related HCC patients and used ingenuity pathway analysis to study the results of our meta-analysis.

Research results

Our meta-analysis highlighted several genes and pathways with oncogenic potential. Of note, we describe two potential novel mediators of oncogenesis in rab-like protein 6 (RABL6) and homeobox A10 (HOXA10).

Research conclusions

This meta-analysis describes possible roles of RABL6 and HOXA10 in the pathogenesis of HBV-related HCC. RABL6 and HOXA10 represent potential therapeutic targets and warrant further investigation.

Research perspectives

The next steps to our research is to validate RABL6 and HOXA10 relevance in HBV-related HCC using clinical samples and establish its mechanistic underpinnings in an animal model.
  107 in total

1.  Hepatocyte nuclear factor 4 alpha suppresses the development of hepatocellular carcinoma.

Authors:  Bei-Fang Ning; Jin Ding; Chuan Yin; Wei Zhong; Kun Wu; Xin Zeng; Wen Yang; Yue-Xiang Chen; Jun-Ping Zhang; Xin Zhang; Hong-Yang Wang; Wei-Fen Xie
Journal:  Cancer Res       Date:  2010-09-28       Impact factor: 12.701

2.  Meta-analysis in clinical trials.

Authors:  R DerSimonian; N Laird
Journal:  Control Clin Trials       Date:  1986-09

3.  Oncogenic Activation of the RNA Binding Protein NELFE and MYC Signaling in Hepatocellular Carcinoma.

Authors:  Hien Dang; Atsushi Takai; Marshonna Forgues; Yotsowat Pomyen; Haiwei Mou; Wen Xue; Debashish Ray; Kevin C H Ha; Quaid D Morris; Timothy R Hughes; Xin Wei Wang
Journal:  Cancer Cell       Date:  2017-07-10       Impact factor: 31.743

4.  Costimulation of antitumor immunity by the B7 counterreceptor for the T lymphocyte molecules CD28 and CTLA-4.

Authors:  L Chen; S Ashe; W A Brady; I Hellström; K E Hellström; J A Ledbetter; P McGowan; P S Linsley
Journal:  Cell       Date:  1992-12-24       Impact factor: 41.582

Review 5.  Tumor suppressors and cell metabolism: a recipe for cancer growth.

Authors:  Russell G Jones; Craig B Thompson
Journal:  Genes Dev       Date:  2009-03-01       Impact factor: 11.361

6.  A hepatitis B virus pre-S-retinoic acid receptor beta chimera transforms erythrocytic progenitor cells in vitro.

Authors:  M Garcia; H de Thé; P Tiollais; J Samarut; A Dejean
Journal:  Proc Natl Acad Sci U S A       Date:  1993-01-01       Impact factor: 11.205

7.  PIAS4, upregulated in hepatocellular carcinoma, promotes tumorigenicity and metastasis.

Authors:  Qiang Liu; Baoyong Zhou; Rui Liao; Xiong Zhou; Xiong Yan
Journal:  J Cell Biochem       Date:  2020-01-14       Impact factor: 4.429

8.  ACTL6A expression promotes invasion, metastasis and epithelial mesenchymal transition of colon cancer.

Authors:  Zhijun Zeng; Hao Yang; Shuai Xiao
Journal:  BMC Cancer       Date:  2018-10-22       Impact factor: 4.430

9.  HOXA10 knockdown inhibits proliferation, induces cell cycle arrest and apoptosis in hepatocellular carcinoma cells through HDAC1.

Authors:  Yi Zhang; Jie Chen; Shan-Shan Wu; Meng-Jiao Lv; Yong-Sheng Yu; Zheng-Hao Tang; Xiao-Hua Chen; Guo-Qing Zang
Journal:  Cancer Manag Res       Date:  2019-07-26       Impact factor: 3.989

10.  Albumin suppresses human hepatocellular carcinoma proliferation and the cell cycle.

Authors:  Shunsuke Nojiri; Takashi Joh
Journal:  Int J Mol Sci       Date:  2014-03-24       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.