Literature DB >> 29805517

Identification of key candidate genes and pathways in hepatocellular carcinoma by integrated bioinformatical analysis.

Tonghai Xing1, Tingmang Yan2, Qiang Zhou1.   

Abstract

Hepatocellular carcinoma (HCC) is one of the most common malignant neoplasms worldwide, however the underlying mechanisms and gene signatures of HCC are unknown. In the present study the profile datasets of four cohorts were integrated to elucidate the pathways and candidate genes of HCC. The expression profiles GSE25097, GSE45267, GSE57957 and GSE62232 were downloaded from the Gene Expression Omnibus database, including 436 HCC and 94 normal liver tissues. A total of 185 differentially expressed genes (DEGs) were identified in HCC, including 92 upregulated genes and 92 downregulated genes. Gene ontology (GO) was performed, which revealed that the upregulated DEGs were primarily enriched in cell division, mitotic nuclear division, mitotic cytokinesis and G1/S transition of the mitotic cell cycle. Pathway enrichment was analyzed based on the Kyoto Encyclopedia of Genes and Genomes database to assess the functional relevance of DEGs. The most significant module was selected from protein-protein interactions and 15 important hub genes were identified. The sub-networks of hub genes were involved in cell division, p53 signaling, and T lymphotropic virus type I infection signaling pathways. In conclusion, the present study revealed that the identified DEG candidate genes may promote the understanding of the cause and molecular mechanisms underlying the development of HCC and that these candidates and signal pathways may be potential targets of clinical therapy for HCC.

Entities:  

Keywords:  bioinformatical analysis; differentially expressed genes; hepatocellular carcinoma

Year:  2018        PMID: 29805517      PMCID: PMC5958738          DOI: 10.3892/etm.2018.6075

Source DB:  PubMed          Journal:  Exp Ther Med        ISSN: 1792-0981            Impact factor:   2.447


Introduction

Hepatocellular carcinoma (HCC) is one of the most common malignant neoplasms worldwide (1). Its incidence is particularly high in China (2) on account of hepatitis B (HBV) and C (HCV) infection. However, in the western world, HCC is primarily due to non-alcoholic fatty liver disease (NAFLD) (3,4). The aggressive nature of this malignancy combined with the limited therapeutic options result in poor prognosis (1,5). Unfortunately, HCC has a high recurrence rate even with successful surgical removal because of the insensitivity of HCC to chemotherapy agents and radiotherapy (6). Although there have been extensive previous studies on the molecular mechanism of HCC formation and progression, the causes of HCC are not yet clear. Hence, owing to the high morbidity and mortality in HCC, it is very important to determine the underlying molecular mechanisms and pathological processes and identify suitable molecular biomarkers for early HCC diagnosis. While HCC is a highly heterogeneous tumor, with different genetic compositions and multifaceted molecular pathogenesis (7,8), it is also a slow process during which plenty of genomic alterations accumulate and change the hepatocellular phenotype, leading to cellular intermediates and multiple monoclonal that evolve into HCC (9,10). Microarray technology has become an indispensable tool to monitor genome wide expression levels of genes in a given organism. Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) are two public functional genomics data repositories accepting data from diverse microarray platforms (11). Nowadays, gene expression studies by microarray have been performed to uncover molecular variations in HCC (12–14). Pathway analysis has shown several important cellular signaling pathway alterations to be linked with the main pathogenic mechanism (8). A small number of hub genes (proteins) were identified as key modulators in HCC metastasis by using protein-protein interaction (PPI) network. However, the most significantly dysregulated genes from previous studies are inconsistent because of the sample heterogeneity in independent studies, small sample size, and different statistical methods. In the present study, four original microarray datasets GSE25097, GSE45267, GSE57957, and GSE62232 were downloaded from the NCBI-Gene Expression Omnibus database (NCBI-GEO) (available online: https://www.ncbi.nlm.nih.gov/geo/), and included 436 HCC and 94 normal liver tissues. The microarray data was normalized and preprocessed using the robust multi-array average algorithm (RMA). The differentially expressed genes (DEGs) were obtained using the limma package. Venny was applied to filter the overlapping DEGs among groups. Gene ontology (GO) and pathway enrichment analysis were also performed for annotation and visualization with DAVID and KEGG PATHWAY (Available online: http://www.genome.jp/kegg). We used the STRING online database (Available online: http://string-db.org) to annotate functional interactions of DEGs. The most significant module was selected from PPI, and 15 important hub genes were identified. The sub-networks of hub genes were involved in cell division, p53 signaling and HTLV-I infection signaling pathways.

Materials and methods

Microarray data information and data preprocessing

The microarray data from GSE25097, GSE45267, GSE57957, and GSE62232 were downloaded from the GEO database (http://www.ncbi.nlm.nih.gov/geo/). The microarray data of GSE25097 was based on GPL10687 platforms (Rosetta/Merck Human RSTA Affymetrix 1.0 microarray, Custom CDF; Affymetrix Inc., Santa Clara, CA, USA). The microarray data of GSE45267 was based on GPL570 platforms (HG-U133_Plus_2, Affymetrix Human Genome U133 Plus 2.0 Array). The microarray data of GSE57957 was based on GPL10558 platforms (Illumina HumanHT-12 v4.0 expression beadchip). The microarray data of GSE62232 was based on GPL570 platforms (HG-U133_Plus_2, Affymetrix Human Genome U133 Plus 2.0 Array). We chose these four datasets for integrated analysis, because they represented different racial populations. The microarray data was preprocessed using the RMA with the Affy and limma package in Bioconductor (v1.46.1; http://www.bioconductor.org/). Background correction, normalization, and calculation of expression were all included in the preprocessing stage. The microarray data probe was transformed to gene symbols with Bioconductor Annotation Data software packages. If several probes were mapped to one gene symbol, the mean value was set as the final expression value of this gene. The DEGs in every group were analyzed using the limma package. In Bioconductor. The DEG P-values were calculated using the unpaired or paired Student's t-test provided by the limma package. P<0.01 and [logFC]>1 were used as cut-off criterion for DEGs. Hierarchical clustering analysis of the DEGs was then performed and visualized using g-plots in the R package. Venny is an interactive tool used to compare lists with Venn diagrams. The intersections of upregulated and downregulated genes in different sample groups were respectively analyzed using Venny v2.0.

GO and pathway enrichment analysis

Candidate DEG functions and pathway enrichment were analyzed using multiple online databases; among them, DAVID is a website with gene annotation, visualization, and integrated discovery function, and can thus provide gene biological meaning. GO analysis and pathway analysis were carried out using the KEGG PATHWAY (available online: http://www.genome.jp/kegg), Reactome (available online: http://www.reactome.org), and GO website, with P<0.01 as the cut-off criterion (15,16).

Integration of protein-protein interaction (PPI) network, modular analysis and significant candidate genes and pathway identification

First, online database STRING (available online: http://string-db.org) was employed to develop DEGs-encoded proteins and protein-protein interaction network (PPI). Second, the Cytoscape software (17) was utilized to construct a protein interaction relationship network and analyze the interaction relationship of the candidate DEGs encoding proteins in colon cancer. Third, the Network Analyzer plug-in was used to calculate the node degree, i.e., the numbers of inter-connections to filter the hub genes of PPI. The corresponding proteins in the central nodes might be the core proteins and key candidate genes that have important physiological regulatory functions. The cBio Cancer Genomics Portal (http://cbioportal.org) is an open platform for exploring multidimensional cancer genomics data by encapsulating molecular profiling data obtained from cancer tissue and cell lines into readily understandable genetic, epigenetic, gene expression, and proteomic events. Complex cancer genomic profiles can be easily accessed using the query interface of the portal enabling researchers to explore and compare genetic alterations across samples. The underlying data thus obtained can be linked to clinical outcomes to facilitate novel discovery in biological systems. Through use of the portal search function, identified DEG candidate genes are classified as altered or not altered. The genomics datasets are then presented using OncoPrint as heatmaps-a visually appealing display of alterations in gene arrays across tumor samples.

Results

Identification of DEGs in HCC

NCBI-GEO is a public functional genomics data repository accepting data from diverse microarray platforms, from which gene expression profiles of HCC and normal or adjacent liver tissue from the GSE25097, GSE45267, GSE57957, and GSE62232 databases were obtained. The microarray data of GSE25097 had 268 HCC tissues and 6 normal liver tissues (18); the GSE62232 data had 81 HCC tissues and 10 normal liver tissues (12); the GSE45267 data included 48 HCC tissues and 39 healthy liver tissues; and the GSE57957 data had 39 pairs of HCC tissues and matched paraneoplastic tissue (19). Using P<0.01 and [logFC]>1 as cut-off criterion, we extracted 1072, 1850, 4055, and 1876 DEGs from the expression profile datasets GSE25097, GSE45267, GSE57957, and GSE62232, respectively. In total, 185 overlapping DEGs were identified in HCC (Fig. 1). However, when we determined the intersections of upregulated and downregulated genes in four sample groups using Venny, one gene was excluded because of it's contradictory expression in different groups. So, we ultimately identified 92 upregulated genes and 92 downregulated genes (Table I).
Figure 1.

Identification of 185 commonly changed DEGs from the four cohort profile data sets (GSE25097, GSE45267, GSE57957 and GSE62232). Different color areas represent different datasets. The cross areas indicate the commonly changed DEGs. DEGs were identified with classical t-test; statistically significant DEGs were defined with P<0.01 and [logFC] >1 as the cut-off criteria. DEGs, differentially expressed genes.

Table I.

Up and downregulated DEGs.

DEGsGenes name
UpregulatedTPX2 GPSM2 IGSF3 CCDC34 ZIC2 ANLN SMYD3 RACGAP1 NUP37 SULT1C2 AURKA ATP6V1C1 TTC13 KIF4A THY1 SQLE MCM5 TMEM106C MELK ZWINT OIP5 PTTG1 TARBP1 STMN1 UBE2T CKS2 TP53I3 SLC38A6 TBCE CCNB2 MCM7 PRC1 CEP55 CCNE2 MTHFD1L MCM2 MCM4 CNIH4 DLGAP5 RAD51AP1 RAP2A COL4A1 GPC3 CDKN2C PLVAP PIGC C8orf33 DTNA NEK2 FANCD2 RFX5 COL15A1 GSTA4 ERMP1 MCM3 PCNA CENPA CDC20 RFC4 BUB1 PBK RFXANK TRIP13 PDCD2 P4HA2 SMC2 TMEM9 ASPM CDCA3 ATAD2 PEA15 UBE2C STXBP6 MICB MDK TOP2A TUBG1 RNASEH2A CAP2 FAM83D HMMR MCM6 GMNN KIF20A SAE1 CDKN2A TTK CDKN3 E2F3 NCAPG FDPS NUSAP1
DownregulatedCYP26A1 IGF1 C1RL SORL1 CYP2C19 GHR CLEC1B STARD5 SHBG ATOH8 CPEB3 CRHBP QKI DBH SRD5A2 ADRA1A PLSCR4 RCAN1 CLEC4M ESR1 GCH1 FOXO1 APOF PDE7B GREM2 TTC36 CXCL2 LYVE1 ASPG NAAA NAT2 NCOR1 GCKR OIT3 CETP SRPX MT1F KCNN2 GSTZ1 OLFML3 CNDP1 CCBE1 MASP1 FOSB MARCO LCAT RSPO3 HAMP STAB2 HGFAC SLC4A4 PZP C1QTNF1 CXCL14 MBNL2 EFHD1 IGFALS MT1M ANTXR2 ECM1 FCN2 CHST4 MAN1C1 ST3GAL6 KBTBD11 PCK1 RND3 IL1RAP TMEM27 MT1X AADAT ACADS RNF165 CYP1A2 LPA LIFR EXPH5 COLEC10 PAMR1 CXCL12 LARP1B ANGPTL6 MSRA SOCS2 SARDH LY6E FCN3 CLEC4G VIPR1 LHX2 DHODH PTH1R

184 DEGs were identified from four profile datasets, including 92 upregulated genes and 92 downregulated genes in the hepatocellular carcinoma tissues, compared to normal liver tissues. The upregulated genes were listed from the largest to the smallest of fold changes, and downregulated genes were listed from the smallest to largest of fold changes. DEGs, differentially expressed genes.

DEGs GO analysis in HCC

DEGs GO analysis was conducted with the online softwares DAVID and Gene Ontology. The DEGs were classified into three functional groups: Biological process group, molecular function group, and cellular component group (Fig. 2A). As shown in Fig. 2B and Table II, cell division, mitotic nuclear division, and G1/S transition of mitotic cell cycle are the most abundant groups in the ‘biological process’ category, with each accounting for about 20% of upregulated genes. In contrast, downregulated genes were mainly enriched in signal transduction and proteolysis. For the molecular function category, protein binding (71.7%) was the predominant group among upregulated genes, followed by ATP binding (25%) and DNA binding (18.5%), while the downregulated genes were mainly signed to the calcium ion binding and serine-type endopeptidase activity groups. For the cellular component group, upregulated genes were mainly enriched in the nucleoplasm, nucleus, and cytoplasm, and the downregulated genes were mainly enriched in the extracellular region, extracellular space, and extracellular exosome.
Figure 2.

GO analysis and significantly enriched GO in terms of DEGs in HCC. (A) GO analysis classified the DEGs into three groups (molecular function, biological process and cellular component). (B) Significant Enriched GO Terms of DEGs in HCC based on their functions. GO, gene ontology; DEG, differentially expressed gene; HCC, hepatocellular carcinoma.

Table II.

The significant enriched analysis of DEGs in hepatocellular carcinoma.

A, Upregulated

TermDescriptionCountP-value
GO:0051301Cell division182.36×10−12
GO:0007067Mitotic nuclear division153.38×10–11
GO:0000281Mitotic cytokinesis81.12×10−10
GO:0000082G1/S transition of mitotic cell cycle111.18×10–10
GO:0042555MCM complex62.90×10−10
GO:0005654Nucleoplasm375.64×10–09
GO:0006270DNA replication initiation71.32×10−08
GO:0003678DNA helicase activity61.27×10–07
GO:0006268DNA unwinding involved in DNA replication51.38×10−07
GO:0030496Midbody92.03×10–07
GO:0006260DNA replication91.39×10−06
GO:0005524ATP binding234.82×10–06
GO:0005829Cytosol355.16×10−06
GO:0005515Protein binding661.16×10–05
GO:0005874Microtubule101.92×10−05

B, Downregulated

TermDescriptionCountP-value

GO:0005576Extracellular region288.07×10–09
GO:0005615Extracellular space198.81×10−05
GO:0005581Collagen trimer69.21×10–05
GO:0004252Serine-type endopeptidase activity83.22×10−04
GO:0001867Complement activation, lectin pathway35.48×10–04
GO:0071276Cellular response to cadmium ion30.003
GO:0071294Cellular response to zinc ion30.004
GO:0045926Negative regulation of growth30.004
GO:0006508Proteolysis90.004
GO:0034364High-density lipoprotein particle30.005

DEGs, differentially expressed genes; GO, gene ontology.

Signaling pathway enrichment analysis

To investigate functional and signaling pathway enrichment of the gene signatures, we performed a pathway analysis using online websites of DAVID and KEGG (http://www.genome.jp/kegg/) pathways and GO. The upregulated DEGs were mainly enriched in DNA replication, cell cycle, HTLV-I infection, oocyte meiosis, and p53 signaling pathway, while the downregulated DEGs were mainly enriched in metabolic pathways, caffeine metabolism, mineral absorption, thyroid hormone signaling pathway, and AMPK signaling pathway (Table III, Fig. 3).
Table III.

Signaling pathway enrichment analysis of DEGs function in hepatocellular carcinoma.

A, Upregulated DEG

PathwayNameGene countP-valueGenes
KEGG Pathway: hsa04110Cell cycle167.01×10−17E2F3, TTK, CDC20, PTTG1, MCM2, MCM3, MCM4, MCM5, MCM6, CCNE2, CDKN2A, CCNB2,MCM7, CDKN2C, BUB1, PCNA
KEGG Pathway: hsa03030DNA replication91.03×10−11RFC4, MCM7, PCNA, MCM2, MCM3, RNASEH2A, MCM4, MCM5, MCM6
Reactome: R-HSA-69052Switching of origins to a post-replicative state63.34×10–11MCM7, MCM2, MCM3, MCM4, MCM5, MCM6
Reactome: R-HSA-69300Removal of licensing factors from origins73.49×10−10MCM7, GMNN, MCM2, MCM3, MCM4, MCM5, MCM6
Reactome: R-HSA-176974Unwinding of DNA64.30×10–09MCM7, MCM2, MCM3, MCM4, MCM5, MCM6
Reactome: R-HSA-68867Assembly of the pre-replicative complex61.60×10−08MCM7, MCM2, MCM3, MCM4, MCM5,MCM6
Reactome: R-HSA-176187Activation of ATR in response to replication stress76.30×10–08RFC4, MCM7, MCM2, MCM3, MCM4, MCM5, MCM6
Reactome: R-HSA-68962Activation of the pre-replicative complex61.13×10−06MCM7, MCM2, MCM3, MCM4, MCM5, MCM6
Reacto me: R-HSA-68949Orc1 removal from chromatin65.12×10–05MCM7, MCM2, MCM3, MCM4, MCM5, MCM6
Reactome: R-HSA-2500257Resolution of Sister Chromatid Cohesion66.32×10−04CCNB2, CENPA, ZWINT, BUB1, NUP37, CDC20
Reactome: R-HSA-2467813Separation of Sister Chromatids76.54×10–04CENPA, ZWINT, BUB1, NUP37, CDC20, PTTG1, UBE2C
Reactome: R-HSA-2514853Condensation of Pro-metaphase Chromosomes30.001CCNB2, NCAPG, SMC2
Reactome: R-HSA-4615885SUMOylation of DNA replication proteins40.002PCNA, NUP37, AURKA, TOP2A
KEGG Pathway: hsa05166HTLV-I infection70.003E2F3, CDKN2A, CDKN2C, PCNA, FDPS CDC20, PTTG1
KEGG Pathway: hsa04114Oocyte meiosis50.003CCNE2, BUB1, AURKA, CDC20, PTTG1

B, Downregulated DEGs

PathwayNameGene countP-valueGenes

Reactome: R-HSA-2855086Ficolins bind to repetitive carbohydrate structures on the target cell surface33.56×10−04MASP1, FCN3, FCN2
Reactome: R-HSA-166662Lectin pathway of complement activation35.32×10−04MASP1, FCN3, FCN2
Reactome: R-HSA-5661231Metallothioneins bind metals30.001MT1M, MT1X, MT1F
Reactome: R-HSA-1251985Nuclear signaling by ERBB430.009ESR1, CXCL12, NCOR1
Reactome: R-HSA-166663Initial triggering of complement40.01MASP1, FCN3, FCN2, COLEC10
KEGGPathway: hsa01100Metabolic pathways150.01AADAT, CNDP1, CYP2C19, ACADS, NAT2, CYP26A1, CYP1A2, DBH, MAN1C1, GCH1, PCK1, ST3GAL6, DHODH, GSTZ1, SARDH

DEGs, differentially expressed genes; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Figure 3.

Significantly enriched pathway in terms of DEGs in colorectal cancer. DEGs functional and signaling pathway enrichment were conducted using online websites of KEGG PATHWAY, Reactomen, and Gene Ontology analysis. DEGs, differentially expressed genes; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Key candidate genes and pathways identification with DEGs protein-protein interaction network (PPI) and modular analysis

Based on the analysis in the STRING database (Available online: http://string-db.org) (20) and Cytoscape software (17), relevant protein-protein interactions (PPI) were obtained and visualized, containing 184 nodes and 1166 edges (Fig. 4A). After a topological analysis, 15 genes with a node degree ≥36 (i.e., each node had more than 10 connections/interactions) were selected as representative hub genes. The most significant 15 node degree genes were TOP2A, PCNA, CCNB2, AURKA, CDKN3, BUB1, RFC4, CEP55, DLGAP5, MCM2, PRC1, RACGAP1, TPX2, CDC20, and MCM4. Based on the degree of importance, one significant module was chosen for further analysis through Cytotype MCODE from the PPI network complex. Pathway enrichment analysis showed that Module 1 consisted of 33 nodes and 507 edges (Fig. 4B, Table IV), which are mainly associated with DNA replication, cell cycle, switching of origins to a post-replicative state, and activation of ATR in response to replication stress by pathway enrichment analysis.
Figure 4.

DEGs PPI network complex and modular analysis. (A) Using the STRING online database, total of 184 DEGs (92 upregulated in blue standing for upregulation and 92 downregulated genes in red standing for downregulation) were filtered into the DEGs PPI network complex. (B) The most significant module consists of 33 nodes and 507 edges, which are mainly associated with DNA replication, cell cycle, switching of origins to a post-replicative state, and activation of ATR in response to replication stress. DEGs, differentially expressed genes; PPI, protein-protein interaction.

Table IV.

Pathway enrichment analysis of Module 1 genes function.

TermDescriptionCountP-value
KEGG:hsa04110Cell cycle116.64×10–15
KEGG:hsa03030DNA replication  83.50×10−13
R-HSA-69052Switching of origins to a post-replicative state  63.93×10–13
R-HSA-176974Unwinding of DNA  65.14×10−11
R-HSA-68867Assembly of the pre-replicative complex  61.94×10–10
R-HSA-176187Activation of ATR in response to replication stress  72.88×10−10
R-HSA-69300Removal of licensing factors from origins  63.98×10–10
R-HSA-68962Activation of the pre-replicative complex  61.26×10−08
R-HSA-68949Orc1 removal from chromatin  67.66×10–07
R-HSA-2467813Separation of Sister Chromatids  68.33×10−05
R-HSA-2500257Resolution of Sister Chromatid Cohesion  52.19×10–04
R-HSA-174178APC/C:Cdh1 mediated degradation of Cdc20 and other APC/C:Cdh1 targeted proteins in late mitosis/early G1  47.57×10−04
KEGG:hsa04114Oocyte meiosis  40.001
R-HSA-179409APC-Cdc20 mediated degradation of Nek2A  30.00179182

KEGG, Kyoto Encyclopedia of Genes and Genomes.

Validation of the DEGs in cBio portal dataset

To further confirm the validity of the identified DEGs, cBio portal, an online integrated data mining system, was used to analyze the genetic alteration of genes. Among the four HCC studies analyzed (12,21,22), alterations ranging from 15.2 to 68.9% were found including the gene sets submitted for analysis (Fig. 5A). The most pronounced genomic changes observed across samples from the TCGA study was presented using OncoPrint. Two genes, UBE2T and CCNE2, with the largest mutation frequency found based on the TCGA database in HCC were also identified as the hub genes in this study. The results showed low mutation frequency of the three hub genes-TOP2A, PCNA, and AURKA in HCC samples-investigated in the present study (Fig. 5B), but in the TCGA breast cancer, the three genes showed high alteration frequency (23,24) (Fig. 5C). Thus, the roles of TOP2A, PCNA, and AURKA as biomarkers in HCC progression and histological grading should be investigated more systematically.
Figure 5.

Validation of the DEGs in cBio portal Dataset. (A) Overview of changes in the identified DEGs in genomics data sets available in 4 different HCC studies. (B) OncoPrint: A visual summary of alteration across TCGA hepatocellular carcinoma based on a query of DEGs. (C) OncoPrint, A visual summary of alteration across TCGA breast cancer based on a query of the three genes-TOP2A, PCNA, and AURKA. Distinct genomic alterations including mutations and CNAs (exemplified by gene amplifications and homozygous deletions) are summarized and color coded presented by % changes in particular affected genes in individual tumor samples. Each row represents a gene, and each column represents a tumor sample. Red bars designate gene amplifications, blue bars represent homozygous deletions, and green squares indicate nonsynonymous mutations. DEGs, differentially expressed genes; HCC, hepatocellular carcinoma; CNS, copy number alterations; TCGA, The Cancer Genome Atlas.

Discussion

HCC is one of the most common malignant neoplasms worldwide. Advanced-stage HCC is notoriously difficult to treat. Although various causes and underlying mechanisms of HCC formation and progression have been revealed by several basic and clinical studies in the past several decades, the worldwide incidence and mortality of HCC is still very high. Different from a single genetic or cohort study, this study analyzed four original microarray datasets, including 436 HCC and 94 normal liver tissues. In total, 185 DEGs were identified in HCC, including 92 upregulated genes and 92 downregulated genes. GO was performed, which showed that the upregulated DEGs were mainly enriched in cell division, mitotic nuclear division, mitotic cytokinesis, and G1/S transition of the mitotic cell cycle. Pathway enrichment was analyzed based on the KEGG database to assess the functional relevance of DEGs. On the basis of the analysis in the STRING database and Cytoscape software, relevant PPIs were obtained and visualized that contained 184 nodes and 1166 edges. The most significant 15 hub genes were selected, including TOP2A, PCNA, CCNB2, AURKA, CDKN3, BUB1, RFC4, CEP55, DLGAP5, MCM2, PRC1, RACGAP1, TPX2, CDC20, and MCM4. Cell cycle-related gene TOP2A (DNA topoisomerase II alpha) encodes a DNA topoisomerase, an enzyme that controls and alters the topologic states of DNA during transcription. In many cancers including breast, ovarian, colon, and small-cell lung cancers, TOP2A has also been shown as a valuable prognostic marker for tumor grading, recurrences, and a predictor of poorer patient survival (25–27). In this study, TOP2A is the top degree hub gene and overexpressed in tumor tissue compared to adjacent non-tumoral or normal tissue. Consistent with our studies, Wong et al (28), used microarrays to examine 22 HCC cell lines; their array-based transcriptional mapping results showed that TOP2A was upregulated in 14/22 cell lines through DNA copy gains. Furthermore, TMA analysis of 172 liver tumors showed that TOP2A expressions significantly correlated with advanced histological grading, microvascular invasion, aggressive biological behavior, and chemotherapy resistance. High-grade TOP2A expressions showed significantly lower overall survival. Thus, TOP2A as a biomarker in histological grading and a target in chemotherapy resistance should be investigated more systematically. The second hub gene PCNA, proliferating cell nuclear antigen, encodes the protein which is found in the nucleus and is a cofactor of DNA polymerase delta and involved in the RAD6-dependent DNA repair pathway in response to DNA damage. Owing to its function in cell proliferation, PCNA plays an important role in cancer progression (29). Combined hepatocellular-cholangiocarcinoma (CHC) is a malignant subtype of primary liver tumor containing elements of both HCC and intrahepatic cholangiocarcinoma (ICC). In patients who underwent hepatectomy with curative intent, nontumor ductular reactions (DRs) produced by hepatic progenitor cells (HPCs) in nontumor liver tissue were an independent prognostic indicator for disease-free survival (DFS) and overall survival (OS). PCNA could label the index of the ductular reaction (PI-DR). A higher level of PI-DR contributes to intratumoral HPC activation (30), fibrosis, hepatocyte replication, and hepatic inflammation which is predictive of a high recurrence rate. Moreover, increased PI-DR expression was also associated with multicentric occurrence (MO) and microvascular invasion (MVI) of HCC (31). However, several studies found conflicting results, which showed that PCNA had little prognostic value in tumor tissue (32,33). Because one part of PCNA trimers was not engaged in DNA synthesis (34). Similar to the PCNA function in maintaining the stability of chromosomal replication and segregation in cellular mitosis, another gene-aurora kinase A (AURKA)-was found upregulated in our experiment. AURKA is a cell cycle regulated kinase that appears to be involved in microtubule formation. A clinical trial by Jeng et al (35), showed that AURKA was overexpressed in 137 (61%) of 223 patients with HCC. AURKA overexpression coincided with portal vein tumor invasion, regardless of tumor size. Furthermore, AURKA was found involved in p53 signaling pathway and interacted directly with p53 mutation (36). Aurora-A and p53 mutation had a synergistic effect promoting tumor progression and poor prognosis (35). In p53-altered (deleted or mutated) liver cancer, the tumor suppressor protein p19ARF was activated and mediated G2/M cell cycle arrest. MYC, a proto-oncogene, was overexpressed and directly bound to AURKA, which stabilized MYC to overcome G2 to M cell cycle arrest and promote tumor cell survival. A previous study found conformation-changing AURKA inhibitors could prevent formation of MYC-AURKA complexes and hence, degradation of MYC (37). Therefore, therapeutic strategy targeting MYC-AURKA complexes could be considered for this subtype of HCCs. Besides the cycle-related pathways and p53 signaling, T lymphotropic virus type I (HTLV-I) infection-related biological processes and pathways were also identified in our study. HTLV-1 infection, especially combined with HCV infection, increased HCC mortality (38). Tax, an HTLV-I oncoprotein, contributes to chromosome aneuploidy, cytokinesis failure, and multinucleated cells primarily by directly binding and activating the CDC20-associated anaphase promoting complex (APCCDC20) during S phase to delay mitotic progression and faulty mitosis (39). Other DEGs in our study associated with HTLV-I infection-related pathway were FDPS, CDC20, E2F, CDKN2A, CDK2N2C, PTTG1, and PCNA. Consistent with our studies, Jin et al (40), analyzed gene expression profiles GSE6222, GSE41804, and GSE51401 that contained 117 samples, including 54 cases and 63 controls from which 1347 DEGs were identified, including 2920 upregulated genes and 2231 downregulated genes. The top 10 hug genes were SPINK1, TOP2A, ASPM, GPC3, ANLN, SULT1C2, CCNB1, PEG10, CDKN3, and ECT2. The main pathway of the identified DEGs were those involved in cell cycle and oocyte meiosis, which were also identified in our study. However, Jin's study was based on a dataset only generated from Chinese and Japanese patients. Different from Yin's report, we analyzed four datasets generated from American, Chinese, Singaporean, and French patients, thereby representing different populations. Apart from cell cycle-related pathways, HTLV-I infection and p53 signaling pathway were identified, because the incidence of NAFLD and steatohepatitis was high in European and American patients (41,42). HCC is a group of complex and heterogeneous tumors. The mechanism mainly involves chromosomal and microsatellite instability (43), and the later involves the inactivation or mutation of DNA mismatch repair genes. Our study found chromosomal instability-related genes including BUB, PCNA, CDC20, and AURKA. The characteristics of HCC cannot be explained only by analysis of gene expression profiles, although gene expression profiles could reveal some of the underlying mechanism in cancer progression. Various factors should be explored, including gene mutation, methylation, miRNA, and lncRNA, which could likely participate in HCC carcinogenesis and chemotherapy resistance. For instance, TSLNC8, a long intergenic noncoding RNA on chromosome 8p12, was characterized as a novel tumor suppressor by modulating the IL-6/STAT3 signaling pathway and being inversely correlated with HCC embolus, nodules, and differentiation stage (44). A clinical trial based on 2079 cirrhotic patients with long term follow-up found that the incidence of HCC induced by the etiology of cirrhosis was different. Chronic viral hepatitis patients had a higher cumulative risk of HCC than those with primary biliary cirrhosis and NAFLD, while those with autoimmune liver diseases (AIH) had the lowest risk (45). Taken together, in our study we have identified 185 DEG candidate genes using integrated bioinformatical analysis, and found 15 mostly changed hub genes, which were significantly enriched in cell cycle process, DNA replication, p53 signaling, and HTLV-I infection-related biological processes and pathways. These findings could promote our understanding of the cause and molecular mechanisms underlying the development of HCC, and these candidate and signal pathways could be the targets of clinical therapy for HCC.
  45 in total

1.  Clinicopathological and prognostic significance of serum and tissue Dickkopf-1 levels in human hepatocellular carcinoma.

Authors:  Edmund Kwok-Kwan Tung; Carmen Ka-Man Mak; Sarwat Fatima; Regina Cheuk-Lam Lo; Heng Zhao; Chunsheng Zhang; Hongyue Dai; Ronnie Tung-Ping Poon; Man-Fung Yuen; Ching-Lung Lai; Jin-jun Li; John Moon-Ching Luk; Irene Oi-Lin Ng
Journal:  Liver Int       Date:  2011-08-04       Impact factor: 5.828

2.  Topoisomerase II alpha: prognostic predictor and cell cycle marker in surface epithelial neoplasms of the ovary and peritoneum.

Authors:  M J Costa; C L Hansen; J A Holden; D Guinee
Journal:  Int J Gynecol Pathol       Date:  2000-07       Impact factor: 2.762

3.  Background progenitor activation is associated with recurrence after hepatectomy of combined hepatocellular-cholangiocarcinoma.

Authors:  Xiong Cai; Jian Zhai; David E Kaplan; Yijun Zhang; Lining Zhou; Xutao Chen; Guangyang Qian; Qiudong Zhao; Yonghai Li; Lu Gao; Wenming Cong; Minghua Zhu; Zhenlin Yan; Lehua Shi; Dong Wu; Lixin Wei; Feng Shen; Mengchao Wu
Journal:  Hepatology       Date:  2012-06-25       Impact factor: 17.425

4.  Genome-wide survey of recurrent HBV integration in hepatocellular carcinoma.

Authors:  Wing-Kin Sung; Hancheng Zheng; Shuyu Li; Ronghua Chen; Xiao Liu; Yingrui Li; Nikki P Lee; Wah H Lee; Pramila N Ariyaratne; Chandana Tennakoon; Fabianus H Mulawadi; Kwong F Wong; Angela M Liu; Ronnie T Poon; Sheung Tat Fan; Kwong L Chan; Zhuolin Gong; Yujie Hu; Zhao Lin; Guan Wang; Qinghui Zhang; Thomas D Barber; Wen-Chi Chou; Amit Aggarwal; Ke Hao; Wei Zhou; Chunsheng Zhang; James Hardwick; Carolyn Buser; Jiangchun Xu; Zhengyan Kan; Hongyue Dai; Mao Mao; Christoph Reinhard; Jun Wang; John M Luk
Journal:  Nat Genet       Date:  2012-05-27       Impact factor: 38.330

5.  A unique metastasis gene signature enables prediction of tumor relapse in early-stage hepatocellular carcinoma patients.

Authors:  Stephanie Roessler; Hu-Liang Jia; Anuradha Budhu; Marshonna Forgues; Qing-Hai Ye; Ju-Seog Lee; Snorri S Thorgeirsson; Zhongtang Sun; Zhao-You Tang; Lun-Xiu Qin; Xin Wei Wang
Journal:  Cancer Res       Date:  2010-12-15       Impact factor: 12.701

Review 6.  Hepatocellular carcinoma in non-alcoholic fatty liver disease: an emerging menace.

Authors:  György Baffy; Elizabeth M Brunt; Stephen H Caldwell
Journal:  J Hepatol       Date:  2012-02-09       Impact factor: 25.083

Review 7.  Association between nonalcoholic fatty liver disease and risk for hepatocellular cancer, based on systematic review.

Authors:  Donna L White; Fasiha Kanwal; Hashem B El-Serag
Journal:  Clin Gastroenterol Hepatol       Date:  2012-10-04       Impact factor: 11.382

8.  c-erbB2 and topoisomerase IIalpha protein expression independently predict poor survival in primary human breast cancer: a retrospective study.

Authors:  Peter Fritz; Cristina M Cabrera; Jürgen Dippon; Andreas Gerteis; Wolfgang Simon; Walter E Aulitzky; Heiko van der Kuip
Journal:  Breast Cancer Res       Date:  2005-03-21       Impact factor: 6.466

9.  Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets.

Authors:  Kornelius Schulze; Sandrine Imbeaud; Eric Letouzé; Ludmil B Alexandrov; Julien Calderaro; Sandra Rebouissou; Gabrielle Couchy; Clément Meiller; Jayendra Shinde; Frederic Soysouvanh; Anna-Line Calatayud; Roser Pinyol; Laura Pelletier; Charles Balabaud; Alexis Laurent; Jean-Frederic Blanc; Vincenzo Mazzaferro; Fabien Calvo; Augusto Villanueva; Jean-Charles Nault; Paulette Bioulac-Sage; Michael R Stratton; Josep M Llovet; Jessica Zucman-Rossi
Journal:  Nat Genet       Date:  2015-03-30       Impact factor: 38.330

10.  Integration of gene ontology pathways with North American Rheumatoid Arthritis Consortium genome-wide association data via linear modeling.

Authors:  Jérémie Jp Lebrec; Tom Wj Huizinga; René Em Toes; Jeanine J Houwing-Duistermaat; Hans C van Houwelingen
Journal:  BMC Proc       Date:  2009-12-15
View more
  11 in total

1.  HFE promotes mitotic cell division through recruitment of cytokinetic abscission machinery in hepatocellular carcinoma.

Authors:  Pingping Dong; Ziqing Cai; Bingfeng Li; Yueqin Zhu; Alice K Y Chan; Michael W L Chiang; Chun Hang Au; Wing Kin Sung; Tan To Cheung; Chung Mau Lo; Kwan Man; Nikki P Lee
Journal:  Oncogene       Date:  2022-07-26       Impact factor: 8.756

2.  Identification of Key Genes for Hepatitis Delta Virus-Related Hepatocellular Carcinoma by Bioinformatics Analysis.

Authors:  Cheng Zhang; Shan Wu; Xiao-Dong Yang; Hui Xu; Tai Ma; Qi-Xing Zhu
Journal:  Turk J Gastroenterol       Date:  2021-02       Impact factor: 1.852

3.  Bioinformatics Analysis based on Multiple Databases Identifies Hub Genes Associated with Hepatocellular Carcinoma.

Authors:  Lu Zeng; Xiude Fan; Xiaoyun Wang; Huan Deng; Kun Zhang; Xiaoge Zhang; Shan He; Na Li; Qunying Han; Zhengwen Liu
Journal:  Curr Genomics       Date:  2019-08       Impact factor: 2.236

4.  Cytochrome P450 family members are associated with fast-growing hepatocellular carcinoma and patient survival: An integrated analysis of gene expression profiles.

Authors:  Zhao-Zhen Liu; Li-Na Yan; Chun-Nan Dong; Ning Ma; Mei-Na Yuan; Jin Zhou; Ping Gao
Journal:  Saudi J Gastroenterol       Date:  2019 May-Jun       Impact factor: 2.485

5.  Identification of crucial genes based on expression profiles of hepatocellular carcinomas by bioinformatics analysis.

Authors:  Ze-Kun Liu; Ren-Yu Zhang; Yu-Le Yong; Zhi-Yun Zhang; Can Li; Zhi-Nan Chen; Huijie Bian
Journal:  PeerJ       Date:  2019-08-08       Impact factor: 2.984

6.  Comprehensive investigation of key biomarkers and pathways in hepatitis B virus-related hepatocellular carcinoma.

Authors:  Xiwen Liao; Tingdong Yu; Chengkun Yang; Ketuan Huang; Xiangkun Wang; Chuangye Han; Rui Huang; Xiaoguang Liu; Long Yu; Guangzhi Zhu; Hao Su; Wei Qin; Jianlong Deng; Xianmin Zeng; Bowen Han; Quanfa Han; Zhengqian Liu; Xin Zhou; Junqi Liu; Yizhen Gong; Zhengtao Liu; Jianlv Huang; Lei Lu; Xinping Ye; Tao Peng
Journal:  J Cancer       Date:  2019-09-07       Impact factor: 4.207

7.  Identification and Validation of Key Genes in Hepatocellular Carcinoma by Bioinformatics Analysis.

Authors:  Jia Wang; Rui Peng; Zheng Zhang; Yixi Zhang; Yuke Dai; Yan Sun
Journal:  Biomed Res Int       Date:  2021-02-23       Impact factor: 3.411

Review 8.  System-Wide Pollution of Biomedical Data: Consequence of the Search for Hub Genes of Hepatocellular Carcinoma Without Spatiotemporal Consideration.

Authors:  Ankush Sharma; Giovanni Colonna
Journal:  Mol Diagn Ther       Date:  2021-01-21       Impact factor: 4.074

9.  Identification and Validation of Hub Genes with Poor Prognosis in Hepatocellular Carcinoma by Integrated Bioinformatical Analysis.

Authors:  Jiang Guo; Wei Li; Long Cheng; Xuesong Gao
Journal:  Int J Gen Med       Date:  2022-04-11

10.  Identification of Key Genes and Prognostic Value Analysis in Hepatocellular Carcinoma by Integrated Bioinformatics Analysis.

Authors:  Meng Wang; Licheng Wang; Shusheng Wu; Dongsheng Zhou; Xianming Wang
Journal:  Int J Genomics       Date:  2019-11-22       Impact factor: 2.326

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.