| Literature DB >> 36197270 |
Xu Huang1, Xu Wang2, Ge Huang3, Ruotao Li4, Xingkai Liu1, Lidong Cao5, Junfeng Ye1, Ping Zhang1.
Abstract
Hepatocellular carcinoma (HCC) is still a significant global health problem. The development of bioinformatics may provide the opportunities to identify novel therapeutic targets. This study bioinformatically identified the differentially expressed genes (DEGs) in HCC and associated them with HCC prognosis using data from published databases. The DEGs downloaded from the Gene Expression Omnibus (GEO) website were visualized using the Venn diagram software, and then subjected to the GO and KEGG analyses, while the protein-protein interaction network was analyzed using Cytoscape software with the Search Tool for the search tool for the retrieval of interacting genes and the molecular complex detection plug-in. Kaplan-Meier curves and the log rank test were used to associate the core PPI network genes with the prognosis. There were 57 upregulated and 143 downregulated genes in HCC samples. The GO and pathway analyses revealed that these DEGs are involved in the biological processes (BPs), molecular functions (MFs), and cell components (CCs). The PPI network covered 50 upregulated and 108 downregulated genes, and the core modules of this PPI network contained 34 upregulated genes. A total of 28 of these upregulated genes were associated with a poor HCC prognosis, 27 of which were highly expressed in HCC tissues. This study identified 28 DEGs to be associated with a poor HCC prognosis. Future studies will investigate their possible applications as prognostic biomarkers and potential therapeutic targets for HCC.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36197270 PMCID: PMC9509045 DOI: 10.1097/MD.0000000000030678
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Figure 1.The Venn diagram of 200 DEGs. Illustration of the 200 DEGs from the 3 datasets (GSE101685, GSE62232, and GSE112790), which downloaded from PubMed database analyzed by using Venn diagram software (http://bioinformatics.psb.ugent.be/webtools/Venn/). The various colors represent different datasets. (A) The 57 upregulated DEGs (log FC > 0). (B) The 143 downregulated DEGs (log FC < 0). DEGs = the differentially expressed genes, FC = fold change.
Identification of DEGs in HCC tissue samples versus normal livers.
| Expression | Genes |
|---|---|
| Upregulated DEGs | CDK1, TYMS, FAM72A///FAM72D///FAM72B///FAM72C, SPINK1, UBD///GABBR1, RKAA2, CAP2, DTL, RACGAP1, FAM83D, CTHRC1, UHRF1, RRM2, ZWINT, CCNB1, NDC80, TOP2A, KIAA0101, ASPM, HELLS, FLVCR1, HMMR, CCNA2, CD24, TTK, CDKN3, AKR1B10, PBK, NCAPG, GINS1, GPC3, CDKN2C, SULT1C2, CCL20, ROBO1, SPP1, CENPU, PRR11, LOC101930489///MIR4435-2HG///LINC00152, NEK2, ANLN, ACSL4, APOBEC3B, BIRC5, KIF20A, AURKA, UBE2T, DUXAP10, CRNDE, NUSAP1, NQO1, BUB1B, MAD2L1, COL15A1, DLGAP5, ECT2, PRC1 |
| Downregulated DEGs | HBA2///HBA1, MT1G, CYP4A22///CYP4A11, LECT2, TUBE1, CYP26A1, BBOX1, PLG, CYP2A6, SOCS2, LINC01093, CYP2C8, CXCL14, SLC22A1, IGF1, SULT1E1, CYP39A1, SPP2, HAO2, LINC01554, FAM134B, MT1F, SLC25A47, MFSD2A, FLJ22763, HHIP, APOA5, ADH1B, KCND3, KCNN2, SLCO1B3, SLC10A1, SLC1A2, GSTZ1, PRG4, LY6E, ASPA, CYP1A2, INS-IGF2///IGF2, MT1E, CNDP1, MAN1C1, BCO2, FOLH1B///FOLH1, FCN3, ACSM3, GBA3, CYP2C19, PDGFRA, ANXA10, TTC36, LOC100287413///GLYATL1, CLEC4G, CYP2B6, GYS2, KBTBD11, FOLH1B, KMO, LPA, GHR, CLEC1B, MIR675///H19, CXCL2, LIFR, CLRN3, CYP2C9, CFHR3, MARCO, BHMT, CYP2A7, CYP2E1, EXPH5, MT1H, LCAT, CTH, CLEC4M, VNN1, LYVE1, ESR1, HSD11B1, RSPO3, IGFBP3, FOS, LOC101928916///NNMT, PLAC8, ALDOB, HAMP, DNASE1L3, DCN, NAT2, BCHE, CPEB3, IL1RAP, RDH16, AKR1D1, CYP8B1, CXCL12, GNMT, TMEM27, HPGD, CRHBP, DNAJC12, MFAP3L, MME, AVPR1A, WDR72, THRSP, CYP4A11, IDO2, HGFAC, IGFALS, MT1X, MT2A, ADGRG7, S100A8, C7, CYP3A43, PZP, FBP1, AADAT, ADH4, GPM6A, OIT3, HGF, MOGAT2, MT1M, MAGI2-AS3, CYP3A4, GLYAT, CYP2B7P///CYP2B6, CETP, GLS2, SRD5A2, ADRA1A, ECM1, APOF, HBB, MT1HL1, C9, SRPX, FCN2, OAT, LINC00844 |
The upregulated DEGs were identified by the criteria of log2 fold change (FC) ≥ 2.0 and adjusted P value <.05, and the number of the DEGs was 57. The downregulated DEGs were identified by the criteria of log fold change (FC) ≤ 2.0 and adjusted P value <.05, and the number of the DEGs was 143.
DEGs = the differentially expressed genes
GO terms of the DEGs in HCC tissue samples.
| Expression | Category | Term | Count | % |
| FDR |
|---|---|---|---|---|---|---|
| Upregulated DEGs | GOTERM_BP_DIRECT | GO:0051301–cell division | 8 | 9.33 | 6.73E-07 | 9.15E-04 |
| Downregulated DEGs | GOTERM_BP_DIRECT | GO:0019373–epoxygenase P450 pathway | 9 | 5.72 | 2.53E-13 | 3.93E-10 |
| GOTERM_CC_DIRECT | GO:0005789–endoplasmic reticulum membrane | 19 | 12.1 | 2.38E-05 | 0.027 |
The GO term analysis was applied to define genes and their products, mRNA, or proteins in order to evaluate the unique biological signaling of the high-throughput transcriptome or genomic data. The table was utilized DAVID to identify the enriched biological processes (BPs), molecular functions (MFs), and cellular components (CCs) as well as pathways with P < .05 as the cutoff criterion for these DEGs in HCC.
DAVID = the database for annotation, visualization and integrated discovery, FDR = false discovery rate, GO = gene ontology.
KEGG pathways of the DEGs in HCC tissue samples.
| Pathway ID | Name | Count | % |
| Genes |
|---|---|---|---|---|---|
| hsa05204 | Chemical carcinogenesis | 12 | 4.93 | 2.43E-08 | CYP3A4, CYP3A43, CYP2C19, CYP2C9, CYP2C8, ADH4, NAT2, HSD11B1, ADH1B, CYP2A6, CYP2E1, CYP1A2 |
| hsa00830 | Retinol metabolism | 10 | 4.11 | 4.02E-07 | CYP3A4, CYP2B6, CYP2C9, CYP2C8, ADH4, ADH1B, CYP26A1, CYP2A6, CYP1A2, RDH16 |
| hsa00982 | Drug metabolism - cytochrome P450 | 10 | 4.11 | 6.84E-07 | CYP3A4, CYP2C19, CYP2B6, CYP2C9, CYP2C8, ADH4, ADH1B, CYP2A6, CYP2E1, CYP1A2 |
| hsa00980 | Metabolism of xenobiotics by cytochrome P450 | 9 | 3.37 | 1.38E-05 | CYP3A4, CYP2B6, CYP2C9, ADH4, HSD11B1, ADH1B, CYP2A6, CYP2E1, CYP1A2 |
| hsa04110 | Cell cycle | 7 | 2.88 | 0.01 | CCNB1, CDK1, MAD2L1, CDKN2C, TTK, BUB1B, CCNA2 |
The KEGG database was used to integrate the currently known PPI network information for the metabolism, genetic information processing, environmental information processing, cellular processes, organismal systems, human diseases, and drug development using DAVID 6.8 (https://david.ncifcrf.gov), an online bioinformatics tool designed to interpret biological function annotations of genes or proteins.
DAVID = the database for annotation, visualization and integrated discovery, HCC = hepatocellular carcinoma, KEGG = Kyoto Encyclopedia of Genes and Genomes.
Figure 2.The PPI network. (A) The PPI network covers a total of 158 DEGs. Nodes indicate proteins; edges indicate the interaction of proteins. The blue nodes refer to downregulated DEGs; the red nodes refer to upregulated DEGs. (B) Module analysis of the DEGs using the Cytoscape software with a cutoff value of 2, the node score cutoff value = 0.2 (k-core = 2), and maximum depth = 100. DEGs = the differentially expressed genes, PPI = protein–protein interaction.
Figure 3.The Kaplan–Meier survival curves of the 28 DEGs. Association of the 34 core genes with HCC prognosis as analyzed by using the online tool Kaplan–Meier plotter. The data demonstrated that 28 of 34 genes were associated with significantly poor HCC prognosis (P < .05). DEGs = the differentially expressed genes, HCC = hepatocellular carcinoma.
Survival significance of the 34 key candidate genes in HCC.
| Category | Genes |
|---|---|
| Genes with a significantly worse survival ( | ANLN, ASPM, AURKA, BIRC5, BUB1B, CCNB1, CDK1, CDKN3, DLGAP5, DTL, ECT2, FAM83D, GINS1, HELLS, KIAA0101, KIF20A, MAD2L1, NCAPG, PBK, PRC1, RACGAP1, RRM2, TOP2A, TTK, TYMS, UBE2T, UHRF1, ZWINT |
| Genes without a significantly worse survival ( | CCNA2, NDC80, NEK2, NUSAP1, HMMR, CENPU |
HCC = hepatocellular carcinoma.
Figure 4.The expression level of the 27 DEGs between the liver tissue and the normao tissue. Identification of 27 genes that are significantly highly expressed in HCC tissues. The 28 genes that are significantly associated with a poor survival of HCC patients were analyzed by using GEPIA of HCC tissues versus normal livers. The data showed that 27 of these 28 genes were highly expressed in HCC versus normal liver samples (*P < .05). The red color refers to HCC; the gray color refers to normal liver samples. DEGs = the differentially expressed genes, GEPIA = gene expression profiling interactive analysis, HCC = hepatocellular carcinoma.
Validation of the 28 genes from the GEPIA data.
| Category | Genes |
|---|---|
| Highly expressed genes ( | ANLN, ASPM, AURKA, BIRC5, BUB1B, CCNB1, CDK1, CDKN3, DLGAP5, DTL, ECT2, FAM83D, GINS1, HELLS, KIAA0101, KIF20A, MAD2L1, NCAPG, PBK, PRC1, RACGAP1, RRM2, TOP2A, TYMS, UBE2T, UHRF1, ZWINT |
| Nonhighly expressed genes ( | TTK |
GEPIA = gene expression profiling interactive analysis.
Figure 5.The KEGG pathway enrichment analysis of the 27 selected genes. The data showed that level of Cyclin B1, CDK1, MAD2L1, and BUB1B was markedly enhanced in the cell cycle pathway. The red arrows point out the position of these genes. CycB, Cyclin B1; Mad2, MAD2L1; BubR1, BUB1B. BUB1B = mitotic checkpoint serine/threonine kinase B, CDK1 = Cyclin-dependent kinase 1, KEGG = Kyoto Encyclopedia of Genes and Genomes, MAD2L1 = Mitotic arrest deficient 2 like 1.
Re-analysis of 27 selected genes via KEGG pathway enrichment.
| Pathway ID | Name | Count | % |
| Genes |
|---|---|---|---|---|---|
| cfa04110 | Cell cycle | 4 | 8.36 | 3.09E-04 | CCNB1, CDK1, MAD2L1, BUB1B |
| cfa04115 | p53 signaling pathway | 3 | 6.27 | .002 | CCNB1, CDK1, RRM2 |
KEGG = Kyoto Encyclopedia of Genes and Genomes.