| Literature DB >> 28108733 |
Cheng Zhen1, Caizhong Zhu1, Haoyang Chen1, Yiru Xiong1, Junyuan Tan1, Dong Chen1, Jin Li1.
Abstract
OBJECTIVE: To systematically explore the molecular mechanism for hepatocellular carcinoma (HCC) metastasis and identify regulatory genes with text mining methods.Entities:
Keywords: hepatocellular carcinoma; metastasis; text mining
Mesh:
Year: 2017 PMID: 28108733 PMCID: PMC5355149 DOI: 10.18632/oncotarget.14692
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
The top 20 HCC metastasis-related genes based on text mining
| Gene | Description | Count |
|---|---|---|
| vascular endothelial growth factor A | 252 | |
| alpha fetoprotein | 190 | |
| cadherin 1 (E-cadherin) | 154 | |
| matrix metallopeptidase 2 | 154 | |
| matrix metallopeptidase 9 | 153 | |
| mitogen-activated protein kinase 1 | 123 | |
| transforming growth factor beta 1 | 118 | |
| AKT serine/threonine kinase 1 | 110 | |
| catenin beta 1 | 100 | |
| protein tyrosine kinase 2 (FAK) | 93 | |
| secreted phosphoprotein 1 | 85 | |
| NME/NM23 nucleoside diphosphate kinase 1 | 82 | |
| nuclear factor kappa B subunit 1 | 76 | |
| MET proto-oncogene, receptor tyrosine kinase | 75 | |
| basigin (CD147) | 72 | |
| phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha | 71 | |
| hypoxia inducible factor 1 alpha subunit | 68 | |
| CD44 molecule | 67 | |
| fibronectin 1 | 65 | |
| hepatocyte growth factor | 65 |
Only the genes that co-appeared with metastasis-related phenotypes in the same sentence will be counted. If a gene appeared several times in one sentence, it would be treated once.
The most significant KEGG pathways related to HCC metastasis
| KEGG Pathway | Genes | |
|---|---|---|
| Focal adhesion | 98 | < 0.0001 |
| Adherens junction | 42 | < 0.0001 |
| Regulation of actin cytoskeleton | 71 | < 0.0001 |
| Cytokine-cytokine receptor interaction | 78 | < 0.0001 |
| MAPK signaling pathway | 74 | < 0.0001 |
| Toll-like receptor signaling pathway | 39 | < 0.0001 |
| Oxidative phosphorylation | 1 | < 0.0001 |
| Apoptosis | 32 | < 0.0001 |
| Cell cycle | 31 | < 0.0001 |
| Purine metabolism | 6 | < 0.0001 |
| Insulin signaling pathway | 40 | 0.0001 |
| Wnt signaling pathway | 40 | 0.0002 |
| Neuroactive ligand-receptor interaction | 22 | 0.0002 |
| TGF-beta signaling pathway | 27 | 0.0002 |
| Pyrimidine metabolism | 3 | 0.002 |
| Tight junction | 30 | 0.002 |
| Glycerophospholipid metabolism | 2 | 0.003 |
| Jak-STAT signaling pathway | 37 | 0.007 |
| Gap junction | 24 | 0.01 |
| Fatty acid metabolism | 2 | 0.01 |
Official symbols of all 1116 genes were treated as inputs. KEGG pathways were sorted by P value.
Figure 1The PPI network of HCC-metastasis related genes
All edges were treated as undirected and all interactions were based on experiments. Isolated nodes and self-loops were deleted. Network was built with input nodes only, excluding their neighbours.
The top 20 nodes in HCC metastasis-related PPI network
| Node | Description | Degree |
|---|---|---|
| ubiquitin C | 739 | |
| epidermal growth factor receptor | 157 | |
| MDM2 proto-oncogene | 153 | |
| tumor protein p53 | 152 | |
| amyloid beta precursor protein | 149 | |
| heat shock protein 90 alpha family class A member 1 | 149 | |
| E1A binding protein p300 | 135 | |
| small ubiquitin-like modifier 1 | 130 | |
| growth factor receptor bound protein 2 | 128 | |
| SRC proto-oncogene, non-receptor tyrosine kinase | 126 | |
| catenin beta 1 | 125 | |
| fibronectin 1 | 123 | |
| tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein zeta | 122 | |
| estrogen receptor 1 | 110 | |
| heat shock protein 90 alpha family class B member 1 | 107 | |
| histone deacetylase 1 | 100 | |
| AKT serine/threonine kinase 1 | 96 | |
| androgen receptor | 92 | |
| mitogen-activated protein kinase 1 | 88 | |
| cullin 7 | 87 | |
| v-myc avian myelocytomatosis viral oncogene homolog | 87 |
The degree of each node was calculated with CytoNCA. All edges were treated as undirected. †MYC was equal with CUL7.
Figure 2Genes unique to HBV or HCV-related-metastasis
Genes with low frequency (freq < 5) were excluded. As papers about HCV-metastasis were less than that of HBV (136 VS 262), all frequencies of HCV particular genes were normalized based on the number of papers (×1.926).
The KEGG pathways for HBV/HCV particular genes related to metastasis
| KEGG Pathway | Genes | For HBV Particular Genes | For HCV Particular Genes | |
|---|---|---|---|---|
| MAPK signaling pathway | 15 | 0.0003 | Y | |
| Tight junction | 8 | 0.004 | Y | |
| Adherens junction | 6 | 0.007 | Y | |
| Cytokine-cytokine receptor interaction | 7 | 0.0003 | Y | |
| TGF-beta signaling pathway | 4 | 0.0006 | Y | |
| ECM-receptor interaction | 3 | 0.005 | Y | |
| Jak-STAT signaling pathway | 4 | 0.005 | Y | |
| Gap junction | 3 | 0.007 | Y | |
| Cell cycle | 3 | 0.008 | Y | |
| Focal adhesion | 16/5 | < 0.0001/0.005 | Y | Y |
| Regulation of actin cytoskeleton | 14/5 | 0.0003/0.003 | Y | Y |
| Toll-like receptor signaling pathway | 7/3 | 0.006/0.008 | Y | Y |
As shown in the last two columns, pathways may belong to HBV/HCV-HCC metastasis particular genes, or shared by both.
Figure 3The co-occurrence between genes and metastasis-related phenotypes
For each phenotype the size of circle indicated the number of genes that arise with it in one sentence. The thickness of edge reflected the frequency of each co-occurrence relationship.
Figure 4Cluster analysis for genes that co-appeared with metastasis-related phenotypes
Data were linearly normalized. Hierarchical cluster analysis was performed based on maximum-linkage, using similarity metric of Euclidean distance.