| Literature DB >> 26366411 |
Yung-Hao Wong1, Chia-Chou Wu1, Chih-Lung Lin2, Ting-Shou Chen2, Tzu-Hao Chang3, Bor-Sen Chen1.
Abstract
Hepatocellular carcinoma (HCC) is a major liver tumor (~80%), besides hepatoblastomas, angiosarcomas, and cholangiocarcinomas. In this study, we used a systems biology approach to construct protein-protein interaction networks (PPINs) for early-stage and late-stage liver cancer. By comparing the networks of these two stages, we found that the two networks showed some common mechanisms and some significantly different mechanisms. To obtain differential network structures between cancer and noncancer PPINs, we constructed cancer PPIN and noncancer PPIN network structures for the two stages of liver cancer by systems biology method using NGS data from cancer cells and adjacent noncancer cells. Using carcinogenesis relevance values (CRVs), we identified 43 and 80 significant proteins and their PPINs (network markers) for early-stage and late-stage liver cancer. To investigate the evolution of network biomarkers in the carcinogenesis process, a primary pathway analysis showed that common pathways of the early and late stages were those related to ordinary cancer mechanisms. A pathway specific to the early stage was the mismatch repair pathway, while pathways specific to the late stage were the spliceosome pathway, lysine degradation pathway, and progesterone-mediated oocyte maturation pathway. This study provides a new direction for cancer-targeted therapies at different stages.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26366411 PMCID: PMC4558430 DOI: 10.1155/2015/391475
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Flowchart illustrating construction of network markers in two stages of liver cancer and the investigation of carcinogenesis mechanisms. We integrated NGS data, the GO database, and protein-protein interaction (PPI) information to construct the PPI network. These data were used for pool selection, and then the selected proteins and NGS data were used to contribute to the PPI network (PPIN) by a maximum-likelihood estimation and model order detection method, resulting in a liver cancer PPIN (CPPIN) and a noncancer PPIN (NPPIN) in the early and late stages of liver cancer. The two constructed PPINs were used to determine significant proteins of tumorigenesis by examining differences between the two PPI matrices of the two constructed PPINs. With the help of a differential PPI matrix (network) between CPPIN and NPPIN, a carcinogenesis relevance value (CRV) was computed for each protein, and significant proteins in carcinogenesis were determined based on p values of the CRVs of these proteins in the differential PPI matrix between CPPIN and NPPIN. These significant proteins were obtained for early and late stages of liver cancer.
Descriptive information on datasets extracted from the TCGA database used in this study. Cases are grouped by the type of cancer, and surrounding normal tissues came from human patients in the early or late stage of liver cancer.
| Cancer type | TCGA dataset number | Early stage sample# | Late stage sample# | Normal sample# | Platform |
|---|---|---|---|---|---|
| Liver cancer | Batch100 | 19 | 18 | 24 | Illumina |
Top 20 proteins of early-stage and late-stage liver cancer.
| Protein | CRV |
| Case_AvgExp | Control_AvgExp | log2FC |
|---|---|---|---|---|---|
| Top 20 proteins of early-stage liver cancer | |||||
| APP | 46.52 | 6.09 | 13724 | 14041 | −0.03 |
| KRTAP4-12 | 33.47 | 6.09 | 1 | 1 | −0.17 |
| ELAVL1 | 22.99 | 0.000329 | 1725 | 1319 | 0.39 |
| KRTAP10-1 | 17.17 | 0.000756 | 1 | 1 | 0.04 |
| KRTAP10-5 | 14.73 | 0.000981 | 1 | 1 | 0.04 |
| H2AFX | 14.08 | 0.001048 | 608 | 271 | 1.17 |
| CDK1 | 13.69 | 0.001079 | 335 | 34 | 3.32 |
| PRKDC | 12.91 | 0.001176 | 4011 | 1684 | 1.25 |
| CUL3 | 12.83 | 0.001188 | 1376 | 1606 | −0.22 |
| ESR1 | 12.81 | 0.001194 | 228 | 1537 | −2.76 |
| EZH2 | 12.22 | 0.001274 | 310 | 49 | 2.66 |
| CEP250 | 12.05 | 0.001298 | 716 | 268 | 1.42 |
| AURKB | 11.41 | 0.001383 | 141 | 14 | 3.35 |
| CDC20 | 10.89 | 0.001523 | 359 | 20 | 4.17 |
| E2F1 | 10.54 | 0.001584 | 476 | 43 | 3.46 |
| OTX1 | 10.29 | 0.001664 | 31 | 2 | 4.27 |
| C19orf66 | 10.2 | 0.001688 | 1551 | 3166 | −1.03 |
| SUMO1 | 9.86 | 0.00181 | 2422 | 2982 | −0.3 |
| MCM4 | 9.76 | 0.001859 | 1308 | 336 | 1.96 |
| GRB2 | 9.19 | 0.0022 | 3572 | 2608 | 0.45 |
|
| |||||
| Top 20 proteins of late stage liver cancer | |||||
| ESR1 | 34.1 | 4.22 | 245 | 1537 | −2.65 |
| ELAVL1 | 29.69 | 2.53 | 1791 | 1319 | 0.44 |
| UBD | 28.54 | 3.38 | 20926 | 1781 | 3.55 |
| YWHAZ | 27.31 | 4.22 | 13323 | 6408 | 1.06 |
| SIRT7 | 24 | 0.000122 | 456 | 225 | 1.02 |
| HDAC5 | 22.07 | 0.000152 | 1696 | 840 | 1.01 |
| KRTAP10-5 | 20.54 | 0.000224 | 1 | 1 | 0.06 |
| EZH2 | 19.05 | 0.00027 | 394 | 49 | 3 |
| ILF2 | 18.86 | 0.000283 | 4111 | 1737 | 1.24 |
| CEP250 | 16.12 | 0.000452 | 627 | 268 | 1.23 |
| PCNA | 15.95 | 0.00046 | 2290 | 904 | 1.34 |
| SUMO2 | 15.77 | 0.00049 | 3305 | 2112 | 0.65 |
| H2AFX | 15.43 | 0.000532 | 806 | 271 | 1.58 |
| HSP90AB1 | 14.43 | 0.0006 | 30949 | 14629 | 1.08 |
| HGS | 13.9 | 0.000659 | 2441 | 1167 | 1.06 |
| APP | 13.87 | 0.000659 | 15083 | 14041 | 0.1 |
| WHSC1 | 12.06 | 0.000861 | 1369 | 447 | 1.61 |
| SETDB1 | 11.7 | 0.000912 | 1203 | 497 | 1.28 |
| TRAF2 | 11.45 | 0.000938 | 665 | 277 | 1.26 |
| SMARCA4 | 11.18 | 0.000992 | 2696 | 1115 | 1.27 |
| SFN | 11.03 | 0.001026 | 1008 | 55 | 4.19 |
AvgExp means average expression.
log2FC means log2 fold change.
Figure 2The constructed cancer differential protein-protein interaction networks (DPPINs) for early, late, and total stages of liver cancer. This figure shows the DPPINs with edge and node information for the early, late, and total stages of liver cancer. The DPPIN is the difference between the cancer PPIN (CPPIN) and noncancer PPIN (NPPIN). The figures were created using Cytoscape.
The 27 identified significant proteins of core network marker in both early-stage and late-stage liver cancer (intersection).
| Common network marker of early-stage and late-stage liver cancer | |||||
|---|---|---|---|---|---|
| Protein | CRV |
| Cancer_AvgExp | Control_AvgExp | log2FC |
| APP | 46.52 | 0.00001 | 13724 | 14041 | −0.03 |
| APP | 13.87 | 0.00066 | 15083 | 14041 | 0.1 |
| ELAVL1 | 22.99 | 0.00033 | 1725 | 1319 | 0.39 |
| ELAVL1 | 29.69 | 0.00003 | 1791 | 1319 | 0.44 |
| KRTAP10-5 | 14.73 | 0.00098 | 1 | 1 | 0.04 |
| KRTAP10-5 | 20.54 | 0.00022 | 1 | 1 | 0.06 |
| H2AFX | 14.08 | 0.00105 | 608 | 271 | 1.17 |
| H2AFX | 15.43 | 0.00053 | 806 | 271 | 1.58 |
| CDK1 | 13.69 | 0.00108 | 335 | 34 | 3.32 |
| CDK1 | 10.31 | 0.00119 | 596 | 34 | 4.15 |
| ESR1 | 12.81 | 0.00119 | 228 | 1537 | −2.76 |
| ESR1 | 34.1 | 0.00000 | 245 | 1537 | −2.65 |
| EZH2 | 12.22 | 0.00127 | 310 | 49 | 2.66 |
| EZH2 | 19.05 | 0.00027 | 394 | 49 | 3 |
| CEP250 | 12.05 | 0.00130 | 716 | 268 | 1.42 |
| CEP250 | 16.12 | 0.00045 | 627 | 268 | 1.23 |
| AURKB | 11.41 | 0.00138 | 141 | 14 | 3.35 |
| AURKB | 7.68 | 0.00289 | 199 | 14 | 3.85 |
| CDC20 | 10.89 | 0.00152 | 359 | 20 | 4.17 |
| CDC20 | 8.94 | 0.00168 | 688 | 20 | 5.11 |
| E2F1 | 10.54 | 0.00158 | 476 | 43 | 3.46 |
| E2F1 | 8.85 | 0.00174 | 673 | 43 | 3.95 |
| OTX1 | 10.29 | 0.00166 | 31 | 2 | 4.27 |
| OTX1 | 6.25 | 0.00811 | 26 | 2 | 4.02 |
| SUMO1 | 9.86 | 0.00181 | 2422 | 2982 | −0.3 |
| SUMO1 | 8.63 | 0.00188 | 2624 | 2982 | −0.18 |
| MCM4 | 9.76 | 0.00186 | 1308 | 336 | 1.96 |
| MCM4 | 6.76 | 0.00535 | 1327 | 336 | 1.98 |
| HGS | 7.96 | 0.00312 | 2504 | 1167 | 1.1 |
| HGS | 13.9 | 0.00066 | 2441 | 1167 | 1.06 |
| KPNA2 | 7.86 | 0.00323 | 1927 | 689 | 1.48 |
| KPNA2 | 10.43 | 0.00116 | 2533 | 689 | 1.88 |
| UBC | 7.85 | 0.00323 | 34388 | 35490 | −0.05 |
| UBC | 6.63 | 0.00591 | 36931 | 35490 | 0.06 |
| SPRY2 | 7.5 | 0.00376 | 409 | 989 | −1.27 |
| SPRY2 | 7.17 | 0.00399 | 328 | 989 | −1.59 |
| TOPBP1 | 7.33 | 0.00406 | 799 | 346 | 1.21 |
| TOPBP1 | 6.45 | 0.00698 | 875 | 346 | 1.34 |
| SIRT7 | 7.11 | 0.00456 | 442 | 225 | 0.98 |
| SIRT7 | 24 | 0.00012 | 456 | 225 | 1.02 |
| WHSC1 | 6.67 | 0.00590 | 1217 | 447 | 1.44 |
| WHSC1 | 12.06 | 0.00086 | 1369 | 447 | 1.61 |
| MCM2 | 6.45 | 0.00666 | 1097 | 174 | 2.66 |
| MCM2 | 8.48 | 0.00200 | 1351 | 174 | 2.96 |
| AURKA | 6.4 | 0.00685 | 570 | 86 | 2.73 |
| AURKA | 9.25 | 0.00154 | 516 | 86 | 2.58 |
| COPS5 | 6.39 | 0.00691 | 1506 | 1184 | 0.35 |
| COPS5 | 10 | 0.00128 | 1708 | 1184 | 0.53 |
| PCNA | 6.29 | 0.00749 | 1652 | 904 | 0.87 |
| PCNA | 15.95 | 0.00046 | 2290 | 904 | 1.34 |
| BUB1B | 6.27 | 0.00762 | 179 | 12 | 3.89 |
| BUB1B | 7.47 | 0.00327 | 320 | 12 | 4.73 |
| DNMT1 | 6.02 | 0.00920 | 1376 | 455 | 1.6 |
| DNMT1 | 10.63 | 0.00110 | 1383 | 455 | 1.6 |
There are two rows with same proteins name while the upper row represents the early stage liver cancer and the lower raw represents the late stage liver cancer.
AvgExp means average expression.
log2FC means log2 fold change.
Top 30 proteins of total-stage liver cancer.
| Protein | CRV |
| Case_AvgExp | Control_AvgExp | log2FC |
|---|---|---|---|---|---|
| APP | 81.53 | <10−9 | 14385 | 14041 | 0.03 |
| ELAVL1 | 3.65 | 0.000187 | 1757 | 1319 | 0.41 |
| CCDC33 | 2.82 | 0.000464 | 3 | 1 | 1.7 |
| UBC | 25.19 | 0.000588 | 35625 | 35490 | 0.01 |
| HIST3H3 | 24.3 | 0.000626 | 1 | 1 | 0.02 |
| KRTAP10-1 | 23.96 | 0.000643 | 1 | 1 | 0.02 |
| ESR1 | 23.79 | 0.000652 | 236 | 1537 | −2.7 |
| UBD | 23.34 | 0.000682 | 14701 | 1781 | 3.04 |
| H2AFX | 17.28 | 0.001125 | 705 | 271 | 1.38 |
| KRTAP10-5 | 17.05 | 0.001133 | 1 | 1 | 0.05 |
| HGS | 15.2 | 0.001329 | 2473 | 1167 | 1.08 |
| TRAF2 | 14.28 | 0.001461 | 581 | 277 | 1.07 |
| HSP90AB1 | 14.16 | 0.001483 | 29756 | 14629 | 1.02 |
| PCNA | 13.58 | 0.001572 | 1962 | 904 | 1.12 |
| SMARCA4 | 13.21 | 0.001611 | 2546 | 1115 | 1.19 |
| TAF6 | 12.84 | 0.001636 | 1075 | 478 | 1.17 |
| SUMO2 | 12.6 | 0.001704 | 2968 | 2112 | 0.49 |
| UBQLN4 | 12.59 | 0.001713 | 1881 | 905 | 1.05 |
| COPS5 | 12.45 | 0.001743 | 1604 | 1184 | 0.44 |
| CEP250 | 12.41 | 0.001747 | 673 | 268 | 1.33 |
| EZH2 | 12.15 | 0.001802 | 351 | 49 | 2.84 |
| CDK1 | 11.69 | 0.001892 | 462 | 34 | 3.78 |
| TCF3 | 11.35 | 0.00199 | 980 | 415 | 1.24 |
| CDC20 | 11.28 | 0.002011 | 519 | 20 | 4.7 |
| MCM2 | 10.9 | 0.002126 | 1220 | 174 | 2.81 |
| GRB2 | 10.64 | 0.002233 | 3488 | 2608 | 0.42 |
| WHSC1 | 10.5 | 0.002288 | 1291 | 447 | 1.53 |
| MYOD1 | 10.5 | 0.002292 | 2 | 1 | 1.22 |
| AURKB | 10.48 | 0.002305 | 169 | 14 | 3.61 |
| SUMO1 | 10.36 | 0.002373 | 2520 | 2982 | −0.24 |
The intersection of the total-stage liver cancer with our previous result.
| Protein | CRV |
| CRV |
|
|---|---|---|---|---|
| BUB1B | 7.58 | 0.005833 | 5.5696 | 0.00064 |
| CDC20 | 11.28 | 0.002011 | 5.1507 | 0.00109 |
| CDK2 | 6.9 | 0.007708 | 14.069 |
<1 |
| CUL3 | 8.16 | 0.004478 | 12.9519 | <1 |
| E2F1 | 10.19 | 0.002497 | 3.9947 | 0.00862 |
| ESR1 | 23.79 | 0.000652 | 10.3758 | <1 |
| HDAC4 | 10.11 | 0.002531 | 5.8397 | 0.00048 |
| HGS | 15.2 | 0.001329 | 4.6929 | 0.00232 |
| MYC | 7.55 | 0.005884 | 10.7821 | <1 |
| PCNA | 13.58 | 0.001572 | 15.1438 | <1 |
| PRKDC | 8.19 | 0.004418 | 5.9369 | 0.00041 |
| SMARCA4 | 13.21 | 0.001611 | 7.7449 | 0.0001 |
| SUMO1 | 10.36 | 0.002373 | 15.8533 | <1 |
| TRAF2 | 14.28 | 0.001461 | 4.7703 | 0.00207 |
| UBC | 25.19 | 0.000588 | 137.284 | <1 |
Figure 3Overview of significant pathways in network marker of total stage of liver cancer. Among KEGG pathways identified via the DAVID tool (Table 4) showing significant associations with specific proteins of the early stage of liver cancer, these molecular pathways had p values of ≤0.05. This indicates that these pathways play important roles in the carcinogenesis mechanism in the early stage of liver cancer. The network markers of the early stage of liver cancer highlighted by stars show potentially targets in the pathways. Due to the different naming systems, the same proteins in these tables and in the text are labeled with different names.
Figure 4Overview of significant pathways of network markers of early-stage liver cancer. KEGG pathways accessed using the DAVID tool (Table 7) show significant associations with specific proteins in the early stage of liver cancer; these molecular pathways are labeled with p values of ≤0.05. These pathways were shown to play important roles in the carcinogenesis mechanism in the early stage of liver cancer. Network markers of the early stage of liver cancer as highlighted by stars show potential targets in the pathways. Due to different naming systems, the same proteins in the tables and text have different names. The proteins in the early-stage liver cancer network marker are enriched in “hsa03040:Mismatch repair” (Rank 9 in Table 7).
Figure 5Overview of significant pathways of network markers of late-stage liver cancer. KEGG pathways identified using the DAVID tool (Table 8) show significant associations with specific proteins in the late stage of liver cancer; these molecular pathways are labeled with p values of ≤0.05. These pathways were shown to play important roles in the carcinogenesis mechanism of the early stage of liver cancer. Proteins in network markers of the early stage of liver cancer as highlighted by stars are potential targets in the pathways. Due to different naming systems, the same proteins in the tables and in the text have different names. The proteins in the total stage liver cancer network marker are enriched in “hsa03040:Spliceosome” (Rank 5 in Table 8).
| Rank | Term | Count | Symbol |
|
|---|---|---|---|---|
| KEGG | ||||
| 1 | hsa04110:Cell cycle | 14 | E2F1, CDK1, PRKDC, CDC20, MCM2, MCM3, YWHAE, MCM4, CDK2, MCM7, GSK3B, PCNA, BUB1B, MYC | 5.96 |
| 2 | hsa05215:Prostate cancer | 7 | E2F1, HSP90AB1, GRB2, GSK3B, MAPK3, PTEN, CDK2 | 2.62 |
| 3 | hsa04114:Oocyte meiosis | 7 | CDK1, PPP1CA, MAPK3, CDC20, AURKA, YWHAE, CDK2 | 8.70 |
| 4 | hsa03030:DNA replication | 5 | MCM7, PCNA, MCM2, MCM3, MCM4 | 9.48 |
| 5 | hsa05200:Pathways in cancer | 10 | E2F1, HSP90AB1, TRAF2, GRB2, MSH2, GSK3B, MAPK3, MYC, PTEN, CDK2 | 2.73 |
| 6 | hsa05213:Endometrial cancer | 5 | GRB2, GSK3B, MAPK3, MYC, PTEN | 4.03 |
| 7 | hsa05210:Colorectal cancer | 5 | GRB2, MSH2, GSK3B, MAPK3, MYC | 0.002457 |
| 8 | hsa05222:Small cell lung cancer | 5 | E2F1, TRAF2, MYC, PTEN, CDK2 | 0.002457 |
| 9 | hsa05214:Glioma | 4 | E2F1, GRB2, MAPK3, PTEN | 0.008946 |
| 10 | hsa04722:Neurotrophin signaling pathway | 5 | GRB2, GSK3B, MAPK3, YWHAE, TP73 | 0.009832 |
| 11 | hsa04115:p53 signaling pathway | 4 | CDK1, PTEN, CDK2, TP73 | 0.011029 |
| 12 | hsa05220:Chronic myeloid leukemia | 4 | E2F1, GRB2, MAPK3, MYC | 0.014381 |
| 13 | hsa04914:Progesterone-mediated oocyte maturation | 4 | HSP90AB1, CDK1, MAPK3, CDK2 | 0.020705 |
| 14 | hsa04012:ErbB signaling pathway | 4 | GRB2, GSK3B, MAPK3, MYC | 0.021345 |
| 15 | hsa04540:Gap junction | 4 | CDK1, GRB2, MAPK3, TUBA1B | 0.022657 |
| 16 | hsa05219:Bladder cancer | 3 | E2F1, MAPK3, MYC | 0.033366 |
| 17 | hsa04510:Focal adhesion | 5 | PPP1CA, GRB2, GSK3B, MAPK3, PTEN | 0.047655 |
| 18 | hsa05223:Non-small cell lung cancer | 3 | E2F1, GRB2, MAPK3 |
|
| 19 | hsa05221:Acute myeloid leukemia | 3 | GRB2, MAPK3, MYC |
|
| 20 | hsa04910:Insulin signaling pathway | 4 | PPP1CA, GRB2, GSK3B, MAPK3 |
|
| 21 | hsa05218:Melanoma | 3 | E2F1, MAPK3, PTEN |
|
| 22 | hsa04662:B cell receptor signaling pathway | 3 | GRB2, GSK3B, MAPK3 |
|
|
| ||||
| BioCarte | ||||
| 1a | h_g1Pathway:Cell Cycle:G1/S Check Point | 4 | E2F1, CDK1, GSK3B, CDK2 | 0.012616 |
| 2a | h_p27Pathway:Regulation of p27 Phosphorylation during Cell Cycle Progression | 3 | E2F1, NEDD8, CDK2 | 0.020162 |
| 3a | h_p53Pathway:p53 Signaling Pathway | 3 | E2F1, PCNA, CDK2 | 0.033691 |
| 4a | h_her2Pathway:Role of ERBB2 in Signal Transduction and Oncology | 3 | GRB2, MAPK3, ESR1 | 0.049866 |
| 5a | h_ptenPathway:PTEN dependent cell cycle arrest and apoptosis | 3 | GRB2, MAPK3, PTEN |
|
| 6a | h_RacCycDPathway:Influence of Ras and Rho proteins on G1 to S Transition | 3 | E2F1, MAPK3, CDK2 |
|
| 7a | h_cellcyclePathway:Cyclins and Cell Cycle Regulation | 3 | E2F1, CDK1, CDK2 |
|
The significant pathways via DAVID Bioinformatics database are selected for the 74 significant proteins in carcinogenesis. Bold indicates p value > 0.05. Most of them are from the KEGG database, and we add several key pathways from BioCarta.
| GO:term |
| Corrected |
|
|
|
| Term name |
|---|---|---|---|---|---|---|---|
| (1) Biological processes | |||||||
| GO:0051320 | 9.1 | 1.9 | 6357 | 9 | 12 | 4 | S phase |
| GO:0000084 | 9.1 | 1.9 | 6357 | 9 | 12 | 4 | S phase of mitotic cell cycle |
| GO:0006267 | 3.3 | 7.1 | 6357 | 9 | 16 | 4 | Prereplicative complex assembly |
| GO:0065004 | 1.4 | 3.1 | 6357 | 9 | 68 | 5 | Protein-DNA complex assembly |
| GO:0000727 | 2.7 | 5.8 | 6357 | 9 | 26 | 4 | Double-strand break repair via break-induced replication |
| GO:0006270 | 3.7 | 8.0 | 6357 | 9 | 28 | 4 | DNA-dependent DNA replication initiation |
| GO:0022616 | 5.7 | 1.2 | 6357 | 9 | 31 | 4 | DNA strand elongation |
| GO:0006271 | 5.7 | 1.2 | 6357 | 9 | 31 | 4 | DNA strand elongation involved in DNA replication |
| GO:0000724 | 1.3 | 2.8 | 6357 | 9 | 38 | 4 | Double-strand break repair via homologous recombination |
| GO:0022402 | 2.5 | 5.3 | 6357 | 9 | 445 | 7 | Cell cycle process |
|
| |||||||
| (2) Cellular components | |||||||
| GO:0042555 | 2.7 | 1.3 | 6357 | 9 | 6 | 4 | MCM complex |
| GO:0005656 | 3.3 | 1.5 | 6357 | 9 | 16 | 4 | Prereplicative complex |
| GO:0031261 | 1.3 | 6.3 | 6357 | 9 | 22 | 4 | DNA replication preinitiation complex |
| GO:0031298 | 2.3 | 1.0 | 6357 | 9 | 25 | 4 | Replication fork protection complex |
| GO:0032993 | 2.9 | 1.3 | 6357 | 9 | 46 | 4 | Protein-DNA complex |
| GO:0043234 | 9.7 | 4.5 | 6357 | 9 | 1369 | 9 | Protein complex |
| GO:0044454 | 4.9 | 2.3 | 6357 | 9 | 217 | 5 | Nuclear chromosome part |
| GO:0044451 | 1.5 | 7.5 | 6357 | 9 | 275 | 5 | Nucleoplasm part |
| GO:0044428 | 1.6 | 7.7 | 6357 | 9 | 1251 | 8 | Nuclear part |
| GO:0044427 | 2.5 | 0.0011 | 6357 | 9 | 302 | 5 | Chromosomal part |
|
| |||||||
| (3) Molecular functions | |||||||
| GO:0043566 | 3.8 | 2.6 | 6357 | 9 | 85 | 6 | Structure-specific DNA binding |
| GO:0043138 | 5.6 | 3.7 | 6357 | 9 | 18 | 4 | 3′-5′ DNA helicase activity |
| GO:0004003 | 2.7 | 1.8 | 6357 | 9 | 26 | 4 | ATP-dependent DNA helicase activity |
| GO:0003682 | 3.3 | 2.2 | 6357 | 9 | 80 | 5 | Chromatin binding |
| GO:0003688 | 4.9 | 3.3 | 6357 | 9 | 30 | 4 | DNA replication origin binding |
| GO:0003678 | 1.4 | 9.9 | 6357 | 9 | 39 | 4 | DNA helicase activity |
| GO:0003697 | 2.4 | 1.6 | 6357 | 9 | 44 | 4 | Single-stranded DNA binding |
| GO:0016887 | 4.9 | 3.3 | 6357 | 9 | 278 | 6 | ATPase activity |
| GO:0043140 | 5.5 | 3.7 | 6357 | 9 | 13 | 3 | ATP-dependent 3′-5′ DNA helicase activity |
| GO:0008094 | 1.3 | 9.1 | 6357 | 9 | 67 | 4 | DNA-dependent ATPase activity |
R: number of genes in reference set.
T: number of genes in test set.
G: number of genes annotated by given term in reference set.
O: number of genes annotated by given term in test set.
| Rank | Term | Count | Symbol |
|
|---|---|---|---|---|
| KEGG | ||||
| 1 | hsa04110:Cell cycle | 11 | E2F1, CCNB1, CDK1, CDKN2A, PCNA, BUB1B, PRKDC, CDC20, MCM2, MCM4, MYC | 4.90 |
| 2 | hsa03030:DNA replication | 4 | POLD1, PCNA, MCM2, MCM4 | 5.24 |
| 3 | hsa05220:Chronic myeloid leukemia | 4 | E2F1, CDKN2A, GRB2, MYC | 0.004415 |
| 4 | hsa04114:Oocyte meiosis | 4 | CCNB1, CDK1, CDC20, AURKA | 0.01273 |
| 5 | hsa05219:Bladder cancer | 3 | E2F1, CDKN2A, MYC | 0.015099 |
| 6 | hsa05223:Non-small cell lung cancer | 3 | E2F1, CDKN2A, GRB2 | 0.024285 |
| 7 | hsa05214:Glioma | 3 | E2F1, CDKN2A, GRB2 | 0.03234 |
| 8 | hsa04115:p53 signaling pathway | 3 | CCNB1, CDK1, CDKN2A | 0.037212 |
| 9 | hsa03430:Mismatch repair | 2 | POLD1, PCNA | 0.09922 |
|
| ||||
| BioCarta | ||||
| 1a | h_cellcyclePathway:Cyclins and Cell Cycle Regulation | 4 | E2F1, CCNB1, CDK1, CDKN2A | 0.002246 |
| 2a | h_srcRPTPPathway:Activation of Src by Protein-tyrosine phosphatase alpha | 3 | CCNB1, CDK1, GRB2 | 0.004 |
| 3a | h_g2Pathway:Cell Cycle:G2/M Checkpoint | 3 | CCNB1, CDK1, PRKDC | 0.025668 |
| 4a | h_g1Pathway:Cell Cycle:G1/S Check Point | 3 | E2F1, CDK1, CDKN2A | 0.039621 |
| 5a | h_ptc1Pathway:Sonic Hedgehog (SHH) Receptor Ptc1 Regulates cell cycle | 2 | CCNB1, CDK1 |
|
The significant pathways via DAVID Bioinformatics database are selected for the 43 significant proteins in carcinogenesis. Bold indicates p value > 0.05. Most of them are from the KEGG database, and we add several key pathways from BioCarta.
| GO:term |
| Corrected |
|
|
|
| Term name |
|---|---|---|---|---|---|---|---|
| (1) Biological processes | |||||||
| GO:0051320 | 1.9 | 0.0028 | 6357 | 4 | 12 | 2 | S phase |
| GO:0000084 | 1.9 | 0.0028 | 6357 | 4 | 12 | 2 | S phase of mitotic cell cycle |
| GO:0006267 | 3.5 | 0.0052 | 6357 | 4 | 16 | 2 | Prereplicative complex assembly |
| GO:0000727 | 9.6 | 0.0142 | 6357 | 4 | 26 | 2 | Double-strand break repair via break-induced replication |
| GO:0006270 | 1.1 | 0.0165 | 6357 | 4 | 28 | 2 | DNA-dependent DNA replication initiation |
| GO:0022616 | 1.3 | 0.0203 | 6357 | 4 | 31 | 2 | DNA strand elongation |
| GO:0006271 | 1.3 | 0.0203 | 6357 | 4 | 31 | 2 | DNA strand elongation involved in DNA replication |
| GO:0000724 | 2.0 | 0.0306 | 6357 | 4 | 38 | 2 | Double-strand break repair via homologous recombination |
| GO:0000725 | 3.1 | 0.0470 | 6357 | 4 | 47 | 2 | Recombinational repair |
| GO:0022403 | 3.4 | 0.0515 | 6357 | 4 | 286 | 3 | Cell cycle phase |
|
| |||||||
| (2) Cellular components | |||||||
| GO:0042555 | 4.4 | 1.3 | 6357 | 4 | 6 | 2 | MCM complex |
| GO:0005656 | 3.5 | 0.0010 | 6357 | 4 | 16 | 2 | Prereplicative complex |
| GO:0031261 | 6.8 | 0.0020 | 6357 | 4 | 22 | 2 | DNA replication preinitiation complex |
| GO:0031298 | 8.8 | 0.0026 | 6357 | 4 | 25 | 2 | Replication fork protection complex |
| GO:0032993 | 3.0 | 0.0091 | 6357 | 4 | 46 | 2 | Protein-DNA complex |
| GO:0000151 | 5.7 | 0.0171 | 6357 | 4 | 63 | 2 | Ubiquitin ligase complex |
| GO:0043234 | 0.0021 | 0.0643 | 6357 | 4 | 1369 | 4 | Protein complex |
| GO:0033597 | 0.0025 | 0.0754 | 6357 | 4 | 4 | 1 | Mitotic checkpoint complex |
| GO:0031463 | 0.0031 | 0.0942 | 6357 | 4 | 5 | 1 | Cul3-RING ubiquitin ligase complex |
| GO:0043596 | 0.0062 | 0.1883 | 6357 | 4 | 10 | 1 | Nuclear replication fork |
|
| |||||||
| (3) Molecular functions | |||||||
| GO:0043138 | 4.5 | 0.0021 | 6357 | 4 | 18 | 2 | 3′-5′ DNA helicase activity |
| GO:0004003 | 9.6 | 0.0046 | 6357 | 4 | 26 | 2 | ATP-dependent DNA helicase activity |
| GO:0003688 | 1.2 | 0.0061 | 6357 | 4 | 30 | 2 | DNA replication origin binding |
| GO:0003678 | 2.1 | 0.0104 | 6357 | 4 | 39 | 2 | DNA helicase activity |
| GO:0003697 | 2.7 | 0.0133 | 6357 | 4 | 44 | 2 | Single-stranded DNA binding |
| GO:0008094 | 6.4 | 0.0310 | 6357 | 4 | 67 | 2 | DNA-dependent ATPase activity |
| GO:0003682 | 9.2 | 0.0443 | 6357 | 4 | 80 | 2 | Chromatin binding |
| GO:0043566 | 0.0010 | 0.0500 | 6357 | 4 | 85 | 2 | Structure-specific DNA binding |
| GO:0004842 | 0.0010 | 0.0500 | 6357 | 4 | 85 | 2 | Ubiquitin-protein ligase activity |
| GO:0070035 | 0.0010 | 0.0511 | 6357 | 4 | 86 | 2 | Purine NTP-dependent helicase activity |
R: number of genes in reference set.
T: number of genes in test set.
G: number of genes annotated by given term in reference set.
O: number of genes annotated by given term in test set.
| Rank | Term | Count | Symbol |
|
|---|---|---|---|---|
| KEGG | ||||
| 1 | hsa04110:Cell cycle | 16 | E2F1, CDK1, YWHAZ, YWHAB, CDC20, SFN, MCM2, MCM3, CDK4, MCM4, CDK2, MCM7, PCNA, YWHAQ, BUB1B, CCNA2 | 3.90 |
| 2 | hsa04114:Oocyte meiosis | 8 | CDK1, YWHAZ, YWHAB, YWHAQ, CDC20, AURKA, PPP1CC, CDK2 | 3.62 |
| 3 | hsa03030:DNA replication | 5 | MCM7, PCNA, MCM2, MCM3, MCM4 | 2.35 |
| 4 | hsa04115:p53 signaling pathway | 4 | CDK1, SFN, CDK4, CDK2 | 0.020496 |
| 5 | hsa03040:Spliceosome | 5 | SNRPB, THOC4, LSM2, SF3A2, SF3B4 | 0.022722 |
| 6 | hsa05222:Small cell lung cancer | 4 | E2F1, TRAF2, CDK4, CDK2 | 0.035432 |
| 7 | hsa04914:Progesterone-mediated oocyte maturation | 4 | HSP90AB1, CDK1, CCNA2, CDK2 | 0.037607 |
| 8 | hsa00310:Lysine degradation | 3 | SETDB1, WHSC1, EHMT2 |
|
| 9 | hsa05200:Pathways in cancer | 7 | E2F1, HSP90AB1, TRAF2, MSH2, BIRC5, CDK4, CDK2 |
|
|
| ||||
| BioCarte | ||||
| 1a | h_p53Pathway:p53 Signaling Pathway | 4 | E2F1, PCNA, CDK4, CDK2 | 0.003378 |
| 2a | h_cellcyclePathway:Cyclins and Cell Cycle Regulation | 4 | E2F1, CDK1, CDK4, CDK2 | 0.010334 |
| 3a | h_g1Pathway:Cell Cycle:G1/S Check Point | 4 | E2F1, CDK1, CDK4, CDK2 | 0.015615 |
| 4a | h_rbPathway:RB Tumor Suppressor/Checkpoint Signaling in response to DNA damage | 3 | CDK1, CDK4, CDK2 | 0.019987 |
| 5a | h_ranMSpathway:Role of Ran in mitotic spindle regulation | 3 | RAN, AURKA, KPNA2 | 0.019987 |
| 6a | h_g2Pathway:Cell Cycle:G2/M Checkpoint | 3 | CDK1, YWHAQ, BRCA1 |
|
| 7a | h_RacCycDPathway:Influence of Ras and Rho proteins on G1 to S Transition | 3 | E2F1, CDK4, CDK2 |
|
The significant pathways via DAVID Bioinformatics database are selected for the 74 significant proteins in carcinogenesis. Bold indicates p value > 0.05. Most of them are from the KEGG database, and we add several key pathways from BioCarta.
| GO:term |
| Corrected |
|
|
|
| Term name |
|---|---|---|---|---|---|---|---|
| (1) Biological processes | |||||||
| GO:0051320 | 1.5 | 3.6 | 6357 | 10 | 12 | 4 | S phase |
| GO:0000084 | 1.5 | 3.6 | 6357 | 10 | 12 | 4 | S phase of mitotic cell cycle |
| GO:0006267 | 5.5 | 1.3 | 6357 | 10 | 16 | 4 | Prereplicative complex assembly |
| GO:0000727 | 4.5 | 1.1 | 6357 | 10 | 26 | 4 | Double-strand break repair via break-induced replication |
| GO:0006270 | 6.2 | 1.5 | 6357 | 10 | 28 | 4 | DNA-dependent DNA replication initiation |
| GO:0022616 | 9.5 | 2.3 | 6357 | 10 | 31 | 4 | DNA strand elongation |
| GO:0006271 | 9.5 | 2.3 | 6357 | 10 | 31 | 4 | DNA strand elongation involved in DNA replication |
| GO:0000724 | 2.2 | 5.4 | 6357 | 10 | 38 | 4 | Double-strand break repair via homologous recombination |
| GO:0006260 | 5.1 | 1.2 | 6357 | 10 | 120 | 5 | DNA replication |
| GO:0000725 | 5.3 | 1.2 | 6357 | 10 | 47 | 4 | Recombinational repair |
|
| |||||||
| (2) Cellular components | |||||||
| GO:0042555 | 4.6 | 2.2 | 6357 | 10 | 6 | 4 | MCM complex |
| GO:0005656 | 5.5 | 2.6 | 6357 | 10 | 16 | 4 | Prereplicative complex |
| GO:0031261 | 2.2 | 1.0 | 6357 | 10 | 22 | 4 | DNA replication preinitiation complex |
| GO:0031298 | 3.8 | 1.8 | 6357 | 10 | 25 | 4 | Replication fork protection complex |
| GO:0032993 | 4.8 | 2.3 | 6357 | 10 | 46 | 4 | Protein-DNA complex |
| GO:0044454 | 2.3 | 0.0113 | 6357 | 10 | 217 | 4 | Nuclear chromosome part |
| GO:0044451 | 5.8 | 0.0280 | 6357 | 10 | 275 | 4 | Nucleoplasm part |
| GO:0044428 | 7.7 | 0.0370 | 6357 | 10 | 1251 | 7 | Nuclear part |
| GO:0044427 | 8.3 | 0.0400 | 6357 | 10 | 302 | 4 | Chromosomal part |
| GO:0043234 | 0.0013 | 0.0656 | 6357 | 10 | 1369 | 7 | Protein complex |
|
| |||||||
| (3) Molecular functions | |||||||
| GO:0043138 | 9.3 | 6.4 | 6357 | 10 | 18 | 4 | 3′-5′ DNA helicase activity |
| GO:0004003 | 4.5 | 3.1 | 6357 | 10 | 26 | 4 | ATP-dependent DNA helicase activity |
| GO:0003688 | 8.3 | 5.7 | 6357 | 10 | 30 | 4 | DNA replication origin binding |
| GO:0043566 | 9.0 | 6.2 | 6357 | 10 | 85 | 5 | Structure-specific DNA binding |
| GO:0003678 | 2.4 | 1.7 | 6357 | 10 | 39 | 4 | DNA helicase activity |
| GO:0003697 | 4.0 | 2.8 | 6357 | 10 | 44 | 4 | Single-stranded DNA binding |
| GO:0043140 | 7.9 | 5.4 | 6357 | 10 | 13 | 3 | ATP-dependent 3′-5′ DNA helicase activity |
| GO:0008094 | 2.2 | 1.5 | 6357 | 10 | 67 | 4 | DNA-dependent ATPase activity |
| GO:0003682 | 4.6 | 3.1 | 6357 | 10 | 80 | 4 | Chromatin binding |
| GO:0070035 | 6.1 | 4.2 | 6357 | 10 | 86 | 4 | Purine NTP-dependent helicase activity |
R: number of genes in reference set.
T: number of genes in test set.
G: number of genes annotated by given term in reference set.
O: number of genes annotated by given term in test set.