| Literature DB >> 30717315 |
Xueli Zhang1,2, Xiao-Feng Sun3, Bairong Shen4, Hong Zhang5.
Abstract
In order to find out the most valuable biomarkers and pathways for diagnosis, therapy and prognosis in colorectal cancer (CRC) we have collected the published CRC biomarkers and established a CRC biomarker database (CBD: http://sysbio.suda.edu.cn/CBD/index.html). In this study, we analysed the single and multiple DNA, RNA and protein biomarkers as well as their positions in cancer related pathways and protein-protein interaction (PPI) networks to describe their potential applications in diagnosis, therapy and prognosis. CRC biomarkers were collected from the CBD. The RNA and protein biomarkers were matched to their corresponding DNAs by the miRDB database and the PubMed Gene database, respectively. The PPI networks were used to investigate the relationships between protein biomarkers and further detect the multiple biomarkers. The Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathway enrichment analysis and Gene Ontology (GO) annotation were used to analyse biological functions of the biomarkers. AI classification techniques were utilized to further verify the significances of the multiple biomarkers in diagnosis and prognosis for CRC. We showed that a large number of the DNA, RNA and protein biomarkers were associated with the diagnosis, therapy and prognosis in various degrees in the CRC biomarker networks. The CRC biomarkers were closely related to the CRC initiation and progression. Moreover, the biomarkers played critical roles in cellular proliferation, apoptosis and angiogenesis and they were involved in Ras, p53 and PI3K pathways. There were overlaps among the DNA, RNA and protein biomarkers. AI classification verifications showed that the combined multiple protein biomarkers played important roles to accurate early diagnosis and predict outcome for CRC. There were several single and multiple CRC protein biomarkers which were associated with diagnosis, therapy and prognosis in CRC. Further, AI-assisted analysis revealed that multiple biomarkers had potential applications for diagnosis and prognosis in CRC.Entities:
Keywords: DNA; RNA; cancer-related pathways; colorectal cancer; multiple-biomarkers; protein; single-biomarkers
Year: 2019 PMID: 30717315 PMCID: PMC6407036 DOI: 10.3390/cancers11020172
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Distributions and interactions of CRC diagnosis, therapy and prognosis biomarkers from the CBD. The numbers mentioned on the lines means the amounts of articles for the correlated biomarkers. (A) The CRC biomarkers were classified according to their functions of diagnosis, therapy and prognosis. (B) The biomarkers reported by more than 2 articles are presented. (C) The interactions of diagnosis, therapy and prognosis biomarkers.
Figure 2PPI networks of CRC protein biomarkers in diagnosis, therapy and prognosis. Distributions of protein biomarkers in diagnosis (A), therapy (B) and prognosis (C) of CRC are displayed. Top 10 most frequent protein biomarkers in related to the diagnosis, therapy and prognosis of the CRC are listed.
KEGG pathway enrichment results for CRC protein biomarkers.
| Pathway ID | Pathway Description | Counts | FDR |
|---|---|---|---|
|
| |||
| 03010 | Ribosome | 6 | 0.00157 |
| 05200 | Pathways in cancer | 8 | 0.00213 |
| 04066 | HIF-1 signalling pathway | 5 | 0.00281 |
| 04310 | Wnt signalling pathway | 5 | 0.00765 |
| 05206 | MicroRNAs in cancer | 5 | 0.00803 |
| 05131 | Shigellosis | 3 | 0.049 |
|
| |||
| 05200 | Pathways in cancer | 15 | 4.52 × 10−13 |
| 05219 | Bladder cancer | 7 | 6.28 × 10−10 |
| 05206 | MicroRNAs in cancer | 9 | 8.43 × 10−9 |
| 05161 | Hepatitis B | 8 | 1.56 × 10−7 |
| 05210 | Colorectal cancer | 6 | 3.78 × 10−7 |
| 04110 | Cell cycle | 7 | 9.35 × 10−7 |
| 05218 | Melanoma | 6 | 9.35 × 10−7 |
| 05215 | Prostate cancer | 6 | 2.7 × 10−6 |
| 05212 | Pancreatic cancer | 5 | 1.48 × 10−5 |
| 05220 | Chronic myeloid leukaemia | 5 | 2.48 × 10−5 |
|
| |||
| 05206 | MicroRNAs in cancer | 23 | 1.16 × 10−17 |
| 05219 | Bladder cancer | 13 | 1.47 × 10−14 |
| 05200 | Pathways in cancer | 26 | 3.98 × 10−13 |
| 04115 | p53 signalling pathway | 12 | 7.01 × 10−10 |
| 05166 | HTLV-I infection | 18 | 3.39 × 10−8 |
| 04060 | Cytokine-cytokine receptor interaction | 18 | 5.3 × 10−8 |
| 04151 | PI3K-Akt signalling pathway | 20 | 7.36 × 10−8 |
| 05215 | Prostate cancer | 11 | 1.15 × 10−7 |
| 05205 | Proteoglycans in cancer | 16 | 1.28 × 10−7 |
GO analysis results in biological process level for CRC protein biomarkers.
| Pathway ID | Pathway Description | Counts | FDR |
|---|---|---|---|
|
| |||
| Go:0042327 | Positive regulation of phosphorylation | 20 | 6.22 × 10−9 |
| Go:0045937 | Positive regulation of phosphate metabolic process | 21 | 6.22 × 10−9 |
| Go:0001934 | Positive regulation of protein phosphorylation | 19 | 1.42 × 10−8 |
| Go:0071822 | Protein complex subunit organization | 24 | 1.42 × 10−8 |
| Go:0042127 | Regulation of cell proliferation | 24 | 2.08 × 10−8 |
| Go:0042981 | Regulation of apoptotic process | 23 | 3.65 × 10−8 |
| Go:0048583 | Regulation of response to stimulus | 34 | 4.31 × 10−8 |
| Go:0043933 | Macromolecular complex subunit organization | 27 | 9.8 × 10−8 |
| Go:0043066 | Negative regulation of apoptotic process | 18 | 1.39 × 10−7 |
| Go:0008284 | Positive regulation of cell proliferation | 17 | 4.33 × 10−7 |
|
| |||
| GO:0060548 | Negative regulation of cell death | 20 | 7.29 × 10−11 |
| GO:0042981 | Regulation of apoptotic process | 21 | 5.27 × 10−9 |
| GO:0009628 | Response to abiotic stimulus | 19 | 8.77 × 10−9 |
| GO:0010941 | Regulation of cell death | 21 | 8.77 × 10−9 |
| GO:0043066 | Negative regulation of apoptotic process | 17 | 8.77 × 10−9 |
| GO:0031325 | Positive regulation of cellular metabolic process | 26 | 8.79 × 10−8 |
| GO:0010604 | Positive regulation of macromolecule metabolic process | 25 | 1.34 × 10−7 |
| GO:0009893 | Positive regulation of metabolic process | 28 | 1.89 × 10−7 |
| GO:0009605 | Response to external stimulus | 21 | 3.8 × 10−7 |
| GO:0048523 | Negative regulation of cellular process | 29 | 4.12 × 10−7 |
|
| |||
| GO:0042127 | Regulation of cell proliferation | 76 | 3.63 × 10−29 |
| GO:0006950 | Response to stress | 100 | 4.56 × 10−21 |
| GO:0048731 | System development | 101 | 1.33 × 10−20 |
| GO:0048522 | Positive regulation of cellular process | 111 | 5.31 × 10−20 |
| GO:0048523 | Negative regulation of cellular process | 105 | 5.31 × 10−20 |
| GO:0031325 | Positive regulation of cellular metabolic process | 88 | 6.82 × 10−20 |
| GO:0048518 | Positive regulation of biological process | 119 | 8.49 × 10−20 |
| GO:0010604 | Positive regulation of macromolecule metabolic process | 84 | 2.55 × 10−19 |
| GO:0048519 | Negative regulation of biological process | 107 | 7.7 × 10−19 |
| GO:0051247 | Positive regulation of protein metabolic process | 60 | 1.19 × 10−18 |
GO analysis results in molecular function level for CRC protein biomarkers.
| Pathway ID | Pathway Description | Counts | FDR |
|---|---|---|---|
|
| |||
| GO:0005515 | Protein binding | 44 | 2.81 × 10−10 |
| GO:0005102 | Receptor binding | 20 | 4.2 × 10−7 |
| GO:0042802 | Identical protein binding | 15 | 0.000526 |
| GO:0005488 | Binding | 53 | 0.00127 |
| GO:0001968 | Fibronectin binding | 3 | 0.0307 |
| GO:0005539 | Glycosaminoglycan binding | 6 | 0.0353 |
| GO:0003735 | Structural constituent of ribosome | 5 | 0.0358 |
| GO:0005126 | Cytokine receptor binding | 6 | 0.0358 |
| GO:0032403 | Protein complex binding | 9 | 0.0358 |
| GO:0019899 | Enzyme binding | 14 | 0.0365 |
|
| |||
| GO:0005515 | Protein binding | 36 | 3.52 × 10−10 |
| GO:0042802 | Identical protein binding | 16 | 8.85 × 10−7 |
| GO:0046983 | Protein dimerization activity | 13 | 1.15 × 10−5 |
| GO:0005488 | Binding | 42 | 0.000317 |
| GO:0019899 | Enzyme binding | 15 | 0.000317 |
| GO:0042803 | Protein homodimerization activity | 10 | 0.000445 |
| GO:0043566 | Structure-specific DNA binding | 7 | 0.00061 |
| GO:0046982 | Protein heterodimerization activity | 7 | 0.00061 |
| GO:0030983 | Mismatched DNA binding | 3 | 0.000839 |
| GO:0004861 | Cyclin-dependent protein serine/threonine kinase inhibitor activity | 3 | 0.00138 |
|
| |||
| GO:0005515 | Protein binding | 131 | 4.67 × 10−29 |
| GO:0005102 | Receptor binding | 45 | 1.18 × 10−11 |
| GO:0044877 | Macromolecular complex binding | 44 | 1.98 × 10−11 |
| GO:0005488 | Binding | 160 | 2.91 × 10−9 |
| GO:0042802 | Identical protein binding | 35 | 1.63 × 10−7 |
| GO:0019899 | Enzyme binding | 41 | 5.39 × 10−7 |
| GO:0032403 | Protein complex binding | 25 | 6.42 × 10−7 |
| GO:0003684 | Damaged DNA binding | 9 | 8.98 × 10−6 |
| GO:0043566 | Structure-specific DNA binding | 15 | 1.09 × 10−5 |
| GO:0019900 | Kinase binding | 19 | 9.05 × 10−5 |
GO analysis results in cellular component level for CRC protein biomarkers.
| Pathway ID | Pathway Description | Counts | FDR |
|---|---|---|---|
|
| |||
| GO:0005615 | Extracellular space | 20 | 1.49 × 10−6 |
| GO:0022627 | Cytosolic small ribosomal subunit | 6 | 1.49 × 10−6 |
| GO:0031982 | Vesicle | 33 | 1.49 × 10−6 |
| GO:0031988 | Membrane-bounded vesicle | 32 | 2.34 × 10−6 |
| GO:0005576 | Extracellular region | 36 | 2.74 × 10−6 |
| GO:0044421 | Extracellular region part | 32 | 7.96 × 10−6 |
| GO:0034774 | Secretory granule lumen | 6 | 1.67 × 10−5 |
| GO:0022626 | Cytosolic ribosome | 6 | 8.62 × 10−5 |
| GO:0030141 | Secretory granule | 9 | 8.62 × 10−5 |
| GO:0031093 | Platelet alpha granule lumen | 5 | 8.62 × 10−5 |
|
| |||
| GO:0005829 | Cytosol | 24 | 4.63 × 10−5 |
| GO:0044428 | Nuclear part | 26 | 4.63 × 10−5 |
| GO:0032991 | Macromolecular complex | 27 | 0.000117 |
| GO:0043233 | Organelle lumen | 26 | 0.000117 |
| GO:0043234 | Protein complex | 25 | 0.000117 |
| GO:0044427 | Chromosomal part | 11 | 0.000117 |
| GO:0031981 | Nuclear lumen | 23 | 0.000149 |
| GO:0005654 | Nucleoplasm | 21 | 0.000153 |
| GO:0005694 | Chromosome | 11 | 0.000164 |
| GO:0070013 | Intracellular organelle lumen | 24 | 0.000662 |
|
| |||
| GO:0005576 | Extracellular region | 96 | 1.33 × 10−10 |
| GO:0005615 | Extracellular space | 46 | 1.62 × 10−10 |
| GO:0044421 | Extracellular region part | 85 | 1.96 × 10−10 |
| GO:0005829 | Cytosol | 72 | 6.05 × 10−8 |
| GO:0005912 | Adherens junction | 23 | 1.13 × 10−7 |
| GO:0005924 | Cell-substrate adherens junction | 21 | 2.33 × 10−7 |
| GO:0043227 | Membrane-bounded organelle | 163 | 3.07 × 10−7 |
| GO:0009986 | Cell surface | 28 | 3.94 × 10−7 |
| GO:0005925 | Focal adhesion | 20 | 6.84 × 10−7 |
| GO:0031982 | Vesicle | 73 | 1.04 × 10−6 |
Figure 3Biomarkers in the Pathways in cancer. (A) Various cancer pathways involve in different cancer initiation and progression. (B) CRC biomarkers for diagnosis, therapy and prognosis biomarkers in the CBD were mapped in different colours in Pathways in cancer. The CRC biomarkers have been associated with apoptosis, cell proliferation, VEGF signalling pathway and Ras signalling pathway in the Pathways in cancer. Red: diagnosis biomarker; Blue: treatment biomarker; Purple: prognosis biomarker; Orange: diagnosis & treatment biomarker; Yellow: treatment & prognosis biomarker; Pink: diagnosis & treatment & prognosis biomarker.
Figure 4MiRNA in cancers. (A) MiRNAs involve in different types of cancers. (B) CRC Biomarkers in the miRNAs in cancer pathway. Different miRNAs and interactions among the miRNAs and a variety of genes, such as APC and K-ras have been involved in CRC initiation and progression process.
Figure 5Associations of DNA, RNA and protein biomarkers in diagnosis, therapy and prognosis of CRC. The RNA and protein biomarkers from our CBD were converted to their corresponding genes and the relationships between the overlapping genes were further analysed. There were two genes (IGFBP1 and PTPRG) from both RNA and protein biomarkers which were associated to CRC diagnosis and one gene (MYA6) was related to therapy. However, there were 24 genes which were associated with prognosis.
Biological functional analysis for overlapping DNA transferred by prognosis biomarkers.
| Pathway ID | Pathway Description | Counts | FDR |
|---|---|---|---|
|
| |||
| 05206 | MicroRNAs in cancer | 5 | 0.000171 |
| 04068 | FoxO signalling pathway | 3 | 0.0466 |
| 05200 | Pathways in cancer | 4 | 0.0466 |
|
| |||
| GO:0009887 | Organ morphogenesis | 9 | 0.00107 |
| GO:0010468 | Regulation of gene expression | 16 | 0.00107 |
| GO:0010557 | Positive regulation of macromolecule biosynthetic process | 11 | 0.00107 |
| GO:0010628 | Positive regulation of gene expression | 11 | 0.00107 |
| GO:2000112 | Regulation of cellular macromolecule biosynthetic process | 15 | 0.00107 |
| GO:0031328 | Positive regulation of cellular biosynthetic process | 11 | 0.00118 |
| GO:0048514 | Blood vessel morphogenesis | 6 | 0.00514 |
| GO:0010556 | Regulation of macromolecule biosynthetic process | 14 | 0.00588 |
| GO:0010604 | Positive regulation of macromolecule metabolic process | 12 | 0.00608 |
| GO:0001568 | Blood vessel development | 6 | 0.00631 |
Figure 6PPI network for the 24 overlapping prognosis genes. There were 13 genes which can been used to predict patients survival. The remaining genes worked in pairs or in groups to predict the prognosis.
Figure 7Diagnostic performance of multiple biomarkers for CRC. (A) The receiver operating (ROC) curves of all the 15 multiple biomarkers. (B) Distributions of AUC across biosignatures. The area under curve (AUC) statistics from 100 random training/testing divisions. The 15 multiple biomarkers were ranked.
Figure 8Kaplan-Meier survival curves of five multiple biomarkers with significant prognosis value. (A) Kaplan-Meier survival curves of multiple biomarker combined by PTEN and ZEB2. (B) Kaplan-Meier survival curves of multiple biomarker combined by STAT3 and S1PR1. (C) Kaplan-Meier survival curves of multiple biomarker combined by CASK and EPAS1. (D) Kaplan-Meier survival curves of multiple biomarker combined by KRAS, PTEN, STAT3, CD44, ZEB1, ZEB2 and S1PR1. (E) Kaplan-Meier survival curves of multiple biomarker combined by ZEB2 and ZEB1.