| Literature DB >> 25079297 |
Frank Emmert-Streib, Ricardo de Matos Simoes, Galina Glazko, Simon McDade, Benjamin Haibe-Kains, Andreas Holzinger, Matthias Dehmer, Frederick Campbell.
Abstract
Cancer is a complex disease that has proven to be difficult to understand on the single-gene level. For this reason a functional elucidation needs to take interactions among genes on a systems-level into account. In this study, we infer a colon cancer network from a large-scale gene expression data set by using the method BC3Net. We provide a structural and a functional analysis of this network and also connect its molecular interaction structure with the chromosomal locations of the genes enabling the definition of cis- and trans-interactions. Furthermore, we investigate the interaction of genes that can be found in close neighborhoods on the chromosomes to gain insight into regulatory mechanisms. To our knowledge this is the first study analyzing the genome-scale colon cancer network.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25079297 PMCID: PMC4158620 DOI: 10.1186/1471-2105-15-S6-S6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Overview of the histologies of the 289 colon cancer samples provided by Expression Project for Oncology (expO).
| Histology | Number of Samples |
|---|---|
| Adenocarcinoma | 218 |
| Mucinous Adenocarcinoma | 36 |
| Adenocarcinoma arising in a villous adenoma | 15 |
| Metastatic Papillary Serous Adenocarcinoma | 3 |
| Carcinoma in situ arising in a villous adenoma | 2 |
| Metastatic Mucinous Adenocarcinoma | 2 |
| Adenocarcinoma In situ | 1 |
| Clear cell adenocarcinoma | 1 |
| Colloid Carcinoma | 1 |
| Medullary Carcinoma | 1 |
| Metastatic Adenocarcinoma | 1 |
| Metastatic Papillary Serous Carcinoma | 1 |
| Metastatic Serous adenocarcinoma (papillary serous) | 1 |
| Signet Ring Cell Carcinoma | 1 |
| Undifferentiated Carcinoma | 1 |
| Missing | 4 |
Biological Process GPEA analysis showing the 50 most significant terms.
| GOID | GO-term | #Genes | #Interactions | p-value | GCC | CG |
|---|---|---|---|---|---|---|
| GO:0022403 | cell cycle phase | 853 | 938 | 5.8e-238 | 349 | 60/+ |
| GO:0000278 | mitotic cell cycle | 776 | 818 | 7.1e-221 | 343 | 54/+ |
| GO:0006414 | translational elongation | 108 | 156 | 3.0e-181 | 72 | 1 |
| GO:0006415 | translational termination | 91 | 130 | 9.0e-160 | 67 | 1 |
| GO:0006614 | SRP-dependent cotranslational protein targeting to membrane | 105 | 136 | 4.6e-153 | 67 | 2 |
| GO:0045047 | protein targeting to ER | 107 | 137 | 2.1e-152 | 67 | 2 |
| GO:0072599 | establishment of protein localization to endoplasmic reticulum | 108 | 137 | 2.6e-151 | 67 | 2 |
| GO:0006613 | cotranslational protein targeting to membrane | 107 | 136 | 7.4e-151 | 67 | 2 |
| GO:0000279 | M phase | 537 | 462 | 4.1e-149 | 196 | 33/+ |
| GO:0000087 | M phase of mitotic cell cycle | 374 | 321 | 3.6e-144 | 159 | 20/+ |
| GO:0070972 | protein localization to endoplasmic reticulum | 121 | 140 | 2.2e-142 | 67 | 2 |
| GO:0000184 | nuclear-transcribed mRNA catabolic process, nonsense-mediated decay | 118 | 137 | 6.0e-141 | 70 | 2 |
| GO:0000280 | nuclear division | 363 | 305 | 7.2e-138 | 155 | 20/+ |
| GO:0007067 | mitosis | 363 | 305 | 7.2e-138 | 155 | 20/+ |
| GO:0006413 | translational initiation | 153 | 155 | 7.4e-134 | 78 | 4 |
| GO:0048285 | organelle fission | 388 | 318 | 4.0e-133 | 161 | 20/+ |
| GO:0006412 | translation | 469 | 355 | 5.2e-115 | 183 | 16 |
| GO:0000956 | nuclear-transcribed mRNA catabolic process | 171 | 150 | 1.1e-113 | 73 | 7 |
| GO:0006612 | protein targeting to membrane | 154 | 139 | 7.9e-113 | 67 | 4 |
| GO:0019080 | viral genome expression | 152 | 137 | 7.7e-112 | 70 | 10/+ |
| GO:0019083 | viral transcription | 152 | 137 | 7.7e-112 | 70 | 10/+ |
| GO:0016071 | mRNA metabolic process | 614 | 463 | 4.2e-109 | 301 | 21 |
| GO:0006402 | mRNA catabolic process | 183 | 152 | 1.2e-107 | 73 | 7 |
| GO:0043624 | cellular protein complex disassembly | 157 | 131 | 5.9e-101 | 67 | 2 |
| GO:0043241 | protein complex disassembly | 162 | 132 | 9.1e-99 | 67 | 2 |
| GO:0006401 | RNA catabolic process | 210 | 157 | 5.1e-96 | 74 | 7 |
| GO:0072594 | establishment of protein localization to organelle | 212 | 156 | 7.8e-94 | 74 | 4 |
| GO:0022904 | respiratory electron transport chain | 111 | 97 | 4.0e-90 | 62 | 5 |
| GO:0019058 | viral infectious cycle | 228 | 158 | 7.2e-87 | 81 | 14/+ |
| GO:0032984 | macromolecular complex disassembly | 183 | 133 | 7.8e-87 | 67 | 7 |
| GO:0045333 | cellular respiration | 163 | 122 | 1.3e-86 | 80 | 9/+ |
| GO:0006259 | DNA metabolic process | 880 | 655 | 2.9e-85 | 334 | 75/+ |
| GO:0051301 | cell division | 480 | 310 | 2.2e-81 | 126 | 35/+ |
| GO:0022900 | electron transport chain | 151 | 105 | 2.0e-74 | 66 | 5 |
| GO:0006396 | RNA processing | 656 | 428 | 1.1e-73 | 249 | 18 |
| GO:0060337 | type I interferon-mediated signaling pathway | 73 | 62 | 3.2e-67 | 29 | 5 |
| GO:0071357 | cellular response to type I interferon | 73 | 62 | 3.2e-67 | 29 | 5 |
| GO:0034340 | response to type I interferon | 74 | 62 | 1.7e-66 | 29 | 5 |
| GO:0002682 | regulation of immune system process | 893 | 609 | 1.2e-63 | 265 | 83/+ |
| GO:0051320 | S phase | 148 | 89 | 2.7e-58 | 40 | 8 |
| GO:0045087 | innate immune response | 544 | 308 | 1.8e-56 | 151 | 25/+ |
| GO:0051325 | interphase | 405 | 218 | 8.8e-56 | 116 | 34/+ |
| GO:0022411 | cellular component disassembly | 295 | 156 | 3.7e-55 | 69 | 12 |
| GO:0016032 | viral reproduction | 701 | 419 | 1.5e-54 | 150 | 46/+ |
| GO:0044764 | multi-organism cellular process | 703 | 420 | 2.5e-54 | 150 | 46/+ |
| GO:0022415 | viral reproductive process | 547 | 305 | 4.6e-54 | 107 | 44/+ |
| GO:0051329 | interphase of mitotic cell cycle | 399 | 210 | 3.8e-53 | 114 | 34/+ |
| GO:0050776 | regulation of immune response | 564 | 313 | 2.2e-52 | 146 | 43/+ |
| GO:0030198 | extracellular matrix organization | 209 | 110 | 5.5e-52 | 54 | 11/+ |
| GO:0043062 | extracellular structure organization | 210 | 110 | 1.4e-51 | 54 | 11/+ |
Significant enrichment of cancer census genes is indicated by a '+' (column seven). GCC denotes the size of the giant connected component corresponding to the genes of a GO-term; CG number of census cancer genes in the GCC.
Figure 1CSPNN for the 6 colon cancer genes . Genes on shortest paths and next neighbor genes are shown in gray besides if they are present in the census cancer gene list (PRDM16 (blue)). In total, this network contains 107 genes, including 7 census cancer genes, and 184 interactions.
Figure 2A: Statistically significant chromosome cooperations are highlighted by a link. B: The table shows the Benjamini & Hochberg (BH) adjusted p-values for these links.
Figure 3A: Connection between the ensemble consensus rate and BC3Net. B: Integrated ensemble consensus rate (ECR) for cis-interactions (red) and trans-interactions (blue). C: Median values of the individual ensemble consensus sets ECSfor .
The 10 major hub genes of the colon cancer network.
| entrez | symbol | Description | degree | locus | most significant GO-term |
|---|---|---|---|---|---|
| 81137 | OR7E104P | olfactory receptor | 458 | chr13q21.31 | GO:0007268 synaptic transmission |
| 2623 | GATA1 | transcription factor | 321 | chrXp11.23 | GO:0007601 visual perception |
| 348808 | NPHP3-AS1 | antisense RNA | 262 | chr3q22.1 | GO:0050906 detection of stimulus involved in sensory perception |
| 285877 | POM121L12 | transmembrane protein | 247 | chr7p12.1 | GO:0007606 sensory perception of chemical stimulus |
| 283933 | ZNF843 | zinc finger protein | 231 | chr16p11.2 | GO:0030534 adult behavior |
| 60506 | NYX | extracellular matrix | 217 | chrXp11.4 | GO:0042749 regulation of circadian sleep/wake cycle |
| 387601 | SLC22A25 | anion transporter | 216 | chr11q12.3 | GO:0048511 rhythmic process |
| 284805 | C20orf203 | ORF | 212 | chr20q11.21 | GO:0006813 potassium ion transport |
| 6521 | SLC4A1 | anion transporter | 208 | chr17q21.31 | GO:0072529 pyrimidine-containing compound catabolic process |
| 163778 | SPRR4 | envelope precursor | 207 | chr1q21.3 | GO:0007608 sensory perception of smell |
The hub genes are described by their entrez gene id, gene symbol, short description, node degree, chromosomal location and the most significant GO-term based on a Gene Ontology enrichment analysis based on the direct interactions for each hub gene.
Figure 4A: Analysis procedure for a GPEA. B: Shown are the locations of the largest 146 network components corresponding to gene sets of 1 Mb windows (red dots) along the chromosomes. Blue dots indicate the location of cancer census genes. C: The top ranked largest network component corresponding to the positional gene set on chromosome 8 with 29 genes (red).
Chromosomal neighborhood-induced GPEA and GO analysis.
| chr | locus | start | Size | edges | pvalue | gcc | census | term |
|---|---|---|---|---|---|---|---|---|
| chr8 | q24.3 | 145000001 | 35 | 52 | 3.6e-86 | 29 | RECQL4 | |
| chr8 | q24.3 | 145500001 | 31 | 37 | 3.3e-59 | 24 | RECQL4 | |
| chr6 | p22.2/p22.1 | 26000001 | 45 | 40 | 3.5e-52 | 23 | nucleosome assembly (9) | |
| chr6 | p22.2 | 25500001 | 46 | 40 | 2.1e-51 | 24 | nucleosome assembly (9) | |
| chrX | q28 | 153000001 | 37 | 33 | 1.2e-45 | 18 | ||
| chr19 | q13.31 | 44000001 | 35 | 31 | 1.8e-43 | 15 | regulation of transcription, DNA- | |
| dependent (15) | ||||||||
| chr7 | p15.1/.2 | 27000001 | 17 | 21 | 4.4e-39 | 12 | HOXA9, HOXA11, HOXA13, JAZF1 | anterior/posterior pattern specification (9) |
| chr7 | p15.2 | 26500001 | 18 | 21 | 6e-38 | 12 | HOXA9, HOXA11, HOXA13 | anterior/posterior pattern specification (9) |
| chr8 | q24.3 | 144500001 | 30 | 26 | 1e-37 | 14 | ||
| chr6 | p21.1 | 42500001 | 32 | 26 | 3.3e-36 | 18 | meiosis (3) | |
| chr19 | q13.31/q13.32 | 44500001 | 28 | 24 | 2.7e-35 | 12 | BCL3, CBLC | regulation of transcription, DNA- dependent (12) |
| chr17 | q12/q21.1/.2 | 37500001 | 26 | 22 | 8.4e-33 | 13 | ERBB2, | |
| RARA | ||||||||
| chr17 | p13.1 | 7000001 | 56 | 32 | 3.3e-32 | 22 | TP53 | |
| chrX | q28 | 153500001 | 30 | 22 | 5.6e-30 | 14 | MTCP1 | |
| chr1 | q22 | 155000001 | 33 | 23 | 6.4e-30 | 20 | MUC1 | |
| chr17 | q11.2 | 26500001 | 34 | 23 | 2.6e-29 | 17 | ||
| chr8 | p11.21 | 42000001 | 16 | 16 | 3.3e-28 | 11 | HOOK3 | |
| chr6 | p21.31/.32 | 32500001 | 34 | 22 | 1.6e-27 | 7 | DAXX | antigen processing and presentation of exogenous peptide antigen via MHC class II (6) proteasomal ubiquitin-dependent protein catabolic process (3) |
| chr9 | q34.3 | 139500001 | 53 | 27 | 3.4e-26 | 15 | ||
| chrX | p11.23 | 48500001 | 28 | 19 | 1.5e-25 | 14 | WAS, GATA1, TFE3 | |
| chr17 | q21.32 | 46000001 | 26 | 18 | 7.2e-25 | 7 | embryonic skeletal system development (5) | |
| chr16 | p13.3 | 1500001 | 47 | 24 | 2.5e-24 | 15 | TSC2 | protein ubiquitination (4) |
| chr17 | q21.32/.33 | 46500001 | 27 | 18 | 3e-24 | 7 | embryonic skeletal system development (5) | |
| chr17 | p13.1 | 6500001 | 51 | 25 | 4e-24 | 20 | ||
| chr8 | q24.3 | 144000001 | 29 | 18 | 4e-23 | 7 | heterocycle metabolic process (6) | |
| chr6 | p21.33 | 31000001 | 54 | 25 | 6.8e-23 | 13 | ||
| chr6 | p21.32/.33 | 31500001 | 54 | 25 | 6.8e-23 | 14 | ||
| chr19 | q13.43 | 58000001 | 40 | 21 | 9.2e-23 | 14 | transcription, DNA-dependent (14) regulation of type I interferon- mediated signaling pathway (8) homophilic cell adhesion (8) cellular biosynthetic process (9) | |
| chr9 | p21.3 | 20500001 | 20 | 15 | 1e-22 | 8 | MLLT3 | |
| chr5 | q31.3 | 140000001 | 52 | 24 | 3e-22 | 8 | ||
| chr17 | q12 | 37000001 | 21 | 15 | 4.4e-22 | 13 | LASP1, ERBB2 | |
| chr8 | p11.22/.23 | 37500001 | 18 | 14 | 5.2e-22 | 8 | WHSC1L1, FGFR1 | |
| chr19 | q13.43 | 57500001 | 35 | 19 | 8.4e-22 | 10 | regulation of transcription, DNA-dependent (10) | |
| chr17 | q25.3 | 79500001 | 46 | 22 | 1e-21 | 20 | ASPSCR1 | proteasomal ubiquitin-dependent protein catabolic process (3) |
| chrX | p11.23 | 48000001 | 28 | 16 | 5.6e-20 | 10 | SSX1, WAS, GATA1, TFE3 |
Each row corresponds to a window gene set. These windows are indexed by the chromosome, locus and base start. The number of genes in these windows and the edges between them are given in column four and five. Column six gives the p-value of the GPEA analysis (p-val) and column nine shows the most significant GO term for the genes in the GCC.