| Literature DB >> 24550935 |
Frank Emmert-Streib1, Ricardo de Matos Simoes1, Paul Mullan2, Benjamin Haibe-Kains3, Matthias Dehmer4.
Abstract
In this study, we infer the breast cancer gene regulatory network from gene expression data. This network is obtained from the application of the BC3Net inference algorithm to a large-scale gene expression data set consisting of 351 patient samples. In order to elucidate the functional relevance of the inferred network, we are performing a Gene Ontology (GO) analysis for its structural components. Our analysis reveals that most significant GO-terms we find for the breast cancer network represent functional modules of biological processes that are described by known cancer hallmarks, including translation, immune response, cell cycle, organelle fission, mitosis, cell adhesion, RNA processing, RNA splicing and response to wounding. Furthermore, by using a curated list of census cancer genes, we find an enrichment in these functional modules. Finally, we study cooperative effects of chromosomes based on information of interacting genes in the beast cancer network. We find that chromosome 21 is most coactive with other chromosomes. To our knowledge this is the first study investigating the genome-scale breast cancer network.Entities:
Keywords: BC3Net; GPEA; breast cancer; computational genomics; gene regulatory network; statistical inference
Year: 2014 PMID: 24550935 PMCID: PMC3909882 DOI: 10.3389/fgene.2014.00015
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Overview of histological annotations of the 351 samples of the expO breast cancer gene expression compendium.
| Ductal carcinoma | 270 |
| Lobular carcinoma | 39 |
| Invasive ductal carcinoma | 7 |
| Mucinous carcinoma | 6 |
| Ductal and lobular carcinoma | 4 |
| Metaplastic carcinoma | 3 |
| Metaplastic squamous carcinoma | 3 |
| Cribiform carcinoma | 2 |
| Inflammatory carcinoma | 2 |
| Intraductal carcinoma | 2 |
| Intraductal papillary carcinoma | 2 |
| Medullary carcinoma, NOS | 2 |
| Adenoid cystic carcinoma | 1 |
| Colloid adenocarcinoma | 1 |
| Ductal carcinoma | 1 |
| Intracystic carcinoma | 1 |
| Invasive ductal carcinoma, mucinous type | 1 |
| Lobular carcinoma | 1 |
| Papillary carcinoma | 1 |
| Papillary carcinoma (predominantly micropapillary pattern) | 1 |
| Unknown | 1 |
GPEA analysis of the breast cancer gene regulatory network for GO biological processes.
| GO:0000278 | Mitotic cell cycle | 776 | 1031 | 2.3e-260 | 54/+ |
| GO:0006414 | Translational elongation | 108 | 218 | 6.7e-259 | 1 |
| GO:0022403 | Cell cycle phase | 853 | 1142 | 1.1e-257 | 60/+ |
| GO:0006415 | Translational termination | 91 | 191 | 3.5e-244 | 1 |
| GO:0006614 | SRP-dependent cotranslational protein targeting to membrane | 105 | 195 | 9.1e-227 | 2 |
| GO:0045047 | Protein targeting to ER | 107 | 196 | 3.6e-225 | 2 |
| GO:0072599 | Establishment of protein localization to endoplasmic reticulum | 108 | 196 | 1.4e-223 | 2 |
| GO:0006613 | Cotranslational protein targeting to membrane | 107 | 195 | 1.4e-223 | 2 |
| GO:0000184 | Nuclear-transcribed mRNA catabolic process, non-sense-mediated decay | 118 | 198 | 1.1e-211 | 2 |
| GO:0070972 | Protein localization to endoplasmic reticulum | 121 | 197 | 6.2e-206 | 2 |
| GO:0006413 | Translational initiation | 153 | 226 | 1.0e-204 | 4 |
| GO:0000087 | M phase of mitotic cell cycle | 374 | 415 | 2.3e-182 | 20/+ |
| GO:0000280 | Nuclear division | 363 | 391 | 1.0e-171 | 20/+ |
| GO:0007067 | Mitosis | 363 | 391 | 1.0e-171 | 20/+ |
| GO:0000279 | M phase | 537 | 573 | 1.7e-171 | 33/+ |
| GO:0006612 | Protein targeting to membrane | 154 | 200 | 1.1e-169 | 4 |
| GO:0000956 | Nuclear-transcribed mRNA catabolic process | 171 | 212 | 1.0e-166 | 7 |
| GO:0048285 | Organelle fission | 388 | 409 | 1.6e-166 | 20/+ |
| GO:0019080 | Viral genome expression | 152 | 193 | 6.4e-163 | 10/+ |
| GO:0019083 | Viral transcription | 152 | 193 | 6.4e-163 | 10/+ |
| GO:0043624 | Cellular protein complex disassembly | 157 | 196 | 1.8e-161 | 2 |
| GO:0043241 | Protein complex disassembly | 162 | 197 | 1.3e-157 | 2 |
| GO:0006402 | mRNA catabolic process | 183 | 213 | 4.3e-156 | 7 |
| GO:0032984 | Macromolecular complex disassembly | 183 | 201 | 1.8e-142 | 7 |
| GO:0006401 | RNA catabolic process | 210 | 218 | 1.1e-137 | 7 |
| GO:0072594 | Establishment of protein localization to organelle | 212 | 213 | 6.1e-131 | 4 |
| GO:0019058 | Viral infectious cycle | 228 | 218 | 1.4e-123 | 14/+ |
| GO:0051301 | Cell division | 480 | 419 | 1.3e-112 | 35/+ |
| GO:0016071 | mRNA metabolic process | 614 | 544 | 1.3e-107 | 21/+ |
| GO:0006412 | Translation | 469 | 376 | 2.3e-93 | 16/+ |
| GO:0002682 | Regulation of immune system process | 893 | 827 | 7.1e-91 | 83/+ |
| GO:0022411 | Cellular component disassembly | 295 | 229 | 8.5e-90 | 12/+ |
| GO:0001775 | Cell activation | 763 | 663 | 6.4e-88 | 74/+ |
| GO:0046649 | Lymphocyte activation | 471 | 367 | 2.8e-87 | 61/+ |
| GO:0045321 | Leukocyte activation | 556 | 439 | 2.3e-84 | 63/+ |
| GO:0050776 | Regulation of immune response | 564 | 435 | 8.7e-79 | 43/+ |
| GO:0022415 | Viral reproductive process | 547 | 416 | 3.0e-77 | 44/+ |
| GO:0007155 | Cell adhesion | 963 | 867 | 1.4e-74 | 41/+ |
| GO:0022610 | Biological adhesion | 965 | 869 | 2.1e-74 | 41/+ |
| GO:0060337 | Type I interferon-mediated signaling pathway | 73 | 71 | 8.5e-73 | 5/+ |
| GO:0071357 | Cellular response to type I interferon | 73 | 71 | 8.5e-73 | 5/+ |
| GO:0034340 | Response to type I interferon | 74 | 71 | 5.7e-72 | 5/+ |
| GO:0016032 | Viral reproduction | 701 | 546 | 1.6e-68 | 46/+ |
| GO:0044764 | Multi-organism cellular process | 703 | 547 | 4.0e-68 | 46/+ |
| GO:0006396 | RNA processing | 656 | 488 | 1.4e-63 | 18 |
| GO:0010564 | Regulation of cell cycle process | 440 | 295 | 2.9e-62 | 45/+ |
| GO:0002684 | Positive regulation of immune system process | 558 | 387 | 1.9e-59 | 41/+ |
| GO:0006259 | DNA metabolic process | 880 | 707 | 1.2e-56 | 75/+ |
| GO:0051249 | Regulation of lymphocyte activation | 296 | 183 | 1.5e-56 | 34/+ |
| GO:0045087 | Innate immune response | 544 | 368 | 1.8e-56 | 25/+ |
Shown are the 50 most significant terms (p-values are Bonferroni adjusted). CG is the number of cancer genes present in each term and the “+” indicates the significant enrichment of a GO-term with cancer genes. The total number of census genes in these 50 GO-terms is 256.
Figure 1Subnetwork of the breast cancer GRN for the biological process cell cycle (GO:0007049). This subnetwork includes 51 census cancer genes, e.g., BRCA1 and BRCA2. The census cancer genes are highlighted in red. Overall, the shown network consists of 728 genes and 1671 interactions.
Figure 2Shown is the 1st degree neighbors subnetwork, G. Note that we also included RAD50, despite the fact that RAD50 is not present in cancer census gene list.
Length of the shortest path between pairs of breast cancer genes (Bgene1 and Bgene2).
| CHEK2 | BRIP1 | 2 |
| BRCA1 | BRIP1 | 3 |
| BRCA2 | CHEK2 | 3 |
| BRCA2 | NBN | 3 |
| BRCA2 | BRIP1 | 3 |
| ATM | NBN | 3 |
| NBN | PALB2 | 3 |
| BRCA1 | BRCA2 | 4 |
| BRCA1 | CHEK2 | 4 |
| BRCA1 | PALB2 | 4 |
| BRCA2 | PTEN | 4 |
| BRCA2 | RAD50 | 4 |
| BRCA2 | PALB2 | 4 |
| TP53 | PALB2 | 4 |
| PTEN | CHEK2 | 4 |
| PTEN | NBN | 4 |
| PTEN | RAD50 | 4 |
| CHEK2 | ATM | 4 |
| CHEK2 | NBN | 4 |
| CHEK2 | RAD50 | 4 |
| CHEK2 | PALB2 | 4 |
| NBN | BRIP1 | 4 |
| RAD50 | BRIP1 | 4 |
| RAD50 | PALB2 | 4 |
| BRIP1 | PALB2 | 4 |
| BRCA1 | PTEN | 5 |
| BRCA1 | ATM | 5 |
| BRCA1 | NBN | 5 |
| BRCA1 | RAD50 | 5 |
| BRCA2 | TP53 | 5 |
| BRCA2 | ATM | 5 |
| TP53 | PTEN | 5 |
| TP53 | CHEK2 | 5 |
| TP53 | ATM | 5 |
| TP53 | NBN | 5 |
| TP53 | RAD50 | 5 |
| TP53 | BRIP1 | 5 |
| PTEN | ATM | 5 |
| PTEN | BRIP1 | 5 |
| PTEN | PALB2 | 5 |
| ATM | RAD50 | 5 |
| ATM | BRIP1 | 5 |
| NBN | RAD50 | 5 |
| BRCA1 | TP53 | 6 |
| ATM | PALB2 | 6 |
Figure 3(A) Statistically significant cooperations between chromosome pairs are highlighted by a link. The Benjamini and Hochberg (BH) adjusted p-values for these links are shown in the Table (B). The significance level for this analysis was α = 0.05, which guarantees that FDR ≤ α holds for the set of all significant links.
Census cancer genes ranked by degree in all-census-pair shortest path network.
| 81137 | 725 | OR7E104P | Olfactory receptor, family 7, subfamily E, member 104 pseudogene |
| 100132767 | 524 | ||
| 2623 | 462 | GATA1 | GATA binding protein 1 (globin transcription factor 1) |
| 387601 | 432 | SLC22A25 | Solute carrier family 22, member 25 |
| 353135 | 415 | LCE1E | Late cornified envelope 1E |
| 284187 | 384 | ||
| 256646 | 334 | C15orf55 | Chromosome 15 open reading frame 55 |
| 219409 | 334 | GSX1 | GS homeobox 1 |
| 9362 | 326 | CPNE6 | Copine VI (neuronal) |
| 283933 | 319 | ZNF843 | Zinc finger protein 843 |
| 9127 | 311 | P2RX6 | Purinergic receptor P2X, ligand-gated ion channel, 6 |
| 285877 | 303 | POM121L12 | POM121 transmembrane nucleoporin-like 12 |
| 9758 | 292 | FRMPD4 | FERM and PDZ domain containing 4 |
| 3166 | 292 | HMX1 | H6 family homeobox 1 |
| 60506 | 290 | NYX | Nyctalopin |
| 348808 | 288 | NPHP3-AS1 | NPHP3 antisense RNA 1 |
| 4584 | 287 | MUC3A | Mucin 3A, cell surface associated |
| 284805 | 283 | C20orf203 | Chromosome 20 open reading frame 203 |
| 163778 | 262 | SPRR4 | Small proline-rich protein 4 |
| 155 | 256 | ADRB3 | Adrenoceptor beta 3 |
| 286023 | 252 | FLJ40288 | Uncharacterized FLJ40288 |
| 64405 | 251 | CDH22 | Cadherin 22, type 2 |
| 1813 | 248 | DRD2 | Dopamine receptor D2 |
| 149647 | 245 | FAM71A | Family with sequence similarity 71, member A |
| 253868 | 238 | C20orf166-AS1 | C20orf166 antisense RNA 1 |