| Literature DB >> 27640171 |
Lingtao Su1,2, Xiangyu Meng3,4, Qingshan Ma5, Tian Bai1,2, Guixia Liu6,7.
Abstract
The importance of the construction of gene-gene interaction (GGI) network to better understand breast cancer has previously been highlighted. In this study, we propose a novel GGI network construction method called linear and probabilistic relations prediction (LPRP) and used it for gaining system level insight into breast cancer mechanisms. We construct separate genome-wide GGI networks for tumor and normal breast samples, respectively, by applying LPRP on their gene expression datasets profiled by The Cancer Genome Atlas. According to our analysis, a large loss of gene interactions in the tumor GGI network was observed (7436; 88.7 % reduction), which also contained fewer functional genes (4757; 32 % reduction) than the normal network. Tumor GGI network was characterized by a bigger network diameter and a longer characteristic path length but a smaller clustering coefficient and much sparse network connections. In addition, many known cancer pathways, especially immune response pathways, are enriched by genes in the tumor GGI network. Furthermore, potential cancer genes are filtered in this study, which may act as drugs targeting genes. These findings will allow for a better understanding of breast cancer mechanisms.Entities:
Keywords: Breast cancer; Gene–gene interaction; Network construction; TCGA dataset
Mesh:
Year: 2016 PMID: 27640171 PMCID: PMC5838217 DOI: 10.1007/s12539-016-0185-4
Source DB: PubMed Journal: Interdiscip Sci ISSN: 1867-1462 Impact factor: 2.233
Fig. 1Frequency distribution of −1, 0, 1 s in the discretized matrix D and in the two random matrices
Fig. 2Con and Re regulations, gene number and interaction number under different values. BRCA-T, BRCA-N and Random represent the tumor, normal and random samples, respectively. Gene number is the number of genes contained in the filtered interactions, interactions are sum of the Con and Re interactions, Con signifies the interactions that are forward-regulated, and Re denotes interactions that are reverse-regulated
Fig. 3Performance comparison of the LPRP and other methods using both simulated and real datasets. ACCURACY is defined as the number of known GGIs in the top 10,000, 100,000, 500,000 and 1,000,000 interactions. The top 10,000 is the number (10,000) of GGIs filtered in each method under the given thresholds. a The result based on the Dream5 network 4 datasets. b The result based on the normal BRCA gene expression datasets. MIC is not included due to its long running time
Fig. 4Performance comparison of the LPRP and other methods using syn.data real gene expression dataset (syn.data contained in minet R package, syn.data includes gene expression dataset and reference network). Panda [40, 41], Mrnet [42], CLR [43], ARACNE [16] and mrnetb are GGI network inference methods, all can be found in minet [44] R package
Fig. 5Performance comparison of the LPRP and other methods using simulated gene expression dataset (with 100 genes and 100 samples simulated by SynTren. SynTren can not only simulate gene expression but also give reference network)
Fig. 6Up- and down-regulated gene node degrees in the final tumor and normal GGI networks. Normal indicates a normal GGI network, tumor indicates a tumor GGI network, up degree gene indicates the degree of genes was larger in the tumor GGI network than in the normal GGI network, and down degree gene indicates that the degree of genes was smaller in the tumor GGI network than in the normal GGI network
Fig. 7Breast disease genes and their neighbors. In this figure, known breast disease-related genes are mapped to the final tumor GGI network, and their adjacent neighbors are filtered out
Functional annotation of clusters detected using MINE in the tumor GGI network
| Cluster | DAVID/SIGORA annotate | Gene | Benjamini | |
|---|---|---|---|---|
| 1 | Ribosome, translational elongation | 68 | 2.9E−125 | 1.1E−127 |
| 2 | Cell cycle, P53 signaling pathway, DNA replication | 73 | 9.5E−59 | 1.8E−61 |
| 3 | Regulation of lymphocyte activation, regulation of leukocyte activation, immune response, T cell receptor signaling pathway, Jak-STAT signaling pathway | 39 | 1.9E−21 | 2.3E−24 |
| 4 | Eextracellular matrix, proteinaceous extracellular matrix, cell adhesion, hydroxylation, extracellular region | 32 | 4.0E−36 | 4.4E−38 |
| 5 | Immune response, apoptosis, regulation of apoptosis, response to virus | 23 | 6.7E−9 | 3.3E−11 |
| 6 | Mitotic cell cycle, chromosome, centromeric region, intracellular non-membrane-bounded organelle, chemokine signaling pathway | 16 | 8.4E−11 | 4.1E−13 |
| 7 | Antigen processing and presentation of peptide antigen via MHC class I | 12 | 1.5E−13 | 1.5E−13 |
| 8 | Antigen processing and presentation of peptide or polysaccharide antigen via MHC class II, immune response | 8 | 1.4E−13 | 9.1E−16 |
| 9 | IgG binding, alternative splicing | 8 | 1.4E−8 | 1.5E−9 |
| 10 | SH2 domain, chemokine signaling pathway | 7 | 1.2E−4 | 2.4E−6 |
| 11 | Protein biosynthesis, RNA transport | 7 | 1.7E−4 | 1.8E−5 |
| 12 | 7 | |||
| 13 | Chemokine signaling pathway, response to wounding, Cytokine-cytokine receptor interaction | 6 | 1.7E−8 | 2.2E−10 |
| 14 | Epidermis development, epithelial cell differentiation, ectoderm development | 5 | 1.9E−7 | 6.8E−9 |
| 15 | Immune response | 5 | 2.6E−4 | 6.2E−6 |
| 16 | Chemokine signaling pathway, NOD-like receptor signaling pathway, sh3 domain | 5 | 1.7E−2 | 5.1E−4 |
| 17 | Cell cycle, DNA replication | 5 | 2.3E−6 | 3.8E−7 |
KEGG pathway enrichment analysis results
| ID | Pathway | |
|---|---|---|
| 1 | Cytokine–cytokine receptor interaction | 4.46E−200 |
| 2 | Metabolic pathways | 2.25E−34 |
| 3 | Jak-STAT signaling pathway | 3.24E−21 |
| 4 | Protein processing in endoplasmic reticulum | 1.18E−10 |
| 5 | ErbB signaling pathway | 8.19E−09 |
| 6 | Amino sugar and nucleotide sugar metabolism | 3.92E−06 |
| 7 | Histidine metabolism | 9.17E−06 |
| 8 | Caffeine metabolism | 1.05E−05 |
| 9 | Glycerophospholipid metabolism | 1.32E−05 |
| 10 | Asthma | 1.41E−05 |
| 11 | Vitamin B6 metabolism | 1.60E−05 |
| 12 | Sulfur relay system | 1.67E−05 |
| 13 | Small cell lung cancer | 2.35E−05 |
| 14 | Cysteine and methionine metabolism | 2.80E−05 |
| 15 | Glycosphingolipid biosynthesis—lacto and neolacto series | 3.14E−05 |
| 16 | Fc gamma R-mediated phagocytosis | 6.53E−05 |
| 17 | Base excision repair | 0.0001419 |
| 18 | Synthesis and degradation of ketone bodies | 0.0001611 |
| 19 | Protein digestion and absorption | 0.0001899 |
| 20 | Porphyrin and chlorophyll metabolism | 0.0002799 |
| 21 | GnRH signaling pathway | 0.0003071 |
| 22 | Osteoclast differentiation | 0.0004201 |
| 23 | Alanine, aspartate and glutamate metabolism | 0.0006986 |
| 24 | Long-term potentiation | 0.0007306 |
| 25 | Glycosylphosphatidylinositol(GPI)-anchor biosynthesis | 0.0007490 |
DAVID annotation results of the three clusters in Fig. 7
| Cluster | DAVID Annotate | Gene | Benjamini | |
|---|---|---|---|---|
| 1 | Extracellular matrix, cell adhesion, blood vessel development, EGF-like region, conserved site, cell migration, pathways in cancer | 77 | 1.0E−54 | 5.7E−57 |
| 2 | Disulfide bond, transmembrane protein, Chemokine signaling pathway, inflammatory. Response, immune response, apoptosis | 62 | 1.3E−14 | 5.9E−17 |
| 3 | Cell cycle, DNA repair, regulation of cell cycle process, pathways in cancer, apoptosis, immune response | 180 | 8.0E−41 | 2.7E−43 |