| Literature DB >> 22583750 |
Ricardo de Matos Simoes1, Shailesh Tripathi, Frank Emmert-Streib.
Abstract
BACKGROUND: The physical periphery of a biological cell is mainly described by signaling pathways which are triggered by transmembrane proteins and receptors that are sentinels to control the whole gene regulatory network of a cell. However, our current knowledge about the gene regulatory mechanisms that are governed by extracellular signals is severely limited.Entities:
Mesh:
Year: 2012 PMID: 22583750 PMCID: PMC3476434 DOI: 10.1186/1752-0509-6-38
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1The gene regulatory network is composed of the transcriptional regulatory network, protein network and a signaling network spanning the whole cell.
Figure 2Illustration of our simulation set-up to generate bootstrap data sets. Using the true underlying network G as reference, we estimate F-scores for each of the inferred networks. The colors of the arrows correspond to the boxplots in Figure 5. Data set D represents one data set of the whole ensemble.
Figure 5Comparing the F-scores for inferring a scale-free network. Blue: 300 independently simulated gene expression data sets. Red: BE 1. Green: BE 2. The data sets were generated with sample sizes ranging from 100, 200, 500 to 1000 samples
Figure 4Reference hierarchical network with 8 weakly connected MTM network modules. Each module consists of 25 genes. The modules are connected by 0, 3, 5, 10 and 15 edges, as indicated by the black, thick edges between the modules.
Figure 3Illustration of the usage of the bootstrap ensemble for the clustering of the network components.
Figure 6Simulated gene expression data are generated from a network with a defined hierarchical structure (). From the simulated data, the distance matrix D is estimated. The inferred hierarchy of the network modules is obtained from a hierarchical clustering of matrix D. The measures d1 and d2 evaluate the estimated hierarchical structure.
Figure 7Example for the functioning ofd1. The reference hierarchy RH describes network modules that are directly connected by a parent/child relation or by network modules that share the same parent network module. The scoring function evaluates each cluster from the lowest to the highest split s in the dendrogram. In the given example, the inferred dendrogram contains a total of one clustering error (split s4).
Figure 8Simulation study for the reconstruction of the hierarchical organization within a network. Shown are the results for 100 simulated gene expression data sets generated from networks with network modules connected by 0, 3, 5, 10 and 15 edges. A) Dendrogram clustering error d1. B) Graph-edit distance hierarchy error.
Figure 9Histogram of the size of the 463 separate network components of the inferred B-cell C3NET gene regulatory network inferred from the(left). Middle and right: DLBCL-C3NET gene regulatory networks with 556 separated network components for the Salaverria GSE22470 dataset and 593 separated network components for the Lenz GSE11318 dataset.
GO category Biological Process: Enrichment analysis of the genes in the giant connected component
| GO ID | Term | All | Sig | Exp | p-value |
|---|---|---|---|---|---|
| GO:0007186 | G-protein coupled receptor protein signaling pathway | 450 | 89 | 38.65 | 1.4e-14 |
| GO:0050877 | neurological system process | 567 | 99 | 48.70 | 1.4e-12 |
| GO:0032501 | multicellular organismal process | 2714 | 317 | 233.12 | 1.7e-12 |
| GO:0003008 | system process | 858 | 127 | 73.70 | 1.2e-10 |
| GO:0007267 | cell-cell signaling | 532 | 87 | 45.70 | 1.3e-09 |
| GO:0007166 | cell surface receptor linked signaling pathway | 1071 | 145 | 91.99 | 2.9e-09 |
| GO:0019226 | transmission of nerve impulse | 312 | 58 | 26.80 | 9e-09 |
| GO:0023033 | signaling pathway | 1127 | 148 | 96.80 | 1.6e-08 |
| GO:0023052 | signaling | 2504 | 279 | 215.08 | 3.4e-08 |
| GO:0023060 | signal transmission | 2154 | 246 | 185.02 | 4.6e-08 |
| GO:0023046 | signaling process | 2155 | 246 | 185.10 | 4.9e-08 |
| GO:0030001 | metal ion transport | 281 | 51 | 24.14 | 1.6e-07 |
| GO:0007268 | synaptic transmission | 274 | 50 | 23.54 | 1.8e-07 |
| GO:0070838 | divalent metal ion transport | 114 | 27 | 9.79 | 8.1e-07 |
| GO:0006816 | calcium ion transport | 112 | 26 | 9.62 | 2e-06 |
GO category Molecular function: Enrichment analysis of the genes in the giant connected component
| GO ID | Term | All | Sig | Exp | p-value |
|---|---|---|---|---|---|
| GO:0004930 | G-protein coupled receptor activity | 247 | 60 | 21.35 | 5.2e-14 |
| GO:0004872 | receptor activity | 797 | 121 | 68.89 | 1.1e-10 |
| GO:0004888 | transmembrane receptor activity | 547 | 88 | 47.28 | 3.3e-09 |
| GO:0004871 | signal transducer activity | 1118 | 150 | 96.63 | 4.6e-09 |
| GO:0060089 | molecular transducer activity | 1118 | 150 | 96.63 | 4.6e-09 |
| GO:0005509 | calcium ion binding | 528 | 85 | 45.64 | 6.2e-09 |
| GO:0046873 | metal ion transmembrane transporter activity | 166 | 35 | 14.35 | 4.8e-07 |
| GO:0042165 | neurotransmitter binding | 51 | 17 | 4.41 | 6.2e-07 |
| GO:0005261 | cation channel activity | 138 | 30 | 11.93 | 1.6e-06 |
| GO:0005216 | ion channel activity | 198 | 38 | 17.11 | 1.9e-06 |
| GO:0022836 | gated channel activity | 176 | 35 | 15.21 | 2e-06 |
| GO:0015267 | channel activity | 215 | 40 | 18.58 | 2.3e-06 |
| GO:0022803 | passive transmembrane transporter activity | 215 | 40 | 18.58 | 2.3e-06 |
| GO:0022857 | transmembrane transporter activity | 510 | 74 | 44.08 | 4.2e-06 |
| GO:0022838 | substrate-specific channel activity | 206 | 38 | 17.81 | 5e-06 |
GO category Cellular component: Enrichment analysis of the genes in the giant connected component
| GO ID | Term | All | Sig | right | p-value |
|---|---|---|---|---|---|
| GO:0016021 | integral to membrane | 2201 | 282 | 193.28 | 2.2e-14 |
| GO:0031224 | intrinsic to membrane | 2260 | 285 | 198.46 | 1.2e-13 |
| GO:0044459 | plasma membrane part | 1388 | 195 | 121.89 | 4e-13 |
| GO:0044425 | membrane part | 2739 | 327 | 240.53 | 1.1e-12 |
| GO:0005886 | plasma membrane | 2084 | 264 | 183.01 | 1.3e-12 |
| GO:0005887 | integral to plasma membrane | 900 | 131 | 79.03 | 9e-10 |
| GO:0031226 | intrinsic to plasma membrane | 915 | 132 | 80.35 | 1.4e-09 |
| GO:0016020 | membrane | 3462 | 372 | 304.02 | 4.3e-08 |
| GO:0034702 | ion channel complex | 125 | 31 | 10.98 | 6.7e-08 |
| GO:0005576 | extracellular region | 1077 | 136 | 94.58 | 3e-06 |
| GO:0034703 | cation channel complex | 80 | 21 | 7.03 | 3.4e-06 |
| GO:0034704 | calcium channel complex | 22 | 10 | 1.93 | 6.2e-06 |
| GO:0005903 | brush border | 31 | 11 | 2.72 | 3.6e-05 |
| GO:0005891 | voltage-gated calcium channel complex | 17 | 8 | 1.49 | 4e-05 |
| GO:0030054 | cell junction | 327 | 48 | 28.72 | 0.00024 |
Figure 10Hierarchical organization of the= 25 largest network components of the B-cell C3NET gene regulatory network inferred from theand the DLBCL gene regulatory network inferred from theSalaverria GSE22470 datasetand the. The numbers of the network components correspond to the leaves of the dendrogram. The GCC corresponds to number ‘1’. A) Clustering of the GO enrichment analysis, category Cellular Component. We show GO terms from the top ranked frequency counts of GO terms for the branches, indicted by the vertical color bars. B) Hierarchical clustering of the structural network components. Left and right: The red part of a dendrogram represents network components with Gene Ontology terms enriched with extracellular and membrane proteins. Green and blue network components are enriched with intracellular proteins.
Top 25 hub genes with a degree (deg) larger than 20 found in the B-cell lymphoma gene regulatory network - genes are described by their gene id, gene symbol, and, if available, one selected annotation term from GO (category Biological Process), bc refers to the betweenness centrality and the number in brackets to its rank with respect to the bc values
| Entrez | Symbol | deg/bc(rank) | Description |
|---|---|---|---|
| 778 | CACNA1F | 46/282019(1) | calcium channel, voltage-dependent, L type, alpha 1F subunit (calcium ion transport GO:0006816) |
| 2949 | GSTM5 | 38/14454(39) | glutathione S-transferase mu 5 (metabolic process GO:0008152) |
| 7275 | TUB | 37/16818(35) | tubby homolog (mouse) (response to stimulus GO:0050896) |
| 9080 | CLDN9 | 35/65406(6) | claudin 9 (calcium-independent cell-cell adhesion GO:0016338) |
| 3363 | HTR7 | 33/1436(406) | 5-hydroxytryptamine (serotonin) receptor 7 (adenylate cyclase-coupled) (signal transduction GO:0007165) |
| 1579 | CYP4A11 | 32/24724(25) | cytochrome P450, family 4, subfamily A, polypeptide 11 (long-chain fatty acid metabolic process GO:0001676) |
| 796 | CALCA | 32/46142(11) | calcitonin-related polypeptide alpha (endothelial cell proliferation GO:0001935) |
| 7546 | ZIC2 | 28/6978(95) | Zic family member 2 (odd-paired homolog, Drosophila) (cell differentiation GO:0030154) |
| 1993 | ELAVL2 | 28/9991(62) | ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 (Hu antigen B) ( NA) |
| 1549 | CYP2A7 | 27/33469(17) | cytochrome P450, family 2, subfamily A, polypeptide 7 (oxidation-reduction process GO:0055114) |
| 5554 | PRH1 | 27/8093(77) | proline-rich protein HaeIII subfamily 1 ( NA) |
| 2516 | NR5A1 | 27/248679(2) | nuclear receptor subfamily 5, group A, member 1 (cell-cell signaling GO:0007267) |
| 11222 | MRPL3 | 27/12748(45) | mitochondrial ribosomal protein L3 (translation GO:0006412) |
| 92017 | SNX29 | 26/159597(3) | sorting nexin 29 (cell communication GO:0007154) |
| 6534 | SLC6A7 | 25/46858(10) | solute carrier family 6 (neurotransmitter transporter, L-proline), member 7 (proline transport GO:0015824) |
| 115703 | ARHGAP33 | 25/28006(21) | Rho GTPase activating protein 33 (signal transduction GO:0007165) |
| 40 | ACCN1 | 24/5548(113) | amiloride-sensitive cation channel 1, neuronal (sodium ion transport GO:0006814) |
| 1943 | EFNA2 | 23/34712(16) | ephrin-A2 (cell-cell signaling GO:0007267) |
| 7047 | TGM4 | 23/147229(4) | transglutaminase 4 (prostate) (peptide cross-linking GO:0018149) |
| 343 | AQP8 | 22/17440(34) | aquaporin 8 (water transport GO:0006833) |
| 9127 | P2RX6 | 22/63918(7) | purinergic receptor P2X, ligand-gated ion channel, 6 (signal transduction GO:0007165) |
| 2797 | GNRH2 | 21/2976(202) | gonadotropin-releasing hormone 2 (signal transduction GO:0007165) |
| 5545 | PRB4 | 21/9246(66) | proline-rich protein BstNI subfamily 4 ( NA) |
| 4706 | NDUFAB1 | 21/11907(49) | NADH dehydrogenase (ubiquinone) 1, alpha/beta subcomplex, 1, 8kDa (electron transport chain GO:0022900) |
| 553 | AVPR1B | 21/5080(130) | arginine vasopressin receptor 1B (signal transduction GO:0007165) |
Figure 11The giant connected component consists of 884 genes. The ensemble support values of the individual edges are shown from red (high support) to blue (low support). The 7 largest hub genes of the giant connected component are highlighted, and a GO term from the category Biological Process is included: CACNA1F (46),CLDN9 (35), CALCA (32), NR5A1 (27), SNX29 (26), SLC6A7 (25) and TGM4 (23) and P2RX6 (22). Here, the number in bracket corresponds to the number of direct gene neighbors.
Network similarity analysis for cellular components between the B-cell C3NET gene regulatory network and the Lenz and Salaverria gene regulatory network for 30 from the 58 cellular component subnetworks with the highest correlation coefficient of the betweenness centrality, significant in both comparisons - the columns denote the size (number of genes) of a Gene Ontology term represented in the subnetworks, betw the average betweenness for the two comparisons, Spearman’s rank correlation coefficient and the FDR adjusted p-value
| GOID | Term | Size | betw | pval | |
|---|---|---|---|---|---|
| GO:0045178 | basal part of cell | 47 | 230 | 0.51 | 1.83e-03 |
| GO:0035097 | histone methyltransferase complex | 126 | 187 | 0.37 | 1.98e-02 |
| GO:0031967 | organelle envelope | 109 | 212 | 0.34 | 4.20e-03 |
| GO:0034361 | very-low-density lipoprotein particle | 67 | 183 | 0.33 | 2.45e-02 |
| GO:0055037 | recycling endosome | 82 | 167 | 0.31 | 1.38e-02 |
| GO:0034364 | high-density lipoprotein particle | 169 | 172 | 0.28 | 2.23e-04 |
| GO:0031461 | cullin-RING ubiquitin ligase complex | 429 | 183 | 0.25 | 1.17e-04 |
| GO:0030017 | sarcomere | 325 | 235 | 0.24 | 7.26e-03 |
| GO:0044422 | organelle part | 350 | 167 | 0.21 | 4.00e-04 |
| GO:0005680 | anaphase-promoting complex | 295 | 156 | 0.20 | 6.33e-04 |
| GO:0044441 | cilium part | 654 | 355 | 0.20 | 4.17e-07 |
| GO:0005788 | endoplasmic reticulum lumen | 368 | 245 | 0.20 | 6.16e-03 |
| GO:0044445 | cytosolic part | 625 | 354 | 0.19 | 3.01e-06 |
| GO:0005665 | DNA-directed RNA polymerase II, core complex | 640 | 264 | 0.19 | 5.20e-05 |
| GO:0005789 | endoplasmic reticulum membrane | 656 | 247 | 0.18 | 8.14e-06 |
| GO:0005903 | brush border | 2234 | 414 | 0.17 | 2.09e-06 |
| GO:0044432 | endoplasmic reticulum part | 983 | 410 | 0.17 | 1.43e-07 |
| GO:0005765 | lysosomal membrane | 1894 | 338 | 0.17 | 2.12e-04 |
| GO:0034703 | cation channel complex | 923 | 277 | 0.17 | 7.43e-07 |
| GO:0071944 | cell periphery | 290 | 198 | 0.16 | 1.10e-02 |
| GO:0035085 | cilium axoneme | 1586 | 261 | 0.16 | 2.62e-10 |
| GO:0005769 | early endosome | 1585 | 329 | 0.16 | 1.76e-09 |
| GO:0005773 | vacuole | 1585 | 285 | 0.16 | 2.62e-10 |
| GO:0005768 | endosome | 1585 | 324 | 0.16 | 1.56e-02 |
| GO:0044309 | neuron spine | 708 | 317 | 0.16 | 8.22e-05 |
| GO:0044440 | endosomal part | 3021 | 287 | 0.16 | 1.44e-05 |
| GO:0000313 | organellar ribosome | 1545 | 240 | 0.16 | 6.90e-04 |
| GO:0005669 | transcription factor TFIID complex | 1480 | 308 | 0.15 | 5.94e-09 |
| GO:0008305 | integrin complex | 1609 | 307 | 0.15 | 3.14e-09 |
| GO:0008180 | signalosome | 1272 | 245 | 0.15 | 1.56e-02 |