| Literature DB >> 20458373 |
Abstract
Perturbations of gene regulatory networks are essentially responsible for oncogenesis. Therefore, inferring the gene regulatory networks is a key step to overcoming cancer. In this work, we propose a method for inferring directed gene regulatory networks based on soft computing rules, which can identify important cause-effect regulatory relations of gene expression. First, we identify important genes associated with a specific cancer (colon cancer) using a supervised learning approach. Next, we reconstruct the gene regulatory networks by inferring the regulatory relations among the identified genes, and their regulated relations by other genes within the genome. We obtain two meaningful findings. One is that upregulated genes are regulated by more genes than downregulated ones, while downregulated genes regulate more genes than upregulated ones. The other one is that tumor suppressors suppress tumor activators and activate other tumor suppressors strongly, while tumor activators activate other tumor activators and suppress tumor suppressors weakly, indicating the robustness of biological systems. These findings provide valuable insights into the pathogenesis of cancer.Entities:
Keywords: cancer; decision rules; gene regulatory networks; machine learning; microarrays
Year: 2010 PMID: 20458373 PMCID: PMC2865768 DOI: 10.4137/grsb.s4509
Source DB: PubMed Journal: Gene Regul Syst Bio ISSN: 1177-6250
Figure 1.Network Type 1 constructed under α = 0.95.
Figure 2.Network Type 1 constructed under α = 0.85.
Figure 3.Network Type 1 constructed under α = 0.80.
Connection degrees of identified genes in Network Type 1.
| DES | 0 | 1 (1) | 2 (2) | 10 (5) | 14 (6) | 17 (8) | 22 (13) | 9 (5) |
| MYL9 | 0 | 0 | 2 (1) | 9 (3) | 12 (4) | 24 (13) | 27 (15) | 12 (5) |
| CSRP1 | 0 | 1 (0) | 2 (0) | 10 (4) | 20 (12) | 23 (13) | 27 (16) | 12 (6) |
| ACTA2 | 0 | 0 | 4 (3) | 9 (3) | 12 (3) | 18 (5) | 27 (13) | 10 (4) |
| SPARCL1 | 0 | 1 (1) | 4 (3) | 10 (3) | 15 (7) | 20 (8) | 27 (15) | 11 (5) |
| KCNMB1 | 0 | 0 | 6 (3) | 13 (4) | 14 (4) | 29 (15) | 29 (15) | 13 (6) |
| Mgp | 0 | 0 | 4 (0) | 11 (2) | 16 (4) | 26 (13) | 27 (13) | 12 (5) |
| SLC2A4 | 0 | 0 | 1 (0) | 5 (1) | 12 (2) | 24 (11) | 25 (11) | 10 (4) |
| myosin | 0 | 0 | 0 | 3 (0) | 7 (3) | 15 (10) | 19 (14) | 6 (4) |
| TPM3 | 0 | 1 (0) | 4 (2) | 13 (6) | 18 (9) | 22 (9) | 22 (9) | 11 (5) |
| 0 | 0 | 1 (1) | 3 (1) | 6 (2) | 21 (9) | 22 (9) | 8 (3) | |
| 0 | 1 (0) | 2 (0) | 14 (12) | 16 (13) | 21 (13) | 26 (15) | 11 (8) | |
| 0 | 0 | 0 | 3 (2) | 7 (5) | 14 (6) | 15 (6) | 6 (3) | |
| 0 | 0 | 2 (2) | 5 (3) | 17 (12) | 22 (13) | 27 (13) | 10 (6) | |
| 0 | 0 | 0 | 3 (1) | 7 (3) | 13 (5) | 18 (5) | 6 (2) | |
| 0 | 0 | 0 | 1 (1) | 5 (3) | 10 (4) | 15 (5) | 4 (2) | |
| 0 | 0 | 0 | 13 (11) | 14 (11) | 20 (12) | 25 (13) | 10 (7) | |
| 0 | 1 (1) | 2 (1) | 14 (13) | 17 (12) | 21 (12) | 26 (13) | 12 (7) |
The upregulated genes are formatted in boldface in table 1, 2 and 4.
Figure 4.Eight regulatory patterns.
Abbreviations: S, suppressor; Si, the ith suppressor; A, activator; Ai, the ith activator, i = 1, 2, …, n.
Values of n for eight regulatory patterns detected when α = 0.8.
| DES | 4 | 4 | 0 | 6 | ||||
| MYL9 | 4 | 4 | 0 | 4 | ||||
| CSRP1 | 4 | 4 | 5 | 7 | ||||
| ACTA2 | 4 | 5 | 0 | 3 | ||||
| SPARCL1 | 4 | 4 | 0 | 7 | ||||
| KCNMB1 | 4 | 6 | 0 | 4 | ||||
| Mgp | 4 | 8 | 0 | 4 | ||||
| SLC2A4 | 4 | 6 | 0 | 2 | ||||
| myosin | 3 | 1 | 1 | 2 | ||||
| TPM3 | 4 | 5 | 0 | 9 | ||||
| 1 | 3 | 1 | 1 | |||||
| 1 | 2 | 10 | 3 | |||||
| 1 | 1 | 0 | 5 | |||||
| 0 | 5 | 8 | 4 | |||||
| 1 | 3 | 0 | 3 | |||||
| 1 | 1 | 0 | 3 | |||||
| 1 | 2 | 10 | 1 | |||||
| 1 | 4 | 10 | 2 |
Properties of two modules detected in network Type 1 with α = 0.8.
| Genes contained in the module | PCBD1, TPM3, S100A11, SPARCL1, HNRNPA1, KCNMB1, ACTA2, IPL1, Mgp, SLC2A4, CSRP1 | HSPD1, IL8, DARS |
| Node number | 11 | 3 |
| Edge number | 66 | 3 |
| Clustering coefficient | 0.6 | 0.5 |
| Upregulated genes | PCBD1, S100A11, HNRNPA1, IPL1 | HSPD1, IL8, DARS |
| Downregulated genes | TPM3, SPARCL1, KCNMB1, ACTA2, Mgp, SLC2A4, CSRP1 | N/A |
“N/A” indicates that there is no related gene contained in the corresponding modules.
GO terms significantly enriched with two modules in Network Type 1 (α = 0.8).
| 1 | Molecular function | 48306 | Calcium-dependent protein binding | 0.00003 |
| 2 | Biological process | 42221 | Response to chemical stimulus | 0.005 |
| Molecular function | 5524 | ATP binding | 0.02 | |
| 32559 | Adenyl ribonucleotide binding | 0.02 | ||
| 30554 | Adenyl nucleotide binding | 0.02 |
GO terms shared by more than one gene with P ≤ 0.05 are identified.
Figure 5.Network Type 2 constructed under α = 0.85.
Figure 6.Network Type 2 constructed under α = 0.8.
Connection degrees of identified genes in Network Type 2.
| DES | 0 | 2 (2) | 4 (3) | 15 (10) | 33 (25) | 11 (8) |
| MYL9 | 0 | 0 | 7 (6) | 22 (17) | 40 (33) | 14 (11) |
| CSRP1 | 1 (1) | 3 (2) | 3 (2) | 9 (5) | 18 (13) | 7 (5) |
| ACTA2 | 0 | 0 | 11 (10) | 28 (22) | 30 (22) | 14 (11) |
| SPARCL1 | 1 (1) | 2 (2) | 6 (5) | 20 (13) | 36 (28) | 13 (10) |
| KCNMB1 | 0 | 0 | 13 (10) | 24 (15) | 40 (29) | 15 (9) |
| Mgp | 0 | 0 | 12 (8) | 30 (21) | 47 (36) | 18 (13) |
| SLC2A4 | 0 | 0 | 10 (9) | 27 (23) | 63 (54) | 20 (17) |
| myosin | 0 | 0 | 0 | 3 (1) | 10 (6) | 3 (1) |
| TPM3 | 0 | 0 | 6 (4) | 24 (18) | 53 (45) | 17 (13) |
| 0 | 0 | 1 (1) | 2 (1) | 8 (4) | 2 (1) | |
| 0 | 4 (3) | 67 (64) | 1369 (1367) | 1401 (1399) | 568 (567) | |
| 0 | 0 | 0 | 8 (7) | 28 (27) | 7 (7) | |
| 0 | 0 | 6 (6) | 57 (55) | 1752 (1747) | 363 (362) | |
| 0 | 0 | 0 | 7 (5) | 37 (33) | 9 (8) | |
| 0 | 0 | 2 (2) | 9 (9) | 22 (21) | 7 (6) | |
| 0 | 6 (6) | 57 (57) | 1772 (1770) | 1787 (1785) | 724 (724) | |
| 0 | 13 (13) | 84 (83) | 1569 (1568) | 1595 (1591) | 652 (651) |
Contrast in regulatory circumstances of two groups of genes.
| Average number of genes regulating upregulated genes | 2.75 | 26.625 | 597.75 | 825.875 | 290.75 |
| Average number of genes regulating downregulated genes | 0.6 | 5.7 | 14.5 | 29.1 | 9.8 |
| 0.1199 | 0.0679 | 0.0407 | 0.0178 | 0.0214 |
Properties of three modules detected in Network Type 2 with α = 0.8.
| Genes contained in the module | PCBD1, S100A11, Mgp, SPARCL1, SLC2A4, IPL1, HNRNPA1, TPM3, BCL3, MAOB, SDC2, SRF, PRDX6, VIP, CALD1, DELTA-CRYSTALLIN ENHANCER BINDING FACTOR | DES, KCNMB1, MYL9, ACTA2, CEBPD, CCND3, SRF | HSPD1, SRPK1, HNRNPM |
| Nodes number | 16 | 7 | 3 |
| Edges number | 88 | 14 | 4 |
| Clustering coefficient | 0.37 | 0.33 | 0.25 |
| Upregulated genes | PCBD1, S100A11, IPL1, HNRNPA1, SDC2 | N/A | HSPD1, SRPK1, HNRNPM |
| Downregulated genes | Mgp, SPARCL1, SLC2A4, TPM3, SRF, BCL3, MAOB, PRDX6, VIP, CALD1, DELTA-CRYSTALLIN ENHANCER BINDING FACTOR | DES, MYL9, ACTA2, KCNMB1, CCND3, CEBPD, SRF | N/A |
“N/A” indicates that there is no related gene contained in the corresponding modules.
GO terms significantly enriched with three modules in Network Type 2 (α = 0.8).
| 1 | Biological process | 51239 | Regulation of multicellular organismal process | 0.001 |
| 51170 | Nuclear import | 0.002 | ||
| 51098 | Regulation of binding | 0.003 | ||
| 45941 | Positive regulation of transcription | 0.003 | ||
| 10628 | Positive regulation of gene expression | 0.003 | ||
| 45935 | Positive regulation of nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | 0.003 | ||
| 10557 | Positive regulation of macromolecule biosynthetic process | 0.004 | ||
| 9891 | Positive regulation of biosynthetic process | 0.005 | ||
| 6913 | Nucleocytoplasmic transport | 0.005 | ||
| 51169 | Nuclear transport | 0.005 | ||
| 10604 | Positive regulation of macromolecule metabolic process | 0.006 | ||
| Molecular function | 48306 | Calcium-dependent protein binding | 0.00007 | |
| 2 | Cellular component | 44449 | Contractile fiber part | 0.0004 |
| 43292 | Contractile fiber | 0.0004 | ||
| 3 | Biological process | 6395 | RNA Splicing | 0.001 |
| 6394 | RNA processing | 0.003 | ||
| Molecular function | 166 | Nucleotide binding | 0.002 |
GO terms shared by more than one gene with P ≤ 0.05 are identified.
Colon cancer microarray dataset decision table.
| 1 | 8589.4163 | … | 500.425 | … | 28.70125 | Tumor |
| 2 | 9164.2537 | … | 335.69 | … | 16.77375 | Normal |
| … | … | … | … | … | … | … |
| 61 | 6234.6225 | … | 272.92875 | … | 23.265 | Tumor |
| 62 | 7472.01 | … | 2699.1925 | … | 39.63125 | Normal |
Discretized colon cancer microarray dataset decision table.
| 1 | ‘All’ | … | ‘(-inf-1696.2275)’ | … | ‘All’ | Tumor |
| 2 | ‘All’ | … | ‘(1696.2275-inf)’ | … | ‘All’ | Normal |
| … | … | … | … | … | … | … |
| 61 | ‘All’ | … | ‘(-inf-1696.2275)’ | … | ‘All’ | Tumor |
| 62 | ‘All’ | … | ‘(1696.2275-inf)’ | … | ‘All’ | Normal |
“ ‘All’ ” indicates that one gene has the same value in all samples; “ ‘(-inf-x)’ ” indicates “<=x”; “ ‘(x-inf)’ ” indicates “>x”.
Colon cancer microarray dataset decision table.
| 1 | 8589.4163 | … | 475.27885 | … | 28.70125 | Downregulation |
| 2 | 9164.2537 | … | 1648.4596 | … | 16.77375 | Upregulation |
| … | … | … | … | … | … | … |
| 61 | 6234.6225 | … | 191.33846 | … | 23.265 | Downregulation |
| 62 | 7472.01 | … | 1240.5846 | … | 39.63125 | Upregulation |
Discretized decision table of Table 9.
| 1 | ‘All’ | … | ‘(-inf-1048.3779)’ | … | ‘All’ | Downregulation |
| 2 | ‘All’ | … | ‘(1048.3779-inf)’ | … | ‘All’ | Upregulation |
| … | … | … | … | … | … | … |
| 61 | ‘All’ | … | ‘(-inf-1048.3779)’ | … | ‘All’ | Downregulation |
| 62 | ‘All’ | … | ‘(1048.3779-inf)’ | … | ‘All’ | Upregulation |