| Literature DB >> 23320896 |
Priyakshi Mahanta1, Hasin A Ahmed, Dhruba K Bhattacharyya, Jugal K Kalita.
Abstract
BACKGROUND: The development of high-throughput Microarray technologies has provided various opportunities to systematically characterize diverse types of computational biological networks. Co-expression network have become popular in the analysis of microarray data, such as for detecting functional gene modules.Entities:
Mesh:
Year: 2012 PMID: 23320896 PMCID: PMC3426802 DOI: 10.1186/1471-2105-13-S13-S4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of proximity measures
| Euclidian | Mutual | Yes | Yes | No |
| Pearson | Mutual | No | Yes | Yes |
| Spearman | Mutual | No | No | No |
| MSR | Aggregate | No | Yes | Yes |
| NMRS | Mutual | No | Yes | Yes |
The table 1 presents the comparison of different proximity measure.
Figure 1Example patterns used for evaluation of proximity measures The figure 1 presents the value of some example patterns that are used to demonstrate the superiority NMRS over other proximity measures viz. Euclidean distance, Pearson correlation coefficient and Spearman correlation coefficient.
Figure 2NMRS and Pearson correlation coefficient among considered example patterns The figure 2 presents NMRS and Pearson correlation coefficient of patterns b1-b8 with that of a.
Gene pattern
| 4 | 7 | 6 | 3 | 6 | 5 | 8 | 7 | 3 | |
|---|---|---|---|---|---|---|---|---|---|
| 10 | 13 | 12 | 9 | 12 | 11 | 14 | 13 | 9 | |
| 10.4286 | 12.5714 | 11.8571 | 9.7143 | 11.8571 | 11.1429 | 13.2857 | 12.5714 | 9.7143 | |
| 10.8571 | 12.1429 | 11.7143 | 10.4286 | 11.7143 | 11.2857 | 12.5714 | 12.1429 | 10.4286 | |
| 11.2857 | 11.7143 | 11.5714 | 11.1429 | 11.5714 | 11.4286 | 11.8571 | 11.7143 | 11.1429 | |
| 11.7143 | 11.2857 | 11.4286 | 11.8571 | 11.4286 | 11.5714 | 11.1429 | 11.2857 | 11.8571 | |
| 12.1429 | 10.8571 | 11.2857 | 12.5714 | 11.2857 | 11.7143 | 10.4286 | 10.8571 | 12.5714 | |
| 12.5714 | 10.4286 | 11.1429 | 13.2857 | 11.1429 | 11.8571 | 9.7143 | 10.4286 | 13.2857 | |
| 13 | 10 | 11 | 14 | 11 | 12 | 9 | 10 | 14 | |
The table 2 presents the random gene patterns for analysis of different proximity measures.
Symbolic representation
| The gene expression matrix | |
| i | |
| Signum threshold | |
| Co-expression network | |
| Set of vertices in G | |
| Set of edges in G | |
| Distance matrix | |
| NMRS distance between genes d | |
| Adjacency matrix | |
| 1 if v | |
| Set of connected region | |
| i | |
| Set of vertices in i | |
| Set of edges in i | |
| Adjacency matrix of the i | |
| i | |
| Set of network modules obtained from | |
| Topological Matrix value between vertices v | |
| Average TOM of the set of vertices V1 | |
| TOM for i | |
| Maximum spanning tree obtained from i | |
| Set of edges in |
The table 3 describes the various symbols that is used in ModuleMiner.
Datasets used for evaluating ModuleMiner
| Serial. No | Dataset | No. of Genes/ No. of Conditions | Source |
|---|---|---|---|
| 1 | Yeast Sporulation | 474/17 | |
| 2 | Yeast Diauxic Shift | 689/72 | Sample gene in expander |
| 3 | Subset of Yeast Cell Cycle | 384/17 | |
| 4 | Arabidopsis Thaliana | 138/8 | |
| 5 | Rat CNS | 112/9 |
The table 4 gives the description of various datasets used in ModuleMiner.
P-value of one of the network modules of Dataset 2
| 2.32E-28 | GO:0000788 | nuclear nucleosome |
| 5.12E-27 | GO:0000786 | nucleosome |
| 7.27E-23 | GO:0006334 | nucleosome assembly |
| 2.06E-20 | GO:0032993 | protein-DNA complex |
| 8.61E-19 | GO:0034728 | nucleosome organization |
| 1.14E-18 | GO:0065004 | protein-DNA complex assembly |
| 1.12E-17 | GO:0006333 | chromatin assembly or disassembly |
| 4.12E-16 | GO:0005694 | chromosome |
| 2.49E-14 | GO:0044454 | nuclear chromosome part |
| 1.70E-13 | GO:0031298 | replication fork protection complex |
| 9.47E-14 | GO:0006325 | chromatin organization |
| 6.78E-13 | GO:0044427 | chromosomal part |
| 2.32E-12 | GO:0034622 | cellular macromolecular complex assembly |
The table 5 gives the p value of one of the network modules of Dataset 2.
p-value of one of the network modules of Dataset 3
| 3.93E-25 | GO:0006281 | DNA repair |
| 1.03E-26 | GO:0006259 | DNA metabolic process |
| 1.23E-23 | GO:0006974 | response to DNA damage stimulus |
| 7.69E-27 | GO:0006260 | DNA replication |
| 6.94E-19 | GO:0007049 | cell cycle |
| 5.55E-16 | GO:0005634 | nucleus |
| 8.53E-18 | GO:0044454 | nuclear chromosome part |
| 1.51E-17 | GO:0022402 | cell cycle process |
| 3.53E-17 | GO:0000079 | regulation of cyclin-dependent protein kinase activity |
| 5.72E-15 | GO:0045859 | regulation of protein kinase activity |
| 5.16E-16 | GO:0005657 | replication fork |
The table 6 gives the p value of one of the network modules of Dataset 3.
Q-value of one of the network modules of Dataset 3
| DNA replication | 1.93E-21 |
| DNA repair | 1.93E-21 |
| response to DNA damage stimulus | 2.17E-20 |
| DNA-dependent DNA replication | 3.07E-19 |
| replication fork | 6.27E-19 |
| nuclear chromosome | 1.23E-17 |
| mitotic sister chromatid cohesion | 5.51E-17 |
| nuclear replication fork | 9.37E-17 |
| nuclear chromosome part | 2.00E-16 |
| sister chromatid cohesion | 5.13E-15 |
The table 7 gives the Q value of one of the network modules of Dataset 3.
Q-value of one of the network modules of Dataset 1
| cytosolic ribosome | 1.43E-52 |
| cytosolic part | 3.26E-48 |
| structural constituent of ribosome | 2.11E-44 |
| ribosomal subunit | 1.16E-42 |
| cytosolic large ribosomal subunit | 2.65E-36 |
| large ribosomal subunit | 1.47E-27 |
| preribosome | 2.96E-23 |
| cytosolic small ribosomal subunit | 3.71E-17 |
| 90S preribosome | 8.48E-16 |
The table 8 gives the Q value of one of the network modules of Dataset 1.
Q-value of one of the network modules of Dataset 1
| sporulation resulting in formation of a cellular spore | 1.53E-34 |
| sporulation | 1.53E-34 |
| anatomical structure formation involved in morphogenesis | 1.53E-34 |
| spore wall assembly | 3.43E-33 |
| ascospore wall assembly | 3.43E-33 |
| ascospore formation | 3.43E-33 |
| sexual sporulation | 3.43E-33 |
| spore wall biogenesis | 3.43E-33 |
| ascospore wall biogenesis | 3.43E-33 |
| sexual sporulation resulting in formation of a cellular spore | 3.43E-33 |
| cell development | 3.43E-33 |
| cell wall assembly | 8.88E-33 |
| reproductive process in single-celled organism | 2.59E-32 |
| cell differentiation | 8.40E-32 |
| fungal-type cell wall biogenesis | 6.93E-30 |
| reproductive developmental process | 1.40E-29 |
| reproductive process | 1.86E-25 |
| reproductive cellular process | 1.86E-25 |
| reproduction of a single-celled organism | 9.90E-25 |
| cell wall biogenesis | 1.25E-24 |
| sexual reproduction | 4.83E-24 |
| anatomical structure development | 5.45E-24 |
| anatomical structure morphogenesis | 5.45E-24 |
| M phase | 2.10E-23 |
| meiotic cell cycle | 1.62E-21 |
| meiosis | 2.74E-21 |
| M phase of meiotic cell cycle | 2.74E-21 |
The table 9 gives the Q value of one of the network modules of Dataset 1.
Q-value of one of the network modules of Dataset 4
| synaptic transmission | 1.29E-13 |
| glutamate receptor activity | 3.77E-11 |
| synapse | 6.68E-08 |
| regulation of synaptic transmission | 3.06E-07 |
| regulation of transmission of nerve impulse | 4.00E-07 |
| regulation of neurological system process | 7.07E-07 |
| regulation of system process | 5.38E-05 |
| synapse part | 8.11E-04 |
| cell projection part | 9.46E-04 |
The table 10 gives the Q value of one of the network modules of Dataset 4.
Q-value of one of the network modules of Dataset 5
| regulation of synaptic transmission | 6.438756E-7 |
| regulation of transmission of nerve impulse | 9.297736E-7 |
| regulation of neurological system process | 1.533111E-6 |
| intermediate filament cytoskeleton organization | 2.056912E-6 |
| intermediate filament-based process | 5.218967E-6 |
| neurofilament cytoskeleton | 1.109702E-5 |
| intermediate filament organization | 1.454524E-5 |
| synapse part | 2.543099E-5 |
| growth factor binding | 2.571707E-5 |
| intermediate filament | 2.938762E-5 |
| positive regulation of neurogenesis | 9.6019E-5 |
The table 11 gives the Q value of one of the network modules of Dataset 5.
The weightage of co-expression by Module Miner
| Dataset1 | C1 | 99.57% |
| C2 | 88.89% | |
| Dataset2 | C1 | 59.23% |
| C2 | 77.27% | |
| Dataset3 | C1 | 92.13% |
| C2 | 88.89% | |
| C3 | 92.33% | |
| C4 | 67.65% | |
| Dataset4 | C1 | 81.85% |
| Dataset5 | C1 | 76.62% |
The table 12 gives the percentage of co-expression on network modules produced by Module Miner.
Figure 3Visualization of co-expressed network The figure3 presents co-expressed network by GeneMANIA for Dataset1.
Figure 4Visualization of co-expressed network The figure 4 presents co-expressed network by GeneMANIA for Dataset2 and Dataset3.
Figure 5Visualization of co-expressed network The figure 5 presents co-expressed network by GeneMANIA for Dataset4 and Dataset5.