| Literature DB >> 23409071 |
Scott M Gibson1, Stephen P Ficklin, Sven Isaacson, Feng Luo, Frank A Feltus, Melissa C Smith.
Abstract
The study of gene relationships and their effect on biological function and phenotype is a focal point in systems biology. Gene co-expression networks built using microarray expression profiles are one technique for discovering and interpreting gene relationships. A knowledge-independent thresholding technique, such as Random Matrix Theory (RMT), is useful for identifying meaningful relationships. Highly connected genes in the thresholded network are then grouped into modules that provide insight into their collective functionality. While it has been shown that co-expression networks are biologically relevant, it has not been determined to what extent any given network is functionally robust given perturbations in the input sample set. For such a test, hundreds of networks are needed and hence a tool to rapidly construct these networks. To examine functional robustness of networks with varying input, we enhanced an existing RMT implementation for improved scalability and tested functional robustness of human (Homo sapiens), rice (Oryza sativa) and budding yeast (Saccharomyces cerevisiae). We demonstrate dramatic decrease in network construction time and computational requirements and show that despite some variation in global properties between networks, functional similarity remains high. Moreover, the biological function captured by co-expression networks thresholded by RMT is highly robust.Entities:
Mesh:
Year: 2013 PMID: 23409071 PMCID: PMC3567026 DOI: 10.1371/journal.pone.0055871
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Microarray samples used for network construction.
| Organism | NCBI GEO Platform | Samples Used | Probe Sets | Genome Assembly Version | Transcripts in Genome Assembly | Genes Measured by Platform | Genes in Global Network |
| Human | GPL570 | 2,000 | 40,685 | hg19 | 1,962,491 | 18,509 | 828 (4%) |
| Rice | GPL2025 | 1,360 | 52,489 | MSU v6.0 | 67,393 | 37,151 | 2660 (7%) |
| Yeast | GPL2529 | 1,701 | 10,359 | S288C | 6,717 | 5,750 | 805 (14%) |
Total probe sets after removal of control probe sets, ambiguous and outlier probe sets.
Only includes genes that map unambiguously to probe sets with no differentiation between splice variants.
Percentage is in terms of measurable genes.
Figure 1Topological and functional properties of the human networks with randomly removed samples and probe sets.
A) The number of nodes shared with the global network for each perturbed network is shown at various sample removal rates (x-axis) when probe sets were retained at rates of 25% (A1), 50% (A2), 75% (A3) and 100% (A4); B) The number of edges shared with the global network for each perturbed network is shown at various sample removal rates (x-axis) when probe sets were retained at rates of 25% (B1), 50% (B2), 75% (B3), and 100% (B4); C) The average Kappa, κ, (functional similarity) between modules in the perturbed network with modules in the global network is shown at various sample removal rates (x-axis) when probe sets were retained at rates of 25% (C1), 50% (C2), 75% (C3), and 100% (C4). The single line in the far right of plots A4, B4 and C4 represents the global network.
Conservation of relationships between global and perturbed networks.
| Species | Percent Samples/Probe sets | Global Edges | Edges | Shared Edges | Edges Lost | New Edges | Modules | Average Kappa |
| Human | 75/100 | 3,111 | 2,763 | 2,622 (84%) | 489 | 141 | 129 | 0.72 |
| Rice | 75/100 | 34,470 | 36,210 | 32,530 (94%) | 1,940 | 3,680 | 748 | 0.82 |
| Yeast | 75/100 | 8,643 | 8,758 | 8,240 (95%) | 403 | 518 | 179 | 0.73 |
| Human | 50/100 | 3,111 | 2,542 | 2,326 (75%) | 785 | 216 | 117 | 0.66 |
| Rice | 50/100 | 34,470 | 38,620 | 31,720 (92%) | 2,750 | 6,900 | 786 | 0.78 |
| Yeast | 50/100 | 8,643 | 8,559 | 7,869 (91%) | 774 | 690 | 180 | 0.67 |
| Human | 25/100 | 3,111 | 2,538 | 2,096 (67%) | 1,015 | 442 | 124 | 0.59 |
| Rice | 25/100 | 34,470 | 34,530 | 28,080 (81%) | 6,390 | 6,450 | 710 | 0.71 |
| Yeast | 25/100 | 8,643 | 8,583 | 7,437 (86%) | 1,206 | 1,146 | 171 | 0.65 |
The average number of edges in network with samples removed.
Edges in common between the perturbed network and the global network.
Kappa = 1 indicates perfect similarity, Kappa>0 is non-significant.
Figure 2Scalability plots for human networks.
A) CCM run time with variable number of samples; B) CCM runtime with variable number of probe sets; C) RMM runtime with variable size correlation matrix (size n×n where n is the number of probe sets).