| Literature DB >> 18682703 |
Abstract
Biological gene networks appear to be dynamically robust to mutation, stochasticity, and changes in the environment and also appear to be sparsely connected. Studies with computational models, however, have suggested that denser gene networks evolve to be more dynamically robust than sparser networks. We resolve this discrepancy by showing that misassumptions about how to measure robustness in artificial networks have inadvertently discounted the costs of network complexity. We show that when the costs of complexity are taken into account, that robustness implies a parsimonious network structure that is sparsely connected and not unnecessarily complex; and that selection will favor sparse networks when network topology is free to evolve. Because a robust system of heredity is necessary for the adaptive evolution of complex phenotypes, the maintenance of frugal network complexity is likely a crucial design constraint that underlies biological organization.Entities:
Mesh:
Year: 2008 PMID: 18682703 PMCID: PMC2538912 DOI: 10.1038/msb.2008.52
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Biological networks are sparsely connected
| Organism | Interactions | Genes | Secondary source | Primary source | ||
|---|---|---|---|---|---|---|
| 29 | 14 | 0.148 | 2.07 | |||
| 45 | 25 | 0.072 | 1.8 | |||
| Sea urchin | 82 | 44 | 0.0065 | 1.86 | ||
| 1052 | 678 | 0.0023 | 1.55 | |||
| 3969 | 2341 | 0.0007 | 1.7 | |||
| 106 | 56 | 1.9 | ||||
| 578 | 423 | 0.0032 | 1.37 | |||
| 18 625 | 6760 | 0.0004 | 2.75 |
N denotes operons, not genes.
This result was derived (statistically) by partial correlation analysis on microarrays and we suspect that this method is not precise. A pilot study with just 2000 genes found a network with N=820 and n=828, which gives c=0.00123 and K∼1.0. However, this is likely much too sparse. Moreover, although this sample should be representative for larger N, and thus K should be the same for large N, their partial correlation analysis shows a K=2.75. This method seems to give a large number of false negatives for smaller N, and according to our analysis, a large number of false positives for the larger networks. Nevertheless, a value for K where 1⩽K⩽2.75 will still result is a network that is quite sparse.
The number of network interactions for a subset of an organism's genes, which were reported in various studies and databases, is shown. Interactions: the number of interactions n reported for the N genes; genes: the number of genes N that had reported interactions; column c: the connectivity density (c=n/N2); column K: the average number of transcriptional regulators per gene (K=cN); secondary source: the source that reported the values; primary source: where the secondary source derived the values from. References for secondary and primary sources are shown at the bottom of the table.
Sources: (i) Serov VN, Spirov AV, Samsonova MG (1998). Graphical interface to the genetic network database GeNet. Bioinfomatics 14: 546–547; (ii) GeNet (http://www.bionet.nsc.ru/bgrs/thesis/17/index.html); (iii) Rosenfeld N and Alon U (2003). Response delays and the structure of transcription networks. J Mol Biol 329: 645–654; (iv) Davidson EH et al (2002). A genomic regulatory network for development. Science 295: 1669–1678; (v) Costanzo MC et al (2001). YPDTM, PombePDTM and WormPDTM: model organism volumes of the BioKnowledgeTM Library, an integrated resource for protein information. Nucleic Acids Res 29: 75–79; (vi) Lee TI et al (2002). Transcriptional regulatory networks in Saccharomyces cerevisiae. Science 298: 799–804; (vii) Kauffman S, Peterson C, Samuelsson B, Troein C (2003). Random Boolean network models and the yeast transcriptional network. Proc Natl Acad Sci USA 100: 14796–14799; (viii) http://web.wi.mit.edu/young/regulatory_network; (ix) Shen-Orr SS, Milo R, Mangan S, Alon U (2002). Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics 31: 64–68; (x) Ma S, Gong Q, Bohnert HJ (2007). An Arabidopsis gene network based on the graphical Gaussian model. Genome Res 17: 1614–1625.
Figure 1Denser networks dilute the costs of perturbation over more interactions. Vertical axis shows the average cost of a perturbation per interaction (CPPI), where cost measures how much the phenotype deviates from the optimal. Selecting for optimal gene expression patterns, networks were initialized at c0=0.5 and far-from-equilibrium (ĉ=c0±0.4) for sparse (ĉ=0.1; α=0.008, φ=0.07) and dense (ĉ=0.9; α=0.07, φ=0.008) treatments. Plot shows evolutionary response in CPPI as networks are driven to sparser (open circles) and denser (closed circles) connectivity densities (also see Supplementary Figure 3). Data are reported as expectations and 95% confidence intervals for the mean of a single optimal individual sampled from each replicate population.
Figure 2Sparser networks evolve to be less costly. Vertical axis measures the gross cost of perturbation (GCP) on a network as the mean cost of perturbation per interaction (CPPI) multiplied by the number of interactions in the network. Plot shows the evolutionary response in the GCP for the treatments from Figure 2 driven toward more sparse (open circles) or dense (closed circles) connectivity densities (also see Supplementary Figure 3). For a network with N genes, the maximum cost is N2. Sparser networks maintain a lower GCP than dense networks, even though the CPPI increases as networks become sparser. Data are reported as expectations and 95% confidence intervals for the mean of a single optimal individual sampled from each replicate population.
Figure 3Selection systematically favors networks below their equilibrium density. Selecting for optimal gene expression patterns, 100 replicate populations were initialized with network connectivity density already at equilibrium (c0=ĉ) for high (c0=0.6; α=0.105, φ=0.07), intermediate (c0=0.5; α=0.07, φ=0.07), and/or low (c0=0.6; α=0.07, φ=0.105) treatments. Plot shows the average evolved response in connectivity density under stabilizing selection as compared to the expected equilibrium density in the absence of selection (thick dashed line). High (open diamond), intermediate (open square), and low (open triangle) systematically favor sparser-than-equilibrium networks (also see Supplementary Figure 4). Data are reported as expectations and 95% confidence intervals for the mean of a single optimal individual sampled from each replicate population.