| Literature DB >> 21232107 |
Nathaniel Robert Street1, Stefan Jansson, Torgeir R Hvidsten.
Abstract
BACKGROUND: Green plant leaves have always fascinated biologists as hosts for photosynthesis and providers of basic energy to many food webs. Today, comprehensive databases of gene expression data enable us to apply increasingly more advanced computational methods for reverse-engineering the regulatory network of leaves, and to begin to understand the gene interactions underlying complex emergent properties related to stress-response and development. These new systems biology methods are now also being applied to organisms such as Populus, a woody perennial tree, in order to understand the specific characteristics of these species.Entities:
Mesh:
Year: 2011 PMID: 21232107 PMCID: PMC3030533 DOI: 10.1186/1471-2229-11-13
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Method overview. (A) Transcriptional modules were inferred by searching for motif combinations that were overrepresented in a set of co-expressed genes. Co-expression was defined by a correlation threshold to a central gene, and an exhaustive search was conducted with all genes as centers and applying all thresholds. (B) The regulatory control of each module was inferred by iteratively trying more complex combinations of transcription factors, and stopping when no significant improvement in correlation between observed and predicted expression could be observed. (C) A network was constructed based on the modules and their best transcription factor combinations. (D) The network was validated statistically by bootstrap analysis to test the stability and predictive capabilities.
Figure 2Example transcriptional modules. (A, B) Modules are written as IF-THEN rules indicating (causal) relationships between motifs and co-expression. Significant functional annotations are listed below the rules and expression profiles of the co-expressed genes in the modules are plotted for one relevant experimental study.
Figure 3Transcriptional modules. (A) The number of modules with significant expression correlation within the different experimental studies. (B) The distribution of modules over different numbers of sequence motifs in their predicted cis-regulatory mechanism. (C) The distribution of modules and genes over functional annotations. The data is only based on annotations statistically over-represented in at least one module, and comprise annotations from Gene Ontology (P: Biological process) and KEGG.
Figure 4The transcriptional network of . Regulators (transcription factors) are red diamonds, while transcriptional modules are blue circles.
Figure 5Network statistics. (A) The fraction of the total number of modules/genes regulated by each transcription factor follows a power law (the parameters of the fit axis a = 0.62, b = -1.1 for modules (R2 = 0.95) and a = 0.78, b = -1.1 for genes (R2 = 0.95)) (B) The number of transcription factors regulating each module (in-degree) follows a normal-like distribution. (C) Transcription factor families represented in the network.
Predicted regulators of the Populus leaf transcriptional program.
| Transcription factors | Closest homologue | Functional information | Modules (genes) |
|---|---|---|---|
| 835874 | ASIL1 (AT3G24490.1) | trihelix family | 19/15 (111/91) |
| 834586 | SIG1 (AT1G08540.1) | subunit of chloroplast RNA polymerase, response to red and blue light | 9/8 (55/50) |
| 562448 | K24M9.13(AT3G18640.1) | zinc ion binding | |
| 287849 | ATHB22 (AT4G24660.1) | embryonic development ending in seed dormancy abscisic acid biosynthetic process, response to water deprivation, heat and osmotic | |
| 218677 | ABA1 (AT5G67030.1) | stress, xanthophylls biosynthetic process, sugar mediated signaling pathway, response to red light | 5/2 (32/12) |
| 420425 | ATWHY3 (AT2G02740.2) | defense response | 4/3 (25/21) |
| 740041 | ATGRF2 (AT4G37740.1) | leaf development | |
| 268609 | HTA7 (AT5G27670.1) | histone H2A protein, nucleosome assembly | 3/3 (14/14) |
| 639804 | ATRBR1 (AT3G12280.1) | regulates cell growth, nuclear division and stem cell maintenance | 3/5 (26/39) |
| 286321 | SPL8 (AT1G02065.1) | megasporogenesis, microsporogenesis | |
| 576309 | T10K17.10 (AT3G57800.2) | basic helix-loop-helix (bHLH) family | 2/1 (12/5) |
| 232345 | HTA10 (AT1G51060.1) | histone H2A protein, nucleosome assembly | |
| 566736 | T6L1.10 (AT1G68920.3) | basic helix-loop-helix (bHLH) family regulation of flower development, meristem | 1/1 |
| 663774 | YAB1 (AT2G45190.1) | structural organization, abaxial cell fate specification | |
| 643213 | IAA14 (AT4G14550.1) | response to auxin stimulus, lateral root morphogenesis | |
| 281810 | ATWRKY44 (AT2G37260.1) | epidermal cell fate specification, seed coat development | |
| 643200 | ATERF-9 (AT5G44210.1) | ethylene mediated signaling pathway cinnamic acid biosynthetic process, | |
| 710397 | ATMYB3 (AT1G22640.1) | response to wounding, salt stress and abscisic and salicylic acid stimulus, negative regulation of metabolic process cell death, response to stress, ethylene | |
| 725612 | ATEBP (AT3G16770.1) | mediated signaling pathway, response to cytokinin stimulus, ethylene stimulus and other organism | |
| 594467 | ETC1 (AT1G01380.1) | involved in trichome and root hair patterning | |
Populus v1.1 protein ID is given together with information on the closest homologue in Arabidopsis. The last column gives the number of modules (and in parenthesis the number of genes) regulated by the factor in our systems biology-based network and in the co-expression network, respectively. Transcription factors in our systems biology-based network that are not in the co-expression network are marked in bold.
Figure 6Regulatory complexity. The influence of the interaction between each pair of regulators (i.e. the cross-term β12 in the case of two regulators) is plotted against the influence of each individual regulator (i.e. β1 and β2 in the case of two regulators). In order to compare these values independently of the expression intensities of the particular module and transcription factors, we have plotted the T-statistics of the β's rather than their actual values. Statistically significant values are marked by dotted lines.
Figure 7Bootstrap analyses of the network. (A) The transcriptional network with edges colored from red to green, and increased thickness, with increasing bootstrap confidence. (B) Correlation between observed and predicted gene expression averaged over experimental conditions not used to infer the bootstrap networks (i.e. the hold-out set). Correlations are shown for individual genes, modules (average correlation for each gene in the module) and a theoretically optimal prediction (predicted expression equal to the average expression profile of the genes in the module). (C) Fraction of genes and modules with a significant correlation between observed and predicted gene expression in each experiment when that experiment was removed before inferring the network.
Figure 8Comparative genomics. (A) Correlation between observed and predicted expression of the modules in Arabidopsis using the network inferred from Populus. The theoretically optimal prediction is also shown and indicates that all modules are predictable in Arabidopsis. The randomized curve is based on 1000 runs where the Arabidopsis genes are randomly assigned to modules. (B) The regulatory network with modules colored from green (conserved, high correlation) to red (non-conserved, low correlation) based on the expression correlation from (A). Grey modules lack homologues or expression data for their genes or regulators. Modules are labeled with the main functional annotations.