| Literature DB >> 20465807 |
Elizabeth Allen1, Annick Moing, Timothy Md Ebbels, Mickaël Maucourt, A Deri Tomos, Dominique Rolin, Mark A Hooks.
Abstract
BACKGROUND: Holistic profiling and systems biology studies of nutrient availability are providing more and more insight into the mechanisms by which gene expression responds to diverse nutrients and metabolites. Less is known about the mechanisms by which gene expression is affected by endogenous metabolites, which can change dramatically during development. Multivariate statistics and correlation network analysis approaches were applied to non-targeted profiling data to investigate transcriptional and metabolic states and to identify metabolites potentially influencing gene expression during the heterotrophic to autotrophic transition of seedling establishment.Entities:
Mesh:
Year: 2010 PMID: 20465807 PMCID: PMC2890501 DOI: 10.1186/1752-0509-4-62
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Trends in differential gene expression. DE genes were determined between each successive day at a threshold cut-off level of 1.5-fold. Each comparative stage, i.e. day, was measured in triplicate and the mean of the hybridization intensities calculated prior to DE analysis. (A) Total number of DE genes and the split between UR and DR genes. (B) The proportion given as percentages of total DE genes comprised by either chloroplast/plastid protein or TF/signaling protein encoding genes as given in the TAIR gene ontology database. Open and closed bars represent UR and DR genes, respectively.
Figure 2Relationships between metabolites. (A) Clusters of metabolites with similar profiles generated by 2D-SOM. Hierarchal and K-means clustering were used to estimate the optimal number of bins for 2D-SOM analysis. Metabolites in cluster 1: sucrose, rhamnose, citrate, alanine, trigonelline, lactate, glucose, threonine, unkS7.37, unkM1.85; Cluster 2: arginine, formate; Cluster 3: fumarate, proline, glutamate, unkD8.0, unkM5.18, unkD3.12; Cluster 4: malate; Cluster 5: valine, isoleucine, leucine, choline, unkD5.69; Cluster 6: fructose, glutamine, unkM7.9. (B) Spring embedding plots showing relationships based on correlations. The plot shows metabolites as nodes and Pearson correlation coefficients over days as connections. The color of the connecting line describes the strength of the correlation between the nodes; a dark red color indicates a strong positive correlation and a dark blue line represents a weaker positive correlation according to the scale of correlation coefficients on the right of the graph. Only correlations above a Bonferroni-adjusted P-value < 0.0001 are shown. (C) Enlargement of the lactate cluster. (D) Enlargement of the valine cluster. Since values start from an initial random configuration, the directions separating cluster in each spring embedding plot are arbitrary, but they provide an indication of distance separating nodes and edges.
Figure 3Day-by-transcript relationships. (A) PCA scores plot of the time points sampled during germination and seedling establishment based on the average transcript levels. Each number 0 to 8 represents one day (24 h) from imbibed seeds (0) to 8 days (8) of age, respectively. (B) Higher order relationships among days based on mean values of transcript levels from the 3 replicates visualized by spring embedding. The plot shows day 0 (d0) to day 8 (d8) as nodes and the relative degree of transcript correlation as edges. Clustering was based on Pearson correlation coefficients at a threshold cut-off of 0.7. The color bar on the right of the figure provides the relative degree of correlation.
Figure 4Day-by-metabolite relationships. (A) PCA scores plot where each number represents one day (24 h) from imbibed seeds (0) to 8 days of age (8). (B) Spring embedding plot where the symbols d0 to d8 correspond to the samples in A. Each point is a node representing the mean value and each line gives the relative degree of correlation. The threshold Pearson correlation coefficient for the spring embedding was 0.7. The color bar on the right of each figure provides the relative degree of coloration. Both types of analysis were based on the mean values (n = 3) of 3 replicates (2 replicates for day 3).
Figure 5A spring embedding model revealing relationships between metabolites and genes from days 0 to 8. Pearson correlation coefficients were determined between every metabolite and gene over the 9 time points. Metabolites are central nodes from which connected genes radiate outwards. The coloured lines represent edges describing the nature of the correlation; a dark red line represents a strong positive correlation whereas a dark blue line represents a strong negative correlation. A total of 237 correlations were identified between 20 metabolites and 209 genes at the threshold cut-off of (p < 0.0001, r > |0.95|). The plots inset show the profiles of the average expression values for the transcription factors IAA14, ARF10 and ABI3 used to calculate correlation coefficients.
Correlations between metabolites and transcript levels in developing seedlings.
| Metabolite | Number of Relationships* | Direction of Correlations† |
|---|---|---|
| Glucose | 1 | N |
| Sucrose | 129 | P |
| Fructose | 1 | P |
| Rhamnose | 1 | N |
| Malate | 3 | N (1), P (2) |
| Citrate | 7 | N (6), P (1) |
| Fumarate | 3 | N (2), P (1) |
| Formate | 1 | N |
| Lactate | 21 | N |
| Glutamine | 7 | N |
| Alanine | 4 | N (1), P (3) |
| Valine | 3 | N |
| Threonine | 5 | N |
| Trigonelline | 16 | N (15), P (1) |
| Choline | 1 | N |
| unkD8.0 | 6 | N (2), P (4) |
| unkM7.9 | 4 | N |
| unkS7.36 | 3 | N |
| unkM5.18 | 2 | N (1), P (1) |
| unkM1.85 | 19 | N (18), P (1) |
*Number of significant Pearson correlations between the metabolite and genes above p = 9.8 × 10-5 (FDR = 5%) determined over the 9 sampling periods, day 0 to day 8. †The direction of the linear regression, either positive (P) or negative (N) and the number in each direction in parentheses.
Identified regulatory genes correlated with metabolites.
| Locus | Metabolite | Gene Symbol | Gene Description† | |
|---|---|---|---|---|
| AT1G30970 | Sucrose | 0.9672 | SUF4 | Suppressor of Frigida 4 |
| AT2G20180 | Sucrose | 0.9493 | PIF1 | Myc-related bHLH transcription factor |
| AT2G24500 | Sucrose | 0.9628 | FZF | C2H2 zinc finger protein FZF |
| AT2G25930 | unkM7.9 | -0.9752 | ELF3 | Early Flowering 3 |
| AT2G28350 | unkM7.9 | -0.9667 | ARF10 | Auxin Response Factor 10 |
| AT2G31370 | fumarate - | 0.9495 | POSF21 | bZIP transcription factor POSF21 |
| AT2G43010 | unkM1.85 | -0.9529 | PIF4 | Nuclear localized bHLH protein that interacts with active PhyB |
| AT3G15030 | unkS7.37 | -0.9586 | TCP4 | Arabidopsis thaliana TCP |
| AT3G24650 | Sucrose | 0.9721 | ABI3 | Homologous to the maize transcription factor Viviparous-1. |
| AT4G14550 | unkM1.85 | -0.9588 | SLR/IAA14 | IAA14 is a member of the Aux/IAA protein family. Solitary Root locus. |
| AT4G35570 | Sucrose | 0.9803 | HMGB5 | High Mobility Group B 5 |
| AT5G17800 | Sucrose | 0.977 | ATMYB56 | Member of the R2R3 factor gene family |
| AT1G09570 | Sucrose | 0.9777 | PHYA | Phytochrome A |
| AT4G16250 | Sucrose | 0.9581 | PHYD | Phytochrome D |
| AT3G16030 | alanine | 0.9519 | CES101 | Callus Expression of RBCS |
| AT1G61040 | Sucrose | 0.9495 | VIP5 | Vernalization Independence 5 putative heterogeneous nuclear ribonucleoprotein |
| AT2G33410 | Sucrose | 0.9771 | ||
| AT2G37020 | Sucrose | 0.9522 | DNA binding, Chloroplast | |
| AT3G16810 | Sucrose | 0.9735 | APUM24 | Arabidopsis Pumilio 24 |
| AT4G14520 | Sucrose | 0.9625 | Homologous to the DNA-directed RNA polymerase II subunit (At5g59180) | |
| AT4G25500 | Sucrose | 0.9721 | ATRSP35 | arginine/serine-rich splicing factor |
| AT4G36020 | Sucrose | 0.9577 | CSDP1 | Cold shock domain protein |
| AT5G07290 | Sucrose | 0.9538 | AML4 | ARABIDOPSIS MEI2-LIKE 4 |
| AT5G14270 | glutamine | -0.9566 | ATBET9 | Arabidopsis thaliana Bromodomain and Extra terminal Domain protein 9 |
| AT5G38890 | Sucrose | 0.9898 | Exoribonuclease-related | |
| AT5G53180 | fumarate - | 0.9593 | ATPTB2 | Polypyrimidine tract-binding (PTB) Protein |
† Pearson correlation coefficients based on a false discovery rate at a significance threshold < 0.05. The sign gives the direction of correlation.