| Literature DB >> 17430560 |
Nicola Neretti1, Daniel Remondini, Marc Tatar, John M Sedivy, Michela Pierini, Dawn Mazzatti, Jonathan Powell, Claudio Franceschi, Gastrone C Castellani.
Abstract
Time course gene expression experiments are a popular means to infer co-expression. Many methods have been proposed to cluster genes or to build networks based on similarity measures of their expression dynamics. In this paper we apply a correlation based approach to network reconstruction to three datasets of time series gene expression following system perturbation: 1) Conditional, Tamoxifen dependent, activation of the cMyc proto-oncogene in rat fibroblast; 2) Genomic response to nutrition changes in D. melanogaster; 3) Patterns of gene activity as a consequence of ageing occurring over a life-span time series (25y-90y) sampled from T-cells of human donors. We show that the three datasets undergo similar transitions from an "uncorrelated" regime to a positively or negatively correlated one that is symptomatic of a shift from a "ground" or "basal" state to a "polarized" state. In addition, we show that a similar transition is conserved at the pathway level, and that this information can be used for the construction of "meta-networks" where it is possible to assess new relations among functionally distant sets of molecular functions.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17430560 PMCID: PMC1885845 DOI: 10.1186/1471-2105-8-S1-S16
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Histogram of correlation coefficients of the gene expression time series between genes for the cMyc dataset. The red line refers to the perturbed case, whereas the blue to the unperturbed one. (A) The perturbation induces a bimodal distribution: genes tend to be either strongly correlated or anti-correlated, differing significantly from unperturbed case. (B) Correlation coefficient histograms obtained after time reshuffling of the same genes do not show any significant difference.
Network parameters. Comparison of principal network parameters for the N and T datasets
| 0 | 0 | |
| 17 | 99 | |
| Mean, | 4.53 | 23.44 |
| Standard deviation, | 2.61 | 23.97 |
| Skewness | 0.89 | 1.16 |
| Clustering coefficient | 0.43 | 0.45 |
Figure 2Histogram of the correlation coefficients of the gene expression time series between all the genes selected with the change point analysis in the . In the NY-controls (top left) the histogram resembles a Gaussian distribution slightly skewed towards positive correlation values. When considering the expression ratio Y-treatment over NY-control (top right) the distribution becomes bimodal and genes tend to be either strongly correlated or anti-correlated. These results have been validated by reshuffling the time points independently for each gene. In both cohorts this leads to a Gaussian distribution (bottom row).
Figure 3Histogram of correlation coefficients of the gene expression time series between genes for the aging dataset. The picture on the left (A) shows the histogram of the correlation coefficients for the set of 768 probesets selected with one-way ANOVA, P value < 0.01. The picture on the right (B) is the histogram of the correlation coefficients for a set of 768 probesets randomly sampled from the whole dataset of 14688 probesets. A single-gene time reshuffling applied onto each dataset produces a Gaussian distribution (data not shown).
Figure 4Transition from unimodal to bimodal behavior for the cMyc data. The different panels show the histogram of the correlation coefficients between the expression values over time from the probesets in the dataset N (left column) and T (right column) for decreasing cutoff P values (P). The top row includes all the probesets used in the analysis (P> 1); the central row corresponds to an intermediate threshold (P= 0.2); the bottom row corresponds to the lowest threshold (P= 0.05).
List of pathways for D. melanogaster dataset. List of pathways (from the KEGG database) that include at least 20% of genes whose expression pattern has changed significantly upon re-feeding in the D. melanogaster dataset
| path:dme00010 | Glycolysis/Gluconeogenesis |
| path:dme00030 | Pentose phosphate pathway |
| path:dme00051 | Fructose and mannose metabolism |
| path:dme00052 | Galactose metabolism |
| path:dme00053 | Ascorbate and aldarate metabolism |
| path:dme00062 | Fatty acid biosynthesis (path 2) |
| path:dme00071 | Fatty acid metabolism |
| path:dme00120 | Bile acid biosynthesis |
| path:dme00190 | Oxidative phosphorylation |
| path:dme00230 | Purine metabolism |
| path:dme00240 | Pyrimidine metabolism |
| path:dme00251 | Glutamate metabolism |
| path:dme00252 | Alanine and aspartate metabolism |
| path:dme00260 | Glycine, serine and threonine metabolism |
| path:dme00280 | Valine, leucine and isoleucine degradation |
| path:dme00290 | Valine, leucine and isoleucine biosynthesis |
| path:dme00310 | Lysine degradation |
| path:dme00330 | Arginine and proline metabolism |
| path:dme00340 | Histidine metabolism |
| path:dme00350 | Tyrosine metabolism |
| path:dme00361 | Gamma-Hexachlorocyclohexane degradation |
| path:dme00380 | Tryptophan metabolism |
| path:dme00410 | Beta-Alanine metabolism |
| path:dme00440 | Aminophosphonate metabolism |
| path:dme00500 | Starch and sucrose metabolism |
| path:dme00562 | Inositol phosphate metabolism |
| path:dme00564 | Glycerophospholipid metabolism |
| path:dme00600 | Glycosphingolipid metabolism |
| path:dme00620 | Pyruvate metabolism |
| path:dme00624 | 1- and 2- Methylnaphthalene degradation |
| path:dme00625 | Tetrachloroethene degradation |
| path:dme00632 | Benzoate degradation via CoA ligation |
| path:dme00640 | Propanoate metabolism |
| path:dme00650 | Butanoate metabolism |
| path:dme00670 | One carbon pool by folate |
| path:dme00740 | Riboflavin metabolism |
| path:dme00790 | Folate biosynthesis |
| path:dme00903 | Limonene and pinene degradation |
| path:dme00920 | Sulfur metabolism |
| path:dme00930 | Caprolactam degradation |
| path:dme00970 | Aminoacyl- tRNA biosynthesis |
| path:dme03020 | RNA polymerase |
| path:dme03050 | Proteasome |
| path:dme03060 | Protein export |
| path:dme04070 | Phosphatidylinositol signaling system |
List of pathways for cMyc dataset. List of pathways (from KEGG database) that include at least 5% of genes whose expression pattern has changed significantly upon cMyc activation
| path:rno00020 | Citrate cycle (TCA cycle) | path:rno00920 | Sulfur metabolism |
| path:rno00051 | Fructose and mannose metabolism | path:rno01510 | Neurodegenerative Disorders |
| path:rno00052 | Galactose metabolism | path:rno03010 | Ribosome |
| path:rno00062 | Fatty acid biosynthesis (path 2) | path:rno03020 | RNA polymerase |
| path:rno00071 | Fatty acid metabolism | path:rno03022 | Basal transcription factors |
| path:rno00072 | Synthesis and degradation of ketone bodies | path:rno03030 | DNA polymerase |
| path:rno00230 | Purine metabolism | path:rno04010 | MAPK signaling pathway |
| path:rno00240 | Pyrimidine metabolism | path:rno04020 | Calcium signaling pathway |
| path:rno00280 | Valine, leucine and isoleucine degradation | path:rno04070 | Phosphatidylinositol signaling system |
| path:rno00310 | Lysine degradation | path:rno04110 | Cell cycle |
| path:rno00340 | Histidine metabolism | path:rno04210 | Apoptosis |
| path:rno00350 | Tyrosine metabolism | path:rno04310 | Wnt signaling pathway |
| path:rno00380 | Tryptophan metabolism | path:rno04510 | Focal adhesion |
| path:rno00440 | Aminophosphonate metabolism | path:rno04520 | Adherens junction |
| path:rno00480 | Glutathione metabolism | path:rno04530 | Tight junction |
| path:rno00510 | N-Glycan biosynthesis | path:rno04540 | Gap junction |
| path:rno00561 | Glycerolipid metabolism | path:rno04620 | Toll-like receptor signaling pathway |
| path:rno00564 | Glycerophospholipid metabolism | path:rno04630 | Jak-STAT signaling pathway |
| path:rno00590 | Prostaglandin and leukotriene metabolism | path:rno04810 | Regulation of actin cytoskeleton |
| path:rno00620 | Pyruvate metabolism | path:rno04910 | Insulin signaling pathway |
| path:rno00624 | 1- and 2-Methylnaphthalene degradation | path:rno05010 | Alzheimer's disease |
| path:rno00630 | Glyoxylate and dicarboxylate metabolism | path:rno05020 | Parkinson's disease |
| path:rno00650 | Butanoate metabolism | path:rno05030 | Amyotrophic lateral sclerosis (ALS) |
| path:rno00720 | Reductive carboxylate cycle (CO2 fixation) | path:rno05040 | Huntington's disease |
| path:rno00790 | Folate biosynthesis | path:rno05060 | Prion disease |
| path:rno00903 | Limonene and pinene degradation |
Figure 5Histogram of the correlation coefficients between all the genes selected within the purine synthesis pathway (top row) and Tor pathway (bottom row). The expression ratios for the single genes are shown in the rightmost panels. The difference in the correlation distributions between Y-treatment and NY-control is qualitatively the same as the one observed in the entire pool of genes selected with the change point analysis (Figure 2).
Figure 6Histogram of the correlation coefficients between all the genes for selected pathways. (A) MAPK signaling, (B) calcium signaling, (C) focal adhesion, (D) gap junction, (E) insulin/IGF signaling. The expression ratios for the single genes in each pathway are shown in the rightmost panels. The difference in the correlation distributions between c-Myc-ON-treatment (central panel) and c-Myc-OFF-control (left panel) is qualitatively the same as the one observed in the entire pool of genes selected with the ANOVA analysis (Fig. 1).
Figure 7Network between the 5 selected pathways of Fig in the case of C-MYC off. All these pathways show weak co-expression (dotted lines). (A) When cMyc- is on, pathways show positive and negative correlations (B). The red and blue arrows denote positive and negative co-regulation, respectively. The thickness of the arrows is proportional to the magnitude, or absolute value, of the co-expression.