| Literature DB >> 20565716 |
David A Orlando1, Siobhan M Brady, Thomas M A Fink, Philip N Benfey, Sebastian E Ahnert.
Abstract
BACKGROUND: Biological processes occur on a vast range of time scales, and many of them occur concurrently. As a result, system-wide measurements of gene expression have the potential to capture many of these processes simultaneously. The challenge however, is to separate these processes and time scales in the data. In many cases the number of processes and their time scales is unknown. This issue is particularly relevant to developmental biologists, who are interested in processes such as growth, segmentation and differentiation, which can all take place simultaneously, but on different time scales.Entities:
Mesh:
Year: 2010 PMID: 20565716 PMCID: PMC3017766 DOI: 10.1186/1471-2164-11-381
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1We search for time scale separation in microarray expression data by detecting patterns that show a temporal shift between experimental replicates relative to the synchronization by a given biological process. In this example, Gene A is related to the synchronization procedure and shows very similar expression patterns in both replicates. The expression level of Gene B on the other hand is also changing in a reproducible way, but on a different time scale, independent of the synchronization process. By shifting these time series relative to each other and calculating the similarity for each possible shift, we find the superposition that yields the maximal similarity in the overlap window (third column). Because of the varying window size we keep track of the statistical significance (see Methods). The value of the shift s which gives rise to the maximal similarity, as well as the statistical significance of this value, allows us to determine whether a given expression pattern is likely to be evidence of a process operating on a separate biological time scale.
Distribution of significant shifts in the cell-cycle dataset
| Shift | Not Sig | -48 | -40 | -32 | -24 | -16 | -8 | 0 | 8 | 16 | 24 | 32 | 40 | 48 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1768 | 38 | 91 | 182 | 364 | 689 | 1689 | 2970 | 3351 | 2705 | 1489 | 441 | 193 | 59 | |
| Most Sig. | 1768 | 21 | 19 | 14 | 56 | 41 | 214 | 897 | 1992 | 402 | 237 | 32 | 33 | 16 |
The first row gives the possible types of shifts. The second row gives the number of genes that have a significant shift (p-value < 0.001) at the indicated shift (a gene can have more than one significant shift). The third row gives the number of genes with their maximally significant shift at the indicated shift (a gene can only have one maximally significant shift).
Figure 2Distribution of maximally significant shifts. Our method identified 3974 genes that had significant shifts at or below a p-value of 0.001. Each of these genes was assigned to the shift in which it had the most significant (lowest) p-value.
Figure 3Distribution of periodic genes identified by Orlando et al. in (Orlando . Each of the 1274 periodic genes identified by Orlando et al. was assigned to its maximally significant shift. Genes which did not have a significant p-value (< 0.001) were assigned to the "Not Significant" category.
GO term analysis result for cell cycle data
| +0 min | +8 min | +16 min | |||
|---|---|---|---|---|---|
| ribosome biogenesis | 1.9 e−32 | M phase of mitotic cell cycle | 4.3 e−18 | proteolysis involved in cellular pro- tein catabolic process | 6.9 e−04 |
| ribonucleoprotein complex biogene- sis and assembly | 6.5 e−28 | mitosis | 6.8 e−17 | ubiquitin-dependent protein catabolic process | 9.7 e−04 |
| ncRNA processing | 3.7 e−25 | mitotic cell cycle | 1.1 e−16 | modification-dependent protein catabolic process | 9.7 e−04 |
| rRNA processing | 1.7 e−20 | cell cycle | 5.7 e−16 | ||
| rRNA metabolic process | 2.2 e−19 | cell cycle phase | 1.1 e−15 | ||
| ncRNA metabolic process | 2.5 e−17 | M phase | 2.5 e−14 | ||
| maturation of 5.8S rRNA | 1.6 e−14 | cell cycle process | 7.4 e−14 | ||
| maturation of 5.8S rRNA from tri- cistronic rRNA transcript | 1.6 e−14 | cell division | 4.0 e−13 | ||
| RNA processing | 1.3 e−13 | DNA replication | 2.6 e−09 | ||
| ribosomal large subunit biogenesis | 6.3 e−13 | chromosome segregation | 1.5 e−07 | ||
| organelle organization | 7.1 e−11 | sister chromatid segregation | 1.5 e−07 | ||
| maturation of SSU-rRNA from tri- cistronic rRNA transcript | 7.2 e−10 | response to DNA damage stimulus | 1.6 e−07 | ||
| maturation of SSU-rRNA | 7.2 e−10 | mitotic sister chromatid segregation | 1.7 e−07 | ||
| endonucleolytic cleavage in ITS1 to separate SSU-rRNA ... | 3.9 e−09 | glycoprotein metabolic process | 2.5 e−07 | ||
| cleavages during rRNA processing | 5.9 e−09 | carbohydrate metabolic process | 4.5 e−07 | ||
| endonucleolytic cleavages during rRNA processing | 8.8 e−09 | protein amino acid glycosylation | 1.5 e−06 | ||
| endonucleolytic cleavage of tri- cistronic rRNA transcript ... | 8.8 e−09 | biopolymer glycosylation | 1.5 e−06 | ||
| RNA metabolic process | 1.0 e−07 | glycosylation | 1.5 e−06 | ||
| maturation of LSU-rRNA from tri- cistronic rRNA transcript ... | 6.6 e−07 | glycoprotein biosynthetic process | 2.6 e−06 | ||
| maturation of LSU-rRNA | 6.6 e−07 | cellular response to DNA damage stimulus | 5.9 e−06 | ||
| rRNA 5'-end processing | 1.5 e−06 | DNA repair | 1.3 e−05 | ||
| ncRNA 5'-end processing | 1.5 e−06 | microtubule-based process | 2.4 e−05 | ||
| RNA 5'-end processing | 3.3 e−06 | DNA metabolic process | 2.8 e−05 | ||
| cellular component organization | 5.8 e−06 | DNA-dependent DNA replication | 5.9 e−05 | ||
| endonucleolytic cleavage to gener- ate mature 5'-end ... | 6.9 e−06 | response to stimulus | 1.1 e−04 | ||
| ribosomal large subunit assembly and maintenance | 1.0 e−05 | response to stress | 3.2 e−04 | ||
| ribosome assembly | 5.1 e−05 | DNA packaging | 4.0 e−04 | ||
| endonucleolytic cleavage in 5'-ETS of tricistronic ... | 6.2 e−05 | microtubule cytoskeleton organiza- tion | 5.3 e−04 | ||
| ribosomal subunit assembly | 6.6 e−05 | ||||
| nucleobase, nucleoside, nucleotide and nucleic acid ... | 1.8 e−04 | ||||
Every set of genes, defined by the position of their maximally significant shift, was analyzed for over-represented GO terms (Boyle et al. 2004). Each shift (column) and term (row) found to be significant are shown.
Figure 4Distribution of maximally significant shifts in the root dataset. Our method identified 5592 genes that had significant shifts at or below a p-value of 0.01. Each of these genes was assigned to the shift in which it had the most significant (lowest) p-value.
Figure 5Gene ontology (GO) terms associated with shifted profiles. Log10 p-value is indicated. (A) All patterns except for shifts of +5 and -6 show enrichment of an associated biological process. (B) Cell-type enrichment, and enrichment of genes associated with auxin-activated lateral root initiation in the lateral root inducible system (LRIS). Genes activated or repressed within xylem pole pericycle cells (J0121) using the LRIS were also enriched.
Figure 6Clustering of shift profiles identifies spatiotemporally regulated modules of genes for shifts of +2. (A) Relative expression by marker line is visualized. Clusters of spatially co-expressed genes are indicated on the y-axis. (B) Relative expression by longitudinal section in the two roots is visualized. Root sections are indicated on the x-axis, and clusters from A are further visualized on the y-axis.
Figure 7For cases where multiple spatiotemporally regulatory modules were identified, statistically significantly enriched expression within cell types was also tested. The p-value scale obtained using the hypergeometric distribution is indicated on the top of each panel. (A) Shift of +2. Many clusters display distinct cell type enrichment profiles. (B) Shift of +3. At least two significantly cell type-enriched clusters were identified. (C) Shift of -2. Among many clusters, specific cell-type enrichment was identified in only a single cluster.