| Literature DB >> 18331647 |
Alessandra Riva1, Anne-Sophie Carpentier, Frédérique Barloy-Hubler, Angélique Chéron, Alain Hénaut.
Abstract
BACKGROUND: The processes of gene transcription, translation, as well as the reactions taking place between gene products, are subject to stochastic fluctuations. These stochastic events are being increasingly examined as it emerges that they can be crucial in the cell's survival. In a previous study we had examined the transcription patterns of two bacterial species (Escherichia coli and Bacillus subtilis) to elucidate the nucleoid's organization. The basic idea is that genes that share transcription patterns, must share some sort of spatial relationship, even if they are not close to each other on the chromosome. We had found that picking any gene at random, its transcription will be correlated with genes at well-defined short - as well as long-range distances, leaving the explanation of the latter an open question. In this paper we study the transcription correlations when the only transcription taking place is stochastic, in other words, no active or "deterministic" transcription takes place. To this purpose we use transcription data of Sinorhizobium meliloti.Entities:
Mesh:
Year: 2008 PMID: 18331647 PMCID: PMC2270832 DOI: 10.1186/1471-2164-9-125
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1a, b and c – The autocorrelation functions for data set A, when all three replicons are actively transcribed and translated. Figure 1a shows the autocorrelation function for the megaplasmid pSymA (blue curve) when the plasmid is actively transcribed (and translated; data set A). The red curve shows the autocorrelation function when the genes' positions were randomly assigned. The Y-axis has been cropped at an autocorrelation of 0.15 for a clearer visual interpretation; the blue curve starts at an autocorrelation of 0.27 for a gene distance of one. Maxima (positive correlation) and minima (negative or anti-correlation) can be clearly distinguished, with a strong anti-correlation for genes that lie opposite of each other on the chromosome (a gene distance of around 650). Similarly, Figures 1b and 1c show the results for the chromosome and the megaplasmid pSymB, respectively, when both are actively transcribed (and translated; data set A). As can be seen, the autocorrelation functions for the three replicons are similar. Note: The figures are all at the same scale to better illustrate the different sizes of the three replicons. All Y-axes have been cropped at a value of 0.15 for (visual) clarity's sake.
Figure 2a, b and c – The autocorrelation functions for data set B, when pSymA is transcribed stochastically only. Figure 2a shows the autocorrelation function for the megaplasmid pSymA (blue curve) when only stochastic transcription takes place (data set B). The red curve shows the autocorrelation function when the genes' positions were randomly assigned. The Y-axis has been cropped at an autocorrelation of 0.15 for a clearer visual interpretation; the blue curve starts at an autocorrelation of 0.15 for a gene distance of one. The signal becomes quickly confounded with the noise (red curve). There are minima and maxima that stand out, but only a spectral analysis can tell, whether these are significant or not. Again to serve as comparison, Figures 2b and 2c show the autocorrelation functions for the chromosome and the megaplasmid psymB, respectively, for data set B. Both replicons are actively transcribed and translated, unlike psymA, and their autocorrelation functions are comparable to those in data set A (as confirmed by the spectral and statistical analyses, see additional file 1). Note: The figures are all at the same scale to better illustrate the different sizes of the three replicons. All Y-axes have been cropped at a value of 0.15 for (visual) clarity's sake.
The spectral analyses of the autocorrelations
| pSymA (set A) | rnd pSymA (set A) | pSymA (set B) | rnd pSymA (set B) | ||||||||
| No of Periods | Period | Amplitude | No of Periods | Period | Amplitude | No of Periods | Period | Amplitude | No of Periods | Period | Amplitude |
| 1 | 1294.000 | 0.584 | 23 | 56.261 | 0.028 | 12 | 107.833 | 0.040 | 615 | 2.104 | 0.051 |
| 12 | 107.833 | 0.158 | 341 | 3.795 | 0.026 | 6 | 215.667 | 0.022 | 371 | 3.488 | 0.021 |
| 6 | 215.667 | 0.081 | 563 | 2.298 | 0.024 | 70 | 18.486 | 0.020 | 566 | 2.286 | 0.018 |
| 7 | 184.857 | 0.077 | 118 | 10.966 | 0.019 | 47 | 27.532 | 0.018 | 296 | 4.372 | 0.017 |
| 21 | 61.619 | 0.050 | 262 | 4.939 | 0.019 | 43 | 30.093 | 0.017 | 393 | 3.293 | 0.015 |
| 43 | 30.093 | 0.048 | 640 | 2.022 | 0.016 | 107 | 12.093 | 0.016 | 99 | 13.071 | 0.015 |
| 3 | 431.333 | 0.042 | 248 | 5.218 | 0.014 | 238 | 5.437 | 0.016 | 541 | 2.392 | 0.014 |
| 70 | 18.486 | 0.039 | 380 | 3.405 | 0.014 | 134 | 9.657 | 0.016 | 430 | 3.009 | 0.013 |
| 47 | 27.532 | 0.036 | 212 | 6.104 | 0.014 | 9 | 143.778 | 0.015 | 278 | 4.655 | 0.013 |
| 2 | 647.000 | 0.033 | 39 | 33.179 | 0.013 | 50 | 25.880 | 0.015 | 584 | 2.216 | 0.013 |
| 8 | 161.750 | 0.032 | 637 | 2.031 | 0.013 | 424 | 3.052 | 0.014 | 610 | 2.121 | 0.013 |
| 166 | 7.795 | 0.025 | 635 | 2.038 | 0.013 | 48 | 26.958 | 0.014 | 537 | 2.410 | 0.013 |
| 82 | 15.780 | 0.025 | 21 | 61.619 | 0.011 | 10 | 129.400 | 0.013 | 249 | 5.197 | 0.012 |
| 5 | 258.800 | 0.025 | 325 | 3.982 | 0.011 | 144 | 8.986 | 0.013 | 407 | 3.179 | 0.012 |
| 17 | 76.118 | 0.023 | 301 | 4.299 | 0.011 | 2 | 647.000 | 0.013 | 317 | 4.082 | 0.011 |
| 4 | 323.500 | 0.022 | 636 | 2.035 | 0.010 | 220 | 5.882 | 0.012 | 168 | 7.702 | 0.011 |
| 30 | 43.133 | 0.022 | 141 | 9.177 | 0.009 | 524 | 2.469 | 0.011 | 386 | 3.352 | 0.011 |
| 42 | 30.810 | 0.017 | 589 | 2.197 | 0.009 | 53 | 24.415 | 0.011 | 18 | 71.889 | 0.010 |
| 29 | 44.621 | 0.016 | 318 | 4.069 | 0.009 | 138 | 9.377 | 0.011 | 238 | 5.437 | 0.010 |
| 11 | 117.636 | 0.016 | 339 | 3.817 | 0.009 | 171 | 7.567 | 0.011 | 471 | 2.747 | 0.010 |
The table shows the spectral analyses for the averaged auto-correlations of pSymA (pSymA) and of pSymA with the genes' positions randomly permutated (rnd pSymA) for data set A and set B. Shown are the first twenty periods with the highest amplitudes, in descending order and the number of periods contained on the replicon. The two spectra of the "real" plasmid are made up of short and long periods. Both have, for example, periods of 108 and 206 genes. The random permutations, however, have only very short periods. The amplitudes for pSymA in set B are lower compared to the set A, as we are looking at a random phenomenon, which is necessarily weaker than active transcription. "No of periods": how many periods can be fitted along the replicon in question.
Mann-Whitney two-tailed test for psymA
| (psymA set A) – (rnd psymA set A) | < 0.0001 |
| (psymA set B) – (rnd psymA set B) | < 0.0001 |
| (psymA set A) – (psymA set B) | 0.006 |
In order to examine whether the various pairs of spectra (real replicon versus random permutation) differ from each other from a statistical point of view, we performed the Mann-Whitney two-tailed test. We also compared the spectra of the real replicon in the two datasets. For both, data set A and set B, the spectra of pSymA are clearly different from those obtained with the controls, where the gene order has been randomly permutated (p < 0.0001).
The comparison of pSymA in data set A versus data set B shows that though differing from a statistical point of view (p = 0.006), they are closer to each other than to the random permutations. This may be attributed to the presence of the long-range correlations in both sets.