| Literature DB >> 17983469 |
Juan F Poyatos1, Laurence D Hurst.
Abstract
BACKGROUND: Why do some groups of physically linked genes stay linked over long evolutionary periods? Although several factors are associated with the formation of gene clusters in eukaryotic genomes, the particular contribution of each feature to clustering maintenance remains unclear.Entities:
Mesh:
Year: 2007 PMID: 17983469 PMCID: PMC2258174 DOI: 10.1186/gb-2007-8-11-r233
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Rate of synteny conservation in a null model of gene order evolution. The relationship between the proportion of gene pairs retained as neighbors and the number of inversions between two taxa.
Figure 2Differences in intergene distance. Intergene distances of gene pairs currently in synteny and in the ancestor (blue) and those that were not ancestral neighbors but currently are neighbors (red) as a function of the number of inversions (error bars are standard error of the mean).
Figure 3Proportion of gene pairs conserved in a comparator versus intergene distance in S. cerevisiae. Profiles of the rate of gene pairs conserved versus their current spacer in S. cerevisiae (red) or in simulants (blue) when comparing S. cerevisiae with two comparator species for (a) C. glabrata and (b) A. gossypii. For the simulations the number of inversions to run was determined by comparing observed synteny conservation rates against inversion number as shown in Figure 1. For our five focal species we also restricted analysis to cases where both of the orthologues of the S. cerevisiae gene pair are on the same chromosome in the comparator species, as this fits better the simulant model and permits higher orthology certainty. Each data point in the real and simulant data represents the proportion of gene pairs from 50 showing conserved synteny, after the data was rank ordered by intergene distance. After considering the first 50 we then considered ranks 2-51, 3-52, and so on. In addition, we also considered other comparators, and a much more distant comparator, C. albicans (Additional data file 1).
Determinants of gene order conservation
| 3.831 | 3.83 | 3.828 | 3.828 | 3.826 | |
| 3.83 | 3.827 | 3.829 | 3.829 | 3.829 | |
| 0.066 | 0.068 | 0.067 | 0.068 | 0.067 | |
| 0.077 | 0.079 | 0.082 | 0.086 | 0.082 | |
| 528.187 | 477.038 | 474.632 | 471.702 | 461.193 | |
| 390.175 | 417.432 | 370.395 | 376.93 | 378.035 | |
| 0.201 | 0.208 | 0.206 | 0.203 | 0.209 | |
| 0.216 | 0.204 | 0.226 | 0.224 | 0.225 | |
| 1.07 | 1.062 | 1.054 | 1.056 | 1.055 | |
| 1.036 | 1.045 | 1.029 | 1.035 | 1.035 | |
| 0.207 | 0.162 | 0.12 | 0.129 | 0.121 | |
| 0.111 | 0.114 | 0.113 | 0.118 | 0.122 | |
| 5.371 | 5.38 | 5.357 | 5.35 | 5.35 | |
| 5.346 | 5.341 | 5.34 | 5.35 | 5.34 | |
| Rate-co | 86% | 84% | 55% | 55% | 56% |
| Pairs | 1678 | 1677 | 1860 | 1930 | 1852 |
Seven features were computed for adjacent gene pairs in S. cerevisiae (met, metabolic network distance; cex, gene co-expression; igd, intergenic distance; let, density of lethals; rec, recombination rate; cre, gene co-regulation; pro, protein-protein interaction network distance; see Materials and methods). The table shows the mean value of each of these properties for adjacent gene pairs that remained (-co) or did not remain (-nc) as adjacent in the corresponding comparator yeasts. Species ordered according to phylogenetic distance to S. cerevisiae, the closest being C. glabrata. Note that there does not exist yet a consensus phylogeny, for example [22,18]. The last two rows list the proportion of adjacent S. cerevisiae pairs retained as adjacent in the corresponding comparator and total number of homologues pairs (both orthologues might not be in the same chromosome). A smaller number was obtained (9% and 1,850 homologue gene pairs, respectively) when using C. albicans as comparator species [17]. *Yeasts that diverged after the whole genome duplication event.
S. cerevisiae versus C. glabrata logistic regression analyses
| Multiple regression | ||||||
| Simple regression | Stepwise regression | Estimate | z-value | Residual deviation | ||
| Null | 853.06 | (0) 855.07 | 2.535 | 25.310 | 853.07 | - |
| 852.93 | (-) | -0.052 | -0.692 | 852.94 | - | |
| 852.35 | (-) | 0.05 | 0.526 | 852.22 | - | |
| 833.09 | (1) 837.09 | -0.312 | -4.172 | 832.48 | <0.0001 | |
| 853.02 | (-) | 0.044 | 0.453 | 832.33 | - | |
| 850.4 | (-) | -0.084 | -0.935 | 830.95 | - | |
| 845.2 | (2) 835.1 | -0.168 | -2.08 | 827.16 | <0.05 | |
| 852.7 | (-) | -0.092 | -0.995 | 826.22 | - | |
The first column lists the seven predictors contributing to the generalized models and the corresponding null model. The second column shows residual deviance (equivalent to the residual sum of squares in ordinary regression analyses) of a model with a single determinant. The third column describes a stepwise forward regression according to the Akaike criterion with insertion order in parenthesis. The last four columns list the results of a multiple regression model (estimates and z-values) and the corresponding Anova with terms added sequentially from met to pro (residual and χ2 test).
Logistic models of gene order conservation for different comparator species
| Species | Model |
We applied a combination of methods (see main text) to obtain the simplest logistic model capable to describe the observed conservation. Here Pr is the probability that an adjacent pair in S. cerevisiae is found adjacent in the corresponding comparator with logit . Cex, co-expression; cre, co-regulation score; igd, intergenic distance; rec, recombination rate.
Figure 4Determinants of close non-adjacently conserved pairs versus distant adjacently conserved pairs. The difference between the ratio of determinant values of non-adjacently conserved genes in a close species to S. cerevisiae (C. glabrata) and those adjacently conserved in a distant species (A. gossypii) is plotted in red for each predictor (line between points to help visualization). This ratio is defined as the quotient between the corresponding values of the close (distant) pairs and those of the adjacently conserved pairs in the close species, that is, C. glabrata. We also plotted the null behavior obtained by random sampling of the combined group, close and distant, preserving group size, 10,000 times (mean, continuous blue line, ±2 standard deviations, dashed blue lines). Behavior was qualitatively robust for the cex, igd, and let predictors when using S. castelli and K. lactis as close/distant comparator (Additional data file 1).