| Literature DB >> 17538690 |
Laurence Ettwiller1, Reiner A Veitia.
Abstract
Previous studies in the yeast Saccharomyces cerevisiae have shown that genes encoding subunits of macromolecular complexes have similar evolutionary rates (K) and expression levels (E). Besides, it is known that the expression of a gene is a strong predictor of its rate of evolution (i.e., E and K are correlated). Here we show that intracomplex variation of subunit expression correlates with intracomplex variation of their evolutionary rates (using two different measures of dispersion). However, a similar trend was observed for randomized complexes. Therefore, using a mathematical transformation, we created new variables capturing intracomplex variation of both E and K. The values of these new compound variables were smaller for real complexes than for randomized ones. This shows that proteins in complexes tend to have closer expressivities (E) and K's simultaneously than in the randomly grouped genes. We speculate about the possible implications of this finding.Entities:
Year: 2007 PMID: 17538690 PMCID: PMC1838959 DOI: 10.1155/2007/58721
Source DB: PubMed Journal: Comp Funct Genomics ISSN: 1531-6912
The median dispersions of K and E or CAI are smaller for complexes than for random gene groups. The K Cal and K Spo are those obtained from comparisons between orthologs of S cerevisiae and C albicans (Cal) or Sch Pombe (Spo), respectively, P values from a median test.
| Medians | SD(logE) | SD(logCAI) | SD(logK) Cal | SD(logK) Spo | DE | D CAI | DK Cal | DK Spo |
|
| ||||||||
| Observed | 0.53 | 0.18 | 0.51 | 0.46 | 0.30 | 0.11 | 0.30 | 0.27 |
| Random | 1.33 | 0.55 | 0.78 | 0.72 | 0.56 | 0.27 | 0.42 | 0.39 |
|
| 1.0E-19 | 1.0E-18 | 1.0E-06 | 1.0E-06 | 1.0E-20 | 1.0E-20 | 1.0E-07 | 1.0E-07 |
Figure 1The expression levels and the evolutionary rates of the subunits of macromolecular complexes are closer than expected by random. (a) Regression analysis of median SD(logE) and median SD(logK). (b) Regression analysis of median DE and median DK. The K values were those obtained from comparisons between orthologs of S cerevisiae and C albicans. However, very similar results were obtained for K values drawn from comparisons between orthologs of S cerevisiae and S pombe. Black circles represent 94 real complexes and gray triangles represent a sample of 94 groups containing randomly chosen genes treated in the same way as complexes. A regression line, common to real and random complexes, appears in black. Note that the points representing the real complexes concentrate in the lower-left quadrant. To test this statistically, we defined a new composite variable as the distance on the regression line between its intersection with the x-axis and the orthogonal projection of the data points on the regression line (e.g., the distance “X” in Figure 1(a)). (c) Distribution of the values of the compound variables SD(logE)-SD(logK) and DE-DK for real and random (Rnd) complexes. The medians of the composite variables (represented by horizontal lines) for real complexes and random groups were statistically different.
The median values of the different composite variables are smaller for complexes than for random gene groups. The K Cal and K Spo are those obtained from comparisons between orthologs of S cerevisiae and C albicans (Cal) or Sch pombe (Spo), respectively, P values from a median test.
| Medians | SD(logE)-SD(logK) Cal | SD(logE)-SD(logK) Spo | SD(logCAI)-SD(logK) Cal | SD(log) CAI-SD(logK) Sp | DE-DK Cal | DE-DK Spo | D CAI-DK Cal | D CAI-DK Spo |
|
| ||||||||
| Observed | 1.74 | 1.64 | 0.86 | 0.79 | 0.89 | 0.79 | 0.55 | 0.49 |
| Random | 2.55 | 2.43 | 1.33 | 1.25 | 1.16 | 1.06 | 0.75 | 0.68 |
|
| 1.0E-20 | 1.0E-20 | 1.0E-14 | 1.0E-16 | 1.0E-23 | 1.0E-19 | 1.0E-17 | 1.0E-16 |