| Literature DB >> 24128940 |
Katarzyna Tomala1, Ryszard Korona.
Abstract
Protein metabolism is one of the most costly processes in the cell and is therefore expected to be under the effective control of natural selection. We stimulated yeast strains to overexpress each single gene product to approximately 1% of the total protein content. Consistent with previous reports, we found that excessive expression of proteins containing disordered or membrane-protruding regions resulted in an especially high fitness cost. We estimated these costs to be nearly twice as high as for other proteins. There was a ten-fold difference in cost if, instead of entire proteins, only the disordered or membrane-embedded regions were compared with other segments. Although the cost of processing bulk protein was measurable, it could not be explained by several tested protein features, including those linked to translational efficiency or intensity of physical interactions after maturation. It most likely included a number of individually indiscernible effects arising during protein synthesis, maturation, maintenance, (mal)functioning, and disposal. When scaled to the levels normally achieved by proteins in the cell, the fitness cost of dealing with one amino acid in a standard protein appears to be generally very low. Many single amino acid additions or deletions are likely to be neutral even if the effective population size is as large as that of the budding yeast. This should also apply to substitutions. Selection is much more likely to operate if point mutations affect protein structure by, for example, extending or creating stretches that tend to unfold or interact improperly with membranes.Entities:
Keywords: budding yeast; disordered proteins; membrane proteins; molecular evolution rate; protein overexpression
Mesh:
Substances:
Year: 2013 PMID: 24128940 PMCID: PMC3845635 DOI: 10.1093/gbe/evt154
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FThe effects of single gene overexpression on growth. The number of cell divisions in single-strain cultures was estimated four times independently. The estimates were divided by the median values of relevant replications to obtain normalized values. (a) The repeatability of the individual normalized fitness estimates and (b) the frequency distribution of strains’ means. The vertical dashed line marks the slowest growing 91 strains. These were removed from all of the following statistical analyses to make the distribution symmetric and closer to normal. (This exclusion was unlikely to affect our analyses. For example, we correlated fitness with ten properties of proteins for all data and those lacking the 77 data points. For data analyzed in this way, pairs of Pearson’s coefficients were themselves very much correlated: Pearson’s r = 0.988, Spearman’s r = 1).
FGene Ontology categories as predictors of the overexpression cost. The graph shows the highest and most statistically significant deviations of the Yeast Slim category means from the grand mean (not fitness gains or losses when compared with a strain with no overexpression).
FProtein properties and the fitness cost of overexpression. (a) Examples of fitness predictors (only the most significant predictors are shown; the remaining ones are in supplementary fig. S2, Supplementary Material online). Moving averages are shown as red lines for continuous variables. (b) Results of multifactorial analysis. Statistical significance of positive (green) and negative (red) effects is shown.
FThe level of protein overexpression. (a) Frequency distribution of the amount of protein at the normal (empty bars) and overexpressed (filled bars) levels. Normal protein levels were taken from a previous study (Ghaemmaghami et al. 2003) and overexpression estimates were obtained in this study using a competitive ELISA assay. (b) The relationship between protein length and protein overexpression level (see supplementary methods, Supplementary Material online).
Fitness Cost of Protein Expression
| Protein Type | 1% of Total Protein | Special Region Fraction (Mean ± SD) | Cost Per Single aa |
|---|---|---|---|
| Standard | 0.023 ± 0.005 | — | (7.32 ± 1.63) × 10−11 |
| Disordered (added) | 0.017 ± 0.004 | 0.11 ± 0.08 | (6.76 ± 1.47) × 10−10 |
| Trans-membrane (added) | 0.012 ± 0.002 | 0.13 ± 0.10 | (4.78 ± 0.82) × 10−10 |
aProteins were standard (that is, cytosolic and well structured), contained disordered regions, and were located in membranes. The proportion of protein length taken by the disordered or transmembrane regions is shown in the middle column.
bThe fitness cost of producing 1% of superfluous polypeptide (standard), plus the costs added by the presence of disordered or transmembrane regions.
cThe fitness cost of expressing one amino acid in one protein molecule if the amino acid is located in standard or special regions.