| Literature DB >> 33526669 |
Diana Ascencio1,2,3,4,5,6, Guillaume Diss1,2,3,5, Isabelle Gagnon-Arsenault1,2,3,4,5, Alexandre K Dubé1,2,3,4,5, Alexander DeLuna6, Christian R Landry7,2,3,4,5.
Abstract
Gene duplication is ubiquitous and a major driver of phenotypic diversity across the tree of life, but its immediate consequences are not fully understood. Deleterious effects would decrease the probability of retention of duplicates and prevent their contribution to long-term evolution. One possible detrimental effect of duplication is the perturbation of the stoichiometry of protein complexes. Here, we measured the fitness effects of the duplication of 899 essential genes in the budding yeast using high-resolution competition assays. At least 10% of genes caused a fitness disadvantage when duplicated. Intriguingly, the duplication of most protein complex subunits had small to nondetectable effects on fitness, with few exceptions. We selected four complexes with subunits that had an impact on fitness when duplicated and measured the impact of individual gene duplications on their protein-protein interactions. We found that very few duplications affect both fitness and interactions. Furthermore, large complexes such as the 26S proteasome are protected from gene duplication by attenuation of protein abundance. Regulatory mechanisms that maintain the stoichiometric balance of protein complexes may protect from the immediate effects of gene duplication. Our results show that a better understanding of protein regulation and assembly in complexes is required for the refinement of current models of gene duplication.Entities:
Keywords: dosage balance hypothesis; fitness effects; gene duplication; gene expression; protein interaction
Year: 2021 PMID: 33526669 PMCID: PMC7970654 DOI: 10.1073/pnas.2014345118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.More than 10% of yeast essential genes affect fitness when duplicated. (A) Relative fitness was measured using a high-resolution competition assay (23). We cocultured a mCherry-tagged strain carrying an extra copy of an essential gene on a centromeric plasmid with a CFP-tagged reference strain carrying a control plasmid. We followed the ratio of the two populations for up to 28 generations to calculate a slope, which corresponds to the selection coefficient (s). (B) Cumulative distribution of selection coefficients of all the 899 strains tested (Dataset S1). Each dot represents a strain expressing an additional gene copy. The black dots represent the distribution of 192 biological replicates of reference-versus-reference competition. The threshold used for deleterious (in red) or beneficial (in blue) effect is at least 1% (−4.5 > z score > 4.5). (C) Selection coefficients for the validation of the 180 genes with significant effects measured by flow cytometry (Dataset S2). The labels are for genes with the strongest deleterious and beneficial effects. The bars indicate the SD of three biological replicates. The black circles highlight genes with haploinsufficient phenotypes. Spearman’s correlation coefficient is indicated at the Top. (D) A comparison of fitness effects among haploinsufficient and haplosufficient genes (12). P value from a Fisher exact test is shown. The fraction and number of genes are indicated with white numbers. (E) Selection coefficients of genes that code for proteins that are members of complexes and proteins that are not. On Top, we show the P value from a Wilcoxon rank-sum test.
Fig. 2.Most duplications of subunits do not affect PPIs in large complexes. (A) DHFR-PCA–based strategy to measure perturbation of the pairwise physical interactions after a duplication. On the Left, in a DHFR-PCA, the colony size on selective media (MTX) is correlated with the stability and strength of the physical interaction between the two subunits S1 and S2 (shown in green). The perturbation score (ps) is defined as the colony size difference between the strain carrying a duplication (+pCEN-VPS5) and a WT (+pCEN) strain. Heatmap indicating ps values of the complete retromer PPI network due to the duplication of VPS5. (B) Distributions of ps for interactions with and without a competing subunit. Since the duplicated protein is not tagged with a DHFR fragment, it titrates PPI partners away from the tagged copy, decreasing colony size. We show violin plots of the distributions of all the interactions tested for five complexes (Dataset S5). On Top, we show the P value from a Wilcoxon rank-sum test. (C) Colony sizes of all strains carrying the duplication of a subunit compared with their control strain (the empty vector). Colony sizes of diploid strains carrying all tested combinations of preys, baits, and duplications for the proteasome, and the three RNA polymerases (Datasets S4 and S5). The dark gray circles indicate strains above our growth threshold indicative of physical interaction, while the light gray circles are strains below the growth threshold. The black circles indicate interactions with a competing subunit above the threshold. (D) Cumulative frequency of ps of the proteasome and the three RNA polymerases. All competition effects were excluded. Labeled are the prey–bait combinations that are perturbed by the duplication of PRE7. (E) Relationship between the selection coefficient and the average ps (absolute value) of duplicated subunits on PPIs. Only significant (FDR of 5%) and noncompetition combinations were used to calculate the averages. The circles represent duplications of proteasome subunits, while triangles represent subunits of any of the three RNA polymerases. In red, we show Spearman’s correlation coefficient.
Fig. 3.Attenuation of protein abundance after duplication in most proteasome subunits. (A) Measure of attenuation with GFP-tagged proteins. Changes in abundance of each subunit can be detected by comparing fluorescent signals of GFP-tagged subunits before and after duplication. Upon attenuation, the abundance of the tagged copy will be reduced. (B) GFP signal comparison between strains carrying a duplication of the GFP-tagged subunit and their corresponding control. On the Right, a cartoon of the proteasome with its components. All GFP values are corrected for autofluorescence by subtracting the signal of the parental strain not expressing GFP and by cell size (Dataset S6). (C) Attenuation scores of all assayed proteasome subunits. The attenuation score is the difference between GFP fluorescent signals of the control strain (bearing a control plasmid) and the duplicated strain (bearing a centromeric plasmid with an extra copy of the subunit) divided by the GFP signal of the control. In the absence of attenuation, this value is 0. Upon complete attenuation, it is 0.5. (D) Attenuation scores of the proteasome subcomplexes. On the Right, the asterisks indicate significant differences between components calculated by correcting for multiple testing (Tukey’s test; *P ≤ 0.05 and **P ≤ 0.01). (E) Colony sizes of all strains carrying the PRE7 duplication (+pCEN-PRE7) compared with their control strain (the empty vector) indicating changes in PPI in the DHFR-PCA assay. The black dots highlight the subunits that have interactions disturbed after PRE7 duplication. (F) GFP signal of proteasome subunits before and after PRE7 duplication (+pCEN-PRE7). The black dots highlight the subunits that have interactions disturbed after PRE7 duplication (Dataset S7). Replicate measurements are available in .