Changping Li1, Baoxu Ding1, Xintong Ma1, Xuan Yang1, Hongyan Wang2, Yuefan Dong1, Zhibin Zhang1, Jinbin Wang1, Xiaochong Li1, Yanan Yu1, Yiyang Yu1, Bao Liu1, Jonathan F Wendel3, Yidan Li4, Tianya Wang1, Lei Gong1. 1. Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun, Jilin Province, 130024, China. 2. Laboratory of Plant Epigenetics and Evolution, School of Life Science, Liaoning University, Shenyang, 110036, China. 3. Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA, 50011. 4. Institute of Agricultural Biotechnology, Jilin Academy of Agricultural Sciences, Changchun, Jilin Province, 130024, China.
Abstract
Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCo) has long been studied from many perspectives. As a multisubunit (large subunits [LSUs] and small subunits[SSUs]) protein encoded by genes residing in the chloroplast (rbcL) and nuclear (rbcS) genomes, RuBisCo also is a model for cytonuclear coevolution following allopolyploid speciation in plants. Here, we studied the genomic and transcriptional cytonuclear coordination of auxiliary chaperonin and chaperones that facilitate RuBisCo biogenesis across multiple natural and artificially synthesized plant allopolyploids. We found similar genomic and transcriptional cytonuclear responses, including respective paternal-to-maternal conversions and maternal homeologous biased expression, in chaperonin/chaperon-assisted folding and assembly of RuBisCo in different allopolyploids. One observation is about the temporally attenuated genomic and transcriptional cytonuclear evolutionary responses during early folding and later assembly process of RuBisCo biogenesis, which were established by long-term evolution and immediate onset of allopolyploidy, respectively. Our study not only points to the potential widespread and hitherto unrecognized features of cytonuclear evolution but also bears implications for the structural interaction interface between LSU and Cpn60 chaperonin and the functioning stage of the Raf2 chaperone.
Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCo) has long been studied from many perspectives. As a multisubunit (large subunits [LSUs] and small subunits[SSUs]) protein encoded by genes residing in the chloroplast (rbcL) and nuclear (rbcS) genomes, RuBisCo also is a model for cytonuclear coevolution following allopolyploid speciation in plants. Here, we studied the genomic and transcriptional cytonuclear coordination of auxiliary chaperonin and chaperones that facilitate RuBisCo biogenesis across multiple natural and artificially synthesized plant allopolyploids. We found similar genomic and transcriptional cytonuclear responses, including respective paternal-to-maternal conversions and maternal homeologous biased expression, in chaperonin/chaperon-assisted folding and assembly of RuBisCo in different allopolyploids. One observation is about the temporally attenuated genomic and transcriptional cytonuclear evolutionary responses during early folding and later assembly process of RuBisCo biogenesis, which were established by long-term evolution and immediate onset of allopolyploidy, respectively. Our study not only points to the potential widespread and hitherto unrecognized features of cytonuclear evolution but also bears implications for the structural interaction interface between LSU and Cpn60 chaperonin and the functioning stage of the Raf2 chaperone.
As the most abundant photosynthetic enzyme on earth (1, 2), ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCo) has long attracted study, including that of its biogenesis (3), metabolic maintenance (4), engineering for increased catalytic specificity for CO2 versus O2 (5), and recombinant expression (3, 6–9). RuBisCo has also been widely employed in evolutionary biology, mostly for molecular phylogenetic inference using sequences of the large subunit (LSU), encoded by the relatively slow-evolving chloroplast rbcL genes (10–12). In contrast to the plastid-encoded LSU, the small subunit (SSU) is encoded by rbcS, which typically occurs in small multigene families that evolve nonindependently (13–15). Given that the RuBisCo enzyme complex is composed of both nuclear-encoded SSUs and cytoplasmically encoded LSUs, RuBisCo also has been an important model for the study of cytonuclear coevolution following allopolyploid speciation in plants (16–23).Allopolyploid speciation entails the combination of two or more diploid parents with divergent nuclear genomes but in only one progenitor cytoplasm, which carries the mitochondrial and plastid genomes (24). Allopolyploid speciation is prevalent in the current plant kingdom (25–27), including in many well-studied models such as Arabidopsis, Gossypium, Nicotiana, Tragopogon, and Triticum. Of note, allopolyploid speciation is followed by evolutionary diploidization, which involves diverse genomic processes and mechanisms (24). Accompanying allopolyploid speciation, stoichiometric disruption and incompatibility between nuclear and cytoplasmic genomes, caused by the union of two nuclear genomes in only one set of parental organellar genomes (usually maternal), necessitates cytonuclear coevolution or coordination (16–23). Previous studies using RuBisCo have demonstrated nuclear, paternal-to-maternal gene conversion among subgenomic rbcS homologs leading to maternal-like paternal SSUs (carrying maternal conversions) and transcriptionally biased expression of maternal and maternal-like rbcS homologs (16–23). However, it is still underexplored about whether foregoing cytonuclear coordinative responses still hold for other nuclear-encoded proteins targeting to cytoplasmic organelles in the allopolyploid speciation process.RuBisCo maturation involves folding and assembly stages, with a number of intermediate cytonuclear coencoded complexes formed from plastid-encoded LSUs and nuclear-encoded chaperonin/chaperones (Fig. 1). In the initial folding stage, nuclear-encoded chaperonin 60α, chaperonin 60β, cochaperonin 20 ,and cochaperonin 10 (hereafter abbreviated as Cpn60α, Cpn60β, and Cpn20/Cpn10, respectively) proteins are assembled into a complex (in a 1:1:1:1:1 ratio of subunits relative to LSU; Fig. 1), which folds LSU into a configuration appropriate for subsequent polymerization (28). Subsequently, nuclear-encoded RuBisCo accumulation factor 1 (hereafter abbreviated as Raf1) and/or RbcX assemble LSUs into dimers and even higher oligomers (in 1:2 or 2:2 ratio relative to LSU; Fig. 1), for assembly with other chaperones (28). RuBisCo accumulation factor 2 (hereafter abbreviated as Raf2) chaperones are implicated as binding with this complex (in an undetermined ratio relative to LSU) in a subsequent second assembly step, downstream of LSURaf1 and LSURbcX complex formation, or perhaps at a folding stage upstream of the Cpn60αβ/Cpn20 chaperonin complex. It is thought that the bundle sheath defective 2 (hereafter abbreviated as Bsd2) chaperone replaces the previous assembly chaperones to bind with LSU octamers (in a 1:1 ratio relative to LSU; Fig. 1), facilitating the binding of SSU with LSU to form the RuBisCo holoenzyme (28). To the best of our knowledge, none of the aforementioned proteins involved in RuBisCo biogenesis have been explored with respect to cytonuclear coevolutionary responses let alone comparing their cytonuclear coordinative responses in terms of their functional position in RuBisCo folding and assembly steps.
Fig. 1.
Overview of folding and assembly stages/processes of RuBisCo biogenesis. Nuclear-encoded premature chaperonins, chaperones, and RuBisCo SSUs in parenthesis are targeted to chloroplasts by their N-terminal transit peptides and transported through TOC (outer membrane translocon) complexes into chloroplasts. Within chloroplasts, LSUs produced by translation from chloroplast-genome rbcL transcripts are input into subsequent folding and assembly steps (subdivided into assembly-1 and -2 stages). Intermediate complexes are arranged according to their functional position in the respective stage (8). Component combinations and ratios of LSU vs. respective auxiliary chaperonin/chaperon proteins or SSUs in each intermediate complex are labeled beneath each complex. The functioning stage of the Raf2 cofactor has not been ascertained; based on results from the present study, its most likely positioning in assembly is denoted by the red star.
Overview of folding and assembly stages/processes of RuBisCo biogenesis. Nuclear-encoded premature chaperonins, chaperones, and RuBisCo SSUs in parenthesis are targeted to chloroplasts by their N-terminal transit peptides and transported through TOC (outer membrane translocon) complexes into chloroplasts. Within chloroplasts, LSUs produced by translation from chloroplast-genome rbcL transcripts are input into subsequent folding and assembly steps (subdivided into assembly-1 and -2 stages). Intermediate complexes are arranged according to their functional position in the respective stage (8). Component combinations and ratios of LSU vs. respective auxiliary chaperonin/chaperon proteins or SSUs in each intermediate complex are labeled beneath each complex. The functioning stage of the Raf2 cofactor has not been ascertained; based on results from the present study, its most likely positioning in assembly is denoted by the red star.Here, we explore their cytonuclear evolution in multiple angiosperm allopolyploid models, including Arabidopsis, Gossypium, Nicotiana, and Triticum, each of which has a well-characterized phylogeny and living models of progenitor or parental diploids. Our focus was on nuclear-encoded chaperonin and chaperone genes, for which we explored copy number, genomic variation, and transcriptional bias in the context of folding and assembly steps potentially subjected to phenomena that might be reflective of cytonuclear interaction and selection. By quantifying and comparing genomic (intergenomic homeologous gene conversion) and transcriptional (maternal homeologous biased expression) cytonuclear coordination of chaperonins and chaperones across multiple natural and artificially synthesized allopolyploids, we demonstrate similar genomic and transcriptional cytonuclear responses of genes involved in chaperonin/chaperon-assisted folding and assembly of RuBisCo in different allopolyploids. We also show that these responses are temporally attenuated during the folding and assembly processes. Our results have general significance for understanding the cytonuclear evolution of chaperonin/chaperone encoding genes, as well as for elucidating structural and functional features of intermediate protein complexes during RuBisCo biogenesis.
Results
Reduction in Copy Numbers of Chaperonin and Chaperone Genes Downstream Relative to Upstream.
Gene families encoding chaperonin and chaperone proteins during RuBisCo folding and assembly (6) have previously been characterized in some diploid angiosperms (29). Here, we tabulated chaperonin and chaperone gene families (Fig. 1) and placed them in a phylogenic context (Fig. 2). Notably, chaperonin proteins involved in the upstream folding process (including Cpn60α, Cpn60β, and Cpn20/Cpn10) were encoded by multiple copies, with the numbers of gene copies encoding cpn60α, cpn60β, and cpn20/cpn10 being approximately the same (Fig. 2). Given their integration into the same Cpn60αβ/Cpn20/Cpn10 chaperonin complex, such a multiple-copy existence implicates gene balance as a constraint on copy number (30, 31). In contrast, the chaperone factors involved in RuBisCo assembly were encoded by fewer genes than those that function in the folding process (Figs. 1 and 2). Notably, there was no apparent association between gene copy number of each gene family with the number of whole-genome duplications (WGDs) in each lineage. In addition, a unique case involves raf2, which encodes an essential Raf2 protein for RuBisCo biogenesis. Without exception across the angiosperms studied, this protein was encoded by a single-copy gene (Fig. 2).
Fig. 2.
Copy numbers of chaperonin and chaperon genes in representative sequenced diploid angiosperm species. Sequenced diploid angiosperm species are listed according to their phylogenetic relationships (revised from the dendrogram tabulated in Phytozome V12.1). In each cell of the table, copy numbers of chaperonin (in Cpn60α to Cpn20/Cpn10 columns) and chaperone (in Raf1 to Bsd2 columns) genes in respective species are specified and their relative abundance are denoted in gradient colors. UN denotes the undetermined cases, possibly reflecting missed or unannotated copies. Those ancient WGDs (aWGDs) are denoted in the respective branch or node . Similar numbers of multiple-copy cpn60α, cpn60β, and cpn20/cpn10 genes across species are statistically tested by paired Wilcoxon rank sum tests, P > 0.05.
Copy numbers of chaperonin and chaperon genes in representative sequenced diploid angiosperm species. Sequenced diploid angiosperm species are listed according to their phylogenetic relationships (revised from the dendrogram tabulated in Phytozome V12.1). In each cell of the table, copy numbers of chaperonin (in Cpn60α to Cpn20/Cpn10 columns) and chaperone (in Raf1 to Bsd2 columns) genes in respective species are specified and their relative abundance are denoted in gradient colors. UN denotes the undetermined cases, possibly reflecting missed or unannotated copies. Those ancient WGDs (aWGDs) are denoted in the respective branch or node . Similar numbers of multiple-copy cpn60α, cpn60β, and cpn20/cpn10 genes across species are statistically tested by paired Wilcoxon rank sum tests, P > 0.05.
Attenuated Cytonuclear Genomic Responses to Polyploidy.
Previous studies in allopolyploids have demonstrated coordinated cytonuclear evolution between the maternally derived chloroplast rbcL (and its encoded LSU) and the biparentally inherited rbcS genes encoding the SSU (16, 17, 19, 21, 23). At the genomic level, this coordination involves paternal-to-maternal gene conversion of rbcS homeologs, which results in maternal-like, paternal rbcS copies (16, 17, 19, 21, 23). It is established that the folding process involves significant protein structural alterations in LSU oligomers, whereas the assembly process completes the polymerization of individual folded LSUs (8). Given these different effects of folding and assembly cofactors on LSUs during RuBisCo biogenesis, we hypothesized that chaperonins working at the folding stage would display more evidence of genomic cytonuclear coordination (more paternal-to-maternal gene conversions) than chaperones functioning during the later assembly stage.To test this hypothesis, we characterized gene conversions within chaperonin/chaperone-encoding homeologs for genes encoding auxiliary proteins that function during either stage (assembly stage was subdivided into assembly-1 and assembly-2) (28). Nonsynonymous gene conversions were determined by aligning translated open reading frames of diploid chaperonin/chaperone homologs with their respective subgenomic chaperonin/chaperone homeologs in the allopolyploids. Intersubgenomic homeologous single nucleotide polymorphism (SNP) changes leading to amino acid changes in the mature protein regions (regions encoding N-terminal transit peptide regions were excluded) were identified as nonsynonymous converted SNPs (16, 17, 23). To quantify and compare this phenomenon within and among different chaperonin/chaperone genes, nonsynonymous converted SNPs and their corresponding amino acid changes (example as illustrated in ; also ) were normalized by the total number of polymorphic amino acid positions within each full-length sequence alignment. These normalized paternal-to-maternal and maternal-to-paternal amino acid conversions were defined as positive and negative genomic cytonuclear coordination signals (GCCSs), respectively. Positive and negative GCCSs were averaged for each chaperone family acting in both the folding and assembly stages, for each allopolyploid studied.As tabulated () and summarized (Fig. 3), the protein factor having the highest positive average GCCS values varied among genes and species (e.g., cpn20 in Gossypium hirsutum and Nicotiana tabacum and cpn60β in Arabidopsis suecica, Triticum turgidum, and Triticum aestivum). In addition, the average positive GCCSs exhibited an attenuation trend, which involved higher GCCSs of folding-stage chaperonins than those of assembly-stage chaperones in most sampled allopolyploids (Fig. 3 ; Wilcoxon rank sum test, P < 0.05). Notably, there were some exceptional cases with negative GCCSs or maternal-to-paternal conversion for some chaperonin and chaperones in some sampled allopolyploids, including cpn60α in N. tabacum, cpn60β and cpn10 in G. hirsutum, rbcX in N. tabacum, and raf2 in T. aestivum (D-subgenomic portion) (Fig. 3 ). However, since the directionality of the attenuation trend is too common to be by chance, the overall attenuated positive GCCCs during RuBisCo biogenesis were still supported.
Fig. 3.
Averaged GCCSs of chaperonin and chaperone genes in the folding and assembly processes during RuBisCo biogenesis in representative allopolyploids. Chaperonin and chaperone genes are labeled on the x-axis, which are arranged in accordance with their functional position in folding and assembly stage (denoted at the top of each panel); on the y axis, positive and negative averaged GCCSs are shown. The natural allopolyploids A. suecica, G. hirsutum, and N. tabacum (in A), the wild and domesticated T. turgidum subspecies and the paternal A- and D-subgenomes of T. aestivum (in B), and newly synthesized allopolyploids (in C) are compared and summarized. Signals of chaperonin genes at folding stages are linked with solid lines, while those of chaperone genes at the assembly substages 1 and 2 are within-group linked using dashed lines and dotted lines, respectively.
Averaged GCCSs of chaperonin and chaperone genes in the folding and assembly processes during RuBisCo biogenesis in representative allopolyploids. Chaperonin and chaperone genes are labeled on the x-axis, which are arranged in accordance with their functional position in folding and assembly stage (denoted at the top of each panel); on the y axis, positive and negative averaged GCCSs are shown. The natural allopolyploids A. suecica, G. hirsutum, and N. tabacum (in A), the wild and domesticated T. turgidum subspecies and the paternal A- and D-subgenomes of T. aestivum (in B), and newly synthesized allopolyploids (in C) are compared and summarized. Signals of chaperonin genes at folding stages are linked with solid lines, while those of chaperone genes at the assembly substages 1 and 2 are within-group linked using dashed lines and dotted lines, respectively.To further explore the nonsynonymous gene conversions in chaperonin and chaperone genes, we aligned the diploid chaperonin and chaperone homologs with their respective homeologs in all sampled allopolyploids (). For Cpn20, with significantly high positive GCCSs in G. hirsutum and N. tabacum, GCCS signals mainly occur around the protein interface and adjacent regions that interact with LSUs (; 32), where nonsynonymous substitutions between parental diploid homologs were detected. For these same species, other chaperonins/chaperones had limited or no GCCSs (including Cpn10, Raf1, RbcX, and Bsd2), which corresponded to parental diploids where homologs lacked nonsynonymous substitutions in the LSU interface regions (28, 30, 33–36). For Cpn60β, with a significant negative GCCS in G. hirsutum, its GCCSs were detected at a certain position (the 315th aligned amino acid position in ). For Triticum, positive paternal-to-maternal GCCSs were detected in the paternal homeolog of Cpn60β; whereas Cpn20 did not have nonsynonymous substitutions in the interface regions with LSU and did not display any GCCS (). Together, our results suggest that the protein compositional difference of chaperonins and chaperones in their interface regions with LSUs could have provided a selective force in driving the fixation of nonsynonymous gene conversions following allopolyploidy.
Attenuated Cytonuclear Transcriptomic Responses to Allpolyploidy.
Biased maternal homeologous transcriptional expression has previously been observed for some genes in allopolyploids (16–20, 22). Here, we quantified the ratio of maternal vs. paternal homoelogous expression (evaluated in fragments per kilobase of transcript per million mapped reads) for each pair of chaperonin and chaperone homeologs and termed this ratio as transcriptomic cytonuclear coordination signal (TCCS) (Fig. 4). This ratio was statistically tested for departures from equivalence using DESeq2 (). The results showed that 1) although the TCCSs varied among chaperonin/chaperone genes at the folding and assembly stages in the same allopolyploid species, there was at least one folding stage chaperonin gene displaying statistically significant positive transcriptomic maternal bias (significant positive TCCS) in all allopolyploid species, which included the chaperonin cpn60β, cpn20, and/or cpn10 genes (Fig. 4). However, there were occasional significant TCCSs of chaperone at the assembly stage in Triticum allopolyploids (Fig. 4); 2) the genes showing the highest bias were the folding stage chaperonins cpn20 and cpn60β (Fig. 4); 3) in most cases, significant transcriptomic maternal biases were higher during the folding stage and lower to nonexistent during the assembly stage (Fig. 4), when comparing TCCSs of paired folding chaperonin vs. assembly chaperone genes with statistical significant bias (exceptional cases involve Cpn20-1 vs. Raf2 in T. turgidum ssp. dicoccoides, Cpn20-2 vs. Raf1 in T. aestivum A-subgenome portion). Thus, Cpn20 and Cpn60β chaperonins at the folding stage appear to exhibit similar attenuated cytonuclear evolutionary responses to allopolyploidy at both the genomic and transciptomic levels (Figs. 3 and 4).
Fig. 4.
TCCSs of each chaperonin and chaperone gene in folding and assembly processes during RuBisCo biogenesis in representative natural allopolyploids. Chaperonin and chaperone genes functioning in folding and assembly stages (denoted in red and blue at the top of each panel) are arranged along the x-axis; corresponding TCCSs for each homeolog pair, estimated as the expression ratio of maternal vs. paternal chaperonin/chaperone gene homeologs, are summarized in boxplots. Statistically significant TCCS boxes (biased maternal expression is determined in DESeq2 and depicted above the horizontal nonbiased homeologous expression dotted line; ) are denoted by asterisks above boxes. Genes in folding (red) and assembly (blue) stage with significant and nonsignificant TCCSs are summarized in dark and light boxes, respectively. As for species in A to C (corresponding to A. suecica, N. tabacum, and G. hirsutum), since there is no significant TCCS detected in any assembly chaperone, the comparison of TCCSs between folding chaperonin and assembly chaperone are naturally accepted as being significant; as for species in D to G (corresponding to wild and domesticated T. turgidum subspecies and the paternal A- and D-subgenomes of T. aestivum), iterative comparisons of TCCSs are made between paired cofactors with significant bias at folding and/or assembly stage (pairs are linked by solid horizontal line). Respective statistical significant (in asterisk) and nonsignificant (P > 0.05) comparisons are denoted beneath the pairing line.
TCCSs of each chaperonin and chaperone gene in folding and assembly processes during RuBisCo biogenesis in representative natural allopolyploids. Chaperonin and chaperone genes functioning in folding and assembly stages (denoted in red and blue at the top of each panel) are arranged along the x-axis; corresponding TCCSs for each homeolog pair, estimated as the expression ratio of maternal vs. paternal chaperonin/chaperone gene homeologs, are summarized in boxplots. Statistically significant TCCS boxes (biased maternal expression is determined in DESeq2 and depicted above the horizontal nonbiased homeologous expression dotted line; ) are denoted by asterisks above boxes. Genes in folding (red) and assembly (blue) stage with significant and nonsignificant TCCSs are summarized in dark and light boxes, respectively. As for species in A to C (corresponding to A. suecica, N. tabacum, and G. hirsutum), since there is no significant TCCS detected in any assembly chaperone, the comparison of TCCSs between folding chaperonin and assembly chaperone are naturally accepted as being significant; as for species in D to G (corresponding to wild and domesticated T. turgidum subspecies and the paternal A- and D-subgenomes of T. aestivum), iterative comparisons of TCCSs are made between paired cofactors with significant bias at folding and/or assembly stage (pairs are linked by solid horizontal line). Respective statistical significant (in asterisk) and nonsignificant (P > 0.05) comparisons are denoted beneath the pairing line.
Immediate Versus Evolved Genomic and Transcriptional Cytonuclear Responses to Allpolyploidy.
An interesting follow-up question is about the timing of genomic and transcriptomic cytonuclear responses to the shock of genome merger and doubling. Are these established immediately with allopolyploidization or are they evolved responses? To address this question, we characterized genomic and transcriptomic responses in the synthetic allotetraploids A. suecica (738) and G. hirsutum (2(A2D1)) and two synthetic allotetraploid wheat lines (AADD and SLSLAA), using similar analyses as described above (Figs. 3 and 5).
Fig. 5.
TCCSs of each chaperonin and chaperone gene in the folding and assembly processes of RuBisCo biogenesis as revealed in synthetic allopolyploids. All symbols, labels, and annotations are the same as those in Fig. 4. The newly synthesized allopolyploids, including 2(A2D1) cotton, AADD allotetraploid wheat, A. suecica accession 738, and SLSLAA allotetraploid wheat, are illustrated in A–D, respectively.
TCCSs of each chaperonin and chaperone gene in the folding and assembly processes of RuBisCo biogenesis as revealed in synthetic allopolyploids. All symbols, labels, and annotations are the same as those in Fig. 4. The newly synthesized allopolyploids, including 2(A2D1) cotton, AADD allotetraploid wheat, A. suecica accession 738, and SLSLAA allotetraploid wheat, are illustrated in A–D, respectively.As for the genomic responses, we detected both positive and negative GCCSs within synthesized allopolyploids. Specifically, for the synthetic AADD wheat and the 2(A2D1) cotton allotetraploids, the nonsynonymous averaged positive GCCSs of all chaperonins and chaperons were mostly the same. However, within synthesized Arabidopsis and SLSLAA wheat allotetraploids, positive GCCSs at the assembly stage of RuBisCo biogenesis were higher than those at the folding stage (Fig. 3). As expected, in synthetic allopolyploids, there was no attenuation in positive genomic responses as observed in natural allopolyploids (Fig. 3 ). In contrast, there were even notable negative GCCSs for many chaperones at the folding and assembly stages in the synthetic allopolyploids (Fig. 3), suggesting that gene conversions in both directions occurred within the initial generations after allopolyploid synthesis. Our observations not only support the attenuated patterns of cytonuclear responses in natural allopolyploids but also demonstrate they are evolved responses.With respect to transcriptomic responses, one might expect that the combination of cis and trans factors from divergent diploids into a common nucleus might disrupt progenitor expression patterns. Here, being similar as those analyzed natural allopolyploids, we characterized and compared the TCCSs of cofactors at folding and assembly stages in artificial (synthesized) allopolyploids (Fig. 5). Notably, although the chaperonin/chaperones with transcriptional maternal bias in each stage were occasionally different in synthetic and natural allopolyploids (Fig. 5; i.e., Cpn60β vs. Cpn20 in synthetic and natural Gossypium allopolyploids), the TCCSs with statistically significant maternal biases were higher during the folding than the assembly stage in most synthesized allopolyploids, except Cpn20-3 vs. RbcX-1 in synthesized AADD wheat. Such a result is consistent with the aforementioned observations for natural allopolyploids (Figs. 4 and 5). The implication of these results is that transcriptional cytonuclear responses to genome merger and doubling can be immediate, or in the early generations, with the onset of allopolyploidization.
Discussion
The well-characterized protein components of the folding and assembly processes during RuBisCo biogenesis provide an excellent opportunity for exploration of the cytonuclear consequences of genome merger and doubling, where the nuclear genome is doubled yet in a context of plastid and mitochondrial genomes from only one of the progenitor genomes. Here, we elucidate the cytonuclear genomic and transcriptomic responses of chaperonin and chaperone genes across a wide range of different allopolyploid systems, as well as in synthetic allopolyploids and along a temporal gradient with respect to the steps of RuBisCo biogenesis. We show that in four different genera across the angiosperm phylogenetic spectrum, there are similar patterns of apparent cytonuclear evolutionary responses to allopolyploidy. These include differences in paternal vs. maternal gene conversion in the predicted direction based on maternal parentage and alterations in homeolog expression from the two progenitor diploids that also favor the maternal parent. An observation here is the attenuated cytonuclear response, where the maximum effect is observed for proteins acting early during RuBisCo biogenesis, with diminishing or attenuated evolutionary responses for later stages. Our study points to the possibility that this is a more widespread and hitherto unrecognized feature of cytonuclear evolution.
Fate of WGD Chaperonin/Chaperone Gene Duplicates in RuBisCo Biogenesis Can Be Explained by Dosage Balance and Dominant Negative Mutation.
It has been established that all modern diploid flowering plants are paleopolyploids, having experienced multiple rounds of ancient WGD followed by recurrent diploidization (including angiosperms in Fig. 2; 24, 37–40). One of the key mechanisms thought to explain the variable patterns of duplicate gene retention following WGD is the gene dosage balance hypothesis (31, 41, 42), which postulates that after WGD, genes encoding component subunits of specific protein complexes are preferentially maintained in appropriate dosages to maintain balanced interactions within the protein complex (31, 41–43, 44, 45). Although gene balance is unlikely to entirely explain variable patterns of duplicate gene loss and retention (43, 44, 46–48), this framework offers an appropriate lens for mosaic nuclear- and chloroplast-encoded protein complexes as well (44); because the number of chloroplasts increases after polyploidy (49–52), the ratio of chloroplast-encoded genes vs. nuclear-encoded genes in the same mosaic complex are often to remain relative stable, perhaps accounting for at least part of the maintenance of duplicate genes for some gene families (53, 54).Given the increased number of chloroplast genomes following both ancient and recent allopolyploidy (55), we suggest that the maintenance of dosage balance might play an important role in the retention of multiple copies of cpn60α, cpn60β, and cpn20/cpn10 (Fig. 2), counteracting the concurrent diploidization processes (56, 57). Copy numbers for other chaperones in the assembly process (including Raf1, RbcX1, RbcX2, Raf2, and Bsd2; Fig. 2) appear to have been mostly reduced following WGDs, thus requiring explanations apart from dosage balance, including a dominant-negative mutation model (44, 46), and variable constraints based on position in the pathway or network (58, 59).In the present example, the coexistence of multiple- or single-copy RuBisCo biogenesis genes may be related to the well-understood protein stoichiometries (3, 28, 30, 33, 35, 36), in which the ratio of LSU vs. chaperonin/chaperon proteins in each functional unit is known to change from 1:1 during folding to 2:1 during assembly (Fig. 1). Accordingly, the increased dosage of LSUs after WGD is consistent with a 1:1 ratio of LSU:chaperonin at the folding stage given the doubling of nuclear genomes. The 2:1 ratio at the assembly stage could release the selection from increased LSU dosage or cause less constraint for the genes in the latter RuBisCo biogenesis pathway, under which the dominant-negative mutation mainly directs returning to single copy for those chaperone protein-encoding genes after WGD. Such a supposed mechanism still needs further experimental and/or evolutionary confirmations.
Cytonuclear Evolutionary Responses to Allopolyploidy Operate at Different Paces.
Previous work has established two common cytonuclear responses to allopolyploidy, i.e., paternal-to-maternal gene conversion and maternally biased gene expression at the level of the transcriptome. Most of this work has involved RuBisCo, due to the asymmetry in the location of its protein-encoding components (16–23). By extending this analysis to include auxiliary proteins involved in RuBisCo biogenesis, we asked whether 1) the footprints of cytonuclear evolution that often are apparent for rbcS also are evident for chaperonin and chaperone genes that participate in RuBisCo folding and assembly. We further asked whether 2) the pace of apparent cytonuclear evolutionary responses to allopolyploidy varies among these protein participants.With respect to the first question, we show both positive genomic and transcriptional cytonuclear coordination, as reflected by frequent intergenomic paternal-to-maternal conversions and maternally biased expression among diverse allopolyploids, but intriguingly, these responses are unequal during RuBisCo biogenesis, becoming attenuated from early to later during protein assembly (Fig. 3 and 4). This diminution of an apparent cytonuclear evolutionary response may reflect the intimacy of interaction of each chaperonin and chaperone with the LSU. If we assume a Cytonuclear Coordination Olympic Game for the relevant chaperonins and chaperones, Cpn20 or Cpn60β at the folding step win both the genomic and transcriptional marathon race, where marathon implies a long-term evolutionary outcome following allopolyploidy. Noting that the chaperonins and chaperones studied here are also involved in folding of other protein cytonuclear coencoded protein complexes (i.e., NADH complex in the electron transport chain; 57), the possibility arises that the evolutionary patterns we observe reflect cytonuclear pressure from these other complexes in addition to RuBisCo. In other words, additional organellar component subunits could also be judges in the cytonuclear Olympic competitions. It will be of interest to explore this possibility in future studies.As for the second question, by characterizing and comparing genomic and transcriptional responses among natural and artificial synthetic allopolyploids, we discovered that apparent cytonuclear evolutionary responses are attenuated during RuBisCo biogenesis (Fig. 4), even immediately with the onset of allopolyploidy (Fig. 5). In an analogy to the above marathon race, Cpn20 or Cpn60β at the folding step win the transcriptional 100-meter dash (Cpn20 or Cpn60β harboring the highest maternal biased expression), where dash conveys an immediate evolutionary response to allopolyploidy. Although the underlying molecular mechanism for this diminution in genomic or transcription bias for proteins facilitating RuBisCo biogenesis remains uncharacterized, we speculate that this reflects transcriptional feedback effects from the maternally inherited plastid to the nucleus, e.g., via reprogrammed epigenomic modification (60), divergence in promoter regions of parental homeologs (61), or differences in binding affinity of transcription factors to homeologous promoters (62).In contrast to transcriptional cytonuclear coordination, genomic changes during the initial generations of allopolyploidy appear to be more random with few gene conversions and in either direction (Fig. 3). These few but apparently random gene conversions could reflect 1) genomic damage and subsequent homeologous repair (63, 64); 2) the lack of subgenomic recombination suppressive genes in the diploid parents, such as the absence of Ph1 gene in AADD and SLSLAA wheat (65), or the presence of a ph1-like allele in diploid A. thaliana (66) with potential modifications of the meiotic machinery in the A.
arenosa subgenome in synthetic A. suecica (67, 68), allowing pairing and recombination of homeologous chromosomes; or 3) reprogrammed epigenomic landscape caused by allopolyploidy, e.g., in allopolyploid cotton (69), which could promote homeologous pairing and recombination as well. Taken together, our data support the notion that cytonuclear coevolution is a slow process. Future insights into the pace of cytonuclear evolution are likely to derive from comparisons of multiple allopolyploids of varying age, either within genera (for example, Nicotiana; 70) or from additional, broader surveys among angiosperms.
Implications for the Structural Interaction Interface between LSU and Cpn60 Chaperonin and the Functioning Stage of the Raf2 Chaperone.
Understanding the intermediate folding and assembly complex involving LSUs and their interacting auxiliary cofactors is essential for interpreting molecular evolution in the interaction face between component subunits (28, 30, 32, 33, 35, 36). Because the intermediate complexes are not easily expressed or purified for protein structural analyses (6), the interaction interface between Cpn60β and LSU and the functioning stage of the Raf2 cofactor interacting with LSUs in RuBisCo biogenesis have remained elusive. Here, we augment more traditional molecular protein–protein interaction assays with evolutionary considerations. Specifically, we note that Cpn60β has a domain with clusters of significant positive GCCSs, as might be expected if these function at the interface region with LSU during folding (Fig. 1 and ). With respect to the undetermined functioning location of cofactors interacting with LSUs in complex, perhaps less compelling, but suggestive, is the case for Raf2, where our interpretation that its interaction with LSUs and function during assembly is evolutionarily supported by its single-copy composition in all sampled diploid plant species (Fig. 2) and limited genomic cytonuclear coordination (Fig. 3) similar to other assembly chaperones (Fig. 1). Evidence testing these suggestions awaits biophysical or other analyses that are more direct.
Materials and Methods
Plant Materials.
Four representative monocot and eudicot allopolyploid lineages, including both natural and synthesized allopolyploids and diploid parents, were selected for our cytonuclear analyses (), which include Arabidopsis (71), Gossypium (72), Nicotiana (73), and Triticum (74).
Gene Sequence Retrieval, Cloning, and Sequencing.
Chaperonin and chaperone gene family composition and gene sequences in different allopolyploid lineages were characterized using genomic searching. To validate the retrieved sequences, we designed homeolog-specific primers to complete PCR amplification and cloning for each chaperonin and chaperone homeolog coding DNA sequence in each allopolyploid lineage (). Additionally, we also searched for respective homologs in 41 genome assemblies of representative diploid angiosperms deposited in Phytozome v12.1. Details of the technical procedure are described in .
Cytonuclear Coevolution at the Genomic and Transcriptomic Level.
To quantify and compare genomic cytonuclear coordination at different stages in RuBisCo biogenesis, we defined the GCCS for each chaperonin and chaperone gene at the sequence level as the ratio of nonsynonymous, presumptively converted, amino acids vs. the total of nonsynonymous amino acid differences in each pair of parental mature proteins (). Positive paternal-to-maternal and negative maternal-to-paternal intergenomic gene conversions were discriminated (). To characterize possible cytonuclear coordination at the transcriptomic level, for each chaperonin and chaperone homeologous pair, we defined and calculated the TCCS as the subgenomic expression ratio of maternal vs. paternal chaperonin/chaperone gene homeologs in leaf RNA Sequencing (RNAseq) data (expression level estimated by reads per kilobase of transcript, per million mapped reads; ). Details of foregoing coevolutionary analyses are described in .
Authors: Thomas Marcussen; Simen R Sandve; Lise Heier; Manuel Spannagl; Matthias Pfeifer; Kjetill S Jakobsen; Brande B H Wulff; Burkhard Steuernagel; Klaus F X Mayer; Odd-Arne Olsen Journal: Science Date: 2014-07-18 Impact factor: 47.728
Authors: Simon Griffiths; Rebecca Sharp; Tracie N Foote; Isabelle Bertin; Michael Wanous; Steve Reader; Isabelle Colas; Graham Moore Journal: Nature Date: 2006-02-09 Impact factor: 49.962