Literature DB >> 31498865

Duplication history and molecular evolution of the rbcS multigene family in angiosperms.

Kana Yamada1, Iakov I Davydov1,2, Guillaume Besnard3, Nicolas Salamin1.   

Abstract

Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) is considered to be the main enzyme determining the rate of photosynthesis. The small subunit of the protein, encoded by the rbcS gene, has been shown to influence the catalytic efficiency, CO2 specificity, assembly, activity, and stability of RuBisCO. However, the evolution of the rbcS gene remains poorly studied. We inferred the phylogenetic tree of the rbcS gene in angiosperms using the nucleotide sequences and found that it is composed of two lineages that may have existed before the divergence of land plants. Although almost all species sampled carry at least one copy of lineage 1, genes of lineage 2 were lost in most angiosperm species. We found the specific residues that have undergone positive selection during the evolution of the rbcS gene. We detected intensive coevolution between each rbcS gene copy and the rbcL gene encoding the large subunit of RuBisCO. We tested the role played by each rbcS gene copy on the stability of the RuBisCO protein through homology modelling. Our results showed that this evolutionary constraint could limit the level of divergence seen in the rbcS gene, which leads to the similarity among the rbcS gene copies of lineage 1 within species.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Experimental Biology.

Entities:  

Keywords:  zzm321990 rbcSzzm321990 ; Coevolution; RuBisCO; duplication; gene copies; homology modelling; molecular evolution; multigene family; photosynthesis; positive selection

Mesh:

Substances:

Year:  2019        PMID: 31498865      PMCID: PMC6859733          DOI: 10.1093/jxb/erz363

Source DB:  PubMed          Journal:  J Exp Bot        ISSN: 0022-0957            Impact factor:   6.992


Introduction

Gene duplication is one of the main mechanisms creating novel features at the molecular level during evolution (Flagel and Wendel, 2009). The functional role played by duplicated genes has been discussed in detail (Hughes, 1994; Lynch and Force, 2000) and the mechanisms at work in this process are now relatively well understood (Hughes, 1994; Studer ; Innan and Kondrashov, 2010; Roulin ; Rensing, 2014). At the molecular level, it was initially proposed that relaxation of the selective constraints on one of the gene copies following gene duplication allows an accumulation of mutations that can permit the evolution of novel or sub-gene function or lead to a total loss of function (Ohta, 1988; Wagner, 1998; Moore and Purugganan, 2005). However, the advantages brought by gene duplication could not only stem from the effects of mutations but also from the protection against deleterious mutations or the mechanisms of dosage effect (Papp ; Kafri ; Cheeseman ). The creation of new gene copies by duplication is further affected by species divergence and the evolutionary history of the resulting gene family. Members of most gene families are therefore connected by a complex history of duplication and speciation events that have produced paralogous and orthologous gene copies. The identification of the proper sets of orthologous genes is challenging (Altenhoff ). Correct identification of relationships of gene copies is further complicated by the presence of gene conversion that may alter the origin of similarities between homologous regions (Mansai and Innan, 2010; Song ). The members of a multigene family can further be modified by crossing over and/or recombination (Ohta, 1977, 1979, 1983; Nei and Rooney, 2005; Mano and Innan, 2008; Dumont and Eichler, 2013). Each gene copy might further differ not only by the evolutionary process but also by the function, cellular localization of encoded protein, stability, and/or expression levels (Hudsona ; Ku ; Clark ; Petter ; Niimura, 2009). The different gene copies can, therefore, play a core role in organizing the novel or modified functions that are often required during adaptive evolution (Ohta, 1991; Nei ; Niimura, 2009; McGlothlin ). An example of this adaptive evolution is the evolution of photosynthesis. Atmospheric CO2 drastically decreased in the Oligocene (Pearson ; Edwards ; Beerling and Royer, 2011) and some plant species adapted to the depleted CO2 concentration by evolving a mechanism, called C4 photosynthesis, to concentrate CO2 by modifying the biochemical cascade and the cellular structures (Sage, 2004). C4 plants have diverged from C3 plants through the acquisition of novel enzymes. Most of the C4-specific enzymes are encoded by multigene families and the co-option of genes pre-existing in the ancestral C3 plants plays an important role during the transition from C3 to C4 type (Monson, 2003; Christin ; Bianconi ). The first enzyme of the Calvin–Benson cycle and the one that fixes CO2 into sugar is ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) (Hatch and Slack, 1968; Kanai and Edwards, 1999). RuBisCO has slower catalytic efficiency than other photosynthetic enzymes because of its affinities to both O2 and CO2 (Rawsthorne, 1992). The fixation of O2 results in a loss of energy and CO2, so-called photorespiration (Kubien ; Peterhansel ). In C4 plants, the CO2-concentrating mechanism (CCM) enabled RuBisCO to be surrounded by highly concentrated CO2, which led to the fixation of substitutions along the protein sequence that increased the catalytic efficiency of RuBisCo and decreased the affinity of CO2, compared with C3 plants (Badger and Andrews, 1987; von Caemmerer and Quick, 2000; Sage and Coleman, 2001). Evidence for the adaptive evolution of RuBisCO has come from the study of the evolution of the chloroplast rbcL gene encoding RBCL, the large subunit of RuBisCO. Positive selection for rbcL has been detected in independent C4 lineages (Kapralov and Filatov, 2007; Christin ; Piot ). In Flaveria, the signal of positive selection of the rbcL gene is almost 20 times stronger than that detected for rbcS, the gene encoding the small subunit (Kapralov ). The RBCL subunit is considered to determine the catalytic properties of RuBisCO, because it contains the catalytic site of the enzyme (Andersson, 2008). However, RBCS has been reported to have an influence on the catalytic efficiency, CO2 specificity, activity, quantity, assembly, and stability of RuBisCO (Andrews and Ballment, 1983; Furbank ; Spreitzer, 2003; Genkov and Spreitzer, 2009; Genkov ; Bracher ). Studer have suggested that some positively selected codons encoding amino acid residues that are located at the interface between RBCL and RBCS may affect the stability and the catalytic properties of RuBisCO. All these studies suggest that the interaction between RBCS and RBCL, and the rbcS gene itself, may play important roles in the evolution of RuBisCO. A better understanding of the evolutionary history of rbcS is thus essential to obtain a deeper insight into the evolution of RuBisCO. We extracted the nucleotide sequences of the rbcS gene from available full genomes of angiosperms and reconstructed the phylogenetic relationships of the rbcS gene copies. We then tested for the presence of positive selection acting on the rbcS gene across the evolution of angiosperms. Positive selection of rbcS has already been tested within some genera but has never been measured on a wider range of plants. Therefore, we aimed to elucidate the differences between gene copies of rbcS in higher plants and to infer their respective evolutionary histories. Firstly, we hypothesized that each rbcS copy may have a different interaction with rbcL, and we tested this hypothesis by inferring the coevolution between rbcS and rbcL. Secondly, we hypothesized that RBCS encoded by different rbcS gene copies may have a different degree of influence on the stability of RuBisCO. We tested this by modelling a RuBisCO structure with eight RBCS units encoded by a unique rbcS copy. We did the same for each rbcS copy and compared the stability between models. Our study provides new insights into the evolutionary mechanism of the rbcS multigene family and sheds light on its influence on RuBisCO evolution.

Materials and methods

Phylogenetic tree of rbcS among angiosperms

We downloaded the annotated rbcS gene of all angiosperms available in Phytozome v12 (https://phytozome.jgi.doe.gov/pz/portal.html). We aligned the sequences obtained using MAFFT (Katoh and Standley, 2013) and removed unreliable sequences that were poorly aligned using GUIDANCE2 with default settings (http://guidance.tau.ac.il/ver2/;Sela ). We then converted these amino acid alignments back into codon alignment using PAL2NAL (http://www.bork.embl.de/pal2nal/#RunP2N) to obtain the final nucleotide alignment of 171 rbcS gene copies for 43 angiosperm species. The TN93 model of substitution was identified as the best model using Jmodeltest v2.1.4 (Darriba ). We reconstructed the phylogenetic tree with PhyML v3.0 (Guindon and Gascuel, 2003) using the BEST algorithm for tree rearrangement while estimating all parameters of the TN93 model and the branch lengths. Branch support values were estimated based on 1,000 bootstrap replicates.

Gene conversion

We tested for the signatures of recombination and gene conversion in the rbcS gene copies using the Recombination Detection Program v4.56 software (RDP4; Martin ). We used Chimaera, 3seq, GENECONV, MaxChi, and SiScan with their default parameters. The nucleotide alignment created for the phylogenetic reconstruction was used as an input for the gene conversion analyses.

Selection

Positive selection analysis in rbcS was performed using the mixed effects model of evolution (MEME) implemented in HyPhy v2.2.6 (Pond ). We used the MG94 codons substitution base model (Muse and Gaut, 1994) and we corrected for multiple testing using a false discovery rate (FDR; Benjamini and Hochberg, 1995) with a threshold of 0.1. We selected the MEME model because it is more suitable than the branch-site mode (Zhang ) for estimating site-specific probabilities (Lu and Guindon, 2014). Positions under positive selection were plotted on the known protein structure of Spinacia oleracea (Chains B, C, E and H of 1RCX of the Protein Data Bank; Taylor and Andersson, 1997) using the software PyMol v1.3 (The PyMOL Molecular Graphics System; Schrödinger, LLC).

Coevolution between rbcS and rbcL

We downloaded the complete genome of the rbcL sequences from NCBI (NCBI Resource Coordinators, 2018) and used the filtered alignment from Guidance (see above) to ensure that the pattern of substitutions is not simply due to alignment errors but does indeed represent genuine evolutionary signals. We retained only the 30 species for which both rbcS and rbcL sequences were available. The resulting alignments were 721 and 1,461 bp long for rbcS and rbcL, respectively. Coevolution analysis of rbcS and rbcL was performed using the maximum likelihood implementation of model Coev (Dib et al., 2014, 2015). For each pair of sites, we compared the likelihood of a dependent and an independent model of substitution using the Akaike information criterion (AIC). The difference in AIC (dAIC) between the two models varies depending on the tree structure and characteristics of the alignment. To estimate the expected distribution of dAIC under no coevolution given the rbcS gene tree, we obtained a null distribution of dAIC by simulating sequences of 2,000 bp on the rbcS tree under the independent substitution model. We estimated the support for the Coev model for the ca. 4 million pairs of sites based on this simulated dataset and determined the dAIC value representing the 95% percentile of this distribution (for details, see Dib ). We then combined the dAIC values with an s/d ratio (ratio between the parameters s, representing the rate of change away from the coevolution profile, and d, representing the rate of change towards the coevolution profile) lower than 0.1 to identify sites under coevolution (see Dib for details). Since rbcL has a single gene copy per species and rbcS shows a variable copy number between species, we duplicated the rbcL sequences for each species to match the rbcS copy number. The rbcS and rbcL alignments were concatenated into a single matrix and conserved positions with a percentage of identity higher than 95% were removed because they do not provide enough information to estimate coevolution (Dib et al., 2014, 2015). The final concatenated alignment of rbcLrbcS contained 590 nucleotide positions (345 bp of rbcL and 245 bp of rbcS), which led to a total of 84,525 tests of coevolution for pairs of sites. In every pair, one of the sites belonged to rbcL, while the other belonged to rbcS. There were 7,828 pairs of sites that passed the dAIC and s/d ratio thresholds defined above and that were considered as coevolving (Supplementary Fig. S1 at JXB online). Among those, the ones with the strongest signals (dAIC more than 35 and s/d ratio less than 0.1; Supplementary Fig. S1) were selected and the R package qgraph (Epskamp ) was used to visualize them. We also plotted the pairs of sites selected on the known protein structure of S. oleracea (Supplementary Fig. S2; Chain B, C, E and H of 1RCX of Protein Data Bank: Taylor and Andersson, 1997) using PyMol v1.3 (Schrödinger, 2015).

Protein stability of RuBisCO structure

The RuBisCO quaternary structure is a hexadecamer composed of eight subunits of RBCL and eight subunits of RBCS. Since RBCL is encoded by a single gene, the eight RBCL subunits are always the same for a given species. On the other hand, the exact combination of the eight RBCS subunits is unknown. We assumed here that, for a given RuBisCO protein, the eight RBCS subunits are encoded by the same copy of rbcS. This assumption made the modelling of protein stability feasible by limiting the number of combinations and allowed us to study differences between gene copies. We performed homology modelling and estimated the Gibbs free energy, estimated as the difference of thermodynamic stability between the folded and unfolded states of a protein, to compare the stability of the whole RuBisCO structures. When the Gibbs free energy is below 0, the folded state is preferred over the unfolded state and protein models with a smaller value of Gibbs free energy can be considered to be more stable. To model the RuBisCO stability in angiosperms, the RBCS and RBCL amino acid sequences of several species of Brassicaceae and Poaceae were downloaded from UniProt (UniProt Consortium, 2015). We selected these two clades because they are well defined in the rbcS phylogenetic tree and are representative of the evolution of rbcS (see Results). To create RBCS encoded by a single gene copy, we duplicated eight times the rbcS sequence in each pair protein structure file. However, when different gene copies of the same species differed only by synonymous substitutions or when amino acids differed in a region outside the crystallized structure, only one complex was tested for these gene copies since amino acid sequences were identical (e.g. Setaria italica copies 4 and 5). Homology modelling was performed using Modeller v9.17 (Eswar ). The RuBisCO structure of Oryza sativa (1WDD of Protein Data Bank; Matsumura ) was used as a template. The homology modelling was run 100 times for each structural complex of rbcLrbcS and the best model (the one with the lowest DOPE score) was selected for further analyses. These models were then repaired with FoldX v4.0 (Schymkowitz ) using the RepairPDB function. The repair step is mandatory for removing potential bad contacts (i.e. Van der Waals clashes) in the structures, which may cause instability of modelled protein. Also using FoldX v4.0, we predicted the differences of Gibbs free energy between maximum likelihood model and null model (ΔG) of each estimated structure using the ‘Stability’ function, with default parameters. Three-dimensional structures were visualized with PyMol v1.3 (Schrödinger, 2015). Estimated ΔG values for the Brassicaceae and Poaceae were visualized on their respective rbcS gene trees using the function phenogram of the R package phytools (Revell, 2012).

Results

We identified two rbcS lineages (rbcS lineages 1 and 2) that could represent a deep duplication event that occurred before the divergence of eudicots and monocots. One gene lineage includes genes that cluster together with a known expressed gene in photosynthetic organs of rice (OsRbcS2; Morita ); we refer to this gene lineage as rbcS lineage 1 (Fig. 1). The second gene lineage includes gene copies expressed in non-photosynthetic organs such as OsRbcS1 in rice (Morita ); we refer to this lineage as rbcS lineage 2 (Fig. 2).
Fig. 1.

Maximum likelihood tree of rbcS lineage 1 in angiosperms. The phylogenetic tree was reconstructed in PhyML v3.0 (Guindon and Gascuel, 2003) using a TN93 model. Each gene copy of a given species is identified by the species name and distinguished by a number (see Supplementary Table S1 for correspondence of gene copy name used in this figure and gene ID in Phytozome database). Names of plant families are indicated on the right next to the species names except for Citrus clementina and Ricinus communis. Because the gene copies of these two species were spread all over the tree and did not cluster within a family, their family names are not indicated in this tree. Branch support was estimated using 1000 bootstraps replicates. Values above or below the branches represent the bootstrap support for each branch (%). The scale bar is shown below the phylogeny. Stars indicate the duplication events within a family.

Fig. 2.

Maximum likelihood of rbcS lineage 2 in angiosperms. The phylogenetic tree of the rbcS gene copies in angiosperms was reconstructed in PhyML v3.0 (Guindon and Gascuel, 2003) using a TN93 model. Names of plant families are indicated on the right next to the species names when the gene copies of the same family cluster together. Branch support was estimated using 1000 bootstrap replicates. Values above the branches represent the bootstrap support for each branch (%). The scale bar is shown below the phylogeny.

Maximum likelihood tree of rbcS lineage 1 in angiosperms. The phylogenetic tree was reconstructed in PhyML v3.0 (Guindon and Gascuel, 2003) using a TN93 model. Each gene copy of a given species is identified by the species name and distinguished by a number (see Supplementary Table S1 for correspondence of gene copy name used in this figure and gene ID in Phytozome database). Names of plant families are indicated on the right next to the species names except for Citrus clementina and Ricinus communis. Because the gene copies of these two species were spread all over the tree and did not cluster within a family, their family names are not indicated in this tree. Branch support was estimated using 1000 bootstraps replicates. Values above or below the branches represent the bootstrap support for each branch (%). The scale bar is shown below the phylogeny. Stars indicate the duplication events within a family. Maximum likelihood of rbcS lineage 2 in angiosperms. The phylogenetic tree of the rbcS gene copies in angiosperms was reconstructed in PhyML v3.0 (Guindon and Gascuel, 2003) using a TN93 model. Names of plant families are indicated on the right next to the species names when the gene copies of the same family cluster together. Branch support was estimated using 1000 bootstrap replicates. Values above the branches represent the bootstrap support for each branch (%). The scale bar is shown below the phylogeny. The phylogenetic tree of rbcS lineage 1, including 146 sequences available for 42 species, is shown in Fig. 1 (see Supplementary Table S1 for correspondence of gene copy name in Fig. 1 and gene ID in the Phytozome v12 database). Each plant family is well defined with subtending branches well supported (bootstrap support >78%; Fig. 1), except for the three families Caricaceae, Malvaceae, and Rosaceae. The relationships obtained within each family or subfamily are further well supported. Globally, the topology of the gene tree follows the expected species tree of angiosperms (e.g. clear division between monocots and eudicots; see Magallón ) but the relationships between several plants families in eudicots (i.e. Linaceae, Malvaceae, Phrymaceae, Rosaceae, Salicaceae, and Solanaceae) were not supported by high bootstrap values (Fig. 1). The low support obtained could be due to short branch lengths and the peculiar evolutionary history of the rbcS gene (see below). The rbcS gene tree estimated by PhyML shows a particular topology with the gene copies of the same species clustering together with high bootstrap support (Fig. 1). The phylogenetic analyses showed also deeper duplication events in several plant families (e.g. Brassicaceae and Rosaceae; shown by stars in Fig. 1) and there are a few exceptions, such as Citrus clementina and Ricinus communis, which have gene copies widely spread across the tree. The branch lengths leading to the Vitis vinifera_copy1, Malus domestica_copy4, and Citrus clementina_copy2 of rbcS lineage 1 are longer (Fig. 1), suggesting the accumulation of more substitutions in these specific gene copies. We observed a deletion of 192 bp in the sequence of Vitis vinifera_copy1, an insertion of 57 bp and a deletion of 72 bp in Malus domestica_copy4, and deletion of 285 bp in Citrus clementina_copy2. These large insertions/deletions could be signs of the loss (pseudogenization) or change of function of these gene copies, but a deeper investigation of these sequences should be carried out to fully understand these patterns. The phylogenetic tree of rbcS lineage 2, including 25 sequences available for 24 species, is shown in Fig. 2. As observed in the tree of rbcS lineage 1, there is a clear division between monocots and eudicots. Although the number of gene copies is limited, we observed the rbcS gene copies of each family (Euphorbiaceae, Fabaceae, Linaceae, Malvaceae, Rosaceae, Rutaceae, Salicaceae, and Solanaceae) cluster together. In the end, we excluded gene copies of rbcS lineage 2 from further analyses because (i) the copies of lineage 2 are more divergent than those of lineage 1, and the sequences of lineage 2 cannot be aligned reliably, and thus cannot be tested for positive selection and coevolution, and (ii) our focus was on the molecular evolution of the gene copies involved in photosynthesis. Minimum numbers of gene copies per species are shown in Supplementary Table S2. We show this table to demonstrate that the estimated number of rbcS gene copies varies between species, but given uncertainty in the quality of genome assemblies, it indicates a minimum number of rbcS gene copies per species. It is also possible that there are additional gene copies that were not available in the current version of genome assemblies.

Gene conversion and positive selection

We did not detect any significant signal of gene conversion (P>0.05). We tested rbcS sequences for signs of positive selection using the MEME model of HyPhy (Pond ). A strong signal of positive selection was detected in 13 sites (Table 1; Fig. 3). The episodes of positive selection were not associated with specific branches or duplication events.
Table 1.

Codon position of the aminoacid sequences of RBCS under positive selection

Amino acid residues in RuBisCO structure of Spinacia oleracea (1RCX of Protein Data Bank) p-value q-value
231.46E-032.66E-02
341.31E-045.39E-03
393.44E-048.88E-03
415.60E-038.27E-02
421.84E-046.32E-03
604.67E-037.62E-02
682.59E-034.46E-02
966.93E-039.34E-02
1011.07E-051.10E-03
1039.33E-061.10E-03
1077.87E-039.87E-02
1184.89E-041.01E-02
1197.96E-039.87E-02
Fig. 3.

RBCS residues under positive selection. Thirteen positions of rbcS showed strong signals of positive selection (see also Table 1). We plotted corresponding amino acid residues to RuBisCO structure of Spinacia oleracea (1RCX of Protein Data Bank; Taylor and Andersson, 1997). The light pink cartoon ribbons indicate RBCS chain C of 1RCX. The light grey, light blue, and dark grey cartoon ribbons indicate RBCL chains B, E, and H of 1RCX, respectively. The positions of RBCS under positive selection are shown as spheres in different colours. The upper panel shows the overview of the positions under positive selection. The other four panels show the zoom view of each sphere.

Codon position of the aminoacid sequences of RBCS under positive selection RBCS residues under positive selection. Thirteen positions of rbcS showed strong signals of positive selection (see also Table 1). We plotted corresponding amino acid residues to RuBisCO structure of Spinacia oleracea (1RCX of Protein Data Bank; Taylor and Andersson, 1997). The light pink cartoon ribbons indicate RBCS chain C of 1RCX. The light grey, light blue, and dark grey cartoon ribbons indicate RBCL chains B, E, and H of 1RCX, respectively. The positions of RBCS under positive selection are shown as spheres in different colours. The upper panel shows the overview of the positions under positive selection. The other four panels show the zoom view of each sphere. We tested a total of 84,525 pairs of sites to detect coevolution between rbcS and rbcL. Signs of coevolution were, as expected, pervasive between these two genes and 26,338 pairs had a dAIC value between the null and alternative model higher than the threshold of 9.893, which represented the 95% percentile of the distribution of dAIC obtained by simulating the evolution of the independent model along the rbcS gene tree (Supplementary Fig. S1A). Among these 26,338 pairs, we further looked at the strength of the signal by considering the ratio of the parameters s and d, which indicates a strong signal if its value is close to zero (Dib et al., 2014, 2015). The distribution of s/d ratios is shown in Supplementary Fig. S1B and we identified 7,828 profiles with an s/d ratio less than 0.1. The 28 pairs with the strongest signal of coevolution (dAIC more than 35 and s/d ratio less than 0.1) are listed in Table 2. Among these 28 pairs, we found five positions along the rbcS sequence, which encode residues 30, 67, 68, 70, and 104 of RBCS of 1RCX (Table 2; Fig. 4), that were each coevolving with multiple positions of rbcL encoding RBCL. Similarly, six positions of rbcL encoding residues 91, 95, 97, 349, 354, and 456 of RBCL of 1RCX (Table 2; Fig. 4) were found to be coevolving with multiple positions of rbcS encoding RBCS.
Table 2.

Coevolving sites between rbcS and rbcL and corresponding amino acid residues plotted to known RuBisCO structure of spinach

Nucleotide position of rbcS in our alignmentCorresponding amino acid residue of RBCS of 1RCXNucleotide position of rbcL in our alignmentCorresponding amino acid residue of RBCL of 1RCX s/d ratiodAIC
18313624479.11E-0244.5468
32027311975.49E-0236.26688
32830132376.30E-0235.73412
32830292914.40E-0237.72452
328304801536.36E-0235.06962
3283010503435.01E-0235.14996
3694310813542.49E-0241.5562
409578572796.00E-1036.22426
4416761145.42E-1038.19676
44167304954.47E-0249.84318
44167316998.87E-0235.08008
44167317998.87E-0235.08008
444686362058.63E-0235.08624
4446810683497.73E-0235.55796
50070189565.31E-0235.22768
50070225685.79E-1042.60568
500703751185.93E-1039.87838
5899910813545.21E-1038.31622
59610213894561.95E-0238.76078
601103304958.21E-0240.1506
602104292913.47E-0235.81118
604104156454.91E-0238.94026
604104304957.99E-0238.06524
6041046392067.46E-0240.35832
60410414124644.02E-0236.44858
61110713894563.18E-1038.82068
6341138462753.24E-0235.55106
65211810683494.16E-0235.65656
Fig. 4.

Coevolving residues between RBCS and RBCL. Coevolution between paired combinations of rbcS sites and rbcL sites was estimated by a maximum-likelihood implementation of Coev and dependent model using nucleotide sequences. The differences in AIC between pairs of models (dAIC) were calculated. AIC of the null model (9.893) was used as a threshold. We then filtered further with the s/d ratio threshold (0.1) according to previous studies (Dib et al., 2014, 2015). The 28 coevolving profiles with the strongest signals (dAIC >35 and s/d ratio <0.1) were selected (see Table 2). We identified the corresponding amino acid residues and plotted these sites using the qgraph function of R (Epskamp ). The residues of RBCS and RBCL are shown by pink and green filled circles, respectively and the numbers in the circles indicate the residues of known RuBIsCO structure (1RCX of Protein Data Bank; Taylor and Andersson, 1997). A coevolving profile pair is connected with a line. The coevolution profiles including the same RBCS residues are shown as lines of the same colour connected with multiple RBCL residues (e.g. lines in red, blue, yellow, orange, and green). Grey lines show the other coevolving profiles including the residues of RBCS that coevolve with a single RBCL among listed 28 profiles.

Coevolving sites between rbcS and rbcL and corresponding amino acid residues plotted to known RuBisCO structure of spinach Coevolving residues between RBCS and RBCL. Coevolution between paired combinations of rbcS sites and rbcL sites was estimated by a maximum-likelihood implementation of Coev and dependent model using nucleotide sequences. The differences in AIC between pairs of models (dAIC) were calculated. AIC of the null model (9.893) was used as a threshold. We then filtered further with the s/d ratio threshold (0.1) according to previous studies (Dib et al., 2014, 2015). The 28 coevolving profiles with the strongest signals (dAIC >35 and s/d ratio <0.1) were selected (see Table 2). We identified the corresponding amino acid residues and plotted these sites using the qgraph function of R (Epskamp ). The residues of RBCS and RBCL are shown by pink and green filled circles, respectively and the numbers in the circles indicate the residues of known RuBIsCO structure (1RCX of Protein Data Bank; Taylor and Andersson, 1997). A coevolving profile pair is connected with a line. The coevolution profiles including the same RBCS residues are shown as lines of the same colour connected with multiple RBCL residues (e.g. lines in red, blue, yellow, orange, and green). Grey lines show the other coevolving profiles including the residues of RBCS that coevolve with a single RBCL among listed 28 profiles. Among the 28 pairs of coevolving sites, one pair occurred in the region encoding the transit-peptide (nucleotide position 183 of our alignment; Table 2). In RBCS, the 11 residues (residues 27, 30, 43, 68, 70, 99, 102, 103, 104, 113, and 118) among the 14 residues with the strongest signal of coevolution (dAIC>35; Table 2) form the elements of the secondary structure (see Knight ). Among those residues, the three residues 33, 99, and 115 are in the core of the small subunit. In RBCL, the 12 residues among 21 with the strongest signal of coevolution listed in Table 2 also form the elements of the secondary structure (residues 447 and 456 in the C-terminal domain; 37, 56, 97, 99, 118, 275, 279, 343, 349, and 354 in the N-terminal domain; see Knight ) and residues 45, 118, 205, 275, and 279 of RBCL are at the interface between two RBCLs (see the bold residues in Supplementary Fig. S2; see Knight ). Furthermore, residues 107 and 118 of RBCS (reference sequence 1RCX) were detected to be evolving under positive selection and are coevolving with RBCL. Our phylogenetic analyses indicated that at least two plant families (Rosaceae and Brassicaceae; Fig. 1) had old duplication events during their evolutionary history. In contrast, the Poaceae family did not show any signs of old duplication event within the family (Fig. 1). The large sequence divergence between gene copies in Brassicaceae could lead to a variable stability of the heterodimers formed with the single RBCL protein when different rbcS gene copies are involved. We therefore compared the characteristics of each gene copy from both the Brassicaceae and Poaceae by estimating the Gibbs free energy of the RuBisCO structure. In Poaceae, the Gibbs free energy values estimated were similar for gene copies of the same species (Fig. 5; Table 3). There was also a clear distinction between the values for the Pooideae, represented by Brachypodium distachyon, and for representatives of the PACMAD clade (Zea mays and Setaria italica). Oryza sativa was not included in our analysis because (i) the translated amino acid sequences of LOC_Os12g17600 and LOC_Os12g19470 are identical, and (ii) the translated amino acid sequence of the other copy, LOC_Os12g19381, had the insertion of one nucleotide that causes a frame shift. In Brassicaceae, we expected differences of Gibbs free energy values between gene copies because their duplication is relatively old, having taken place during the early steps of diversification of the family. However, the estimated Gibbs free energy values showed a clear clustering by species (Fig. 5; Table 3). This shows that stabilities for the RuBisCO complex within the species are consistent, despite different evolutionary histories of gene copies within the same species.
Fig. 5.

Stability of modelled RuBisCO structure. The phylogenetic trees of rbcS in Brassicaceae and Poaceae are shown in the upper panels. RuBisCO protein structures with RBCS encoded by each rbcS were estimated by homology modeling using Modeller (Eswar ). The structure was repaired by the RepairPDB function of FoldX v4 (Schymkowitz ). The stability of the whole RuBisCO was estimated using the ‘Stability’ function of FoldX4. Then, the result of protein stability was taken as a trait and phylogenetic relationships were given as input trees. We then drew a phenogram using the phytools package (Revell, 2012) in R (lower panels). Sit5 is shown as representative of Sit4/Sit5 because of synonymous substitutions.

Table 3.

Delta Gibbs free energy of modelled RuBisCO structure

Species namesName of each gene copyDifferences of Gibbs free energy between maximum likelihood model and null model
Brassicaceae Arabidopsis lyrata Aly1 −205.782
Aly2 −186.133
Aly3 −216.206
Arabidopsis thaliana Ath1 −203.385
Ath2 −234.677
Ath3 −218.445
Ath4 −227.182
Brassica rapa Bra1 −186.759
Bra2 −209.734
Bra3 −182.142
Bra4 −213.151
Bra5 −193.802
Capsella rubella Cru1 −226.842
Cru2 −204.699
Cru3 −206.047
Cru4 −244.759
Eutrema solsugineum Esa1 23.9451
Esa2 20.741
Esa3 28.6711
Poaceae Brachypodium distachyon Bdi1 −332.978
Bdi2 −310.927
Setaria italica Sit1 −231.832
Sit2 −207.691
Sit3 −221.541
Sit4/Sit5 −204.342
Zea mays Zma1 −231.694
Zma2 −228.502
Delta Gibbs free energy of modelled RuBisCO structure Stability of modelled RuBisCO structure. The phylogenetic trees of rbcS in Brassicaceae and Poaceae are shown in the upper panels. RuBisCO protein structures with RBCS encoded by each rbcS were estimated by homology modeling using Modeller (Eswar ). The structure was repaired by the RepairPDB function of FoldX v4 (Schymkowitz ). The stability of the whole RuBisCO was estimated using the ‘Stability’ function of FoldX4. Then, the result of protein stability was taken as a trait and phylogenetic relationships were given as input trees. We then drew a phenogram using the phytools package (Revell, 2012) in R (lower panels). Sit5 is shown as representative of Sit4/Sit5 because of synonymous substitutions.

Discussion

In this study, we investigated the evolution of the small subunit of the RuBisCO protein in 43 species of angiosperms. We characterized the differences between each rbcS gene copy by testing coevolution between rbcS and rbcL and the influence of each copy on the stability of the enzyme. We reconstructed the phylogenetic relationships of the rbcS gene copies, and this showed a pattern in which gene copies of the same species were more closely related to each other than those of different species. We did not detect a significant signal of gene conversion but found extensive coevolution between the RBCS and RBCL subunits. The presence of coevolution between these two genes that encode tightly linked proteins was expected. Besides that, we detected that the same nucleotide positions of each rbcS copy coevolve with the exact same positions of rbcL. This suggests that the coevolution between rbcS and rbcL did not involve specific rbcS gene copies, but represented rather a pervasive process throughout the evolution of these genes. We finally identified several sites that are evolving under positive selection in rbcS and showed through homology modelling that the incorporation of any of the rbcS sequence for a given species does not affect significantly the stability of the RuBisCO protein.

Number of rbcS gene copies per species

The number of rbcS gene copies per species used in this study is shown in Supplementary Table S2. The number of rbcS gene copies has been estimated with different methods in several studies (Galili ; Ogihara ; Sasanuma, 2001; Thomas-Hall ; Kapralov ; Miller, 2014). For example, the number of rbcS copies in wheat has been estimated to range from 21 gene copies (Southern hybridization analysis; Galili ) to 100 gene copies (slot-blot analysis; Ogihara ). Absolute detection of the number of gene copies is remarkably complex (Cantsilieris ). The estimation of the gene copy number variation became more reliable thanks to the released genomic data. However, the reported number of gene copies changes when using newly released genomes with an improved method of assembly. Also, we should note that the precision of our method is limited by the genome assembly quality, and could be substantially degraded for various reasons, e.g. tandem copies of the gene are missing from the assembly (Panchy ).

Phylogenetic reconstruction of the rbcS gene family

The topology of the rbcS gene tree within each angiosperm family mostly follows the topology of the expected species tree. In most species of angiosperms, gene copies of the same species were more closely related than those of different species. This pattern has already been reported within some species of the same genus such as Solanum and Flaveria (Pichersky and Cashmore, 1986; Kapralov ). Our analysis is, however, the first to show that this pattern is not restricted to specific genera and is present across all angiosperms using nucleotide sequences. We also found family-specific duplication events in Brassicaceae and Rosaceae (Fig. 1). In general, the evolution of multigene families is affected by a number of processes that involve either divergent, concerted, or birth-and-death evolution (Nei and Rooney, 2005). Nei and Rooney (2005) defined divergent evolution as a mechanism by which gene copies of the common ancestral species are retained after speciation in descendant species while diverging through the accumulation of substitutions. However, we observed copy number variation between species and also non-expressed copies of the rbcS gene, which makes divergent evolution unlikely to be the main process behind the rbcS evolution. Gene copies of the same species were more similar than gene copies of different species (Fig. 1). Such similarity between gene copies within species is often the result of frequent gene conversions between gene copies during concerted evolution. Sugita and colleagues have suggested that the high similarity of the rbcS copies of Solanum lycopersicum is likely to be explained by gene conversion (Sugita ). We tested for gene conversion using RDP4 (Martin ) and CHAP2 (Song ). However, we could not detect any significant signal of gene conversion across angiosperms. This result is congruent with the results of Miller (2014) who did not find clear evidence of gene conversion between rbcS gene copies of Solanaceae species. Additionally, we observed that the gene copies of the same species are separated by long branches, such as those found in Linum usitatissimum or Mimulus guttatus (Fig. 1). These genes are unlikely to be affected by concerted evolution because in such cases gene copies would be less genetically distant due to frequent gene conversions and crossing-over. Our results suggest, therefore, that rbcS evolved following a birth-and-death process (Nei and Rooney, 2005). The observed pattern of the rbcS tree may have occurred by frequent recent duplications followed by pseudogenization and/or gene loss.

Retention rate of duplicates and two lineages of rbcS

The topology of the rbcS tree suggests that gene copies that may have originated from the ancient duplication events have been lost (except the event that led to the emergence of rbcS lineages 1 and 2, and ones before the divergence of Brassicaceae and Rosaceae), while gene copies that may have originated from recent events have been retained. The two lineages of rbcS have already been identified using amino acid sequences of angiosperms, gymnosperms, pteridophytes, and bryophytes (Pottier ). Our phylogenetic tree based on the nucleotide sequences showed that the two rbcS lineages (rbcS lineage 1 and rbcS lineage 2) have originated from a duplication event before the divergence of monocots and eudicots. rbcS lineage 1 (shown in Fig. 1) includes gene copies that are expressed in photosynthetic organs (Cheng ; Yoon ). rbcS lineage 2 (shown in Fig. 2) includes gene copies that are expressed in non-photosynthetic organs such as OsRbcS1 (Morita ). All sampled species of angiosperms carry gene copies of rbcS lineage 1 except Citrus sinensis, but only a few carry copies of rbcS lineage 2 (see Supplementary Table S2). The gene copies of lineage 1, given their ubiquitous presence in almost all the species sampled and their expressions associated with photosynthetic organs, are probably the functionally important gene copies involved in photosynthesis. However, the rbcS gene copies of lineage 2 have been kept in a few species of angiosperms since the divergence of monocots and eudicots. Also, higher numbers of rbcS copies of lineage 2 were found in seedless plants (bryophytes and pteridophytes) than in angiosperms, and they were shown to be more similar to the copies of lineage 2 than to those of lineage 1 in angiosperms (Pottier ). Although the exact function of the rbcS copies from lineage 2 is still unclear, a study found that the incorporation of RBCS encoded by the OsRbcS1 gene, which belongs to lineage 2 of rice, into RuBisCO increased the catalytic turnover rate of this enzyme (Morita ). Furthermore, the RuBisCO of Chlamydomonas reinhardtii showed higher carboxylation rate and higher affinity to CO2 if the rbcS gene copy of lineage 2 from Nicotiana tabacum was inserted rather than the rbcS gene copy of lineage 1 (Laterre ). There is clearly the need to study in more detail the exact roles played by gene copies of rbcS lineage 2 in angiosperms, in particular, to understand if these copies are functional in all the species carrying this lineage. If this is the case, we will still have to investigate how gene copies of rbcS lineage 2 may contribute to the improvement of catalytic properties of RuBisCO given they do not exist in all the species. Pottier have suggested that the rbcS lineage 2 may have been the predominant copies when RuBisCO was surrounded by a lower concentration of O2. Investigating the environmental habitat of the species carrying the gene copies of lineage 2 may help to understand the reason why they are carried only by a few species in angiosperms.

Positive selection and coevolution analyses

Another goal was to estimate the selective pressure acting on rbcS and uncover the coevolution between rbcS and rbcL encoding the subunits of the RuBisCO protein by estimating the coevolution between pairs of sites from these two genes. We detected positive selection in 13 positions along the rbcS sequence (Table 1), which indicates that the evolution of the rbcS gene is affected by episodic events of positive selection. The adaptation of the RuBisCO protein, which has been previously attributed mainly to the evolution of rbcL (Kapralov and Filatov, 2007; Christin, ), could thus also be mediated by changes occurring within the gene encoding the small subunit. We further detected extensive signals of coevolution between the two subunits, which reinforces our understanding of the tight interaction between the two subunits. Our result from the coevolution analysis is consistent with the analyses based on the correlation of codon usage bias between two genes (Pei ). We can go a step further with our analyses by identifying the residues that are potentially interacting in the 3D structure formed by RBCS and RBCL. Our study shows that there are extensive signals of coevolution between the residues of the two subunits. Some of the residues (residues 14, 56, and 95) that we identified as coevolving were also previously reported as coevolving intra-RBCL in gymnosperms species (Sen ). These residues of RBCL coevolve with specific residues of RBCS, and then, again, these residues of RBCS coevolve with multiple residues of RBCL (e.g. residue 14 of RBCL is the coevolving site intra-RBCL, but it also coevolves with residue 67 of RBCS; then, residue 67 of RBCS coevolves also with residues 95 and 99 of RBCL; see Fig. 4). The coevolution between some specific residues of RBCS and RBCL may have been driven by the tight interaction between the two subunits. We found that the majority of coevolving RBCL positions are located in the N-terminal domain (10 residues among 12; Supplementary Fig. S2). Knight observed that the residues involved in the interaction of subunits are mostly found in the N-terminal domain. It is suggested that residues 30 and 70 of RBCS coevolving with multiple RBCL residues in the N-terminal domain might be key sites for the interaction of the subunits. Similarly, the residues of RBCS coevolving with residues 45, 118, 205, 275, and 279 of RBCL at the interface between two RBCL subunits could be potentially involved in the assembly of RuBisCO structures. Furthermore, Knight have observed that residues in the C-terminal domain are often involved in the catalysis and binding of substrates. The catalytic sites of RuBisCO are found in the RBCL subunit, and the coevolving residues of RBCS (e.g. residue 102) that mostly interact with residues of RBCL in the C-terminus (e.g. residue 456) are probably not directly involved in the activity of the RuBisCO. However, these RBCS residues may coordinately react to the functional changes on the large subunits and thus be involved in the maintenance of the 3D structure. One of the coevolving residues of RBCL (residue 449 of 1RCX; dAIC value=23.46404 and s/d ratio=7.43E-10; with a significant signal of coevolution but not included in the 28 residues with the strongest signals) is part of a codon that is highly conserved between higher plants and the algae Chlamydomonas (Marín-Navarro and Moreno, 2006). The change of this amino acid from a cysteine to a serine has been shown to drastically increase the degradation of RuBisCO in Chlamydomonas (Marín-Navarro and Moreno, 2006). Residue 16 of RBCS (1RCX), coevolving with residue 449 of RBCL, further coevolves with another residue of RBCL (residue 40 of 1RCX; dAIC=15.4967 and s/d ratio=9.44E-10; with a significant signal of coevolution but not included in the 28 residues with the strongest signals), which was also described as important for the degradation of the RuBisCO (Kokubun ). Our results could indicate that residue 16 in the small subunit may also be involved in the protection against degradation of RuBisCO. In our study, we show positive selection acting on the rbcS gene, and positively selected rbcS sites that are coevolving with rbcL. The residues 107 and 118 are both under positive selection and coevolving with rbcL. These results may suggest that the substitution of an amino acid of RBCL may coordinately lead to the substitution of an amino acid of RBCS, and vice versa. Chakrabarti and Panchenko (2010) have suggested that functionally important sites undergo coevolution. Some of the positively selected sites or coevolving sites are on the interface of RBCS and RBCL. We suppose that the evolutionary processes of RBCS and RBCL are profoundly influenced by each other. These reported positively selected positions of rbcS and coevolving positions of rbcS with rbcL may be important sites for the structure and the function of RBCS and these results may help to elucidate the function of RBCS. Another goal was to understand the differences of stability between different gene copies. The composition of the RBCS subunits within the RuBisCO complex in vivo is not known. Structural stability is an important feature in an enzyme, which tends to evolve in a narrow range of stability. RuBisCO is no exception and it was observed that some amino acid substitutions under positive selection can slightly shift the stability during adaptation, in order to improve the catalytic efficiency while keeping the global fold intact (Studer ). We were thus interested to see if the differences in the multiple copies of rbcS could significantly impact the stability of the RuBisCO complex. Our protein stability modelling suggests that gene copies of rbcS lineage 1 of the same species may have similar functions in spite of their different evolutionary histories. Sasanuma (2001) investigated the fate of newly duplicated rbcS genes in Triticum spp. and found evidence of homogenization and pseudogenized genes, but no evidence of gaining new functions was detected. Therefore, multiple gene copies may exist for robustness (Wagner, 2005; Plata and Vitkup, 2014) to maintain the important function of RuBisCO. Like Sasanuma’s, our results suggest that _rbcS_ is robust to gene dosage effect. As RuBisCO is necessary for plants to survive, the robustness of the rbcS gene can assist plant adaptation to drastic environmental change or prevent lack of RBCS when some of the copies are lost. Further investigation is required if we are to understand rbcS evolution in more detail. The evolutionary history of rbcS is complex to track but we suppose that studying rbcS will allow for a deeper understanding of the multigene family.

Concluding remarks

Investigating the mechanisms that have shaped the evolution of the RuBisCO complex is important for understanding the function of this key enzyme in photosynthesis. This is usually done by looking at the chloroplast gene rbcL, but this approach only provides half of the picture and it is important to consider the evolution of the small subunit encoded by the nuclear gene family rbcS. Although rbcS has a more complex evolutionary history than rbcL, involving the appearance of multiple gene copies, there are strong connections between the two subunits, as detected in the coevolution analysis of rbcS and rbcL. Some coevolving or positively selected positions are at the interface of RBCS and RBCL. A striking example is positions 107 and 118 of RBCS, which are both under positive selection and coevolving with rbcL. These results suggest substantial interactions between the subunits. However, the coevolution is not occurring between a specific rbcS gene copy and rbcL. Further, the differences of the evolutionary history of each of the gene copies do not lead to differences in the stability of the RuBisCO. We thus propose: (i) that rbcS gene copies are created under neutral evolutionary processes, or (ii) that different copies are kept by the selective pressure that allows plants to cope with different environmental conditions or to be expressed differently in each organ. We need to further investigate the mechanism and the rate of gain and loss of rbcS. Transcriptome data of rbcS in different organs and different conditions (e.g. temperature, aridity) may help us to understand if these copies are playing a role in maintaining stoichiometry.

Supplementary data

Supplementary data are available at JXB online. Fig. S1. dAIC and s/d ratio distributions of frequency of coevolving profiles by Coev model. Fig. S2. The residues of RBCS and RBCL under coevolution plotted on known RuBisCO structure of spinach. Table S1. Correspondence of gene copy names in Fig. 1 and Gene ID in Phytozome database. Table S2. Minimum number of rbcS gene copies per species in angiosperms. Click here for additional data file.
  85 in total

1.  The degradation of the large subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase into the 44-kDa fragment in the lysates of chloroplasts incubated in darkness.

Authors:  Norimoto Kokubun; Hiroyuki Ishida; Amane Makino; Tadahiko Mae
Journal:  Plant Cell Physiol       Date:  2002-11       Impact factor: 4.927

Review 2.  The evolutionary dynamics of plant duplicate genes.

Authors:  Richard C Moore; Michael D Purugganan
Journal:  Curr Opin Plant Biol       Date:  2005-04       Impact factor: 7.834

3.  Phylogenetic and molecular analysis of the ribulose-1,5-bisphosphate carboxylase small subunit gene family in banana.

Authors:  Skye Thomas-Hall; Paul R Campbell; Katrien Carlens; Emi Kawanishi; Rony Swennen; László Sági; Peer M Schenk
Journal:  J Exp Bot       Date:  2007-06-21       Impact factor: 6.992

4.  Performance of standard and stochastic branch-site models for detecting positive selection among coding sequences.

Authors:  Ashley Lu; Stéphane Guindon
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

5.  Evolution by the birth-and-death process in multigene families of the vertebrate immune system.

Authors:  M Nei; X Gu; T Sitnikova
Journal:  Proc Natl Acad Sci U S A       Date:  1997-07-22       Impact factor: 11.205

Review 6.  Molecular methods for genotyping complex copy number polymorphisms.

Authors:  Stuart Cantsilieris; Paul N Baird; Stefan J White
Journal:  Genomics       Date:  2012-10-30       Impact factor: 5.736

7.  A new enzyme for the interconversion of pyruvate and phosphopyruvate and its role in the C4 dicarboxylic acid pathway of photosynthesis.

Authors:  M D Hatch; C R Slack
Journal:  Biochem J       Date:  1968-01       Impact factor: 3.857

8.  The biochemistry of Rubisco in Flaveria.

Authors:  David S Kubien; Spencer M Whitney; Paige V Moore; Linley K Jesson
Journal:  J Exp Bot       Date:  2008-01-27       Impact factor: 6.992

9.  Chromosome assignment of four photosynthesis-related genes and their variability in wheat species.

Authors:  Y Ogihara; H Shimizu; K Hasegawa; H Tsujimoto; T Sasakuma
Journal:  Theor Appl Genet       Date:  1994-06       Impact factor: 5.699

10.  Molecular evolution of rbcL in three gymnosperm families: identifying adaptive and coevolutionary patterns.

Authors:  Lin Sen; Mario A Fares; Bo Liang; Lei Gao; Bo Wang; Ting Wang; Ying-Juan Su
Journal:  Biol Direct       Date:  2011-06-03       Impact factor: 4.540

View more
  4 in total

1.  An Antisense Circular RNA Regulates Expression of RuBisCO Small Subunit Genes in Arabidopsis.

Authors:  He Zhang; Shuai Liu; Xinyu Li; Lijuan Yao; Hongyang Wu; František Baluška; Yinglang Wan
Journal:  Front Plant Sci       Date:  2021-05-24       Impact factor: 5.753

2.  Coevolution between MHC Class I and Antigen-Processing Genes in Salamanders.

Authors:  Gemma Palomar; Katarzyna Dudek; Magdalena Migalska; J W Arntzen; G Francesco Ficetola; Dušan Jelić; Elizabeth Jockusch; Inigo Martínez-Solano; Masatoshi Matsunami; H Bradley Shaffer; Judit Vörös; Bruce Waldman; Ben Wielstra; Wiesław Babik
Journal:  Mol Biol Evol       Date:  2021-10-27       Impact factor: 16.240

3.  Rubiscosome gene expression is balanced across the hexaploid wheat genome.

Authors:  Louis Caruana; Douglas J Orr; Elizabete Carmo-Silva
Journal:  Photosynth Res       Date:  2022-01-27       Impact factor: 3.429

4.  A temporal gradient of cytonuclear coordination of chaperonins and chaperones during RuBisCo biogenesis in allopolyploid plants.

Authors:  Changping Li; Baoxu Ding; Xintong Ma; Xuan Yang; Hongyan Wang; Yuefan Dong; Zhibin Zhang; Jinbin Wang; Xiaochong Li; Yanan Yu; Yiyang Yu; Bao Liu; Jonathan F Wendel; Yidan Li; Tianya Wang; Lei Gong
Journal:  Proc Natl Acad Sci U S A       Date:  2022-08-15       Impact factor: 12.779

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.