Literature DB >> 33342010

Hexamerization and thermostability emerged very early during geranylgeranylglyceryl phosphate synthase evolution.

Cosimo Kropp1, Kristina Straub1, Mona Linde1, Patrick Babinger1.   

Abstract

A large number of archaea live in hyperthermophilic environments. In consequence, their proteins need to adopt to these harsh conditions, including the enzymes that catalyze the synthesis of their membrane ether lipids. The enzyme that catalyzes the formation of the first ether bond in these lipids, geranylgeranylglyceryl phosphate synthase (GGGPS), exists as a hexamer in many hyperthermophilic archaea, and a recent study suggested that hexamerization serves for a fine-tuning of the flexibility - stability trade-off under hyperthermophilic conditions. We have recently reconstructed the sequences of ancestral group II GGGPS enzymes and now present a detailed biochemical characterization of nine of these predecessors, which allowed us to trace back the evolution of hexameric GGGPS and to draw conclusions about the properties of extant GGGPS branches that were not accessible to experiments up to now. Almost all ancestral GGGPS proteins formed hexamers, which demonstrates that hexamerization is even more widespread among the GGGPS family than previously assumed. Furthermore, all experimentally studied ancestral proteins showed high thermostability. Our results indicate that the hexameric oligomerization state and thermostability were present very early during the evolution of group II GGGPS, while the fine tuning of the flexibility - stability trade-off developed very late, independent of the emergence of hexamerization.
© 2021 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.

Entities:  

Keywords:  ancestral sequence reconstruction; enzyme; ether lipids; oligomerization; protein evolution; protein stability; thermostability

Year:  2021        PMID: 33342010      PMCID: PMC7888582          DOI: 10.1002/pro.4016

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


sometimes abbreviated as _N.., reconstructed ancestral variant of group II GGGPS at node N.. ancestral sequence reconstruction differential scanning calorimetry glycerol 1‐phosphate glycerol 3‐phosphate geranylgeranylglycerol geranylgeranylglyceryl phosphate geranylgeranylglyceryl phosphate synthase geranylgeranyl pyrophosphate isopropyl‐β‐D‐1‐thiogalactopyranoside isothermal titration calorimetry last common ancestor differential scanning fluorimetry size exclusion chromatography static light scattering

INTRODUCTION

The membrane lipids of Archaea have a quite different composition than those of Bacteria and Eukaryotes. While the latter consist of fatty acids that are linked to a glycerol 3‐phosphate (G3P) backbone by ester bonds, archaeal membrane lipids are ether lipids. Their backbone is a glycerol 1‐phosphate (G1P), to which polyprenyl moieties are linked. The formation of the first ether bond is catalyzed by the enzyme geranylgeranyglyceryl phosphate synthase (GGGPS), which usually transfers a geranylgeranyl residue (C20; four isoprene units) to the G1P. GGGPS is very interesting in the context of evolution, because it is a key enzyme in the context of this so‐called “lipid divide” between the kingdoms of Archaea on the one hand, and Bacteria and Eukarya on the other. Recent studies have unexpectedly revealed that GGGPS‐like enzymes also occur in some bacteria. There, ether lipids with either four (Bacteroidetes) or seven (Firmicutes) isoprene units are synthesized, , , but their biological function is still unknown. In Firmicutes, they are further processed by acetylation, , while a recent study suggests that bacteria of the Fibrobacteres–Chlorobi–Bacteroidetes (FCB) group might synthesize prototypical archaeal ether lipids. The GGGPS enzyme family can be divided into two groups, both comprising archaeal and bacterial members (Figure 1). While the enzymes of Firmicutes and of a few Euryarchaeota, among them all Halobacteria, belong to group I, most Archaea and the Bacteroidetes possess a group II enzyme. In the recent years, the oligomerization states and structures of many GGGPS representatives have been experimentally determined (as summarized and referenced in Table S1). Up to now, only dimeric variants have been discovered within group I, while most group II enzymes have been described as homohexamers. Recently, we have shown that the (βα)8‐barrel backbone of several GGGPS enzymes (mtGGGPS, tkGGGPS, taGGGPS, slGGGPS, fjGGGPS) is structurally stable at hyperthermophilic conditions, independent of the oligomerization state. However, some GGGPS like mtGGGPS or fjGGGPS show a higher flexibility of the active site, going along which higher activity. Hexamerization stabilizes this portion of the protein, which otherwise would collapse at elevated temperatures.
FIGURE 1

Simplified phylogenetic tree of the GGGPS enzyme family. The oligomerization states of a number of representatives have been experimentally identified (Table S1) and are indicated by 2 or 6 grey rectangles (dimeric / hexameric). The oligomerization states of Crenarchaeota and Thaumarchaeota where elucidated within this work. LCA, last common ancestor; LCA‐I / LCA‐II, LCA of group I / group II. Putative oligomerization states of the LCAs, as discussed within this work, are indicated by white rectangles

Simplified phylogenetic tree of the GGGPS enzyme family. The oligomerization states of a number of representatives have been experimentally identified (Table S1) and are indicated by 2 or 6 grey rectangles (dimeric / hexameric). The oligomerization states of Crenarchaeota and Thaumarchaeota where elucidated within this work. LCA, last common ancestor; LCA‐I / LCA‐II, LCA of group I / group II. Putative oligomerization states of the LCAs, as discussed within this work, are indicated by white rectangles The hexamer is formed as a “trimer of dimers”, where the conformation of the dimeric building blocks is identical to those of the natively dimeric GGGPS enzymes. , The conformation of the hexamer is stabilized by three individual contact interfaces between the protomers: The “dimer module interface” is symmetric and is present in dimeric GGGPS as well. The hexamer is stabilized by an additional symmetric interface between the dimer modules which we called “interconnecting interface”, plus an asymmetric interface that we called “ring interface”, because it stabilizes rings of three protomers each. As a hallmark of hexameric GGGPS, the presence of an aromatic amino acid (“aromatic anchor”) within the ring interface has been identified. It forms a cation‐π interaction with a lysine or arginine from the opposing protomer that is supposed to be essential for hexamerization. However, several group II sequences contain a histidine at the position of the aromatic anchor, particularly a large group of crenarchaeal sequences, which led to the assumption that they might be dimers. The corresponding proteins were not amenable to in vitro studies up to now, so that their true oligomerization state remained unclear. Ancestral sequence reconstruction (ASR) is a powerful method to deduce environmental conditions in the Precambrian era and to study the evolution of proteins in terms of their structural stability as well as their interaction specificities with ligands and other proteins. , , We recently have reconstructed the sequences of ancestral GGGPS enzymes and now provide a comprehensive in vitro characterization of nine of these predecessors to trace back the evolution of hexameric GGGPS. We were able to successfully purify and study crenarchaeal predecessors with a His at the above mentioned “anchor position” and show here that they also form hexamers. Furthermore, we tested the ancestral GGGPS proteins for their thermostability. Our results indicate that hexamerization and thermostability were already present at the very beginning of group II GGGPS evolution, while the above mentioned fine‐tuning of active site flexibility developed late and independent of the emergence of hexamerization. Furthermore, we conclude that the few extant dimeric representatives among group II (Thermoplasmatales, Flavobacteriales) emerged by secondary events.

RESULTS

Ancestral sequence reconstruction of group

Ancestral sequence reconstruction of GGGPS has been performed to evaluate FitSS4ASR, a tool to prepare representative sequence sets for ASR, and is described in detail elsewhere. We initially strived to perform ASR with a sequence set comprising both GGGPS groups I and II. However, this turned out to be not possible, because the length of the edge that interconnects the nodes representing the ancestors of groups I and II was too long (> 4 substitutions per site). We then concentrated on the more interesting group II with its diverse oligomerization states. For practical reasons, all experiments presented here are based on the ASR from the manually curated sequence set as described earlier, comprising 87 extant GGGPS group II variants. A simplified and annotated phylogenetic tree deduced from these sequences is shown in Figure 2; the complete tree with all nodes and leaves is shown in Figure S1. The names and properties of all GGGPS proteins used in this study are summarized in Table S1 and Table S2. We did not evaluate the properties of the last common ancestor (LCA) of all group II enzymes (LCA‐II, N1), because it would require to use an outgroup to reconstruct a confident sequence for it. We tested several group I sequences as outgroup, but this resulted in non‐robust trees. We use the same nomenclature to address the reconstructed proteins as introduced earlier, “AncGGGPS2_N..” stands for “ancestor of group II GGGPS” at node N.. .
FIGURE 2

Simplified phylogenetic tree used for ASR of GGGPS group II predecessors. Nodes representing ancestral sequences that were characterized experimentally are marked with blue dots and are labeled (N..). Leaves representing extant sequences that were characterized experimentally and that contributed to the calculation of the tree are shown with their names (cf. Table S2) and labeled in green. Protein names in brackets stand for representative proteins of the respective phyla which have been characterized previously (cf. Table S1), but whose sequences have not been used to calculate the tree. Oligomerization states of ancestral and extant proteins are indicated by grey rectangles, dimer‐hexamer equilibria are marked by a *. The identity of the amino acid located at the “aromatic anchor” position is given as a grey letter. For branches with no studied extant proteins the most conserved residue at that position is given (index con). In red numbers, the transition temperatures T2 of thermal denaturation [°C] are given. The T2 values result from DSC experiments (no index), or from nanoDSF experiments (index n; Table 1). Branches that denote evolutionary paths with nodes representing GGGPS sequences that have been characterized within this study are shown in thick lines. Branches with nodes without experimental data are indicated by thin lines or are omitted for clarity. The full tree is shown in Figure S1. Close to the leaves, the names of the orders and phyla of the extant representatives are given

Simplified phylogenetic tree used for ASR of GGGPS group II predecessors. Nodes representing ancestral sequences that were characterized experimentally are marked with blue dots and are labeled (N..). Leaves representing extant sequences that were characterized experimentally and that contributed to the calculation of the tree are shown with their names (cf. Table S2) and labeled in green. Protein names in brackets stand for representative proteins of the respective phyla which have been characterized previously (cf. Table S1), but whose sequences have not been used to calculate the tree. Oligomerization states of ancestral and extant proteins are indicated by grey rectangles, dimer‐hexamer equilibria are marked by a *. The identity of the amino acid located at the “aromatic anchor” position is given as a grey letter. For branches with no studied extant proteins the most conserved residue at that position is given (index con). In red numbers, the transition temperatures T2 of thermal denaturation [°C] are given. The T2 values result from DSC experiments (no index), or from nanoDSF experiments (index n; Table 1). Branches that denote evolutionary paths with nodes representing GGGPS sequences that have been characterized within this study are shown in thick lines. Branches with nodes without experimental data are indicated by thin lines or are omitted for clarity. The full tree is shown in Figure S1. Close to the leaves, the names of the orders and phyla of the extant representatives are given
TABLE 1

Temperatures of thermal transitions of different GGGPS proteins as followed by nanoDSF and DSC

nanoDSFDSC
ProteinT1 [°C]T2 [°C]T1 [°C]T2 [°C]
AncGGGPS2_N2>95107.8 ± 0.1
AncGGGPS2_N3>95113.4 ± 0.5
AncGGGPS2_N488.4 ± 0.089.2 ± 0.3
AncGGGPS2_N581.0 ± 0.385.2 ± 0.2
AncGGGPS2_N12>95118.4 ± 0.2
AncGGGPS2_N18>95103.5 ± 0.0
AncGGGPS2_N52>95n.d.n.d.
AncGGGPS2_N72>95n.d.n.d.
AncGGGPS2_N7358.4 ± 0.1 b n.d.n.d.
rsGGGPS88.8 ± 0.291.8 ± 0.6
fsGGGPS38.8 ± 0.0>9539.6 ± 0.3104.9 ± 0.3
mrGGGPS>95>9596.4 ± 0.1126.3 ± 0.5
nvGGGPS78.2 ± 0.180.3 ± 1.4
mtGGGPS>95>95102.5 ± 0.5127.0 ± 0.5
tkGGGPSn.d.n.d.115.0 ± 0.0
taGGGPSn.d.n.d.76.0 ± 1.4
slGGGPSn.d.n.d.83.1 ± 0.9
fjGGGPS53.2 ± 0.0>9554.1 ± 0.1110.0 ± 0.1

The first block shows ancestral proteins, the second block proteins studied in this work, the raw data are shown in Figure S8. The third block shows proteins that have been investigated earlier. The given temperatures are the mean of duplicates, standard deviations are given. Note that mtGGGPS shows two T1 transitions (T1, T1* ). Only the transition leading to inactivation is given. ‐, T1 does not exist; n.d., not determined.

The experimentally obtained data do not allow to decide whether the observed transition is due to a partial (T1) or complete denaturation (T2). See methods section for details.

Probably all crenarchaeal

We have postulated in a previous study that an aromatic residue (“aromatic anchor”; Trp, Phe, Tyr) in the so called “ring interface” is a hallmark of hexameric GGGPS. This residue forms a cation‐π interaction with a lysine or arginine of the opposite protomer, and when the aromatic residue is mutated to an alanine, the hexamer falls apart into dimers. An analysis of group II GGGPS sequences by calculation of sequence similarity networks (SSN) revealed that in most sequence clusters this aromatic residue is highly conserved, except in two of them. One of the two clusters representing bacterial group II GGGPS (represented by the branch “Flavobacteriales” in Figure 2) has charged residues at this position (Figure 3(a)ii), and accordingly, extant representatives of this cluster (fjGGGPS, zpGGGPS; Table S1) have been demonstrated to be dimeric. The other sequence cluster without the aromatic residue is represented by the branches “Desulfurococcales” and “Sulfolobales” in Figure 2. In these sequences, a histidine is highly conserved at this position (Figure 3(a)iii). Extant proteins of these branches have not been accessible by heterologous expression up to now except hexameric apGGGPS, which is quite diverse in sequence and has a Phe at the anchor position. We now expressed synthetic genes of ancestral GGGPS and were successful in purifying a predecessor protein of the “Sulfolobales” branch, AncGGGPS2_N52 (Figure 2, Table S1, Table S2), which also has a His at the anchor position. AncGGGPS2_N52 displays high activity in an enzymatic assay (Figure S2) and shows a well‐defined structure in circular dichroism (CD) spectroscopy (Figure S3A). Analytical size exclusion chromatography (SEC) revealed that AncGGGPS2_N52 forms a hexamer (Figure 3(b)), indicating that His might replace the conventional aromatic residues Trp, Phe, Tyr as the “aromatic anchor”. In fact, His is less frequently observed in cation‐π interactions and usually rather plays the cation role in its protonated form, but it can act as the aromatic motif in such interactions. , , In the Sulfolobales sequences, the position of the cationic interaction partner is occupied by a conserved Arg (Figure 3(a)iii).
FIGURE 3

Analysis of GGGPS variants with histidine as “aromatic anchor”. (a) Relative frequency of amino acid residues at the positions that form a cation‐π interaction in hexameric GGGPS variants. Anchor, “aromatic anchor” residue; cation, cationic counterpart. i) Sequences from all extant proteins used in ASR except Flavobacteriales, Sulfolobales and Desulfurococcales (presumed hexamers; Figure 2, Figure S1); ii) sequences from Flavobacteriales, iii) sequences from Sulfolobales and Desulfurococcales. (b) Analytical size exclusion chromatography of AncGGGPS2_N52. (c) Analytical size exclusion chromatography of the variants mtGGGPS_W141H, AncGGGPS2_N12, AncGGGPS2_N12_W138H and AncGGGPS2_N12_W138A. The proteins (40 μM subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols. The oligomerization state of AncGGGPS2_N12 has already been determined in a previous study

Analysis of GGGPS variants with histidine as “aromatic anchor”. (a) Relative frequency of amino acid residues at the positions that form a cation‐π interaction in hexameric GGGPS variants. Anchor, “aromatic anchor” residue; cation, cationic counterpart. i) Sequences from all extant proteins used in ASR except Flavobacteriales, Sulfolobales and Desulfurococcales (presumed hexamers; Figure 2, Figure S1); ii) sequences from Flavobacteriales, iii) sequences from Sulfolobales and Desulfurococcales. (b) Analytical size exclusion chromatography of AncGGGPS2_N52. (c) Analytical size exclusion chromatography of the variants mtGGGPS_W141H, AncGGGPS2_N12, AncGGGPS2_N12_W138H and AncGGGPS2_N12_W138A. The proteins (40 μM subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols. The oligomerization state of AncGGGPS2_N12 has already been determined in a previous study We have previously shown for mtGGGPS that a mutation of the “aromatic anchor” to alanine (mtGGGPS_W141A; serving as reference in Figure 3) breaks the hexamer down into dimers. To demonstrate that histidine does in fact play a similar role for hexamerization like the common aromatic residues do, we first introduced the corresponding His‐to‐Ala exchange into AncGGGPS2_N52, but the resulting protein strongly tended to aggregation and could not be used for further experiments. As an alternative experiment, we changed the Trp in hexameric mtGGGPS_wt to His. The resulting variant mtGGGPS_W141H still eluted as a hexamer in analytical SEC (Figure 3(c)), which supports that the His can still form a cation‐π interaction with the opposing cation (a Lys in mtGGGPS). Additionally, we randomly chose the hexameric ancestral protein AncGGGPS2_N12 (described later in detail) and introduced the W138H exchange, which again did not affect the oligomerization state (Figure 3(c)). In contrast, the hexamer was disrupted in the variant AncGGGPS2_N12_W138A. We conclude that histidine with its heterocyclic aromatic ring also functions as an “aromatic anchor” to stabilize the hexamer, and suspect that all extant sulfolobic GGGPS proteins are hexamers as well. In conclusion, almost all group II GGGPS enzymes must be considered to be hexameric, except the bacterial proteins from Flavobacteriales and a very small number of archaeal proteins, mainly from Thermoplasmatales, , , as summarized in Table S1.

Hexamerization emerged very early during group

To study the time point of emergence of hexamerization during group II GGGPS evolution, our rationale was to follow the evolutionary paths from the well‐characterized extant proteins fjGGGPS (dimeric; Flavobacteriales) and mtGGGPS (hexameric; Methanobacteriales) to LCA‐II (Figure 2). We expressed synthetic genes encoding seven ancestral GGGPS proteins in E. coli, purified the proteins and analyzed their oligomerization states by SEC (Figure 4). The results are summarized in Table S1 and Figure 2 (grey symbols). Additionally, activity assays (Figure S2) were performed, and the structural integrity of the proteins was analyzed by CD spectroscopy (Figure S3B). AncGGGPS2_N5, _N12, _N18 and _N72 turned out to be active proteins and hexameric (Figure 4(a)). AncGGGPS2_N2 and _N3 are also active proteins, but eluted at a volume between the dimeric and hexameric controls at the initially used protein concentration of 40 μM (Figure 4(b)). SEC experiments with a series of different protein concentrations revealed a dimer‐hexamer equilibrium for these variants with tendency to the hexamer under the given experimental conditions (Figure S4). Qualitatively, these variants can still be considered as hexamers, and they possess all characteristics of the hexamer‐specific contact interfaces like the “aromatic anchor” and the cation. A recent study on the robustness of reconstructed ancestral protein functions has revealed that quantitative markers of function ‐ like dissociation constants ‐ may vary, while qualitative conclusions on reconstructed protein functions are robust. Only AncGGGPS2_N4 elutes as a dimer in SEC (Figure 4(b)), although it also contains the “aromatic anchor” and the cation. Furthermore, this protein is inactive (Figure S2). As discussed below, this suggests that the dimeric nature of AncGGGPS2_N4 might be an artifact due to sequence reconstruction. From the evolutionary point of view, there is no rationale why _N4 should be a dimer, while _N3, _N12 and _N5 are hexamers. In summary, these results indicate that hexamerization must have occurred very early in group II GGGPS evolution, but because we could not reconstruct a robust sequence for the group II LCA (LCA‐II; as discussed above), we cannot clearly define whether LCA‐II already has been hexameric or not (Figure 1).
FIGURE 4

Analytical size exclusion chromatography of ancestral GGGPS proteins. (a) Hexameric proteins. (b) Proteins that are dimeric or form dimer‐hexamer equilibria at the given concentration. The identities of the proteins are given in the legend. The proteins (40 μM, subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols. The oligomerization states of AncGGGPS2_N4, _N5 and _N12 have already been determined in a previous study

Analytical size exclusion chromatography of ancestral GGGPS proteins. (a) Hexameric proteins. (b) Proteins that are dimeric or form dimer‐hexamer equilibria at the given concentration. The identities of the proteins are given in the legend. The proteins (40 μM, subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols. The oligomerization states of AncGGGPS2_N4, _N5 and _N12 have already been determined in a previous study To shed light on the impairments of AncGGGPS2_N4, we studied this protein in more detail. We have shown previously that a single mutation (W141A) that disrupts the mtGGGPS hexamer, also drastically worsens the catalytic efficiency (kcat/KM (G1P) 380x lower, KM (G1P) 50x higher), although the mutated position is not directly associated with the active site. Natively dimeric GGGPS like fjGGGPS are highly active, however. , A similar correlation between catalytic activity and oligomerization state of modified N4 variants would support the hypothesis that _N4 has a misbalanced hexamerization interface due to artefacts in sequence reconstruction. Along these lines, we transplanted the ring interface of hexameric AncGGGPS_N12 into AncGGGPS2_N4 (Figure S5). The resulting variant AncGGGPS_N4_IF_n12 eluted as a hexamer in analytical SEC (Figure 5) and showed GGGPS activity (Figure S2). As a reverse experiment, we transplanted the ring interface from AncGGGPS2_N4 to AncGGGPS_N12. The resulting variant AncGGGPS_N12_IF_n4 eluted as a dimer in analytical SEC (Figure 5) and showed no GGGPS activity (Figure S2). Furthermore, isothermal titration calorimetry (ITC) experiments indicated that AncGGGPS2_N4 couldn't bind the substrate G1P any more (Figure S6), which is tantamount to the severely increased KM (G1P) in mtGGGPS_W141A. In summary, these results strongly support that the dimeric nature of AncGGGPS2_N4 is an artefact from ASR due to a misbalanced ring interface. Furthermore, additional experiments indicate that the interplay of ring interface and interconnecting interface might be impaired as well in AncGGGPS2_N4 (data not shown). We have previously shown that both interfaces cooperate to keep the hexamer stable. Ancestral sequence reconstruction, however, calculates each amino acid position independently from other positions. Therefore, mutual interactions of individual residues in an interface, and interactions of the different interfaces in context of oligomerization are difficult to reconstruct, which may occasionally lead to such artifacts. Such effects might also be the reason for the observed dimer‐hexamer equilibrium in AncGGGPS2_N2 and _N3. In general, these considerations support the robustness of the reconstruction of the hexameric state of the ancestral GGGPS proteins. It is obvious that it is much more probable to disturb the interface balance and therefore break the hexamer, than to convert a dimer to a hexamer by undesired reconstruction artifacts.
FIGURE 5

Analytical size exclusion chromatography of AncGGGPS2_N4_IF_n12 and AncGGGPS2_N12_IF_n4. The proteins (40 μM, subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols

Analytical size exclusion chromatography of AncGGGPS2_N4_IF_n12 and AncGGGPS2_N12_IF_n4. The proteins (40 μM, subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols Additionally, we were able to purify and study the first extant protein from Thaumarchaeota, nvGGGPS, which shows a well‐defined structure in CD spectroscopy, is enzymatically active, and is also a hexamer (Figure 1, Table S1, Figure S2, Figure S3D, Figure S7).

Extant dimeric group

To our knowledge, group II GGGPS enzymes from only two phyla have been reported to be dimeric based on experimental data (Table S1). These are the bacterial Flavobacteriales (fjGGGPS, zpGGGPS ; fsGGGPS, this work) and the archaeal Thermoplasmatales (taGGGPS , , ; tvGGGPS ). To study the emergence of these dimeric group II GGGPS, we expressed synthetic genes for predecessors of the Flavobacteriales and analyzed the purified proteins. AncGGGPS2_N72 is hexameric, as described above. In contrast, AncGGGPS2_N73, the LCA of the Flavobacteriales GGGPS, eluted as a dimer in analytical SEC (Figure 6). The protein showed a well‐defined structure in CD spectroscopy (Figure S3C), but only poor activity (Figure S2). This may indicate an impairment of this protein as discussed above for AncGGGPS2_N4, but AncGGGPS2_N73 has an Arg at the “aromatic anchor” position and a Lys at the position of the cationic partner, like most extant Flavobacteriales enzymes. This supports that the experimentally obtained dimeric oligomerization state of AncGGGPS2_N73 is not an artifact.
FIGURE 6

Analytical size exclusion chromatography of AncGGGPS2_N73. The proteins (22 μM, subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. AncGGGPS2_N73 tends to form aggregates and has a low extinction coefficient, resulting in a quite low peak. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols

Analytical size exclusion chromatography of AncGGGPS2_N73. The proteins (22 μM, subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. AncGGGPS2_N73 tends to form aggregates and has a low extinction coefficient, resulting in a quite low peak. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols Similarly, the sequences of Thermoplasmatales have an Arg at the “aromatic anchor” position. GGGPS sequences from Thermoplasmatales are missing in our ASR, because their somewhat diverse sequences were filtered out within the process of identification of wandering sequences to obtain robust ASR results. An alternative ASR of GGGPS sequences including Thermoplasmatales has been published recently. Based on computational data only, the authors postulate a convergent evolution for the hexamerization of group II GGGPS in the different phyla, which goes along with dimeric predecessors at the deep nodes in the phylogenetic tree. In contrast, our experimental results strongly support that hexamerization occurred very early in evolution, close to the group II LCA. Our data rather implicate that the occurrence of dimeric GGGPS among the group II proteins is a secondary and convergent effect, because these proteins are direct descendants of hexameric predecessors (Figure 2). A horizontal gene transfer of a dimeric GGGPS from group I is very unlikely, because the Flavobacteriales and Thermoplasmatales GGGPS sequences are much more diverse from group I sequences than from any group II sequence. The biological reason for this change to a dimeric oligomerization state remains unclear, however, and needs further elucidation.

Thermostability emerged very early during group

Transition temperatures from thermal denaturation experiments (DSC, nanoDSF) of many extant GGGPS enzymes have been obtained in this and in a previous study (Figure 2, Table 1). Notably, the temperatures of complete denaturation (T2) are all in the hyperthermophilic (up to 127°C) or at least thermophilic range, although there are proteins from mesophilic species among them (fjGGGPS, fsGGGPS, rsGGGPS, slGGGPS, nvGGGPS). We have previously found that some GGGPS proteins show an additional thermal transition in DSC or nanoDSF experiments (T1). This transition is hardly visible in CD spectroscopic analysis, indicating that it is not associated with a significant change of secondary structure, but goes along with enzymatic inactivation. For some GGGPS proteins, these T1 temperatures are in the upper mesophilic range only. Our interpretation of these results was that the protein scaffold of all GGGPS proteins is very thermostable and rigid, whereas the active site region is destabilized in some GGGPS variants to allow a higher flexibility and therefore higher activity at lower temperatures. A localized flexibilization of the active site within a relatively rigid protein scaffold is not unique to GGGPS, but has been frequently observed in enzymes from psychrophilic species to allow high turnover rates at low temperatures. , Temperatures of thermal transitions of different GGGPS proteins as followed by nanoDSF and DSC The first block shows ancestral proteins, the second block proteins studied in this work, the raw data are shown in Figure S8. The third block shows proteins that have been investigated earlier. The given temperatures are the mean of duplicates, standard deviations are given. Note that mtGGGPS shows two T1 transitions (T1, T1* ). Only the transition leading to inactivation is given. ‐, T1 does not exist; n.d., not determined. The experimentally obtained data do not allow to decide whether the observed transition is due to a partial (T1) or complete denaturation (T2). See methods section for details. To investigate the emergence of thermostability among group II GGGPS, we have now determined transition temperatures of thermal denaturation of a large number of reconstructed ancestral and extant GGGPS (Figure 2, Table 1). Remarkably, the T2 temperatures of complete denaturation for all ancestral GGGPS proteins are in the hyperthermophilic range. An exception is AncGGGPS2_N73, which tends to aggregation and where T2 could not be determined confidently. The representatives at the deepest nodes, AncGGGPS2_N2 and AncGGPPS3_N3 have T2 temperatures of 108°C and 113°C, respectively. This clearly supports a hyperthermophilic nature of the group II GGGPS LCA. The flexibilization of the active site, going along with the emergence of the T1 transition, obviously has emerged in some clades only. Besides the hyperthermophilic mtGGGPS and mrGGGPS, only the representatives from mesophilic Bacteroidetes (fsGGGPS, fjGGGPS) showed a T1 transition. Our results indicate that the flexibilization emerged quite late in evolution, because none of the analyzed predecessors showed a T1 transition. To exclude that the reconstructed ancestral GGGPS proteins may not be active at elevated temperatures due to reconstruction artefacts, we analyzed the activity of the most stable ancestral GGGPS in our study, AncGGGPS2_N12 (T2 = 118°C; no T1). One of its direct descendants is the well‐studied mtGGGPS (T1 = 102.5°C, T2 = 127°C), which showed an apparent half life time of 16.5 min at 100°C in heat inactivation studies. We incubated both mtGGGPS and AncGGGPS2_N12 for 60 min at 105°C under pressure in the DSC instrument and tested their activity before and after this incubation step (Figure 7). mtGGGPS completely lost its activity under these conditions, while the apparent activity of AncGGGPS2_N12 was unchanged. This means that AncGGGPS2_N12 is even more thermostable than mtGGGPS and supports that all ancestral group II GGGPS enzymes may have been active at extreme temperatures.
FIGURE 7

Heat inactivation of mtGGGPS and AncGGGPS2_N12. Proteins were incubated for 60 min at 105°C. Heat‐treated and untreated samples (1 μM subunit concentration) were incubated with the substrates 14C‐G1P and GGPP for 1.5 h at 40°C. The generated products were extracted, dephosphorylated to allow for subsequent separation by thin layer chromatography, and visualized by autoradiography. As negative control (−), no enzyme was added. The origin of the chromatography and the reaction product geranylgeranylglycerol (GGG) are marked

Heat inactivation of mtGGGPS and AncGGGPS2_N12. Proteins were incubated for 60 min at 105°C. Heat‐treated and untreated samples (1 μM subunit concentration) were incubated with the substrates 14C‐G1P and GGPP for 1.5 h at 40°C. The generated products were extracted, dephosphorylated to allow for subsequent separation by thin layer chromatography, and visualized by autoradiography. As negative control (−), no enzyme was added. The origin of the chromatography and the reaction product geranylgeranylglycerol (GGG) are marked

DISCUSSION

Recent studies have shown that many extant representatives of group II GGGPS proteins have a hexameric oligomerization state. A key element to stabilize the hexamer is a cation‐π interaction in the so‐called ring interface, usually formed by a Trp, Tyr or Phe together with a Lys or Arg at the opposing protein surface. , A large number of Crenarchaeota, however, have a conserved His at the position of the aromatic moiety, and their GGGPS proteins were not accessible to experiments up to now. Using ancestral sequence reconstruction, we have now characterized a large number of GGGPS predecessors with respect to their oligomerization state, thermostability and activity. Our results demonstrate that the crenarchaeal proteins form hexamers, too. This means that almost all group II GGGPS, and consequently the GGGPS of almost all archaea, have an hexameric oligomerization state. Furthermore, almost all GGGPS predecessors form hexamers, among them those at the deepest nodes of the ASR tree. This makes it very likely that hexameric GGGPS developed close to the emergence of the group II LCA (LCA‐II; Figure 1), which itself could not be reconstructed. In contrast to the group II GGGPS proteins, all studied representatives of group I proteins are reported to be dimers. An exception is hsGGGPS, which eluted as a monomer in SEC, but unpublished results support that this might be an artefact due to the high salt requirements of this halobacterial protein that could not be satisfied during SEC analysis. Because of the divergence of group I and group II GGGPS sequences, we were not able to calculate a combined ASR tree of groups I and II and their common ancestor sequence. However, the exclusive presence of dimers among group I makes it very likely that the LCA of group I (LCA‐I; Figure 1) also was a dimer. Under these assumptions (dimeric LCA‐I, hexameric LCA‐II), the LCA of all GGGPS proteins might have been either dimeric or hexameric. The analysis of large datasets of oligomeric structures has revealed as a common rule that the larger an oligomerization interface is, the earlier it has emerged during the evolution of the complex. , Furthermore, symmetric interfaces are more likely to emerge during evolution, because they are easier to generate. In GGGPS, the symmetric dimer module interface is by far the largest (1,163 Å2 in mtGGGPS), while the key interface for hexamerization, the ring interface, is much smaller (256 Å2 in mtGGGPS) and asymmetric. These facts and the principle of parsimony speak for a dimeric nature of the LCA, and the need for thermostabilization may have driven the evolution towards hexamerization in group II GGGPS by a cyclic trimerization of the dimeric building block. We assume that the few dimeric group II GGGPS like in Flavobacteriales and Thermoplasmatales have emerged convergently as secondary events, because they descend from hexameric predecessors. Ancestral sequence reconstruction also allowed us to shed light on the evolution of thermostability among group II GGGPS. All reconstructed GGGPS predecessors are highly thermostable. As extreme example, AncGGGPS2_N12 only denatures at temperatures above 118°C and is still fully active after 60 min incubation at 105°C. We have recently presented data which suggest that hexamerization of GGGPS supports thermostability in a rather indirect manner. Even as a monomeric module, the GGGPS protein scaffold is highly thermostable in terms of the temperature of complete denaturation. However, such rigid enzymes tend to be less catalytically active. Similarly to what has been observed for cold‐adapted enzymes to allow high turnover rates at low temperatures, , we have previously found a localized flexibilization of the active site of some GGGPS representatives, which goes along with two transitions in thermal denaturation experiments. The first transition is associated with loss of activity and collapse of the active site, while the second transition marks the complete denaturation of the protein. We could demonstrate that hexamerization elevates the temperature of the first transition, and we assumed that oligomerization serves for a fine tuning of the balance between flexibility and stability in hexameric GGGPS. Ancestral sequence reconstruction now revealed that the flexibilization of the active site, apparent as the occurrence of two transitions in thermal denaturation experiments, obviously emerged quite late in evolution, because only extant GGGPS enzymes showed this phenomenon. This means that the primary stimulus for the emergence of hexamerization was not the need to stabilize the active site, but must have been different, such as a general need for thermostability. The flexibilization of the active site has obviously emerged independently of hexamerization in convergent processes, because it is also present in dimeric fjGGGPS. Its stabilization by hexamerization must rather be regarded as a secondary benefit that made use of the already existing higher oligomeric structure. Consequently, ancestral GGGPS enzymes must have used another still unknown, probably less effective way to balance catalytic activity and stability, like some extant dimeric GGGPS from hyperthermophilic species still do.

MATERIALS AND METHODS

Ancestral sequence reconstruction of

Ancestral sequence reconstruction (ASR) has been described in detail elsewhere. Briefly, the 87 GGGPS group II sequences that have been manually selected by using several filter algorithms in order to obtain high posterior probabilities on the edges of the tree, branch lengths <1.0 mutations per site and a species diversity as high as possible. Filter algorithms as they are the basis of FitS4ASR were used in a manual manner which identify wandering sequences by both RogueNaRok and the described heatmap approach. The resulting sequences are from Crenarchaeota (23 sequences), Euryarchaeota (40 s.), Thaumarchaeota (8 s.), and Bacteroidetes (16 s.). The tree needed for reconstruction was computed by using PhyloBayes, utilizing a site‐heterogeneous CAT model. Four independent MCMC samplings of length 50,000 were launched to ensure convergence. After discarding the first 6,000 trees as burn‐in, the remaining trees of two MCMC chains ‐ chosen by the quality criteria maximum discrepancy (<0.1; 0.01699) and minimum effective size (> 100; 2,769) (score for a good run; received score) ‐ were concatenated to deduce a consensus tree. By means of FastML, the ancestral sequences were reconstructed with the substitution model JTT and a gamma distribution. The parameter “probability cutoff”, which helps to prefer ancestral indel over character, was set to 0.8 in order to adjust the length of the reconstructed sequences to the length of the recent sequences, compensating a bias towards longer than true ancestors. The most probable sequence was determined for each internal node of the tree. A FASTA file with all extant and ancestral sequences as well as a file containing the posterior probabilities for all sites in the ancestral sequences are provided as supplementary material.

Cloning and site‐directed mutagenesis

As summarized in Table S2, twelve ancestral GGGPS variants and four extant GGGPS proteins were produced and purified. Gene sequences for the reconstructed GGGPS variants (AncGGGPS2_N2, AncGGGPS2_N3, AncGGGPS2_N4, AncGGGPS2_N5, AncGGGPS2_N12, AncGGGPS2_N18, AncGGGPS2_N52, AncGGGPS2_N72, AncGGGPS2_N73, AncGGGPS2_N4_IF_n12, AncGGGPS2_N12_IF_n4) and extant GGGPS proteins of Roseivirga seohaensis (rsGGGPS), Flavobacterium saliperosum (fsGGGPS), Methanobrevibacter ruminatium (mrGGGPS) and Nitrososphaera viennensis (nvGGGPS) were optimized in their codon usage for expression in E. coli and ordered as GeneArt™ Strings™ DNA fragments from Thermo Fisher Scientific. BsaI restriction sites were incorporated at the 5′ and 3′‐ends of the DNA fragments to allow cloning into a modified pET21a expression vector, providing a C‐terminal hexahistidine (His)6 tag. AncGGGPS2_N73 was cloned into a modified pMAL‐c5T expression vector, providing a C‐terminal maltose binding protein (MBP) fusion with a thrombin cleavage site. Other variants of AncGGGPS2_N12 and mtGGGPS were generated by QuickChange mutagenesis with oligonucleotides as given in Table S3. All constructs were verified by sequencing. The wild‐type Methanothermobacter thermautotrophicus GGGPS (mtGGGPS) and the mtGGGPS_W141A variant have been cloned previously into pET21a. The sequence numbering of mtGGGPS used in this study refers to EMBL ENA entry AAB85058 and pdb‐id 4 mm1, which have an N‐terminal three amino acid extension compared to UniProt entry O26652.

Production and purification of recombinant proteins

Heterologous gene expression was performed in the E. coli strain BL21‐Gold(DE3) (Agilent Technologies). The transformed cells were grown at 37°C in LB containing ampicillin (150 μg/ml−1). When OD600 reached 0.6–0.8, expression was induced by adding 1 mM isopropyl‐β‐D‐1‐thiogalactopyranoside (IPTG). Growth was continued overnight at 20°C, the cells were harvested by centrifugation, resuspended in 50 mM potassium phosphate, pH 7.5, 300 mM KCl, 10 mM imidazole and disrupted by sonication. The His‐tagged proteins were purified from the clarified cell extract by metal chelate affinity chromatography. An ÄKTApurifier system with a HisTrap FF crude column (5 mL, GE Healthcare) was used, and a linear gradient of imidazole (10–500 mM) in 50 mM potassium phosphate, pH 7.5, 300 mM KCl was applied to elute the protein. Interfering imidazole and salt were removed from the purified proteins by dialysis against 50 mM potassium phosphate, pH 7.5 at 4°C. The dialysis buffer for rsGGGPS additionally contained 300 mM KCl. For the MBP fused construct, the dialysis buffer was supplemented with thrombin (2 U/ml−1 final concentration) for cleavage. MBP and thrombin were removed by subsequent preparative size exclusion chromatography (SEC) on a Highload™ 26/600 Superdex™ S75 pg column (GE Healthcare). The column was equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl and was run at a flow rate of 1.5 ml/min. Protein concentrations were determined by absorbance spectroscopy in the UV range; if the extinction coefficient of the variant was too low, the Bradford assay was used. The molar extinction coefficients ε280 and the molecular weight were calculated from the amino acid sequence by means of ProtParam. Purified protein was dropped into liquid nitrogen and stored at −80°C. If high protein concentrations were needed, the samples were concentrated by ultrafiltration (Amicon Ultra‐15, mwco 10 kDa, Merck Millipore). The purification yields of all proteins are listed in Table S2, purity is shown in Figure S9.

Characterization of oligomerization state of recombinant proteins

All variants were characterized by SEC experiments. Standard SEC experiments were performed on a Superdex S75 column (GE Healthcare), which was operated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl at a flow rate of 0.5 ml/min. In 100 μL of protein with a subunit concentration of 22–40 μM was applied. All SEC runs were performed at room temperature in an air‐conditioned room (approx. 22°C).

Spectra of 6 μM protein (subunit concentration) were recorded at a scan rate of 200 nm/min−1 in degassed 50 mM potassium phosphate, pH 7.5 with a response time of 0.5 sec from 190 to 260 nm in a JASCO J‐815 CD spectrometer (d = 0.1 cm) at 25°C.

Differential scanning calorimetry

The 40 μM protein (subunit concentration) was heated in degassed 50 mM potassium phosphate, pH 7.5 from 30°C to 130°C at a ramp rate of 1 K/min−1 in a VP‐DSC differential scanning microcalorimeter (MicroCal, Malvern Instruments) with fixed reference and sample cell (0.511 mL each). The change in heat capacity with raising temperature was recorded. The proper equilibration of the calorimeter was ascertained by performing several buffer‐buffer baselines. Overpressure was applied to prevent boiling above 100°C. Where possible, DSC transitions were baseline corrected and fitted with a non‐two state model using the software supplied by the manufacturer. The apparent midpoint temperature (TMapp) of the irreversible unfolding transition was determined as an operational measure of protein stability. Experiments were done in duplicates.

Differential scanning fluorimetry (

The 40 μM protein (subunit concentration) was heated in degassed 50 mM potassium phosphate, pH 7.5 from 20°C to 95°C at a ramp rate of 1 K/min−1 in a Prometheus NT.48 instrument (NanoTemper Technologies GmbH; access provided by 2bind GmbH). The excitation power at 280 nm was 20%–40%. Emission spectra were measured at 330 and 350 nm. The change in the ratio of the fluorescence signal at 350 nm to 330 nm with raising temperature was followed. Fluorescence transitions were fitted by the program supplied by the manufacturer and the apparent midpoint temperature (TMapp) of the irreversible unfolding transition was determined as an operational measure of protein stability. The experiments were done in duplicates.

Classification of thermal transitions as

The existence of two individual thermal transitions, where the first (at temperature T1) is associated with a loss of enzymatic activity, but with a partial denaturation of the protein only, while the second (at a higher temperature T2) goes along with a complete denaturation, has been unambiguously demonstrated by a combination of DSC, nanoDSF and CD‐spectroscopic experiments as well as enzymatic activity tests for the two proteins mtGGGPS and fjGGGPS. In DSC experiments, the proteins are heated up to 130°C. If only a single transition is visible, it is therefore very likely that this transition goes along with a complete denaturation of the protein and can be addressed as a T2 transition. This has been demonstrated experimentally by experiments with taGGGPS and slGGGPS. In nanoDSF experiments, however, a complete denaturation cannot be achieved for highly thermostable proteins, because this would exceed the upper temperature limit of the instrument (95°C). This makes it difficult to conclude whether a transition observed in nanoDSF is due to a partial (T1) or a complete (T2) denaturation, if only a single transition is visible. Although a decrease of the F350/F330 ratio points to a T1 transition in most cases and an increase of the ratio to a T2 transition ( and this study), we performed DSC experiments wherever possible to clarify the situation. For the three proteins AncGGGPS2_N52, _N72 and _N73, DSC experiments were not possible due to technical reasons. In case of AncGGGPS2_N52 and _N72, no transition could be observed in nanoDSF. We assume that the T2 transition for these proteins is above 95°C. In case of AncGGGPS2_N73, a transition was observed in nanoDSF at 58.4°C with a decrease in F350/F330 ratio, which would speak for a T1 transition, but the protein strongly tended to aggregation and was difficult to handle. Therefore, it is impossible to decide whether this transition is due to a partial or complete denaturation.

Radiometric

14C‐G1P was synthesized as described by Guldan et al. To test the activity of purified GGGPS enzymes, 1 μM of protein (subunit concentration) was incubated with 20 μM GGPP and 20 μM 14C‐G1P (302 nCi) in 10 mM MgCl2, 0.2% Tween80, 50 mM Tris–HCl, pH 8.0, in a total volume of 100 μL for 1.5 h at 40°C. The products were dephosphorylated by adding 1 U calf intestinal alkaline phosphatase (New England Biolabs) for 30 min at 40°C and extracted according to the method of Bligh and Dyer as modified by Kates. The solvent was evaporated to dryness in a rotary evaporator and the products were analyzed by TLC on Silica 60 plates, developed in ethyl acetate/hexane 1:1 (v/v), and visualized with a phosphorimager system (PerkinElmer Life Sciences).

Heat inactivation

For observing residual activity after irreversible thermal inactivation, the isoscan function of the VP‐DSC differential scanning microcalorimeter (MicroCal, Malvern Instruments) was used with a subsequent radiometric GGGPS activity assay. All enzymes were adjusted to a subunit concentration of 40 μM in degassed 50 mM potassium phosphate, pH 7.5. The enzymes were incubated at 105°C for 60 min in the DSC instrument to allow exact temperature control of the sample. The samples were chilled on ice and centrifuged at 16.100 g for 5 min at 4°C. The residual activity of the supernatant was assayed in the presence of geranylgeranyl pyrophosphate (GGPP) and G1P in a radiometric GGGPS activity assay, as described above.

Isothermal titration calorimetry

To follow G1P binding to GGGPS variants, a MicroCal PEAQ‐ITC microcalorimeter (Malvern Instruments) with a cell volume of 280 μL was used. Degassed protein (100 μM subunit concentration) and G1P (1 mM) solutions were prepared from the identical buffer batch (50 mM Tris, pH 8.0, 10 mM MgCl2). The G1P solution was titrated to the protein in 2 μL aliquots for a total of 18 injections at 2.5 min intervals at 25°C. During the course of the titration, the reaction mixture was continuously stirred. As controls, titrations of buffer with buffer, protein solution with buffer and buffer with ligand solution were performed. Each titration experiment was baseline corrected, and the change of heat per injection was calculated by integrating the area under each peak using the Origin software provided by MicroCal. The experimentally observed signals for the ligand binding experiment were corrected for the signals of the control experiments, a ΔH versus molar ratio plot was generated and a fit curve was plotted with the “single set of identical sites” option provided by the software.

Supplementary material

As supplementary material, we provide a file (Supinfo.pdf) containing three tables that summarize the features of the studied proteins and the used oligonucleotides, as well as nine figures showing a detailed phylogenetic tree as used for ASR and detailed characterizations of the studied proteins like activity assays, CD spectra, additional SEC analyses, ITC, nanoDSF and DSC data as well as SDS‐PAGE analysis displaying their purity. A FASTA file with all extant and ancestral sequences (AppendixS2.doc) as well as a file containing the posterior probabilities for all sites in the ancestral sequences (AppendixS1.xlsx) are provided separately.

AUTHOR CONTRIBUTIONS

Cosimo Kropp: Conceptualization; investigation; methodology; writing‐original draft. Kristina Straub: Methodology; software. Mona Linde: Investigation; methodology. Patrick Babinger: Conceptualization; supervision; writing‐review and editing. Table S1 Experimentally determined oligomerization states of GGGPS enzymes Table S2. GGGPS variants characterized in this study with their purification yields after expression in E. coli. Table S3. Primer sequences used for site‐directed mutagenesis. Figure S1. Phylogenetic tree deduced from the manually curated sequence set used for ASR of GGGPS group II predecessors. PhyloBayes (27) was used to calculate the tree. The length of the horizontal bar corresponds to 0.2 substitutions per site. Nodes “N..” correspond to evolutionary intermediates. The tree has been calculated in a previous study (12). The full species and strain names as well as UniprotKB accession numbers are given for each extant sequence. A FASTA file with all extant and ancestral sequences as well as a file containing the posterior probabilities for all sites in the ancestral sequences are provided as supplementary material. Figure S2. Activity assay of GGGPS variants used in this work. The substrate 14C‐G1P was incubated with GGPP and 1 μM of the GGGPS variant (subunit concentration) for 1.5 h at 40°C. The generated products were extracted, dephosphorylated to allow for subsequent separation by thin layer chromatography, and visualized by autoradiography. As negative control (−), no enzyme was added to the assay. The origin of the chromatography and the reaction product GGG are marked. The samples were run on different TLC plates each, as indicated by the vertical white gap. Figure S3. Structural integrity of GGGPS variants characterized in this study. Far‐UV CD spectra of ancestral and extant GGGPS variants (6 μM, subunit concentration) were recorded in 50 mM potassium phosphate, pH 7.5 from 190 nm to 260 nm (d = 1 mm) at room temperature. The CD spectra are sorted in the order they are mentioned in the main text (A‐D). The identities of the proteins are given in the legends. Figure S4. Analytical size exclusion chromatography with varying concentrations of AncGGGPS2_N2 and AncGGGPS2_N3. (A) AncGGGPS2_N2 and (B) AncGGGPS2_N3 were applied at four different subunit concentrations (as given in the legend) to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min, followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols. Figure S5. Sequence alignment of AncGGGPS2_N4 and AncGGGPS2_N12. Only the region around the ring interface is shown. The transplanted region is highlighted in blue, the non‐swapped rest of the ring interface highlighted in beige. The positions of the “aromatic anchor” (a) and the cation (c) are marked. Figure S6. Analysis of G1P binding to AncGGGPS2_N4 variants by isothermal titration calorimetry. (A/C) AncGGGPS2_N4. (B/D) AncGGGPS2_N4_IF_n12. (A) and (B) show the raw data of the titration experiments, in (C) and (D) the derived ΔH for each titration step is plotted against the molar ratio G1P:protein. The observed change in ΔH for AncGGGPS2_N4_IF_n12 is too weak to reliably deduce binding parameters. Figure S7. Analytical size exclusion chromatography of rsGGGPS, fsGGGPS, mrGGGPS and nvGGGPS. The proteins (40 μM, subunit concentration) were applied to a S75 analytical column equilibrated with 50 mM potassium phosphate, pH 7.5, 300 mM KCl. Elution was performed at a flow rate of 0.5 mL/min and followed by measuring the absorbance at 280 nm, which was plotted against the elution volume. mtGGGPS_wt and mtGGGPS_W141A (dashed lines) served as references for the hexameric and dimeric oligomerization states. The oligomerization state is indicated by gray symbols. Figure S8. Thermal stability of GGGPS proteins, followed by nanoDSF and DSC. (A) nanoDSF analysis of ancestral GGGPS proteins with an apparent T2 > 95°C. (B) nanoDSF analysis of ancestral proteins with an observable thermal transition. (C) nanoDSF analysis of extant GGGPS proteins. The change in the ratio of the fluorescence emission at 350 and 330 nm of 40 μM protein (subunit concentration) in 50 mM potassium phosphate, pH 7.5 (supplemented with 300 mM KCl for AncGGGPS2_N73 and rsGGGPS) was monitored from 20 to 95°C at a scan rate of 1 K min−1. Figure S9. Purity of GGGPS variants characterized in this study. SDS‐PAGE analysis (12.5% polyacrylamide) of the purified GGGPS variants (2.5 μg protein each). Size marker (M), molecular weight in kDa. Ancestral GGGPS variants: (1) AncGGGPS2_N2, (2) AncGGGPS2_N3, (3) AncGGGPS2_N4, (4) AncGGGPS2_N5, (5) AncGGGPS2_N12, (6) AncGGGPS2_N18, (7) AncGGGPS2_N52, (8) AncGGGPS2_N72, (9) AncGGGPS2_N73, (10) AncGGGPS2_N4_IF_n12, (11) AncGGGPS2_N12_IF_n4, (12) AncGGGPS2_N12_W138H, (13) AncGGGPS2_N12_W138A. Extant GGGPS variants: (14) rsGGGPS, (15) fsGGGPS, (16) mrGGGPS, (17) nvGGGPS, (18) mtGGGPS, (19) mtGGGPS_W141H, (20) mtGGGPS_W141A. The samples were run on three separate SDS gels. Click here for additional data file. Appendix S1. Supporting Information. Click here for additional data file. Appendix S2. Supporting Information. Click here for additional data file.
  36 in total

1.  Activity-stability relationships in extremophilic enzymes.

Authors:  Salvino D'Amico; Jean-Claude Marx; Charles Gerday; Georges Feller
Journal:  J Biol Chem       Date:  2003-01-02       Impact factor: 5.157

2.  The crystal structure of (S)-3-O-geranylgeranylglyceryl phosphate synthase reveals an ancient fold for an ancient enzyme.

Authors:  Jian Payandeh; Masahiro Fujihashi; Wanda Gillon; Emil F Pai
Journal:  J Biol Chem       Date:  2005-12-23       Impact factor: 5.157

3.  Self-assembly and evolution of homomeric protein complexes.

Authors:  Gabriel Villar; Alex W Wilber; Alex J Williamson; Parvinder Thiara; Jonathan P K Doye; Ard A Louis; Mara N Jochum; Anna C F Lewis; Emmanuel D Levy
Journal:  Phys Rev Lett       Date:  2009-03-18       Impact factor: 9.161

4.  PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating.

Authors:  Nicolas Lartillot; Thomas Lepage; Samuel Blanquart
Journal:  Bioinformatics       Date:  2009-06-17       Impact factor: 6.937

5.  Structural studies of geranylgeranylglyceryl phosphate synthase, a prenyltransferase found in thermophilic Euryarchaeota.

Authors:  P N Blank; A A Barnett; T A Ronnebaum; K E Alderfer; B N Gillott; D W Christianson; J A Himmelberger
Journal:  Acta Crystallogr D Struct Biol       Date:  2020-05-29       Impact factor: 7.652

Review 6.  Reconstructing Ancient Proteins to Understand the Causes of Structure and Function.

Authors:  Georg K A Hochberg; Joseph W Thornton
Journal:  Annu Rev Biophys       Date:  2017-03-15       Impact factor: 12.981

7.  Identification and Characterization of Heptaprenylglyceryl Phosphate Processing Enzymes in Bacillus subtilis.

Authors:  Mona Linde; David Peterhoff; Reinhard Sterner; Patrick Babinger
Journal:  J Biol Chem       Date:  2016-05-14       Impact factor: 5.157

8.  Purification and characterization of geranylgeranylglyceryl phosphate synthase from a thermoacidophilic archaeon, Thermoplasma acidophilum.

Authors:  Naoki Nemoto; Tairo Oshima; Akihiko Yamagishi
Journal:  J Biochem       Date:  2003-05       Impact factor: 3.387

9.  Computer simulations explain the anomalous temperature optimum in a cold-adapted enzyme.

Authors:  Jaka Sočan; Miha Purg; Johan Åqvist
Journal:  Nat Commun       Date:  2020-05-26       Impact factor: 14.919

10.  Origin of complexity in haemoglobin evolution.

Authors:  Shane A Chandler; Yang Liu; Anthony V Signore; Arvind S Pillai; Carlos R Cortez-Romero; Justin L P Benesch; Arthur Laganowsky; Jay F Storz; Georg K A Hochberg; Joseph W Thornton
Journal:  Nature       Date:  2020-05-20       Impact factor: 49.962

View more
  3 in total

Review 1.  The catalytic and structural basis of archaeal glycerophospholipid biosynthesis.

Authors:  Niels A W de Kok; Arnold J M Driessen
Journal:  Extremophiles       Date:  2022-08-17       Impact factor: 3.035

2.  Controlling Enzymatic Activity by Modulating the Oligomerization State via Chemical Rescue and Optical Control.

Authors:  Cosimo Kropp; Astrid Bruckmann; Patrick Babinger
Journal:  Chembiochem       Date:  2021-10-22       Impact factor: 3.461

3.  Hexamerization and thermostability emerged very early during geranylgeranylglyceryl phosphate synthase evolution.

Authors:  Cosimo Kropp; Kristina Straub; Mona Linde; Patrick Babinger
Journal:  Protein Sci       Date:  2021-01-11       Impact factor: 6.725

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.