Literature DB >> 34977887

Solution Structure and Conformational Flexibility of a Polyketide Synthase Module.

Maja Klaus¹, Emanuele Rossini², Andreas Linden^3,4, Karthik S Paithankar¹, Matthias Zeug¹, Zoya Ignatova⁵, Henning Urlaub^3,4, Chaitan Khosla⁶, Jürgen Köfinger², Gerhard Hummer^2,7, Martin Grininger¹.

Abstract

Polyketide synthases (PKSs) are versatile C-C bond-forming enzymes that are broadly distributed in bacteria and fungi. The polyketide compound family includes many clinically useful drugs such as the antibiotic erythromycin, the antineoplastic epothilone, and the cholesterol-lowering lovastatin. Harnessing PKSs for custom compound synthesis remains an open challenge, largely because of the lack of knowledge about key structural properties. Particularly, the domains-well characterized on their own-are poorly understood in their arrangement, conformational dynamics, and interplay in the intricate quaternary structure of modular PKSs. Here, we characterize module 2 from the 6-deoxyerythronolide B synthase by small-angle X-ray scattering and cross-linking mass spectrometry with coarse-grained structural modeling. The results of this hybrid approach shed light on the solution structure of a cis-AT type PKS module as well as its inherent conformational dynamics. Supported by a directed evolution approach, we also find that acyl carrier protein (ACP)-mediated substrate shuttling appears to be steered by a nonspecific electrostatic interaction network.

Entities: Chemical

Year: 2021 PMID： 34977887 PMCID： PMC8717363 DOI： 10.1021/jacsau.1c00043

Source DB: PubMed Journal: JACS Au ISSN： 2691-3704

Introduction

Modular polyketide synthases (PKSs) are a class of megasynthases that produce compounds of high pharmaceutical importance.[1] In modular PKSs, modules are linked covalently or noncovalently,[2−4] and are hypothesized to self-organize into linear scaffolds that feature product synthesis in a vectorial manner (Figure A).[5,6] In cis-AT PKSs, the catalytic core of each PKS module comprises the domains needed for the condensation of small acyl precursors, the ketosynthase (KS), the acyl transferase (AT), and the acyl carrier protein (ACP). In addition, the domains ketoreductase (KR), dehydratase (DH), and enoylreductase (ER) can be present to modify the condensation product at the β-position. In most cases, a thioesterase (TE) domain at the C-terminus of the modular PKSs is responsible for product release.[7,8]

Figure 1

Schematic architecture of DEBS, structural models, and constructs used in this study. (A) Overview of DEBS. The three polypeptides (DEBS1–3), the encoded modules (M1–M6), the loading module (LM), the TE, and the final product are shown. Black tabs depict docking domains. The KR domain of M3 is redox-inactive and termed kr.[33] (B) Models for the partially reducing PKS modules; DEBS M3 is based on SAXS analysis that implemented high-resolution structural information on KS-AT models and FAS,[9] and PIKS M5 is based on cryoEM analysis.[14] DEBS M3 model: Domains KS and AT form a rigid dimeric unit of an overall extended shape, and are separated by a noncatalytic linker-domain (LD). AT is followed by a structured post-AT linker[10,11,34] that interacts with both LD and KS, and connects AT to KR. PIK M5 model: In the arch-shaped KS-AT dimer, the AT is oriented downward. It interacts with the respective upward-facing LD and the KS of the other protomer of the dimer. The post-AT linker is not resolved within the PIKS M5 structure. Schemes of the domain structures including domain boundaries are attached. Note that for DEBS M3, domain borders are also given in construct A numbering with labeling in italics, which has been used in this study (Table S1). Post-AT linker sequence as defined in refs (21 and 22). (C) DEBS M2-derived constructs MBP-M2-DD2 (construct A) and KR1-ACP1-M2-TE (construct B). Over the last years, structural evidence has been obtained for two distinct structural arrangements of PKS modules that differ most substantially in the relative arrangement of the condensation domains KS and AT. Small-angle X-ray scattering (SAXS) data obtained on the 6-deoxyerythronolide B synthase module 3 (DEBS M3) revealed an extended conformation of the KS-AT dimer[9] in support of previous X-ray crystallographic data collected on the didomain subunit.[10−12] In contrast, a cryo-electron microscopy (cryoEM) study on the related pikromycin synthase module 5 (PIKS M5), reporting the first structure of a complete PKS module received at moderate resolution, suggested an arch-shaped KS-AT dimer conformation (Figure B).[13,14] Whereas the DEBS M3 structural model agrees with the structure of the evolutionarily related mammalian fatty acid synthase (FAS),[15] with the cryoEM structure of the lovastatin synthase iterative module LovB,[16] and with 3D-models generated from partial X-ray structures (mycocerosic acid synthase (MAS)-like PKS[17] and spinosyn synthase module 2 (SPNS M2)[18]), the structural model derived from the EM reconstruction of PKS M5 differs substantially from the others. The two structural models lead to substantial divergence in the molecular mechanism, notably concerning both the intramodular elongation and intermodular translocation steps.[19,20] The model derived from PIKS M5 structural data suggested loading of the KS active site by the upstream ACP (translocation step) via a side entrance, whereas, for elongation, the extender substrate reaches the binding site via a newly observed bottom entrance. Understanding the structural and mechanistic key features of PKSs is essential to inform their engineering for producing novel compounds with potentially new bioactivities.[21,22] To address the question of the structural and conformational properties of a PKS module, we have investigated DEBS module 2 (DEBS M2) by integrating SAXS and cross-linking mass spectrometry (XL-MS) with coarse-grained (CG) structural modeling and simulation. We considered the combination of noninvasive solution structure methods with robust structure calculation by Bayesian inference as particularly suited for addressing the question of the overall structural appearance of a PKS module. The hybrid approach enables the detailed study of the ensemble of conformations of a PKS module in solution, providing insight beyond the currently available structural information. DEBS is the ideal modular PKS for this approach, because it is a prototypical PKS and it has served as a model system before to examine the mechanism[23] and engineering potential[24−27] as well as the structure of the enzymatic domains or larger assemblies, e.g., KR1, DH4, KS3-AT3, KS5-AT5, ACP2 (numbers indicate module), docking domains (DDs) between DEBS2 and DEBS3, and TE.[2,7,9−11,28−30] Further, DEBS is phylogenetically situated in the same clade with other actinobacterial modular PKSs, so that insight gained likely applies to PKS modules in general.[31] We applied our approach to two DEBS M2 constructs (Figure C). Construct A contains the native M2 (KS-AT-KR-ACP-DD) with an N-terminally fused maltose binding protein (MBP). It represents a stand-alone PKS module with a domain organization comparable to the previously studied DEBS M3 and PIKS M5 (essentially just differing in the N-terminal MBP fusion). In construct B, M2 is decorated with the natural upstream domains KR1 and ACP1 (“1” for M1), constituting the processing part of DEBS M1, and a C-terminal TE domain (Figure C). The two constructs were chosen to provide complementary information on domain–domain interfaces in PKS modules (Table S1). Although our data show that extended (“DEBS M3 model”) and arch-shaped (“PIKS M5 model”) conformations of module DEBS M2 are possible in solution, they likewise reveal the propensity of DEBS M2 to adopt the extended conformation in solution. In this conformation, domains KR and ACP are positionally highly variable, representative of an overall conformationally dynamic processing part of a PKS module in accordance with previous data.[32] In a focus on the domain–domain interplay in modular PKSs, we complemented our hybrid approach with a directed evolution experiment based on phage display. Enhanced CG simulations, XL-MS analysis, and phage display suggest that interactions of ACP with catalytic domains are steered by networks of weak electrostatic interactions. Our findings shed light on the structure and dynamics of a PKS module, and offer new insight into the interplay between domains in PKSs. Collectively, these results provide guidance in the engineering of PKSs for the custom synthesis of novel polyketides.

Experimental Section

Plasmids, Bacterial Cell Culture, and Protein Purification

Plasmids harboring genes encoding individual PKS modules were generated via In-Fusion Cloning (Takara) and restriction and ligation-based techniques. Proteins were expressed and purified using similar protocols. For holo-proteins (where the ACP domain is post-translationally modified with a phosphopantetheine arm), E. coli BL21 cells were cotransformed with a plasmid encoding for the phosphopantetheine transferase Sfp from B. subtilis (pAR357[35]). Proteins either contained a C-terminal His6-tag or an N-terminal MBP-tag for purification. Twin-Strep-tagged proteins (KS3-AT3 for phage display) were purified using 5 mL of StrepTactin in a column.[36] The column was washed with 5 CV lysis buffer to elution with 3 column volumes of lysis buffer containing 2.5 mM destiobiotin. Eluates from His6-tag, Strep-tag, and MBP-tag purifications were further purified by anion exchange chromatography and size exclusion chromatography (SEC). Samples were stored as aliquots at −80 °C until further use. For more information, see the Supporting Information.

Enzymatic Assays and Triketide Analysis

PKS enzymatic assay of KR1-ACP1-M2-TE was performed according to published procedures.[34] The resulting product was extracted twice with 450 μL dried in vacuo and submitted for LC-MS analysis. For more information, see the Supporting Information.

SAXS Analysis and CG Structural Modeling

SEC-SAXS analysis was performed on the Bio-SAXS beamline BM29 at the European Synchrotron Radiation Facility (ESRF).[37] We computed the SAXS intensities for each of the individual CG conformations using the package FoXS.[38,39] We used the Bayesian Inference of Ensembles (BioEn) method to determine the statistical weights of Aext, Aarch, and Bext ensembles of conformations underlining the measured SAXS intensities.[40−42] BioEn computations were performed considering the SAXS intensities in the range of 0.02 < q < 0.3 Å–1. From the scattering curves, the radius of gyration Rg was calculated using the Guinier approximation.[43] We reduced the sampled ensembles to the conformations that together fulfill more than 99% of the statistical cumulative weight for evaluation of the measured cross-links. Systematic deviations between the measured and simulated scattering curves at low q range suggested moderate aggregation of the systems in solution. To quantify and account for the fraction of aggregation, we combined each simulated curve with the intensities of a globular model at a statistical weight that matched the experiment. We built initial structural arrangements for Aext and Bext as informed by previous SAXS investigations of DEBS.[9] The initial structural arrangement of Aarch was constructed by docking the modeled protein domains within the cryoEM density of the related PikAIII complex (EMD-5664).[14] We generated a canonical ensemble of models Aext, Aarch, and Bext using Monte Carlo (MC) simulations based on coarse-grained (CG) potential energies.[44] Folded protein domains were treated as rigid bodies described by one interaction site per residue located at their Cα atoms. The protein domains were linked by flexible linkers represented as Gaussian chain polymers and placed in a cubic box of 50 nm size. The solvent dielectric constant was set to ε = 80. The post-AT regions of Aarch were modeled as flexible linkers. We fixed the position and orientation of KS-AT2 and used step sizes of 3 Å and 3.14 rad for the MC translational and rotational movements of all other domains. For all models, conformations were sampled in a temperature replica-exchange simulation scheme covering the range of 298 to 598 K in steps of 20 K. For Aarch, we also performed replica-exchange simulations collecting replicas between 298 and 398 K every 20 K to preserve the arch-shaped conformation that were otherwise disrupted at higher temperatures. For each model, after an initial equilibration, a total of 9.0 × 105 MC conformations were produced to exhaustively sample relevant conformations. To augment model flexibility and the sampled conformational space of the models, we considered only the conformations produced at highest temperature for further analysis. Aarch conformations sampled at 598 K are denoted as Aarch598. For more information, see the Supporting Information.

XL-MS Analysis

Amounts of 150–175 μg of constructs A and B at a final protein concentration of 1 μg/μL or less were incubated with 2 mM SDA (100 mM stock in DMSO) for 30 min at room temperature. The cross-linking reactions were quenched with 50 mM Tris-HCl. Proteins were digested by trypsin, and peptides were acidified with trifluoroacetic acid (TFA) to a final concentration of 0.5% (v/v), desalted on MicroSpin Columns (Harvard Apparatus) following manufacturer’s instructions, and vacuum-dried. To enrich cross-linked peptide species by peptide size exclusion chromatography, we subjected fractions that eluted first and contained the cross-linked peptide pairs to LC-MS/MS analysis. Cross-linked peptides were measured in technical duplicates on an Orbitrap Fusion Tribrid mass spectrometer or on a Q Exactive HF-X coupled to a Dionex UltiMate 3000 UHPLC system (Thermo Fisher Scientific) equipped with an in house-packed C18 column (ReproSil-Pur 120 C18-AQ, 1.9 μm pore size, 75 μm inner diameter, 30 cm length, Dr. Maisch GmbH). ProteomeDiscoverer 1.4 (Thermo Fisher Scientific) was used for converting raw files into.mgf format (signal-to-noise ratio 1.5, 1000–10000 Da precursor mass). The generated.mgf files were subjected to pLink v. 1.23 (pFind group)[45] to identify cross-linked peptides. All spectra were evaluated manually. For more information, see the Supporting Information.

Structural Mapping of the XL-MS Data

We mapped the measured cross-links of construct A over the 1000 Aext and Aarch conformations ranked by weight that fulfill 99.9% of the BioEn scattering curves. The same analysis was performed to map the measured XL-MS data of construct B over the 5000 conformations of model Bext that fulfill 99% of the BioEn scattering curve. For all selected conformations, we calculated the Cα pair-distances across the reported cross-linked residues. For more information, see the Supporting Information.

Phage Display Methodology

Phage display experiments were carried out according to the protocol described by Tonikian et al.[46] A library of ACP1(2) mutants fused to the N-terminus of the minor coat protein P3 of the M13 bacteriophage was generated as described before.[46] A primer for randomization of five amino acids in the chain translocation epitope of ACP1 was designed using NNK at the target positions (P-MK162:5′ CTG GCG TCG CTG CCC GCG NNK GAG CGC NNK NNK GCG CTG TTC NNK CTC GTG CGC NNK CAC GCG GCC GCC GTC CTC 3′ (N: A/C/G/T K: G/T). Panning was performed in a StrepTactinXT-coated microtiter plate with wells coated with (3)KS3-AT3--Strep. The blocked phage library was incubated for 1 h on the coated and blocked wells prior to washing. Wash steps were increased each round over the course of the panning protocol (10×, 12×, 14×, 16×, 18×). Elution of bound phage was done with 100 μL of 100 mM hydrochloric acid for 20 min. The elution was propagated and the titer was determined. For propagation, 2 mL of log phase E. coli SS320 (OD600 0.8) was added to 200 μL of eluted phage and infection was allowed for 30 min at 37 °C, 200 rpm, whereupon 20 μL of M13K07 helper phage (1.0 × 1012 PFU/mL) was added. Phages were harvested and pellets containing the phage were resuspended in 1 mL of blocking buffer. To test the eluted phage for increased and specific binding, a specificity ELISA was performed. To further confirm the relative binding intensity of the newly enriched mutants, we used a titration ELISA using purified phage. The advantage of this assay is that defined phage amounts can be used and the signal can be normalized across different phage preparations. For more information, see the Supporting Information.

Results

SAXS Analysis and XL-MS Data Collection

Constructs A and B were purified as both apo- and phosphopantetheinylated holo-proteins using established protocols,[47] yet codon harmonization of the M2 gene was used to increase expression yields for both constructs (Figure S1, Table S2). We note that the body of data has been collected on the apoprotein (CG structural modeling, SAXS, XL-MS, and phage display) unless otherwise stated. The proteins purified to homogeneity were subjected to tandem size exclusion chromatography small-angle X-ray scattering (SEC-SAXS) and eluted as dimeric species (Figure S2A). Scattering data from the region of stable Rg values were analyzed, assuming that higher oligomer and aggregates are essentially eliminated (Figures S2B–D, Figures S3A–D, and SI Note 1). The scattering derived parameters, Rg values of 77.2 ± 0.2 Å for construct A and 76.6 ± 0.2 Å for construct B (Guinier approximation), respectively (Figure S3B and Table S3), were significantly larger than those previously obtained for DEBS M3 (M3-TE: 61.3 ± 0.5 Å).[9] Although the Rg value of constructs A and B are essentially similar, molecular weight (MW) estimates by the volume of correlation (Vc) well reflect calculated MWs; i.e., 408 (exp.) vs 403 kDa (calc.) for construct A, and 484 vs 488 kDa for construct B (SI Note 1, Figure S3B and Tables S3 and S4).[48] In addition to SAXS analysis, XL-MS was employed to gain distance information for specific residues in solution. We used the heterobifunctional cross-linker succinimidyl 4,4-azipentanoate to map interactions of lysine residues with other residues within and across domain boundaries (Cα-Cα-distance ∼20 Å (ref[49]) (Tables S5–S7). Focusing on construct A, we observed a number of XLs between the flexible ACP domain and catalytic domains (from ACPK1775 and ACPK1784), yet also across interfaces of less flexible domains and units, e.g., KS:AT and KS:KR (from KSK727), AT:LD and AT:KR (from ATK1071), and the post-AT linker (from K1273) and domains LD, AT, and KR (Table S6). Overall constructs A and B showed a strong overlap in mapped interactions (Table S6 and S7). The biological relevance of the M2-TE region of construct B was established by measuring triketide lactone product formation using a synthetic substrate (Figure S4).

Coarse-Grained Modeling of Two PKS Module Conformations

In integrating SAXS and XL-MS into a complex computational model building approach, we decided to employ structural models of the extended and the arch-shaped KS-AT dimer (KS-AT2) conformation for CG modeling of construct A (Aext and Aarch, Figure A, B, Figure S5, and SI Material and Methods). Model Aext was based on available X-ray crystallographic data[10,11] and exhibits an extended KS-AT2 conformation with rigid post-AT linkers folding back to the KS domain. Model Aarch was informed by the PIKS M5 cryoEM density[14] and carries an arch-shaped KS-AT2 with flexible post-AT loops (Figure S6). We generated an ensemble of structural models in CG simulations[44] and performed Bayesian ensemble refinement.[40−42] An entropic penalty ensured that the weights of the individual conformations were minimally adjusted to match the measured SAXS intensities (Figure S7).[41,42] We note that for CG structural modeling, SEC-SAXS frames over the entire dimeric elution peak were averaged for constructs A and B, respectively (Figure S2A). The Bayesian inference highlighted a small degree of protein aggregation in solution, also visible for both constructs in the SEC profiles (Figure S2A), for which we accounted in silico by aggregate correction (SI Note 2). Computed Rg values from the BioEn ensembles of Aext and Aarch were equal to 67.6 and 64.0 Å based on the Guinier approximation (Figures S8–10, SI Notes 2 and 3), which is in good agreement with the previous data collected on DEBS M3-TE.[9] The in silico intensities computed for Aext are in excellent agreement with the experimental SAXS data (Figure A). By contrast, the calculated intensities for Aarch show a clear divergence from measured data (Figure B and Figure S11). All complex arrangements were sorted based on their statistical weight, and we analyzed the positional variability of the domains of those that cumulatively explain 90% of the scattering intensities (Figure A, B). In Aext, the KR and thereby the attached ACP domain are highly mobile (Figure A and Figure S12). In Aarch, the flexibility of the post-AT linker enables occasional migration of KR toward the condensing domains; however, these conformations contribute only at a marginal statistical weight (Figure B, SI Note 4). In equilibrium with an extended conformation, an arch-shaped conformation thus appears to be feasible both topologically and energetically, in an arrangement resembling the cryoEM structure of PikAIII (SI Note 2).[13,14] However, SAXS data identify the extended conformation as the prevalent structural conformation of DEBS M2 in solution.

Figure 2

Structural models of DEBS M2 in solution. (A) Extended and (B) arch-shaped conformation. (A and B, left) Model conformations with high statistical weight in the ensembles of Aext and Aarch, respectively. Conformations at the highest weight are shown as cartoons. The conformations at lower cumulative weights, reflecting less populated conformations in solution, are depicted in surface representation (transparency correlated to statistical weights). Domain coloring: MBP, white; KS, blue; LD, gray; AT, green; KR, orange; ACP, magenta; DD, gray. (A and B, top right) Cumulative weights (c. w.) of the weight-sorted conformations fulfilling 90% (blue line) and 100% (black line) of the fittings. (A and B, bottom right) Experimental (black) and computed (red) scattering intensities as I(q) versus q.

XL-MS Distance Information within PKS Modules

In a next step, we mapped the measured XLs of construct A over the extended and arch-shaped configurations to locate candidate regions of domain–domain interactions in M2 (SI Note 5). Overall, the Aext model is in very good agreement with the measured cross-linking data (Tables S6 and S7, SI Note 6). High numbers of cross-linked peptide spectral matches (CSMs), supporting a specific interaction, are observed across the post-AT linker and KR, particularly between K1273 (post-AT linker) and E1474, Y1475, and A1476 (KR, Figure A). These residues emerge as a hot-spot of interaction that constrain an otherwise undirected random motion of KR around the post-AT linker and KS, supporting the high flexibility of M2 in solution (Figure B and Movie S1). The attachment of the post-AT linker to the KS-AT2 core and the treatment of the KS-AT2 core as rigid body generally prevented interactions between KR and the peripheral regions of AT. As such, one observed cross-link in this region (XL7, AT2K1071-KRQ1623), which cannot be explained by conformational variability of the Aext model, indicates that dissociation of the post-AT linker from the KS-AT2 core, providing a higher degree of flexibility to KR, or conformational variability within the KS-AT2 core may be possible at low frequency. Alternatively, XL7 may result from formation of higher oligomers, as observed in SEC (Figure S2A), in which the peripheral stretch of the AT could interact with the mobile KR of another module (Figure S13, Table S6). Another set of XLs was observed between MBP and KS-AT2, in which the MBP domain interacts ubiquitously, yet at low frequencies with the KS-AT dimer (Figure S14, Table S6). This can be rationalized by the high mobility of the MBP domain, which is connected by long linkers to the condensing domain (Table S9). In contrast, Aarch satisfies only a small set of the XLs measured mainly across KR and ACP (Table S6). Overall, XL-MS data thus support an extended conformation of M2 in solution.

Figure 3

XL-MS distance information within PKS modules. (A) High-frequency XLs between KR (E1474, Y1475, and A1476) and the post-AT linker (K1273). (B) Schematic illustration of KR flexibility from multiple generated conformations. Post-AT linker in light green. (C–E) Candidate regions of domain–domain interaction of ACP with KS and KR based on the Aext configurations at high statistical weights that satisfy the measured XLs with residue-pair distance lower than 21 Å. (C) Interactions between ACPK1775 (yellow sphere) and D538, G539, and Y540 of KS (red spheres). The active serine of ACP (S1790, cyan sphere) is oriented far from the entrance of the KS binding pocket with the catalytic KSC571 highlighted in green. We note that only a subset of observed ACP-KS interactions place the ACP active serine within proximity to the KS active site (Figure S15). (D) Interactions between ACPK1784 (yellow sphere) and G539, and Y540 of KS (red spheres). The active serine of ACP (S1790, cyan sphere) is oriented in proximity of the entrance of the KS binding pocket with the catalytic KSC571 highlighted in green. (E) Interaction of ACPK1775 (yellow sphere) and V1295, S1303, and E1304 of KR (red spheres).

Positional Variability of ACP

The attachment of ACP via a 14-amino acid linker to the mobile KR implies a high conformational variability. By computational modeling and XL-MS, we isolated candidate regions of domain–domain interaction of ACP with KS-AT2 and KR (Figure C–E and Figure S15). We observed high frequencies of interactions across K1775 and K1784 of ACP and D538, G539, and Y540 of KS and across K1775 of ACP and V1295, S1303, and E1304 of KR. The candidate regions are populated by charged residues: D1172, D1773, R1778, E1785 and D1789 of ACP; D535, R553, Y556, R612, R711, and D714 of KS; and D1296, E1299, R1302, E1449, D1450, R1716, R1718, D1722, and R1723 of KR (Table S6). These data indicate that domain–domain interactions are based on networks of electrostatic interactions, similarly as found before in type I and type II FASs.[50−52] In our simulations, ACP extensively samples the entrance of the KS binding pocket (Figure S15), and we isolated a candidate orientation with ACPS1790 pointing toward the active site and the catalytic C571 of KS (Figure D). Note that, because of its small size, the ACP domain is represented in computational models with low weight, thereby impeding a more detailed description of interfaces. However, XL-MS confirms modeling data and highlights interaction hot spots of ACP distributed over KS-AT2 and KR that are steered by networks of electrostatic interactions (Table S6).

Structural Properties of Construct B

Analogously to construct A, the SAXS intensities measured for construct B in solution are explained by a structural model (Bext) that carries the extended KS-AT2 conformation (Figure S16, SI Note 7). In Bext, KR1 (KR of M1) is highly mobile and occupies a large conformational space, whereas ACP1 displays low mobility and samples specific regions of KS-AT2 (Figure S17, SI Note 8). In addition, the lowered flexibility of KR (of M2) in Bext confirms the structural constraints imposed by the TE (Figure S18, SI Note 8). We note that XL-MS was performed with phosphopantetheinylated construct B. Overall fewer CSMs were observed compared to (apo-)construct A (Table S7), which was also found in a single experiment with phosphopantetheinylated construct A (yielding fewer CSMs compared to (apo-)construct A; Table S6). This observation indicates that the phosphopantetheine moiety constrains ACP docking during substrate shuttling, as suggested previously in structural[13,53,54] and functional studies[55−57] on type I PKSs and the related FASs (SI Note 9). In construct B, no CSMs were observed between ACP1 and KS, which may be due to the suppression of this interaction by the competitive ACP1-KR1 interactions or by the high dynamics of ACP1 due to the unconstrained KR1 (no N-terminal KS1-AT1 dimer in construct B). In conclusion, Bext is in good agreement with the measured CSMs (Table S5, SI Note 10). Interestingly, the interaction pattern of Aext is also captured in Bext, highlighting the transferability of the XL-MS data to a different construct of DEBS (Table S6, SI Note 11).

Characterization of an Interface via Phage Display

Having observed that the ACP-covering cross-links do not converge on specific interfaces on either catalytic domain of the KS-AT2 core (Tables S6 and S7), we sought to further our understanding of catalytic domain recognition by ACP in an additional experiment based on directed evolution through phage display. We decided to work with the intermodular translocation interface between ACP and KS, i.e., ACP interacting with KS of the downstream module for translocating the growing acyl chain, because this interface is the only one that has been described so far in modular PKSs in more detail.[58] Specifically, we set up a phage display approach with the aim of increasing the affinity of the non-native interface between ACP of module 1 (ACP1) and KS of module 3 (KS3). The sequence identity between KS of M2 (KS2) and KS3 is 58% on the protein level (EMBOSS pairwise assembly, www.ebi.ac.uk).[59] For phage display, ACP1 was expressed on phage, and its helix 1 (H1), previously determined to play a crucial role during intermodular chain translocation,[58] was mutagenized (Figure S19). We note that phage display was performed with apo-ACP phage library due to the difficulty in performing and monitoring phosphopantetheinylation of the ACP displayed at the phage surface. In our phage display experiment, ACP1H1 mutants (mutants of ACP1 at helix 1) with higher affinity toward KS3-AT3 were enriched. The didomain construct KS3-AT3 was used instead of standalone KS3, because it can be produced as a stable protein by recombinant production.[10] For ACP1, three interfaces are available in principle for docking, i.e., two interfaces on KS3 (one for intermodular translocation and one for intramodular condensation), and one on AT3 for transacylation. As H1 plays a major role in the intermodular translocation reaction,[24,58] while only marginally contributing to domain–domain interactions during condensation[58] and transacylation,[60−62] a putative consensus sequence on H1 should evolve from selection for the translocation interface. From the initial library of 3.1 × 106 possible ACP1H1 mutants, 300 enriched mutants were selected for ELISA screening, from which we finally identified six mutants (ACP1-Lib1-Mut“X”) exhibiting an increased binding compared to wild-type ACP1, albeit only marginally, and BSA as a control (Figure A). The BSA control was considered as a mean to eliminate ACP1H1 mutants that did presumably not evolve specificity to KS3-AT3, because of also nonspecifically interacting with BSA. We did not observe a consensus in mutations between enriched and ELISA-selected ACP1H1 mutants, denoted ACP1-Lib1-Mut“X” (Figure B). The integrity of the selected mutants was eventually confirmed by the activity of bimodular LM+M1+M3-TE assembly lines in which the full-length M1 construct carried mutated ACPs (SI Note 12) (Figure C)[26,27,47]

Figure 4

Directed evolution of a chimeric ACP:KS interface. Analysis of enriched mutants from ACP1-Library1. (A) ELISA of ACPs presented on the phage surface. Results for selected mutants (Lib1-MutX) and wild-type ACPs are shown. Signal was obtained in KS3-AT3-coated wells (black bars) and the degree of unspecific binding was assessed by comparing it to the signal in BSA coated wells (gray bars). For each ACP, four individually grown phage cultures were tested (except three cultures for ACP1 and ACP2), and the individual data points and data mean are indicated. Data range of ACP2 is indicated by a gray bar for reference. (B) Sequence alignment of obtained mutants compared to wild-type ACP1 and ACP2. Randomized positions are indicated with a yellow asterisk. (C) Turnover rates of wild-type and chimeric bimodular PKSs harboring the mutations enriched in the ACP1-Lib1 biopanning experiments. All bimodular PKS consisted of LM, M1, and Module“X”-TE. Either wild-type DEBS M1 was used as the first module (M1) or one of the six mutants obtained through the first directed evolution experiments (M1-Lib1-Mut“X”). Initial rate data were obtained at individual PKS protein concentrations of 4 μM and nonlimiting concentrations of propionyl-CoA, methylmalonyl-CoA, and NADPH. Measurements were performed in triplicate from one batch per construct M1-Lib1-Mut“X”, and mean and standard deviation are given. For the two most active mutations, Lib1-Mut3 and Lib1-Mut5, new positions at H1 were chosen for further randomization. Both libraries were treated with the same protocol and analyzed with the same methods as the original library, except for an additional ELISA titration experiment that was conducted to further evaluate binding of ACP1H1 mutants compared to wild-type. The new libraries did neither produce a consensus sequence nor result in high affinity or high turnover bimodular PKSs (Figure S20, Table S8, and SI Note 13). Collectively, these data argue against the occurrence of a specific ACP:KS translocation interface, and, thus, agree with the positional variability of ACP in docking at a catalytic domain suggested from cross-linking and modeling data. We note that several studies propose that the spatial distribution of ACP is sensitive to the acyl-phosphopantetheine moiety,[13,53,54] and future studies with phosphopantetheinylated ACP and the native substrate will be needed to further understand the specificity in substrate shuttling as well as the impact of acyl-phosphopantetheinylation.

Discussion

Structural analysis of the evolutionarily related iterative PKSs and FASs suggests a high conformational variability of modular PKSs,[17,53,63] which makes the high-resolution structure determination of PKS modules challenging. Structural characterization of flexible proteins is possible by the synergistic application of SAXS, XL-MS, and integrative modeling that enables the quantitative assessment and modeling of structural heterogeneity and conformational variability. In applying the combined approach, we were able to substantiate the relevance of the two strikingly different arch-shaped and extended conformations that have both been proposed as prevalent structures of a PKS module. Overall, our experimental data and modeling results on DEBS M2 show that both conformations are possible in principle; however, the extended structure appears to be dominant in solution. The extended structure of modular PKSs in solution is itself dynamic, and we present direct insight into the conformational variability of a PKS module. Whereas the KS-AT dimeric unit is relatively rigid, as reported by previous structural studies on type I PKS[10−12,16,32] and treated as rigid core in this study, computational modeling constrained by SAXS data suggests a high conformational variability of KR. Combined with XL-MS data highlighting the interaction of KR with the post-AT linker, a picture of KR moving undirected and randomly just constrained by the post-AT linker as an anchoring point emerges. A PKS module can also feature dimerizing units in the processing part; i.e., the dimeric domain DH, and a dimerization element present in about half of the modules with only KR as modifying domain.[64] Both units will constrain conformational dynamics, and KR will no longer be a separately moving domain (Movie S1), but part of a larger assembly with decreased mobility. In conclusion, this study argues that in PKS modules, a compact KS-AT dimeric unit carries a processing part with conformational dynamics depending on its module organization.[17] The substrate-shuttling ACP domain is of key importance for any PKS module. In our study, performed on the nonphosphopantetheinylated domain, we observe a lack of high-frequency XLs from ACP to KS, AT and KR, which indicates that ACP does not form highly specific interfaces with those domains. To further our understanding of the specificity of ACP-domain interactions, we employed a directed evolution approach for evolving the ACP:KS translocation interface toward higher affinities. Directed evolution approaches are widely used to narrow down specific interactions as a result of an increase in affinity. Our results show that despite intensive screening only few higher affinity hits emerged from ELISA analysis, among which no consensus sequence could be found. We interpret this observation as arguing against the existence of specific ACP:KS translocation interfaces, in line with our solution structure data. Overall, our data suggest that substrate shuttling by ACP of type I PKS is facilitated by weak interactions, similarly as described earlier for type I and type II FAS.[51,52] We conclude that ACP moves freely, and rather randomly collides with domains of the type I assembly, where electrostatic networks influence the approach of ACP to the individual domains. As such our findings essentially support a previously suggested model in which ACP is unrestrained, but restricted in its trajectory by the other domains of the multienzyme assembly.[20] We note that the body of data for this study was collected with nonphosphopantetheinylated proteins, mainly due to technical reasons. For example, domains were represented as rigid bodies in CG structural modeling, and the phosphopantetheine moiety was not represented in the model. Since ACPs of type I systems do not accommodate phosphopantetheine and acyl-chains, leading just to local changes around the active serine in apo-, holo-, and acylated-ACP, but leaving the ACP fold overall unchanged,[65−67] our data are representative of the domain–domain portion of the interactions. However, several structural studies revealed that the spatial distribution of ACP is sensitive to the acyl moiety in type I PKSs and the related FASs.[13,53,54] Further, recent enzyme kinetic studies on ATs from type I PKSs and FASs reported the variation in KM with the identity of the acyl unit, suggesting the impact of acyl-phosphopantetheine moiety on ACP docking.[55−57] Accordingly, studies are required on phosphopantetheinylated and acylated ACPs to understand substrate shuttling. In particular, the impact of the phosphopantetheine and the identity of the acyl-moiety on this process is just poorly characterized for type I PKSs and FASs until to date.[65−67] Besides giving insight into the structural and conformational properties of PKSs, our study provides guidelines for PKS engineering. Taking into account recently published data,[34,68] the extended conformation is likely self-sufficient for synthetic progress in PKS assembly lines. We therefore suggest to work with the extended model in PKS engineering approaches, as well as to base active site remodeling on the biomolecular processes connected to this structural arrangement. For example, our data support the KS functional mode to be “conventional”, i.e., using the generic KS binding tunnel identical to type I FASs and iterative PKSs, and approaches on active site design can be informed by data available also on those systems. Our findings can further explain why engineering approaches involving exchanges of the KR domain or the entire processing part have been successful in the past,[69−72] as well as encourage consideration of such approaches in PKS engineering projects. XL-MS data indicate just few cross-links between KR and other domains (to the AT domain, Tables S6 and S7), so that KR exchanges may be well accessible in modules with only KR as modifying domain. We assume that also exchanges of the entire processing part (in larger PKS modules) are amenable, because condensing and processing parts appear separated in structural and conformational properties; as also implied by several successful engineering approaches (e.g., refs (72 and 73)). Data for ACP should be interpreted in a similar light. As our direct insight in ACP-mediated domain–domain interactions argues against well-defined docking events (between apo-ACP and catalytic domains), productive substrate shuttling appears possible in chimeric PKS with non-native ACP as long as conflicting electrostatic interactions are avoided.

71 in total

1. An unusually large multifunctional polypeptide in the erythromycin-producing polyketide synthase of Saccharopolyspora erythraea.

Authors: J Cortes; S F Haydock; G A Roberts; D J Bevitt; P F Leadlay
Journal: Nature Date: 1990-11-08 Impact factor: 49.962

2. Identification of cross-linked peptides from complex samples.

Authors: Bing Yang; Yan-Jie Wu; Ming Zhu; Sheng-Bo Fan; Jinzhong Lin; Kun Zhang; Shuang Li; Hao Chi; Yu-Xin Li; Hai-Feng Chen; Shu-Kun Luo; Yue-He Ding; Le-Heng Wang; Zhiqi Hao; Li-Yun Xiu; She Chen; Keqiong Ye; Si-Min He; Meng-Qiu Dong
Journal: Nat Methods Date: 2012-07-08 Impact factor: 28.547

3. Structure-based analysis of the molecular interactions between acyltransferase and acyl carrier protein in vicenistatin biosynthesis.

Authors: Akimasa Miyanaga; Shohei Iwasawa; Yuji Shinohara; Fumitaka Kudo; Tadashi Eguchi
Journal: Proc Natl Acad Sci U S A Date: 2016-02-01 Impact factor: 11.205

Review 4. Engineering strategies for rational polyketide synthase design.

Authors: Maja Klaus; Martin Grininger
Journal: Nat Prod Rep Date: 2018-10-17 Impact factor: 13.423

Review 5. The architectures of iterative type I PKS and FAS.

Authors: Dominik A Herbst; Craig A Townsend; Timm Maier
Journal: Nat Prod Rep Date: 2018-10-17 Impact factor: 13.423

6. Interfacial plasticity facilitates high reaction rate of E. coli FAS malonyl-CoA:ACP transacylase, FabD.

Authors: Laetitia E Misson; Jeffrey T Mindrebo; Tony D Davis; Ashay Patel; J Andrew McCammon; Joseph P Noel; Michael D Burkart
Journal: Proc Natl Acad Sci U S A Date: 2020-09-14 Impact factor: 11.205

7. Antibody Probes of Module 1 of the 6-Deoxyerythronolide B Synthase Reveal an Extended Conformation During Ketoreduction.

Authors: Dillon P Cogan; Xiuyuan Li; Natalia Sevillano; Irimpan I Mathews; Tsutomu Matsui; Charles S Craik; Chaitan Khosla
Journal: J Am Chem Soc Date: 2020-08-18 Impact factor: 15.419

8. Efficient Ensemble Refinement by Reweighting.

Authors: Jürgen Köfinger; Lukas S Stelzl; Klaus Reuter; César Allande; Katrin Reichel; Gerhard Hummer
Journal: J Chem Theory Comput Date: 2019-04-17 Impact factor: 6.006

9. Conformational flexibility of metazoan fatty acid synthase enables catalysis.

Authors: Edward J Brignole; Stuart Smith; Francisco J Asturias
Journal: Nat Struct Mol Biol Date: 2009-01-18 Impact factor: 15.369

10. Structural rearrangements of a polyketide synthase module during its catalytic cycle.

Authors: Jonathan R Whicher; Somnath Dutta; Douglas A Hansen; Wendi A Hale; Joseph A Chemler; Annie M Dosey; Alison R H Narayan; Kristina Håkansson; David H Sherman; Janet L Smith; Georgios Skiniotis
Journal: Nature Date: 2014-06-18 Impact factor: 49.962

1 in total

1. Solution structure of the type I polyketide synthase Pks13 from Mycobacterium tuberculosis.

Authors: Cécile Bon; Stéphanie Cabantous; Sylviane Julien; Valérie Guillet; Christian Chalut; Julie Rima; Yoann Brison; Wladimir Malaga; Angelique Sanchez-Dafun; Sabine Gavalda; Annaïk Quémard; Julien Marcoux; Geoffrey S Waldo; Christophe Guilhot; Lionel Mourey
Journal: BMC Biol Date: 2022-06-21 Impact factor: 7.364

1 in total