Literature DB >> 35876524

An Nuclear Magnetic Resonance Fingerprint Matching Approach for the Identification and Structural Re-Evaluation of Pseudomonas Lipopeptides.

Vic De Roo¹, Yentl Verleysen^1,2, Benjámin Kovács¹, Matthias De Vleeschouwer^1,2, Penthip Muangkaew², Léa Girard³, Monica Höfte⁴, René De Mot³, Annemieke Madder², Niels Geudens¹, José C Martins¹.

Abstract

Cyclic lipopeptides (CLiPs) are secondary metabolites secreted by a range of bacterial phyla. CLiPs from Pseudomonas in particular, display diverse structural variations in terms of the number of amino acid residues, macrocycle size, amino acid identity, and stereochemistry (e.g., d- versus l-amino acids). Reports detailing the discovery of novel or already characterized CLiPs from new sources appear regularly in literature. Increasingly, however, the lack of detailed characterization threatens to cause considerable confusion, especially if configurational heterogeneity is present for one or more amino acids. Using Pseudomonas CLiPs from the Bananamide, Orfamide, and Xantholysin groups as test cases, we demonstrate and validate that the combined 1H and 13C Nuclear Magnetic Resonance (NMR) chemical shifts of CLiPs constitute a spectral fingerprint that is sufficiently sensitive to differentiate between possible diastereomers of a particular sequence even when they only differ in a single d/l configuration. Rapid screening, involving simple matching of the NMR fingerprint of a newly isolated CLiP with that of a reference CLiP of known stereochemistry, can then be applied to resolve dead-ends in configurational characterization and avoid the much more cumbersome chemical characterization protocols. Even when the stereochemistry of a particular reference CLiP remains to be established, its spectral fingerprint allows to quickly verify whether a newly isolated CLiP is novel or already present in the reference collection. We show NMR fingerprinting leads to a simple approach for early on dereplication which should become more effective as more fingerprints are collected. To benefit research involving CLiPs, we have made a publicly available data repository accompanied by a 'knowledge base' at https://www.rhizoclip.be, where we present an overview of published NMR fingerprint data of characterized CLiPs, together with literature data on the originally determined structures. IMPORTANCE Pseudomonas CLiPs are ubiquitous specialized metabolites, impacting the producer's lifestyle and interactions with the (a)biotic environment. Consequently, they generate interest for agricultural and clinical applications. Establishing structure-activity relationships as a premise to their development is hindered because full structural characterization including stereochemical information requires labor-intensive analyses, without guarantee for success. Moreover, increasing use of superficial comparison with previously characterized CLiPs introduces or propagates erroneous attributions, clouding further scientific progress. We provide a generally applicable characterization methodology based on matching NMR spectral fingerprints of newly isolated CLiPs to natural and synthetic reference compounds with (un)known stereochemistry. In addition, NMR fingerprinting is shown to provide a suitable basis for structural dereplication. A publicly available reference compound repository promises to facilitate participation of the lipopeptide research community in structural assessment and dereplication of newly isolated CLiPs, which should also support further developments in genome mining for novel CLiPs.

Entities: Chemical

Keywords: NMR spectroscopy; Pseudomonas; cyclic lipodepsipeptides; dereplication; stereochemistry

Mesh：

Substances：

Year: 2022 PMID： 35876524 PMCID： PMC9431178 DOI： 10.1128/spectrum.01261-22

Source DB: PubMed Journal: Microbiol Spectr ISSN： 2165-0497

INTRODUCTION

Pseudomonas represents a diverse and ubiquitous bacterial genus that evolves in a wide range of ecological niches (1, 2). To support their lifestyle and interactions with other organisms, they produce a variety of complex secondary metabolites including cyclic lipodepsipeptides or ‘CLiPs’ generated by non-ribosomal peptide synthetases (NRPSs) (3). Biological effects associated with these biosurfactant peptides include antimicrobial activity, control of bacterial motility, and biofilm formation and phytotoxicity, among others (4, 5). They are also involved in the complex interplay between bacteria and plants at the level of the plant rhizosphere. This generates plant growth promoting activity and protection against pathogens by direct antagonism or induced resistance (6, 7). Other reports have detailed anti-insecticidal activity (8, 9). As a result, this diverse array of functions of Pseudomonas CLiPs has sparked interest in their potential development in plant biocontrol applications and as biopesticides (10, 11). In turn, this also stimulates screening efforts for their development as novel antimicrobials (4, 5). Indeed, the current dearth of novel compounds to fight multidrug-resistant bacteria or fungal infections combined with an increased focus on the development of narrow-spectrum or even species-specific antibiotics has renewed efforts toward mining bacteria for novel pharmaceutical leads (12). These have included CLiPs, (13, 14) not in the least motivated by the successful introduction of daptomycin into clinical settings. In addition, several reports have highlighted anti-carcinogenic activities of Pseudomonas CLiPs, further illustrating their biomedical application potential (15–17). Therefore, both from a fundamental and application perspective, there is a pressing need to uncover the mode of action resulting in these biological functions, starting with the way these are related to the underlying chemical structures (5, 18, 19). Given that structure elucidation of CLiPs remains far from trivial, the development of reliable and straightforward approaches remains key. Moreover, such approaches should allow comparing different CLiPs to unequivocally establish their structural identity. With well over 100 distinct chemical structures reported to date, Pseudomonas CLiPs invariably consist of an oligopeptide sequence of 8 to 25 residues, which is partly cyclized into a depsipeptide through ester bond formation of the C-terminus with a preceding side chain hydroxyl group (4, 5). An acyl chain of varying length and constitution caps the N-terminus of the oligopeptide sequence. The incorporation of non-proteinogenic amino acids, with a majority of residues displaying d-configuration through the action of epimerization domains in the non-ribosomal assembly line, generates additional structural diversity and further magnifies the structure elucidation challenge (20). The availability of rapid and affordable genome sequencing has stimulated the development of bioinformatic tools that allow to predict the CLiP structure produced by a particular non-ribosomal peptide synthetase (NRPS) by mapping its biosynthetic gene cluster (BGC) (21). The primary sequence is derived from analysis of the adenylation A-domains of the NRPS. Although separate epimerization domains are present in the siderophore synthetases of Pseudomonas, such activity is not found in their CLiP biosynthetic enzymes. Instead, condensation domains with intrinsic epimerization capacity (C/E) mimic the activity of a DCL domain by converting the configuration of the C-terminal residue in the intermediate from l to d before extending the peptide (22). Tentative attribution of d/l configuration can therefore be inferred from analysis of the condensation domains involved. For Pseudomonas CLiPs in particular, the dual condensation/epimerization C/E domains responsible for configurational inversion of the preceding l-amino acid in the growing peptide chain can be identified using such tools, however it turns out that the epimerization functionality can be inactive for reasons that remain unclear (23, 24). Consequently, as further demonstrated in this work, these predictions do not yet achieve the level of confidence necessary to eliminate the need for the more elaborate and labor-intensive chemical analysis methods. Furthermore, this genome-based approach does not provide information about the residues involved in the depsi bond, nor can the nature of the acyl chain be predicted. Thus, while analysis of the biosynthetic gene cluster coding for the NRPS represents a complementary asset during the chemical structure elucidation process (Fig. S1), it cannot replace it. Combined application of mass spectrometric (MS) and NMR methods allows, in principle, to establish the composition of the oligopeptide sequence and the nature of the acyl chain as well as the cyclisation site but does not provide access to stereochemical information. For this, chemical derivatization methods combined with chromatographic separation, such as Marfey’s analysis, are typically employed to derive the number and configuration of individual amino acids (Fig. S2) (25–28). However, the total hydrolysis of the lipopeptide into its constituent amino acids required in the process also causes the loss of information regarding their original position in the sequence. Consequently, positional ambiguity will arise when a particular amino acid with multiple occurrences in the sequence is configurationally heterogenous i.e., both the d- and l-configuration occur (26, 29). As a result, the elucidation of stereochemistry remains incomplete, such as is the case for the Pseudomonas CLiPs MDN-0066 (17), PPZPM-1a (30) and lokisin (31), among others. While workarounds exist, they are tedious and not error-proof (24, 26). In other cases, configurational analysis is omitted and configurational attribution is claimed by relying on either the conservation of the known pattern of d/l-configurations along a sequence (32, 33), or solely on genomic prediction (34–36). However, the latter ignores the problem of inactive epimerization domains (see above) while the former disregards the natural occurrence of diastereoisomeric lipopeptides having identical constitution but differing in configuration at one (or more) position(s). This is illustrated by, for example, the Pseudomonas CLiP pairs viscosin/WLIP, viscosinamide/pseudodesmin A, syringostatin/syringomycin and fengycin/plipastatin. Thus, new approaches that allow to move the structure elucidation beyond configurational dead-ends remain in high demand. Simultaneously, the increased screening of Pseudomonas strains for CLiP production also calls for the introduction of dereplication approaches that confidently distinguish a known versus a novel CLiP structure early on, this to avoid the costly re-elucidation of previously described compounds (37). In line with this, multiple reports show a growing tendency to characterize known CLiPs from novel sources solely through comparison of their high resolution mass to those of previously characterized ones (32, 33, 38–43). However, these approaches overlook the possibility of isobaric lipopeptides, which occur more commonly than is perhaps anticipated. For instance, several members of the Viscosin group of CLiPs, such as viscosin and massetolide F, have a distinct molecular topology yet the same molecular formula, a situation that also occurs for milkisin and tensin, both members of the Amphisin group. Even lipopeptides from distinct CLiP groups can be isobaric, such as strikingly demonstrated by gacamide/cocoyamide and putisolvin II (44–46). Indeed, these share the same C66H115N13O19 molecular formula yet differ in the total number of amino acids (12 versus 11), the number of amino acids involved in the respective macrocycles (5 versus 4) and the constitution of the acyl chain (3R-OH C10:0 versus C6:0). Previously, we noted that the combined 1H and 13C NMR chemical shift fingerprint obtained from 1H-{13C} HSQC spectra is sufficiently unique to differentiate diastereomers of a particular CLiP even when they differ in only a single d/l configuration (47). More so, it is sensitive to the configuration of the 3-hydroxy moiety of the fatty acid (HDA) tail as well (48). Indeed, inverting its configuration is also accompanied by changes in the 1H and 13C chemical shifts of the CHβ and CHα resonances. By integrating NMR spectral fingerprinting and total synthesis into the existing bioinformatic and chemical analyses workflows commonly used for structure elucidation, we show here, for three representative cases, that configurational analysis dead-ends can be removed. First, by matching the 1H-{13C} HSQC spectral fingerprint of newly isolated Pseudomonas CLiPs with that of CLiPs of known stereochemistry obtained from total synthesis, we demonstrated the strength of our approach on the CLiP produced by Pseudomonas azadiae SWRI103T. Next, the general applicability is shown by settling the stereochemistry of orfamide B from Pseudomonas aestus CMR5c and xantholysin A from Pseudomonas mosselii BW11M1, representative members from two other distinct Pseudomonas CLiP groups. In the process, we revise the stereochemistry of several orfamide homologues, including orfamide A from Pseudomonas protegens Pf-5, the original prototype CLiP defining the Orfamide group (49). Next, we proceed to illustrate how NMR fingerprinting constitutes a potent tool for CLiP dereplication purposes by resolving configurational dead-ends reported in literature for MDN-0066 (17), orfamides, and xantholysin produced by various bacterial strains, without the need for additional synthesis. Based on this, we advocate the adoption of NMR based fingerprinting by the CLiP research community and initiate establishing a data repository and knowledge database, which will be further developed toward this purpose.

RESULTS

Structure elucidation of a newly isolated Bananamide group member from P. azadiae SWRI103.

(i) Isolation, bioinformatics, and initial chemical structure characterization. P. azadiae SWRI103 was originally collected from the rhizosphere of wheat in Iran, as part of a campaign to assess the plant growth stimulating potential of fluorescent pseudomonads (50). Initial PCR screening targeting initiation and termination domains indicated the presence of a lipopeptide-specific NRPS system (51). More recently, whole genome sequencing revealed a biosynthetic gene cluster (BGC) (52) with similarities to that of several Bananamide (8:6) producers, such as Pseudomonas bananamidigenes BW11P2 (37), Pseudomonas botevensis COW3 (53) and Pseudomonas prosekii LMG26867T (54). Here, the (8:6) refers to the (l:m) notation we introduce to provide a simple but effective classification of CLiPs belonging to the same group from a chemical structure perspective, as explained in “Materials and Methods”. As the stereochemistry of all these bananamides was unknown, we used strain SWRI103 to attempt full structure elucidation, which could only be achieved by the development of our expanded workflow, with all steps in the process described in detail hereafter (Fig. S1). Firstly, analysis of the retrieved BGC predicted a Leu – Asp – Thr – Leu – Leu – Ser – Leu – Ile octapeptide sequence from the associated NRPS (Table 1). The total peptide sequence length and amphipathic profile matches that of previously reported and characterized bananamides (37, 53) but differs in amino acid composition. Therefore, it may represent a novel member of the Bananamide group (8:6). Next, bioinformatic analysis of the condensation/epimerization (C/E) domains was applied for the configurational analysis of the amino acids. The initial Cstart domain is responsible for the incorporation of a fatty acid (FA) moiety at the N-terminus of the peptide, the exact nature of which cannot be predicted. This domain is followed by six dual activity C/E domains, responsible for the condensation of the newly recruited l-amino acid to the growing peptide chain along with by l to d epimerization of the preceding residue in the sequence (55, 56). This is followed by one LCL-type domain, which lacks the epimerization functionality thus retaining the l-configuration of the preceding residue while incorporating an l-amino acid as final residue before cyclisation by a tandem of transesterification (TE) domains. Assuming that all dual C/E domains have functional epimerization activity, the combined A- and C-domain analysis predicts a FA – d-Leu – d-Asp – d-aThr – d-Leu – d-Leu – d-Ser – l-Leu – l-Ile sequence as the most likely lipopeptide biosynthesized by P. azadiae SWRI103 (Table 1).

TABLE 1

Sequence, configurational analysis, and assignment of a novel Bananamide from P. azadiae SWRI103

Bananamide from SWRI103	AA1	AA2	AA3	AA4	AA5	AA6	AA7	AA8
Bioinformatic analysis workflow
A-domain	Leu	Asp	Thr	Leu	Leu	Ser	Leu	Ile
C-domain	C_start	C/E	C/E	C/E	C/E	C/E	C/E	^LC_L
Prediction	D	D	D	D	D	D	L	L
Chemical analysis workflow
NMR analysis	Leu	Glu	Thr	Leu	Leu	Ser	Leu	Ile
Marfey's analysis	L/D	D	D	L/D	L/D	D	L/D	L
Combined analysis and synthesized (8:6) Lx library
	Leu	Glu	Thr	Leu	Leu	Ser	Leu	Ile
(8:6) L1 (1)^a	L	D	D-a	D	D	D	L	L
(8:6) L4 (2)^a	D	D	D-a	L	D	D	L	L
(8:6) L5 (3)^a	D	D	D-a	D	L	D	L	L

The nomenclature of the synthetic compounds is based on the (l:m) notation of the Bananamide group (8:7), followed by the position of the elusive l-Leu in the sequence (position 1, 4 or 5). Underlined amino acids are part of the macrocycle. The bold residues are those that differ from the predicted sequence.

Sequence, configurational analysis, and assignment of a novel Bananamide from P. azadiae SWRI103 The nomenclature of the synthetic compounds is based on the (l:m) notation of the Bananamide group (8:7), followed by the position of the elusive l-Leu in the sequence (position 1, 4 or 5). Underlined amino acids are part of the macrocycle. The bold residues are those that differ from the predicted sequence. Secondly, and independently from the bioinformatic analysis workflow, we engaged into the chemical analysis workflow of the putative novel bananamide. Following incubation of P. azadiae SWRI-103 in M9 minimal salt medium, a single CLiP-containing fraction could be extracted and purified. From high resolution MS analysis, a molecular mass of 1053.67 Da could be established i.e., within the expected range for an (8:6) CLiP. Next, NMR was used to elucidate the planar structure. That is, combined COSY/TOCSY analysis showed the presence of a single glutamic acid, serine, threonine, isoleucine, and four leucine residues in the peptide sequence while NOESY and 1H-{13C} HMBC experiments independently allowed to place these in the same order as predicted from the A-domain analysis, validating the latter (Table 1). Note however that while an aspartic acid residue was predicted at position 2 from the bioinformatic analysis, the presence of a glutamic acid was observed here using NMR spectroscopy. This is not surprising, as it was previously shown that Asp/Glu selectivity from genomic predictions is not always clear-cut (37). Additionally, the 1H-{13C} HMBC spectrum showed a clear 3JCH correlation between Thr3 Hβ and Ile8 C’ thereby unambiguously establishing that the ester bond occurs between the hydroxyl side chain of threonine and the isoleucine carboxyl end, thus revealing the six-residue macrocycle expected for a member of the Bananamide group. Taking into account that C52H92N8O14 was derived as molecular formula from the HR-MS data, a 3-hydroxydecanoic moiety was inferred and confirmed from the NMR data while its linkage to the N-terminus resulted from characteristic 3JCH contacts with Leu1 in the 1H-{13C} HMBC spectrum. As neither MS nor NMR analysis allows establishing the configuration of the individual amino acid residues, we proceeded to Marfey’s method for the analysis of amino acid configuration. Following total hydrolysis of the CLiP, Marfey’s analysis allowed to unambiguously determine the configuration of all uniquely occurring residues. For the 6 leucines however, a 2:2 d:l ratio was found, indicating configurational heterogeneity and, as a result, the exact distribution of d and l-Leu residues within the oligopeptide sequence remained hidden. Therefore, at the end of the chemical analysis workflow, a total of 6 distinct sequences which all satisfy the 2:2 ratio but differ in the distribution of d- and l-leucines should therefore be considered. As a result, the definitive elucidation of the stereochemistry and, therefore, the chemical structure thus reaches a dead-end. When information regarding a CLiP sequence is available from both bioinformatic and chemical analysis workflows, the configurational ambiguity is sometimes resolved by proposing that one of the sequences issued from chemical analysis matches the bioinformatic prediction (Fig. S1). However, such proposals should always be treated with caution since the epimerization functionality of C/E domains can be inactive in a currently unpredictable manner. Moreover, as aptly illustrated by all cases reported here, the predicted configuration from C-domain analysis may not even match those originating from the chemical analysis workflow. Here, for instance, the bioinformatic analysis proposes a 3:1 d:l ratio for the 4 leucines rather than the 2:2 found experimentally. Nevertheless, the bioinformatic analysis can still provide guidance to narrow down the number of possible diastereomeric sequences by noting that the LCL domains, which lack the epimerization functionality, will maintain the l-configuration of the amino acid introduced at the preceding position (see above). Since such LCL domain is present in the final NRPS module of P. azadiae SWRI103, the penultimate Leu7 residue will invariably retain its l-configuration upon recruitment. This narrows down the number of possible sequences from 6 to 3: (8:6)-L1 (1), (8:6)-L4 (2) and (8:6)-L5 (3) (Table 1), depending on which leucine position features the remaining l-configuration. In all cases, the 3-hydroxy fatty acid tail is assumed to be (R)-configured, as is generally observed for Pseudomonas CLiPs. Notwithstanding the combination of bioinformatic and chemical analysis workflows, the stereochemistry remains ill-defined. (ii) Extending the chemical analysis workflow by a synthesis add-on. As recently reviewed by Götze and Stallforth, several approaches exist to tackle the issue of configurational ambiguity (24). In the absence of suitable crystals for X-ray diffraction-based structure elucidation, the most common one resorts to mild acid catalyzed hydrolysis conditions or enzymatic degradation of linearized lipopeptides to yield oligopeptide fragments. These then need to be isolated, characterized by NMR or MS, and subjected to Marfey’s method to identify the correct position of the corresponding d- or l-amino acids, introducing an extensive workload without guarantee that the matter will be settled (26). An alternative solution using MS makes clever use of deuterated amino acids and the fact that d-amino acids are, with very few exceptions, generated during biosynthesis through epimerization from l-amino acids (23, 57). In all cases, a separate analysis should be performed to elucidate the configuration of the 3-hydroxy fatty acid moiety (58). Here, we rather build on our previously developed total chemical synthesis route for Viscosin (9:7) group CLiPs (48, 59) to synthesize the 3 remaining (8:6)-Lx (x = 1, 4, 5) sequences (Table 1). The total chemical synthesis strategy mostly relies on solid phase peptide synthesis, affording considerable automation and rapid access to multiple homologous sequences through parallel synthesis (48, 59, 60). With one residue less in the macrocycle and a d-Glu2 rather than d-Gln2 as main differences, the synthesis of the (8:6)-Lx sequences could proceed with minimal change to the original strategy. A more extensive discussion of the applied synthesis route and its key features can be found in the Supplementary Materials section. Next, the exact stereochemistry of the natural compound is revealed by matching the NMR spectral fingerprint to those of the synthetic compounds. The NMR matching approach relies on previous observations where, using the 1H-{13C} HSQC experiment, we established that the inversion in the configuration of a single Cα atom introduces prominent changes in the 1H and 13C chemical shifts of the corresponding (C–H)α group, even when the three-dimensional structure of the CLiP was retained (47). The same trend was found for the configuration of the 3-hydroxy fatty acid moiety (48). Thus, the (C–H)α fingerprint region of the HSQC spectra of the natural compound and its synthetic (8:6)-Lx analogue are expected to be identical. In contrast, the 2 other sequences will display clearly distinct HSQC spectra in general and (C–H)α fingerprint regions in particular as they feature 2 d:l inversions compared to the natural compound (Table 1). By individually overlaying the 1H-{13C} HSQC spectra of each synthesized (8:6)-Lx variant with that of the natural compound from P. azadiae SWRI103, a straightforward visual assessment concerning similarities and differences in the 1H and 13C chemical shifts can be made (Fig. 1). Being the most sensitive reporters of backbone stereochemistry, we focus here on the cross-peaks in the (C-H)α fingerprint regions (Fig. 1). Accordingly, the (8:6)-L4 (2) variant shows an excellent match with the (C–H)α fingerprint of the natural compound since all (C-H)α cross peak pairs belonging to the same residue in the sequence show excellent overlap (Fig. 1A). In contrast, a considerable mismatch exists with (8:6)-L1 (1) as visualized by five clearly non-overlapping cross-peaks (Fig. 2B). This mismatch becomes even more pronounced for (8:6)-L5 (3), where essentially none of the cross-peaks overlap (Fig. 1C). Based on this, we can establish the stereochemistry of the P. azadiae SWRI103 bananamide as identical to that of the (8:6)-L4 (2) sequence (i.e., 3R-OH C10:0 – d-Leu – d-Glu – ), revealing that the C/E domain in the fifth module of the NRPS system is non-functional for epimerization. A more quantitative evaluation of spectral similarity is provided in the supplementary material section.

FIG 1

FIG 2

Comparison of the 1H-{13C} HSQC (CH)α fingerprint of synthetic (10:8)-Lx variants (blue) with that of the natural compound extracted from P. aestus CMR5c, all recorded in DMF-d7 at 500 MHz and 298K. (A) Overlay with the synthetic (10:8)-L1 variant (blue) and (B) the synthetic (10:8)-L5 variant (blue). A more quantitative evaluation of spectral similarity is provided in the supplementary material section.

Comparison of the 1H-{13C} HSQC (CH)α fingerprints of the various synthetic (8:6)-Lx variants (blue) with that of the natural bananamide compound produced by P. azadiae SWRI103 (black) recorded in acetonitrile-d3 at 700 MHz, 303K. (A) Matching (8:6)- L4 variant (B) (8:6)- L1 variant and (C) (8:6)- L5 variant. (D) shows the spectrum of the natural compound but now recorded in DMSO-d6 at 298K (black) overlaid with a schematic representation of the spectrum of MDN-0066 which was recorded under identical conditions, generated from the tabulated chemical shift data in the original report (17). Comparison of the 1H-{13C} HSQC (CH)α fingerprint of synthetic (10:8)-Lx variants (blue) with that of the natural compound extracted from P. aestus CMR5c, all recorded in DMF-d7 at 500 MHz and 298K. (A) Overlay with the synthetic (10:8)-L1 variant (blue) and (B) the synthetic (10:8)-L5 variant (blue). A more quantitative evaluation of spectral similarity is provided in the supplementary material section. In addition, to avoid any confusion regarding the absolute configuration of the 3-hydroxy decanoic acid, an analogue identical to L4 (2), but with a 3-(S)-hydroxydecanoic acid as fatty tail was synthetized as well (compound 2b). The overlay of this peptide with the natural compound revealed that inverting the configuration at this position clear affects the fingerprint region (supplementary information, S19). Indeed, the HSQC cross peak of the CHβ of the fatty acid is shifted to the largest extent while additional shifts were also noticed at the level of the CHα cross-peaks of Leu1, Glu2, and Thr3. This not only confirms the possibility to discriminate the configuration at this position, but also the use of NMR based fingerprinting for configurational assignment. (iii) Matching our reference compound with literature data. The effort to elucidate the stereochemistry of the P. azadiae SWRI103 (8:6) bananamide also allowed to settle that of MDN-0066, an (8:6) CLiP produced by Pseudomonas granadensis F-278,770T, which showed distinct bioactivity in a renal carcinoma cell model (17). Using a chemical analysis methodology similar to the one described above, Cautain et al. (17) elucidated the peptide sequence by relying on MS analysis and NMR spectral assignment. However, the detailed stereochemical elucidation of MDN-0066 was left incomplete as Marfey’s analysis revealed configurational heterogeneity given the presence of 2 d-Leu and 2x l-Leu, a result similar to that of the P. azadiae SWRI103 bananamide. Using the tabulated 1H and 13C NMR chemical shifts reported for MDN-0066 in DMSO-d6, the spectral fingerprint matching could be performed against the data of the SWRI103 bananamide recorded under identical conditions. The result, shown in Fig. 1D, shows a near identical match with (8:6)-L4 (2). This proves that MDN-0066 is identical to bananamide SWRI103, thereby disambiguating the stereochemistry of MDN-0066 and illustrating the potential of our approach for dereplication purposes. In order to assess the general applicability of our approach to Pseudomonas CLiPs, we then turned to 2 additional case studies, involving the stereochemical elucidation of orfamide B and xantholysin A.

Configuration elucidation of (10:8) orfamide B from P. aestus CMR5c.

Orfamides are important for bacterial motility, cause lysis of oomycete zoospores, and play a role in biocontrol activity against fungal pathogens and insects (6, 49, 61–63). The name-sake of the Orfamide (10:8) group, orfamide A, was extracted from Pseudomonas protegens Pf-5 and fully characterized including its complete stereochemistry by mass spectrometry, NMR spectroscopy and chiral gas chromatography (49). Orfamides B and C were extracted as minors, and their planar structures were characterized as well. Since its original discovery, orfamide B is often found as the major compound produced by newly isolated bacterial strains (35). Additionally, the orfamide group has expanded with additional orfamide-homologues (30, 35, 64, 65). However, in many cases, the stereochemistry of orfamide B or that of other orfamide homologues from newly isolated bacterial sources remained unconfirmed as it was derived from sequence similarity with the original orfamide A as extracted from P. protegens Pf-5. To unlock conformational analysis and structure-activity evaluations for the Orfamide group, we therefore proceeded to an explicit stereochemical verification. (i) Isolation, bioinformatics, and initial chemical structure characterization of orfamide B. P. aestus CMR5c was originally isolated from the rhizosphere of red cocoyam in Cameroon in a screen for biocontrol agents against the cocoyam root rot disease caused by Pythium myriotylum (11). We already reported the initial genome mining and bioinformatic analysis of the P. aestus CMR5c BGC which revealed the presence of 3 genes with ~80% similarity to the ofaA, ofaB, and ofaC NRPS genes from P. protegens Pf-5 (35). MS and NMR analysis further confirmed the predicted primary sequence and evidenced the incorporation of a 3-hydroxy-tetradecanoic acid moiety at the N-terminus. Thus, the major CLiP produced by P. aestus CMR5c has a planar structure identical to the originally published orfamide B. Based on this similarity in primary sequence and the genomic similarity between the respective BGCs, orfamide B from P. aestus CMR5c was proposed to also possess only l-leucines (35), as was previously also postulated for the original orfamide A (49). C-domain analysis showed that the initial Cstart type domain involved in acylating the first amino acid is followed by 6 dual activity C/E domains (modules 2–7), 2 non-epimerizing LCL type domains (modules 8 and 9), and a final C/E domain in the last module (Table 2). Taking into account the distribution of LCL and C/E domains, the bioinformatic analysis predicts the stereochemistry as shown in Table 2, with d-Leu residues occurring at positions 1 and 5 (Table 2) contradicting the all-l-Leu configuration proposed earlier. Marfey’s analysis confirmed the configuration predictions for singly occurring amino acids and the 1:1 d:l ratio for both valines. For the leucines however, a 1:3 d:l ratio was found, invalidating the 2:2 ratio predicted from bioinformatic analysis as well as the all-l configuration proposed by Ma et al. (35).

TABLE 2

Sequence, configurational analysis, and assignment of orfamide B from P. aestus CMR5c

CMR5c	AA¹	AA²	AA³	AA⁴	AA⁵	AA⁶	AA⁷	AA⁸	AA⁹	AA¹⁰
Bioinformatic analysis workflow
A-domain	Leu	Glu	aThr	Val/Ile	Leu	Ser	Leu	Leu	Ser	Val
C-domain	Cstart	C/E	C/E	C/E	C/E	C/E	C/E	^LC_L	^LC_L	C/E
Prediction	D	D	D	D	D	D	L	L	D	L
Proposal Ma et al. 2016 (33)	L	D	D	D	L	D	L	L	D	L
Chemical analysis workflow
NMR analysis	Leu	Glu	Thr	Val	Leu	Ser	Leu	Leu	Ser	Val
Marfey’s analysis	L/D	D	D-a	L/D	L/D	D	L/D	L/D	D	L/D
Synthesized Lx(10:8) library
	Leu	Glu	Thr	Val	Leu	Ser	Leu	Leu	Ser	Val
(10:8)-L1 (4)^a	L	D	D-a	D	D	D	L	L	D	L
(10:8)-L5 (5)^a	D	D	D-a	D	L	D	L	L	D	L
(10:8) L1L5 (6)	L	D	D-a	D	L	D	L	L	D	L

The nomenclature of the synthetic compounds is based on the (l:m) notation of the Orfamide group (10:8), followed by the position of the elusive l-Leu in the sequence (position 4, 5 or 4&5). The underlined residues in the tables indicate the position of the macrocycle. The bold residues are those that differ from the predicted sequence.

Sequence, configurational analysis, and assignment of orfamide B from P. aestus CMR5c The nomenclature of the synthetic compounds is based on the (l:m) notation of the Orfamide group (10:8), followed by the position of the elusive l-Leu in the sequence (position 4, 5 or 4&5). The underlined residues in the tables indicate the position of the macrocycle. The bold residues are those that differ from the predicted sequence. (ii) Stereochemical elucidation using chemical synthesis and NMR fingerprint analysis. Considering the experimental evidence from the chemical analysis workflow, a total of 8 sequences should effectively be considered since the configurational d:l heterogeneity of the Leu and Val residues are combinatorically independent. Conveniently, the bioinformatic analysis allows to trim this down to 2 sequences only, strongly reducing the synthetic effort required. Indeed, the ambiguity regarding the valines can be settled by noting that, as the final residue, Val10 is not subjected to any epimerization activity. This pins the valine configurations down as d-Val4 end l-Val10. Next, the presence of LCL domains in modules 8 and 9 allows to unequivocally attribute the l-configuration to Leu7 and l-Leu8. The remaining d and l configured leucines are to be distributed over positions 1 and 5. While this constitutes an apparent dead-end, configurational assignment could be finalized using fingerprint matching of the natural compound against the (10:8)-L1 (4) and (10:8)-L5 (5) sequences obtained by synthesis (Table 2) using the same strategy as discussed above for the bananamide analogues. This included the incorporation of a 3R-hydroxydecanoic moiety (3-OH C10:0) at the N-terminus rather than a 3R-hydroxytetradecanoic one (3-OH C14:0) mainly because of synthetic availability of the precursor. Previous investigation of C10, C12 and C14 pseudodesmin analogues evidenced that lengthening of the acyl chain did not affect the 1H and 13C chemical shifts of the peptide moiety in any way (60). Fig. 2 shows the overlay of the (C–H)α fingerprint region of each synthesized (10:8)-Lx sequence with that of natural orfamide B from P. aestus CMR5c. A straightforward visual assessment clearly indicates that the (10:8)-L1 (4) variant displays a near identical fingerprint match while none of the (C–H)α cross-peaks of (10:8)-L5 (5) overlap with those of the CMR5c orfamide. The latter probably indicates a major conformational effect caused by the l to d inversion at Leu5. To remove any doubts and independently exclude the all-l-Leu configuration originally proposed from sequence similarity with orfamide B from P. protegens Pf-5, we also committed to synthesize the corresponding (10:8)-L1L5 (6) analogue which again showed significant mismatch with the natural CMR5c orfamide. (Fig. S38) All data together establish that the stereochemistry of orfamide B from P. aestus CMR5c corresponds to that of the (10:8)-L5 (5) sequence (3R-OH C14:0 – l-Leu – d-Glu – ), indicating the C/E domain in the second module is non-functional for epimerization. (iii) Stereochemical reassessment and dereplication of the Orfamide group using literature NMR data. The results presented above appear to indicate that orfamide B from P. aestus CMR5c is a d-Leu5 diastereoisomer of orfamide B from P. protegens Pf-5 (49), since the latter is reported to possess a l-Leu5. So far, reports of CLiPs from the same (l:m) group that feature a configurational difference are few, possibly due to the lack of extensive configurational assignment among Pseudomonas CLiPs. The best-known example occurs within the Viscosin (9:7) group where CLiPs can be divided in a L-subgroup (14 sequences) and d-subgroup (5 sequences) depending on the configuration of the leucine, also at position 5 (66). Given that orfamides have been reported from multiple bacterial sources since their initial discovery, the question raised as to whether such division is also present for the Orfamide (10:8) group. Settling this matter would allow assessing whether stereochemical variation within a group is a more general feature present in the NRPS of Pseudomonas CLiPs. Conveniently, the major orfamides reported from other bacterial sources in literature feature planar structures identical to either orfamide A from P. protegens Pf-5 or orfamide B from P. aestus CMR5c, thus allowing the use of NMR fingerprinting for stereochemical evaluation. To be able to screen directly against a reference spectrum of orfamide A under standardized conditions, P. protegens Pf-5 was cultured to obtain orfamide A, together with a series of minor compounds including orfamide B and 4 previously unreported ones, which we named orfamides J – M (See supplementary information). For the orfamide B producer Pseudomonas sessilinigenes CMR12a (67), the 1H-{13C} HSQC spectral fingerprint proved identical to the spectrum of the orfamide B reference from P. aestus CMR5c. This indicated that these molecules have identical stereochemistry and feature a d-Leu5. Next, we turn to Pseudomonas sp. PH1b where genome exploration revealed an orfamide-type BGC (68). Characterization of orfamides from this isolate presents an interesting case as it produces a major and a minor orfamide with sequence identical to orfamide A and orfamide B, respectively, thus only differing by a Ile4/Val4 substitution resulting from A-domain substrate flexibility in module 4 of the NRPS. The (C–H)α spectral fingerprint of its major orfamide matched with that of orfamide A produced by P. protegens Pf-5 indicating an l-configuration for Leu5. Surprisingly, however, the spectrum of the minor orfamide of Pseudomonas sp. PH1b matched with that of orfamide B from P. aestus CMR5c, therefore indicating a d-Leu5 configuration. This result was highly unexpected as it implies that in Pseudomonas sp. PH1b, the same NRPS assembly line would yield different configurations for Leu5. More specifically, it would require the occurrence of epimerization by the C/E domain of module 6 to correlate with the sequence composition of the growing peptide chain as recruited by module 4. Since the all-l Leu configuration of the original orfamide A from P. protegens Pf-5 was derived from extensive chemical analysis but not confirmed through total synthesis, it appeared more likely that an error was made in establishing the configuration at this position. To settle this matter, we reinvestigated the stereochemistry of the original orfamide A, as extracted from P. protegens Pf-5. In contradiction with the report by Gross et al., where chiral GC revealed a 0:4 d:l ratio for the leucines, our Marfey’s analysis yielded a 1:3 d:l ratio, definitively settling the case in favor of a d-Leu5. As a result, in this case, configurational variability within one NRPS assembly line can be invalidated, and a revision is required for the configuration of Pf-5 based orfamides as shown in Table 2. Gratifyingly, this outcome was recently also arrived at using a completely independent approach involving total synthesis of orfamide A by Bando et al. (69), whereby the original stereochemistry led to loss of green algal deflagellation activity, while the corrected one was as active as the natural compound (70). Finally, we could establish that the orfamide produced by Pseudomonas sp. F6 (8) is identical to the revised orfamide A, by only making use of the tabulated chemical shift values of the published compound, and matching this to our reference spectrum of orfamide A (Fig. S43).

Settling the elusive stereochemistry of xantholysin A.

An additional example illustrating the strength of our combined synthesis and spectral matching approach concerns xantholysin A. First reported as the main CLiP produced by Pseudomonas mosselii BW11M1, together with minor congeners B to D, it is the prototype that defines the Xantholysin (14:8) group (71). Since no configurational analysis was performed at the time, NMR and MS analysis only provided the planar structure. Subsequent reports proposed the production of xantholysins by other Pseudomonas strains, uncovering a diverse portfolio of biological functions in the process. Pascual et al. (16) reported the production of xantholysins A to D by Pseudomonas soli F-279,208T solely based on HR-MS of the isolates and characterized their cytotoxic activity against the RCC4 kidney carcinoma cell line. Lim et al. (9) reported HR-MS and 1H NMR identical to those of xantholysin A and B for two lipopeptides from Pseudomonas sp. DJ15 and demonstrated their insecticidal activity against the green peach aphid, a major peach tree pest also serving as plant virus vector. In the context of exploring the tolerance of cocoyam plants against Pythium myriotylum in the field, Oni et al. (44) showed that Pseudomonas sp. COR51 produces a lipopeptide with identical planar structure to that of xantholysin A. It was also extracted and characterized from Pseudomonas xantholysinigenes in the framework of the development of a diagnostic bioinformatics tool that allows the assignment of a CLiP to a particular lipopeptide group based on the phylogeny of the MacB transporter of its producing bacterium. (54) In other work, a large-scale bacterial screening for antibiotic activity proposed the identification of xantholysins A-D in Pseudomonas sp. 250J, based on HR-MS data and high similarity in the NRPS genomic make-up to that of P. mosselii BW11M1 (72). Later on, combination of Marfey’s analysis with mapping of the C/E and LCL domains in its NRPS led to propose a stereochemistry for xantholysin A, although the distribution of the remaining d-Leu and l-Leu over positions 9 and 11 remained tentative (73). Shimura et al. (74) reported the total synthesis of MA026, a xantholysin-like CLiP from Pseudomonas sp. RtlB026 with potent anti-hepatitis C activity. Discrepancies in physicochemical properties of synthetic and natural MA026 led Uchiyama et al. (75) to revise its structure by exchanging residues at position 10 and 11, as previously suggested by Li et al. (71), thus leading to a planar structure identical to xantholysin A. With more than 90% sequence similarity between their respective BGC, it was proposed that the stereochemistry of xantholysin A would be identical to that of MA026. However, Uchiyama et al. (75) also noted that the stereochemical assignment for xantholysin A from Pseudomonas sp. 250J did not match with MA026, the latter containing l-Gln6 and d-Gln13, while the reverse configuration was proposed for the former. To resolve this on-going characterization issue, we first considered applying our combined approach to the original xantholysin A from P. mosselii BW11M1. However, the presence of configurational heterogeneity for the leucine (d:l 2:1) and glutamine/glutamic acid (Glx) residues (d:l 4:1) independently act to generate 50 possible sequences, that can be trimmed down to 15 sequences when taking into account the genomic data. However, no further prioritization is possible after combining the chemical and genomic data. Comparison of the (C–H)α fingerprint of MA026 (with known stereochemistry) to that of originally isolated xantholysin A should in principle allow to either quickly establish configurational identity or eliminate one sequence. However, NMR data for MA026 was listed without explicit assignment. To circumvent this, we adapted our own synthesis scheme to produce MA026 and subsequently established that its (C–H)α fingerprint is indeed identical with that of the original xantholysin A (Fig. 3), thus avoiding the need for further synthesis. Used as reference compound, simple comparison of its (C–H)α fingerprint with that of putative xantholysin A lipopeptides allowed to also extend the assigned stereochemistry to the major xantholysin extracted from Pseudomonas. sp. COR51 (44), P. xantholysinigenes (54) and Pseudomonas sp. 250J (73), for which the NMR data were available or provided for comparison by the original authors, respectively. (Fig. S57, S58, and S59). In conclusion, the structure of xantholysin A was determined to be 3R-OH C10:0 – l-Leu – d-Glu – d-Gln – d-Val – d-Leu – l-Gln – . This reveals that two modules (2 in XtlA and 7 in XltB) lack the epimerization capability expected for the respective C/E-classified domains.

FIG 3

Comparison of the 1H-{13C} HSQC (C-H)α fingerprints of natural xantholysin A (14:8) produced by P. mosselii BW11M1 (black) with that of MA026 obtained through synthesis (blue). Both peptides were measured under identical conditions (DMF-d7, 328K, 700 MHz).

DISCUSSION

The discovery and development of new antimicrobial agents and therapies is an important avenue to tackle the rising global health threat caused by antimicrobial resistance (76, 77). Because of their broad range of antimicrobial activities and wide diversity of structures, Pseudomonas CLiPs have drawn attention as an interesting class of compounds for further exploration in this respect (35, 60, 78, 79). To engage into structure-activity-relationship (SAR) studies, complete elucidation of their structure, including stereochemical make-up, is of vital importance to identify meaningful structure-activity associations (34, 59), and uncover their mode of action. Unfortunately, incomplete structure characterization increasingly accompanies the rising number of these CLiPs isolated from novel bacterial sources. Mass spectrometric (MS) data, genomic predictions or a combination thereof yield insufficient structural data by itself to generate a single structure elucidation result. Especially the comparative analysis with pre-existing data to establish structural novelty or identity with existing CLiPs is not approached critically enough to avoid erroneous attributions. Consequently, discovery efforts may generate a narrowed view on lipopeptide structural diversity due to erroneous assignment of de facto new structures to previously described ones. In turn, this clouds the comparative analysis and interpretation of antimicrobial activities of newly extracted compounds with already existing ones and threatens to lead subsequent development efforts astray. As natural product mining efforts aimed at Pseudomonas strains amplify, the chance of rediscovery of existing CLiPs increases as well. As a result, complete characterization including the labor-intensive determination of stereochemistry, while essential, may not provide a ‘return-on- investment’ as much effort might be spent on an already characterized compound (Fig. 4A).

FIG 4

The stereochemical analysis of CLiPs requires a series of analysis steps. (A) Chemical analysis steps typically required before the implementation of our NMR fingerprint matching methodology described here. (B) Using NMR fingerprint matching, a single analysis step is required for identity confirmation as well as stereochemical validation. (C) Different levels of stereochemical validation of CLiPs. As part of a concerted research effort aiming to map the structural and conformational diversity of Pseudomonas CLiPs, we extended the existing characterization workflows by a combined synthesis and NMR fingerprint matching approach. Simultaneously, it allowed us to demonstrate and validate the innate dereplication potential of the latter by a simple screening of the NMR fingerprint of newly isolated CLiPs. First, we established that the CLiP extracted from the newly isolated producer P. azadiae SWRI103 belongs to the Bananamide (8:6) group and subsequently resolved a configurational dead-end to reveal its stereochemistry. By utilizing our NMR spectral matching methodology, it was subsequently found to be identical to MDN-0066 produced by the P. granadensis type-strain, thereby settling the stereochemistry of this compound as well (17). Using the same approach, we determined that a structure revision is required for several, if not all, Orfamide (10:8) group members. While initially only poised to establish the stereochemical make-up of orfamide B as extracted from P. aestus CMR5c on a firmer basis, we showed that this compound possesses a d-Leu5, instead of an l-Leu5 as previously presumed and published (35, 49). Comparing the 1H-{13C} HSQC spectral fingerprint of a collection of CLiPs produced by P. protegens Pf-5, P. sessilinigenes CMR12a, Pseudomonas sp. F6, and Pseudomonas. sp. PH1b that share the orfamide sequence but have unreported stereochemistry, the presence of a d-Leu5 was confirmed in all cases, proving that one and the same compound is produced by diverse strains from a variety of environments. Given an l-Leu5 was now confined to the originally reported orfamide A only, NMR fingerprint matching allowed to convincingly assert that here also, a d-Leu5 should be considered. Moreover, it was determined that all orfamide members possess a 3R-hydroxy-fatty acid tail, in contrast to earlier literature reports asserting the presence of a 3S-hydroxy-fatty acid tail (49, 58). Finally, we determined the stereochemistry of xantholysin A as produced by a variety of sources. Notably, with 15 possible sequences resulting from the combination of information obtained from both genomic as well as chemical analysis protocols, a considerable synthetic effort would have been required for the elucidation of this lipopeptides’ stereochemistry. Instead, we utilized available literature data to prioritize possible sequences, resulting in only a single preferred sequence to be initially synthesized, thereby solving the stereochemistry of xantholysin A (as produced by various bacterial strains; using their reported literature data) and MA026. From the above, it is clear that the NMR fingerprint approach introduced in this work has the potential to support researchers in dereplication, i.e., has this particular lipopeptide been isolated before? By matching NMR spectra of a CLiP from a newly isolated bacterial source with those of existing (reference) CLiPs, one can determine whether they are identical or not. Importantly, this is possible irrespective of the stereochemical characterization of the reference CLiPs and does not require any synthesis or extensive characterization effort. In fact, it can be performed at a relatively early stage following isolation and purification, as it only requires a 1H-13C fingerprint, which is preferably but not necessarily assigned. Indeed, in case of a perfect match, identity is ensured, while small but notable differences may motivate further characterization using the workflow as described here (Fig. S1). It is important to note that total synthesis is not a prerequisite for dereplication of a newly isolated CLiP. In other words, the (C–H) α NMR fingerprint matching only requires the comparison of NMR spectra, or tabulated 1H and 13C chemical shifts values to assess structural similarity to a reference CLiP which may arise from natural sources or be synthetic compounds issued from structure elucidation efforts such as described in this work. Provided the required data is collected and made accessible, a fast screening method becomes available early on in the discovery process, potentially eliminating additional labor-intensive analyses, especially when it concerns CLiPs with a configurational heterogeneity of the amino acid content. Only when a genuinely new CLiP structure is found and/or no reference compound is available, additional analysis is desired/required. In such cases, we have shown that genomic and chemical methods, in combination with solid-phase peptide synthesis can provide unambiguous characterization of CLiP structures. Application of NMR based dereplication as introduced in principle here should permit an improved mining of NRPSs by providing reference data for bioinformatics software suites, thereby improving genomic prediction of CLiP structures. As previously observed for Pseudomonas lipopeptides from the Viscosin, Poaeamide, Tolaasin, Factin, Peptin, and Mycin families (29, 80) and further evidenced in this report for CLiPs from 3 additional groups (Bananamide, Orfamide, Xantholysin), the bioinformatic prediction of epimerization functionality in the non-ribosomal biosynthesis of CLiPs remains inaccurate. In each of these groups, at least 1 module appears catalytically inactive for epimerization. Only in the Amphisin (11:9) and Gacamide/Cocoyamide (11:5) groups do the predictions of dual activity condensation domains match with the elucidated CLiP stereochemistry. It can be anticipated that ensuring availability of unambiguous stereochemical information and scrutinizing corresponding C/E domain sequences for motifs linked to epimerization (in)activity (22), ideally complemented with insight from currently unavailable 3D structural C/E domain data, should allow an improved genomic prediction of the affected amino acid configurations.

Conclusion.

Incomplete (or erroneous) structural characterization currently generates ‘scientific noise’ in the field of lipopeptide research, hampering a concerted research action for the generation of structure-activity relationships (18). The (C–H)α NMR fingerprint matching approach as described here will prevent this by offering a rapid, unambiguous characterization of CLiP structures. By standardizing the relevant experimental conditions as much as possible and by making reference data publicly available, a rapid elucidation of the complete structure of newly extracted lipopeptides, including stereochemical make-up of both the oligopeptide and the fatty acid moieties comes within reach (Fig. 4). In addition, our user friendly NMR matching methodology allowed to quickly supplement incomplete literature and characterization data with NMR fingerprint data, thereby elucidating the complete structure (including stereochemical make-up) of previously extracted CLiPs (Fig. S2). While limited to Pseudomonas CLiPs, our results support the (C–H)α NMR fingerprint matching as a tool for dereplication beyond this genus, such as those produced by other bacterial phyla, including Bacillus, Burkholderia, and Streptomyces spp. Indeed, without resorting to synthesis or elaborate chemical analysis, the identity of novel CLiPs from natural sources with previously, even only partially, characterized ones, can be established through matching of their spectra. Obviously, access to the desired NMR data for a sufficiently large collection of CLiPs, preferably with annotated fingerprints, is essential for this approach to become of use as dereplication tool for the research community. Only then can the lack of a match be interpreted as indicative of compound novelty with high confidence. While awaiting the development of a (dedicated) NMR data sharing and dereplication platform – most likely by integration of the reference spectra into existing (lipopeptide) databases, such as Norine (81, 82) – a publicly available NMR data repository accompanied by a ‘knowledge base’ has been made available at https://www.rhizoclip.be. It contains an overview of downloadable (C–H)α NMR fingerprint data of characterized CLiPs (both original spectra and tabulated chemical shift values), together with literature data on the originally determined structures. The latter includes a description of the CLiPs’ original description, molecular mass, three-dimensional structures (if available), and a summary of published antimicrobial activities. Moreover, a detailed protocol is available for researchers that wish to record NMR data of their newly extracted lipopeptides to compare them to the publicly available reference data. Finally, we invite all researchers to submit NMR data of new CLiPs, such that they can be included in this knowledge base or contact us to explore possibilities to provide such data for their compounds.

MATERIALS AND METHODS

Detailed materials and experimental methods can be found in the SI appendix together with additional tables and figures.

Data availability.

All study data are included in the article and SI Appendix. NMR data for (C–H)α spectral fingerprint matching can be found at https://www.rhizoclip.be.

76 in total

Review 1. Structure, properties, and biological functions of nonribosomal lipopeptides from pseudomonads.

Authors: Sebastian Götze; Pierre Stallforth
Journal: Nat Prod Rep Date: 2019-08-22 Impact factor: 13.423

2. PPZPMs--a novel group of cyclic lipodepsipeptides produced by the Phytophthora alni associated strain Pseudomonas sp. JX090307--the missing link between the viscosin and amphisin group.

Authors: Hardy Weisshoff; Sarah Hentschel; Irmtraut Zaspel; René Jarling; Eberhard Krause; Thi Lam Huong Pham
Journal: Nat Prod Commun Date: 2014-07 Impact factor: 0.986

3. Total synthesis and anti-hepatitis C virus activity of MA026.

Authors: Satomi Shimura; Masahiro Ishima; Syo Nakajima; Toshitaka Fujii; Natsumi Himeno; Kentaro Ikeda; Jesus Izaguirre-Carbonell; Hiroshi Murata; Toshifumi Takeuchi; Shinji Kamisuki; Takahiro Suzuki; Kouji Kuramochi; Koichi Watashi; Susumu Kobayashi; Fumio Sugawara
Journal: J Am Chem Soc Date: 2013-12-06 Impact factor: 15.419

4. Erratum: Indexing the Pseudomonas specialized metabolome enabled the discovery of poaeamide B and the bananamides.

Authors: Don D Nguyen; Alexey V Melnik; Nobuhiro Koyama; Xiaowen Lu; Michelle Schorn; Jinshu Fang; Kristen Aguinaldo; Tommie L Lincecum; Maarten G K Ghequire; Victor J Carrion; Tina L Cheng; Brendan M Duggan; Jacob G Malone; Tim H Mauchline; Laura M Sanchez; A Marm Kilpatrick; Jos M Raaijmakers; René De Mot; Bradley S Moore; Marnix H Medema; Pieter C Dorrestein
Journal: Nat Microbiol Date: 2017-01-23 Impact factor: 17.745

5. Characterization of Extracellular Biosurfactants Expressed by a Pseudomonas putida Strain Isolated from the Interior of Healthy Roots from Sida hermaphrodita Grown in a Heavy Metal Contaminated Soil.

Authors: Przemysław Bernat; Joseph Nesme; Katarzyna Paraszkiewicz; Michael Schloter; Grażyna Płaza
Journal: Curr Microbiol Date: 2019-08-20 Impact factor: 2.188

Review 10. Microbial biosurfactant research: time to improve the rigour in the reporting of synthesis, functional characterization and process development.

Authors: Matthew Simon Twigg; Niki Baccile; Ibrahim M Banat; Eric Déziel; Roger Marchant; Sophie Roelants; Inge N A Van Bogaert
Journal: Microb Biotechnol Date: 2020-11-29 Impact factor: 5.813

1 in total

1. Total Synthesis and Structure Correction of the Cyclic Lipodepsipeptide Orfamide A.