| Literature DB >> 26412281 |
Chad W Johnston1,2, Michael A Skinnider1,2, Morgan A Wyatt1,2, Xiang Li1,2, Michael R M Ranieri1,2, Lian Yang3, David L Zechel4, Bin Ma3, Nathan A Magarvey1,2.
Abstract
Bacterial natural products are a diverse and valuable group of small molecules, and genome sequencing indicates that the vast majority remain undiscovered. The prediction of natural product structures from biosynthetic assembly lines can facilitate their discovery, but highly automated, accurate, and integrated systems are required to mine the broad spectrum of sequenced bacterial genomes. Here we present a genome-guided natural products discovery tool to automatically predict, combinatorialize and identify polyketides and nonribosomal peptides from biosynthetic assembly lines using LC-MS/MS data of crude extracts in a high-throughput manner. We detail the directed identification and isolation of six genetically predicted polyketides and nonribosomal peptides using our Genome-to-Natural Products platform. This highly automated, user-friendly programme provides a means of realizing the potential of genetically encoded natural products.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26412281 PMCID: PMC4598715 DOI: 10.1038/ncomms9421
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1The GNP.
(a) The automated GNP pipeline processes submitted sequences to identify NRPS and PKS gene clusters and yield predicted structures. These predictions can be combinatorialized and elaborated to account for biosynthetic promiscuity, creating libraries of hypothetical structures that are used to search LC–MS/MS data and automatically reveal the true genetically encoded natural product. (b) A novel nonribosomal peptide biosynthetic gene cluster identified within S. calvus, alongside the combinatorialized GNP-generated structure prediction. (c) LC–MS/MS chromatogram of an S. calvus culture extract with GNP result indicating a localized genetically predicted structure. (d) Chemical structures of the top-scoring GNP-generated structure prediction (left) and the corresponding natural product, WS9326C (right), depicted with corresponding retention times (RTs) and mass to charge ratios (m/z).
Figure 2Identifying novel hybrid nonribosomal depsipeptides from unexplored Variovorax strains.
(a) GNP analysis of a novel NRPS–PKS gene cluster in V. paradoxus S110 provided a prediction that was combinatorialized and used to query corresponding LC–MS/MS data with GNP, revealing the true genetically encoded natural products. Isolation of the indicated compounds yielded novel metabolites vacidobactin A and B, depicted with corresponding retention times (RTs) and mass to charge ratios (m/z). (b) GNP analysis of a novel NRPS–PKS gene cluster in V. paradoxus P4B provided a structure prediction that was combinatorialized and used to query LC–MS/MS data with GNP. Isolation of the indicated compounds yielded novel metabolites variobactin A and B, depicted with corresponding RT and mass to charge ratios (m/z).
Figure 3Automated prediction, detection and structure elucidation of a glycosylated polyketide from N. potens.
(a) A biosynthetic gene cluster containing machinery for deoxysugar and polyketide biosynthesis was identified in the genome of N. potens (DSM 45234). The GNP-generated polyketide and sugar predictions were combinatorialized to yield a library of 42 hypothetical products. (b) GNP analysis of LC–MS/MS data from a N. potens extract revealed four related candidate peaks. The most abundant of the detected hits—predicted polyketide 10 (c)—was isolated for structure elucidation by NMR. (c) Structure of the isolated glycosylated polyketide potensimicin. (d) NOESY NMR spectroscopy of the potensimicin deoxysugar demonstrates it is mycaminose.
Figure 4GNP-facilitated detection and structure elucidation of the cryptic nonribosomal peptide thanamycin.
(a) Biosynthetic gene cluster for thanamycin identified within P. fluorescens DSM11579. (b) Combinatorialization of the GNP-generated structure prediction. (c) GNP prediction-guided discovery chart indicates a series of related thanamycin-like ions from a P. fluorescens extract, including the main ion (1,291.6 m/z), which was found to possess the most structural similarity to predicted structure Thana116 from 120 hypothetical variants, shown as a chemoinformatic tree clustered by chemical similarity. (d) Structure of thanamycin with corresponding retention time (RT) and observed mass to charge ratio (m/z).