Literature DB >> 35035787

The natural product biosynthesis potential of the microbiomes of Earth - Bioprospecting for novel anti-microbial agents in the meta-omics era.

Aileen Ute Geers1, Yannick Buijs1, Mikael Lenz Strube1, Lone Gram1, Mikkel Bentzon-Tilia1.   

Abstract

As we stand on the brink of the post-antibiotic era, we are in dire need of novel antimicrobial compounds. Microorganisms produce a wealth of so-called secondary metabolites and have been our most prolific source of antibiotics so far. However, rediscovery of known antibiotics from well-studied cultured microorganisms, and the fact that the majority of microorganisms in the environment are out of reach by means of conventional cultivation techniques, have led to the exploration of the biosynthetic potential in natural microbial communities by novel approaches. In this mini review we discuss how sequence-based analyses have exposed an unprecedented wealth of potential for secondary metabolite production in soil, marine, and host-associated microbiomes, with a focus on the biosynthesis of non-ribosomal peptides and polyketides. Furthermore, we discuss how the complexity of natural microbiomes and the lack of standardized methodology has complicated comparisons across biomes. Yet, as even the most commonly sampled microbiomes hold promise of providing novel classes of natural products, we lastly discuss the development of approaches applied in the translation of the immense biosynthetic diversity of natural microbiomes to the procurement of novel antibiotics.
© 2021 The Author(s).

Entities:  

Keywords:  Antibiotics; Microbiomes; Natural products; Nonribosomal peptides; Polyketides; Secondary metabolites

Year:  2021        PMID: 35035787      PMCID: PMC8733032          DOI: 10.1016/j.csbj.2021.12.024

Source DB:  PubMed          Journal:  Comput Struct Biotechnol J        ISSN: 2001-0370            Impact factor:   7.271


Introduction

The increasing occurrence and spread of antibiotic resistant bacteria pose major threats to public health, and mitigation of the current development represents one of the most significant challenges to modern medicine. In 2019, approximately 2.6 million laboratory-confirmed urinary tract infections caused by Escherichia coli or Klebsiella pneumonia were reported to the Global Antimicrobial Resistance and Use Surveillance System (GLASS), of which 54.4% and 43.1%, respectively, were resistant to the first-line antibiotic co-trimoxazole. Notably, in the 500,000 reported bloodstream infections the occurrence of methicillin-resistant Staphylococcus aureus was 15% in high income countries and 33% in low and middle income countries [1]. While programs aiming at limiting the spread and reducing the emergence of antibiotic resistance are widely applied and under continuous development [2], it is of paramount importance that we can provide a continuous supply of new and effective antibiotics. Decades ago, in the ‘Golden era of antibiotic discovery’, new compound classes were regularly discovered and nearly all antibiotics used today are derived from compounds described in the 1940s to 1960s [3], [4]. The principle approach in the bioprospecting for antibiotics during this period was based on agar overlay inhibition assays developed by Selman Waksman and colleagues [5]. With a primary focus on soil-derived actinobacteria, these initial systematic screening approaches, in combination with bioactivity guided fractionation, were instrumental for the identification of anti-microbial agents. In addition, they highlighted the fact that microorganisms from the soil microbial communities were proficient producers of antimicrobial compounds [6]. However, as re-discovery rates increased, bioprospecting hit the law of diminishing returns [7], and was in large part abandoned due to the consensus belief that the potential of cultured microbes had been exhausted. The search for novel antibiotics turned to chemical synthesis and class modification, where success rates were initially higher [8]. However, the chemical library approach has not been able to provide truly novel compounds in the long run and as the search continues, we have turned to nature yet again in the pursuit for structural novelty and truly novel classes of antibiotics. While the screening of culturable bacteria and filamentous fungi has proven extremely useful in bioprospecting for novel antibiotics historically, cultivation-based discovery of novel secondary metabolites has two major limitations: first, the majority of microorganisms inhabiting natural niches are recalcitrant to cultivation as monocultures under standard manmade laboratory conditions, and although cultivation techniques have greatly improved in recent years, e.g. with the development of the iChip [9] and highly parallelized droplet cultivation [10], the majority of bacteria have ever only been observed through their DNA sequences [11], [12]. Second, even if microorganisms can be successfully cultured in the laboratory, they often only express a fraction of their biosynthesis potential under the conditions provided in standard screening programs [13], [14]. The last two decades have seen a remarkable development in high-throughput sequencing, allowing us to query the unculturable majority of microorganisms. By high-throughput amplicon sequencing approaches targeting the 16S rRNA gene, astounding amounts of information has become available describing microbiome composition and diversity in natural environments. Furthermore, sequencing targeting specific functional genes has shed light on the distribution and diversity of various functional groups of microorganisms in nature. As the sequencing output continues to increase, such targeted approaches are gradually replaced by broader, untargeted metagenomic sequencing of bulk environmental DNA (eDNA), and the subsequent generation of metagenome assembled genomes (MAGs) has allowed us to investigate the metabolism and ecology of unculturable microorganisms even further. Collectively, through the explosion in sequencing data and the in silico bioinformatic tools developed alongside, it has become evident that there is a large biosynthetic capacity in the unculturable majority of microorganisms, as well as a yet un-explored potential in the already cultured. However, the challenge of exploiting this genetic potential remains, and a more holistic and systematic understanding of the drivers of natural product diversity in different environments is needed. Biosynthetic domain-targeted amplicon sequencing and untargeted genome-resolved metagenomic studies have started to map this incredible diversity in environments ranging from arid soils to the ocean floors.

Evaluating the natural product biosynthesis potential of the unculturable majority

The collective microbiome of Earth encompasses a staggering 1030 bacterial cells [11], representing extensive microbial richness, which potentially represents as many as 1012 distinct species [15], [16]. These microorganisms drive the major element cycles on the planet, and are also master chemists capable of producing in excess of 109 distinct small (<1 kDa) bioactive molecules [17]. The natural roles of many bioactive small molecules are not elucidated [18], yet the untapped potential of these molecules as novel anti-microbial compounds is beyond a doubt enormous. Assessing the potential of environmental microbial community members to produce bioactive compounds, and hence potential novel anti-microbial therapeutics, is by no means trivial. Cultivation recalcitrance and the enigmatic notion of “silent” or cryptic biosynthetic gene clusters (BGCs), complicates the holistic study of bioactive secondary metabolites produced by microbiomes in situ. Fortunately, recent advances in ‘-omics’ methodology have enabled reliable, high-throughput and cultivation-independent approaches to study the secondary metabolism of microorganisms in their environment. Some of the largest classes of bioactive secondary metabolites are the polyketides (PKs) and the nonribosomal peptides (NRPs). These compound classes have a wide range of biological activities and pharmacological properties; for example, the NRP actinomycin is a clinical anti-cancer drug and the PK erythromycin is a widely used antibiotic. The modular and iterative nature of the enzymatic biosynthesis of PKs and NRPs involves modules of highly conserved domains responsible for the incorporation of coenzyme A (CoA)- or amino acid-based building blocks in growing chains of compounds. The minimal module for a nonribosomal peptide synthetase (NRPS) consists of an adenylation domain (AD), selecting the respective amino acid, a condensation (C) domain, catalyzing the peptide bond formation, and finally a peptidyl carrier domain, which is carrying the growing chain. A polyketide synthase (PKS) has a very similar structure, in that a minimal module contains an acyltransferase (AT), a keto-synthase (KS) and an acyl carrier protein, however often additional domains such as a keto-reductase can be found. This high degree of conservation has facilitated the design of degenerate primers for high-throughput amplicon sequencing and subsequent clustering into operational biosynthetic units (OBUs), allowing targeted analyses of the biosynthesis potential for these two classes of compounds (Fig. 1) [19], [20]. The primers most widely used to target conserved domains in BGCs encoding PKSs, were originally designed from conserved regions in 20 known modular PKSs and were used to study the geldanamycin BGC in Streptomyces hygroscopicus NRRL 3602 [21]. Similarly, the most widely used NRPS primers were originally designed to be specific for actinomycetes based on six characterized NRPS gene clusters [22]. Additional primers have subsequently been made increasingly degenerate to target a broader range of PKS BGCs from taxonomically different microorganisms [23]. Initially, heterologous expression of such soil-derived PCR amplicons provided early evidence that the hidden potential of the unculturable majority could be tapped, and that it likely represented a trove of structural novelty [24]. Subsequently, within the past decade, targeted short- and long-read amplicon sequencing, as well as untargeted shotgun metagenomics have generated a multitude of sequence data describing the biosynthesis potential of some of nature’s prevalent microbiomes.
Fig. 1

Working principle of biosynthetic gene clusters and their targeted analysis by amplicon sequencing. The genetic organization of an example PKS gene cluster is shown, along with the resulting enzyme complex and the final synthetized product. By targeted PCR amplification on environmental DNA, KS domains can be retrieved, sequenced and clustered into OBUs.

Working principle of biosynthetic gene clusters and their targeted analysis by amplicon sequencing. The genetic organization of an example PKS gene cluster is shown, along with the resulting enzyme complex and the final synthetized product. By targeted PCR amplification on environmental DNA, KS domains can be retrieved, sequenced and clustered into OBUs.

The biosynthesis potential of soil microbiomes

In addition to being readily accessible, soil is characterized by several features of significant importance for microbial secondary metabolite production. Often soil holds a significant amount of microbial biomass comprising approximately 109 bacteria, and 106 fungi per gram [25]. Especially, filamentous soil-dwelling actinomycetes exhibit a very high biosynthesis potential and account for the production of more than two thirds of known antibiotics [26]. Soil microbiomes have therefore rightly served as a starting point surveying the genetic potential for secondary metabolite production. More than 2000 soil samples have been queried using the PKS/NRPS amplicon approach described above [19], [27], [28], [29], [30], [31], [32], [33], and, perhaps not surprisingly, the majority of sequences, i.e. 80–99%, cannot be recovered in available databases. DNA extracted from around 0.25 g of soil is estimated to represent a richness of KS and AD domains in the range of 103-104 OBUs [19], [29], [32]. Thus, natural soil microbiomes hold an immense degree of biosynthetic diversity and consequently, structurally novel chemistry. While it is difficult to ascertain the spatial distribution of microorganisms in soil [34], the microscale heterogeneity is likely in part the reason why exploratory studies have reported a large inter-sample variation, even within the same collection site. Despite this variation, geographical distance, or dispersal limitation, and biome, or ‘habitat’, seem to be the most important drivers of biosynthetic diversity in soil [19], [29], [30], [32], [35]. Hence, widely interspersed terrestrial sampling sites should increase chances of unearthing the widest possible selection of metabolites in our pursuit of novel anti-microbial agents. However, in contrast to this line of thinking, the genetic capacity to synthesize specific biomedically relevant natural product families is widespread. For instance, the conserved domains originating from genes encoding the antibiotics erythromycin and teicoplanin, have been recovered from a single soil sample collected in the US, despite the fact that the compounds were originally isolated from microorganisms collected in the Philippines and India, respectively [29]. While the underlying physicochemical parameters in large parts shape the biomes and the small spatial scale heterogeneity, no single environmental driver of secondary metabolite richness or diversity has to date been identified using PKS/NRPS-targeted short-read amplicon sequencing. Weak correlations between the richness of NRPS and PKS sequences and the concentrations of potassium and calcium, and soil pH, have however been observed [19]. Other studies utilizing long-read sequencing methodology, have reported that biome may in some cases be a stronger driver than geographical distance. For instance, PKS fragments cloned from microbial community DNA extracted from soil and street sediment samples produced putative cosmopolitan KS and AT domain sequences, which consistently occurred in particular habitats separated by hundreds of kilometers [36]. In polar desert soil NRPS and PKS cluster richness, as determined using a third generation long-read sequencing approach, was negatively correlated with environmental parameters such as moisture, carbon, and nitrogen content. Hence, NRPS and PKS biochemistry may be an important physiological trait for survival in arid, nutrient-starved soils [37]. Whether the physicochemical drivers are habitat specific, remains to be determined. While such targeted PCR-based approaches have inherent limitations, genome-resolved metagenomic approaches have corroborated the notion that the untapped secondary metabolite biosynthesis potential of soil microbiomes is immense. Crits-Christoph et al. reported that of 240 NRPS and PKS BGCs recovered from 376 MAGs from 120 grassland soil microbiomes, 220 did not share more than 50% of the genes of previously described clusters, and little sequence identity was observed at the amino acid level when compared to the ‘Minimum Information About a Biosynthetic Gene’ (MIBiG) [38] repository [39]. Furthermore, more than 900 additional BGCs were identified using the ‘Antibiotics and Secondary Metabolite Analysis Shell (AntiSMASH) [40], the majority of which encoded putative bioactive peptides, terpenes, and metabolites of unknown function. Similarly, genome-resolved metagenomics has revealed and compared thousands of diverse and novel BGCs from various grassland and forest soil microbiomes, underlining that habitat and soil depth are likely major drivers of overall BGC richness [41].

The biosynthesis potential of marine microbiomes

More than 70% of Earth’s surface is covered by oceans and marine environments represent the largest biosphere on the planet. Here, a plethora of niches distinct from any terrestrial environment exists. As a consequence, equally diverse microbiomes have evolved, and this diversity manifests itself in the compounds produced [42], [43] and comparisons of chemo-informatic data have shown that over 70% of molecular scaffolds in the Dictionary of Marine Natural Products (DMNP, 2007.6) are unique to marine organisms [44]. Nonetheless, marine microbiomes have not been scrutinized to the same extent as their terrestrial counterparts in the pursuit of novel secondary metabolites. As for soil microbiomes, most bioprospecting efforts in the marine environment have focused on lineages with culturable representatives, such as proteobacteria [43] and actinobacteria [45], resulting in the discovery of multiple bioactive compounds, of which many are antibiotics [46], [47], [48], [49], [50]. A few targeted metagenomic approaches have been used to assess the biosynthesis potential of the unculturable fraction of marine microbiomes, and especially marine sediments seem to hold a significant biosynthesis potential [20], [51], [52]. Coastal sediment microbiomes harbor twice as many KS and AD OBUs (97% sequence similarity cut-off) as compared to the coastal seawater above it, and in addition, 97.4–99.7% of OBUs from both seawater and sediments cannot be recovered from available databases [20]. Furthermore, seawater and sediment from the same location represent significantly different microbiomes in terms of their genetic capacity to synthesize PKs and NRPs indicating that habitat is a key driver of the diversity of the biosynthesis potential of marine microbiomes as well. Untargeted genome-resolved metagenomic analyses of a combined collection of ca. 35,000 MAGs, single amplified genomes (SAGs), and cultivated reference genomes from more than 1000 samples collected across the oceans, have in addition shown a significant differential biogeographic structuring of the oceanic biosynthetic potential [53]. In the oceans, tropical and epipelagic waters are enriched in terpenes as opposed to the less noticeable polar and deep waters, which conversely seem to harbor a more pronounced genetic potential for the biosynthesis of NRPSs and PKSs. Yet, as for soil microbiomes, the underlying drivers of the differentiation between habitats remain to be resolved.

The biosynthesis potential of host-associated microbiomes

While the evolutionary and ecological forces selecting for an elaborate repertoire of bioactive metabolites are varied [39], the microbiomes associated with eukaryotic hosts are likely hotspots for biosynthetically talented bacteria. In terrestrial systems, the plant-associated microbiomes of the phyllo- and rhizospheres represent rich repositories for novel BGCs [54], [55]. However, even within identical plant cultivars, the biosynthetic gene composition is significantly different between individuals, and is additionally affected by plant health, growth stage, and geographical distance [56], [57]. This suggests that plants engage in host-microbe interactions with microorganisms producing distinct collections of diverse bioactive compounds. Similarly, in marine systems, microbiomes associated with invertebrates, e.g. ascidians and sponges, have been widely studied due to their production of exceptionally diverse secondary metabolites [58], [59], [60], [61], [62], [63]. An initial cultivation-independent approach seeking to evaluate the natural product biosynthesis potential of marine sponges showed that of 150 distinct KS OBUs, 127 formed an independent clade that was present in the majority of a diverse collection of marine sponges, suggesting that sponge microbiomes produce ubiquitous yet unexplored polyketides [64]. In contrast to this finding, the exceptionally prolific secondary metabolite-producing microbiomes of the lithistid sponge Theonella swinhoei produce largely non-overlapping metabolite profiles, resulting in distinct host chemotypes [58], [65]. Global genome-resolved metagenomic data has revealed that microbiomes associated with more complex eukaryotic host animals including arthropods, insects, and humans all harbor a broad assortment of BGCs encoding the most prevalent classes of secondary metabolites, i.e. PKs, NRPs, ribosomally synthesized and post-translationally modified peptides (RiPPs), and terpenoids [12], [66], yet in-depth comparisons of the richness and diversity of BGCs across host-associated microbiomes are currently lacking.

Comparisons across biomes and methods

Individual studies evaluating the natural product biosynthesis potential of microbiomes from divergent biomes are limited [12], [20], [67], and our extrapolation and presentation of the current state of the field in the subsections above compile data retrieved from multiple distinct environments using different methodologies, each with inherent limitations. Studies relying on amplicon sequencing targeting conserved domains within PKS and NRPS BGCs, which is the most widely applied high-throughput approach, suffer from the fact that the amplicons only cover parts of the domains of interest. Since PKSs and NRPSs consist of multiple domains, each amplicon in itself does not easily allow for the inference of a specific natural product, nor the taxonomy of the organism it originated from. Furthermore, there are differences in the choice of target domains, sequencing technology, and primer pairs across studies. An in silico PCR analysis using the most common primer pairs on: 1) an existing collection of MAGs from a soil microbiome [39], [41], and 2) all KS and AD domains from the AntiSMASH database [68], shows that some primers fail to amplify the majority, if not all target sequences in these two extensive datasets (Table 1). For KS domains, the highest number of observed hits are generated using the MDPQQRf/HGTGTr primer pair designed by Tae et al. (2006) [61]. However, the in silico generated amplicons vary in length, indicating a potential risk for unspecific amplification. The degKS2F/degKS2R primer pair initially designed by Schirmer et al. (2005) [23], [29], which has been used to generate the majority of KS sequencing reads from soil and marine systems [20], [29], [30], [32], is in contrast quite consistent in amplicon length and generates the second highest number of hits (Table 1). Similarly, the Schirmer et al. (2005) NRPS primers targeting AD domains, exhibit superior performance compared to other primer pairs in silico. However, due to the longer amplicons generated with this set of primers, it has been substituted with the A3F/A7R AD-targeting primers [22] in most high-throughput amplicon sequencing studies as these are based on short-read sequencing technologies [20], [29], [30], [32]. Interestingly, the more recently developed primer pairs designed to target divergent NRPS and PKS clusters from soil microbiomes across latitudes and climate zones [69], [70], perform poorly in silico, both against soil derived MAGs known to carry a broad repertoire of PKS and NRPS BGCs [39], [41], and the collective sequence data in the antiSMASH database. In extension of the difficulty of developing suitable primers, amplicon sequencing approaches are confined to the analysis of a select subset of compound classes, i.e. only NRPS and PKS clusters currently. Furthermore, comparisons with un-targeted metagenomic approaches, which in some cases can generate complete or near-complete BGCs [71], are hampered by the relatively limited information stored in the small sequences generated by amplicon sequencing. This issue can to some extent be resolved using bioinformatic tools such as dom2BGC [72], or the ‘Environmental Surveyor of Natural Product Diversity’ (eSNaPD), which compares the sequenced amplicons to a curated reference dataset of 450 unique gene clusters, resulting in around 10,000 signature domains [73]. However, only a small percentage of detected OBUs can be assigned to corresponding domains in the database and the results are highly dependent on the somewhat arbitrary set similarity cut-off. While amplicon sequencing currently remains the best tool for deep-sequencing of specific biosynthetic domains, un-targeted metagenomic approaches are gradually replacing this approach as the field progresses, hopefully fostering a more standardized approach across studies and microbiomes.
Table 1

List of degenerate primers used in biosynthetic amplicon studies targeting various conserved domains in NRPS and PKS BGCs, and the number of in silico amplicons obtained with the listed primers. The publicly available program ‘in silico PCR’ (https://github.com/egonozer/in_silico_pcr) was used to generate amplicons form a set of 1334 metagenome-assembled genomes of a soil microbiome [41] and all KS and AD domains on the AntiSMASH database (stand: august 2019) allowing for one mismatch and one insertion per primer sequence.

NameTarget domainTheoretical amplicon lengthSequenceReferenceHits in MAGsHits in AntiSMASH database% detected of AntiSMASH domainsIn silico amplicon length
degNRPS-1FAD900–1100AARDSNGGNGSNGSNTAYBNCCSchirmer 2005 [23]89115,255371010 ± 43
degNRPS-4RCKRWANCCNCKNANYTTNAYYTG
MTF2AD1000GCNGGYGGYGCNTAYGTNCCNeilan et al. 1999 [99]3427039171004 ± 51
MTRCCNCGDATYTTNACYTG
A3FAD700GCSTACSYSATSTACACSTCSGGAyuso sacido and Genilloud 2005 [22]178806020709 ± 25
A7RSASGTCVCCSGTSCGGTAS
FAD480CGCGCGCATGTACTGGACNGGNGAYYTAmos 2015 [69]000NA
RGGAGTGGCCGCCCARNYBRAARAA
DnDmFC480ATGCATCACATTRTNYYNGAWoodhouse 2013 [60]16NANA481 ± 1.5
DCCRGTGTTNACRAARAANCCDAT
MDPQQRfKS670RTRGAYCCNCAGCANCGTae 2006 [61]1517783340674 ± 179
HGTGTrVGTNCCNGTGCCRTG
degKS2FKS700GCNATGGAYCCNCARCARMGNVTSchirmer 2005 [23]93433022680 ± 10
degKS2RGTNCCNGTNCCRTGNSCYTCNAC
KSDPQQFKS700MGNGARGCNNWNSMNATGGAYCCNCARCANMGPiel 2002 [100]69351318705 ± 7
KSHGTGRGGRTCNCCNARNSWNGTNCCNGTNCCRTG
KSLFKS700CCSCAGSAGCGCSTSYTSCTSGACourtois 2003 [101]32241912671 ± 11
KSLRGTSCCSGTSCCGTGSGYSTCSA
DKFKS700GTGCCGGTNCCRTGNGYYTCMoffitt 2003 [102]1911256683 ± 51
DKRGCGATGGAYCCNCARMG
MAK1KS320GACACSGCSTGYTCBTCGTCGSavic and Vailjevic 2006 [103]06563317 ± 0
MAK3CCGTTSGACGCRCCGTCCTGGTTSCA
P1KSTSGAYCCSCAGCARCGZhao 2012 [56]78250.1591 ± 427
P2GTSGAYACNGCSTGYTC
K1FKS - methyl-malonyl-CoA transferase1200–1400TSAAGTCSAACATCGGBCAAyuso-Sacido and Genilloud 2005 [22]9NANA1304 ± 47
M6RCGCAGGTTSCSGTACCAGTA
PKS_firmi_FACP340GCNGGNCAYWSNYTNGGNGARTAYAAleti 2017 [57]10NANA340 ± 4
PKS_firmi_RCATRWANCKNSWRTGRAANGCNCC
KSα-FKS alpha (type II pks)613TSGCSTGCTTCGAYGCSATCMetsä-Ketelä 1999 [104]15280.1613 ± 0.3
KSα-RTGGAANCCGCCGAABCCGCT
FKS alpha (type II pks)350GGCAACGCCTACCACATGCANGGNYTAmos 2015 [69]000NA
RGGTCCGCGGGACGTARTCNARRTC
List of degenerate primers used in biosynthetic amplicon studies targeting various conserved domains in NRPS and PKS BGCs, and the number of in silico amplicons obtained with the listed primers. The publicly available program ‘in silico PCR’ (https://github.com/egonozer/in_silico_pcr) was used to generate amplicons form a set of 1334 metagenome-assembled genomes of a soil microbiome [41] and all KS and AD domains on the AntiSMASH database (stand: august 2019) allowing for one mismatch and one insertion per primer sequence.

Tapping into microbial biosynthetic diversity

Considering the diversity of microbial biosynthesis genes in nature, even the most commonly sampled microbiomes hold promise of providing novel classes of natural products. As classical bioprospecting has only been able to capture a miniscule amount of the true biosynthetic potential, alternative measures are needed to procure truly novel classes of antibiotics.

Tying taxonomy to biosynthesis and focusing cultivation

In recent years, several new cultivation and isolation techniques have been developed, some implemented with the aim of discovering novel bioactive compounds. By applying the principle of in situ cultivation, the novel antibiotic teixobactin was identified through the screening of extracts from over 10,000 iChip-derived soil isolates [74]. The promising therapeutic candidate is the first antibiotic to bind lipid II, validating the potential for obtaining structural novelty from yet-to-be cultured microorganisms. Similarly, droplet cultivation techniques eliminate competition for space and nutrients by microencapsulation, allowing otherwise cultivation recalcitrant microorganisms to grow under controlled conditions [10], [75], [76]. While increasing the culturable fraction of environmental microorganisms, such approaches are untargeted and necessitate extensive downstream screening of isolates. Focusing on specific uncultured taxa exhibiting elaborate biosynthetic repertoires could increase the efficiency of such cultivation-based approaches substantially. Combining droplet cultivation with fluorescence in situ hybridization in living cells (liveFISH [77]) introduces the possibility for high-throughput and targeted isolation of single microbial cells belonging to promising taxonomic groups. A prerequisite is, however, that the interconnection between taxonomy and biosynthesis potential can be determined. The generation of complete, or near-complete, MAGs from metagenomes clearly allows for this interconnection and coupling taxonomic affiliations. Using the ribosomal S3 protein-encoding gene in combination with antiSMASH outputs for individual MAGs, have demonstrated that the biosynthesis potential varies greatly across phyla in soil microbiomes, and that actinobacteria, but also the less well-studied chloroflexi phylum, likely harbors an extensive secondary metabolite repertoire [41]. In addition, candidate phyla without cultured representatives, e.g. rokubacteria, eisenbacteria, and dormibacteraeota, may be promising candidates for targeted cultivation efforts. The size and repetitive nature of many BGCs, and the extensive diversity of low-abundant microorganisms, does however represent significant challenges for MAG-based approaches. In contrast, the application of linear correlation analyses on targeted amplicon sequencing data allows for the connection between taxonomy (16S operational taxonomic units; OTUs) and biosynthesis potential (OBUs) of low-abundant taxa [20]. Such analyses have demonstrated that most marine bacterial species have no, or very few associated OBUs, while taxa not previously associated with secondary metabolism may in fact be promising targets for cultivation efforts. Interestingly, in both soil and marine microbiomes, targeted approaches have demonstrated that the majority of the uncharacterized biosynthesis potential is associated with low-abundant taxa [20], [27]. Yet again other approaches have been able to link secondary metabolite biosynthesis potential to taxonomy in microbiomes. Coupling mass-spectrometry data to 16S OTUs have established the connection between staurosporine and the Salinispora genus in ocean sediments [51], and single-cell sequencing approaches have facilitated the assignment of individual PKS and NRPS genes to sponge-associated members of the Poribacteria, the Chloroflexi, and the candidate genus Entotheonella [58], [78]. Lastly, opposed to indirect correlation of 16S amplicon data with functional gene amplicon (or chemistry data), emulsion paired isolation and concatenated PCR (epicPCR) fuses phylogenetic and functional amplicons directly in vitro [79]. This technique, which makes use of the capture of single cells in emulsified droplets, has been used to taxonomically identify sulfate reducing bacteria and bacteria carrying antibiotic resistance genes [80], [81], and may in a similar manner be a promising tool in the determination of which taxa to bring into culture.

Culture-independent acquisition of novel secondary metabolites

Culture-independent discovery of secondary metabolites can be facilitated by the use of metagenomic clone libraries in combination with heterologous expression [82]. In such approaches, eDNA is captured in e.g. a cosmid library and maintained in a host organisms [83]. The library can subsequently be screened either for functionality such as coloration, bioactivity or compound detection by high performance liquid chromatography - mass spectrometry (HPLC-MS) [84]. Two examples of novel antibiotics discovered by such approaches include the small molecule turbomycin (1) (Fig. 2), which was found by investigating melanin-like colored colonies of a large metagenomic soil eDNA clone library [85], and the compound palmitoylputrescine (2) (Fig. 2), which was found by bioactivity screening of a bromeliad tank water eDNA library [86].
Fig. 2

Flow chart for the culture independent discovery of natural products from environmental microbiomes. Starting with the environmental sample and the extracted microbial DNA, KS and AD domains can be profiled to gain insights into the genetic biosynthesis potential. A combination of different pathways can lead then to the isolation and characterization of novel natural products. Dashed arrows indicate methods not experimentally established yet. At the bottom four examples of bioactive natural products are shown and color coded according to the methodology of isolation.

Flow chart for the culture independent discovery of natural products from environmental microbiomes. Starting with the environmental sample and the extracted microbial DNA, KS and AD domains can be profiled to gain insights into the genetic biosynthesis potential. A combination of different pathways can lead then to the isolation and characterization of novel natural products. Dashed arrows indicate methods not experimentally established yet. At the bottom four examples of bioactive natural products are shown and color coded according to the methodology of isolation. Alternatively metagenomic clone libraries can be screened for the presence of conserved DNA sequences such as AD and KS domains by PCR [87]. The latter strategy recently led to the discovery of a new class of calcium-dependent antibiotics, the malacidins (3) (Fig. 2) [28]. These novel secondary metabolites were identified in AD amplicon sequenced soil samples based on homology and phylogenetic analysis of conserved AD domains of calcium-dependent NRP antibiotics. Assisted by the eSNaPD tool [73], a soil community especially rich in AD domains encoding for potentially novel calcium-dependent NRPs was selected. Via cloning of eDNA into a cosmid library, a novel BGC was identified, assembled, cloned and expressed in the heterologous host Streptomyces albus, eventually resulting in the discovery of the calcium-dependent malacidin antibiotics [28]. A similar workflow has been employed in the search for analogs of the anti-cancer compound mithramycin in soil [88]. Metagenomic library screening with degenerate primers targeting KSα genes resulted in the identification of a cluster with structural similarity, but sufficient sequence divergence to represent a novel analog. The BGC was cloned into S. albus for heterologous expression, and after bioassay-guided fractionation and pathway engineering, the novel compound metathramycin (4) (Fig. 2) was structurally elucidated and characterized as a potent anti-cancer compound. Hence, such examples demonstrate the possibility of procuring novel secondary metabolites from eDNA and, given the large untapped biosynthetic diversity, promise to be only the tip of the iceberg. However, multiple technical challenges currently limit the effectiveness of large-scale culture independent drug discovery studies. Heterologous expression in a suitable host is not always straightforward as problems such as codon incompatibility [89], a lack of necessary regulatory elements [90] and toxicity of the encoded compound towards the host can arise [91]. Several of these challenges are being addressed both in heterologous expression and in induction efforts of silent BGCs from, and in, culturable bacteria. Development of methods such as transformation-associated recombination cloning of BGCs [92], promoter-swapping [93], and experiences from the induction by e.g. co-cultivation [94], [95] could aid in overcoming challenges associated with heterologous expression of metagenome-retrieved BGCs. Another challenge is the size limitation of DNA inserts in cosmid libraries, which means that BGCs larger than 40 kb will be distributed over two or more clones. This makes it more time consuming to screen, find and assemble BGCs [28]. Lastly, BGCs only constitute a small proportion of bacterial genomic DNA [82], [96], which necessitates sufficiently deep clone libraries in combination with extensive screening efforts to find them. Therefore, methods aiming at enriching the microbial community for biosynthetic talented bacteria before generating a cosmid or sequencing library may be a promising way to improve the discovery of novel BGCs and their corresponding secondary metabolites. This strategy was applied to extract an enriched fraction of the tunicate microbiome actively expressing bioactive compounds at the time of sampling. Through fluorescent labelling of the carrier proteins involved in NRP and PK biosynthesis and subsequent fluorescence activated cell sorting (FACS), it was possible to enrich the BGC content from 0.77 to 1.17 BGC per Mbp [97]. To what extent the need for clone libraries remains in the future is unknown, however promising full length BGCs identified by shotgun sequencing directly could potentially be extracted from eDNA using targeted PCR and subsequent cloning into a suitable host. This approach has been successful for silent BGCs of cultured representatives; for example, a new phenazine compound from Serratia fonticola has been successfully expressed in E. coli [98]. Lastly, single cell sequencing may prove a promising avenue for the identification of candidate secondary metabolites in the future. To date it has mainly been used to either identify the producer of a known compound, or the BGC responsible for its biosynthesis. One example of this approach is the identification of host-associated onnamide A producers [58], where members of a complex sponge-associated microbiome were isolated and the production of the compound could be tied to members of the Entotheonella genus.

Summary and outlook

Microorganisms have been our most generous source of natural products, and they have provided us with the majority of anti-microbial agents applied in our continuous battle against pathogenic bacteria. Regrettably, as antibiotic resistance is on the rise, the supply of novel antibiotics from established sources is running dry, and we must again turn to nature to mitigate this development. The microbiomes of Earth harbor immense taxonomic diversity and the development within the meta-omics field has enabled the exposure of their genetic potential to produce novel secondary metabolites. Through the past decade, targeted and untargeted metagenomic analyses of soil, marine, and host-associated microbiomes have identified these as rich reservoirs of novel bioactive compounds, including PKs and NRPs. However, a lack of standardization in methodology has complicated comparisons across biomes and current approaches have not yet enabled us to identify key drivers of biosynthetic diversity. Nor are we currently able to predict, which microbiomes are the most promising to mine for truly novel anti-microbial agents. To exploit the biosynthesis potential of natural microbiomes, both cultivation-dependent and cultivation-independent avenues need to be explored. Through the development of targeted cultivation methodology, we may be able to isolate representatives of the rare taxa, which seem to be proficient producers of uncharacterized bioactive compounds in nature. However, it is improbable that we will ever be able to cultivate the full phylogenetic range of microbes found in the environment. Instead, cultivation-independent function based and sequence based metagenomics have been successful in the identification and isolation of novel bioactive compounds. With improved sequencing technologies, more sensitive and precise bioinformatic pipelines, and a comprehensive understanding of the biosynthesis, we will likely be able to extract BGC sequences directly from environmental metagenomes and express them heterologously to find true chemical novelty. Additionally, single cell isolation and sequencing hold a great promise of identifying rich producers, for more focused efforts in cultivation or sequencing. Eventually, we will need to make use of the full potential of environmental microbiomes, using both cultivation and cultivation-independent approaches to procure the novel anti-microbial agents needed to avert the antibiotic resistance crisis.

Funding

This study was supported by The Independent Research Fund Denmark (grant DFF – 8048-00035B) and from the Danish National Research Foundation (grant DNRF137).

CRediT authorship contribution statement

Aileen Ute Geers: Conceptualization, Writing – original draft, Visualization. Yannick Buijs: Writing – original draft, Visualization. Mikael Lenz Strube: Writing – review & editing. Lone Gram: Writing – review & editing. Mikkel Bentzon-Tilia: Conceptualization, Writing – original draft, Writing – review & editing, Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  98 in total

1.  Nonribosomal peptide synthesis and toxigenicity of cyanobacteria.

Authors:  B A Neilan; E Dittmann; L Rouhiainen; R A Bass; V Schaub; K Sivonen; T Börner
Journal:  J Bacteriol       Date:  1999-07       Impact factor: 3.490

Review 2.  Challenges of antibacterial discovery.

Authors:  Lynn L Silver
Journal:  Clin Microbiol Rev       Date:  2011-01       Impact factor: 26.132

3.  Mining for Nonribosomal Peptide Synthetase and Polyketide Synthase Genes Revealed a High Level of Diversity in the Sphagnum Bog Metagenome.

Authors:  Christina A Müller; Lisa Oberauner-Wappis; Armin Peyman; Gregory C A Amos; Elizabeth M H Wellington; Gabriele Berg
Journal:  Appl Environ Microbiol       Date:  2015-05-22       Impact factor: 4.792

4.  A new antibiotic kills pathogens without detectable resistance.

Authors:  Losee L Ling; Tanja Schneider; Aaron J Peoples; Amy L Spoering; Ina Engels; Brian P Conlon; Anna Mueller; Till F Schäberle; Dallas E Hughes; Slava Epstein; Michael Jones; Linos Lazarides; Victoria A Steadman; Douglas R Cohen; Cintia R Felix; K Ashley Fetterman; William P Millett; Anthony G Nitti; Ashley M Zullo; Chao Chen; Kim Lewis
Journal:  Nature       Date:  2015-01-07       Impact factor: 49.962

5.  An environmental bacterial taxon with a large and distinct metabolic repertoire.

Authors:  Micheal C Wilson; Tetsushi Mori; Christian Rückert; Agustinus R Uria; Maximilian J Helf; Kentaro Takada; Christine Gernert; Ursula A E Steffens; Nina Heycke; Susanne Schmitt; Christian Rinke; Eric J N Helfrich; Alexander O Brachmann; Cristian Gurgui; Toshiyuki Wakimoto; Matthias Kracht; Max Crüsemann; Ute Hentschel; Ikuro Abe; Shigeki Matsunaga; Jörn Kalinowski; Haruko Takeyama; Jörn Piel
Journal:  Nature       Date:  2014-01-29       Impact factor: 49.962

6.  Direct cloning and refactoring of a silent lipopeptide biosynthetic gene cluster yields the antibiotic taromycin A.

Authors:  Kazuya Yamanaka; Kirk A Reynolds; Roland D Kersten; Katherine S Ryan; David J Gonzalez; Victor Nizet; Pieter C Dorrestein; Bradley S Moore
Journal:  Proc Natl Acad Sci U S A       Date:  2014-01-21       Impact factor: 11.205

7.  Detection of Natural Products and Their Producers in Ocean Sediments.

Authors:  Robert N Tuttle; Alyssa M Demko; Nastassia V Patin; Clifford A Kapono; Mohamed S Donia; Pieter Dorrestein; Paul R Jensen
Journal:  Appl Environ Microbiol       Date:  2019-04-04       Impact factor: 4.792

8.  Functional analysis of environmental DNA-derived type II polyketide synthases reveals structurally diverse secondary metabolites.

Authors:  Zhiyang Feng; Dimitris Kallifidas; Sean F Brady
Journal:  Proc Natl Acad Sci U S A       Date:  2011-07-18       Impact factor: 11.205

9.  A study of iterative type II polyketide synthases, using bacterial genes cloned from soil DNA: a means to access and use genes from uncultured microorganisms.

Authors:  K T Seow; G Meurer; M Gerlitz; E Wendt-Pienkowski; C R Hutchinson; J Davies
Journal:  J Bacteriol       Date:  1997-12       Impact factor: 3.490

10.  Broad-Host-Range Expression Reveals Native and Host Regulatory Elements That Influence Heterologous Antibiotic Production in Gram-Negative Bacteria.

Authors:  Jia Jia Zhang; Xiaoyu Tang; Michelle Zhang; Darlene Nguyen; Bradley S Moore
Journal:  mBio       Date:  2017-09-05       Impact factor: 7.867

View more
  1 in total

1.  Deep-Sea Sediments from the Southern Gulf of Mexico Harbor a Wide Diversity of PKS I Genes.

Authors:  Maikel Fernández-López; Ayixon Sánchez-Reyes; Clara Barcelos; Karla Sidón-Ceseña; Ricardo B Leite; Asunción Lago-Lestón
Journal:  Antibiotics (Basel)       Date:  2022-07-04
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.