| Literature DB >> 35435454 |
Timo Negri1, Shrikant Mantri1,2,3, Angel Angelov4, Silke Peter4, Günther Muth1, Alessandra S Eustáquio5, Nadine Ziemert6,7,8.
Abstract
Culture-independent metagenomic approaches offer a promising solution to the discovery of therapeutically relevant compounds such as antibiotics by enabling access to the hidden biosynthetic potential of microorganisms. These strategies, however, often entail laborious, multi-step, and time-consuming procedures to recover the biosynthetic gene clusters (BGCs) from soil metagenomes for subsequent heterologous expression. Here, we developed an efficient method we called single Nanopore read cluster mining (SNRCM), which enables the fast recovery of complete BGCs from a soil metagenome using long- and short-read sequencing. A metagenomic fosmid library of 83,700 clones was generated and sequenced using Nanopore as well as Illumina technologies. Hybrid assembled contigs of the sequenced fosmid library were subsequently analyzed to identify BGCs encoding secondary metabolites. Using SNRCM, we aligned the identified BGCs directly to Nanopore long-reads and were able to detect complete BGCs on single fosmids. This enabled us to select for and recover BGCs of interest for subsequent heterologous expression attempts. Additionally, the sequencing data of the fosmid library and its corresponding metagenomic DNA enabled us to assemble and recover a large nonribosomal peptide synthetase (NRPS) BGC from three different fosmids of our library and to directly amplify and recover a complete lasso peptide BGC from the high-quality metagenomic DNA. Overall, the strategies presented here provide a useful tool for accelerating and facilitating the identification and production of potentially interesting bioactive compounds from soil metagenomes. KEY POINTS: • An efficient approach for the recovery of BGCs from soil metagenomes was developed to facilitate natural product discovery. • A fosmid library was constructed from soil metagenomic HMW DNA and sequenced via Illumina and Nanopore. • Nanopore long-reads enabled the direct identification and recovery of complete BGCs on single fosmids.Entities:
Keywords: Fosmid library; Metagenomic DNA; Nanopore sequencing; Natural products; Secondary metabolites; Single Nanopore read cluster mining (SNRCM)
Mesh:
Substances:
Year: 2022 PMID: 35435454 PMCID: PMC9064862 DOI: 10.1007/s00253-022-11917-y
Source DB: PubMed Journal: Appl Microbiol Biotechnol ISSN: 0175-7598 Impact factor: 5.560
Fig. 1Gel electrophoresis of metagenomic HMW DNA on a 1% agarose gel. Lane M, Lambda DNA/HindIII Marker; lane 1, metagenomic HMW DNA isolated from a soil sample of the Schönbuch forest
Fig. 2Workflow for capturing metagenomic BGCs ready for heterologous expression. a Isolation of high quality HMW DNA from soil and subsequent construction of a metagenomic fosmid library consisting of approx. 83,700 clones stored in pools of 2000 clones. b Pooling of the library and isolation of fosmids for subsequent Illumina/Nanopore sequencing. Hybrid assembly of short- and long-reads. c Size filtering of contigs greater than 40 kb and subsequent submission to antiSMASH for BGC detection. Alignment of complete BGCs with Nanopore reads for identification of fosmids harboring a complete BGC. Recovery of positive clones by serial dilution PCR
Identification of putative lasso peptide genes of BGCs 40.1 and 482.1 using Blastx results, gene sizes and antiSMASH annotations. Blastx results, gene sizes, and antiSMASH annotations with high similarities to known B1 genes highlighted in bold
| Gene | Blastx results | Gene size (bp) | antiSMASH annotation | Putative lasso peptide gene |
|---|---|---|---|---|
| Metagenomic lasso peptide BGC 40.1 | ||||
| 8 | Hypothetical protein [ | 975 | - | - |
| 9 | Hypothetical protein [ | 1905 | Asn_synthase | C gene |
| 10 | Hypothetical protein [ | 432 | - | - |
| 11 | Lasso peptide biosynthesis B2 protein [ | 423 | PF13471 | B2 gene |
| 12 | B1 gene | |||
| 13 | Erythromycin biosynthesis sensory transduction protein eryC1 [ | 1116 | DegT_DnrJ_EryC1 | - |
| Metagenomic lasso peptide BGC 482.1 | ||||
| 10.1 | Hypothetical protein DMG37_22385 [ | B1 gene | ||
| 10.2 | ||||
| 11 | Aminoglycoside phosphotransferase family protein [ | 1404 | - | Kinase |
| 12 | Lasso peptide biosynthesis B2 protein [ | 375 | PF13471 | B2 gene |
| 13 | Hypothetical protein [ | 1929 | Asn_synthase | C gene |
| 14 | Hypothetical protein [ | 141 | - | A1 gene |
| 15 | Hypothetical protein [ | 141 | - | A2 gene |
| 16 | Hypothetical protein DMG78_32005 [ | 2118 | - | - |
Fig. 3Sequencing confirmation for lasso peptide BGCs 40.1 and 482.1. Sequencing of lasso peptide BGC 40.1 (a) and 482.1 (b) using specific sequencing primers that generate sequences with overlap. Numbers of the alignment refer to the start and end point of each generated sequence that matches the reference sequence. Gene annotation via bioinformatics analysis as described. Translated nucleotide sequences of genes 14 and 15 of lasso peptide BGC 482.1 (c). Most suitable amino acid candidates for ring formation in bold and red. Amino acids building the putative ring with grey shade. Amino acids similar in size to a commonly found threonine at position -2 relative to the core peptide underlined
Identification of putative lasso peptide genes of BGC 44.1 using Blastx results, gene sizes, and antiSMASH annotations. Blastx results, gene sizes, and antiSMASH annotations with high similarities to known B1 genes highlighted in bold
| Metagenomic lasso peptide BGC 44.1 | ||||
|---|---|---|---|---|
| Gene | Blastx results | Gene size (bp) | antiSMASH annotation | Putative lasso peptide gene |
| 8 | ABC transporter permease [ | 822 | - | D1 gene |
| 9 | ABC transporter ATP-binding protein [ | 810 | ABC transporter ATP-binding protein | D2 gene |
| 10 | B1 gene | |||
| 11 | Lasso peptide biosynthesis B2 protein [ | 471 | PF13471 | B2 gene |
| 12 | Asparagine synthetase B [ | 1863 | Asn_synthase | C gene |
| 13 | Hypothetical protein DMG36_15005 [ | 150 | Predicted lasso peptide | A gene |
Fig. 4Sequencing confirmation for lasso peptide BGC 44.1 cloned into an expression vector. Sequencing of lasso peptide BGC 44.1 using specific sequencing primers that generate sequences with overlap. Numbers of the alignment refer to the start and end point of each generated sequence that matches the reference sequence. Vector derived PBAD promoter determined as starting point of the reference sequence. Gene annotation via bioinformatics analysis as described
Fig. 5Workflow for the assembly of a large NRPS BGC from cluster parts on different fosmids via TAR cloning. a Screening of the fosmid library for clones carrying fosmids with overlapping BGC parts and subsequent isolation of the respective clones. Fosmid isolation and assembly of the cluster parts into a complete cluster via TAR cloning in yeast. b SmiI digest of the assembled NRPS BGC generates two fragments for subsequent Nanopore sequencing. Size filtering of the generated reads and alignment with SmiI digested reference sequence