| Literature DB >> 34795375 |
Marc W Van Goethem1, Andrew R Osborn2, Benjamin P Bowen1, Peter F Andeer1, Tami L Swenson1,3, Alicia Clum2, Robert Riley2, Guifen He2, Maxim Koriabine2, Laura Sandor2, Mi Yan2, Chris G Daum2, Yuko Yoshinaga2, Thulani P Makhalanyane4, Ferran Garcia-Pichel5,6, Axel Visel2, Len A Pennacchio2, Ronan C O'Malley1,2, Trent R Northen7,8.
Abstract
Microbial biosynthetic gene clusters (BGCs) encoding secondary metabolites are thought to impact a plethora of biologically mediated environmental processes, yet their discovery and functional characterization in natural microbiomes remains challenging. Here we describe deep long-read sequencing and assembly of metagenomes from biological soil crusts, a group of soil communities that are rich in BGCs. Taking advantage of the unusually long assemblies produced by this approach, we recovered nearly 3,000 BGCs for analysis, including 712 full-length BGCs. Functional exploration through metatranscriptome analysis of a 3-day wetting experiment uncovered phylum-specific BGC expression upon activation from dormancy, elucidating distinct roles and complex phylogenetic and temporal dynamics in wetting processes. For example, a pronounced increase in BGC transcription occurs at night primarily in cyanobacteria, implicating BGCs in nutrient scavenging roles and niche competition. Taken together, our results demonstrate that long-read metagenomic sequencing combined with metatranscriptomic analysis provides a direct view into the functional dynamics of BGCs in environmental processes and suggests a central role of secondary metabolites in maintaining phylogenetically conserved niches within biocrusts.Entities:
Mesh:
Year: 2021 PMID: 34795375 PMCID: PMC8602731 DOI: 10.1038/s42003-021-02809-4
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Fig. 1Secondary metabolism of biocrust.
a Photos of Biocrusts taken near Canyonlands National Park (Moab, UT) with the biological soil crust inlay showing the characteristic green coloration. Photo credit Trent R. Northen. b Taxonomic composition of the metagenomes based on exact sequence variants (ESVs) of 16 S rRNA genes across sequencing platforms. Relative abundances were calculated after assigning taxonomy against the SILVA reference database. c Left panel shows the number of Biosynthetic Gene Clusters (BGCs) recovered from each assembly, arranged from shortest to longest. Right panel shows the cumulative length of BGCs recovered from each metagenome in Megabases (Mb). d Taxonomic distribution of BGCs in megabase pairs (Mb) at the phylum or class level. e BGCs longer than 5 kb from each major class of secondary metabolism, colored by putative phylum-level assignments.
The number of biosynthetic gene clusters recovered from each assembly or co-assembly are shown with details regarding full-length BGCs, BGCs sequenced previously and which assemblies contributed to the co-assemblies.
| Co-assembly/Assembly | Number of BGCs | Full-length BGCs | No. sequenced Previously | Sequencing Platforms |
|---|---|---|---|---|
| Flye_co-assembly | 300 | 67 | 28 | PacBio Sequel ( |
| Combined_Sequel_Illumina | 459 | 31 | 42 | PacBio Sequel ( |
| Ultimate_Sequel+SequelII | 1191 | 419 | 21 | PacBio Sequel ( |
| pbio-1768.15750 | 16 | 0 | 3 | PacBio RS II |
| pbio-1772.15782 | 5 | 0 | 1 | PacBio RS II |
| pbio-1768.15751 | 0 | 0 | 0 | PacBio RS II |
| m54017_180413_173154 | 24 | 3 | 8 | PacBio Sequel |
| m54017_180414_134655 | 30 | 2 | 7 | PacBio Sequel |
| m54017_180414_220614 | 0 | 0 | 0 | PacBio Sequel |
| m54017_180417_205359 | 44 | 7 | 5 | PacBio Sequel |
| pbio-2210.20021 | 548 | 174 | 9 | PacBio Sequel II |
| 11774.4.218925 | 139 | 3 | 19 | Illumina HiSeq 2500 |
| 12041.5.235284 | 232 | 6 | 32 | Illumina HiSeq 2500 |
Fig. 2Transcription of secondary metabolites.
a Taxonomic composition of the metagenomes based on exact sequence variants (ESVs) of 16S rRNA transcripts during a soil wetting experiment. Relative abundances were calculated after assigning taxonomy against the SILVA reference database. b Core bacterial gene transcription (n = 46 genes including DNA-binding proteins, Large and Small subunit ribosomal proteins) shown in blue compared to secondary metabolite gene transcription (orange). Genes transcribed at all 10 timepoints (rightmost point) are thought to experience constitutive expression. The y-axis indicates the proportion of genes transcribed at each timepoint category. c Two putative rearranged siderophore-producing gene clusters found in the co-assembled metagenomes that showed homology are aligned. The heatmaps indicate the transcription of each gene in the BGC based on statistical tests against the control sample; 0 h at the point of wetting. Heatmap columns are scaled to the size of the mapped gene, and row order (n = 13 rows) indicates progression across the experiment from 0 h (bottom row) to 3 days after wetting (top row). The left-hand column shows day (yellow) and night (purple) categories. The predicted chemical structure of NODE_86 is shown above while the structure of NODE_81 is below the contig. Transcriptional profiles of gene clusters with differentially expressed genes at night are shown with stars.
Fig. 3Phylum specific transcription of secondary metabolites.
a t-SNE visualization of every individual biosynthetic gene identified. The color of the points indicate the phylum assignment whilst shapes indicate the BGC class. b Co-occurrence network based on Pearson correlations (r > 0.8) among entire BGCs (n = 2988) based on average z-scores at each time point. Each node is a BGC within a contig that are colored by phylum and shaped by BGC type. Closely clustered nodes share similar transcriptional profiles. c, Line plot showing 16S rRNA transcript copy number over time shown by black, dotted lines. Average BGC transcription over time shown by the colored, solid lines. Cyanobacteria (green) show a unique night-time upregulation of secondary metabolism. Purple background indicates night-time transcription.