| Literature DB >> 35195510 |
Zaki Saati-Santamaría1,2, Nelly Selem-Mojica3, Ezequiel Peral-Aranega1,2, Raúl Rivas1,2,4, Paula García-Fraile1,2,4.
Abstract
Microbes host a huge variety of biosynthetic gene clusters that produce an immeasurable array of secondary metabolites with many different biological activities such as antimicrobial, anticarcinogenic and antiviral. Despite the complex task of isolating and characterizing novel natural products, microbial genomic strategies can be useful for carrying out these types of studies. However, although genomic-based research on secondary metabolism is on the increase, there is still a lack of reports focusing specifically on the genus Pseudomonas. In this work, we aimed (i) to unveil the main biosynthetic systems related to secondary metabolism in Pseudomonas type strains, (ii) to study the evolutionary processes that drive the diversification of their coding regions and (iii) to select Pseudomonas strains showing promising results in the search for useful natural products. We performed a comparative genomic study on 194 Pseudomonas species, paying special attention to the evolution and distribution of different classes of biosynthetic gene clusters and the coding features of antimicrobial peptides. Using EvoMining, a bioinformatic approach for studying evolutionary processes related to secondary metabolism, we sought to decipher the protein expansion of enzymes related to the lipid metabolism, which may have evolved toward the biosynthesis of novel secondary metabolites in Pseudomonas. The types of metabolites encoded in Pseudomonas type strains were predominantly non-ribosomal peptide synthetases, bacteriocins, N-acetylglutaminylglutamine amides and ß-lactones. Also, the evolution of genes related to secondary metabolites was found to coincide with Pseudomonas species diversification. Interestingly, only a few Pseudomonas species encode polyketide synthases, which are related to the lipid metabolism broadly distributed among bacteria. Thus, our EvoMining-based search may help to discover new types of secondary metabolite gene clusters in which lipid-related enzymes are involved. This work provides information about uncharacterized metabolites produced by Pseudomonas type strains, whose gene clusters have evolved in a species-specific way. Our results provide novel insight into the secondary metabolism of Pseudomonas and will serve as a basis for the prioritization of the isolated strains. This article contains data hosted by Microreact.Entities:
Keywords: BGCs; BiG-SCAPE / CORASON; EvoMining; comparative genomics; pan-genome; secondary metabolites
Mesh:
Substances:
Year: 2022 PMID: 35195510 PMCID: PMC8942027 DOI: 10.1099/mgen.0.000758
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Representation of genome characteristics, BGCs and phylogeny of type strains included in this work. (a) Phylogenetic tree of type strains based on 92 concatenated housekeeping genes. Metadata were drawn with the iTOL programme. The black inner circle represents genome completeness (%); the red circle represents G+C (%) content; the blue circle represents genome length; the outer circle represents BGC types predicted by antiSMASH (colour code reflects each different BGC type). Bar, 0.1 substitution per position. (b) Pie chart summarizing the sum of diverse BGC types within all the genomes of the analyses. The colour code is maintained with the legend at the left. (c) Histogram of the number of type strains (y axis) that have different numbers of BGCs (x axis).
Fig. 2.Sequence similarity network (SSN) produced by BiG-SCAPE and visualized and annotated with Cytoscape. Nodes represents BGCs. BGC types are coloured according to the colour legend. MiBIG clusters are labelled according to the metabolic product of the BGC.
Fig. 3.(a) Single network of the BGC SSN representing an example of a clan of GCFs comprising different BGCs of aryl polyenes (mustard nodes) and aryl polyene hybrid clusters (grey nodes). (b) CORASON phylogenetic tree of a GCF of aryl polyenes included in this clan. Each diagram represents a different BGC. The red labelled BGC denotes the representative BGC in the GCF. (c) Tanglegram of the GCF (left; obtained with the BiGSCAPE/Corason workflow) and phylogeny of strains with representatives within the GCF (right; obtained with UBCG tool). Bootstrap values of the phylogeny tree represents the number of UBCG phylogenetic trees that support each differentiation.
Fig. 4.Examples of trees in which the role in the secondary metabolism of proteins related with the lipid metabolism was investigated. Trees were obtained with the EvoMining workflow and visualized with Microreact. (a) Tree no. 13 built with orthologues of a protein with a carboxyl transferase domain. (b) Tree no. eight built with orthologues of an enzyme annotated as an Acyl-CoA dehydrogenase.