| Literature DB >> 33815349 |
Zhong-Zhi Sun1, Bo-Wen Ji1, Ning Zheng1, Meng Wang1, Ye Cao1, Lu Wan1, Yi-Song Li1, Jin-Cheng Rong1, Hai-Lun He2, Xiu-Lan Chen1, Yu-Zhong Zhang3,4, Bin-Bin Xie1.
Abstract
Deconstruction is an essential step of conversion of polysaccharides, and polysaccharide-degrading enzymes play a key role in this process. Although there is recent progress in the identification of these enzymes, the diversity and phylogenetic distribution of these enzymes in marine microorganisms remain largely unknown, hindering our understanding of the ecological roles of marine microorganisms in the ocean carbon cycle. Here, we studied the phylogenetic distribution of nine types of polysaccharide-degrading enzymes in marine bacterial genomes. First, we manually compiled a reference sequence database containing 961 experimentally verified enzymes. With this reference database, we annotated 9,335 enzyme sequences from 2,182 high-quality marine bacterial genomes, revealing extended distribution for six enzymes at the phylum level and for all nine enzymes at lower taxonomic levels. Next, phylogenetic analyses revealed intra-clade diversity in the encoding potentials and phylogenetic conservation of a few enzymes at the genus level. Lastly, our analyses revealed correlations between enzymes, with alginate lyases demonstrating the most extensive correlations with others. Intriguingly, chitinases showed negative correlations with cellulases, alginate lyases, and agarases in a few genera. This result suggested that intra-genus lifestyle differentiation occurred many times in marine bacteria and that the utilization of polysaccharides may act as an important driver in the recent ecological differentiation of a few lineages. This study expanded our knowledge of the phylogenetic distribution of polysaccharide enzymes and provided insights into the ecological differentiation of marine bacteria.Entities:
Keywords: carbohydrate active enzymes; ecological differentiation; genomics; marine bacteria; phylogeny; polysaccharide-degrading enzymes
Year: 2021 PMID: 33815349 PMCID: PMC8012555 DOI: 10.3389/fmicb.2021.658620
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1The environmental and taxonomic sources of sequences in the reference data set (A) and the top matched reference sequences of annotated PDEs (B). The number of enzyme sequences was shown in parentheses.
FIGURE 2Distribution of PDEs at different taxonomic levels (phylum, class, order, family, and genus). Note that genomic annotation greatly expanded the distribution of PDEs. White, PDE reference sequences; gray, annotated PDEs; black, taxa only found in annotated PDEs.
FIGURE 3Phylogenetic distribution of PDEs among marine bacterial genomes. The tree was reconstructed based on 31 core genes using IQ-TREE. The figure was prepared with the help of the iTOL server. Enzymes were shown in outer circles as heatmap based on annotated gene number in each genome. The 12 genera subject to distribution analyses at the genus level were indicated with curves.
FIGURE 4PDE encoding potentials by different taxa at the phylum (A) and genus (B) levels. The number of genomes available in each group was shown in parentheses.
Phylogenetic conservation of enzyme-positive genotypes.
| Agarase | 114 | 36.80% | 1.93 | 0.0235380 | 138 | 30.43% | 2.28 | 0.0319636 |
| Alginate lyase | 524 | 25.80% | 2.97 | 0.0211616 | 640 | 21.72% | 3.40 | 0.0312082 |
| Carrageenase | 56 | 50.00% | 1.68 | 0.0197299 | 64 | 46.88% | 1.60 | 0.0262650 |
| Cellulase | 632 | 33.50% | 2.31 | 0.0226166 | 780 | 25.00% | 2.85 | 0.0369611 |
| Chitinase | 644 | 13.50% | 4.61 | 0.0280873 | 839 | 10.13% | 5.94 | 0.0444206 |
| Chitosanase | 60 | 41.70% | 1.76 | 0.0144015 | 70 | 31.43% | 2.48 | 0.0383783 |
| Fucoidanases | 5 | 100.00% | 1 | 0.0507775 | 6 | 100.00% | 1.00 | 0.0408922 |
| Hyaluronate lyase | 60 | 30.00% | 2.38 | 0.0196562 | 86 | 40.70% | 1.95 | 0.0177127 |
| Ulvan lyase | 26 | 73.10% | 1.18 | 0.0193213 | 30 | 63.33% | 1.25 | 0.0203285 |
FIGURE 5Correlations between enzyme pairs. (A) Pearson correlation coefficients and Jaccard distances between enzyme pairs in all genomes (circle) and in 12 genera (triangle). Symbols were filled when p-value <0.05 for the Pearson correlation coefficients and open otherwise. (B) Correlations between different enzymes found in 12 genera. Blue line, positive correlation; red line, negative correlation. The number of occurrences was indicated with line width (thin, 1; middle, 2; bold, 3).
FIGURE 6Effects of coverage and identity threshold choice on the results. (A) Number of predicted PDEs under different thresholds. (B) Venn diagram of correlated enzyme pairs predicted with different thresholds (main set, identity 30%; additional set 1, identity 25%; additional set 2, identity 50%; coverage 80% for all sets). Enzyme pairs with the Pearson correlation coefficient |r| > 0.4 and p < 0.05 were regarded as correlated. The number of correlated enzyme pairs for each data set was indicated in parentheses beside the set identity.