Literature DB >> 36147676

Comparative genome analysis of four Leuconostoc strains with a focus on carbohydrate-active enzymes and oligosaccharide utilization pathways.

Anshul Sharma1, Neha Sharma2, Deepshikha Gupta3,4, Hae-Jeung Lee1,5, Young-Seo Park2.   

Abstract

Leuconostoc is mostly found in food, plants, and dairy products. Due to their innate genomic features, such as the presence of carbohydrate-active enzymes, bacteriocins, and plasmids, Leuconostoc spp. have great biotechnological potential. In this study, four strains were isolated and identified as Leuconostoc mesenteroides SG315 (LA), L. citreum SG255 (LB), L. lactis CCK940 (LC), and L. lactis SBC001 (LD). Comparative analysis was performed using their draft genome sequences. Differences among the four strains were analyzed using the average nucleotide identity, dot plot, and multiple alignments of conserved genomic sequences. Functional profiling revealed 2134, 1917, 1751, and 1816 open reading frames; 2023, 1823, 1655, and 1699 protein-coding genes; 60, 57, 83, and 82 RNA-coding genes; and GC content of 37.5 %, 38.8 %, 43.3 %, and 43.2 %, in LA, LB, LC, and LD, respectively. The total number of genes encoding carbohydrate-active enzymes was 76 (LA), 73 (LB), 57 (LC), and 67 (LD). These results indicate that the four strains shared a large number of genes, but their gene content is different. Furthermore, most genes with unknown functions were observed in the prophage regions of the genome. This study also elucidated the oligosaccharide utilization and folate biosynthesis pathways in Leuconostoc spp. Taken together, our findings provide useful information on the genomic diversity of CAZymes in the four Leuconostoc strains and suggest that these species could be used for potent exploitation.
© 2022 The Author(s).

Entities:  

Keywords:  Carbohydrate-active enzymes; Comparative genomics; Leuconostoc citreum; Leuconostoc lactis; Leuconostoc mesenteroides; Oligosaccharide

Year:  2022        PMID: 36147676      PMCID: PMC9465122          DOI: 10.1016/j.csbj.2022.08.032

Source DB:  PubMed          Journal:  Comput Struct Biotechnol J        ISSN: 2001-0370            Impact factor:   6.155


Introduction

Bacteria and their genomes have evolved and adapted over time due to mutations and gene rearrangements. This evolutionary pattern can be observed as nucleotide insertions or deletions in specific parts of the genomes. In addition to genes encoding essential functions, many organisms have accessory genes that allow bacteria to adapt to their surroundings [1]. Several Leuconostoc species, which are lactic acid bacteria (LAB), are commonly found in plants, silage, fermented foods (e.g., sour cream, quark, kefir, kimchi, and sauerkraut), and dairy products (e.g., fresh and raw milk, cream, butter, and cheese) [2], [3], [4]. Leuconostoc is a member of the Leuconostocaceae family, one of the most commonly used LAB groups [3] and is closely related to Lactobacillaceae [5]. Cienkowski isolated Leuconostoc from a slime 46 epidemic in a sugar manufacturing unit in 1878 [6]. Leuconostoc species are Gram-positive, facultative anaerobes, non-motile, catalase-negative, asporogenous, psychrotrophic, or psychrotolerant catenation-shaped bacteria with an average GC content of 37.5 % and an optimum growth at 25–30 °C [7]. According to physiological studies, Leuconostoc species are closely related to heterofermentative lactobacilli [3]. The genus has been designated as “generally recognized as safe” (GRAS), indicating that it is reasonably safe for use in culinary products [3]. Recently, Leuconostoc spp. has recently been identified as one of the most promising probiotic candidates due to its ability to produce bioactive antimicrobial peptides and vitamins, as well as improve human health by modulating immune responses [8], [9], [10]. A texturing Leuconostoc strain was found to produce hetero-polysaccharides rather than homopolysaccharides in milk via a Wzy-dependent pathway for the first time [11]. Various Leuconostoc spp. have produced polysaccharides (fructans and glucans) and oligosaccharides [12], [13], [14], [15]. We previously reported oligosaccharide production and immunostimulatory activity in RAW264.7 of L. lactis CCK940 [16], the draft genome sequence of L. citreum SG255, an oligosaccharide-synthesizing strain [17], and the oligosaccharide production using glucansucrase activity of L. lactis CCK940 [18]. Our recent study elucidated the production, optimization, and structural characterization of L. lactis SBC001 gluco-oligosaccharides, as well as their prebiotic and anti-inflammatory effects [15]. Different Leuconostoc species or strains have a wide range of capabilities, implying that their genomes are highly diverse in content and type. Although numerous studies have been conducted to investigate the diversity and evolution of Leuconostoc [19], [20], [21], many fundamental studies on genomic diversity, species dynamics, and functional properties are still required to fully understand this genus’ potential. Comparative genome analysis of Leuconostoc species allows for comparisons and presentation of data at the species or strain level, as well as functional genetic potential at the molecular level [7]. Recently, a comparative genomic analysis of L. mesenteroides MTCC 10,508 and L. carnosum has been reported [19], [20]. Carbohydrates are the primary energy sources for LAB fermentation and acidification. Before being transported inside the bacterial cell by specific transport systems, all sugars must be converted into monomeric forms by specific enzymes [22], [23]. Oligosaccharides are the main carbohydrates found in fruits, cereals, milk, and the upper intestines of animals. Although the pathways for mono-and disaccharide metabolism are well understood, there is little information on the metabolism of higher oligosaccharides, which are found in many ecosystems. The bioinformatic analysis of Leuconostoc genes involved in carbohydrate metabolism has begun. However, there is still a lack of knowledge about this topic. Draft genome sequencing and characterization of L. mesenteroides SG315 (LA), L. citreum SG255 (LB), and L. lactis CCK940 (LC), and L. lactis SBC001(LD) were performed to advance our understanding of the genome diversity. A comparative genomic analysis was performed to reveal the evolutionary relationships among these species. The number and diversity of carbohydrate-active enzymes (CAZymes) in Leuconostoc spp., as well as their association with oligosaccharide utilization pathways, were investigated. This is the first time genomes of four different strains of Leuconostoc spp. have been compared.

Materials and methods

Bacterial species and culture conditions

In this study, four Leuconostoc strains were used: LA, LB, LC, and LD. In our laboratory, we isolated LA, LB, LC, and LD from baechu-kimchi, young radish kimchi, Chinese cabbage kimchi, and chive, respectively. The strains were identified using standard morphological, biochemical, and physiological tests, as well as 16 s rRNA gene sequencing with the National Center for Biotechnology Information’s nucleotide BLAST tool. Under microaerophilic conditions, a single colony of each strain was inoculated in modified De Man, Rogosa, and Sharpe (MRS) broth at 37 °C for 24 h. Stock cultures were stored at − 80 °C in modified MRS broth mixed with 20 % glycerol.

Genome assembly and annotation

For genome sequencing of the selected strains, library construction was performed using the Illumina TruSeq DNA library prep kit and genome sequencing was performed by Illumina Miseq 300 bp paired-end. For quality control, overrepresented sequences were carefully checked, and low quality reads and adaptors were removed. For the sequencing analysis, a de novo assembly of high quality reads was selected for the analysis. Overall, the whole genome sequencing of the four strains was performed by Sanigen Co., ltd. company in the Republic of Korea. The following accession numbers have already been given to the genome sequences of all four strains in NCBI GenBank: SG315 NZ_JACGML000000000 (LA), SG255 NZ_JACGMK000000000 (LB), CCK940 NZ_NQLF 00,000,000 (LC), and SBC001 NZ_JACGAK000000000 (LD). The ANI Calculator was used to perform average nucleotide identity (ANI) analysis, as described in a previous study [24]. For pathway analysis, the complete genomes were mapped using the Kyoto Encyclopedia of Genes and Genomes (KEGG, https://www.kegg.jp/) and Clusters of Orthologous Groups (COG, https://www.ncbi.nlm.nih.gov/COG) databases with default parameters. PHASTER (http://phast.wishartlab.com/index.html) was used to detect putative prophage insert regions, and MinCED 3 (https://sourceforge.net/projects/minced/) was used to identify clustered regularly interspaced short palindromic repeats (CRISPR). The genomes were compared using Kbase to generate dot plot matrices [25]. The dot plots are visual representations of conserved regions of genomes discovered through pairwise alignment. MAUVE [26] was used to align the genomes with 940 LC serving as the reference. The dot plot matrices and MAUVE alignment predict the homologous region rearrangements such as inversion or translocation events, as well as insertions or deletions in the genome.

Phylogenetic analysis

A phylogenetic tree was constructed based on whole genome data from LA, LB, LC, and LD strains. A whole-genome level comparison was performed amongst 119 Leuconostoc strains, covering 65 L. mesenteroides genomes, 28 L. lactis genomes, and 26 L. citreum genomes. The phylogenetic relationship was obtained using PATRIC with the mafft alignment program [27] (https://mafft.cbrc.jp/alignment/software/) and the following parameters: single copy gene requested, 100; max deletions allowed, 10; max duplications allowed, 10. This resulted in the alignment of 97,704 nucleotides. The tree was constructed and viewed graphically using FigTree (https://tree.bio.ed.ac.uk/software/figtree/).

Genome Island prediction using the IslandViewer web tool

The presence of genomic islands (GIs) was predicted using Island Viewer 4 [28], which is an open-access web tool that uses three prediction algorithms: SIGI-HMM, Island Path-DIMOB, and IslandPick, to calculate the codon usage and dinucleotide bias within a genome and generate a dataset of GIs and non-GIs from phylogenetically related organisms.

Identification of CAZymes

CAZymes (https://www.cazy.org/) in the Leuconostoc species obtained using genome similarity analysis were identified and annotated using dbCAN2 [29]. The dbCAN2 scans the genome with several algorithms, including the hidden Markov model (HMM) profile, which uses HMMdb v7 [30], DIAMOND [31], and Hotpep [32] to improve prediction. Genes annotated by at least two methods were chosen for further analysis. CAZymes are essential in sugar metabolism, particularly in the biosynthesis, binding, and catabolism of carbohydrates. Glycosyltransferase (GT), glycoside hydrolase (GH), polysaccharide lyase (PL), carbohydrate esterase (CE), and auxiliary activity (AA) are the different classes of CAZymes [33].

Results and discussion

General genomic features

The LA genome is made up of a single circular chromosome of 2.1 Mb in length and has a GC content of 37.5 % (Fig. 1A). Meanwhile, the genomes of LB (Fig. 1B), LC (Fig. 1C), and LD (Fig. 1D) each have a single circular chromosome of 1.9 Mb, 1.7 Mb, and 1.8 Mb in length, with a GC content of 38.8 %, 43.3 %, and 43.2 %, respectively. The number of open reading frames (ORFs), protein-coding genes, RNA genes, and pseudogenes annotated in the four Leuconostoc strains is shown in Table 1. Furthermore, in LC and LD, a single CRISPR array was discovered (Table S1). According to Kim et al. [34], the L. citreum EFEL2700 isolate from kimchi has a 1.9 Mb long genome sequence, 1,853 total genes, and 39.1 % GC content (Table 1). L. citreum DmW_111, isolated from wild Drosophila melanogaster, had a 1.8 Mb genome length, with a GC content of 38.9 % and a total gene count of 1853 [35]. According to unpublished data, the genomes of L. lactis 1.2.28 (GCA_018993775.1), L. mesenteroides SRCM 103,356 (GCA_004102585.1) had 1.7 Mb and 2.0 Mb genomes, respectively, with 43.4 % and 37.7 % GC content and 1724 and 1991 total genes (Table 1). Additionally, L. mesenteroides 406 contains a 2.0 Mb chromosome with a GC content of 37.7 % and a total number of genes of 2062 [36], which corroborate our findings.
Fig. 1

Circular genomes of (A) LA, (B) LB, (C) LC, and (D) LD. Circles illustrate the following characteristics from the outside to the center: contigs/chromosome (blue), coding sequences on forward and reverse strands.” Circles 4–8 represent non-CDS, antimicrobial resistance genes (AMR), virulence factors (VF), transporters, and drug targets, respectively. The next ring represented in pink/black denotes the GC content. The innermost circle shows GC-skew. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Table 1

Genome information of the four Leuconostoc species.

AttributeLeuconostoc mesenteroides SG315Leuconostoc citreum SG255Leuconostoc lactis CCK940Leuconostoc lactis SBC001
Genome size (bp)2,099,7411,869,0571,741,5111,835,155
DNA G + C (bp)786,451724,875754,610791,832
G + C content (%)37.4638.7843.3343.15
DNA scaffolds232823
N50 (bp)250,908152,7141,726,6901,758,110
Total genes2134191717511816
Protein coding genes2023182316551699
RNA genes60578382
Pseudo genes51371335
CRISPR arrays0011

LA: Leuconostoc mesenteroides SG315; LB: L. citreum SG255; LC: L. lactis CCK940; and LD: L. lactis SBC001; X= L. citreum EFEL2700; Y= L. lactis 1.2.28; W: L. citreum DmW_111; Z= L. mesenteroides SRCM103356; S= L. mesenteroides 406; UP: unpublished.

Circular genomes of (A) LA, (B) LB, (C) LC, and (D) LD. Circles illustrate the following characteristics from the outside to the center: contigs/chromosome (blue), coding sequences on forward and reverse strands.” Circles 4–8 represent non-CDS, antimicrobial resistance genes (AMR), virulence factors (VF), transporters, and drug targets, respectively. The next ring represented in pink/black denotes the GC content. The innermost circle shows GC-skew. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) Genome information of the four Leuconostoc species. LA: Leuconostoc mesenteroides SG315; LB: L. citreum SG255; LC: L. lactis CCK940; and LD: L. lactis SBC001; X= L. citreum EFEL2700; Y= L. lactis 1.2.28; W: L. citreum DmW_111; Z= L. mesenteroides SRCM103356; S= L. mesenteroides 406; UP: unpublished. According to the RAST annotation, LA, LB, LC, and LD harbored 208, 203, 209, and 206 subsystems, respectively, which were categorized further into 27 subsystem features. The relevant circular views for LA, LB, LC, and LD are shown in Fig. S1A, B, C, and D, respectively. Orthologous genes are groups of homologous genes that descend from a common ancestor. These genes frequently retain their roles as those of their ancestral genes even after evolution [37]. Approximately 20 COG functions were identified in the three Leuconostoc spp., in varying percentages of genes associated with each strain (Table S2). Translation, ribosomal structure, and biogenesis (COG category J) were the most abundant COG function genes (LA, 8.82 %; LB, 9.56 %; LC, 10.51 %; LD, 10.93 %). The genes involved in amino acid transport and metabolism (COG category E) for LA, LB, LC, and LD were found to be 8.63 %, 6.50 %, 7.93 %, and 5.87 %, respectively. Genes involved in carbohydrate transport and metabolism (COG category G) (6.85 %, 6.55 %, 6.34 %, and 7.25 %) and transcription (COG category K) (5.58 %, 5.81 %, 5.34 %, and 7.02 %) were also annotated in LA, LB, LC, and LD, respectively. The functional enrichment of these COGs suggests that they are important for survival and adaptation to harsh environmental conditions, as well as in the utilization of complex carbohydrates for growth. Fig. S2 shows the comparative proteome analysis of the four Leuconostoc strains and illustrates that the majority of proteins share a sequence similarity score of less than 80 %, while a few proteins are 95–98 % identical. A phylogenetic tree was constructed with different genome sequences available in the public domain (Fig. 2). The four strains described in this paper are highlighted in red. The findings demonstrated the categorization of strains into species and revealed differences in strain genetic similarity within each species.
Fig. 2

Phylogenetic comparison of four Leuconostoc strains with genomes available in the public domain. Strains used in this study are highlighted.

Phylogenetic comparison of four Leuconostoc strains with genomes available in the public domain. Strains used in this study are highlighted.

ANI analysis

The ANI value describes the sequence similarity between the conserved regions of two genomes and measures their genetic relatedness [38]. ANI measurements are considered more informative than 16S rRNA gene identification because the former is based on a larger number of genes [39]. Organisms with ANI values of ≥ 95 % are all members of the same species [40]. In this study, the ANI values of he four strains were estimated to investigate their interspecies genetic relatedness. The ANI values ranged from 72.86 % to 98.14 % (Fig. 3). The highest ANI value (98.14 %) was observed between LC and LD which is expected because both strains belong to the same species (Fig. 3). These findings are consistent with previous research, which found that the ANI value between the target L. lactis strain EFEL005 and the type strain L. lactis KCTC 3528 T was greater than 96 % [36]. The species differences could be attributed to host-specific strain diversity and niche adaption.
Fig. 3

Heatmap showing the average nucleotide identity (ANI) among four species of Leuconostoc. Green color represents similarity above 97%. As the similarity % decreases, the colour intensity changes from yellow to red. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Heatmap showing the average nucleotide identity (ANI) among four species of Leuconostoc. Green color represents similarity above 97%. As the similarity % decreases, the colour intensity changes from yellow to red. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

GIs

GIs are genomic regions that reveal evidence of horizontal DNA transfer, particularly in bacteria, and appear to benefit bacterial cells [41], [42]. GIs encode various symbiotic or pathogenic functions that can help an organism adapt to its host or environment [43]. Two steps are involved in GI integration and formation [44]. GIs were predicted in all four Leuconostoc strains using the closest complete (or closed) sequence as a reference (L. mesenteroides J18 for LA, L. citreum KM20 for LB, L. lactis WiKim40 for LC, and LD) (Fig. 4A–D). A total of 5 GIs (LA), 8 GIs (LB), 10 (LC), and 10 (LD) were reported during the genome analysis (Table S3). In LA, GIs ranging from 9.13 kb with nine coding sequences (CDS) (GI-2) to 53.95 kb with 65 CDS (GI-4) were predicted (Table S3). In LB, the GI-2, with a length of 50.40 kb and 72 CDS, was the longest, while the GI-1, with a length of 4.89 kb and only 6 CDS, was discovered to be the shortest (Table S3). LC showed GIs that vary in size, with the shortest GI-3 of 5.41 kb with 5 CDS and the largest GI-6 of 79.64 kb with 96 CDS. For LD, the largest GI was 133.48 kb (GI-3) with 131 CDS, and the shortest GI was 6.95 kb (GI-2) with only six CDS (Table S3).
Fig. 4

Genomic islands (GIs) of (A) LA, (B) LB, (C) LC, and (D) LD predicted using SIGI-HMM (orange) and IslandPath-DIMOB (blue). Red shows the integrated genomic island search results. a) Circular and b) horizontal views of the GIs of the four Leuconostoc species. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Genomic islands (GIs) of (A) LA, (B) LB, (C) LC, and (D) LD predicted using SIGI-HMM (orange) and IslandPath-DIMOB (blue). Red shows the integrated genomic island search results. a) Circular and b) horizontal views of the GIs of the four Leuconostoc species. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Prophage regions in the genomes

In this study, we aimed to explore prophages in the genomes of four strains. Prophages integrated into bacterial genomes confer resistance to superinfection [45]. The PHASTER software was used to identify prophage elements in the genomes of LA, LB, LC, and LD. As shown in Fig. 5 (A–D, upper panel), the number of prophage regions were found to be 3, 2, 4, and 2 for LA, LB, LC, and LD, respectively. The presence of prophage in the genomes suggests genomic instability, which is a reservoir for adaptation. As shown in the lower panel of Fig. 5A, in the three-prophage sections of LA, a total of 70 protein-coding genes were identified, 30 of which were phage proteins, and the remaining 40 were hypothetical in nature.
Fig. 5

Prophage regions detected in (A) LA, (B) LB (C) LC and (D) LD using the PHASTER server. The total number of prophage regions discovered in each strain is displayed in the upper panel, and the number of genes discovered in each prophage region is shown in the lower panel.

Prophage regions detected in (A) LA, (B) LB (C) LC and (D) LD using the PHASTER server. The total number of prophage regions discovered in each strain is displayed in the upper panel, and the number of genes discovered in each prophage region is shown in the lower panel. Similarly, the lower panel of Fig. 5B represents 73 proteins found in two prophage regions identified in LB. Of these, 24 were identified as phage proteins, while the remaining 49 were classified as hypothetical proteins. In Fig. 5C, the lower panel shows 17 proteins out of 47 in LC constituted the prophage area, with the remaining 30 classified as hypothetical proteins. Nine phage proteins were identified among the 27 protein-coding genes in LD, with the remaining proteins were hypothetical proteins. The presence of a large number of hypothetical genes in Leuconostoc spp. indicates the presence of unrecognized phages [45].

Dot-plot and MAUVE analysis

Genomic reshuffling has a substantial impact on bacterial evolution [46]. The pairwise alignment dot-plot pattern showed a higher low-level similarity among Leuconostoc spp. (Fig. 6). The x- and y-axes show the sequence size scales. There were no continuous lines among the selected species.
Fig. 6

Dot plot matrices between (A) LA vs LB, (B) LA vs LC, (C) LA vs LD, (D) LB vs LC, (E) LB vs LD, and (F) LC vs LD. Dot plot matrices were generated using Kbase with default parameters.

Dot plot matrices between (A) LA vs LB, (B) LA vs LC, (C) LA vs LD, (D) LB vs LC, (E) LB vs LD, and (F) LC vs LD. Dot plot matrices were generated using Kbase with default parameters. The genome sequences of the Leuconostoc spp. were aligned using MAUVE [26] to identify the multiple maximal matches and local collinear blocks (LCB). The genome sequence of LC was used as the reference strain. The majority of LCBs were shared by all strains, but the alignments differed, indicating a significant amount of the genetic information in these strains was conserved (Fig. 7). The diagram depicts homologous regions that have been scrambled or inverted as a result of DNA translocation, rearrangement, or recombination. The gaps in the alignment represent missing genome regions Furthermore, chromosomal rearrangements appeared to be common, as the frequency of LCBs with alterations in their relative genomic position was significant, despite their short lengths. A collinear set of homologous (matching) regions between the four Leuconostoc genomes is represented by boxes of the same color. After careful observation, the synchronous rearrangement was observed in LC an LD. This result could be due to their close taxonomic relationship. Notably, the degree o f homology varied between blocks.
Fig. 7

MAUVE alignment of (A) LC with (B) LD, (C) LB, and (D) LA.

MAUVE alignment of (A) LC with (B) LD, (C) LB, and (D) LA.

CAZymes identified in the four Leuconostoc spp.

The total number of genes encoding CAZymes in LA, LB, LC, and LD was 76, 73, 57, and 67, respectively (Table S4). Only those genes that were predicted by at least two of the three algorithms were considered for the downstream analysis. A closer look at the CAZyme families revealed clear differences in the number and distribution of these genes among these four Leuconostoc strains. CAZymes from LA, LB, LC, and LD contained 32, 25, 18, and 22 genes encoding glycoside hydrolases (GHs), and 19, 20, 15, and 17 genes encoding glycosyltransferases (GTs), respectively (Fig. 8). Except for LD, which had only one gene for carbohydrate esterases (CEs), all strains had two CE genes With the exception of LA, each strain possessed a single gene encoding polysaccharide lyases (PL) (Table 2) (Fig. 8). These findings are consistent with those of a recent study on 182 Leuconostoc spp. genomes, which found that the most prevalent identified gene clusters were the GH > GT > CE > CMB > and PL families [47]. Notably, all selected strains encoded more genes for GHs and GTs, which can participate in the metabolism and transport of functional and active substances. The genes were mainly distributed among the 16 GH and 7 GT, and 8 GH and 7 GT families, respectively, in LA and LB. In LC and LD, genes were distributed among 10 GH and 6 GT, and 10 GH and 7 GT gene families, respectively. Notably, in GHs and GTs, GH13 and GT2 were the most dominant in all strains. The enzymes GHs and PLs cleave glycosidic bonds between carbohydrates or carbohydrates and non-carbohydrate moieties. GHs, in particular, hydrolyze glycosidic bonds, whereas PLs use an elimination mechanism to cleave complex carbohydrates [48], [49], [50].
Fig. 8

Number of CAZymes in the four Leuconostoc species. Color codes represent the different bacterial species: red, LA; blue, LB; pink, LC; green, LD; GH, glycoside hydrolase; GT, glycosyltransferase; CBM, carbohydrates-binding module; CE, carbohydrate esterase; PL, polysaccharide lyase. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Table 2

Genes related to carbohydrate-active enzymes in the four Leuconostoc species.

GHGTCECBMPL
SG315321922NI
GH32(14-326), GH13_18(36-375), GH73(288-428), GH13_29(25-371), GH25(58-218), GH73(66-204), GH32(49-359), GH13_31(28-377), GH42(13-379), GH5_44(48-352), GH43_11(10-321), GH8(30-365), GH25(32-220), GH23, GH32(51-353), GH68(175-614), GH68(179-618), GH25(47-227), GH3(35-252), GH65(317-693), GH36(14-719), GH70(332-1127), GH70 (2011-2804), GH65(317-694), GH13_31(29-380), GH13_31(30-381), GH1(9-482), GH13_31(29-382), GH1(1-474), GH1(3-473), GH2(33-918), GH70(292-1108), GH70(370-1167)GT4 (197-346), GT4 (160-302), GT51(104-284), GT32 (19-93), GT28 (189-349), GT2_Glycos_transf_2(108-278), GT2_Glycos_transf_2(6-172), GT2_Glycos_transf_2(6-131), GT2_Glycos_transf_2 (15-147), GT111 (4-216), GT2_Glycos_transf_2 (9-168), GT4 (335-485), GT4 (314-463), GT4 (303-450), GT2_Glycos_transf_2 (58-235), GT8 (3-252), GT51 (70-250), GT4 (177-325), GT2_Glycos_transf_2(13-177)CE9(16-381), CE1(10-253)CBM50, CBM50
SG2552520211
GH73(695-846), GH65(317-693), GH13_31(29-380),GH13_31(30-381), GH73(63-201), GH1(8-490), GH70(349-1145), GH1(2-480), GH1(3-478), GH25(61-221), GH65(308-722), GH13_31(29-380), GH70(2-465), GH70(312-626), GH70(851-1647), GH32(14-326), GH25(32-220), GH23, GH1(3-452), GH13_18(1-191),GH32(22-328), GH73(295-434), GH13_18(37-373),GH70(453-1311), GH13_31(29-382)GT28(188-352), GT51(102-282), GT2_Glycos_transf_2(5-142), GT2_Glycos_transf_2(8-176),GT2_Glycos_transf_2(5-132), GT51(71-249), GT2_Glycos_transf_2(9-173), GT2_Glycos_transf_2(30-166), GT4(214-348), GT2_Glycos_transf_2(14-182), GT2_Glycos_transf_2(12-145), GT111(7-222), GT0, GT2_Glyco_trans_2_3(54-275),GT4(159-301), GT4(197-345), GT8(9-260), GT4(176-325), GT4(303-445), GT2_Glycos_transf_2(57-232)CE9(5-374),CE1(10-257)CBM50PL1_6(145-354)
CCK9401815211
GH32(14-326), GH13_18(37-373), GH2(37-916), GH36(14-718), GH13_29(25-371), GH25(63-223), GH25(34-221), GH73(297-437), GH73(65-203), GH43_11(3-304), GH65(317-693), GH13_31(32-365), GH13_31(28-362), GH73(465-601),GH1(6-485), GH13_31(30-381), GH25(536-722), GH70(345-1141)GT2_Glycos_transf_2(8-119), GT111(7-223),GT2_Glycos_transf_2(58-234)GT2_Glycos_transf_2(57-232),GT4(302-440), GT4(175-324), GT8(11-261),GT4(195-343), GT4(160-299), GT28(188-352)GT51(104-284), GT2_Glycos_transf_2(8-175)GT2_Glycos_transf_2(5-174)GT2_Glycos_transf_2(9-135), GT51(71-250)CE9(10-356), CE1(10-257)CBM34(6-133) + GH13_20(183-487)PL1_6(146-354)
SBC0012217211
CBM34(6-133) + GH13_20(183-487), GH13_31(28-362), GH13_31(32-365), GH65(317-693), GH73(65-203), GH73(297-437), GH70(343-1139), GH43_11(3-304), GH25(63-223), GH13_29(25-371), GH36(14-718), GH2(37-916), GH13_18(37-373), GH32(14-326), GH25(536-722), GH36(14-719), GH13_31(30-381), GH1(6-485), GH13_31(30-378), GH32(37-339), GH36(12-700), GH36(12-711)GT4(39-188), GT51(104-284), GT28(188-352), GT4(160-299), GT4(195-343), GT8(11-261), GT2_Glyco_tranf_2_3(55-290), GT4, GT2_Glycos_transf_2(7-173), GT2_Glycos_transf_2(17-138), GT111(7-223), GT2_Glycos_transf_2(8-119), GT51(71-250), GT4(302-440), GT2_Glycos_transf_2(9-135), GT2_Glycos_transf_2(5-174), GT2_Glycos_transf_2(8-175)CE1(10-257)CE9(10-356)CBM50PL1_6(146-354)

GH, glycoside hydrolases; GT, glycosyltransferases; CE, carbohydrate esterases; CBM, carbohydrate-binding modules; PL, polysaccharide lyases; NI, not identified; SBC001, Leuconostoc lactis SBC001; CCK940, Leuconostoc lactis CCK940, SG255: Leuconostoc citrem SG255, SG315: Leuconostoc mesenteroides SG315.

Number of CAZymes in the four Leuconostoc species. Color codes represent the different bacterial species: red, LA; blue, LB; pink, LC; green, LD; GH, glycoside hydrolase; GT, glycosyltransferase; CBM, carbohydrates-binding module; CE, carbohydrate esterase; PL, polysaccharide lyase. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) Genes related to carbohydrate-active enzymes in the four Leuconostoc species. GH, glycoside hydrolases; GT, glycosyltransferases; CE, carbohydrate esterases; CBM, carbohydrate-binding modules; PL, polysaccharide lyases; NI, not identified; SBC001, Leuconostoc lactis SBC001; CCK940, Leuconostoc lactis CCK940, SG255: Leuconostoc citrem SG255, SG315: Leuconostoc mesenteroides SG315. The HMMER, DIAMOND, and Hotpep algorithms predicted 32, 25, 18, and 22 genes and 19, 20, 15, and 17 genes encoding GHs and GTs in LA, LB, LC, and LD, respectively (Table S4). Some GHs in this study are modular enzymes with catalytic and non-catalytic domains such as carbohydrate-binding modules (CBM) [51]. Furthermore, all strains encoded GH1, GH13 (GH 13–18 in CCK940), GH25, GH32, GH65, GH70, and GH74, whereas GT2, GT4, GT8, GT28, GT51, and GT111 were found in all strains. Major oligosaccharide-degrading enzyme families have been identified as GH32, GH13, and GH2 [52]. In general, GH13 belongs to the α-amylase group, has 4–7 conserved sequence regions and catalytic machinery, and adopts the (α/β) eight-barrel fold of the catalytic domain [53]. A previous study found the GH13 family, which includes amylases, beta-xylosidases, beta-glucosidases, and beta-galactosidase, in the genome of Leuconostoc sp. MTCC 10,508 [19]. Meanwhile, the GH70 family is classified as transglycosylase, which catalyzes the inter- or intramolecular substitution of glycoside molecules at the anomeric location, resulting in new glycoside molecules [54]. Peptidoglycan hydrolase (GH73 family) cleaves glycosidic (β-1,4) linkages between N-acetylglucosaminyl and N-acetylmuramyl residues in bacterial peptidoglycan [43]. GTs participate in glycosidic bond formation by transferring the sugar moiety from the activated sugar donor to the acceptor molecule [49]. GT2 and GT4 (polyspecific families) account for roughly half of the total number of GTs required for cellulose and sucrose synthesis, respectively, among the GT families listed in the CAZy database [44]. Furthermore, the GT2 and GT4 families are responsible for the majority of glycosyltransferases in Leuconostoc spp. [47]. In this study, the GT2 family was prominent among the predicted GTs in the Leuconostoc spp. genomes, with seven, nine, six, and seven genesin the LA, LB, LC, and LD, respectively. GT2 functions include galactosyltransferase, chitin synthase, cellulose synthase, glucosyltransferase, rhamnosyltransferase, and mannosyltransferase [55].

Oligosaccharide utilization pathway in Leuconostoc spp.

Based on the predicted KEGG pathways and BLASTp analysis of genes related to carbohydrate metabolism, the metabolic pathways of the selected strains encoded GH families involved in oligosaccharide utilization routes (Fig. 9). The four strains contained The GH1, GH2, GH13_30, GH13_31, GH13_18, and GH65 families, which were primarily associated with the metabolism of gentiobiose-6-P and cellobiose-6-P (GH1), glucooligosaccharide (lactose) (GH2), maltotriose (GH13_30), isomaltose panose (GH13_31), glucose + fructose (GH13_18), and maltose (GH65). Furthermore, GH42 (glucooligosaccharide), GH36 (melibiose, raffinose, and stachyose), GH43 (xylobiose and xylooligosaccharides), and GH42 (lactitol) were found in LA, LC, and LD but not in LB. Notably, GH30, which may be associated with the metabolism of gentiobiose, was not found in any of the strains studied (Fig. 9). Except for strain LB, all strains contained GH36 (α-galactosidase (EC 3.2.1.22) and β-fructofuranosidase (EC 3.2.1.26), which are mainly used for the utilization of raffinose and other sugars commonly found in plants. A recent study found that enzymes involved in raffinose metabolism were not present in the genomes of 56 LAB, including L. mesenteroides and L. citreum [56]. d-raffinose is a trisaccharide composed of galactose, glucose, and fructose, that is hydrolyzed by galactosidase (EC 3.2.1.22) to d-galactose and sucrose. This enzyme is absent in the human digestive tract [56]. According to Gemma Buron-Moles et al. [56] Leuconostoc spp. do not metabolize glycogen. In this study, comparable results were observed for all Leuconostoc spp. (data not shown). Similarly, with the exception of LB, all strains contained the GH42 and GH43 family (Fig. 9), which includes β-galactosidase, α-l-arabinosidase, and xylanases. Remarkably, both GH42 and GH2 enzymes are annotated as β-galactosidase. GH2 β-galactosidases hydrolyze glucooligosaccharides (GOS), including oligosaccharides from lactose [57]. In L. lactis, this enzyme is encoded by two overlapping genes: lacL (large subunit, 1,878 bp) and lacM (small subunit, 963 bp) [58]. In another comparative study using dairy-isolated species, L. mesenteroides, L. cremoris, and L. pseudomesenteroides, but not L. lactis, harbored lacL and lacM [59]. Notably, GH42 β-galactosidases (lacA) have been reported to have a low activity against lactose [60]. These findings suggest that Leuconostoc spp. can utilize specific oligosaccharides. GH13_30 is also found in all Leuconostoc strains and is classified as α-glucosidase (EC 3.2.1.20), a carbohydrate-hydrolyzing enzyme that hydrolyzes maltotriose by breaking α-1,4-glycosidic bonds [61]. GH13_31 and GH65, annotated as 1,6-α-glucosidase (EC 3.2.1.10, also known as isomaltose and alpha-methyl glucosidases) and maltose phosphorylase (EC 2.4.1.8), respectively, are present in all Leuconostoc spp. Maltose phosphorylase catalyzes the conversion of maltose to glucose and glucose-1-phosphate through phosphorolysis [62]. Previous research found genes encoding maltose phosphorylase in L. mesenteroides, L. lactis, and L. pseudomesenteroides, but not L. cremoris [59]. These results indicate that the maltooligosaccharide utilization locus was present in all selected Leuconostoc spp. Furthermore, GH1, also known as 6-phospho-β-glucosidase (EC 3.2.1.86), was found in all selected strains of this study. This enzyme catalyzes the hydrolytic cleavage of cellobiose 6-phosphate (β-1,4-linked) to yield glucose 6-phosphate and glucose [63]. A recent study described the important role of 6-phospho-β-glucosidases in L. pesudomesenteroides DSM 20,193 in plant-based fermentation [64]. GH42 and β-galactosidase (EC 3.2.1.23) may be involved in the production of galactose and glucitol. GH43 was identified as β-xylosidase (EC 3.2.1.37). Except for LB, all strains contained β-1,3-xylosidase (EC 3.2.1.-), which may use xylooligosaccharides (XOS) [65]. XOSs are a group of oligosaccharides that include xylotetrose, -triose, and -biose, which comprise xylose residues linked by β-1,4 linkages and are found naturally in vegetables fruits, bamboo shoots, milk, and honey [66]. With the exception of strain LB, all strains consumed galactose via the Leloir pathway, indicating the lack of lactose phosphotransferase system activity [67]. The xylosidase enzyme of L. lactis SHO-47 has been proposed to be cytoplasmically localized [68].
Fig. 9

Predicted oligosaccharide utilization pathways of various carbohydrates in Leuconostoc spp. GH, glycoside hydrolases; GOS, glucooligosaccharide; XOS, xylooligosaccharides; Gal, galactose; Glc, glucose; Fru, fructose; Glc-1-P, glucose-1-phosphate; Glc-6-P, glucose-6-phosphate; Xul-5-P, xylulose-5-phosphate.

Predicted oligosaccharide utilization pathways of various carbohydrates in Leuconostoc spp. GH, glycoside hydrolases; GOS, glucooligosaccharide; XOS, xylooligosaccharides; Gal, galactose; Glc, glucose; Fru, fructose; Glc-1-P, glucose-1-phosphate; Glc-6-P, glucose-6-phosphate; Xul-5-P, xylulose-5-phosphate.

Folic acid biosynthesis

In recent years, there has been increased interest in folate, a water-soluble vitamin, in relation to nutrition [69]. Plants, vegetables, fruits, and meats are the best sources of folate. While many studies have shown that most LAB strains are deficient in folate, certain strains can generate natural folate [70]. These microorganisms can be used as folic acid substitutes. In this study, only LC and LD were observed to synthesize folic acid via two metabolic pathways commonly found in LAB (Fig. 10). The first pathway converts guanosine triphosphate (GTP) to 6-hydroxymethyl-7,8-dihydropterin pyrophosphate (DHPPP), while the second converts chorismite to para-aminobenzoic acid (PABA) [71]. The study also observed the presence of enzymes related to the GTP pathway in LC and LD genomes: folA (DHF reductase, EC 1.5.1.3), folB (EC 4.1.2.25), folC (EC 6.3.2.12 dihydrofolate synthase, and folylpolyglutamate synthase), folE (GTP cyclohydrolase I, EC 3.5.4.16), folK (2-amino-4-hydroxy-6-hydroxymethyl-dihydropteridine diphosphokinase, EC 2.7.6.3), and folP (dihydropteroate synthase, EC 2.5.1.15). Furthermore, as illustrated in Fig. 10, folC catalyzes the synthesis of DHF and polyglutamylation in Leuconostoc spp. [72]. Some LAB lack folQ (EC 3.1.3.1/3.6.1.-) which encodes the DHNTPase enzyme responsible for folate biosynthesis; however, they have alternative enzymes that are similar to DHNTPase [73]. The genomes of LC and LD did not contain folQ. In the Shikimate pathway (alternate route), chorismate is converted into 4-amino-4-deoxychorismate by pabA (aminodeoxychorismate synthase, EC 2.6.1.85). Following that, 4-amino-4-deoxychorismate is converted to pyruvate, which is cleaved by pabB (4-amino-4-deoxychorismate lyase, EC 4.1.3.38), yielding para-aminobenzoic acid (PABA), which is then used in folate biosynthesis. However, in LA, this pathway was found to be incomplete, and the xylulokinase-producing gene (aroK EC 2.7.1.17) was absent in the LB strain.
Fig. 10

Overview of the folate biosynthesis pathway in the four Leuconostoc species. Different color codes are used to represent the presence of genes in different Leuconostoc species. Grey boxes represent the absence of a gene. GTP: guanosine triphosphate, PABA: para-aminobenzoic acid, DHPPP: hydroxymethyl-7,8-dihydropterin pyrophosphate, DHF: dihydrofolate, THF: tetrahydrofolate.

Overview of the folate biosynthesis pathway in the four Leuconostoc species. Different color codes are used to represent the presence of genes in different Leuconostoc species. Grey boxes represent the absence of a gene. GTP: guanosine triphosphate, PABA: para-aminobenzoic acid, DHPPP: hydroxymethyl-7,8-dihydropterin pyrophosphate, DHF: dihydrofolate, THF: tetrahydrofolate.

Conclusion

This study analyzed the draft genome sequences and functional attributes of four Leuconostoc strains. Based on ANI, the two different strains of L. lactis have the highest similarity among the four species, and low strain similarity could be due to gene arrangements. Although all of the strains shared a common gene pool, there were differences in their gene contents. Furthermore, many genes with unknown functions have been observed in the prophage region, which can be investigated further in future studies. Functional profiling revealed the presence of various CAZymes among the four strains, correlating with previous research. The different enzymes involved in oligosaccharide metabolism are also highlighted in this study Meanwhile, only two strains of L. lactis showed the complete folic acid synthesis pathway In conclusion, these findings will aid in the understanding of genetic similarity among Leuconostoc strains and can help in the development of new strategies for industrial applications through the production of genetically-modified strains.

CRediT authorship contribution statement

Anshul Sharma: Methodology, Writing – original draft, Validation, Data curation. Neha Sharma: Methodology, Writing – original draft, Validation, Data curation. Deepshikha Gupta: Methodology, Software, Visualization, Formal analysis, Investigation, Writing – review & editing. Hae-Jeung Lee: Conceptualization, Data curation. Young-Seo Park: Conceptualization, Writing – review & editing, Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  61 in total

1.  A hierarchical classification of polysaccharide lyases for glycogenomics.

Authors:  Vincent Lombard; Thomas Bernard; Corinne Rancurel; Harry Brumer; Pedro M Coutinho; Bernard Henrissat
Journal:  Biochem J       Date:  2010-12-15       Impact factor: 3.857

2.  Determination of glucansucrase encoding gene in Leuconostoc mesenteroides.

Authors:  Renpeng Du; Xiaoxiao Qiao; Yu Wang; Bo Zhao; Ye Han; Zhijiang Zhou
Journal:  Int J Biol Macromol       Date:  2019-07-04       Impact factor: 6.953

3.  Characterization of lactose utilization and β-galactosidase in Lactobacillus brevis KB290, the hetero-fermentative lactic acid bacterium.

Authors:  Hiroyuki Honda; Nobuhiro Yajima; Tadao Saito
Journal:  Curr Microbiol       Date:  2012-08-31       Impact factor: 2.188

4.  A taxonomic note on the genus Lactobacillus: Description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae.

Authors:  Jinshui Zheng; Stijn Wittouck; Elisa Salvetti; Charles M A P Franz; Hugh M B Harris; Paola Mattarelli; Paul W O'Toole; Bruno Pot; Peter Vandamme; Jens Walter; Koichi Watanabe; Sander Wuyts; Giovanna E Felis; Michael G Gänzle; Sarah Lebeer
Journal:  Int J Syst Evol Microbiol       Date:  2020-04-15       Impact factor: 2.747

5.  Structural and Biochemical Insights into the Peptidoglycan Hydrolase Domain of FlgJ from Salmonella typhimurium.

Authors:  Patryk Zaloba; Ben A Bailey-Elkin; Miriam Derksen; Brian L Mark
Journal:  PLoS One       Date:  2016-02-12       Impact factor: 3.240

6.  Homology to peptide pattern for annotation of carbohydrate-active enzymes and prediction of function.

Authors:  P K Busk; B Pilgaard; M J Lezyk; A S Meyer; L Lange
Journal:  BMC Bioinformatics       Date:  2017-04-12       Impact factor: 3.169

7.  HMMER web server: 2018 update.

Authors:  Simon C Potter; Aurélien Luciani; Sean R Eddy; Youngmi Park; Rodrigo Lopez; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

Review 8.  Fermented Foods as a Dietary Source of Live Organisms.

Authors:  Shannon Rezac; Car Reen Kok; Melanie Heermann; Robert Hutkins
Journal:  Front Microbiol       Date:  2018-08-24       Impact factor: 5.640

9.  Role of Lactic Acid Bacteria Phospho-β-Glucosidases during the Fermentation of Cereal by-Products.

Authors:  Marta Acin-Albiac; Pasquale Filannino; Kashika Arora; Alessio Da Ros; Marco Gobbetti; Raffaella Di Cagno
Journal:  Foods       Date:  2021-01-05

10.  In silico analysis of the α-amylase family GH57: eventual subfamilies reflecting enzyme specificities.

Authors:  Mária Martinovičová; Štefan Janeček
Journal:  3 Biotech       Date:  2018-07-09       Impact factor: 2.406

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.