| Literature DB >> 36147676 |
Anshul Sharma1, Neha Sharma2, Deepshikha Gupta3,4, Hae-Jeung Lee1,5, Young-Seo Park2.
Abstract
Leuconostoc is mostly found in food, plants, and dairy products. Due to their innate genomic features, such as the presence of carbohydrate-active enzymes, bacteriocins, and plasmids, Leuconostoc spp. have great biotechnological potential. In this study, four strains were isolated and identified as Leuconostoc mesenteroides SG315 (LA), L. citreum SG255 (LB), L. lactis CCK940 (LC), and L. lactis SBC001 (LD). Comparative analysis was performed using their draft genome sequences. Differences among the four strains were analyzed using the average nucleotide identity, dot plot, and multiple alignments of conserved genomic sequences. Functional profiling revealed 2134, 1917, 1751, and 1816 open reading frames; 2023, 1823, 1655, and 1699 protein-coding genes; 60, 57, 83, and 82 RNA-coding genes; and GC content of 37.5 %, 38.8 %, 43.3 %, and 43.2 %, in LA, LB, LC, and LD, respectively. The total number of genes encoding carbohydrate-active enzymes was 76 (LA), 73 (LB), 57 (LC), and 67 (LD). These results indicate that the four strains shared a large number of genes, but their gene content is different. Furthermore, most genes with unknown functions were observed in the prophage regions of the genome. This study also elucidated the oligosaccharide utilization and folate biosynthesis pathways in Leuconostoc spp. Taken together, our findings provide useful information on the genomic diversity of CAZymes in the four Leuconostoc strains and suggest that these species could be used for potent exploitation.Entities:
Keywords: Carbohydrate-active enzymes; Comparative genomics; Leuconostoc citreum; Leuconostoc lactis; Leuconostoc mesenteroides; Oligosaccharide
Year: 2022 PMID: 36147676 PMCID: PMC9465122 DOI: 10.1016/j.csbj.2022.08.032
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1Circular genomes of (A) LA, (B) LB, (C) LC, and (D) LD. Circles illustrate the following characteristics from the outside to the center: contigs/chromosome (blue), coding sequences on forward and reverse strands.” Circles 4–8 represent non-CDS, antimicrobial resistance genes (AMR), virulence factors (VF), transporters, and drug targets, respectively. The next ring represented in pink/black denotes the GC content. The innermost circle shows GC-skew. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Genome information of the four Leuconostoc species.
| Genome size (bp) | 2,099,741 | 1,869,057 | 1,741,511 | 1,835,155 |
| DNA G + C (bp) | 786,451 | 724,875 | 754,610 | 791,832 |
| G + C content (%) | 37.46 | 38.78 | 43.33 | 43.15 |
| DNA scaffolds | 23 | 28 | 2 | 3 |
| N50 (bp) | 250,908 | 152,714 | 1,726,690 | 1,758,110 |
| Total genes | 2134 | 1917 | 1751 | 1816 |
| Protein coding genes | 2023 | 1823 | 1655 | 1699 |
| RNA genes | 60 | 57 | 83 | 82 |
| Pseudo genes | 51 | 37 | 13 | 35 |
| CRISPR arrays | 0 | 0 | 1 | 1 |
LA: Leuconostoc mesenteroides SG315; LB: L. citreum SG255; LC: L. lactis CCK940; and LD: L. lactis SBC001; X= L. citreum EFEL2700; Y= L. lactis 1.2.28; W: L. citreum DmW_111; Z= L. mesenteroides SRCM103356; S= L. mesenteroides 406; UP: unpublished.
Fig. 2Phylogenetic comparison of four Leuconostoc strains with genomes available in the public domain. Strains used in this study are highlighted.
Fig. 3Heatmap showing the average nucleotide identity (ANI) among four species of Leuconostoc. Green color represents similarity above 97%. As the similarity % decreases, the colour intensity changes from yellow to red. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Genomic islands (GIs) of (A) LA, (B) LB, (C) LC, and (D) LD predicted using SIGI-HMM (orange) and IslandPath-DIMOB (blue). Red shows the integrated genomic island search results. a) Circular and b) horizontal views of the GIs of the four Leuconostoc species. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 5Prophage regions detected in (A) LA, (B) LB (C) LC and (D) LD using the PHASTER server. The total number of prophage regions discovered in each strain is displayed in the upper panel, and the number of genes discovered in each prophage region is shown in the lower panel.
Fig. 6Dot plot matrices between (A) LA vs LB, (B) LA vs LC, (C) LA vs LD, (D) LB vs LC, (E) LB vs LD, and (F) LC vs LD. Dot plot matrices were generated using Kbase with default parameters.
Fig. 7MAUVE alignment of (A) LC with (B) LD, (C) LB, and (D) LA.
Fig. 8Number of CAZymes in the four Leuconostoc species. Color codes represent the different bacterial species: red, LA; blue, LB; pink, LC; green, LD; GH, glycoside hydrolase; GT, glycosyltransferase; CBM, carbohydrates-binding module; CE, carbohydrate esterase; PL, polysaccharide lyase. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Genes related to carbohydrate-active enzymes in the four Leuconostoc species.
| GH32(14-326), GH13_18(36-375), GH73(288-428), GH13_29(25-371), GH25(58-218), GH73(66-204), GH32(49-359), GH13_31(28-377), GH42(13-379), GH5_44(48-352), GH43_11(10-321), GH8(30-365), GH25(32-220), GH23, GH32(51-353), GH68(175-614), GH68(179-618), GH25(47-227), GH3(35-252), GH65(317-693), GH36(14-719), GH70(332-1127), GH70 (2011-2804), GH65(317-694), GH13_31(29-380), GH13_31(30-381), GH1(9-482), GH13_31(29-382), GH1(1-474), GH1(3-473), GH2(33-918), GH70(292-1108), GH70(370-1167) | GT4 (197-346), GT4 (160-302), GT51(104-284), GT32 (19-93), GT28 (189-349), GT2_Glycos_transf_2(108-278), GT2_Glycos_transf_2(6-172), GT2_Glycos_transf_2(6-131), GT2_Glycos_transf_2 (15-147), GT111 (4-216), GT2_Glycos_transf_2 (9-168), GT4 (335-485), GT4 (314-463), GT4 (303-450), GT2_Glycos_transf_2 (58-235), GT8 (3-252), GT51 (70-250), GT4 (177-325), GT2_Glycos_transf_2(13-177) | CE9 | CBM50, CBM50 | ||
| GH73(695-846), GH65(317-693), GH13_31(29-380), | GT28(188-352), GT51(102-282), GT2_Glycos_transf_2(5-142), GT2_Glycos_transf_2(8-176), | CE9(5-374), | CBM50 | PL1_6(145-354) | |
| GH32(14-326), GH13_18(37-373), GH2(37-916), GH36(14-718), GH13_29(25-371), GH25(63-223), GH25(34-221), GH73(297-437), GH73(65-203), GH43_11(3-304), GH65(317-693), GH13_31(32-365), GH13_31(28-362), GH73(465-601), | GT2_Glycos_transf_2(8-119), GT111(7-223), | CE9(10-356), CE1(10-257) | CBM34(6-133) + GH13_20(183-487) | PL1_6(146-354) | |
| CBM34(6-133) + GH13_20(183-487), GH13_31(28-362), GH13_31(32-365), GH65(317-693), GH73(65-203), GH73(297-437), GH70(343-1139), GH43_11(3-304), GH25(63-223), GH13_29(25-371), GH36(14-718), GH2(37-916), GH13_18(37-373), GH32(14-326), GH25(536-722), GH36(14-719), GH13_31(30-381), GH1(6-485), GH13_31(30-378), GH32(37-339), GH36(12-700), GH36(12-711) | GT4(39-188), GT51(104-284), GT28(188-352), GT4(160-299), GT4(195-343), GT8(11-261), GT2_Glyco_tranf_2_3(55-290), GT4, GT2_Glycos_transf_2(7-173), GT2_Glycos_transf_2(17-138), GT111(7-223), GT2_Glycos_transf_2(8-119), GT51(71-250), GT4(302-440), GT2_Glycos_transf_2(9-135), GT2_Glycos_transf_2(5-174), GT2_Glycos_transf_2(8-175) | CE1(10-257)CE9 | CBM50 | PL1_6(146-354) |
GH, glycoside hydrolases; GT, glycosyltransferases; CE, carbohydrate esterases; CBM, carbohydrate-binding modules; PL, polysaccharide lyases; NI, not identified; SBC001, Leuconostoc lactis SBC001; CCK940, Leuconostoc lactis CCK940, SG255: Leuconostoc citrem SG255, SG315: Leuconostoc mesenteroides SG315.
Fig. 9Predicted oligosaccharide utilization pathways of various carbohydrates in Leuconostoc spp. GH, glycoside hydrolases; GOS, glucooligosaccharide; XOS, xylooligosaccharides; Gal, galactose; Glc, glucose; Fru, fructose; Glc-1-P, glucose-1-phosphate; Glc-6-P, glucose-6-phosphate; Xul-5-P, xylulose-5-phosphate.
Fig. 10Overview of the folate biosynthesis pathway in the four Leuconostoc species. Different color codes are used to represent the presence of genes in different Leuconostoc species. Grey boxes represent the absence of a gene. GTP: guanosine triphosphate, PABA: para-aminobenzoic acid, DHPPP: hydroxymethyl-7,8-dihydropterin pyrophosphate, DHF: dihydrofolate, THF: tetrahydrofolate.