Literature DB >> 27274361

Complete genome sequence of Leuconostoc gelidum subsp. gasicomitatum KG16-1, isolated from vacuum-packaged vegetable sausages.

Margarita Andreevskaya1, Jenni Hultman2, Per Johansson2, Pia Laine1, Lars Paulin1, Petri Auvinen1, Johanna Björkroth2.   

Abstract

Leuconostoc gelidum subsp. gasicomitatum is a predominant lactic acid bacterium (LAB) in spoilage microbial communities of different kinds of modified-atmosphere packaged (MAP) food products. So far, only one genome sequence of a poultry-originating type strain of this bacterium (LMG 18811(T)) has been available. In the current study, we present the completely sequenced and functionally annotated genome of strain KG16-1 isolated from a vegetable-based product. In addition, six other vegetable-associated strains were sequenced to study possible "niche" specificity suggested by recent multilocus sequence typing. The genome of strain KG16-1 consisted of one circular chromosome and three plasmids, which together contained 2,035 CDSs. The chromosome carried at least three prophage regions and one of the plasmids encoded a galactan degradation cluster, which might provide a survival advantage in plant-related environments. The genome comparison with LMG 18811(T) and six other vegetable strains suggests no major differences between the meat- and vegetable-associated strains that would explain their "niche" specificity. Finally, the comparison with the genomes of other leuconostocs highlights the distribution of functionally interesting genes across the L. gelidum strains and the genus Leuconostoc.

Entities:  

Keywords:  Comparative genomic analysis; Food spoilage; Functional genome annotation; Leuconostoc gelidum subsp. gasicomitatum; Modified-atmosphere packaging; Vegetable products

Year:  2016        PMID: 27274361      PMCID: PMC4895993          DOI: 10.1186/s40793-016-0164-8

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

is a psychrotrophic LAB commonly associated with cold-stored nutrient-rich foods of meat and vegetable origins [1-7]. According to a recently published reclassification study, this species comprises three subspecies: , and [8]. was first isolated from spoiled MAP tomato-marinated broiler meat [2]. Later, it was found in spoiled MAP beef and pork [3, 9], and acetic-acid preserved herring [10]. Recently, it was detected in connection with the spoilage of boiled eggs preserved in brine [11], vacuum-packaged vegetable sausages [4] and minimally processed vegetable salads [5]. Depending on the product type, the spoilage is characterized by the formation of gas, slime, sour and buttery off-odors, and discoloration. Together with , it belongs to the predominant microbiota at the end of shelf-life in different kinds of packaged cold-stored food products [5, 12]. Based on the previous studies of the genetic diversity and population structure of , some strains isolated from vegetable-based products were almost not recovered from meat-derived foods [9, 13]. This might suggest “niche-specificity” of the different strains as a consequence of their genetic differences. Alternatively, the absence of strain dissemination between vegetable- and meat-processing chains was proposed as a possible factor accounting for the phenomenon observed [9]. So far, only the complete genome of type strain LMG 18811 isolated from spoiled MAP broiler meat, has been available [14]. In this study we present the complete and annotated genome sequence for the vegetable spoilage-associated strain KG16-1 [4]. In addition, six more vegetable strains of this organism were sequenced and partially assembled. This allowed us to compare the gene repertoires of eight strains and all leuconostocs sequenced to date to identify: i) the genetic determinants putatively accounting for the differences in lifestyle of meat and vegetable strains of the described organism, and ii) genes that are specific for the (subsp. gasicomitatum) species.

Organism information

Classification and features

KG16-1 is a Gram-positive, non-motile, non-sporulating, facultatively anaerobic psychrotrophic LAB, belonging to the phylum as part of the family (Table 1). It was first isolated from spoiled vacuum-packaged vegetable sausages in 2006 in Helsinki, Finland (the strain was initially designated as 16-1 and then renamed into KG16-1) [4]. The sausages consisted mainly of carrot (56 %) with the addition of potato, rapeseed oil, cheese, cream, egg yolk powder and sucrose, and were stored below 8 °C for more than 20 days after cooking and packaging. The spoilage was characterized by the formation of gas and slime. Inoculation experiments showed [4] that strain KG16-1 alone is able to cause spoilage of vacuum-packaged vegetable sausages. The phylogenetic analysis based on the concatenated nucleotide sequences of atpA, pheS and rpoA genes (coding for ATP synthase subunit alpha, phenylalanine--tRNA ligase alpha subunit and DNA-directed RNA polymerase subunit alpha, respectively, and showed to successfully discriminate between species of the genus [15]) clearly shows that strain KG16-1 belongs to the species and is distinct from the subspecies gelidum and aenigmaticum (Fig. 1). According to the API 50CH carbohydrate utilization test (bioMérieux, Marcy l’Etoile, France), this bacterium is capable of fermenting a variety of carbohydrates, including hexoses, pentoses and disaccharides (Table 1). The utilization of D-galactose, N-acetylglucosamine and gentibiose is weak. Peculiarly, unlike the majority of strains, strain KG16-1 does not ferment xylose. The cells are oval cocci, 0.5 to 1 μm in diameter (Fig. 2). Colonies formed on de Man-Rogosa-Sharpe (MRS) medium are small and greyish-white.
Table 1

Classification and general features of Leuconostoc gelidum subsp. gasicomitatum strain KG16-1 according to MIGS recommendations [48]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [49]
Phylum Firmicutes TAS [50, 51]
Class Bacilli TAS [52]
Order Lactobacillales TAS [53]
Family Leuconostocaceae TAS [54]
Genus Leuconostoc TAS [5557]
Species Leuconostoc gelidum TAS [1, 8]
Subspecies Leuconostoc gelidum subsp. gasicomitatum TAS [2, 8]
Strain KG16-1
Gram stainPositiveTAS [2]
Cell shapeCoccusTAS [2]
MotilityNon-motileTAS [2]
SporulationNot reportedNAS
Temperature rangePsychrotrophTAS [2]
Optimum temperature25 °CTAS [2]
pH range; Optimum5 – 8; 6.5TAS [8, 58]
Carbon sourceD-glucose, D-fructose, D-mannose, L-arabinose, D-ribose, methyl D-glucoside, D-cellobiose, D-maltose, D-lactose, D-melibiose, D-raffinose, D-saccharose, D-trehalose, D-turanoseIDA
MIGS-6HabitatVacuum-packaged vegetable sausagesTAS [4]
MIGS-6.3Salinity4 % NaCl (w/v)TAS [8]
MIGS-22Oxygen requirementFacultative anaerobicTAS [59]
MIGS-15Biotic relationshipFree-livingNAS
MIGS-14PathogenicityNon-pathogenNAS
MIGS-4Geographic locationHelsinki, FinlandTAS [4]
MIGS-5Sample collection2006TAS [4]
MIGS-4.1Latitude60.19NAS
MIGS-4.2Longitude24.94NAS
MIGS-4.4AltitudeUnknownNAS

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [60]

Fig. 1

Phylogenetic tree showing the relationship of L. gelidum subsp. gasicomitatum KG16-1 to other Leuconostoc species. The tree was built using T-REX web server [61] based on MUSCLE [62] aligned concatenated nucleotide sequences of atpA, pheS and rpoA genes (Genbank identifiers are listed in Table S1 of Additional file 1). Poorly aligned positions were removed using Gblocks [63] with default parameters and the tree was inferred using RAxML program [64] with GTRCAT substitution model and Bacillus subtilis as an outgroup. Bootstrap analysis was performed with 500 replicates. Type strains are identified with superscripted “T” and the availability of a sequenced genome is indicated with “*”

Fig. 2

Photomicrograph of L. gelidum subsp. gasicomitatum KG16-1. The cells were grown for 48 h on the plate with MRS medium and Gram stained. The image was taken using an optical microscope with magnification 1000x

Classification and general features of Leuconostoc gelidum subsp. gasicomitatum strain KG16-1 according to MIGS recommendations [48] aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [60] Phylogenetic tree showing the relationship of L. gelidum subsp. gasicomitatum KG16-1 to other Leuconostoc species. The tree was built using T-REX web server [61] based on MUSCLE [62] aligned concatenated nucleotide sequences of atpA, pheS and rpoA genes (Genbank identifiers are listed in Table S1 of Additional file 1). Poorly aligned positions were removed using Gblocks [63] with default parameters and the tree was inferred using RAxML program [64] with GTRCAT substitution model and Bacillus subtilis as an outgroup. Bootstrap analysis was performed with 500 replicates. Type strains are identified with superscripted “T” and the availability of a sequenced genome is indicated with “*” Photomicrograph of L. gelidum subsp. gasicomitatum KG16-1. The cells were grown for 48 h on the plate with MRS medium and Gram stained. The image was taken using an optical microscope with magnification 1000x

Genome sequencing information

Genome project history

Strain KG16-1 was chosen for sequencing as a representative of a vegetable-based product spoilage-associated strain of . The manufacturer of the product was a small-size operator using vegetables as the main ingredients. No meat was handled at the plant and the other ingredients were mainly heat treated. Since this was the first wider problem reported affecting non-meat foods, and more than one product of the manufacturer was affected, a genome project was initiated. The project was carried out jointly by the Institute of Biotechnology and Department of Food Hygiene and Environmental Health, University of Helsinki, Finland. The complete genome was sequenced, fully assembled and annotated. The summary of the project information, including database identifiers, is shown in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityComplete
MIGS-28Libraries used454 fragment library (500-600 bp)
MIGS 29Sequencing platforms454 Sequencer with GS Flx chemistry
MIGS 31.2Fold coverage19 ×
MIGS 30AssemblersNewbler 2.0.00.20, Gap4 from Staden package
MIGS 32Gene calling methodProdigal, Glimmer3, tRNAscan-SE, RNAmmer, ARAGORN
Locus TagLEKG
Genbank IDLN890331- LN890334
GenBank Date of ReleaseJanuary 20, 2016
BIOPROJECTPRJEB11303
MIGS 13Source Material IdentifierKG16-1
Project relevanceFood spoilage
Project information

Growth conditions and genomic DNA preparation

For DNA isolation, strain KG16-1 was grown anaerobically (Oxoid, Basingstoke, United Kingdom) in MRS broth at 25 °C overnight. DNA was extracted using a modified method [16] of Pitcher et al. [17], and the genomic DNA was mechanically sheared with a needle. The ratio of absorbance at 260 nm and 280 nm (NanoDrop spectrophotometer, Thermo Scientific, USA) as a measure of DNA purity was assessed to be ~1.8.

Genome sequencing and assembly

Genomic DNA was sequenced using 454 Sequencer with GS Flx chemistry. The 152,753 reads obtained had an average length of 224 bp and were assembled using Newbler 2.0.00.20. In total, 149,580 reads (97.9 %) were assembled, resulting in 106 large (>500 bp) and 12 smaller (>100 bp, but < 500 bp) contigs with N50 contig size being 32,090 bp. The average sequencing coverage was 19×. The Gap4 program from the Staden package [18] was first used to design primers to the near ends (~ 150-200 bp) of the contigs and then to organize the contigs in the correct order based on the PCR products. Gaps were closed by Sanger sequencing with BigDye v3.1 chemistry and primer walking of PCR products. List of primers used can be obtained upon request. The insertion of A nucleotide (genomic position 416,796) within the mucus-binding protein-encoding gene LEKG_0412 was confirmed by linker adaptor-mediated PCR. For this, genomic DNA was digested by EcoRI restriction enzyme and then ligated to synthetic adaptors that provided binding sites for primers used during PCR amplification (5′ GCATTCACACTTAAGTTTCGTGA ′3 and 5′ TGTCGACGTTGTAAAACGACGGCCAGT 3′) and Sanger sequencing (5′ ATTAACCCTCACTAAAGGGA 3′).

Genome annotation

Protein-coding genes were identified by Glimmer3 [19] and Prodigal [20] and their functions were predicted by RAST [21] and PANNZER [22]. The outputs of two programs for the prediction of gene coordinates as well as for gene function prediction were compared and the discrepancies were manually resolved based on the presence of potential ribosomal binding sites, similarity searches against public databases and literature data. To further improve the gene prediction, the GenePRIMP pipeline [23], which detects erroneously predicted translational start sites, broken and missing genes, was applied. To identify putative pseudogenes and possible sequencing errors, frameshift prediction was performed using GeneTack program [24] and similarity searches with proteomes of closely related species using Exonerate [25]. Bacteriocins and prophage regions were predicted with the help of BAGEL2 [26] and PHAST [27] programs, respectively. CRISPRFinder [28] was used to detect CRISPRs. rRNA, tRNA and tmRNA, were predicted with RNAmmer [29], tRNAscan-SE [30] and ARAGORN [31], respectively. Identification of transmembrane helices and signal peptides was performed with TMHMM server v. 2.0 [32] and SignalP 4.1 server [33]. Finally, the assignment of COG functional categories and prediction of Pfam domains were performed by an RPS-BLAST search (e-value threshold of 0.01 was used and the one best hit was taken into account for each gene) against the COG and Pfam databases, respectively. The genome was also checked for the presence of all core COG functions [34].

Genome properties

The complete genome of KG16-1 consisted of one circular chromosome (1,965,841 bp) and three circular plasmids (35,714 bp, 29,494 bp and 19,683 bp) (Table 3). The average GC content of the chromosome was 36.9 %. The numbers of predicted protein-coding genes were 1,944 (including 12 pseudogenes) on the chromosome and 38 (including two pseudogenes), 32 and 21 on the three plasmids, respectively (Table 4). In addition, the chromosome contained four rRNA operons (each having 5S, 16S and 23S rRNAs), 67 tRNA genes and one tmRNA gene. The putative function was assigned to 84 % of genes and COGs were assigned to 1,601 protein-coding genes (78.7 % of the total). The distribution of the protein-coding genes among COG functional categories is summarized in Table 5 and depicted in Fig. 3. All 61 core COGs of the minimal set of essential functions for a prokaryotic organism [34] were present in the genome. The genome harbored three complete prophages (LEKG_0833-0885; LEKG_1279-1342; LEKG_1560-1576) (Fig. 3). However, due to the presence of bacteriophage attachment sites inside the second prophage region, it might consist of two different prophages.
Table 3

Summary of genome: one chromosome and three plasmids

LabelSize (Mb)TopologyINSDC identifier
Chromosome1.97CircularLN890331
Plasmid 10.04CircularLN890332
Plasmid 20.03CircularLN890333
Plasmid 30.02CircularLN890334
Table 4

Genome statistics

AttributeValue% of Total
Genome size (bp)2,050,732100.0
DNA coding (bp)1,829,56089.2
DNA G + C (bp)755,74536.9
DNA scaffolds4
Total genes2,115100.0
Protein coding genes2,03596.2
RNA genes803.8
Pseudo genes140.7
Genes in internal clustersNANA
Genes with function prediction1,77784.0
Genes assigned to COGs160175.7
Genes with Pfam domains168879.8
Genes with signal peptides663.1
Genes with transmembrane helices54325.7
CRISPR repeats00
Table 5

Number of genes associated with general COG functional categories

CodeValue% ageDescription
J1909.3Translation, ribosomal structure and biogenesis
A00.0RNA processing and modification
K1447.1Transcription
L1025.0Replication, recombination and repair
B00.0Chromatin structure and dynamics
D311.5Cell cycle control, cell division, chromosome partitioning
V522.6Defense mechanisms
T562.8Signal transduction mechanisms
M1055.2Cell wall/membrane biogenesis
N140.7Cell motility
U211.0Intracellular trafficking and secretion
O572.8Posttranslational modification, protein turnover, chaperones
C602.9Energy production and conversion
G1778.7Carbohydrate transport and metabolism
E1326.5Amino acid transport and metabolism
F904.4Nucleotide transport and metabolism
H793.9Coenzyme transport and metabolism
I663.2Lipid transport and metabolism
P793.9Inorganic ion transport and metabolism
Q251.2Secondary metabolites biosynthesis, transport and catabolism
R1336.5General function prediction only
S1085.3Function unknown
-43421.3Not in COGs

The total is based on the total number of protein coding genes in the genome

Fig. 3

Genome map of L. gelidum subsp. gasicomitatum KG16-1. Genes are colored according to their COG functional annotations. Moving inwards, the tracks on the chromosome map represent the following features: genes on the forward strand, genes on the reverse strand, rRNAgenes (red), tRNAgenes (blue) prophages (green), GC plot (cyan), and GC skew (purple). Likewise, the tracks on the plasmid maps represent the genes on the forward and reverse strands, GC plot, and GC skew

Summary of genome: one chromosome and three plasmids Genome statistics Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome Genome map of L. gelidum subsp. gasicomitatum KG16-1. Genes are colored according to their COG functional annotations. Moving inwards, the tracks on the chromosome map represent the following features: genes on the forward strand, genes on the reverse strand, rRNAgenes (red), tRNAgenes (blue) prophages (green), GC plot (cyan), and GC skew (purple). Likewise, the tracks on the plasmid maps represent the genes on the forward and reverse strands, GC plot, and GC skew

Insights from the genome sequence

Genome functional characteristics

Based on the predicted functions in the genome, strain KG16-1 had a very similar set of metabolic pathways to those present in strain LMG 18811 and described previously [14]. Briefly, the only central carbohydrate catabolic pathway present in the genome was the phosphoketolase pathway, which classifies this bacterium as obligate heterofermentative LAB. The genome also contained genes for three alternative pyruvate utilization pathways, respiratory electron transport chain and menaquinone biosynthesis. Despite the negative reaction for xylose utilization, the required xylose catabolic genes (xylA and xylB) and putative xylose transporter (xylP, LEKG_1853) were present in the genome and did not contain frameshifts or premature stop codons. However, an amino acid sequence comparison of xylA and xylB between xylose-fermenting (according to API 50CH test) strains (LMG 18811, C120c, KSL4-2 and PL111) and KG16-1 showed that in KG16-1, these genes carried two (V168A, S260G) and four (S108L, L128H, D199N, G319E) amino acid substitutions, respectively, which were not present in other xylose-utilizing strains. In addition, KG16-1 was lacking the ortholog of another xylose transporter xylT (e.g. gene LEGAS_1062 in LMG 18811), which was present in other strains. D-Ala--D-Ala ligase gene ddl (LEKG_0342) contained Phe261 in its active site, which indicates a resistance to vancomycin [35]. The same was found for all other leuconostocs sequenced to date, when sequences of ddl genes were aligned. Unlike the genome of strain LMG 18811, strain KG16-1 contained three plasmids. The first plasmid contained the putative type I galactan catabolic gene cluster (LEKG_1953-1960), similar to the cluster present in ATCC 8293 [36], and heavy metal resistance genes. Type I galactan is a structural polysaccharide comprising pectin, which is one of the major components of plant cell walls. Hence, the ability to degrade this polysaccharide would be beneficial for the growth in a plant environment. The second plasmid harbored a Type II restriction-modification (RM) system and conjugal transfer genes, while the third plasmid carried an RM system (the type is unclear) and heavy-metal resistant genes. In addition to the plasmids, the chromosome harbored at least two RM enzymes of type II (LEKG_0442 and LEKG_0445).

Genomic comparison between strains and other species

The comparison of gene contents (more precisely, their protein translations) was performed for KG16-1 and 40 other leuconostocs (assembly accession numbers are listed in Table S2 of Additional file 1). They comprised 34 sequenced and annotated genomes available at the NCBI database by September 2015, including the complete genome of LMG 18811. In addition, we sequenced six strains of (C120c, C122c, KSL4-2, PB1a, PB1e, PL111) associated with spoilage of other vegetable-based food products (Table 6). The sequencing was done using Illumina HiSeq2500 platform (paired-end library with the read length of 101 + 101 bp), the draft genomes were assembled with Velvet 1.2.08 [37] and annotated by RAST [21]. The important details on the draft genomes, such as genome size, fold coverage, number of contigs and predicted CDSs, are given in Table S3 of Additional file 1.
Table 6

The presence/absence of selected genes in L. gelidum strains

Gene nameLMG 18811T KG16-1C120cC122cKSL4-2PB1aPB1ePL111LMG 22919a LMG 18297T JB7Other Leuconostocs
Lacticin biosynthesis genes-++b ---------
Polyketide biosynthesis cluster+-++++-++b,c ---
Pyruvate oxidase+-+++++++++Present in 15 other leuconostocs
Biofilm formation genes+-+++++++b,c ++ L. mesenteroides KFRI-MG, L. mesenteroides subsp. mesenteroides J18, L. pseudomesenteroides LMG 11482T
Collagen-binding protein+-+++++++b,c ++Present in all four L. pseudomesenteroides strains
Xylose-proton symporter+-+++++++b,c ++Present in 15 other leuconostocs
Mucus-binding protein++b +++++++b,c ---
Accessory Sec system+++++++++b,c -+ L. fallax LMG 18975a,b, L. pseudomesenteroides LMG 11482 T, L. kimchii C2 (partially)
Fused glutamate racemase/NTP pyrophosphatase+++++++++b,c ++Encoded by two separate genes
Source of isolationBroiler meat [2]Vegetable sausages [4]Vegetable salad [9]Vegetable salad [13]Fish/carrots [10]Carrot [10]Carrot [13]Carrot [10]Kimchi [65]Kimchi [7]Kimchi [6]

Strains LMG 18811T, KG16-1, C120c, C122c, KSL4-2, PB1a, PB1e, PL111 and LMG 22919 belong to L. gelidum subsp. gasicomitatum; LMG 18297 T and JB7 – to L. gelidum subsp. gelidum. Strains with complete genomes are LMG 18811T, KG16-1 and JB7; others have draft genomes. Gene names and locus_tags (either from KG16-1 or LMG 18811 T): lacticin biosynthesis genes lctA (LEKG_0458) and lctMT (LEKG_0461-0462); polyketide biosynthesis cluster (LEGAS_1827-1830); pyruvate oxidase poxB (LEGAS_1053); biofilm formation genes icaB and icaA (LEGAS_1065, LEGAS_1067); collagen-binding protein cna (LEGAS_1063); xylose-proton symporter xylT (LEGAS_1062); mucus-binding protein (LEGAS_0414); accessory Sec system genes secY2, asp1, asp2, asp3, secA2, nss, gtfA, gtfB, asp4 (LEKG_0540-0548); fused glutamate racemase/NTP pyrophosphatase (LEKG_0672)

aPhylogenetic analysis (Figure S1 of Additional file 1) indicates that the genome, assigned to L. inhae LMG 22919, actually belongs to L. gelidum subsp. gasicomitatum species

bGene is missing from the genome annotation

cContains frameshift(s)

The presence/absence of selected genes in L. gelidum strains Strains LMG 18811T, KG16-1, C120c, C122c, KSL4-2, PB1a, PB1e, PL111 and LMG 22919 belong to L. gelidum subsp. gasicomitatum; LMG 18297 T and JB7 – to L. gelidum subsp. gelidum. Strains with complete genomes are LMG 18811T, KG16-1 and JB7; others have draft genomes. Gene names and locus_tags (either from KG16-1 or LMG 18811 T): lacticin biosynthesis genes lctA (LEKG_0458) and lctMT (LEKG_0461-0462); polyketide biosynthesis cluster (LEGAS_1827-1830); pyruvate oxidase poxB (LEGAS_1053); biofilm formation genes icaB and icaA (LEGAS_1065, LEGAS_1067); collagen-binding protein cna (LEGAS_1063); xylose-proton symporter xylT (LEGAS_1062); mucus-binding protein (LEGAS_0414); accessory Sec system genes secY2, asp1, asp2, asp3, secA2, nss, gtfA, gtfB, asp4 (LEKG_0540-0548); fused glutamate racemase/NTP pyrophosphatase (LEKG_0672) aPhylogenetic analysis (Figure S1 of Additional file 1) indicates that the genome, assigned to L. inhae LMG 22919, actually belongs to L. gelidum subsp. gasicomitatum species bGene is missing from the genome annotation cContains frameshift(s) Overall, the genome set for comparative analysis contained eight genomes (two complete and six draft). Ortholog prediction and subsequent analysis, including pangenome matrix (Additional file 2) construction and identification of group-specific genes, were performed using GET_HOMOLOGUES software package [38] with OrthoMCL clustering algorithm [39] and default parameters (minimum coverage in BLAST pairwise alignments 75 %, maximum e-value 0.00001), except for the minimum sequence identity, which was set to 30 %. As a result 6,248 orthologous groups were predicted (including singletons) with 406 clusters present in all genomes (so-called core genome), and 983 present in at least 95 % (38) of the genomes (Additional file 2). The last number represents the soft core of the genus, which allowed to account for the missing annotations in draft genomes [40]. The soft core included 52 clusters with unknown function. Based on the pangenome matrix of the presence/absence of the genes in the genomes, a pangenome tree was constructed (Fig. 4). Generally, genomes belonging to the same species clustered together, except for the genomes of 1301_LGAS and 1300_LCIT obtained during the same study of clinical isolates from the hospital intensive care unit [41], where species names were assigned to the sequenced genomes based on the best matching genome from NCBI database. The first genome clustered together with strains, while the second genome fell into the same branch with strains. The phylogenetic analysis (Figure S1 of Additional file 1) placed these genomes into the same branches as in the pangenome tree, thereby confirming that the first genome, formerly assigned to (1301_LGAS), actually belongs to , while the second genome, assigned to (1300_LCIT), is a member of the group. In addition, the phylogenetic analysis (Figure S1 of Additional file 1) showed that the genome assigned to LMG 22919 (= KCTC 3774) appears to be of strain. It is notable that none of the previously published gene sequences for LMG 22919 (= KCTC 3774) (16S rRNA [AF439560]; recN [GenBank:AM698028]; recA [GenBank:JF261010]; atpA [GenBank:AM711190]; pheS [GenBank:AM711167]; rpoA [GenBank:AM711310]) mapped 100 % to the genome assigned to the same strain [GenBank:GCF_000166735.2]. Instead, the nucleotide sequences of these genes extracted from this genome were identical to those of 18811T [GenBank: GCA_000196855.1]. However, the genome assigned to LMG 22919 lacked five genomic regions (including two prophages) that are present in 18811T genome, but absent in all or some other strains (Fig. 5a). This might indicate that the considered genome [GenBank:GCF_000166735.2] belongs to the strain, which is very close to strain 18811T, but not identical. The clustering of the genome assigned to LMG 22919 together with on the pangenome tree (Fig. 4) was, assumingly, the result of numerous frameshifts that are present in this genome (most probably due to the sequencing errors), as well as the high genome fragmentation (893 contigs), which both caused many genes to be missing or truncated in the genome annotation. Although phylogenetically was closer to and (Fig. 1), in terms of gene content it seemed to be more similar to (Fig. 4). Interestingly, there was no clear separation between the meat strain and vegetable strains of based on gene content. On the contrary, five vegetable strains clustered together with the meat strain, while the three remaining vegetable strains formed a separate branch.
Fig. 4

Pangenome tree of the Leuconostoc genus, constructed based on information on presence/absence of orthologs. The tree was inferred using PARS program from the PHYLIP package [66] and visualized by the Tree viewer from T-REX web server [61]. *Phylogenetic analysis indicates that the genome assigned to L. inhae actually belongs to L. gelidum subsp. gasicomitatum. The clustering of this genome together with the L. fallax LMG 18975T genome was most likely caused by the numerous sequencing errors in this genome, as well as high genome fragmentation

Fig. 5

Genome comparisons between L. gelidum strains and other leuconostocs. The comparison was done based on the BLASTN alignment and visualized using BRIG [67]. The genomes of L. gelidum subsp. gasicomitatum LMG 18811T (a) and KG16-1 (b) were used as references and are represented by the innermost black circles. L. gelidum subsp. gasicomitatum strains include LMG 18811T, KG16-1, C120c, C122c, KSL4-2, PB1a, PB1e, PL111, LMG 22919 (*genome erroneously assigned to L. inhae LMG 22919 T) and L. gelidum subsp. gelidum strains include LMG 18297T and JB7. One genome per Leuconostoc species (other than L. gelidum) was chosen for the comparisons. The outmost tracks show the location of prophage regions (designated as Ph) and other functionally interesting genes (designations used: MBP - mucus-binding protein LEGAS_0414; LBC - lacticin biosynthesis cluster; AccSecSyst – accessory Sec system; icaAB - biofilm formation genes; cna - collagen-binding protein; xylT - xylose-proton symporter; poxB - pyruvate oxidase)

Pangenome tree of the Leuconostoc genus, constructed based on information on presence/absence of orthologs. The tree was inferred using PARS program from the PHYLIP package [66] and visualized by the Tree viewer from T-REX web server [61]. *Phylogenetic analysis indicates that the genome assigned to L. inhae actually belongs to L. gelidum subsp. gasicomitatum. The clustering of this genome together with the L. fallax LMG 18975T genome was most likely caused by the numerous sequencing errors in this genome, as well as high genome fragmentation Genome comparisons between L. gelidum strains and other leuconostocs. The comparison was done based on the BLASTN alignment and visualized using BRIG [67]. The genomes of L. gelidum subsp. gasicomitatum LMG 18811T (a) and KG16-1 (b) were used as references and are represented by the innermost black circles. L. gelidum subsp. gasicomitatum strains include LMG 18811T, KG16-1, C120c, C122c, KSL4-2, PB1a, PB1e, PL111, LMG 22919 (*genome erroneously assigned to L. inhae LMG 22919 T) and L. gelidum subsp. gelidum strains include LMG 18297T and JB7. One genome per Leuconostoc species (other than L. gelidum) was chosen for the comparisons. The outmost tracks show the location of prophage regions (designated as Ph) and other functionally interesting genes (designations used: MBP - mucus-binding protein LEGAS_0414; LBC - lacticin biosynthesis cluster; AccSecSyst – accessory Sec system; icaAB - biofilm formation genes; cna - collagen-binding protein; xylT - xylose-proton symporter; poxB - pyruvate oxidase) KG16-1 genome contained 75 unique genes that are not present in other genomes. Of these genes, uncharacterized and phage protein-coding genes constituted 77 % (58) (Fig. 5b). The genome contained the cluster for lacticin-481 biosynthesis (LEKG_0458-0465), which was also present only in strain C120c and homologies to that in [42, 43]. The lacticin-481 type bacteriocin has been shown to be active against LAB and food-spoilage bacterium [43]. Genes involved in catabolism of type I galactan were present in other vegetable strains of besides KG16-1 (KSL4-2, PL111 and PB1e). Peculiarly, the KG16-1 genome was lacking several functionally interesting genes that are present in other strains (Table 6, Fig. 5a). They included a polyketide biosynthesis cluster and the genome locus containing pyruvate oxidase poxB, collagen-binding protein cna, biofilm formation genes icaAB and, as already mentioned, xylose transporter xylT. Besides seven strains, the same polyketide biosynthesis cluster (in terms of domain architecture and gene synteny) was found only in JIM 8232 (genes pig-1,2,3,4). Polyketides are bioactive compounds that can exhibit antibacterial, immunosuppressive and antitumor activities [44]. Pyruvate oxidase is a hydrogen peroxide-producing enzyme and was speculated to be associated with meat discoloration [14]. Genes can and icaAB were suggested to mediate adhesion and, hence, better survival in meat environment of LMG 18811 [14]. They exhibit homology to collagen adhesin and polysaccharide adhesin biosynthesis genes, respectively, from [45, 46]. Our analysis shows that, except for KG16-1, they were also present in vegetable strains. The orthologs of another putative adhesin, mucus-binding protein (LPxTG-like motif-containing), were found only in strains. In the KG16-1 strain, this gene contained a frameshift due to the insertion of A nucleotide at the position 416,796 (confirmed by PCR and Sanger sequencing). Finally, an accessory Sec system involved in the export and glycosylation of serine-rich adhesins [47] was only detected in a few species, including strains (except for LMG 18297). Serine-rich proteins were found in the vicinity of this system in two complete genomes of LMG 18811 and KG16-1. They were also present in other draft genomes and might be involved in adhesion. The latter might play an important role in the survival and persistence of the bacteria considered in a food-processing environment.

Conclusions

Seven vegetable spoilage-associated strains of were sequenced, and one (KG16-1) was fully assembled, functionally annotated and described in detail in this paper. The gene contents were compared between these vegetable strains, the meat-spoilage-associated strain LMG 18811 of the same species and 33 other species sequenced to date. As a result, no obvious differences in gene contents between the meat strain and vegetable strains of were found that would explain their adaptation to different ecological niches. Therefore, the absence of cross-contamination between vegetable- and meat-processing chains seems to be the more likely factor explaining strain segregation between vegetable- and meat-based food products. Finally, the distribution of functionally interesting genes (spoilage-, adhesion- and bacteriocin-related) was determined across the strains and other leuconostocs.

Abbreviations

LAB, Lactic acid bacteria; MAP, Modified-atmosphere packaged; MRS, de Man-Rogosa-Sharpe (medium); RM, restriction-modification
  54 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

3.  The Staden package, 1998.

Authors:  R Staden; K F Beal; J K Bonfield
Journal:  Methods Mol Biol       Date:  2000

4.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

5.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

6.  PANNZER: high-throughput functional annotation of uncharacterized proteins in an error-prone environment.

Authors:  Patrik Koskinen; Petri Törönen; Jussi Nokso-Koivisto; Liisa Holm
Journal:  Bioinformatics       Date:  2015-01-08       Impact factor: 6.937

Review 7.  Emerging themes in SecA2-mediated protein export.

Authors:  Meghan E Feltcher; Miriam Braunstein
Journal:  Nat Rev Microbiol       Date:  2012-09-24       Impact factor: 60.633

8.  Spoilage of value-added, high-oxygen modified-atmosphere packaged raw beef steaks by Leuconostoc gasicomitatum and Leuconostoc gelidum.

Authors:  Elina J Vihavainen; K Johanna Björkroth
Journal:  Int J Food Microbiol       Date:  2007-09-01       Impact factor: 5.277

9.  PHAST: a fast phage search tool.

Authors:  You Zhou; Yongjie Liang; Karlene H Lynch; Jonathan J Dennis; David S Wishart
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

10.  A Year of Infection in the Intensive Care Unit: Prospective Whole Genome Sequencing of Bacterial Clinical Isolates Reveals Cryptic Transmissions and Novel Microbiota.

Authors:  David J Roach; Joshua N Burton; Choli Lee; Bethany Stackhouse; Susan M Butler-Wu; Brad T Cookson; Jay Shendure; Stephen J Salipante
Journal:  PLoS Genet       Date:  2015-07-31       Impact factor: 5.917

View more
  6 in total

1.  Food Spoilage-Associated Leuconostoc, Lactococcus, and Lactobacillus Species Display Different Survival Strategies in Response to Competition.

Authors:  Margarita Andreevskaya; Elina Jääskeläinen; Per Johansson; Anne Ylinen; Lars Paulin; Johanna Björkroth; Petri Auvinen
Journal:  Appl Environ Microbiol       Date:  2018-06-18       Impact factor: 4.792

2.  Transcriptomic time-series analysis of cold- and heat-shock response in psychrotrophic lactic acid bacteria.

Authors:  Ilhan Cem Duru; Anne Ylinen; Sergei Belanov; Alan Avila Pulido; Lars Paulin; Petri Auvinen
Journal:  BMC Genomics       Date:  2021-01-07       Impact factor: 3.969

3.  Complete genome sequence of Leuconostoc suionicum DSM 20241T provides insights into its functional and metabolic features.

Authors:  Byung Hee Chun; Se Hee Lee; Hye Hee Jeon; Dong-Woon Kim; Che Ok Jeon
Journal:  Stand Genomic Sci       Date:  2017-07-17

4.  Metagenomic approach reveals microbial diversity and predictive microbial metabolic pathways in Yucha, a traditional Li fermented food.

Authors:  Jiachao Zhang; Xiaoru Wang; Dongxue Huo; Wu Li; Qisong Hu; Chuanbiao Xu; Sixin Liu; Congfa Li
Journal:  Sci Rep       Date:  2016-08-31       Impact factor: 4.379

5.  Deciphering intra-species bacterial diversity of meat and seafood spoilage microbiota using gyrB amplicon sequencing: A comparative analysis with 16S rDNA V3-V4 amplicon sequencing.

Authors:  Simon Poirier; Olivier Rué; Raphaëlle Peguilhan; Gwendoline Coeuret; Monique Zagorec; Marie-Christine Champomier-Vergès; Valentin Loux; Stéphane Chaillou
Journal:  PLoS One       Date:  2018-09-25       Impact factor: 3.240

6.  Bacterial Communities and Prediction of Microbial Metabolic Pathway in Rice Wine Koji From Different Regions in China.

Authors:  Xinxin Zhao; Fanshu Xiang; Fengxian Tang; Wenchao Cai; Zhuang Guo; Qiangchuan Hou; Xinquan Yang; Wen Song; Chunhui Shan
Journal:  Front Microbiol       Date:  2022-01-03       Impact factor: 5.640

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.