| Literature DB >> 21338475 |
Roland J Siezen1, Jumamurat R Bayjanov, Giovanna E Felis, Marijke R van der Sijde, Marjo Starrenburg, Douwe Molenaar, Michiel Wels, Sacha A F T van Hijum, Johan E T van Hylckama Vlieg.
Abstract
Lactococcus lactis produces lactic acid and is widely used in the manufacturing of various fermented dairy products. However, the species is also frequently isolated from non-dairy niches, such as fermented plant material. Recently, these non-dairy strains have gained increasing interest, as they have been described to possess flavour-forming activities that are rarely found in dairy isolates and have diverse metabolic properties. We performed an extensive whole-genome diversity analysis on 39 L. lactis strains, isolated from dairy and plant sources. Comparative genome hybridization analysis with multi-strain microarrays was used to assess presence or absence of genes and gene clusters in these strains, relative to all L. lactis sequences in public databases, whereby chromosomal and plasmid-encoded genes were computationally analysed separately. Nearly 3900 chromosomal orthologous groups (chrOGs) were defined on basis of four sequenced chromosomes of L. lactis strains (IL1403, KF147, SK11, MG1363). Of these, 1268 chrOGs are present in at least 35 strains and represent the presently known core genome of L. lactis, and 72 chrOGs appear to be unique for L. lactis. Nearly 600 and 400 chrOGs were found to be specific for either the subspecies lactis or subspecies cremoris respectively. Strain variability was found in presence or absence of gene clusters related to growth on plant substrates, such as genes involved in the consumption of arabinose, xylan, α-galactosides and galacturonate. Further niche-specific differences were found in gene clusters for exopolysaccharides biosynthesis, stress response (iron transport, osmotolerance) and bacterial defence mechanisms (nisin biosynthesis). Strain variability of functions encoded on known plasmids included proteolysis, lactose fermentation, citrate uptake, metal ion resistance and exopolysaccharides biosynthesis. The present study supports the view of L. lactis as a species with a very flexible genome.Entities:
Mesh:
Year: 2011 PMID: 21338475 PMCID: PMC3818997 DOI: 10.1111/j.1751-7915.2011.00247.x
Source DB: PubMed Journal: Microb Biotechnol ISSN: 1751-7915 Impact factor: 5.813
Strains included in the analysis.
| Strain code | Internal collection code | Isolation source | Other information |
|---|---|---|---|
|
| |||
| ATCC19435T | NIZO 29T | Milk (dairy starter) | |
| Li‐1 | NIZO 1156 | Grass | |
| E34 | NIZO 1173 | Silage | |
| N42 | NIZO 1230 | Soil and grass | |
| DRA4 | NIZO 1592 | Dairy starter A | |
| ML8 | NIZO 20 | Dairy starter | |
| LMG9446, NCFB1867 | NIZO 2123 | Frozen peas | |
| LMG9449, NCFB1868 | NIZO 2124 | Frozen peas | |
| K231 | NIZO 2199 | White kimchii | |
| K337 | NIZO 2202 | White kimchii | |
| NCDO895, NCIMB700895 | NIZO 2211 | Dairy starter | |
| KF7 | NIZO 2219 | Alfalfa sprouts | |
| KF24 | NIZO 2220 | Alfalfa sprouts | |
| KF67 | NIZO 2223 | Grapefruit juice | |
| KF134 | NIZO 2226 | Alfalfa and radish sprouts | |
| KF146 | NIZO 2229 | Alfalfa and radish sprouts | |
| KF147 | NIZO 2230 | Mung bean sprouts | |
| KF196 | NIZO 2236 | Japanese kaiwere shoots | |
| KF201 | NIZO 2238 | Sliced mixed vegetables | |
| B2244B | NIZO 3919 | Mustard and cress | |
| KF282 | NIZO 3920 | Mustard and cress | |
| LMG14418 | NIZO 2424 | Bovine milk | |
| IL1403 | NIZO 2441 | Dairy starter | Plasmid‐free derivative of |
| LMG8526, NCFB2091 | NIZO 26 | Chinese radish seeds | |
| UC317 | NIZO 644 | Dairy starter | |
| M20 | NIZO 844 | Soil | |
| P7304 | NIZO 2207 | Litter on pastures | rRNA most related to isolates from prawns |
| P7266 | NIZO 2206 | Litter on pastures | rRNA most related to isolates from prawns |
|
| |||
| V4 | NIZO 1157 | Raw sheep milk | |
| KW10 | NIZO 2249 | Kaanga way | |
| NCDO763, ML3 | NIZO 643 | Dairy starter | Derivative of NCDO712 |
| MG1363 | NIZO 1492 | Cheese starter | Plasmid‐free derivative of NCDO712 |
| N41 | NIZO 1175 | Soil and grass | |
|
| |||
| LMG6897T | NIZO 2418T | Cheese starter | Subculture of strain HP |
| FG2 | NIZO 2252 | Dairy starter | |
| AM2 | NIZO 33 | Dairy starter | |
| HP | NIZO 42 | Dairy starter | |
| SK11 | NIZO 32 | Dairy starter | Phage‐resistant derivative of AM1 |
|
| |||
| LMG8520T | NIZO 24T | Leaf hopper (insect) | |
Hybridization and genotyping accuracy for the four reference strains.
| Genotyping | IL1403 | KF147 | MG1363 | SK11 |
|---|---|---|---|---|
| OGs with at least one gene from reference strain | 2286 | 2428 | 2406 | 2289 |
| OGs with score NA | 132 | 181 | 274 | 109 |
| OGs correctly identified as ‘present’ (true positives) | 2101 | 2226 | 2056 | 2130 |
| OGs incorrectly identified as ‘absent’ (false negatives) | 53 | 21 | 76 | 50 |
| True‐positive rate | 97.5% | 99.1% | 96.4% | 97.7% |
| False‐negative rate | 2.5% | 0.9% | 3.6% | 2.3% |
NA means that the presence/absence of an OG could not be calculated, either because the corresponding genes were not represented on the microarray, or due to an insufficient number of probes matching to members of this OG (by default at least 10 probes must be aligned).
Note that strain MG1363 was not used in the CGH array design, and therefore the positive recall for this strain was slightly lower than for the other three reference strains.
Chromosomal orthologous groups (chrOGs), derived from pan‐genome CGH analysis, and their presence in L. lactis strains according to different criteria.
| Analysed groups | Number | Other information |
|---|---|---|
| Total orthologous groups | 3877 | Based on four sequenced |
| Core chrOGs for sequenced genomes | 1513 | Based on four sequenced |
| Number of groups reliably analysable by CGH | 3255 | 622 OGs not on array or not analysed |
| Core chrOGs for the species | 1121 | Strains P7266 and P7304 omitted |
| Core chrOGs for the species | 1268 | Strains KW10 and KF282 also omitted; see Table S1 |
| Core chrOGs linked to LaCOGs | 1246 | Table S1 |
| Core chrOGs only in | 72 | Table S2; not in other LAB |
| Variable chrOGs in 35 strains | 1987 | See distribution in |
Lactococcus lactis ssp. cremoris strains SK11 and MG1363, L. lactis ssp. lactis strains IL1403 and KF147.
Figure 1Whole‐genome content‐based tree. Hierarchical clustering tree of L. lactis strains based on presence/absence of all chromosomal orthologous groups (chrOGs) in these strains. The binary distance metric was used in combination with the average linkage clustering algorithm. Solid rectangles signify dairy isolates, while the other strains signify mainly plant origin. The top clade of 10 strains corresponds to ssp. cremoris genotype, further divided into two subclades, corresponding to the two phenotypes, i.e. cremoris‐like (upper subclade) and lactis‐like phenotype (lower subclade). The lower clade of 27 strains contains only L. lactis ssp. lactis and ssp. hordniae type strain LMG8520T. This clade grouping ssp. lactis strains contains subclades corresponding to isolation source (dairy versus non‐dairy). Strains P7266 and P7304 are clustered far apart from the other subspecies with a lactis genotype.
Figure 2Distribution of chrOGs in the strains. Distribution of chromosomal orthologous groups (chrOGs) in 37 strains (A) and in 35 strains (B). Strains P7304 and P7266 are omitted in (A) and strains KW10 and KF282 are also omitted in (B), due to ambiguities in hybridization efficiencies (see text). The bar on the outer right represents the total number of chrOGs in the core genome. Shading indicates whether the chrOGs are present only in ssp. cremoris strains (black), only in ssp. lactis strains (white) or in both subspecies (grey).
Lactococcus lactis specific core genes with predicted functionsa in 35 strains.
| chrOG id | LaCOG id | Size (AA) | Consensus function | Best hit in non‐LAB organism |
|---|---|---|---|---|
| 1626 | LaCOG02385 | 162–180 | Acetyltransferase, GNAT family | |
| 336 | LaCOG02698 | 152 | Acetyltransferase, GNAT family | |
| 1134 | LaCOG02731 | 1436 | Activator of (R)‐2‐hydroxyglutaryl‐CoA dehydratase | |
| 1884 | LaCOG02722 | 213 | Aminoglycoside phosphotransferase | |
| 350 | LaCOG02578 | 379–383 | ATP/GTP‐binding protein | |
| 202 | LaCOG02425 | 784 | Carbon starvation protein A | |
| 1125 | LaCOG02464 | 134–151 | Dinucleoside polyphosphate hydrolase | |
| 262 | LaCOG02721 | 251–261 | Metallophosphoesterase | |
| 463 | LaCOG02619 | 462–465 | MF superfamily multidrug resistance protein | |
| 174 | LaCOG02554 | 443 | NAD(FAD)‐utilizing dehydrogenase | |
| 2067 | LaCOG02712 | 535 | NADH dehydrogenase | |
| 1192 | LaCOG02661 | 101 | O6‐methylguanine‐DNA methyltransferase | |
| 339 | LaCOG02380 | 145 | Osmotically inducible protein C | |
| 1256 | LaCOG02428 | 1190–1223 | Pyruvate‐flavodoxin oxidoreductase | |
| 1483 | LaCOG02566 | 276–296 | Rgg/GadR/MutR family transcriptional regulator | |
| 2408 | LaCOG02658 | 160–163 | SUF system FeS assembly protein | |
| 1011 | LaCOG02734 | 151 | Transporter | None |
| 2258 | LaCOG02509 | 143 | Universal stress protein | |
| 636 | LaCOG02670 | 141 | Universal stress protein A | |
| 1370 | LaCOG02404 | 269–303 | Zinc‐binding dehydrogenase |
For a full list of the 72 L. lactis‐specific chrOGs see Table S2.
Size (in AA) of protein in four reference L. lactis genomes.
Main subspecies‐specific conserved gene clusters and functions.
| (A) Subspecies | |||
|---|---|---|---|
| Locus | Gene | Function | Comment |
| LLKF_0567 | ImpB/MucB/SamB family protein | ||
| LLKF_0568 | Acetyltransferase, GNAT family | ||
| LLKF_0569 | Transcriptional regulator, MarR family | ||
| LLKF_0570 | Organic hydroperoxide resistance family protein | ||
| LLKF_1314 | NhaP‐type Na+/H+ and K+/H+ antiporter | Cluster not in UC317, LMG8520 | |
| LLKF_1315 | Hypothetical protein | ||
| LLKF_1316 | Alpha‐amylase | ||
| LLKF_1317 | |||
| LLKF_1605 | Endo‐beta‐ | Arabinose gene cluster is inserted between | |
| LLKF_1606 | Glucan 1,6‐alpha‐glucosidase (EC 3.2.1.70) | ||
| LLKF_1607 | Lacto‐ | ||
| LLKF_1608 | Sugar ABC transporter, substrate‐binding protein | ||
| LLKF_1609 | Sugar ABC transporter, permease protein | ||
| LLKF_1610 | Sugar ABC transporter, permease protein | ||
| LLKF_1611 | Alpha‐mannosidase (EC 3.2.1.24) | ||
| LLKF_1612 | Hypothetical protein | ||
| LLKF_1613 | Transcriptional regulator, GntR family | ||
| LLKF_1614 | Alpha‐1,2‐mannosidase (EC 3.2.1.24) | ||
| LLKF_1615 | Phosphoketolase (EC 4.1.2.9) | ||
| LLKF_1623 | |||
| LLKF_1624 | Acetyltransferase (EC 2.3.1.‐) | ||
| LLKF_1625 | Beta‐1,4‐xylosidase | ||
| LLKF_1626 | Xyloside transporter | ||
| LLKF_1627 | Aldose‐1‐epimerase (EC 5.1.3.3) | ||
| LLKF_1628 | Xylulose kinase (EC 2.7.1.17) | ||
| LLKF_1859 | Carbamate kinase (EC 2.7.2.2) | Cluster partially absent in LMG9449; there are other copies of carbamate kinase | |
| LLKF_1860 | Agmatine deiminase (EC 3.5.3.12) | ||
| LLKF_1861 | Agmatine/putrescine antiporter | ||
| LLKF_1862 | Putrescine carbamoyltransferase (EC 2.1.3.6) | ||
| LLKF_1863 | Transcriptional regulator, LuxR family | ||
| LLKF_2026 | Magnesium and cobalt efflux protein | ||
| LLKF_2027 | Penicillin acylase (EC 3.5.1.11) | ||
| LLKF_2028 | Protein‐tyrosine phosphatase (EC 3.1.3.48) | ||
| LLKF_2164 | Beta‐galactosidase (EC 3.2.1.23) | ||
| LLKF_2165 | Galactoside | ||
| (B) Subspecies | |||
| LACR_0451 | Antibiotic export permease protein | Inserted relative to IL1403, KF147 | |
| LACR_0452 | Antibiotic export ATP‐binding protein | ||
| LACR_0453 | Transcriptional regulator, MarR family | ||
| LACR_0498 | Hypothetical protein | Cluster unique for | |
| LACR_0501 | Hypothetical protein | Gene absent in FG2, HP | |
| LACR_0502 | Hypothetical protein | ||
| LACR_0505 | Hypothetical protein | ||
| LACR_0506 | Hypothetical protein | ||
| LACR_0507 | Hypothetical protein | ||
| LACR_0508 | Hypothetical protein | ||
| LACR_0509 | Hypothetical protein | ||
| LACR_0754 | Hypothetical protein | ||
| LACR_0755 | Cold‐shock DNA‐binding protein family protein | ||
| LACR_0756 | Cold‐shock DNA‐binding protein family protein | ||
| LACR_0761 | Sugar ABC transporter permease | In IL1403 a transposase at this position | |
| LACR_0762 | Sugar ABC transporter permease | ||
| LACR_0763 | Oligosaccharide‐binding protein | ||
| LACR_0764 | Integral membrane protein | ||
| LACR_0765 | Alpha‐glucosidase (EC 3.2.1.30) | ||
| LACR_1288 | Transcriptional regulator, AraC family | Glycan degradation; similar clusters in | |
| LACR_1289 | Major facilitator superfamily permease | Gene absent in FG2, HP | |
| LACR_1290 | Glucan 1,3‐beta‐glucosidase (EC 3.2.1.58) | Gene absent in FG2, HP, LMG6897T | |
| LACR_1291 | Beta‐xylosidase (EC 3.2.1.37) | ||
| LACR_1632 | PTS system cellobiose‐specific, IIC component | Whole gene cluster absent in V4, KW10 | |
| LACR_1633 | Transcriptional regulator, AraC family | Gene absent in FG2, HP, LMG6897T | |
| LACR_1636 | Glucokinase (EC 2.7.1.2)/transcription regulator | Gene absent in FG2, HP, LMG6897T | |
| LACR_1637 | 6‐Phospho‐beta‐glucosidase (EC 3.2.1.86) | ||
| LACR_1638 | Ribose‐5‐phosphate isomerase B (EC 5.3.1.6) | ||
| LACR_1639 | Ribulose‐5‐phosphate 3‐epimerase (EC 5.1.3.1) | ||
| LACR_1640 | Transcription regulator, LacI family | ||
| LACR_2591 | Hypothetical protein | ||
| LACR_2592 | Hypothetical protein | ||
| LACR_2593 | Hypothetical protein | ||
| LACR_2594 | Hypothetical protein | ||
For the conserved OGs, members from a reference genome are listed, i.e. LLKF = L. lactis ssp. lactis KF147; LACR = L. lactis ssp. cremoris SK11. Numbering indicates genomic position relative to other chromosomal genes, where consecutively numbered genes are generally in an operon.
These genes are predicted to be present in all strains of a subspecies, either lactis or cremoris, and absent in all strains of the other subspecies. Exceptions are indicated.
Diversity of chromosomally encoded gene clusters and functions.
| Strain | Subspecies | Dairy | Arabinose metabolism | Sucrose metabolism | Galacturonate metabolism | α‐Galactoside metabolism | Xylan breakdown | Starch/maltose breakdown | Trp metabolism | Leu/Ile/Val metabolism | Citrate metabolism | High‐affinity K+ transport | Nisin production/immunity | EPS biosynthesis ( | Teichoic acid biosynthesis |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LMG6897T | C | D | + | S+ | |||||||||||
| HP | C | D | + | S+ | |||||||||||
| FG2 | C | D | + | S+ | |||||||||||
| SK11 | C | D | + | + | + | S+ | |||||||||
| AM2 | C | D | + | + | + | +/− | S+ | ||||||||
| NCD0763 | C | D | + | + | + | M+ | |||||||||
| MG1363 | C | D | + | + | + | M+ | |||||||||
| N41 | C | + | + | + | +/− | +/− | M+ | ||||||||
| V4 | C | D | + | + | + | + | + | M+ | |||||||
| KW10 | C | + | + | +/− | + | ? | |||||||||
| B2244B | L | + | + | + | + | +/− | + | + | + | + | |||||
| LMG8526 | L | + | + | + | + | + | + | + | + | + | +/− | K+/− | |||
| Li‐1 | L | +/− | + | + | +/− | + | + | + | + | I+/− | |||||
| K231 | L | +/− | + | +/− | +/− | + | + | + | + | I+/− | |||||
| KF7 | L | +/− | + | + | + | + | + | + | +/− | K+/− | |||||
| LMG9449 | L | +/− | + | + | + | +/− | + | + | + | I+/− | |||||
| KF24 | L | +/− | + | +/− | + | + | +/− | + | |||||||
| KF146 | L | + | + | + | + | + | + | + | + | + | |||||
| KF134 | L | + | + | + | + | + | + | +/− | |||||||
| KF196 | L | + | + | + | + | + | + | + | +/− | ||||||
| KF67 | L | + | + | + | + | + | + | + | +/− | ||||||
| KF201 | L | + | + | + | + | + | + | I+/− | |||||||
| E34 | L | +/− | + | + | + | + | |||||||||
| K337 | L | +/− | + | +/− | +/− | + | + | + | + | I+/− | |||||
| M20 | L | +/− | + | +/− | + | + | + | + | K+/− | ||||||
| LMG8520 | H | + | + | + | +/− | ||||||||||
| UC317 | L | D | + | + | + | + | I+ | ||||||||
| NCD0895 | L | D | + | + | + | + | + | + | I+ | ||||||
| ML8 | L | D | + | + | + | I+ | |||||||||
| LMG14418 | L | D | + | + | +/− | + | + | + | +/− | I+ | |||||
| N42 | L | + | + | + | + | + | I+ | ||||||||
| IL1403 | L | D | + | + | + | + | I+ | ||||||||
| DRA4 | L | D | + | + | + | + | I+ | ||||||||
| LMG9446 | L | + | +/− | + | + | + | + | +/− | |||||||
| KF147 | L | + | + | + | + | + | + | + | + | + | + | + | K+/− | ||
| ATCC19435T | L | D | +/− | +/− | +/− | + | + | + | K+/− | ||||||
| KF282 | L | + | + | +/− | +/− | + | + | + | + | K+/− |
Predicted presence of chromosomally encoded gene clusters and their functions in the L. lactis strains. L: ssp. lactis; C: ssp. cremoris; D: dairy;
denotes plasmid‐cured strain; + denotes presence of all of the required genes; +/− denotes presence of some of the required genes. Teichoic acid biosynthesis: I = IL1403 type, M = MG1363 type, S = SK11 type, K = KF147 type. Strains P7266 and P7304 were omitted from this analysis.
Figure 3Variable gene clusters involved in sugar breakdown. As no gene order is known for the query strains, the representative clusters present in the reference genome L. lactis KF147 are shown. (A) Arabinose metabolism; (B) sucrose metabolism; (C) galacturonate metabolism; (D) α‐galactoside metabolism; (E) xylan breakdown; (F) starch breakdown. Coloured bars indicate operon predictions of two or more genes; stalks indicate predicted terminators. Images made using MINOMICS (Brouwer ). Gene annotations are in Table S4.
Figure 4Variable gene clusters for cell‐envelope biosynthesis. As no gene order is known for the query strains, the representative clusters present in the reference genome L. lactis KF147 are shown. (A) Exopolysaccharide (EPS) biosynthesis cluster I; (B) exopolysaccharide (EPS) biosynthesis cluster II; (C) teichoic acid biosynthesis cluster. Coloured bars indicate operon predictions of two or more genes; stalks indicate predicted terminators. Images made using MINOMICS (Brouwer ). Gene annotations are in Table S4.
Diversity of putative plasmid‐encoded genes and functions.
| Strain | Subspecies | Dairy | Replication/partitioning | Mobilization/conjugation | Proteolysis ( | Copper resistance | Cadmium resistance | Manganese transport | Lactose metabolism | Citrate uptake ( | Glu dehydrogenase | EPS synthesis |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LMG6897T | C | D | + | +/− | + | + | + | |||||
| HP | C | D | + | + | + | + | ||||||
| FG2 | C | D | + | + | + | + | + | |||||
| SK11 | C | D | + | +/− | + | + | + | + | ||||
| AM2 | C | D | + | +/− | + | + | + | + | + | |||
| NCD0763 | C | D | + | + | + | + | ||||||
| MG1363 | C | D | ||||||||||
| N41 | C | + | + | + | + | + | + | + | +/− | |||
| V4 | C | D | + | |||||||||
| KW10 | C | + | +/− | +/− | +/− | + | +/− | |||||
| B2244B | L | +/− | ||||||||||
| LMG8526 | L | +/− | + | +/− | ||||||||
| Li‐1 | L | + | +/− | + | + | + | ||||||
| K231 | L | + | +/− | + | ||||||||
| KF7 | L | + | + | +/− | ||||||||
| LMG9449 | L | + | + | +/− | + | |||||||
| KF24 | L | + | ||||||||||
| KF146 | L | +/− | ||||||||||
| KF134 | L | +/− | ||||||||||
| KF196 | L | + | +/− | |||||||||
| KF67 | L | +/− | ||||||||||
| KF201 | L | |||||||||||
| E34 | L | |||||||||||
| K337 | L | |||||||||||
| M20 | L | |||||||||||
| LMG8520 | H | + | ||||||||||
| UC317 | L | D | + | + | + | + | + | + | + | +/− | ||
| NCD0895 | L | D | + | + | + | + | + | |||||
| ML8 | L | D | + | + | + | + | + | + | ||||
| LMG14418 | L | D | ||||||||||
| N42 | L | + | + | + | +/− | + | + | + | ||||
| IL1403 | L | D | ||||||||||
| DRA4 | L | D | + | +/− | + | + | + | + | +/− | |||
| LMG9446 | L | + | +/− | +/− | + | + | ||||||
| KF147 | L | + | +/− | |||||||||
| ATCC19435T | L | D | + | + | ||||||||
| KF282 | L | + | ||||||||||
| P7304 | L | + | +/− | + | +/− | + | +/− | |||||
| P7266 | L | + | +/− | +/− | + | + | +/− |
Predicted presence of plasmid‐encoded genes and their functions in the L. lactis strains. L: ssp. lactis; C: ssp. cremoris; D: dairy;
denotes plasmid‐cured strain; + denotes the presence of all or most of the required genes, +/− denotes the presence of some of the required genes. Genes that are known to be both chromosomally and plasmid‐encoded are not included in this analysis, e.g. transposases, intergrases/recombinases, restriction/modification system (hsdM, hsdR, hsdS), proteolytic system (pcp, pepO, pepF, oppACBFD), cold shock proteins and all plasmid‐encoded genes that hybridized with the plasmid‐free strains IL1403 or MG1363.
Figure 5Polyacrylamide gel electrophoresis of plasmid DNA in L. lactis strains. Far left and right lanes contain molecular weight markers. The lower three panels are Southern blots of the same gel as at top, using probes for the citP, lacG and prtP genes. The arrow indicates an artefact band, present in all lanes, and presumably due to contaminating chromosomal DNA.