| Literature DB >> 24116025 |
Hidehiro Toh1, Kenshiro Oshima, Akiyo Nakano, Muneaki Takahata, Masaru Murakami, Takashi Takaki, Hidetoshi Nishiyama, Shizunobu Igimi, Masahira Hattori, Hidetoshi Morita.
Abstract
Lactobacillus casei, L. paracasei, and L. rhamnosus form a closely related taxonomic group (Lactobacillus casei group) within the facultatively heterofermentative lactobacilli. Here, we report the complete genome sequences of L. paracasei JCM 8130 and L. casei ATCC 393, and the draft genome sequence of L. paracasei COM0101, all of which were isolated from daily products. Furthermore, we re-annotated the genome of L. rhamnosus ATCC 53103 (also known as L. rhamnosus GG), which we have previously reported. We confirmed that ATCC 393 is distinct from other strains previously described as L. paracasei. The core genome of 10 completely sequenced strains of the L. casei group comprised 1,682 protein-coding genes. Although extensive genome-wide synteny was found among the L. casei group, the genomes of ATCC 53103, JCM 8130, and ATCC 393 contained genomic islands compared with L. paracasei ATCC 334. Several genomic islands, including carbohydrate utilization gene clusters, were found at the same loci in the chromosomes of the L. casei group. The spaCBA pilus gene cluster, which was first identified in GG, was also found in other strains of the L. casei group, but several L. paracasei strains including COM0101 contained truncated spaC gene. ATCC 53103 encoded a higher number of proteins involved in carbohydrate utilization compared with intestinal lactobacilli, and extracellular adhesion proteins, several of which are absent in other strains of the L. casei group. In addition to previously fully sequenced L. rhamnosus and L. paracasei strains, the complete genome sequences of L. casei will provide valuable insights into the evolution of the L. casei group.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24116025 PMCID: PMC3792948 DOI: 10.1371/journal.pone.0075073
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Circular representations of the chromosomes of L. rhamnosus ATCC 53103, L. paracasei JCM 8130, and L. casei ATCC 393.
From the outside: circles 1 and 2 of the chromosome show the positions of protein-coding genes on the positive and negative strands, respectively. Circle 3 shows the positions of protein-coding genes that are shared among the 10 completely sequenced genomes of the L. casei group. Circle 4 shows the positions of tRNA genes (orange) and rRNA genes (blue). Circle 5 shows a plot of GC skew [(G − C)/(G+C); orange indicates values >0; blue indicates values <0]. Circle 6 shows a plot of G+C content (outward: higher values than the average). The genomic islands in each strain are boxed: regions including carbohydrate utilization gene cluster (pink), prophage-like regions (green), and the others (blue).
Figure 2Genome-based phylogenetic analysis of the L. casei group.
(A) Phylogenetic relationships between the genomes of sequenced lactobacilli inferred from 34 concatenated ribosomal protein amino acid sequences. The scale bar represents an evolutionary distance. Sequences were aligned with ClustalW with a bootstrap trial of 1,000 and bootstrap values (%) are indicated at the nodes. An unrooted tree was generated using NJplot. The chromosome size is shown in parentheses. (B) Three-way comparisons between L. casei ATCC 393 with L. rhamnosus ATCC 53103 and L. paracasei ATCC 334. The 2,191 genes shared by the three strains were classified into three categories on the basis of the BLAST score ratio analysis [23]. (C) Venn diagram comparing the gene inventories of four strains of the L. casei group. Data resulted from reciprocal BLASTP analysis. The numbers of shared and unique genes are shown.
Figure 3Abundance of genes related to carbohydrate transport and metabolism in L. rhamnosus ATCC 53103.
(A) Comparative analysis by functional categories of the gene repertoires of sequenced intestinal lactobacilli. The number of genes assigned to COG category G in each genome is shown. (B) Carbohydrate utilization gene clusters of L. rhamnosus ATCC 53103. Genes and their orientations are depicted with arrows. Regions -5 and -6 are compared with the corresponding genomic locations in L. rhamnosus Lc 705. Gray bars indicate orthologous regions.
Genomic islands in L. rhamnosus ATCC 53103, L. paracasei JCM 8130, and L. casei ATCC 393.
|
|
|
| ||||||
| Locus | Size (kb) | Product description | Locus | Size (kb) | Product description | Locus | Size (kb) | Product description |
| LRHM_0019– LRHM_0031 | 12.5 | ammonium transporter protein,hypothetical protein | LBPC_0071– LBPC_0078 | 6.1 | conserved hypothetical protein | LBCZ_0041– LBCZ_0051 | 8.3 | hypothetical protein, transposase |
| LRHM_0044– LRHM_0073 | 39.7 | fibronectin-binding protein, beta-glucuronidase, 2-dehydro-3-deoxygluconokinase, mannonate dehydratase, fructuronate reductase | LBPC_0157– LBPC_0170 | 15.5 | conserved hypothetical protein | LBCZ_0065– LBCZ_0076 | 12.6 | transposase, conserved hypothetical protein |
| LRHM_0086– LRHM_0096 | 10.5 | carbohydrate utilization gene cluster (region-1) | LBPC_0276– LBPC_0297 | 23.8 | carbohydrate utilization gene cluster | LBCZ_0159– LBCZ_0174 | 15.7 | myo-inositol catabolism protein |
| LRHM_0149– LRHM_0156 | 6.1 | carbohydrate transporter, two-componentsystem | LBPC_0331– LBPC_0359 | 30.6 | PTS transporter, amino acid ABC transporter | LBCZ_0223– LBCZ_0252 | 33.8 | carbohydrate utilization gene cluster |
| LRHM_0172– LRHM_0177 | 6.6 | taurine ABC transporter | LBPC_0470– LBPC_0499 | 33.4 | hypothetical protein | LBCZ_0277– LBCZ_0286 | 8.9 | conserved hypothetical protein |
| LRHM_0256– LRHM_0268 | 14.8 | myo-inositol catabolism protein | LBPC_0579– LBPC_0584 | 6.7 | PTS transporter, 6-phospho-beta-galactosidase | LBCZ_0338– LBCZ_0388 | 41.1 | prophage region I |
| LRHM_0319– LRHM_0350 | 34.6 | carbohydrate utilization gene cluster (region-2) | LBPC_0636– LBPC_0648 | 12.1 | prophage region I | LBCZ_0605– LBCZ_0617 | 11.9 | hypothetical protein |
| LRHM_0376– LRHM_0466 | 97.8 | carbohydrate utilization gene cluster (region-3), amino acid ABC transporter, beta-N-acetylglucosaminidase, N-acylamino acid racemase, cell surface protein, transposase | LBPC_0763– LBPC_0817 | 41.2 | prophage region II | LBCZ_0620– LBCZ_0675 | 39.9 | prophage region II |
| LRHM_0493– LRHM_0500 | 8.0 | hypothetical protein | LBPC_1168– LBPC_1176 | 9.2 | conserved hypothetical protein | LBCZ_0685– LBCZ_0742 | 54.2 | prophage region III |
| LRHM_0624– LRHM_0631 | 7.3 | carbohydrate utilization gene cluster (region-4) | LBPC_1739– LBPC_1789 | 42.7 | prophage region III | LBCZ_0821– LBCZ_0832 | 13.4 | prophage region IV |
| LRHM_1038– LRHM_1090 | 39.7 | prophage region I | LBPC_1864– LBPC_1906 | 36.7 | prophage region IV | LBCZ_1343– LBCZ_1372 | 36.1 | prophage region V |
| LRHM_1192– LRHM_1199 | 10.4 | amino acid transporter, hypothetical protein | LBPC_1988– LBPC_1998 | 10.9 | glycosyltransferase, transposase | LBCZ_1552– LBCZ_1559 | 7.8 | truncated formate C-acetyltransferase, transcriptional regulator |
| LRHM_1455– LRHM_1484 | 36.5 | prophage region II | LBPC_2364– LBPC_2427 | 70.9 | glycosyltransferase, cell surface protein, conserved hypothetical protein | LBCZ_1571– LBCZ_1577 | 9.9 | putative cell surface protein |
| LRHM_1518– LRHM_1530 | 24.9 | cell surface protein, glycosyltransferase | LBPC_2603– LBPC_2630 | 27.3 | carbohydrate utilization gene cluster | LBCZ_1817– LBCZ_1825 | 9.7 | glycosyltransferase |
| LRHM_1699– LRHM_1703 | 7.7 | cell surface protein | LBPC_2661– LBPC_2670 | 8.4 | putative cell surface protein, transposase | LBCZ_1857– LBCZ_1870 | 23.6 | conserved hypothetical protein |
| LRHM_1877– LRHM_1891 | 13.2 | conserved hypothetical protein, transposase | LBCZ_2040– LBCZ_2046 | 9.2 | conserved hypothetical protein | |||
| LRHM_1959– LRHM_1977 | 19.3 | glycosyltransferase | LBCZ_2167– LBCZ_2179 | 12.2 | conserved hypothetical protein, ABC transporter | |||
| LRHM_2012– LRHM_2019 | 16.4 | conserved hypothetical protein | LBCZ_2185– LBCZ_2247 | 81.7 | putative cell surface protein, conjugative transposon protein | |||
| LRHM_2085– LRHM_2097 | 12.3 | conserved hypothetical protein | LBCZ_2402– LBCZ_2414 | 12.5 | carbohydrate utilization gene cluster | |||
| LRHM_2115– LRHM_2119 | 8.3 | CRISPR-associated protein | LBCZ_2437– LBCZ_2492 | 66.7 | putative cell surface protein, carbohydrate utilization gene cluster | |||
| LRHM_2193– LRHM_2198 | 11.8 | cell surface protein, glycosyltransferase | LBCZ_2499– LBCZ_2517 | 21.1 | transposase, conserved hypothetical protein | |||
| LRHM_2223– LRHM_2230 | 7.3 | multidrug ABC transporter, hypothetical protein | LBCZ_2616– LBCZ_2643 | 31.3 | carbohydrate utilization gene cluster, transposase | |||
| LRHM_2351– LRHM_2356 | 8.1 | multidrug ABC transporter | LBCZ_2678– LBCZ_2694 | 15.0 | transposase | |||
| LRHM_2545– LRHM_2597 | 57.7 | carbohydrate utilization gene cluster (region-5) | LBCZ_2698– LBCZ_2704 | 7.6 | PTS transporter | |||
| LRHM_2635– LRHM_2651 | 15.4 | carbohydrate utilization gene cluster (region-6) | ||||||
| LRHM_2779– LRHM_2793 | 12.5 | prophage region III | ||||||
Figure 4Gene clusters encoding cell surface proteins in L. rhamnosus ATCC 53103.
(A) WxL clusters. (B) Putative glycosylated cell-surface protein clusters. Genes and their orientations are depicted with arrows. Gray bars indicate orthologous regions between L. rhamnosus ATCC 53103 and L. paracasei ATCC 334.
Putative cell surface adherence proteins of L. rhamnosus ATCC 53103.
| Locus | Size (aa) | Contained domain | SignalP |
|
|
|
|
|
|
|
| LRHM_0051 | 1,492 | fibronectin-binding | + | − | − | − | − | − | + | + |
| LRHM_0182 | 106 | LPXTG | + | − | − | − | − | − | + | + |
| LRHM_0183 | 268 | WxL | + | + | + | + | + | + | + | + |
| LRHM_0184 | 359 | DUF916 | + | + | + | + | + | + | + | + |
| LRHM_0426 | 334 | LPXTG (SpaA) | + | − | + | + | + | − | − | − |
| LRHM_0427 | 241 | LPXTG (SpaB) | + | − | + | + | + | − | − | − |
| LRHM_0428 | 895 | LPXTG (SpaC) | + | + | + | + | − | − | − | − |
| LRHM_0555 | 220 | WxL1 | + | + | + | + | + | + | + | + |
| LRHM_0556 | 340 | DUF916 | + | + | + | + | + | + | + | + |
| LRHM_0557 | 118 | LPXTG | + | + | + | + | + | + | + | + |
| LRHM_0558 | 688 | WxL2 | + | − | + | + | − | + | + | + |
| LRHM_0561 | 238 | WxL1 | + | + | + | + | + | − | + | + |
| LRHM_0562 | 124 | LPXTG | + | + | + | + | + | − | + | + |
| LRHM_0563 | 229 | WxL1 | + | + | + | + | + | − | + | + |
| LRHM_0564 | 679 | WxL2 | + | + | + | + | + | − | + | + |
| LRHM_1138 | 401 | LPXTG | − | + | + | + | + | + | + | + |
| LRHM_1331 | 213 | LysM | − | + | + | + | + | + | + | + |
| LRHM_1393 | 567 | fibronectin-binding | − | + | + | + | + | + | + | + |
| LRHM_1528 | 913 | Ig-like fold | + | − | − | − | − | + | + | + |
| LRHM_1529 | 3,275 | LPXTG | + | − | − | − | − | + | + | + |
| LRHM_1699 | 351 | DUF916 | + | − | − | − | − | + | + | + |
| LRHM_1700 | 114 | LPXTG | + | − | − | − | − | − | + | + |
| LRHM_1701 | 262 | WxL | + | − | – | − | − | + | + | + |
| LRHM_1702 | 1,131 | WxL | − | − | − | − | − | + | + | + |
| LRHM_1797 | 2,357 | LPXTG | − | − | − | − | − | + | + | + |
| LRHM_2006 | 1,561 | LPXTG | + | − | − | − | − | − | + | − |
| LRHM_2185 | 1,973 | LPXTG | + | + | + | + | + | + | + | + |
| LRHM_2193 | 1,653 | LPXTG | − | − | − | − | − | + | + | + |
| LRHM_2248 | 388 | LPXTG, mucin-binding domain | − | + | + | + | + | + | + | + |
| LRHM_2279 | 517 | LPXTG (SpaD) | + | + | + | + | + | − | + | + |
| LRHM_2281 | 983 | LPXTG (SpaF) | + | + | + | + | + | − | + | + |
| LRHM_2626 | 1,494 | LPXTG | + | − | − | − | − | − | + | + |
| LRHM_2815 | 2,603 | LPXTG | + | + | + | + | + | − | + | + |
‘+’ indicates that the orthologous gene is present, and ‘−’ indicates that the orthologous gene is absent.