| Literature DB >> 35291951 |
Emeline Roux1,2,3, Aurélie Nicolas1, Florence Valence1, Grégoire Siekaniec1,3, Victoria Chuat1, Jacques Nicolas3, Yves Le Loir1, Eric Guédon4.
Abstract
BACKGROUND: Streptococcus thermophilus is a Gram-positive bacterium widely used as starter in the dairy industry as well as in many traditional fermented products. In addition to its technological importance, it has also gained interest in recent years as beneficial bacterium due to human health-promoting functionalities. The objective of this study was to inventory the main health-promoting properties of S. thermophilus and to study their intra-species diversity at the genomic and genetic level within a collection of representative strains.Entities:
Keywords: Bacteriocin; Comparative genomics; Conjugated linoleic acid; GABA; Galactose; Lactose; Pangenome; PrtS; Vitamin
Mesh:
Substances:
Year: 2022 PMID: 35291951 PMCID: PMC8925076 DOI: 10.1186/s12864-022-08459-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Health-promoting properties of S. thermophilus. SCFAs, short chain fatty acids; CLA, conjugated linoleic acid; CLNA, conjugated linolenic acid; EPS, exopolysaccharides; GABA, gamma-aminobutyric acid
Fig. 2Genomic and phylogenomic features of S. thermophilus. A Core and pan-genome size evolution in S. thermophilus isolates according to the number of genomes. Each point represents the number of protein families conserved between genomes. All of the points are plotted as a function of the strain number (x). The deduced pan-genome size is y = 1920.3 × 0.3052 (R2 = 0.9972). B. The distribution of core and accessory proteins according to the COG classification. The x-axis indicates the percentage of proteins in various COG categories. (A) RNA processing and modification; (C) Energy production and conversion; (D) Cell cycle control, cell division, chromosome partitioning; (E) Amino acid transport and metabolism; (F) Nucleotide transport and metabolism; (G) Carbohydrate transport and metabolism; (H) Coenzyme transport and metabolism; (I) Lipid transport and metabolism; (J) Translation, ribosomal structure and biogenesis; (K) Transcription; (L) Replication, recombination, and repair; (M) Cell wall/membrane/envelope biogenesis; (N) Cell motility; (O) Posttranslational modification, protein turnover, chaperones; (P) Inorganic ion transport and metabolism; (Q) Secondary metabolites biosynthesis, transport, and catabolism; (S) Function unknown; (T) Signal transduction mechanisms; (U) Intracellular trafficking and secretion; (V) Defense mechanisms. C Word cloud of pseudogenes annotations. Annotation of pseudogenes provided by MicroScope was filtered to remove non-informative words (see Methods section for the complete list of removed words). Word size is calculated accordingly to its occurrence. D Heatmap showing the pairwise percentage of differences (% differences) between core proteome of the 79 STH strains. Strains were clustered into three clusters (a, b, c). Subgroups within the clusters are also identified (b1, b2, c1, c2, and c3)
Fig. 3Metabolism of galactose and lactose in S. thermophilus. A Schematic representation of the main steps of lactose and galactose catabolism in STH. LacEF, lactose/galactose PTS system; LacG, 6-phospho-beta-galactosidase; LacD, tagatose 1,6-biphosphate aldolase; LacC, tagatose-6-phosphate kinase; LacAB, galactose-6-phosphate isomerase subunits A and B; LacS, lactose permease; LacZ, beta-galactosidase; GalM, galactose mutarotase; LacX, aldolase 1-epimerase; GalK, galactokinase; GalT, galactose-1-phosphate uridylyltransferase; GalE, UDP-glucose 4-epimerase; PgmA, phosphoglucomutase. B Nucleotide sequences alignment of the galR-galK intergenic region of various STH strains. Promoter sequences were annotated according to Vaughan et al. (2001) and Vaillancourt et al. (2002). The start codon of both galR and galK is underlined. The − 35 and − 10 sequences are grey-boxed and the ribosome binding sites are in bold. The transcription starting sites are in red. Nucleotide substitutions and indels are yellow- and blue-boxed, respectively. Stars indicate point mutations in the galK promoter region of Gal+ mutants. 1: Promoter sequence of strains DGCC 7710, 05–34, C106, ND07, MN-BM-A02, ASCC 1275, N4L, MTH17CL396, JIM8232, CIRM-BIA1048, CIRM-BIA1049, ST109, CAG:236, CIRM-BIA1047, MTCC 5461 and CIRM-BIA772; 2: Promoter sequence of strain CIRM-BIA1121; 3: Promoter sequence of strain B59671; 4: Promoter sequence of strain TH982; 5: Promoter sequence of strain 1F8CT; 6: Promoter sequence of strains LMG 18311, BIO1488, KU30, KLDS3.1012, CIRM-BIA1125, CIRM-BIA1116, GABA, LMG S-29186, LMD-9, CIRM-BIA1051, CIRM-BIA1055, CIRM-BIA2101, CIRM-BIA67, CIRM-BIA1050, CIRM-BIA36, TH985, FAM 13496, CIRM-BIA16, IDCC2201, ACA-DC 2, CIRM-BIA18, CIRM-BIA32, CIRM-BIA1358, CIRM-BIA19, APC151, CIRM-BIA1122, MN-ZLW-002, ND03, SMQ-301, MN-BM-A01, and S9; 7: Promoter sequence of strain EPS; 8: Promoter sequence of strain CIRM-BIA65; 9: Promoter sequence of STH CNRZ1066; 10: Promoter sequence of S. thermophilus CS8; 11: Promoter sequence of strain CIRM-BIA1046; 12: Promoter sequence of strain TH1477; 13: Promoter sequence of strains CIRM-BIA23, CIRM-BIA29, CIRM-BIA30, and CIRM-BIA1035; 14: Promoter sequence of S. thermophilus KLDS SM; 15: Promoter sequence of S. thermophilus KLDS 3.1003; 16: Promoter sequence of strain ST3; 17: Promoter sequence of strain TH1436; 18: Promoter sequence of strains M17PTZA496, ATCC 19258, and NCTC12958; 19: Promoter sequence of strains JIM1081, CIRM-BIA961, CIRM-BIA967, JIM10116, and CIRM-BIA336; 20: Promoter sequence of strains CIRM-BIA368, CIRM-BIA956, and CIRM-BIA998. C Genetic organization of genes coding for the tagatose 6P pathway in strain JIM10116 and CIRM-BIA336. The lengths of genes (pentagons) and intergenic regions are drawn to scale. Blue-colored pentagons code for proteins involved in galactose/lactose assimilation. The underlined part of the cluster corresponds to the region with high nucleotide identity (> 96%) with gene clusters found in various Enterococcus and Lactococcus species
Fig. 4PrtS protease domain alignments and amino acid sequence identity percentages (reference: PrtS from STH LMD-9). The domains of PrtS are: SS, signal sequence; PP, propeptide; PR, catalytic domain; A, globular domain; H, helical domain; W, cell wall domain; AN, cell wall anchor. Domain length is indicated in number of amino acids (aa). The two biochemically characterized sequences of PrtS from strains CNRZ385 [113] and 4F44 [112] were added
Fig. 5Genomic features of GABA biosynthetic genes in S. thermophilus. A Comparison of the genomic region harboring the gadB and gadC genes coding for GABA production in 21 S. thermophilus strains and S. salivarius strain JIM8777. The relative lengths of genes (pentagons) and intergenic regions are drawn to scale. *, strain not belonging to the species STH. The gadBC containing strains 1F8CT, TH985 and TH1477 were excluded from this comparative analysis due to the location of gadBC on too short contigs. B Nucleotide sequences alignment of the gadB promoter region of 23 S. thermophilus strains. The deduced − 35 and extended − 10 sequences are grey-boxed and variable nucleotides are in red
Fig. 6Heatmap of the presence or absence of bacteriocin clusters in 79 S. thermophilus strains (predicted by BAGEL4). The color gradient represents the number of predicted clusters per strain, from 0 to 4. LAPs: Linear azol (in)e-containing peptides
Fig. 7Comparison of the genomic region harboring the gpx gene in various S. thermophilus strains, S. vestibularis NCTC12167 and S. salivarius CCHSS3. *, strains not belonging to the species STH. The relative lengths of genes (pentagons) and intergenic regions are drawn to scale