Literature DB >> 29038772

Revealing the genomic differences between two subgroups in Lactobacillus gasseri.

Ipputa Tada1, Yasuhiro Tanizawa2, Akihito Endo3, Masanori Tohno4, Masanori Arita2,5.   

Abstract

Being an autochthonous species in humans, Lactobacillus gasseri is widely used as a probiotic for fermented products. We thoroughly compared the gene contents of 75 L. gasseri genomes and identified two intraspecific groups by the average nucleotide identity (ANI) threshold of 94%. Group I, with 48 strains, possessed 53 group-specific genes including the gassericin T cluster (9 genes) and N-acyl homoserine lactone lactonase. Group II, with 27 strains, including the type strain ATCC 33323, possessed group-specific genes with plasmid- or phage-related annotations. The genomic differences provide evidences for demarcating a new probiotic group within L. gasseri.

Entities:  

Keywords:  Lactobacillus gasseri; comparative genomics; gassericin T; quorum sensing

Year:  2017        PMID: 29038772      PMCID: PMC5633531          DOI: 10.12938/bmfh.17-006

Source DB:  PubMed          Journal:  Biosci Microbiota Food Health        ISSN: 2186-3342


Lactobacillus gasseri is an autochthonous species of lactic acid bacteria (LAB) that colonizes in the human gastrointestinal tract, vaginal tract, and oral cavity. Its health benefits, such as its antimicrobial activity and probiotic properties, have been well documented [1], making L. gasseri distinct as a probiotic yoghurt inoculum in Japan. Previously, we reported the existence of two subtypes in L. gasseri by using the average nucleotide identity (ANI), the statistical similarity computed from whole genome sequences [2]. An ANI threshold of 95% corresponds to an experimental DNA-DNA hybridization (DDH) value of 70%, which is the general criterion for a species-level difference [3,4,5]. To reveal genomic characteristics within the two L. gasseri groups, we here report detailed analysis of them focusing on their gene contents. In total, 75 draft genomes of L. gasseri were downloaded from our DFAST Archive of Genome Annotation (DAGA), the curated genome repository of Lactobacillus and Pediococcus from the DDBJ/ENA/GenBank, and Sequence Read Archive (SRA) [2]. They all satisfied a quality rating of ≥4 (out of 5) in our database, meaning that their genome completeness is ≥95% and contamination level is ≤5% as computed by the CheckM software [6]. When ANI values were calculated for all pairs of the obtained genomes by an open-source Python script (https://github.com/widdowquinn/pyani), the 75 genomes were cleanly classified into two groups at the 94% threshold: Group I, consisted of 48 strains, and Group II, consisting of the remaining 27 strains including the type strain (ATCC 33323T, Fig. 1). The genome size ranged from 1.86–2.14 Mb in Group I (average 1.98 Mb) and ranged from 1.78–2.01 Mb in Group II (average 1.89 Mb). The average numbers of protein sequences were 1920 for Group I, 1844 for Group II, and 1893 for both groups (see Supplementary Table for details).
Fig. 1.

The results of hierarchical clustering for 75 L. gasseri strains.

The genome distance is (1-ANI).

The results of hierarchical clustering for 75 L. gasseri strains. The genome distance is (1-ANI). To elucidate the differences between Groups I and II from their gene contents, we computed orthologous gene groups from the overall 141,948 protein sequences using the GET_HOMOLOGUES software (version 1.3) with default settings [7]. Among the 6142 gene groups obtained, 3946 groups contained multiple genes, and 2196 were genome-specific genes, i.e., singletons (Fig. 2). Although the number of genes in each genome was not much different between Group I and Group II, the number of singletons in Group II (27 strains) is far more than in Group I (48 strains). After removing genes of hypothetical/unknown functions, we selected genes whose conservation rates between the two gasseri groups differed by more than 80%. The numbers of Group I- and Group II-specific genes became 53 and 46, respectively. For these genes, we manually verified with the Mauve software whether the selected genes form a gene cluster [8] and identified 5 conserved clusters in Group I and 4 conserved clusters in Group II, respectively (Table 1, 2 and Fig. 3). Notably, we observed a highly conserved gassericin T cluster (cluster ID: G1C1 in Table 1) and a putative N-acyl homoserine lactone (AHL) lactonase in Group I.
Fig. 2.

Venn diagram of the homologous genes in Group I and Group II.

Table 1.

Group I-specific orthologous gene groups

Cluster IDNumber of genesGene namesConservation (%)
Group IGroup II
G1C110lactacin F two-component system inducer peptide precursor (gatP)97.9100.0
histidine kinase (gatK)95.83.7
two-component system response regulator (gatR)95.80.0
peptide ABC transporter ATP-binding protein (gatT)95.83.7
lactacin F transporter auxillary protein (gatC)95.83.7
bacteriocin (gatA)81.30.0
bacteriocin (gatX)81.30.0
lactacin F immunity protein (gatI)87.53.7
enterocin A immunity protein95.83.7
pediocin immunity protein PedB97.996.3

G1C26peptidase C45100.00.0
adenine deaminase93.80.0
spermidine/putrescine ABC transporter substrate-binding protein93.80.0
spermidine/putrescine ABC transporter ATP-binding protein93.80.0
spermidine/putrescine ABC transporter permease protein93.80.0
amino acid permease100.00.0

G1C34poly(glycerol-phosphate) alpha-glucosyltransferase89.60.0
accessory Sec system protein Asp287.50.0
preprotein translocase subunit SecA87.57.4
preprotein translocase subunit SecY83.30.0

G1C43arginine/ornithine antiporter87.50.0
phosphatidylserine decarboxylase100.00.0
phosphatidylserine decarboxylase87.50.0

G1C52acetolactate synthase large subunit100.03.7
alpha-acetolactate decarboxylase100.03.7

Gene names follow the output of DAGA annotation, and the cluster ID is our tentative assignment in this Table.

Fig. 3.

Genome alignment by the Mauve software between strain K7 (Group I) and strain ATCC 33323T (Group II).

Positions of group-specific clusters are indicated by Cluster IDs (also see Table 1).

Venn diagram of the homologous genes in Group I and Group II. Gene names follow the output of DAGA annotation, and the cluster ID is our tentative assignment in this Table. Genome alignment by the Mauve software between strain K7 (Group I) and strain ATCC 33323T (Group II). Positions of group-specific clusters are indicated by Cluster IDs (also see Table 1). L. gasseri has been reported to produce several bacteriocins, i.e., antimicrobial proteins against its phylogenetic relatives. Gassericin A was reported in L. gasseri LA39 [9], and analogs of gassericin T were reported in L. gasseri SBT2055 (gassericin T) [10], LA158 (gassericin T) [11], LF221 (gassericin K7 B) [12, 13], and EV1461 (gassericin E) [14]. The former, gassericin A, is a rare Class IIc circular bacteriocin. Only the G256_6_33 strain in Group I (SRA: ERR570193) and 4 strains in Group II (G278_2_12, G278_5_18, G287_2_14, and G287_5_2; SRA: ERR570265, ERR570270, ERR900639, and ERR900640, respectively) possessed its close homologs. On the other hand, gassericin T and its analogs are Class IIb two-peptide bacteriocins and have been widely found within L. gasseri. The multiple analogs presumably result from their promiscuous inhibitory spectra that depend on subtle amino acid substitutions or modifications. The production of gassericin T requires 9 related genes in a 7 kb genomic region in L. gasseri LA158 (GenBank: AB710328.1) [11]. In Group I, this region was completely conserved in 39 out of 48 strains. Among the remaining 9 strains, seven conserved the partial region (5.2 kb) and lacked the two gassericin T peptide genes (gatA and gatX). Only two out of 48 strains, strain G277_2_5 (SRA: ERR570252) and strain 130918 (GenBank: GCA _000814885.1), lacked the whole region. The regional alignment, created by the Easyfig visualizer (version 2.2.2), is depicted in Fig. 4 [15]. In contrast, all strains in Group II lacked the region except strain G257_1_23 (SRA: ERR570195), which retained a partial region (4.3 kb) without the two peptide genes. In most strains in Group II, however, the two bordering genes, gatP and pedB, were well conserved (Tables 1, 2). GatP is an autoinducer for gassericin T (see later), whereas the function of pedB is has not been determined, and it is hypothetically annotated as “pediocin immunity protein” in the DAGA database.
Fig. 4.

The structure of the gassericin T cluster.

Strain LA158 is used as a reference. See Table 1 for details of the gene functions.

Table 2.

Group II-specific orthologous gene groups

Cluster IDNumber of genesGene namesConservation (%)
Group IGroup II
G2C13death-on-curing family protein4.2100.0
NADPH-quinone reductase12.592.6
TetR family transcriptional regulator2.1100.0

G2C23integrase0.096.3
dephospho-CoA kinase2.192.6
type III restriction protein, res subunit0.081.5

G2C33RelE/StbE family addiction module toxin0.085.2
DNA-damage-inducible protein J0.088.9
protein-tyrosine phosphatase0.081.5

G2C42flavoprotein4.2100.0
LysR family transcriptional regulator4.292.6

Gene names follow the output of DAGA annotation.

The structure of the gassericin T cluster. Strain LA158 is used as a reference. See Table 1 for details of the gene functions. Among the 48 Group I strains, an additionally conserved gene was the AHL lactonase related to quorum sensing. It is a communication system of bacteria to coordinate population. Gram-positive bacteria typically use secreted peptides as autoinducers, and gatP is known as the autoinducer gene for gassericin T production [14]. On the other hand, Gram-negative bacteria often use small molecules such as AHLs and Autoinducer-2 (AI-2 or furanosyl borate diester) [16]. The AHL lactonase is therefore a quorum-quenching enzyme, hydrolyzing the lactone ring of AHLs. As a member of the metallo-hydrolase superfamily [17], this lactonase has been found in various genera such as Bacillus [18], Agrobacterium [19], Rhodococcus [20], and Streptomyces [21]. In Group I, the AHL lactonase was fully conserved in 42 strains, and 4 strains conserved slightly shorter protein sequences. Two strains, G277_2_5 and UMB0099 (SRA: ERR1045819), lacked the gene, and the former strain did not possess the gassericin T cluster either. The amino acid sequence similarity among the 42 Group I strains was 91.1% (256/281 residues). The amino acid similarity between AHL lactonases in L. gasseri K7 and Bacillus sp. 240B1 (GenBank: AF196486.1) was 23.5% (66/281 residues). The gene was not found in Group II. The extremely high correlation between the gassericin T gene cluster and the AHL lactonase gene in Group I is noteworthy. The gene cluster is chromosomal (at least in complete genomes), showing a good contrast to the gassericin A gene cluster (4 kb) on a plasmid of the producer strain L. gasseri LA39 (JCM11657; the strain was not included in our study due to the absence of the whole genome sequence) [22]. In our study, the gassericin A cluster was found in only one strain in Group I (3 kb partial match in ERR570193) and 4 strains in Group II (a 4 kb complete match in ERR570265 and ERR570270 and a 3.7 kb partial match in ERR900639 and ERR900640). From the draft sequence information, it is hard to tell whether they are plasmidal or chromosomal. However, Group II strains possess more genome-specific genes, and their conserved clusters also contain plasmid- or phage-related annotations such as “TetR transcriptional regulation,” “integrase,” or “RelE/StbE toxin-antitoxin” (Table 2). Gene names follow the output of DAGA annotation. These facts together with the ANI analysis demarcate a new probiotic group within L. gasseri. To investigate the functionality of the gassericin T cluster and to biochemically characterize Group I strains, we are conducting an investigation with a polyphasic taxonomic approach (in preparation).

Funding

This study was supported by the Database Integration Coordination Program of the National Bioscience Database Center (Japan), the commission for the Development of Artificial Gene Synthesis Technology for Creating Innovative Biomaterial from the Ministry of Economy, Trade and Industry (METI), Japan, and NIG-JOINT (grant number 2016-53, 2016-54).
  19 in total

Review 1.  Quorum-quenching microbial infections: mechanisms and implications.

Authors:  Y-h Dong; L-y Wang; L-H Zhang
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2007-07-29       Impact factor: 6.237

2.  Identification of quorum-quenching N-acyl homoserine lactonases from Bacillus species.

Authors:  Yi-Hu Dong; Andi R Gusti; Qiong Zhang; Jin-Ling Xu; Lian-Hui Zhang
Journal:  Appl Environ Microbiol       Date:  2002-04       Impact factor: 4.792

3.  Primary amino acid and DNA sequences of gassericin T, a lactacin F-family bacteriocin produced by Lactobacillus gasseri SBT2055.

Authors:  Y Kawai; B Saitoh; O Takahashi; H Kitazawa; T Saito; H Nakajima; T Itoh
Journal:  Biosci Biotechnol Biochem       Date:  2000-10       Impact factor: 2.043

4.  Identification of extracellular N-acylhomoserine lactone acylase from a Streptomyces sp. and its application to quorum quenching.

Authors:  Sun-Yang Park; Hye-Ok Kang; Hak-Sun Jang; Jung-Kee Lee; Bon-Tag Koo; Do-Young Yum
Journal:  Appl Environ Microbiol       Date:  2005-05       Impact factor: 4.792

5.  N-acylhomoserine lactonase producing Rhodococcus spp. with different AHL-degrading activities.

Authors:  Sun-Yang Park; Byung-Joon Hwang; Min-Ho Shin; Jung-Ae Kim; Ha-Kun Kim; Jung-Kee Lee
Journal:  FEMS Microbiol Lett       Date:  2006-08       Impact factor: 2.742

Review 6.  Gassericin A: a circular bacteriocin produced by lactic acid bacteria Lactobacillus gasseri.

Authors:  Neha Pandey; R K Malik; J K Kaushik; Garima Singroha
Journal:  World J Microbiol Biotechnol       Date:  2013-05-28       Impact factor: 3.312

7.  DNA sequencing and homologous expression of a small peptide conferring immunity to gassericin A, a circular bacteriocin produced by Lactobacillus gasseri LA39.

Authors:  Yasushi Kawai; Joni Kusnadi; Rober Kemperman; Jan Kok; Yoshiyuki Ito; Mikiko Endo; Kensuke Arakawa; Hideaki Uchida; Junko Nishimura; Haruki Kitazawa; Tadao Saito
Journal:  Appl Environ Microbiol       Date:  2008-12-29       Impact factor: 4.792

8.  Easyfig: a genome comparison visualizer.

Authors:  Mitchell J Sullivan; Nicola K Petty; Scott A Beatson
Journal:  Bioinformatics       Date:  2011-01-28       Impact factor: 6.937

9.  DFAST and DAGA: web-based integrated genome annotation tools and resources.

Authors:  Yasuhiro Tanizawa; Takatomo Fujisawa; Eli Kaminuma; Yasukazu Nakamura; Masanori Arita
Journal:  Biosci Microbiota Food Health       Date:  2016-07-14

Review 10.  Quorum sensing signal-response systems in Gram-negative bacteria.

Authors:  Kai Papenfort; Bonnie L Bassler
Journal:  Nat Rev Microbiol       Date:  2016-08-11       Impact factor: 60.633

View more
  2 in total

1.  Genomic insights into Lactobacillus gasseri and Lactobacillus paragasseri.

Authors:  Adriana Ene; Natalie Stegman; Alan Wolfe; Catherine Putonti
Journal:  PeerJ       Date:  2022-06-07       Impact factor: 3.061

2.  Comparative analysis of Lactobacillus gasseri from Chinese subjects reveals a new species-level taxa.

Authors:  Xingya Zhou; Bo Yang; Catherine Stanton; R Paul Ross; Jianxin Zhao; Hao Zhang; Wei Chen
Journal:  BMC Genomics       Date:  2020-02-03       Impact factor: 3.969

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.