| Literature DB >> 35327997 |
Abstract
In this study, whole-genome resequencing of two native probiotic Lactiplantibacillus plantarum strains-UTNGt21A and UTNGt2-was assessed in order to identify variants and perform annotation of genes involved in bacterial adaptability to different stressors, as well as their antimicrobial strength. A total of 21,906 single-nucleotide polymorphisms (SNPs) were detected in UTNGt21A, while 17,610 were disclosed in the UTNGt2 genome. The comparative genomic analysis revealed a greater number of deletions, transversions, and transitions within the UTNGt21A genome, while a small difference in the number of insertions was detected between the strains. A divergent number of types of variant annotations were detected in both strains, and categorized in terms of low, moderate, and high modifier impact on the protein effectiveness. Although both native strains shared common specific genes involved in the stress response to the gastrointestinal environment, which may qualify as a putative probiotic (bile salt, acid, temperature, osmotic stress), they were different in their antimicrobial gene cluster organization, with UTNGt21A displaying a complex bacteriocin gene arrangement and dissimilar gene variants that might alter their defense mechanisms and overall inhibitory capacity. The genome comparison revealed 34 and 9 genomic islands (GIs) in the UTNGt21A and UTNGt2 genomes, respectively, with the overrepresentation of genes involved in defense mechanisms and carbohydrate utilization. In addition, pan-genome analysis disclosed the presence of various strain-specific genes (shell genes), suggesting a high genome variation between strains. This genome analysis illustrates that the bacteriocin signature and gene variants reflect a niche-inherent pattern. These extensive genomic datasets will guide us to understand the potential benefits of the native strains and their utility in the food or pharmaceutical sectors.Entities:
Keywords: Lactiplantibacillus plantarum; SNPs; gene variants; genomic variation; next-generation sequencing; pan-genome; probiotics
Mesh:
Substances:
Year: 2022 PMID: 35327997 PMCID: PMC8953754 DOI: 10.3390/genes13030443
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Mapping data stats.
| Library Name | Ref. Length | Mapped Sites (≥1x) | Total Reads | Mapped Reads | Mapped Bases | Mean Depth |
|---|---|---|---|---|---|---|
| UTNGt21A | 3,348,624 | 3,028,007 (90.43%) | 13,121,820 | 10,291,789 (78.43%) | 941,582,288 | 281.18 |
| UTNGt2 | 3,348,624 | 2,979,050 (88.96%) | 11,733,026 | 10,428,975 (88.89%) | 957,588,188 | 285.96 |
Note—Library name: sample name; Ref. Length: length of reference genome; Mapped Sites: length of mapped site; Total Reads: number of the total reads; Mapped Reads: number of reads mapped to the reference; Mapped Bases: number of bases in reads mapped to the reference; Mean Depth: average alignment depth.
Figure 1Whole-genome alignments performed using the Mauve contig mover module. The genome of reference strain WCFS1 was aligned with the (A) UTNGt21A and (B) UTNGt2 draft genomes. White areas indicate low-identity regions between strains. Regions with the same color indicate high-similarity syntenic blocks, and are connected by the same color bars. Red bars indicate boundaries of the original contigs. The numbers above the alignments indicate the nucleotide positions in the WCFS1 genome.
Figure 2Gene content comparison of the three L. plantarum strains: (A) The pie chart shows the number of genes belonging to the core, the soft core, the shell, or the cloud. (B) The Roary matrix shows genes typical of each strain and those conserved in all strains. The red rectangular frame indicates several distinct cluster genes of the UTNGT21A and UTNGt2 strains.
Figure 3Circular plots visualizing genome islands predicted in the (A) UTNGt21A and (B) UTNGt2 genomes, aligned against the complete reference genome L. plantarum WCFS1, with blocks colored according to the prediction method. IslandPath-DIMOB (blue), SIGI-HMM (orange), and the integrated results (dark red).
Figure 4Cumulative summary of variant calling categories detected in the UTNGt21A and UTNGt2 genomes. Legend—Library name: sample name; Number of SNPs: number of SNPs in sample; Number of insertions: number of insertions in sample; Number of deletions: number of deletions in sample; Number of Transition (Ts): number of transitions in sample; Number of Transversion (Tv): number of transversions in sample; Syn: number of synonymous variants in sample; non-syn: number of non-synonymous variants in sample.
Cumulative types of variant annotation, descriptions, and impacts.
| Type of Variant Annotation | Description | Impact * | Library Name | |||
|---|---|---|---|---|---|---|
| UTNGt21A | UTNGt2 | |||||
| Count | Ratio (%) | Count | Ratio (%) | |||
| synonymous_variant | Variant causes a codon that produces the same amino acid (e.g., Ttg/Ctg, L/L) | Low | 12,350 | 71.51 | 9772 | 72.01 |
| missense_variant | Variant causes a codon that produces a different amino acid (e.g., Tgg/Cgg, W/R) | Moderate | 4740 | 27.45 | 3664 | 27 |
| frameshift_variant | Insertion or deletion causes a frame shift (e.g., an indel’s size is not a multiple of 3). | High | 61 | 0.35 | 33 | 0.24 |
| stop_gained | Variant causes a STOP codon (e.g., Cag/Tag, Q/*) | High | 32 | 0.19 | 21 | 0.16 |
| splice_region and stop_retained_variant | A sequence variant in which a change has occurred within the region of the splice site, either within 1–3 bases of the exon or 3–8 bases of the intron/Variant causes stop codon to be mutated into another stop codon (the new codon produces a different AA). (e.g., Atg/Ctg, M/L (ATG and CTG can be START codons)) | Low | 18 | 0.1 | 22 | 0.15 |
| conservative_inframe_deletion | One or many codons are deleted (e.g., a deletion multiple of three at a codon boundary). | Moderate | 15 | 0.09 | 10 | 0.15 |
| disruptive_inframe_insertion | One or many codons are inserted (e.g., an insertion multiple of three at a codon boundary). | Moderate | 12 | 0.07 | 9 | 0.08 |
| stop_lost and splice_region_variant | Variant causes stop codon to be mutated into a non-stop codon (e.g., Tga/Cga, */R)/A sequence variant in which a change has occurred within the region of the splice site, either within 1–3 bases of the exon or 3–8 bases of the intron. | High | 9 | 0.05 | 5 | 0.07 |
| disruptive_inframe_deletion | One codon is changed and one or many codons are inserted (e.g., an insert of a multiple of three in size, not at a codon boundary). | Modifier | 6 | 0.03 | 6 | 0.04 |
| conservative_inframe_insertion | Inversion of a large chromosome segment (over 1%, or 1,000,000 bases). | Moderate | 6 | 0.03 | 6 | 0.04 |
| non_coding_transcript_exon_variant | Region that does not code for any protein or does not carry genetic code. | Low | 0 | 0 | 6 | 0.04 |
Note: * Impact—High: the variant is assumed to have a high (disruptive) impact on the protein, probably causing protein truncation, loss of function, or triggering nonsense-mediated decay; Moderate: a non-disruptive variant that might change protein effectiveness; Low: assumed to be mostly harmless or unlikely to change protein behavior; Modifier: usually noncoding variants or variants affecting noncoding genes, where predictions are difficult or there is no evidence of impact.
Figure 5(A) Genetic organization of the pln loci of (a) UTNGt21A; (b) UTNGt2; and (c) WCFS1. (B) Additional bacteriocin cluster genes detected on (d, e) UTNGt21A and (f) UTNGt2. Legend—A: plnA; J: pln J; N: plnN; K: plnK; E: plnE; F: plnF; NCαβ: NCα and NCβ; enterocinX (β): enterocin X (chain beta); GlyS: glycotransferase family 2 protein (PlnO), LamM: lantibiotic mersacidin-modifying enzyme; Wαβ: Plantaricin_W (alpha), Plantaricin W (beta); 183.2; Plantaricin_W (beta); ABC: ABC-type bacteriocin transporter. Red blocks: immunity and transport; green arrow: core peptide.
List of genes and annotated variants associated with bacterial adaptation to different stressors in specific microenvironments.
| Stress Factor | Gene (locus WCFS1) | % Identity (EggNOG Annotation)/No. of Variants Relative to the REFERENCE WCFS1 | ||||
|---|---|---|---|---|---|---|
| Protein Product | UTNGt21A | UTNGt2 | ||||
| pH | ATP synthase epsilon chain | 67.60 | (-) | 67.60 | 1 | |
| ATP synthase subunit beta | 84.79 | 2 | 84.79 | 1 | ||
| ATP synthase gamma chain | 64.19 | (-) | 64.19 | 2 | ||
| ATP synthase subunit alpha | 81.34 | 3 | 81.34 | 2 | ||
| ATP synthase subunit delta | 45.55 | (-) | 45.55 | 1 | ||
| ATP synthase subunit b | 57.64 | (-) | 57.64 | 1 | ||
| ATP synthase subunit c | 82.69 | (-) | 82.69 | (-) | ||
| ATP synthase subunit a | 54.85 | (-) | 54.85 | (-) | ||
| Elongation factor 4 | 56.47 | 3 | 82.75 | 5 | ||
| Elongation factor 4 | 82.75 | (-) | 56.63 | 12 | ||
| Bile salt hydrolase |
| Putative protein YxeI (Choloylglycine hydrolase) | 42.98 | (-) | 42.98 | (-) |
|
| Putative protein YxeI (Choloylglycine hydrolase) | 40.54 | (-) | 34.85 | (-) | |
|
| Putative protein YxeI (Choloylglycine hydrolase) | 34.85 | (-) | 40.55 | (-) | |
| Conjugated bile acid hydrolase | 67.28 | (-) | 67.28 | (-) | ||
| Temperature | 18 kDa heat shock protein | 44.96 | 3 | 42.05 | 3 | |
| Heat-inducible transcription repressor HrcA | 58.90 | (-) | 58.90 | (-) | ||
| Protein GrpE | 58.89 | (-) | 58.89 | 1 | ||
| Chaperone protein DnaK | 84.33 | (-) | 84.33 | 3 | ||
| Chaperone protein DnaJ | 71.12 | 3 | 71.12 | 3 | ||
|
| 18 kDa heat shock protein | 44.9633.82 | (-) | 44.96 | (-) | |
|
| Heat shock protein 15 | 70.79 | (-) | 71.91 | (-) | |
| 60 kDa chaperonin | 84.89 | 2 | 84.89 | 2 | ||
| 10 kDa chaperonin | 69.14 | (-) | 69.14 | 1 | ||
| 33 kDa chaperonin | 69.61 | 2 | 69.61 | (-) | ||
| Hypothetical small heat shock protein | 45.28 | 1 | 45.28 | 3 | ||
|
| Catabolite control protein A | 49.33 | 3 | 49.33 | 4 | |
|
| Catabolite control protein A | 44.09 | 44.09 | |||
|
| Catabolite control protein A | 65.76 | 65.76 | |||
|
| Catabolite control protein A | (-) | 44.14 | |||
|
| Catabolite control protein B | 44.92 | 1 | 47.60 | 2 | |
| Cold shock protein 1 | 78.78 | 1 | 78.78 | 1 | ||
| Cold shock protein 2 | 81.81 | (-) | 81.81 | (-) | ||
|
| Cold shock-like protein CspLA | 86.36 | (-) | 86.36 | (-) | |
| Osmosis | Carnitine transport permease protein OpuCD | 73.15 | 2 | 73.17 | 2 | |
| Glycine betaine/carnitine/choline-binding protein OpuCC | 63.10 | 5 | 63.49 | 2 | ||
| Carnitine transport permease protein OpuCB | 98.21 | 5 | 75.00 | 3 | ||
| Carnitine transport ATP-binding protein OpuCA | 68.62 | 3 | 68.62 | 3 | ||
|
| Carnitine transport permease protein OpuCB | 75.00 | (-) | (-) | (-) | |
| Glycine betaine/carnitine/choline-binding protein | 73.14 | 6 | 73.14 | 24 | ||
| Glycine betaine/carnitine/choline-binding protein | 68.62 | 3 | 68.62 | (-) | ||
(-): No variant detected.