| Literature DB >> 34220781 |
Minenosuke Matsutani1, Takura Wakinaka2, Jun Watanabe2, Masafumi Tokuoka3, Akihiro Ohnishi3.
Abstract
Tetragenococcus halophilus - a halophilic lactic acid bacterium - is frequently used as a starter culture for manufacturing fermented foods. Tetragenococcus is sometimes infected with bacteriophages during fermentation for soy sauce production; however, bacteriophage infection in starter bacteria is one of the major causes of fermentation failure. Here, we obtained whole-genome sequences of the four T. halophilus strains YA5, YA163, YG2, and WJ7 and compared them with 18 previously reported genomes. We elucidated five types of clustered regularly interspaced short palindromic repeat (CRISPR) loci in seven genomes using comparative genomics with a particular focus on CRISPR elements. CRISPR1 was conserved in the four closely related strains 11, YA5, YA163, and YG2, and the spacer sequences were partially retained in each strain, suggesting that partial deletions and accumulation of spacer sequences had occurred independently after divergence of each strain. The host range for typical bacteriophages is narrow and strain-specific thus these accumulation/deletion events may be responsible for differences in resistance to bacteriophages between bacterial strains. Three CRISPR elements, CRISPR1 in strains 11, YA5, YA163, and YG2, CRISPR2 in strain WJ7, and CRISPR2 in strain MJ4, were inserted in almost the same genomic regions, indicating that several independent insertions had occurred in this region. As these elements belong to class 1 type I-C CRISPR group, the results suggested that this site is a hotspot for class 1, type I-C CRISPR loci insertion. Thus, T. halophilus genomes may have acquired strain-specific bacteriophage-resistance through repeated insertion of CRISPR loci and accumulation/deletion events of their spacer sequences.Entities:
Keywords: CRISPR elements; Tetragenococcus halophilus; bacteriophage-resistance; comparative genomics; microevolution
Year: 2021 PMID: 34220781 PMCID: PMC8249745 DOI: 10.3389/fmicb.2021.687985
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Characteristics of whole genome sequences of the four Tetragenococcus halophilus strains assembled in this study.
| Strain name | YA5 | YA163 | YG2 | WJ7 |
| Assembly size (bp) | 2,318,625 | 2,420,090 | 2,372,244 | 2,443,531 |
| Total number of contigs | 122 | 132 | 117 | 113 |
| GC content (%) | 35.72 | 35.70 | 35.67 | 35.56 |
| Longest contig (bp) | 119,288 | 125,671 | 158,547 | 213,609 |
| N50 (bp) | 41,027 | 43,881 | 44,018 | 44,198 |
| L50 | 18 | 19 | 17 | 16 |
| Estimated fold coverage | 440 | 477 | 494 | 414 |
| CDS | 2,254 | 2,375 | 2,317 | 2,400 |
| rRNA genes | 3 | 3 | 3 | 3 |
| tRNA genes | 53 | 54 | 54 | 55 |
| tmRNA genes | 1 | 1 | 1 | 1 |
| DRA accession no. | ||||
| GenBank accession no. |
FIGURE 1Matrix and clustering of average nucleotide identity based on MUMmer (ANIm) identification of 22 Tetragenococcus strains. Red color in the heatmap indicate highly conserved genome pairs.
FIGURE 2Graphical representation of the CRISPR gene clusters for CRISPR1 (strain YA163) and CRISPR2 (strain WJ7). The figure was produced using genoPlotR 0.8.9 (Guy et al., 2010). Spacer sequences completely conserved in more than two strains are highlighted with yellow. Spacer numbers used in this figure are also listed in Supplementary Table 4.
FIGURE 3Graphical representation of CRISPR inserted regions. Insertion points were identified with NUCmer alignment (Marçais et al., 2018). Each query genome was independently aligned against the complete genome sequence (GenBank acc. No.: AP012046.1) of the Tetragenococcus halophilus NBRC 12172 strain. The illustration was produced using CG-view (Stothard and Wishart, 2005).
FIGURE 4Maximum-likelihood phylogenetic tree of 22 Tetragenococcus halophilus strains. The tree was prepared based on the nucleotide sequences from 1,292 orthologous gene sets. The strains that retained CRISPR loci in the genome sequence are indicated by a curved line.