| Literature DB >> 29037147 |
Paulina Deptula1, Pia K Laine2, Richard J Roberts3, Olli-Pekka Smolander2, Helena Vihinen2, Vieno Piironen4, Lars Paulin2, Eija Jokitalo2, Kirsi Savijoki4, Petri Auvinen2, Pekka Varmanen5.
Abstract
BACKGROUND: Propionibacterium freudenreichii is an industrially important bacterium granted the Generally Recognized as Safe (the GRAS) status, due to its long safe use in food bioprocesses. Despite the recognized role in the food industry and in the production of vitamin B12, as well as its documented health-promoting potential, P. freudenreichii remained poorly characterised at the genomic level. At present, only three complete genome sequences are available for the species.Entities:
Keywords: Bacteriophage; CRISPR-Cas; Comparative genomics; Complete genome; Gras; Mobile elements; PacBio; Pilus; Propionibacterium freudenreichii; Restriction and modification; Vitamin B12
Mesh:
Year: 2017 PMID: 29037147 PMCID: PMC5644110 DOI: 10.1186/s12864-017-4165-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
P. freudenreichii strains included in this study. Summary of the genome sequences
| Strain | Sequence name | Accession number | Genome size (bp) | Genome coverage | GC% | No. of predicted genes | Note |
|---|---|---|---|---|---|---|---|
| JS | PFREUDJS001 | LN997841 | 2,675,045 | 691 | 67% | 2382 | Sequenced previously [ |
| JS2 | PFRJS2 | LT576032 | 2,655,351 | 317 | 67% | 2310 | 257 a |
| JS4 | PFRJS4 | LT576033 | 2,654,663 | 158 | 67% | 2366 | 259 a |
| JS7 | PFRJS7–1 | LT618776 | 2,738,418 | 300 | 67% | 2412 | Genome with prophage; 263 a |
| PFRJS7–2 | LT618777 | 2,700,482 | 50 | 67% | 2355 | Genome without phage | |
| PFRJS7-ph | LT618778 | 37,936 | 60 | 65% | 59 | Circular phage | |
| JS8 | PFRJS8 | LT576042 | 2,655,373 | 392 | 67% | 2324 | 264 a |
| JS9 | PFRJS9–1 | LT618785 | 2,720,049 | 219 | 67% | 2405 | Additional transposase gene; 265 a |
| PFRJS9–2 | LT618786 | 2,718,592 | 219 | 67% | 2401 | ||
| JS10 | PFRJS10 | LT576035 | 2,626,110 | 208 | 67% | 2329 | 266 a |
| JS11 | PFRJS11 | LT576038 | 2,537,402 | 501 | 67% | 2200 | 274 a |
| JS12 | PFRJS12–1 | LT604998 | 2,615,181 | 250 | 67% | 2275 | Additional transposase gene; 275 a |
| PFRJS12–2 | LT576787 | 2,613,734 | 250 | 67% | 2274 | ||
| PFRJS12–3 | LT604882 | 24,909 | 400 | 64% | 32 | Putative conjugative plasmid | |
| JS13 | PFRJS13–1 | LT618779 | 2,537,370 | 210 | 67% | 2201 | 11-gene insertion; 276 a |
| PFRJS13–2 | LT618780 | 2,520,651 | 90 | 67% | 2189 | ||
| JS14 | PFRJS14 | LT593929 | 2,507,188 | 330 | 67% | 2180 | 277 a |
| JS15 | PFRJS15–1 | LT618787 | 2,621,081 | 94 | 67% | 2322 | 3 additional transposase genes; 281 a |
| PFRJS15–2 | LT618788 | 2,616,005 | 94 | 67% | 2320 | Transposase gene disrupting a Type III RM methylase | |
| JS16 | RM25 | CP010341 | 2,649,163 | 190 | 67% | 2321 | Sequenced previously [ |
| JS17 | PFRJS17–1 | LT618789 | 2,755,516 | 192 | 67% | 2455 | Duplicated transposase gene; 283 a |
| PFRJS17–2 | LT618790 | 2,754,069 | 192 | 67% | 2454 | ||
| JS18 | PFRJS18 | LT576034 | 2,661,974 | 190 | 67% | 2358 | 284 a |
| JS20 | PFRJS20–1 | LT618791 | 2,678,207 | 106 | 67% | 2384 | 286 a |
| PFRJS20–2 | LT618792 | 2,682,327 | 106 | 67% | 2376 | 3 additional transposase genes; one disrupting another transposase gene | |
| JS21 | PFRJS21–1 | LT618781 | 2,659,993 | 222 | 67% | 2330 | Additional transposase genes; 287 a |
| PFRJS21–2 | LT618782 | 2,658,550 | 222 | 67% | 2329 | ||
| JS22 | PFRJS22–1 | LT599498 | 2,633,661 | 190 | 67% | 2326 | Genome with prophage; 288 a |
| PFRJS22-ph | LT615138 | 39,309 | 102 | 66% | 61 | Circular phage | |
| JS23 | PFRJS-23 | LT618793 | 2,630,698 | 210 | 67% | 2335 | Genome with prophage; 289 a |
| PFRJS-23-ph | LT618794 | 42,723 | 25 | 65% | 66 | Circular phage | |
| JS25 | PFRJS25–1 | LT618783 | 2,666,517 | 400 | 67% | 2336 | 291 a |
| PFRJS25–2 pl | LT618784 | 35,640 | 800 | 64% | 46 | Putative conjugative plasmid |
a Strain number in the previous study [48]
Sequencing summary
| Sample Name | Sequencing Chemistry | SMRTcells | Movie Time (min) a | Total Number of Subreads | Total Number of Bases | Mean Subread Length (bp) | N50 Subread Length (bp) b |
|---|---|---|---|---|---|---|---|
| JS4 | P4/C2 | 2 | 120 | 142,306 | 465,129,578 | 3268 | 4131 |
| JS10 | P4/C2 | 2 | 120 | 205,113 | 606,802,054 | 2958 | 3733 |
| JS15 | P4/C2 | 2 | 120 | 179,642 | 540,206,310 | 3007 | 3653 |
| JS18 | P4/C2 | 2 | 120 | 194,246 | 550,199,736 | 2832 | 3679 |
| JS20 | P4/C2 | 2 | 120 | 230,371 | 632,635,668 | 2746 | 3456 |
| JS22 | P4/C2 | 2 | 120 | 181,361 | 563,541,807 | 4090 | 3107 |
| JS23 | P4/C2 | 2 | 120 | 192,373 | 617,310,336 | 3208 | 4222 |
| JS2 | P5/C3 | 2 | 240 | 100,864 | 719,516,424 | 7133 | 8940 |
| JS7 | P5/C3 | 2 | 240 | 107,194 | 1,002,978,417 | 9356 | 13,043 |
| JS8 | P5/C3 | 2 | 240 | 139,418 | 1,149,552,255 | 8245 | 11,148 |
| JS9 | P5/C3 | 2 | 240 | 160,808 | 1,316,084,462 | 8184 | 11,023 |
| JS11 | P5/C3 | 2 | 240 | 184,744 | 1,422,323,823 | 7698 | 10,498 |
| JS12 | P5/C3 | 2 | 240 | 207,391 | 1,490,421,212 | 7186 | 9765 |
| JS13 | P5/C3 | 2 | 240 | 103,466 | 793,823,369 | 7672 | 10,156 |
| JS14 | P5/C3 | 2 | 240 | 125,130 | 966,521,587 | 7724 | 10,326 |
| JS17 | P5/C3 | 2 | 240 | 151,002 | 1,149,295,071 | 7611 | 10,232 |
| JS21 | P5/C3 | 2 | 240 | 173,843 | 1,298,367,718 | 7468 | 9906 |
| JS25 | P5/C3 | 2 | 240 | 192,104 | 1,293,994,542 | 6735 | 8805 |
a run time during which nucleotides added by the polymerase are recorded in real-time
b minimum length of subreads in which half of the sequencing data is found
P. freudenreichii strains included in this study. The details of differences between the genome sequences within strains
| Strain | Accession number | GC% | Note | Details |
|---|---|---|---|---|
| JS7 | LT618776 | 67% | Genome with prophage | PFR_JS7–1_1810 HTH-type transcriptional regulator KmtR:PFR_JS7–1_1869 Transcriptional regulator MtrR (preceeded by tRNA-Lys) |
| LT618777 | 67% | Genome without phage | ||
| JS9 | LT618785 | 67% | Genome with an additional transposase gene | PFR_JS9–1_62 Uma4 protein |
| LT618786 | 67% | |||
| JS12 | LT604998 | 67% | Genome with an additional transposase gene | PFR_JS12–1_615 a Transposase of ISAar20, ISL3 family and PFR_JS12–1_616 Hypothetical protein (ahead of Carbon starvation protein) |
| LT576787 | 67% | |||
| JS13 | LT618779 | 67% | Genome with 11-gene insertion or deletion b | JS13_289 Hypothetical protein: JS13_299 Transposase for insertion sequence element IS1001 |
| LT618780 | 67% | |||
| JS15 | LT618787 | 67% | 3 transposase genes absent from the other genome | PFR_JS15–1_878 Uma4 protein and PFR_JS15–1_879 Hypothetical protein; PFR_JS15–1_1737 Transposase of ISAar20, ISL3 family (Transposase gene disrupting gene coding for CitT); PFR_JS15–1_2045 a Transposase of ISAar20, ISL3 family |
| LT618788 | 67% | Transposase gene disrupting a Type III restriction enzyme | PFR_JS15–2_359 Transposase of ISAar43,IS3 family, IS407 group, orfA and PFR_JS15–2_360 c Insertion sequence IS407 orfB | |
| JS17 | LT618789 | 67% | A duplicated transposase gene | PFR_JS17–1_657 and PFR_JS17–1_658 a Transposase of ISAar20, ISL3 family |
| LT618790 | 67% | |||
| JS20 | LT618792 | 67% | 3 transposase genes absent from the other genome | PFR_JS20–2_544 Insertion sequence IS407 OrfB and PFR_JS20–2_545 c Transposase of ISAar43,IS3 family, IS407 group, orfA (disrupting Transposase IS30 family gene); PFR_JS20–2_568 Transposase of ISAar20, ISL3 family (ahead of Carbon starvation protein); PFR_JS20–2_1234 Transposase of ISAar20, ISL3 family (ahead of LuxS) |
| LT618791 | 67% | |||
| JS21 | LT618781 | 67% | Transposase gene ahead of putative aminotransferase biotin synthesis related protein | PFR_JS21–2_248 Transposase of ISAar20, ISL3 family |
| LT618782 | 67% |
a Transposase genes PFR_JS12–1_615, PFR_JS15–1_2045, PFR_JS17–1_657 and PFR_JS17–1_658 found in strains JS12, JS15 and JS17, respectively, are 100% identical
b The same element is present in the strain JS11 in the same location, but in all of the sequences
c Transposase genes PFR_JS15–2_359 and PFR_JS15–2_360 are identical to PFR_JS20–2_544 and PFR_JS20–2_545. The transposase genes do not appear to be acquired through horisontal transfer, as each of them is present in their respective genomes in at least one more location. All of the transposase genes have counterparts in multiple strains, suggesting their intrinsic character for the species
Fig. 1Genome composition and organisation. Panel a) Average Nucleotide Identity (%) calculated based on pairwise BLAST alignment (ANIb). The levels of similarity are highlighted by coloring from green for the most similar to red for the most dissimilar. The strains JS4, JS15 and JS17 are on average the most similar to all other strains, while the strains JS9 and JS20 are the most dissimilar to all other strains and only slightly more similar to one another. The strains of cereal origin (JS11-JS14) are more similar to each other than to other strains. Panel b) Whole genome alignments generated with ProgressiveMauve. The genomes are arranged according to the phylogenetic tree generated from core genome alignments (see below). The distinct organisation of the genomes of closely related strains can be observed, most clearly between strains JS and JS10, JS15 and JS23 as well as JS4 and JS21. The regions of genome rearrangements in these strains are indicated with matching lines (solid, dot-dash or dash)
Fig. 2Core genome and pan genome of the P. freudenreichii species. The core genome (a) and the pan genome (b) are represented as a variation of the gene pools after sequential addition of 20 P. freudenreichii genomes. The summary of ortholog group distribution between the strains is presented in a pie chart (c). Core genes- present in all of the strains; Soft core genes- present in 19 of the strains; Shell genes- present in 3-18 of the strains; Cloud genes- present in one or two strains only
Fig. 3Flower plot representing comparative analysis of the genome. The orthologous groups shared between the strains are indicated in the center. The number of accessory genes for each strain are indicated on each petal. In the brackets are the genes unique to that strain. The petals are colored based on the degree of relatedness of the strains. The unrooted phylogenetic tree was created based on the core genome alignments. *Type strain P. freudenreichii DSM 4902; **Type strain P. freudenreichii DSM 20271 (CP010341)
Fig. 4Map of accessory genome alignments generated by Roary. Gene clusters unique to individual strains are marked in red and numbered. 1) Genomic island with genes coding for the CASCADE-like CRISPR-Cas systems in strains JS2, JS7 and JS9; 2) Genomic island unique to strains JS4, JS21 and JS25. Genes located on this island include transposase genes with 96–98% sequence identity to those from Corynebacterium urealyticum DSM 7111 and a gene coding for an additional Cobyrinic acid A,C-diamide synthase; 3) Heat shock island unique to strains JS9 and JS20. Genes on the island include an 18-kDa heat shock protein, DnaK, GrpE, CbpM, ClpB and others; Features 4–16 are gene clusters unique to their respective strains. These include complete prophages (8, 12, 13, and 14), phage remnants (6, 8, 9 and 12), predicted genomic islands with genes encoding various functions: resistance to heavy metals (7), possible antibiotic resistance (15), genetic loci with genes coding for restriction and modification systems (7, 11, 12 and 14), and the pilus locus (9). Unique gene clusters 4, 9, 11, 14 and 16, despite sequence differences share structural similarities, including the presence of genes coding for Single-stranded DNA-binding protein, TraM recognition site of TraD and TraG, AAA-like domain protein (VirB4-like), Multifunctional conjugation protein TraI (TrwC or TraA relaxase), Type IV secretory system Conjugative DNA transfer (TraG-like), ParB-like nuclease domain protein, Bifunctional DNA primase/polymerase and Murein DD-endopeptidase MepM. The presence of TraA, TraG and VIrB4 are indicative of Integrative and Conjugative Elements (ICEs) type T4SS. The majority of the unique gene clusters have regions with a high degree of sequence identity to other Actinobacteria, including Propionibacterium acidipropionici, Corynebacterium falsenii, Cutibacterium avidum and Microbacterium sp. Details can be viewed in the Additional tables of the respective strains in the column “Note”
Fig. 5Summary of the genomic features. Core genome alignment phylogenetic tree with the genomic features displayed on a multibar chart, including detected prophages, plasmid-like elements, complete CRISPR-Cas systems, RM systems for which methylases were unambiguously matched with recognition sequences and the genomic islands predicted by at least one method. The strains for which more than one version of the genome was detected are marked with an asterisk
Fig. 6Bacteriophages identified in this study. PJS7 is 99% identical with the recently published genome of Propionibacterium phage PFR1 (NC_031076.1), but different from the other phages identified in this study. Phages PJS22 and PJS23 are similar to each other and to closely related Propionibacterium phages B22 (KX620750.1) and Doucette (KX620751.1). Part of the annotations was derived from the most closely related phages and can be viewed in Additional Tables JS7, JS22 and JS23
Fig. 7Putative conjugative plasmids identified in this study. *Type II restriction-modification system with the recognition motif CTCGAG. **DNA stretch with 88% nucleotide identity to Propionibacterium phages PFR1 (NC_031076.1), PFR2 (KU984980.1) and G4 (KX620754.1)
Fig. 8CRISPR-Cas systems detected in the sequenced strains. Strains JS9, JS2 and JS7 possess CRISPR-Cas sytem type IE (CASCADE), while all the other strains possess system type IU. Only strain JS2 possessess both types of the CRISPR-Cas systems. Green-purple-green boxes indicate presence of repeats and spacers. tn- transposase; hp.- hypothetical proteins
CRISPR-Cas systems identified in the studied strains by CRISPR-finder
| Strain | Predicted CRISPR | Start | End | Size | DR (nt) | No. spacers | Type | Immunity to identified phages | Immunity to mobile elements identified in this study | Locus ID | Note |
|---|---|---|---|---|---|---|---|---|---|---|---|
| JS | Crispr_2 | 2,424,099 | 2,427,021 | 2922 | 36 | 40 | IU | Anatole, B3, B22, E1, E6, G4, PFR1, PFR2 | PJS22, PJS23, PJS7, pJS12 | PFREUDJS001_002141-PFREUDJS001_002147 | – |
| JS2 | Crispr_3 | 2,082,550 | 2,088,376 | 5826 | 28 | 95 | IE | B22, B3, E1,E6, G4, Anatole, Doucette, PFR1, PFR2 | PJS22, PJS23, PJS7, pJS25 | PFR_JS2_1825-PFR_JS2_1832 | – |
| Crispr_4 | 2,120,759 | 2,122,610 | 1851 | 36 | 25 | IU | PFR1, PFR2 | PJS7 | PFR_JS2_1843-PFR_JS2_1849 | – | |
| JS4 | Crispr_3 | 1,825,170 | 1,827,746 | 2576 | 36 | 35 | IU | Doucette, E6, G4, PFR1, PFR2 | PJS22, PJS7, pJS25 | PFR_JS4_1650-PFR_JS4_1655 | One system, negative strand, spacers separated by a transposase gene and then by a four-gene integron. |
| Crispr_2 | 1,823,162 | 1,823,697 | 535 | 36 | 7 | – | – | ||||
| Crispr_1 | 1,818,598 | 1,819,215 | 617 | 36 | 8 | – | – | ||||
| JS7 | Crispr_2 | 2,203,337 | 2,207,335 | 3998 | 28 | 65 | IE | B22, B3, E1,E6, G4, Anatole, Doucette, PFR1, PFR2 | PJS22, PJS7, pJS12 | PFR_JS7–1_1964-PFR_JS7–1_1971 | No |
| JS8 | Crispr_3 | 2,192,257 | 2,196,491 | 4234 | 36 | 58 | IU | Anatole, Doucette, B3, B22, E1, E6, G4, PFR1, PFR2, phiB5 | PJS22, PJS23, PJS7, pJS25 | PFR_JS8_1930-PFR_JS8_1935 | – |
| JS9 | Crispr_3 | 2,260,079 | 2,260,655 | 576 | 28 | 9 | IE | – | – | PFR_JS9–1_2018-PFR_JS9–1_2025 | One system, negative strand, spacers separated by an integrase |
| Crispr_2 | 2,253,046 | 2,258,937 | 5891 | 28 | 96 | B22, B3, E1,E6, G4, Anatole, Doucette, PFR1, PFR2, phiB5 | – | ||||
| Crispr_4 | 2,288,755 | 2,292,746 | 3991 | 29 | 65 | N/A | Doucette, B3, B22, G4, PFR1, PFR2 | PJS23, PJS7 | Remnant of a system, no Cas. Spacers separated by an integrase. | ||
| Crispr_5 | 2,293,887 | 2,295,016 | 1129 | 29 | 18 | PFR1, PFR2 | PJS7 | ||||
| JS10 | Crispr_2 | 2,029,533 | 2,032,236 | 2703 | 36 | 37 | IU | Anatole, B3, B22, E1, E6, G4, PFR1, PFR2 | PJS22, PJS23, PJS7, pJS12 | PFR_JS10_1797-PFR_JS10_1803 | |
| JS12 | Crispr_1 | 2,181,462 | 2,185,411 | 3949 | 36 | 54 | IU | Doucette, B3, B22, E6, G4, PFR1, PFR2, phiB5 | PJS22, PJS23, PJS7 | PFR_JS12–1_1906-PFR_JS12–1_1911 | |
| JS14 | PossibleCrispr_1 | 1,595,184 | 1,595,290 | 106 | 36 | 1 | N/A | Anatole, E1 | PFR_JS14_1397 | ||
| JS15_1 | Crispr_1 | 441,841 | 444,266 | 2425 | 36 | 33 | IU | Anatole, B3, B22, E1, E6, G4, phiB5 | PJS23, pJS12 | PFR_JS15–1_371-PFR_JS15–1_376 | |
| JS16 | Crispr_2 | 417,820 | 419,522 | 1702 | 36 | 23 | IU | B22, E6, G4, PFR1, PFR2 | PJS22, PJS23, PJS7 | RM25_0342-RM25_0347 | One system, negative strand, spacers separated by a transposase gene. Both CRISPR stretches carry immunity to known phages. |
| Crispr_1 | 413,342 | 416,347 | 3005 | 36 | 41 | Anatole, B3, B22, E1, E6 | PJS22 | ||||
| JS17_1 | Crispr_2 | 2,220,735 | 2,225,402 | 4667 | 36 | 64 | IU | Anatole, Doucette, B3, B22, E1, PFR1, PFR2 | PJS22, PJS23, pJS12, pJS25 | PFR_JS17–1_1953-PFR_JS17–1_1958 | – |
| JS20_1 | Crispr_3 | 2,034,463 | 2,037,183 | 2720 | 36 | 37 | IU | Doucette, B22, E6, G4, PFR1, PFR2 | PJS23, PJS7 | PFR_JS20–1_1815-PFR_JS20–1_1820 | One system, spacers separated by a transposase gene and then by an eight-gene integron. |
| Crispr_4 | 2,038,656 | 2,039,048 | 392 | 36 | 5 | – | – | ||||
| Crispr_5 | 2,044,247 | 2,044,936 | 689 | 36 | 9 | – | – | ||||
| JS21_1 | Crispr_5 | 2,189,758 | 2,192,624 | 2866 | 36 | 39 | IU | Doucette, E6, G4, PFR1, PFR2 | PJS22, PJS7, pJS25 | PFR_JS21–1_1963-PFR_JS21–1_1968 | One system, negative strand, the spacers separated by a transposase gene and then by a four-gene integron. |
| Crispr_4 | 2,187,750 | 2,188,285 | 535 | 36 | 7 | – | – | ||||
| Crispr_3 | 2,183,185 | 2,183,803 | 618 | 36 | 8 | – | – | ||||
| JS22 | Crispr_2 | 2,150,779 | 2,151,318 | 539 | 36 | 7 | IU | – | – | Remnant of the same IU system, but no Cas. No immunity to known phages found. | |
| JS23 | Crispr_2 | 2,124,520 | 2,126,512 | 1992 | 36 | 27 | IU | Anatole, B3, B22, E1, E6, G4, phiB5 | PJS23 | PFR_JS23_1889-PFR_JS23_1894 | – |
| JS25 | Crispr_2 | 2,042,081 | 2,045,018 | 2937 | 36 | 40 | IU | Doucette, E6, PFR1, PFR2 | PJS22,PJS7,pJS25 | PFR_JS25–1_1807-PFR_JS25–1_1812 | One system, spacers separated by a transposase gene and then by a four-gene integron. |
| Crispr_3 | 2,046,491 | 2,046,954 | 463 | 36 | 6 | – | – | ||||
| Crispr_4 | 2,050,901 | 2,051,519 | 618 | 36 | 8 | – | – |
Methylation motifs and the responsible methylases identified in P. freudenreichii
Recognition sequences in parentheses indicates the methylase responsible cannot be assigned unambiguously. Modified bases are marked in colour; “not annotated” indicates there is no locus tag as the gene is not annotated in the GenBank file. Color coding for the methylation types Blue: 6 mA; Orange: 4mC; Purple: 5mC
*It is possible that it is really M.PfrJS17ORF2252P that is the active methylase in this organism. M.PfrJS17ORF2252P shares 99% aa sequence identity with M.PfrJS22I, but its corresponding S protein is truncated
**It is possible that it is really M.PfrJS22ORF850P that is the active methylase in this organism. M.PfrJS22ORF850P is 100% identical to M.PfrJS17I, as are their corresponding S proteins while the R proteins differ by one amino acid (475 Ala:Gly)
Fig. 9Pilus and mucus binding of P. freudenreichii. a Pilus operons predicted by LOCP. b Transmission electron microscopy (TEM) images of the strains with intact operons (JS18 and JS20) and control (JS16). c Adhesion assay of specific binding to porcine mucus and non-specific binding to BSA with cell-free PBS as control. The difference in specific binding of strain JS18 to mucus compared to non-specific binding to BSA and background PBS was statistically significant (p < 0.05) and is marked with asterisk