| Literature DB >> 27773878 |
Julieta Bonacina1, Nadia Suárez1, Ricardo Hormigo1, Silvina Fadda1, Marcus Lechner2, Lucila Saavedra1.
Abstract
The study of enterococcal genomes has grown considerably in recent years. While special attention is paid to comparative genomic analysis among clinical relevant isolates, in this study we performed an exhaustive comparative analysis of enterococcal genomes of food origin and/or with potential to be used as probiotics. Beyond common genetic features, we especially aimed to identify those that are specific to enterococcal strains isolated from a certain food-related source as well as features present in a species-specific manner. Thus, the genome sequences of 25 Enterococcus strains, from 7 different species, were examined and compared. Their phylogenetic relationship was reconstructed based on orthologous proteins and whole genomes. Likewise, markers associated with a successful colonization (bacteriocin genes and genomic islands) and genome plasticity (phages and clustered regularly interspaced short palindromic repeats) were investigated for lifestyle specific genetic features. At the same time, a search for antibiotic resistance genes was carried out, since they are of big concern in the food industry. Finally, it was possible to locate 1617 FIGfam families as a core proteome universally present among the genera and to determine that most of the accessory genes code for hypothetical proteins, providing reasonable hints to support their functional characterization.Entities:
Keywords: Enterococcus; food; genomics; niche; probiotics
Mesh:
Substances:
Year: 2017 PMID: 27773878 PMCID: PMC5381348 DOI: 10.1093/dnares/dsw043
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Relevant genomic features and isolation source of the enterococcal species under study
| Origin/feature | Organism | Genbank | Status | Size | GC% | Genes (NCBI) | Proteins (NCBI) | CDS (RAST) | Source |
|---|---|---|---|---|---|---|---|---|---|
| Dairy isolates | AOSM00000000.1 | Contig | 3.1 | 37.7 | 2941 | 2789 | 2978 | Cheese | |
| JMIG00000000.1 | Scaffold | 2.8 | 36.8 | 2501 | 2404 | 2537 | Cultured milk | ||
| AOUK00000000.1 | Contig | 2.9 | 37.7 | 2931 | 2616 | 3118 | Artisanal cheese | ||
| AHXD00000000.1 | Scaffold | 2.9 | 38.3 | 2823 | 2749 | 2826 | Cheese | ||
| CP004063.1 (chromosome) | Complete | 2.8 | 37.85 | 2771 | 2669 | 2837 | Milk and dairy utensils | ||
| CP004064.1 (plasmid) | |||||||||
| AWWM00000000.1 | Contig | 2.8 | 38.1 | 2699 | 2507 | 2878 | Pannerone cheese | ||
| ASAM00000000.1 | Contig | 2.7 | 37.9 | 2674 | 2549 | 2702 | Fontina cheese | ||
| AWWN00000000.1 | Contig | 2.9 | 38.1 | 2814 | 2667 | 2899 | Cheese | ||
| ASWA00000000.1 | Scaffold | 4.6 | 39.9 | 4581 | 4499 | 4524 | Gouda cheese | ||
| ASWC00000000.1 | Scaffold | 3.1 | 38.4 | 2986 | 2902 | 2915 | Dairy products | ||
| AFWZ00000000.1 | Contig | 3.1 | 38.4 | 2785 | 2651 | 2765 | Stripping milk of an argentinean cow | ||
| JDFT00000000.1 | Contig | 2.9 | 38.3 | 2851 | 2757 | 2759 | Artisanal cheese | ||
| Meat isolates | AIIV00000000.1 | Scaffold | 3.0 | 37.5 | 2835 | 2788 | 2819 | Turkey meat | |
| AIIT00000000.1 | Scaffold | 3.0 | 37.3 | 2852 | 2797 | 2816 | Pork meat | ||
| AHXE00000000.1 | Scaffold | 2.9 | 38.2 | 2881 | 2801 | 2849 | Fish burguer | ||
| ASAL00000000.1 | Contig | 2.7 | 37.9 | 2598 | 2480 | 2680 | Fermented sausage | ||
| AWWO00000000.1 | Contig | 2.7 | 37.9 | 2628 | 2491 | 2748 | Fermented sausage | ||
| AWWP00000000.1 | Contig | 2.6 | 37.9 | 2477 | 2390 | 2525 | Llama sausage | ||
| Probiotics | JRGY00000000.1 | Contig | 2.7 | 38.3 | 2641 | 2442 | 2728 | Probiotic preparation | |
| HF558530.1 | Complete | 2.8 | 37.7 | 2810 | 2705 | 2724 | Stool specimen of a healthy human adult | ||
| ADKN00000000.1 | Contig | 2.8 | 37.6 | 2661 | 2560 | 2612 | Pooled fecal sample collected from healthy human | ||
| CP006030.1 (chromosome) | Complete | 2.7 | 38.46 | 2639 | 2512 | 2653 | Not specified | ||
| CP006031.1 (plasmid) | |||||||||
| ATKJ00000000.2 | Contig | 4.2 | 39.4 | 4129 | 4016 | 4180 | Infant fecal material | ||
| Dairy isolates and probiotics | JRGX00000000.1 | Scaffold | 2.6 | 38.0 | 2559 | 2367 | 2646 | Probiotic preparation | |
| JMEC00000000.1 | Contig | 3.1 | 37.5 | 3051 | 2734 | 3180 | Dairy origin |
All Accession numbers are from genbank.
Genomes sizes were calculated as the total length of contigs for each genome.
Figure 1Phylogenetic relationships among food isolated and probiotic enterococci. Strains belonging to different groups are indicated on the bottom left. As outgroups Lactococcus lactis subsp. cremoris MG1363, Lactococcus garvieae Lg2, Listeria monocytogenes HCC23 and Lactobacillus johnsonii NCC 533, were selected. The scale bar indicates the relative distance of sequences. Bootstrap support values are indicated at the nodes. A) Genome-based phylogenetic tree. Genomes were aligned using pMauve and phylogeny reconstructed using a rapid bootstrap analysis with 1000 replicates with RAxML in two steps. First representative strains for each species were used to retrieve the species topology then the procedure was repeated at strain level to maximize resolution here. B) Protein-based phylogenetic tree. The tree is based on all 1:1-orthologs identified with Proteinortho. As in A, a rapid bootstrap analysis with 1000 replicates to search for best-scoring maximum likelihood tree was performed using RAxML. For details, see Materials and Methods.
Figure 2Coding sequences dispensed in subsystems and their distribution in categories of metabolic function. A) Percentage of predicted CDS assigned and not assigned to subsystems. B) Distribution of metabolic functions in categories based on the subsystems features counts.
Figure 3Venn diagram showing i) the number of unique FIGfam protein classes that were present in at least 40% of the strains and in at least two different species of the respective group but in no species/strain of any other group (unique), ii) the number of classes that were similarly present in two or more groups (overlapped), iii) the number of protein classes that were similarly present in all groups (in the center of the diagram), and iv) the subset of classes that were found to be present in all groups and in at least four different Enterococci species rather than two (indicated in parentheses).
Number of antibiotic resistance and bacteriocin genes in the enterococcal genomes
| ARG for each antibiotic class | Bacteriocin class | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Group | Strain | AGly | Gly | MLS | Tet | Tmt | ∑ | I | II | III | ∑ |
| Dairy isolates | IPLA 655 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| INF E1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |
| CRL 1879 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 6 | 1 | 7 | |
| E1604 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 1 | 1 | 2 | |
| NRRL B-2354 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 6 | 1 | 7 | |
| UC7256 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 1 | 1 | |
| UC 7267 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 1 | 1 | |
| UC8668 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 1 | 1 | |
| ATCC 43197 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| ATCC 882 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 2 | |
| CRL1656 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | |
| CRL35 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | |
| Meat isolates | 2924 | 0 | 7 | 4 | 2 | 1 | 14 | 0 | 0 | 0 | 0 |
| 19116 | 5 | 0 | 4 | 1 | 0 | 10 | 2 | 1 | 1 | 4 | |
| E1613 | 1 | 0 | 1 | 0 | 0 | 2 | 1 | 4 | 1 | 6 | |
| UC7251 | 3 | 0 | 3 | 2 | 0 | 8 | 0 | 0 | 1 | 1 | |
| UC8733 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 8 | 2 | 10 | |
| UC10237 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 1 | 0 | 1 | |
| Probiotics | PC1.1 | 0 | 0 | 2 | 0 | 0 | 2 | 0 | 0 | 1 | 1 |
| Symbioflor 1 | 0 | 0 | 2 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | |
| L-X | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 3 | 1 | 4 | |
| T-110 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 2 | 1 | 3 | |
| CFTRI 2200 | 1 | 0 | 1 | 2 | 0 | 4 | 0 | 1 | 0 | 1 | |
| Dairy isolates and probiotics | L-3 | 1 | 0 | 1 | 0 | 0 | 2 | 0 | 3 | 1 | 4 |
| MB5259 | 0 | 0 | 2 | 0 | 0 | 2 | 2 | 1 | 0 | 3 | |
ARG: antibiotic resistance gene; AGly: aminoglycosides; Gly: glycopeptides; MLS: macrolide-lincosamidestreptogramin; Tet: tetracyclines; Tmt: trimethoprim.
Distribution of the intact prophage regions among the enterococcal strains
| Group | Strain | Region | Length | Nº CDS | GC% | Most common phage (hit genes count) |
|---|---|---|---|---|---|---|
| Dairy isolates | IPLA 655 | 1 | 40.7 | 55 | 35.4 | Lister 2389 (16) |
| 2 | 44.8 | 64 | 36.8 | Entero phiEf11 (13) | ||
| 3 | 16.5 | 27 | 35.1 | Bacill G (2) | ||
| INF E1 | 1 | 38.8 | 54 | 33.4 | Entero phiFL1A (10) | |
| CRL 1879 | 1 | 52.5 | 61 | 37.5 | Entero EfaCPT1 (11) | |
| E1604 | 1 | 37.6 | 51 | 36.8 | Entero IME EF4 (11) | |
| NRRL B-2354 | 1 | 56.8 | 59 | 36.64 | Lactob phig1e (12) | |
| 2 | 46 | 58 | 34.96 | Lactob phig1e (13) | ||
| UC7256 | 1 | 48.7 | 54 | 36.8 | Lactob phig1e (12) | |
| UC 7267 | 1 | 31 | 36 | 38.5 | Bacill BCJA1c (10) | |
| UC8668 | 1 | 25.8 | 31 | 36.8 | Lactob phig1e (8) | |
| 2 | 47 | 59 | 36.1 | Lister 2389 (15) | ||
| ATCC 43197 | — | — | — | — | — | |
| ATCC 882 | 1 | 46.5 | 66 | 36.7 | Bacill BCJA1c (10) | |
| CRL1656 | — | — | — | — | — | |
| CRL35 | 1 | 38.5 | 50 | 36.3 | Lactob PL 1 (13) | |
| 2 | 44.4 | 68 | 37.3 | Lactoc Tuc2009 (13). Entero phiEf11 (13) | ||
| 3 | 44.1 | 53 | 37.6 | Bacill BCJA1c (9) | ||
| Meat isolates | 2924 | 1 | 36.8 | 46 | 34.5 | Strept phi3396 (8). Lister LP 101 (8) |
| 19116 | 1 | 68.8 | 72 | 36.2 | Entero EFC 1 (38) | |
| E1613 | 1 | 37.1 | 50 | 35.4 | Lister 2389 (14) | |
| UC7251 | 1 | 70.1 | 64 | 35.9 | Lister 2389 (16) | |
| UC8733 | 1 | 31.9 | 35 | 37.9 | Entero IME EF4 (12) | |
| UC10237 | 1 | 21.8 | 23 | 37.5 | Lister B025 (5) | |
| Probiotics | PC1.1 | 1 | 21.6 | 27 | 36.8 | Lister LP 101 (9) |
| Symbioflor 1 | 1 | 41.6 | 54 | 35.83 | Entero phiEf11 (12). Temper phiNIH1 1 (12) | |
| 2 | 45.2 | 66 | 34.63 | Entero phiEf11 (63) | ||
| L-X | 1 | 25.5 | 30 | 34.4 | Bacill BCJA1c (7) | |
| 2 | 41.9 | 55 | 36.1 | Lactoc Tuc2009 (13). Entero phiEf11 (13). Lactoc TP901 1 (13) | ||
| T-110 | 1 | 41.9 | 56 | 36.16 | Lactoc Tuc2009 (13). Entero phiEf11 (13) | |
| CFTRI 2200 | 1 | 50.1 | 71 | 37.1 | Entero phiFL3A (18) | |
| 2 | 18.8 | 32 | 40.5 | Entero EFC 1 (3) | ||
| Dairy isolates and probiotics | L-3 | — | — | — | — | — |
| MB5259 | 1 | 30.8 | 22 | 33 | Lactob phiAT3 (2) | |
| 2 | 43.9 | 68 | 34.4 | Entero phiEf11 (63) | ||
| 3 | 42.5 | 60 | 34.5 | Entero phiFL1A (15) | ||
| 4 | 35.3 | 55 | 35.2 | Entero phiFL3A (13) | ||
| 5 | 41.7 | 31 | 37 | Staphy StauST398 4 (2) |
Intact prophage region.
Intact prophage region length.
Phage with the highest number of CDS in the region and the number og gene counts in brackets.
Confirmed CRISPR structures in the enterococcal genomes predicted with CRISPRFinder
| Group | Strain | Confirmed CRISPR | Length | DR length | Nº of spacers |
|---|---|---|---|---|---|
| Dairy isolates | IPLA 655 | — | — | — | — |
| INF E1 | 1 | 300 | 37 | 4 | |
| 2 | 683 | 37 | 9 | ||
| CRL 1879 | — | — | — | — | |
| E1604 | 1 | 199 | 24 | 3 | |
| NRRL B-2354 | — | — | — | — | |
| UC7256 | 1 | 199 | 24 | 3 | |
| 2 | 201 | 28 | 3 | ||
| UC7267 | — | — | — | — | |
| UC8668 | 1 | 201 | 28 | 3 | |
| ATCC 43197 | — | — | — | — | |
| ATCC 882 | — | — | — | — | |
| CRL1656 | — | — | — | — | |
| CRL35 | 1 | 227 | 28 | 3 | |
| Meat isolates | 2924 | 1 | 365 | 36 | 5 |
| 19116 | — | — | — | — | |
| E1613 | 1 | 199 | 24 | 3 | |
| UC7251 | 1 | 199 | 24 | 3 | |
| UC8733 | — | — | — | — | |
| UC10237 | — | — | — | — | |
| Probiotics | PC1.1 | 1 | 365 | 36 | 5 |
| Symbioflor 1 | 1 | 629 | 36 | 9 | |
| L-X | 1 | 199 | 24 | 3 | |
| T-110 | — | — | — | — | |
| CFTRI 2200 | 1 | 484 | 23 | 6 | |
| 2 | 221 | 27 | 3 | ||
| 3 | 1215 | 30 | 18 | ||
| 4 | 537 | 24 | 8 | ||
| Dairy isolates and probiotics | L-3 | 1 | 199 | 24 | 3 |
| MB5259 | 1 | 431 | 36 | 6 |
Total length of the CRISPR cassette.
Length of the repeated sequences.
Figure 4Heat map showing the percentage of shared proteins between the genomic islands identified with IslandViewer3. All islands with at least 50% shared proteins to any other island were considered. Dark colour indicates nearly perfect overlaps while white means no detectable relation at all.