| Literature DB >> 22031452 |
Oksana Lukjancenko1, David W Ussery, Trudy M Wassenaar.
Abstract
Six bacterial genera containing species commonly used as probiotics for human consumption or starter cultures for food fermentation were compared and contrasted, based on publicly available complete genome sequences. The analysis included 19 Bifidobacterium genomes, 21 Lactobacillus genomes, 4 Lactococcus and 3 Leuconostoc genomes, as well as a selection of Enterococcus (11) and Streptococcus (23) genomes. The latter two genera included genomes from probiotic or commensal as well as pathogenic organisms to investigate if their non-pathogenic members shared more genes with the other probiotic genomes than their pathogenic members. The pan- and core genome of each genus was defined. Pairwise BLASTP genome comparison was performed within and between genera. It turned out that pathogenic Streptococcus and Enterococcus shared more gene families than did the non-pathogenic genomes. In silico multilocus sequence typing was carried out for all genomes per genus, and the variable gene content of genomes was compared within the genera. Informative BLAST Atlases were constructed to visualize genomic variation within genera. The clusters of orthologous groups (COG) classes of all genes in the pan- and core genome of each genus were compared. In addition, it was investigated whether pathogenic genomes contain different COG classes compared to the probiotic or fermentative organisms, again comparing their pan- and core genomes. The obtained results were compared with published data from the literature. This study illustrates how over 80 genomes can be broadly compared using simple bioinformatic tools, leading to both confirmation of known information as well as novel observations.Entities:
Mesh:
Year: 2011 PMID: 22031452 PMCID: PMC3324989 DOI: 10.1007/s00248-011-9948-y
Source DB: PubMed Journal: Microb Ecol ISSN: 0095-3628 Impact factor: 4.552
Genomes selected for analysis
| GPID | Strain namea | Size, bp or Mb | % CG | Contigs | Number of genes | Strain characteristics |
|---|---|---|---|---|---|---|
| 82 |
| 1,993,560 | 34.7 | 1 | 1,862 | Commercial strain for yogurt, fluid milk production |
| 404 |
| 2,340,228 | 46.1 | 3 | 2,218 | Starter culture for beer, sourdough, and silage |
| 402 |
| 2,924,325 | 46.6 | 2 | 2,771 | Starter culture for milk fermentation and flavour development of cheese |
| 30359 |
| 3,079,196 | 46.3 | 1 | 3,044 | Probiotic strain |
| 46813 |
| 2,043,161 | 36.9 | 1 | 2,024 | Normal oral/vaginal flora, chicken isolate |
| 16871 |
| 1,864,998 | 49.7 | 1 | 2,096 | Yogurt |
| 403 |
| 1,856,951 | 49.7 | 1 | 1,721 | Thermophilic starter culture for yogurt, Swiss and Italian-type cheeses |
| 18979 |
| 2,098,685 | 51.5 | 1 | 1,843 | Not specified |
| 84 |
| 1,894,360 | 35.3 | 1 | 1,755 | Human isolate, type strain |
| 17811 |
| 2,080,931 | 37.1 | 1 | 1,610 | Cheese culture |
| 36575 |
| 1,785,116 | 34.4 | 1 | 1,737 | Competitive exclusion strain in chicken |
| 9638 |
| 1,992,676 | 34.6 | 1 | 1,821 | Probiotic strain |
| 32969 |
| 3,197,759 | 44.7 | 1 | 2,948 | Probiotic strain |
| 356 |
| 3,348,625 | 44.4 | 4 | 3,101 | Human saliva |
| 15766 |
| 1,999,618 | 38.9 | 1 | 1,900 | Type strain, human isolate |
| 19011 |
| 2,039,414 | 38.9 | 1 | 1,820 | Human isolate |
| 32195 |
| 3,010,111 | 46.7 | 1 | 2,944 | Probiotic strain |
| 40637 |
| 3,005,051 | 46.7 | 1 | 2,834 | Human isolate |
| 32197 |
| 3,033,106 | 46.7 | 2 | 2,992 | Probiotic strain |
| 13435 |
| 1,884,661 | 41.3 | 1 | 1,885 | Fermenting |
| 13280 |
| 2,133,977 | 33.0 | 4 | 2,014 | Probiotic strain |
| 18797 |
| 2,529,478 | 35.7 | 1 | 2,516 | Plasmid-cured NCDO712, lab strain |
| 401 |
| 2,598,348 | 35.8 | 6 | 2,504 | Cheese production |
| 72 |
| 2,365,589 | 35.3 | 1 | 2,266 | Laboratory strain |
| 41115 |
| 2,635,654 | 34.9 | 1 | 2,575 | Fermenting, non-dairy |
| 16062 |
| 1,896,614 | 38.9 | 5 | 1,823 | Kimchi (food, Korea) |
| 40837 |
| 2,101,787 | 37.0 | 1 | 2,130 | Kimchi? not specified |
| 315 |
| 2,075,763 | 37.7 | 2 | 2,005 | Food fermentation, not specified |
| 70 |
| 3,359,974 | 37.4 | 4 | 3,265 | Clinical, blood isolate, vancomycin resistant |
| 32949 |
| 2,729,089 | 37.7 | 49 | 2,522 | Urine isolate |
| 32941 |
| 2,853,151 | 37.5 | 75 | 2,737 | Faecal isolate, antibiotic-naïve, normal flora |
| 20843 |
| 2,739,625 | 37.7 | 1 | 2,515 | No info - lab strain? |
| 32919 |
| 2,821,089 | 37.6 | 40 | 2,603 | Urine isolate |
| 32927 |
| 3,134,429 | 40.6 | 49 | 2,985 | No info |
| 32931 |
| 3,423,270 | 42.5 | 54 | 3,243 | No info |
| 32935 |
| 3,392,502 | 42.8 | 57 | 3,121 | No info |
| 46979 |
| 2,811,160 | 37.9 | 78 | 2,705 | Human microbiome, normal flora |
| 32965 |
| 2,685,402 | 38.1 | 67 | 2,573 | No info |
| 32967 |
| 2,771,455 | 38.3 | 70 | 2,698 | No info |
| 330 |
| 2,160,267 | 35.6 | 1 | 2,124 | Clinical isolate, common in adults |
| 326 |
| 2,127,839 | 35.6 | 1 | 1,996 | No info |
| 334 |
| 2,211,485 | 35.6 | 1 | 2,134 | Blood isolate |
| 27849 |
| 2,106,340 | 39.6 | 1 | 2,100 | No info |
| 34729 |
| 2,350,911 | 37.6 | 1 | 2,261 | Normally rumen flora, this is a clinical human isolate from endocarditis |
| 66 |
| 2,196,662 | 40.5 | 1 | 2,051 | Causes caries and periodontal diseases |
| 20527 |
| 1,925,087 | 37.6 | 22 | 1,962 | Human microbiome project, normal flora |
| 16302 |
| 2,146,611 | 40.0 | 1 | 2,018 | Clinical isolate |
| 28997 |
| 2,013,587 | 36.8 | 1 | 1,895 | Normally oral flora, can cause caries, endocarditis. Clinical isolate |
| 333 |
| 2,030,921 | 36.8 | 1 | 1,960 | Oral flora, can cause caries, caries isolate |
| 31233 |
| 2,221,315 | 39.5 | 1 | 2,135 | Alternative name Spain 23FST81. Pandemic, high prevalence, invasive |
| 29047 |
| 2,078,953 | 39.7 | 1 | 2,115 | Resistant clinical isolate |
| 277 |
| 2,160,842 | 39.7 | 1 | 2,125 | Virulent clinical isolate |
| 269 |
| 1,852,441 | 38.5 | 1 | 1,696 | Group A |
| 16364 |
| 1,928,252 | 38.4 | 1 | 1,987 | Sequenced for comparative genome analysis |
| 286 |
| 1,895,017 | 38.5 | 1 | 1,845 | Serotype M18 |
| 13942 |
| 2,388,435 | 43.4 | 1 | 2,270 | Indigenous oral bacteria, causes dental decay, oral plaque isolate |
| 17153 |
| 2,096,309 | 41.1 | 1 | 2,186 | Causes disease in pigs and occasionally humans |
| 32237 |
| 2,170,808 | 41.0 | 2 | 2,058 | Human clinical isolate |
| 18737 |
| 2,038,034 | 41.4 | 1 | 1,978 | Causes meningitis, arthritis, pneumonia in pigs human epidemic in China |
| 13163 |
| 1,796,226 | 39.1 | 1 | 1,915 | Isolated from yogurt for industrial dairy fermentations |
| 13773 |
| 1,864,178 | 39.1 | 3 | 1,716 | Used in the manufacture of fermented dairy foods |
| 13162 |
| 1,796,846 | 39.1 | 1 | 1,889 | Isolated from yogurt for industrial dairy fermentations |
| 16321 |
| 2,089,645 | 59.2 | 1 | 1,631 | Normal gut flora |
| 19423 |
| 1,933,695 | 60.5 | 1 | 1,528 | Normal gut flora |
| 42883 |
| 1,942,198 | 60.5 | 1 | 1,642 | Normal gut flora |
| 32897 |
| 1,938,709 | 60.5 | 1 | 1,567 | Normal gut flora |
| 32893 |
| 1,938,483 | 60.5 | 1 | 1,566 | Normal gut flora |
| 32515 |
| 1,944,050 | 60.4 | 1 | 1,572 | Normal gut flora |
| 28807 |
| 1,915,892 | 60.4 | 28 | 1,578 | Normal gut flora |
| 17583 |
| 2,636,367 | 58.5 | 1 | 2,129 | Normal oral and gut flora, can cause caries, caries isolate |
| 20555 |
| 2,642,081 | 58.5 | 2 | 2,151 | Human microbiome, faeces isolate |
| 18773 |
| 2,389,526 | 60.2 | 3 | 2,003 | Normal gut flora, probiotic |
| 328 |
| 2,260,266 | 60.1 | 2 | 1,729 | Normal gut flora, probiotic |
| 17189 |
| 2,832,748 | 59.9 | 1 | 2,416 | Normal gut flora, probiotic |
| 30065 |
| 2,453,376 | 60.2 | 55 | 2,085 | Normal gut flora, human microbiome project |
| 47579 |
| 2,477,838 | 59.8 | 1 | 1,959 | Normal gut flora, probiotic |
| 29261 |
| 2,007,108 | 59.4 | 17 | 1,586 | Normal gut flora, type strain |
| 30055 |
| 2,186,140 | 62.8 | 33 | 1,810 | Normal gut flora, probiotic |
| 30749 |
| 2,058,429 | 56.1 | 31 | 1,720 | Normal gut flora |
| 30751 |
| 2,019,802 | 57.5 | 27 | 1,580 | Human microbiome project |
| 30373 |
| 2,304,808 | 56.3 | 36 | 1,870 | Human microbiome project |
aThe official abbreviation ‘subsp.’ between species and subspecies name has been deleted throughout this contribution
GPID genome project identification number (NCBI: see http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi), NA not available
Average findings per genus and their pan- and core genome
| Genus | Number of genomes included | Number of species | Average genome size (kbp) | Average % CG | Average number of genes (min–max values) | Average number of gene families (min–max values) | Pan-genomea | Core genomea |
|---|---|---|---|---|---|---|---|---|
|
| 21 | 14 | 2,369 | 42.4 | 2,235 (1,562–3,059) | 2,071 (1,437–2,873) | 13,069 | 363 |
|
| 4 | 1 | 2,532 | 35.4 | 2,465 (2,266–2,504) | 2,238 (2,118–2,341) | 3,389 | 1,522 |
|
| 3 | 3 | 2,025 | 37.9 | 1,986 (1,820–2,130) | 1,896 (1,724–2,050) | 2,927 | 1,164 |
|
| 11 | 4 | 3,041 | 36.6 | 3,078 (2,573–2,515) | 2,707 (2,439–3,114) | 7,519 | 1,092 |
|
| 23 | 12 | 1,981 | 38.9 | 2,018 (1,696–2,270) | 1,923 (1,643–2,180) | 9,785 | 638 |
|
| 19 | 9 | 2,209 | 59.5 | 1,796 (1,528–2,416) | 1,746 (1,497–2,287) | 6,980 | 724 |
aNumber of gene families is given
Figure 1Pan- and core genome plots of the six analyzed genera. The genomes were analyzed in alphabetical order of species names
Figure 2BLAST Matrix for the Lactobacillus genomes. To the side, the total number of protein genes and gene families are listed for each genome. In the matrix cells, the shared protein genes are given as a percentage, based on the ratio of the core genome and pan-genome of each pair, as indicated
Figure 3BLAST Matrix for Bifidobacterium genomes. Note that the colour scale is different to that of Fig. 2
Figure 4Consensus tree of 243 core genes conserved in all analyzed Firmicutes. The number of genes supporting the branches is shown in red. Values for fewer than 100 genes are not shown
Figure 5In silico MLST of gene fragments extracted from the genomes of Bifidobacterium (a) and Enterococcus (b, c). b The genes selected for MLST of E. faecium; c the genes selected for E. faecalis were used
Figure 6Hierarchical clustering of the Bifidobacterium genomes (a) and the Firmicute genomes (b) based on their variable gene content. The scale at the bottom applies to both trees
Figure 7Blast Atlas of Bifidobacterium genomes with B. longum strain NCC2705 (top) and B. animalis lactis strain V8 (bottom) as the reference. To the right, the BLAST lanes for each atlas are listed. The four circles inwards of the annotation lane of the reference genome represent stacking energy, position preference, global direct repeats and GC skew (from out to in)
Figure 8BLAST Atlas of Lactobacillus with L. rhamnosus strain Lc705 (top) and Lb. johnsonii strain NCC533 (bottom) as the reference
Figure 9COG statistics for the genes found in the pan-genome (left) and core genome (right) of Bifidobacterium genomes. The key for the COG classes is explained below the pie charts. Percentages given in the pie chart are calculated by exclusion of classes R, S and X. Only values ≥5% are shown
Figure 10COG distribution of pan-genome genes (left) and core genes (right) for Lactobacillus (top), Lactococcus (middle) and Leuconostoc (bottom)
Relative fractions of COG groups within the functionally annotated genes for the six genera
| COG groups |
|
|
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pan (%) | Core (%) | Pan (%) | Core (%) | Pan (%) | Core (%) | Pan (%) | Core (%) | Pan (%) | Core (%) | Pan (%) | Core (%) | |
| Information storage | 30.0 | 33.9 | 34.0 | 49.1 ↑↑ | 50.5 | 30.4 ↓↓ | 28.1 | 31.0 | 26.6 | 33.8 ↑ | 34.7 | 42.6 ↑↑ |
| Cellular process, signalling | 21.9 | 20.2 | 22.7 | 20.3 | 17.1 | 19.1 | 19.1 | 20.0 | 24.4 | 18.9 ↓ | 26.3 | 20.3 ↓ |
| Metabolism | 48.1 | 45.9 | 44.3 | 30.6 ↓↓ | 32.2 | 50.6 ↑↑ | 52.7 | 49.1 | 50.0 | 47.8 | 39.2 | 36.9 |
All percentages are expressed as the fraction of all COG classes C to V. The arrows indicate significant shifts between the pan-genome genes and core genes for a given genus. Percentages do not always add up to 100% due to rounding effects
Figure 11COG statistics for the genes found in the pan-genome (left) and core genome (right) of the collection of genomes from all included organisms, divided into non-pathogenic isolates (probiotic, fermentative and normal human gut flora) at the top and pathogenic isolates at the bottom
Relative fractions of COG groups within the functionally annotated genes for non-pathogens/pathogens. The arrows indicate how the reported percentages increase or decrease in the core genome compared to the pan genome.
| COG groups | Non-pathogens | Pathogens | ||
|---|---|---|---|---|
| Pan (%) | Core (%) | Pan (%) | Core (%) | |
| Information storage | 33.5 | 64.4 ↑↑ | 29.3 | 42.4 ↑↑ |
| Cell. process, signalling | 22.0 | 16.6 ↓ | 25.7 | 18.9 ↓ |
| Metabolism | 44.5 | 20.2 ↓↓ | 44.9 | 38.7 ↓ |