| Literature DB >> 30275399 |
Vikas Sharma1, Fauzul Mobeen2, Tulika Prakash3.
Abstract
Members of the genus Bifidobacterium are found in a wide-range of habitats and are used as important probiotics. Thus, exploration of their functional traits at the genus level is of utmost significance. Besides, this genus has been demonstrated to exhibit an open pan-genome based on the limited number of genomes used in earlier studies. However, the number of genomes is a crucial factor for pan-genome calculations. We have analyzed the pan-genome of a comparatively larger dataset of 215 members of the genus Bifidobacterium belonging to different habitats, which revealed an open nature. The pan-genome for the 56 probiotic and human-gut strains of this genus, was also found to be open. The accessory- and unique-components of this pan-genome were found to be under the operation of Darwinian selection pressure. Further, their genome-size variation was predicted to be attributed to the abundance of certain functions carried by genomic islands, which are facilitated by insertion elements and prophages. In silico functional and host-microbe interaction analyses of their core-genome revealed significant genomic factors for niche-specific adaptations and probiotic traits. The core survival traits include stress tolerance, biofilm formation, nutrient transport, and Sec-secretion system, whereas the core probiotic traits are imparted by the factors involved in carbohydrate- and protein-metabolism and host-immunomodulations.Entities:
Keywords: Bifidobacterium; comparative genomics; host-microbe interaction; human-gut; immunomodulations; niche-specific adaptations; pan-genome; probiotic
Year: 2018 PMID: 30275399 PMCID: PMC6210967 DOI: 10.3390/genes9100477
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
List of the 56 probiotic and human-gut strains of bifidobacteria (PHBifs) used for the analyses.
| Assembly | Organism/Name | Strain | Isolation Source | Probiotic Potential [Reference] |
|---|---|---|---|---|
| GCA_000695895.1 | RH | Feces * | Yes [ | |
| GCA_000260715.1 | ATCC 25527 | Sewage | Yes [ | |
| GCA_000021425.1 | AD011 | Infant fecal sample * | Yes [ | |
| GCA_000022965.1 | DSM 10140 | Commercially available probiotic strain | Yes [ | |
| GCA_000224965.2 | BLC1 | Commercially available probiotic strain | Yes [ | |
| GCA_000277325.1 | B420 | Commercially available probiotic strain | Yes [ | |
| GCA_000414215.1 | Bl12 | Colonoscopic sample * | No | |
| GCA_000818055.1 | BF052 | Feces of breast-fed infant * | Yes [ | |
| GCA_000022705.1 | Bl-04; ATCC SD5219 | Fecal sample from a healthy adult * | Yes [ | |
| GCA_000277345.1 | Bi-07 | Commercially available probiotic strain | Yes [ | |
| GCA_000025245.1 | BB-12 | Commercially available probiotic strain | Yes [ | |
| GCA_000220885.1 | CNCM I-2494 | Commercially available probiotic strain | Yes [ | |
| GCA_000092765.1 | V9 | Feces of healthy Mongolian infants * | Yes [ | |
| GCA_000816205.1 | KLDS2.0603 | Adult feces * | Yes [ | |
| GCA_000817045.1 | A6 | Feces * | Yes [ | |
| GCA_001025155.1 | JCM 7096 | Human feces * | Yes [ | |
| GCA_000966445.2 | GT102 | Feces * | No | |
| GCA_001025195.1 | JCM 1194 | Human feces * | Yes [ | |
| GCA_000010425.1 | ATCC 15703 | Human adult intestine * | Yes [ | |
| GCA_000817995.1 | BBMN23 | Human feces * | Yes [ | |
| GCA_000164965.1 | S17 | Feces of a breast-fed infant * | Yes [ | |
| GCA_000737885.1 | 22L | Milk * | Yes [ | |
| GCA_001281345.1 | BF3 | Feces * | Yes [ | |
| GCA_001025135.1 | JCM 1255 | Stool of breast-fed infant * | Yes [ | |
| GCA_000165905.1 | PRL2010 | Infant stool samples * | Yes [ | |
| GCA_000265095.1 | BGN4 | Human feces * | Yes [ | |
| GCA_000568955.1 | 12L | Human milk * | No | |
| GCA_000007525.1 | NCC2705 | Infant feces * | Yes [ | |
| GCA_001719085.1 | 35624 | Ileal mucosa of an individual free of gastrointestinal disease * | Yes [ | |
| GCA_000166315.1 | BBMN68 | Long-lived man’s intestinal tract * | Yes [ | |
| GCA_001025175.1 | JCM 1192 | Infant feces * | Yes [ | |
| GCA_000568975.1 | JCM 7017 | Infant feces * | No | |
| GCA_000829295.1 | 105-A | Human feces * | Yes [ | |
| GCA_000347695.1 | RBL67 | Baby feces * | Yes [ | |
| GCA_000569075.1 | S27 | Infant feces * | No | |
| GCA_001025215.1 | JCM 1200 | Infant feces * | Yes [ | |
| GCA_000569035.1 | NCFB 2258 | Infant feces * | Yes [ | |
| GCA_000569055.1 | 689b | Infant feces * | No | |
| GCA_001042615.1 | JCM 15439 | Feces of a healthy Japanese infant * | No | |
| GCA_000772485.1 | GT15 | The gastrointestinal tract (GIT) of a healthy adult from Central region of Russia * | Yes [ | |
| GCA_001446255.1 | NCIMB8809 | Stool sample * | Yes [ | |
| GCA_000569015.1 | JCM 7019 | Adult feces * | No | |
| GCA_000196555.1 | JCM 1217 | Intestine of adult * | Yes [ | |
| GCA_000008945.1 | DJO10A | Healthy young adult’s feces * | Yes [ | |
| GCA_000219455.1 | KACC 91563 | Feces of neonates * | Yes [ | |
| GCA_000196575.1 | 157F | Human infant feces * | Yes [ | |
| GCA_001725985.1 | AH1206 | Stool sample * | Yes [ | |
| GCA_000220135.1 | UCC2003 | Infant nursing stool * | Yes [ | |
| GCA_001281425.1 | BR3 | Feces * | Yes [ | |
| GCA_001293145.1 | BG7 | Feces * | Yes [ | |
| GCA_001446275.1 | CCUG30698 | Human adult intestine * | No | |
| GCA_000092325.1 | JDM301 | Human infant feces * | Yes [ | |
| GCA_000730205.1 | BXY01 | Gut * | No | |
| GCA_001281305.1 | BT1 | Feces * | No | |
| GCA_000269965.1 | JCM 1222 | Intestine of infant * | Yes [ | |
| GCA_000020425.1 | ATCC 15697 | Human infant feces * | Yes [ |
* Represents that isolation source belongs to the human host.
Figure 1Plot of pan- and core-genomes of the genus Bifidobacterium (n = 215 members). Pan-genome estimate is shown after using 30 random samples of the 215 genomes. The plot represents a stabilized core structure but an open pan-genome.
Figure 2The relative entropy of chromosome versus core-genome for the 56 PHBifs. The relative entropy depicts the magnitude of the selection pressure. The core part has a higher selection pressure than that of the chromosome. Significant differences between the two groups were tested with the Welch’s t-test (p-value < 0.001).
Figure 3A plot of K versus K of the 613 gene groups of the orthologous genes from the 56 PHBifs. Straight line represents K = K. Each symbol (+) represents a gene group of 56 orthologous nucleotide sequences from the 56 bifidobacterial genomes. The figure implies that the rates of synonymous substitutions for 613 core genes is higher than that of the non-synonymous substitutions. The resulting core proteins will not be altered and remain conserved during evolution.
Figure 4An abundance heat map of the different COGs classes present in the genomes of the 56 PHBifs. The strains are sorted (top to bottom) according to increasing genome sizes. D: Cell cycle control, cell division, chromosome partitioning; M: Cell motility; N: Cell wall/membrane/envelope biogenesis; O: Posttranslational modification, protein turnover, chaperones; T: Signal transduction mechanisms; U: Intracellular trafficking, secretion, and vesicular transport; V: Defense mechanisms; A: RNA processing and modification; J: Translation, ribosomal structure and biogenesis; K: Transcription; L: Replication, recombination and repair; C: Energy production and conversion; E: Amino acid transport and metabolism; F: Nucleotide transport and metabolism; G: Carbohydrate transport and metabolism; H: Coenzyme transport and metabolism; I: Lipid transport and metabolism; P: Inorganic ion transport and metabolism; Q: Secondary metabolites biosynthesis, transport and catabolism; R: General function prediction only; S: Function unknown; MC: Multiple classes. The Spearman’s R statistic (p-value < 0.001) was used to estimate the significant correlation between the two groups, including COGs class and genome-size, star represents the status of correlation; for details refer to Table S9.
Summary of the genomic and functional features of the core-genome of the 56 PHBifs.
| Feature | Feature Count * | Sub-Feature | Sub Feature Count # |
|---|---|---|---|
| Protein Encoding Genes (PEGs) | 613 (100%) | ||
| PEGs predicted with the COGs functions | 442 (72.1%) | Cellular Processes and Signaling | 57 (9.3%) |
| Information Storage and Processing | 135 (22.02%) | ||
| Metabolism | 179 (29.2%) | ||
| Multiple Classes | 31 (5.06%) | ||
| Poorly Characterized | 40 (6.53%) | ||
| PEGs mapped to the KEGG functions | 488 (79.61%) | ||
| PEGs assigned to the Transporter Proteins | 60 (9.79%) | ||
| PEGs assigned to the Virulence Factors | 118 (19.25%) | ||
| Subcellular Localization of PEGs | 613 (100%) | Cell Wall | 1 (0.16%) |
| Cytoplasmic | 497 (81.08%) | ||
| Extracellular | 29 (4.73%) | ||
| Membrane | 86 (14.03%) | ||
| PEGs predicted with the Transmembrane Helices | 78 (12.72%) | ||
| PEGs predicted with the Signal Peptide Cleavage Sites | 9 (1.47%) | ||
| PEGs predicted with the Lipoprotein Signal Peptides | 613 (100%) | Cytoplasmic Proteins | 561 (91.52%) |
| SPaseI-cleaved Proteins | 12 (1.96%) | ||
| Lipoproteins (SPaseII-cleaved Proteins) | 1 (0.16%) | ||
| Transmembrane Proteins | 39 (6.36%) | ||
| PEGs predicted with the Non-Classical (Not Signal Peptide Triggered) Secretion | 94 (15.33%) | ||
| PEGs assigned to the Effector Proteins | 69 (11.26%) | Endoplasmic Reticulum as an Effector Target | 34 (5.55%) |
| Mitochondrion as an Effector Target | 7 (1.14%) | ||
| Endoplasmic Reticulum as a Possible Effector Target | 9 (1.47%) | ||
| Mitochondrion as a Possible Effector Target | 19 (3.1%) | ||
| PEGs assigned to the Essential Genes | 496 (80.91%) | ||
| PEGs assigned to the Types of Other DNA-binding Proteins | 1 (0.16%) | ||
| PEGs assigned to the Types of Transcription Factors | 19 (3.1%) | ||
| PEGs assigned to the Types of Two-Component Systems | 7 (1.14%) | ||
| PEGs assigned to the Carbohydrate Active Enzymes | 11 (1.8%) | Carbohydrate-Binding Modules | 2 (0.32%) |
| Glycoside Hydrolases | 6 (0.97%) | ||
| Glycosyl Transferases | 3 (0.48%) |
*, # are calculated out of 613 PEGs. KEGG: Kyoto Encyclopedia of Genes and Genomes.
Figure 5A bar plot of the KEGG pathways of the human innate immunity-related proteins, which are interacting with the core proteins of the 56 PHBifs. The pathway legends corresponding to the bars (from bottom to top) are given on the right panel.