| Literature DB >> 29310579 |
Silvia Arboleya1,2,3, Francesca Bottacini1,4, Mary O'Connell-Motherway1,4, C Anthony Ryan1,5, R Paul Ross1,2,4, Douwe van Sinderen1,4, Catherine Stanton6,7.
Abstract
BACKGROUND: Bifidobacterium longum is a common member of the human gut microbiota and is frequently present at high numbers in the gut microbiota of humans throughout life, thus indicative of a close symbiotic host-microbe relationship. Different mechanisms may be responsible for the high competitiveness of this taxon in its human host to allow stable establishment in the complex and dynamic intestinal microbiota environment. The objective of this study was to assess the genetic and metabolic diversity in a set of 20 B. longum strains, most of which had previously been isolated from infants, by performing whole genome sequencing and comparative analysis, and to analyse their carbohydrate utilization abilities using a gene-trait matching approach.Entities:
Keywords: Bifidobacterium longum; Carbohydrates; Comparative genomes; Gene-trait-matching; Microbiota; Pan-genome
Mesh:
Year: 2018 PMID: 29310579 PMCID: PMC5759876 DOI: 10.1186/s12864-017-4388-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Bifidobacterium longum genomes sequenced and analysed in this study
| Genomes | Source of Isolation | Contigs | No. of ORFs | Genome Size (bp) | GC content (%) | No. of unique genes | Plasmids (Rep+) | Reference | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Subject | Age | Delivery | Feeding | ||||||||
| A | 27 w | CS | BM | 37 | 1987 | 2,418,994 | 60.0 | 60 | No | [ | |
| B | 1 w | NB | BM | 27 | 1992 | 2,417,784 | 60.2 | 4 | Yes | [ | |
| B | 1 w | NB | BM | 31 | 1918 | 2,346,522 | 60.0 | 3 | Yes | [ | |
| C | 4 w | NB | BM | 57 | 2032 | 2,452,211 | 59.6 | 6 | Yes | [ | |
| C | 27 w | NB | BM | 51 | 2026 | 2,419,982 | 59.8 | 0 | Yes | [ | |
| C | 27 w | NB | BM | 45 | 1994 | 2,395,158 | 60.1 | 12 | Yes | [ | |
| E | 1 w | CS | BM | 50 | 1927 | 2,364,041 | 60.1 | 3 | Yes | [ | |
| F | 27 w | CS | BM | 39 | 1874 | 2,317,071 | 59.8 | 6 | No | [ | |
| G | 27 w | NB | BM | 48 | 2151 | 2,532,540 | 60.0 | 1 | Yes | [ | |
| H | 1 w | NB | BM | 24 | 1761 | 2,228,807 | 59.8 | 0 | No | [ | |
| H | 4 w | NB | BM | 21 | 1766 | 2,223,352 | 59.8 | 1 | No | [ | |
| J | 4 w | NB | BM | 27 | 2084 | 2,477,750 | 59.9 | 6 | No | [ | |
| D | 1 w | NB | FM | 72 | 1899 | 2,337,438 | 60.1 | 17 | No | [ | |
| D | 1 w | NB | FM | 39 | 2189 | 2,562,703 | 59.7 | 23 | No | [ | |
| I | 1 w | NB | BM | 51 | 1883 | 2,310,288 | 60.2 | 14 | No | [ | |
| K | 25 y | ND | ND | 32 | 1992 | 2,393,969 | 60.4 | 23 | Yes | [ | |
| L | 3 d | ND | ND | 20 | 1987 | 2,448,630 | 60.2 | 8 | No | [ | |
| M | 64 y | ND | ND | 25 | 1852 | 2,330,371 | 59.9 | 9 | No | [ | |
| N | 3 d | ND | ND | 28 | 1941 | 2,382,360 | 59.9 | 24 | Yes | [ | |
| O | 4 d | ND | ND | 52 | 1970 | 2,396,960 | 60.2 | 7 | Yes | [ | |
w weeks, y years, d days, CS C-Section, NB natural birth, BM breast-milk, FM formula milk, ND not determined or unknown
Fig. 1Comparative analysis of B. longum genomes. a Venn diagram displaying core gene families obtained by MCL clustering, and unique genes of B. longum APC/DPC strains and B. longum complete genomes. b Hierarchical clustering heatmap representing the variability of B. longum in terms of presence/absence of gene families. c Pie chart indicating the percentage of variable and core with respect to the total of gene families, resulting from the MCL clustering algorithm. APC 1461: B. longum APC 1461; APC 1462: B. longum APC 1462; APC 1464: B. longum APC 1464; APC 1465: B. longum APC 1465; APC 1466: B. longum APC 1466; APC 1468: B. longum APC 1468; APC 1472: B. longum APC 1472; APC 1473: B. longum APC 1473; APC 1476: B. longum APC 1476; APC 1477: B. longum APC 1477; APC 1478: B. longum APC 1478; APC 1480: B. longum APC 1480; APC 1482: B. longum APC 1482; APC 1503: B. longum APC 1503; APC 1504: B. longum APC 1504; BLIJ: B. longum ssp. infantis ATCC15697; BLJ: B. longum ssp. longum JDM301; BLIF: B. longum ssp. longum 157F; BBL306: B. longum ssp. longum CCUG30698; BLLJ: B. longum ssp. longum JCM1217; B8809: B. longum ssp. longum NCIMB8809; BBMN68: B. longum ssp. longum BBMN68; BL105A: B. longum ssp. longum 105A; BL2705: B. longum ssp. longum NCC2705; BLGT: B. longum ssp. longum GT15; BIL: B. longum ssp. longum F8; BLD: B. longum ssp. longum DJO10A; BLNIAS: B. longum ssp. longum KACC9156; DPC 6316: B. longum DPC 6316; DPC 6317: B. longum DPC 6317; DPC 6320: B. longum DPC 6320; DPC 6321: B. longum DPC 6321; DPC 6323: B. longum DPC 6323
Fig. 2Pan-genome and core-genome of B. longum. a The pan-genome plot is represented by the accumulated number of new genes against the number of genomes added. b The core-genome plot is represented by the accumulated number of genes attributed to the core-genomes against the number of added genomes. The deduced mathematical function is reported. The red line represents the pan-genome and core-genome attributed to the 20 B. longum APC/DPC strains
Fig. 3Phylogenetic analysis of B. longum. Phylogenetic supertree showing the relationship between 73 complete and incomplete B. longum strains and B. breve UCC 2003 as an outlier. B. longum APC/DPC strains are coloured in red. Grey circles with different fillings represent isolates from the same infant
Fig. 4The predicted glycosyl hydrolases in B. longum. a Heatmap displaying the in silico prediction of the GH family members identified in the B. longum genomes. b Pie chart indicating the percentage of each GH family identified only in B. longum APC/DPC genomes
Fig. 5Evaluation of carbohydrate utilization by B. longum strains. a Heatmap showing the growth performance of B. longum APC/DPC strains on different carbon sources at 12 h. b Heatmap displaying the in silico gene-trait matching exercise performed based on the association between the presence/absence of GH families predicted and growth/no growth phenotype of the B. longum APC/DPC strains
Gene-trait matching with functions resulting from hierarchical clustering analysis
| Carbohydrate | Gene cluster | Functions |
|---|---|---|
| XYLO-OLIGOSACCHARIDES (XOS) | A | Hypothetical protein |
| Putative outer membrane protein | ||
| Galactoside O-acetyltransferase | ||
| Alpha-L-arabinofuranosidase | ||
|
| ||
| ABC transporter permease | ||
| ABC transporter permease | ||
| Lactose ABC transporter substrate-binding protein | ||
| LacI family transcriptional regulator | ||
| NADH-dependent butanol dehydrogenase 1 | ||
| ARABINAN | B |
|
|
| ||
|
| ||
|
| ||
| ARABINOXYLAN | C | ABC transporter, permease protein, probably fructooligosaccharide porter |
| ABC transporter, permease protein, probably fructooligosaccharide porter | ||
| ABC transporter, extracellular SBP, probably fructooligosaccharide porter | ||
|
| ||
| Lipase | ||
| LacI family transcriptional regulator | ||
|
| ||
|
| ||
|
| ||
|
| ||
| GALACTAN | D | Solute-binding protein of ABC transporter system for sugars galactan metabolism |
| ABC transporter permease | ||
| ABC transporter permease | ||
| Beta-galactosidase galactan metabolism | ||
| Transcriptional regulator LacI family galactan metabolism | ||
|
| ||
| 2,5-diketo-D-gluconic acid reductase | ||
| FUCOSYLLACTOSE (FL) | E | LacI family transcriptional regulator |
| putative ABC transporter permease | ||
| ABC transporter permease | ||
| ABC transporter substrate binding component | ||
| Mandelate racemase/muconate lactonizing protein | ||
| Short chain dehydrogenase | ||
| Hypothetical protein | ||
| Dihydrodipicolinate synthase | ||
| Predicted fucose isomerase | ||
|
| ||
|
|
Glycosyl hydrolases involved in carbohydrate utilization of each cluster are italicized
Fig. 6Carbohydrate utilization clusters predicted by GTM. Locus map showing gene clusters putatively involved in the utilization of various sugars by certain B. longum strains. Glycosyl hydrolases involved in carbohydrate utilization of each cluster are also highlighted (*)