| Literature DB >> 31509535 |
Charles H King1,2, Hiral Desai1, Allison C Sylvetsky3, Jonathan LoTempio4,5, Shant Ayanyan1, Jill Carrie1, Keith A Crandall6, Brian C Fochtman1, Lusine Gasparyan1, Naila Gulzar1, Paul Howell7, Najy Issa3, Konstantinos Krampis8, Lopa Mishra9, Hiroki Morizono5, Joseph R Pisegna10, Shuyun Rao9, Yao Ren1, Vahan Simonyan1, Krista Smith1, Sharanjit VedBrat7, Michael D Yao11,12, Raja Mazumder1,12.
Abstract
A comprehensive knowledge of the types and ratios of microbes that inhabit the healthy human gut is necessary before any kind of pre-clinical or clinical study can be performed that attempts to alter the microbiome to treat a condition or improve therapy outcome. To address this need we present an innovative scalable comprehensive analysis workflow, a healthy human reference microbiome list and abundance profile (GutFeelingKB), and a novel Fecal Biome Population Report (FecalBiome) with clinical applicability. GutFeelingKB provides a list of 157 organisms (8 phyla, 18 classes, 23 orders, 38 families, 59 genera and 109 species) that forms the baseline biome and therefore can be used as healthy controls for studies related to dysbiosis. This list can be expanded to 863 organisms if closely related proteomes are considered. The incorporation of microbiome science into routine clinical practice necessitates a standard report for comparison of an individual's microbiome to the growing knowledgebase of "normal" microbiome data. The FecalBiome and the underlying technology of GutFeelingKB address this need. The knowledgebase can be useful to regulatory agencies for the assessment of fecal transplant and other microbiome products, as it contains a list of organisms from healthy individuals. In addition to the list of organisms and their abundances, this study also generated a collection of assembled contiguous sequences (contigs) of metagenomics dark matter. In this study, metagenomic dark matter represents sequences that cannot be mapped to any known sequence but can be assembled into contigs of 10,000 nucleotides or higher. These sequences can be used to create primers to study potential novel organisms. All data is freely available from https://hive.biochemistry.gwu.edu/gfkb and NCBI's Short Read Archive.Entities:
Mesh:
Year: 2019 PMID: 31509535 PMCID: PMC6738582 DOI: 10.1371/journal.pone.0206484
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Human Microbiome Project (HMP) and GW participant statistics.
| Feature | White | Other | Asian | Black | Male | Female |
|---|---|---|---|---|---|---|
| HMP samples | 39 | 2 | 7 | 2 | 30 | 20 |
| GW samples | 24 | 0 | 6 | 18 | 21 | 27 |
Fig 1Metagenomic analysis pipeline for 3 samples.
Step 1: CensuScope is run for each read file against Filtered-nt. Each of the aligned organism approved by manually check is added to the GutFeelingKB and it is versioned. Step 2: For the final analysis the raw read files are mapped against GutFeelingKB organism sequences using HIVE-hexagon. Outputs are tabulated as relative abundance percentages. Unaligned reads from each sample were assembled using IDBA-UD. Contigs that were over 10,000 nucleotides long had their headers modified to include the following: sample ID, numbered according to length (long to short), and additional metadata data about the participant. These contigs are available as a download at (https://hive.biochemistry.gwu.edu/gfkb).
List of 109 baseline species and their GenBank accessions found in healthy human gut.
| Organism name | GenBankAC | Organism name | GenBankAC | Organism name | GenBankAC |
|---|---|---|---|---|---|
| Acidaminococcus fermentans (Bac/Firmicute) | CP001859 | Clostridium saccharolyticum (Bac/Firmicute) | CP002109, | Odoribacter splanchnicus (Bac/CFB_bac) | CP002544 |
| Acidaminococcus intestine (Bac/Firmicute) | CP003058 | Coprococcus catus (Bac/Firmicute) | FP929038 | Ornithobacterium rhinotracheale (Bac/CFB_bac) | CP006828 |
| Acidovorax sp KKS102 (Bac/Beta-proteo) | CP003872 | Coprococcus sp ART55/1 (Bac/Firmicute) | FP929039 | Oscillibacter valericigenes (Bac/Firmicute) | AP012044 |
| Adlercreutzia equolifaciens (Bac/ActnBac) | AP013105 | Cutibacterium acnes (Bac/ActnBac) (100;0.004) | CP003084 | Paenibacillus sabinae (Bac/Firmicute) | CP004078 |
| Akkermansia muciniphila (Other Bacteria) | CP001071 | Eggerthella lenta (Bac/ActnBac) | CP001726 | Paeniclostridium sordellii (Bac/Firmicute) | LN679998, |
| Alistipes finegoldii (Alistipes finegoldii) | CP003274 | Eggerthella sp. YY7918 (Bac/ActnBac) | AP012211 | Parabacteroides distasonis (Bac/CFB_bac) | CP000140 |
| Alistipes shahii (Bac/CFB_bac) | FP929032 | Enterococcus faecium (Bac/Firmicute) | CP003351, | Parvimonas micra (Bac/Firmicute) | CP009761 |
| Anaerococcus prevotii (Bac/Firmicute) | CP001708 | Enterococcus hirae (Bac/Firmicute) | CP003504 | Porphyromonas asaccharolytica (Bac/CFB_bac) | CP002689 |
| Anaerostipes hadrus (Bac/Firmicute) | FP929061 | Escherichia coli (Bac/Gamma-proteo) | CP009859, | Porphyromonas gingivalis (Bac/CFB_bac) | AP009380 |
| Bacillus methanolicus (Bac/Firmicute) | CP007739 | Escherichia coli O104:H4 (Bac/Gamma-proteo) | CP004009 | Prevotella dentalis (Bac/CFB_bac) | CP003368, |
| Bacteroides cellulosilyticus (Bac/CFB_bac) | CP012801 | Escherichia coli O83:H1 (Bac/Gamma-proteo) | CU651637 | Prevotella denticola (Bac/CFB_bac) | CP002589 |
| Bacteroides dorei (Bac/CFB_bac) | CP007619, | Ethanoligenens harbinense (Bac/Firmicute) | CP002400 | Prevotella intermedia (Bac/CFB_bac) | AP014597, |
| Bacteroides fragilis (Bac/CFB_bac) | FQ312004, | Eubacterium eligens (Bac/Firmicute) | CP001104, | Prevotella melaninogenica (Bac/CFB_bac) | CP002122, |
| Bacteroides helcogenes (Bac/CFB_bac) | CP002352 | Eubacterium limosum (Bac/Firmicute) | CP002273 | Prevotella ruminicola (Bac/CFB_bac) | CP002006 |
| Bacteroides ovatus (Bac/CFB_bac) | CP012938 | [Eubacterium] rectale (Bac/Firmicute) | FP929042, | Prevotella sp oral taxon 299 (Bac/CFB_bac) (100;0.06) | CP003666 |
| Bacteroides salanitronis (Bac/CFB_bac) (100;0.48) | CP002530 | [Eubacterium] siraeum (Bac/Firmicute) (100;0.75) | FP929044, | Raoultella ornithinolytica (Bac/Gamma-proteo) | CP004142 |
| Bacteroides sp. CAG:98 (Bac/CFB_bac) | CP008741 | Faecalibacterium prausnitzii (Bac/Firmicute) | FP929045, | Roseburia hominis (Bac/Firmicute) | CP003040 |
| Bacteroides thetaiotaomicron (Bac/CFB_bac) | AE015928, | Faecalitalea cylindroides (Bac/Firmicute) | FP929041 | Roseburia intestinalis (Bac/Firmicute) | FP929049, |
| Bacteroides vulgatus (Bac/CFB_bac | CP000139 | Fermentimonas caenicola (Bac/CFB_bac) | LN515532 | Rubinisphaera brasiliensis (Bac/Plnctmy) | CP002546 |
| Bacteroides xylanisolvens (Bac/CFB_bac) | FP929033 | Gardnerella vaginalis (Bac/ActnBac) | CP001849 | Ruminococcus bicirculans (Bac/Firmicute) | HF545616, |
| Barnesiella viscericola (Bac/CFB_bac) | CP007034 | Gordonibacter pamelaeae (Bac/ActnBac) | FP929047 | Ruminococcus bromii (Bac/Firmicute) | FP929051 |
| Bifidobacterium adolescentis (Bac/ActnBac) | CP007443, | Haemophilus parainfluenzae (Bac/Gamma-proteo) | FQ312002 | Ruminococcus champanellensis (Bac/Firmicute) | FP929052 |
| Bifidobacterium animalis (Bac/ActnBac) | CP009045 | Intestinimonas butyriciproducens (Bac/Firmicute) | CP011307 | Ruminococcus sp SR1/5 (Bac/Firmicute) | FP929053 |
| Bifidobacterium bifidum (Bac/ActnBac) | CP010412, | Klebsiella aerogenes (Bac/Gamma-proteo) | FO203355, | Ruminococcus torques (Bac/Firmicute) | FP929055 |
| Bifidobacterium breve (Bac/ActnBac) | CP006715, | Klebsiella michiganensis (Bac/Gamma-proteo) | CP004887 | Sphingobacterium faecium (Bac/CFB_bac) | LK931720 |
| Bifidobacterium dentium (Bac/ActnBac) | AP012326 | Klebsiella pneumoniae (Bac/Gamma-proteo) | CP009208 | Streptococcus mitis (Bac/Firmicute) | FN568063 |
| Bifidobacterium kashiwanohense (Bac/ActnBac) | AP012327, | Klebsiella variicola (Bac/Gamma-proteo) | CP001891 | Streptococcus parasanguinis (Bac/Firmicute) | CP002843, |
| Bifidobacterium longum (Bac/ActnBac) | AP014658, | Lachnoclostridium phytofermentans (Bac/Firmicute) | CP000885 | Streptococcus pasteurianus (Bac/Firmicute) | AP012054 |
| Bifidobacterium thermophilum (Bac/ActnBac) | CP004346 | Lactobacillus acidophilus (Bac/Firmicute) | CP005926 | Streptococcus salivarius (Bac/Firmicute) | CP009913, |
| Blautia obeum (Bac/Firmicute) | FP929054 | Lactobacillus paracasei (Bac/Firmicute) | AP012541 | Streptococcus sp I-P16 (Bac/Firmicute) | CP006776 |
| butyrate-producing bacterium SM4/1 (Bac/Firmicute) | FP929060 | Lactobacillus rhamnosus (Bac/Firmicute) | CP003094 | Streptococcus suis (Bac/Firmicute) | CP000837 |
| butyrate-producing bacterium SS3/4 (Bac/Firmicute) | FP929062 | Lactobacillus ruminis (Bac/Firmicute) | CP003032 | Streptococcus thermophilus (Bac/Firmicute) | CP000024, |
| Campylobacter coli (Bac/Delta-Epsilon-proteo) | CP007180 | Lactococcus lactis (Bac/Firmicute) | CP006766 | Tannerella forsythia (Bac/CFB_bac) | CP003191 |
| Campylobacter hominis (Bac/Delta-Epsilon-proteo) | CP000776 | Leuconostoc citreum (Bac/Firmicute) | DQ489736 | Treponema succinifaciens (Other Bacteria) | CP002631 |
| Candidatus Methanomassiliicoccus intestinalis (Arch/Euryar) | CP005934 | Mageeibacillus indolicus (Bac/Firmicute) | CP001850 | Veillonella parvula (Bac/Firmicute) | CP001820 |
| Citrobacter freundii (Bac/Gamma-proteo) | CP007557 | Megamonas sp Calf98-2 (Bac/Firmicute) | FP929048 | ||
| Clostridioides difficile (Bac/Firmicute) | CP003939, | Methanobrevibacter smithii (Arch/Euryar) | CP000678 |
1 Percentage of samples this organism is present in.
2 Average percent relative abundance of this organism.
Fig 2Stacked bar plot of phylogenetic composition of all microbiome taxa in this study collapsed at the phyla level in fecal samples.
Green bars represent Firmicutes and the blue represent Bacteroidetes, the two most abundant bacterial families. For aesthetic purposes the samples (n = 98, bottom) were sorted according to their composition of Bacteroidetes and Firmicutes to demonstrate how the baseline gut microbiome results from this study could be used in conjunction with results from past studies.
Fig 3Correlation between bacterial organism and nutrient data.
(A) Bifidobacterium is positively correlated with dietary protein intake, specifically vegetable protein, present in vegetables such as broccoli, brussel sprouts, beans, peas, asparagus and beans. (B) Akkermansia is positively associated with body mass index (BMI). (C) Bacteriodes ovatus is positively correlated with daily calorie intake. (D) Bacteriodes ovatus is negatively correlated with body weight.
Fig 4The range of correlation for all features that have been measured for each of the GW samples.
Each line is a graph of the min and max values using a Cosine Similarity coefficient correlation. A positive value means strong correlation, and a negative value means strong anticorrelation, whereas zero means absolutely no correlation. Given the size of sample pool of 16, 0.7 is taken as the marginal threshold for evidence of some degree of correlation. Each feature that had a correlation with any organism is highlighted in blue. For example, some characteristics such as fat intake have anticorrelation with members of Campulobacter jejuni and Eubacterium family.
Fig 5FecalBiome Reporting Template.
Personal Information section of the report contains information about the individual who had a sample sequenced, as well as the individual who ordered the sequence. It contains information about the pipeline used for analysis, as well as the sample number for ease of retrieval. Result section contains microbes representing the most abundant organisms which comprise the top 50% of inhabitants. Organismal Comment section includes information from the GutFeelingKB which pertains to the potential function of that organism.