| Literature DB >> 28595639 |
Sara M Wolff1, Melinda J Ellison2, Yue Hao3, Rebecca R Cockrum4, Kathy J Austin5, Michael Baraboo6, Katherine Burch7, Hyuk Jin Lee8, Taylor Maurer9, Rocky Patil1, Andrea Ravelo8, Tasia M Taxis10, Huan Truong3, William R Lamberson1, Kristi M Cammack11, Gavin C Conant12,13,14,15,16.
Abstract
BACKGROUND: Grazing mammals rely on their ruminal microbial symbionts to convert plant structural biomass into metabolites they can assimilate. To explore how this complex metabolic system adapts to the host animal's diet, we inferred a microbiome-level metabolic network from shotgun metagenomic data.Entities:
Keywords: metabolic network; metagenomics; vertebrate microbiome
Mesh:
Year: 2017 PMID: 28595639 PMCID: PMC5465553 DOI: 10.1186/s40168-017-0274-6
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Fig. 1Merged host/microbe metabolic network. a Each node (circle) is a reaction in the host genome (left) or microbial metagenomes (right). Host nodes are colored purple if derived from an orthology association between sheep and humans, tan if from an ovine/bovine relationship or blue for the case of the added buyrate-employing pseudo-reaction (Materials and Methods). Edges are shared metabolites (network N ). In the center are nodes employing 23 potentially shared compounds between the host and microbes (set VFA + AA; Materials and Methods): the 10 most frequent metabolites (by microbe read count to their respective reactions) are individually colored. All nodes are organized by their distance from the other subnetwork: hence nodes employing an interface metabolite are at the center with a distance of 0. Microbial nodes are colored based on the normalized log2-fold difference in read count between the two diets (green: overabundant in the FORG diet; red: overabundant in CONC). Nodes whose normalized read counts did not differ significantly between the diets are shown in black (Materials and Methods). b The right half of part a, recolored based on the normalized animal-to-animal variance in read count for the FORG animals (see scale bar). c Same as for b but for the CONC animals. d Histogram giving the cumulative proportion of the total mapped reads for the two diets (green: FORG; red: CONC) at each level of the network. Note that the FORG animals have proportionally more reads mapped to more distant layers of the network
Read Statistics by Animal
| ID | Diet | Total readsa | Reads with valid paired ORFsb | # (%) reads that hit to nodesc | # of reads hitting multiple nodes | # (%) reads hit to sheep genomed | #OTU founde |
|---|---|---|---|---|---|---|---|
| 1003 | FORG | 16,779,099 | 14,521,805 | 271,571 (1.87%) | 220,794 | 58,830 (0.35%) | 109 |
| 1009 | FORG | 35,930,923 | 30,282,660 | 761,728 (2.52%) | 548,931 | 1,855,867 (5.17%) | 161 |
| 1127 | FORG | 41,120,479 | 35,920,407 | 570,177 (1.59%) | 459,714 | 63,121 (0.15%) | 137 |
| 1208 | FORG | 44,823,544 | 39,038,632 | 1,012,146 (2.59%) | 790,442 | 18,155 (0.04%) | 140 |
| 1248 | FORG | 22,698,997 | 19,869,280 | 673,066 (3.39%) | 441,001 | 2,804 (0.01%) | 127 |
| 1366 | FORG | 18,124,115 | 14,952,153 | 396,240 (2.65%) | 275,291 | 42,582 (0.23%) | 119 |
| 1397 | FORG | 32,221,706 | 28,176,562 | 657,645 (2.33%) | 486,817 | 62,527 (0.19%) | 137 |
| 7505 | FORG | 47,234,706 | 38,189,761 | 651,061 (1.70%) | 478,999 | 1,175,563 (2.49%) | 177 |
| 1026 | CONC | 29,835,213 | 24,169,312 | 642,601 (2.66%) | 780,456 | 7,564 (0.03%) | 108 |
| 1101 | CONC | 54,927,600 | 47,578,081 | 1,008,933 (2.12%) | 1,059,980 | 44,188 (0.08%) | 142 |
| 1111 | CONC | 26,710,771 | 23,016,865 | 362,506 (1.57%) | 486,293 | 167,325 (0.63%) | 137 |
| 1220 | CONC | 7,800,938 | 6,274,376 | 109,479 (1.74%) | 126,946 | 1,226 (0.02%) | 75 |
| 1239 | CONC | 42,216,924 | 36,551,070 | 751,236 (2.06%) | 775,268 | 24,083 (0.06%) | 138 |
| 1348 | CONC | 13,577,697 | 11,843,596 | 245,424 (2.07%) | 246,823 | 12,233 (0.09%) | 102 |
| 1396 | CONC | 18,274,753 | 15,860,653 | 278,963 (1.76%) | 339,002 | 7,330 (0.04%) | 124 |
| 7429 | CONC | 30,236,139 | 26,315,246 | 553,079 (2.10%) | 537,174 | 65,498 (0.22%) | 135 |
aTotal paired reads sequenced prior to quality filtering
bTotal number of paired reads passing read quality filtering and having a sufficiently long ORF in both members (Materials and Methods)
cNumber and percent of valid reads (previous column) that were mapped to nodes according to our criteria (Materials and Methods)
dNumber and percent of total reads that mapped to the sheep genome at 80% percent identity
eNumber of distinct OTUs identified previously in these sequences [18]
Diet and network position
| Groupa | Currency Cutoffb | Mean layer: FORGc | Mean layer: CONCc | Real Diff.: mean layersd | Max. Random differencee |
|
|---|---|---|---|---|---|---|
| VFA | N25 | 1.99 | 1.82 | 0.17 | 0.006 |
|
| N50 | 1.87 | 1.74 | 0.13 | 0.006 |
| |
| N100 | 1.78 | 1.64 | 0.14 | 0.005 |
| |
| VFA_AA | N25 | 0.828 | 0.66 | 0.16 | 0.003 |
|
| N50 | 0.87 | 0.75 | 0.12 | 0.004 |
| |
| N100 | 0.91 | 0.80 | 0.11 | 0.004 |
| |
| ALL | N25 | 0.51 | 0.45 | 0.06 | 0.002 |
|
| N50 | 0.46 | 0.41 | 0.04 | 0.002 |
| |
| N100 | 0.50 | 0.47 | 0.04 | 0.001 |
|
aInterface metabolite set (Materials and Methods)
bNetwork (e.g., currency cutoff; Materials and Methods)
cMean layer number for the reads mapped from FORG or CONC animals, respectively.
dDifference between the mean layer for FORG and CONC
eMaximum difference in the mean layer for the two diets seen when reads were randomized between the diets
f P-value for the test of the hypothesis that the two diets do not differ in mean layer. For this test, reads were randomly reassigned to diets and the mean layers recomputed 1000 times (Materials and Methods ). Values significant at P = 0.05 shown in bold.
Fig. 3Animal-to-animal taxonomic and network differences. a Principal component analysis of the OTU distributions across the 16 animals. The first two principal components (PCs) are shown, comprising 92% of the total variance. FORG animals are shown in green and CONC in red. Visually, it seems clear that the diet difference explains most of the variation in OTU distributions. An animated 3-dimensional version of this plot that includes PC#3 (3% of variance) is presented as Additional file 2. Each point is labeled with the breed of the animal in question: Su: Suffolk, Ra: Rambouillet, Ha: Hampshire. b Principal component analysis of the distribution of reads mapped to metabolic network nodes. The first two principal components (PCs) are shown, comprising 98% of the total variance. However, diet is no longer the main source of variation. Instead, principal component 1 separates three CONC animals (numbers 1220, 1239 and 1348; high RFI) from the rest of the dataset. Inspection of the node-level data suggests that these three animals are unusual in that they have higher than usual node-to-node variation in the number of mapped reads (namely a few nodes with a large number of mapped reads) and are also highly correlated with each other, unlike some of the other CONC animals with rather different profiles. An animated 3-dimensional version of this plot that includes PC#3 (1% of variance) is presented as Additional file 3. c Minimum and maximum pairwise node distances seen when reads were randomly and proportionally reassigned to each animal. On the y-axis is the same distance scale as y in panel D, on the x-axis is the proportion of simulations with a given minimum/maximum (Materials and Methods). The color scheme is as for (d) Dashed lines give the minimums and maximums seen in the real data of d. d Pairwise differences in distribution of reads mapped to OTUs (x-axis) and nodes (y-axis). FORG to FORG comparisons are shown in green, CONC to CONC in red, and FORG to CONC in blue. For each animal, a vector representing all mapped reads was normalized to unit length and then standard Euclidian distances computed between it and all other animals (Materials and Methods). For the FORG to FORG and CONC to CONC pairs, we computed the Pearson’s correlation of OTU and node distance and compared that value to that seen from randomized datasets (Materials and Methods). e As for c, except with the OTU distances. The x-axis gives OTU distances and the y-axis simulation frequencies
Fig. 2Association of the concentrations of three volatile fatty acids (VFAs) and of reads mapping to reactions involving them. On x is the concentration (mg/ml) of the VFA, on y is the fraction of reads mapping to reactions using that VFA relative to the total number of reads mapping uniquely to any reaction (e.g., the proportion of all mapped reads that involve that VFA). (a) Acetate, (b) Propionate, (c) Butyrate
Fig. 4Simulation of a shift in diet. We used simulated annealing to model a shift in diet from FORG to CONC (red hues). The average compound distribution of the eight CONC animals was used as a target and the eight FORG animals as the starting points (Materials and Methods). We used two different sampling schemes in the simulated annealing: Single-read exchanges moved reads from one node to another until the resulting compound vector was minimally different from the target (triangles), while paired read exchanges moved two reads at a time under the constraint that the two nodes that both reads originated from and the two nodes that they moved to had to, in each case, be connected by an edge (circles). Shown in green hues are the parallel simulations that started with FORG animals but used the average FORG compound vector as a target. On the x-axis we show the results of pairwise comparisons in compound distance between all combinations the eight simulated animals, using for each its best simulated annealing run. The open points show, for reference, the distribution seen in the real animals. On the y-axis are the pairwise node distances for these same comparisons