Literature DB >> 21383980

Host-associated and free-living phage communities differ profoundly in phylogenetic composition.

J Gregory Caporaso1, Rob Knight, Scott T Kelley.   

Abstract

Phylogenetic profiling has been widely used for comparing bacterial communities, but has so far been impossible to apply to viruses because of the lack of a single marker gene analogous to 16S rRNA. Here we developed a reference tree approach for matching viral sequences and applied it to the largest viral datasets available. The resulting technique, Shotgun UniFrac, was used to compare host-associated and non-host-associated phage communities (130 total metagenomes), and revealed a profound split similar to that found with bacterial communities. This new informatics approach complements analysis of bacterial communities and promises to provide new insights into viral community dynamics, such as top-down versus bottom-up control of bacterial communities by viruses in a range of systems.

Entities:  

Mesh:

Year:  2011        PMID: 21383980      PMCID: PMC3044705          DOI: 10.1371/journal.pone.0016900

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The phylogenetic composition of bacterial communities is primarily determined by whether they are found in host-associated or free-living environments [1]. Much less is known about the phylogenetic composition of viral communities, which may comprise most of the genetic diversity on Earth. If viral communities follow this pattern, microbial and viral community composition should be correlated, adding to recent evidence that phage predation can exert top-down control on microbial communities [2], [3]. The lack of a single marker gene in viral genomes complicates phylogenetic profiling of viral communities, a powerful technique for studying microbial communities, and previous studies have focused on profiling viral gene functions [4]. To complement these data with phylogenetic profiles of phage community composition, we developed Shotgun UniFrac (Figure 1). Shotgun UniFrac matches metagenomic reads against full phage genomes from the Phage Proteomic Tree [5] using BLAST. OTUs are assigned to reads by best hit, discarding reads with no significant hit, and UniFrac is applied using QIIME [6] and the Phage Proteomic Tree.
Figure 1

Schematic of the Shotgun UniFrac analysis pipeline.

Results

We applied Shotgun UniFrac to 130 phage metagenomes from diverse environments. As observed with microbial communities, the primary factor separating metagenomes was whether they were derived from a free-living or host-associated environment. Host-associated environments vary more than a variety of free-living communities (considering only matches to the subset of viruses in the reference tree), and phage communities from the same host species tended to cluster (Figure 2a).
Figure 2

Principal Coordinates plot of weighted Shotgun UniFrac distances between viral communities where each point represents a metagenome colored by (a) host type and (b) data source.

Our analysis also included 26 human feces phage metagenomes from 12 individuals with between 1 and 4 metagenomes per individual (recently presented in [7]). To include a metagenome in this analysis, we required a minimum of 200 reads assignable to a viral genome. We observed clustering of metagenomes by individual, although some aberrant clustering occurred (Figure 3a). This is likely due to the limited number of phage genomes currently available, which limits the resolution of Shotgun UniFrac (see Discussion). Confirming the observations of [7], [8] we found between-individual Shotgun UniFrac distances to be significantly greater than within-individual distances (Figure 3b; p = 3×10−23, one-tailed t-test; p<0.001, Monte Carlo t-test with 1000 iterations), suggesting stability in distal gut phage community membership over time.
Figure 3

(a) UPGMA clustering of individuals by weighted Shotgun UniFrac distances between metagenomes.

Cases where metagenomes from a single individual cluster monophyletically are highlighted in red. Cases where only a single metagenome for an individual was included are highlighted in blue. 1000 jackknife iterations were performed at a depth of 200 sequences per metagenome, and jackknife support values are provided for each node. The Reyes et al. analysis from which these samples were derived studied gut microbial communities from human twins and their mothers. The labels for each sample indicate the individual where: Fn corresponds to family number n; M corresponds to mother; and T1 and T2 refer to twin 1 and twin 2, respectively. (b) Histograms of within individual (grey) and between individual (pink) Shotgun UniFrac distances.

(a) UPGMA clustering of individuals by weighted Shotgun UniFrac distances between metagenomes.

Cases where metagenomes from a single individual cluster monophyletically are highlighted in red. Cases where only a single metagenome for an individual was included are highlighted in blue. 1000 jackknife iterations were performed at a depth of 200 sequences per metagenome, and jackknife support values are provided for each node. The Reyes et al. analysis from which these samples were derived studied gut microbial communities from human twins and their mothers. The labels for each sample indicate the individual where: Fn corresponds to family number n; M corresponds to mother; and T1 and T2 refer to twin 1 and twin 2, respectively. (b) Histograms of within individual (grey) and between individual (pink) Shotgun UniFrac distances.

Discussion

Taken together, our results suggest that phage communities mirror microbial communities, and that comparison of phage communities by phylogenetic identity of viral types, even with relatively few sequenced phage genomes available to assign sequences, can be a powerful complement to functional profiles of the communities. Collecting viral metagenomes, microbial metagenomes, and 16S reads from the same samples and comparing these data with techniques such as Procrustes analysis [9] will provide insight into fundamental parameters of microbial ecosystems, such as whether control occurs in a top-down or bottom-up manner. Currently the limiting factor in applying Shotgun UniFrac to phage data is the availability of phage genomes, because sequences not matching known genomes are excluded from the analysis. For some metagenome types less than 1% of the viral metagenomic sequences could be classified (Table 1, Table S1) resulting in relatively few sequences per metagenome for comparing communities. The UniFrac results presented in Figures 2 and 3 are based on exactly 200 sequences per metagenome. Data sets of this size are useful for comparing microbial communities [10] and phage communities (Figure 2), but increasing the database of sequenced phage genomes and their phylogenies will further enhance the resolution of these techniques. Better resolution will aid understanding the complex dynamics and large compositional shifts seen in the human infant microbiome and virome [11], [12] that might be due to predator-prey cycling leading to chaos. Understanding such disruptions might be key to developing an understanding of probiotics and a wide range of time-variable diseases, such as Crohn's disease.
Table 1

OTU assignment statistics by metagenome type.

Metagenome TypenMean fraction failed OTU assignmentsSt. Dev. fraction failed OTU assignmentsMedian fraction failed OTU assignmentsMin fraction failed OTU assignmentsMax fraction failed OTU assignmentsSequences (OTU assignment input)Sequences (OTU assignment output)
Free-living (thermophilic) 20.96750.00400.96750.96350.971530,624939
Northern Islands Coral 40.98510.00380.98480.98130.98931,079,05717,433
Mosquito 30.98980.00160.99090.98760.99101,612,87816,814
Human Feces 810.99080.01040.99290.94181.00001,357,35312,616
Porites compressa (coral) 60.98900.00680.99310.97600.9941238,1232,567
Free-living (mesophilic) 320.99310.00370.99340.98191.00007,471,89052,432
Human Lung 50.99700.00010.99700.99700.99711,728,3785,112

Materials and Methods

Viral community metagenomic data was compiled from CAMERA [13], MG-RAST [14], and study authors [7] (Table S2, Table S3). There was no community clustering by data source (Figure 2b). Sequences were assigned to source viral genomes using Shotgun UniFrac, an extension of the reference-based OTU picking strategy presented by [15], using the open source QIIME and PyCogent [16] toolkits. Shotgun UniFrac was applied against full phage genomes from the Phage Proteomic Tree, and the associated reference tree was used for phylogenetic beta diversity analysis. Sequences were assigned to a viral genome if they achieved an E-value of less than 0.001, resulting in the viral OTU table (Table S4). The viral OTU table was then sub-sampled to 200 sequences per metagenome (Table S5) to control for depth of coverage. The UniFrac diversity metric was applied to the sub-sampled viral OTU table using the Phage Proteomic Tree. The version of the Phage Proteomic Tree used here contains 651 tips built from fully sequenced phage genomes as described in [5]. Community clustering and within- versus between-individual Shotgun UniFrac distances were calculated using Weighted UniFrac. Shotgun UniFrac analysis, Principal Coordinates Analysis, distance calculations and plotting were all performed using QIIME, and Shotgun UniFrac is accessible in QIIME v1.2.0-dev using the pick_reference_otus_through_otu_table.py workflow. The number of input metagenomes by type were: Reclaimed water at discharge point (n = 1); Reclaimed water at point-of-use (n = 2); Freshwater stromatolite (n = 2); Hot Spring, Yellowstone National Park (n = 2); Potable water (n = 1); Saltern (medium salinity) (n = 5); Ocean (db:MG-RAST) (n = 4); Saltern (high salinity) (n = 3); Northern Islands Coral (n = 4); Marine stromatolite (n = 1); Ocean (db:CAMERA) (n = 4); Freshwater (n = 4); Human feces (n = 80); Saltern (low salinity) (n = 3); Healthy human lung (n = 2); Mosquito-associated (n = 3); Cystic fibrosis human lung (n = 3); Porites compressa (coral, wild and experimentally treated) (n = 6). Four overlapping metagenomes (Ocean (db:MG-RAST) and Ocean (db:CAMERA)), were used as controls to ensure that the source database did not affect the clustering results which is possible, for example, if one required preprocessing that the other did not. OTU assignment statistics by metagenome. (XLS) Click here for additional data file. Description of metagenome types and sources. (XLS) Click here for additional data file. Full QIIME metadata mapping file. (XLS) Click here for additional data file. Full viral OTU table (i.e., metagenome × viral OTU abundance matrix). These data were used in jackknifed weighted Shotgun UniFrac calculations (Figure 3a). (XLS) Click here for additional data file. Viral OTU table sub-sampled to 200 sequences per metagenome. These data were used in weighted UniFrac calculations (Figure 2 and Figure 3b). (XLS) Click here for additional data file.
  15 in total

1.  The Phage Proteomic Tree: a genome-based taxonomy for phage.

Authors:  Forest Rohwer; Rob Edwards
Journal:  J Bacteriol       Date:  2002-08       Impact factor: 3.490

2.  Functional metagenomic profiling of nine biomes.

Authors:  Elizabeth A Dinsdale; Robert A Edwards; Dana Hall; Florent Angly; Mya Breitbart; Jennifer M Brulc; Mike Furlan; Christelle Desnues; Matthew Haynes; Linlin Li; Lauren McDaniel; Mary Ann Moran; Karen E Nelson; Christina Nilsson; Robert Olson; John Paul; Beltran Rodriguez Brito; Yijun Ruan; Brandon K Swan; Rick Stevens; David L Valentine; Rebecca Vega Thurber; Linda Wegley; Bryan A White; Forest Rohwer
Journal:  Nature       Date:  2008-03-12       Impact factor: 49.962

3.  Viral diversity and dynamics in an infant gut.

Authors:  Mya Breitbart; Matthew Haynes; Scott Kelley; Florent Angly; Robert A Edwards; Ben Felts; Joseph M Mahaffy; Jennifer Mueller; James Nulton; Steve Rayhawk; Beltran Rodriguez-Brito; Peter Salamon; Forest Rohwer
Journal:  Res Microbiol       Date:  2008-05-01       Impact factor: 3.992

4.  Viral control of bacterial biodiversity--evidence from a nutrient-enriched marine mesocosm experiment.

Authors:  Ruth-Anne Sandaa; Laura Gómez-Consarnau; Jarone Pinhassi; Lasse Riemann; Andrea Malits; Markus G Weinbauer; Josep M Gasol; T Frede Thingstad
Journal:  Environ Microbiol       Date:  2009-06-24       Impact factor: 5.491

5.  Microbial community resemblance methods differ in their ability to detect biologically relevant patterns.

Authors:  Justin Kuczynski; Zongzhi Liu; Catherine Lozupone; Daniel McDonald; Noah Fierer; Rob Knight
Journal:  Nat Methods       Date:  2010-09-05       Impact factor: 28.547

6.  Development of the human infant intestinal microbiota.

Authors:  Chana Palmer; Elisabeth M Bik; Daniel B DiGiulio; David A Relman; Patrick O Brown
Journal:  PLoS Biol       Date:  2007-06-26       Impact factor: 8.029

7.  The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes.

Authors:  F Meyer; D Paarmann; M D'Souza; R Olson; E M Glass; M Kubal; T Paczian; A Rodriguez; R Stevens; A Wilke; J Wilkening; R A Edwards
Journal:  BMC Bioinformatics       Date:  2008-09-19       Impact factor: 3.169

8.  A core gut microbiome in obese and lean twins.

Authors:  Peter J Turnbaugh; Micah Hamady; Tanya Yatsunenko; Brandi L Cantarel; Alexis Duncan; Ruth E Ley; Mitchell L Sogin; William J Jones; Bruce A Roe; Jason P Affourtit; Michael Egholm; Bernard Henrissat; Andrew C Heath; Rob Knight; Jeffrey I Gordon
Journal:  Nature       Date:  2008-11-30       Impact factor: 49.962

9.  PyCogent: a toolkit for making sense from sequence.

Authors:  Rob Knight; Peter Maxwell; Amanda Birmingham; Jason Carnes; J Gregory Caporaso; Brett C Easton; Michael Eaton; Micah Hamady; Helen Lindsay; Zongzhi Liu; Catherine Lozupone; Daniel McDonald; Michael Robeson; Raymond Sammut; Sandra Smit; Matthew J Wakefield; Jeremy Widmann; Shandy Wikman; Stephanie Wilson; Hua Ying; Gavin A Huttley
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

Review 10.  Worlds within worlds: evolution of the vertebrate gut microbiota.

Authors:  Ruth E Ley; Catherine A Lozupone; Micah Hamady; Rob Knight; Jeffrey I Gordon
Journal:  Nat Rev Microbiol       Date:  2008-10       Impact factor: 60.633

View more
  13 in total

Review 1.  Sequencing our way towards understanding global eukaryotic biodiversity.

Authors:  Holly M Bik; Dorota L Porazinska; Simon Creer; J Gregory Caporaso; Rob Knight; W Kelley Thomas
Journal:  Trends Ecol Evol       Date:  2012-01-11       Impact factor: 17.712

2.  Modeling the infection dynamics of bacteriophages in enteric Escherichia coli: estimating the contribution of transduction to antimicrobial gene spread.

Authors:  Victoriya V Volkova; Zhao Lu; Thomas Besser; Yrjö T Gröhn
Journal:  Appl Environ Microbiol       Date:  2014-05-09       Impact factor: 4.792

3.  Microbial Signatures of Cadaver Gravesoil During Decomposition.

Authors:  Sheree J Finley; Jennifer L Pechal; M Eric Benbow; B K Robertson; Gulnaz T Javan
Journal:  Microb Ecol       Date:  2016-01-09       Impact factor: 4.552

4.  Composition and function of the pediatric colonic mucosal microbiome in untreated patients with ulcerative colitis.

Authors:  Rajesh Shah; Julia L Cope; Dorottya Nagy-Szakal; Scot Dowd; James Versalovic; Emily B Hollister; Richard Kellermayer
Journal:  Gut Microbes       Date:  2016-05-23

Review 5.  Experimental and analytical tools for studying the human microbiome.

Authors:  Justin Kuczynski; Christian L Lauber; William A Walters; Laura Wegener Parfrey; José C Clemente; Dirk Gevers; Rob Knight
Journal:  Nat Rev Genet       Date:  2011-12-16       Impact factor: 53.242

6.  Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes.

Authors:  Ramy K Aziz; Bhakti Dwivedi; Sajia Akhter; Mya Breitbart; Robert A Edwards
Journal:  Front Microbiol       Date:  2015-05-08       Impact factor: 5.640

7.  Elviz - exploration of metagenome assemblies with an interactive visualization tool.

Authors:  Michael Cantor; Henrik Nordberg; Tatyana Smirnova; Matthias Hess; Susannah Tringe; Inna Dubchak
Journal:  BMC Bioinformatics       Date:  2015-04-28       Impact factor: 3.169

8.  Environmental genes and genomes: understanding the differences and challenges in the approaches and software for their analyses.

Authors:  Marie Lisandra Zepeda Mendoza; Thomas Sicheritz-Pontén; M Thomas P Gilbert
Journal:  Brief Bioinform       Date:  2015-02-11       Impact factor: 11.622

9.  Conducting a microbiome study.

Authors:  Julia K Goodrich; Sara C Di Rienzi; Angela C Poole; Omry Koren; William A Walters; J Gregory Caporaso; Rob Knight; Ruth E Ley
Journal:  Cell       Date:  2014-07-17       Impact factor: 41.582

10.  Genome signature-based dissection of human gut metagenomes to extract subliminal viral sequences.

Authors:  Lesley A Ogilvie; Lucas D Bowler; Jonathan Caplin; Cinzia Dedi; David Diston; Elizabeth Cheek; Huw Taylor; James E Ebdon; Brian V Jones
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.