| Literature DB >> 27375355 |
Katherine Bruder1, Kema Malki1, Alexandria Cooper1, Emily Sible1, Jason W Shapiro2, Siobhan C Watkins1, Catherine Putonti3.
Abstract
Advances in bioinformatics and sequencing technologies have allowed for the analysis of complex microbial communities at an unprecedented rate. While much focus is often placed on the cellular members of these communities, viruses play a pivotal role, particularly bacteria-infecting viruses (bacteriophages); phages mediate global biogeochemical processes and drive microbial evolution through bacterial grazing and horizontal gene transfer. Despite their importance and ubiquity in nature, very little is known about the diversity and structure of viral communities. Though the need for culture-based methods for viral identification has been somewhat circumvented through metagenomic techniques, the analysis of metaviromic data is marred with many unique issues. In this review, we examine the current bioinformatic approaches for metavirome analyses and the inherent challenges facing the field as illustrated by the ongoing efforts in the exploration of freshwater phage populations.Entities:
Keywords: bacteriophages; environmental metagenomics; freshwater; metaviromics
Year: 2016 PMID: 27375355 PMCID: PMC4915788 DOI: 10.4137/EBO.S38549
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1General protocol for metaviromic data analyses. Commonly used resources are listed for individual steps.
Figure 2Composition of the annotated host for the current collection of complete phage genomes available. Other taxa include all genera/families not listed here.
Published freshwater metavirome studies of phages.
| SAMPLE SITE | DATA LOCATION | SEQUENCING TECHNOLOGY | DATASET SIZE (Mbp) | ANALYSIS METHOD | % OF SEQUENCES ABLE TO BE IDENTIFIED |
|---|---|---|---|---|---|
| Lake Limnopolar (RNA) | SRA: SRP044919 | 454 | 162.9 | BLAST | 5.3–15.3 |
| Lake Limnopolar (DNA) | SRA: SRP000593 | 454 | 24.6 | BLAST | 12.4 |
| Lake Bourget and Lake Pavin (DNA) | SRA: ERP000339 | 454 | 706.2 | MetaVir | 14.3–26.4 |
| Lake Needwood (RNA) | NCBI: ADVU00000000.1, ADVT00000000.1 | 454 | 6.7 | BLAST | 34 (included hits to viral, bacterial, archaeal, and eukaryotic sequences) |
| Lake Michigan (DNA) | SRA: SRP042189 | Illumina | 23.7 | 2013 Samples: MetaVir | 2013 Samples: 6.9 |
| Florida potable water and reclaimed water (DNA and RNA) | SRA: SRP000673 | 454 | 371.6 | BLAST | Potable DNA: 56 |
| Feitsui Reservoir (DNA) | SRP009395 | 454 | 119.9 | BLAST | 31.7 |
| 4 Kent Sea Tilapia Ponds (DNA) | SRA: SRP000139, SRP000138, SRP000163 | 454 | 73.1 | BLAST | 2.9 |
| Ponds of the Mauritanian Sahara (DNA) | MG-RAST: 4446033.3, 4445718.3, 4445716.3, 4445715.3 | 454 | 82.8 | MG-RAST | 16.79–29.5 |
| Lake Ontario and Lake Erie (DNA) | SRA: SRP060006 | Illumina | 4986 | MetaVir | ∼25 |
| Artic fresh water viromes (DNA) | SRA: ERS396648 | Illumina/454 | ∼7300 | BLAST | 9.8 |
| Lake Matoaka (DNA) | MetaVir | 454 | 48.4 | MG-RAST | 23.9–25.7 |
| Lake Lough Neagh (DNA) | SRA: SRP062094 | Illumina | ∼1200 | MetaVir | 14.6 (MetaVir) ∼15% (MG-RAST) |
Note: Locations of sites sampled are shown in Figure 3.
Figure 3Locations of the freshwater metaviromes in Table 1.
Figure 4Phage–host populations within Lake Michigan nearshore waters.
Notes: Left: prevalence of bacterial phyla as determined by 16S rRNA gene sequencing.70 Right: expected prevalence of bacterial phyla based on the detection of phages annotated as infecting particular bacterial host species.66 The two sampling sites, along Chicago’s shoreline, are Montrose Beach (located 6.5 miles north of downtown Chicago) and 57th Street Beach (located 7.5 miles south of downtown Chicago). Sampling was conducted within the recreational beach waters.