| Literature DB >> 18398449 |
Stacy R Finkbeiner1, Adam F Allred, Phillip I Tarr, Eileen J Klein, Carl D Kirkwood, David Wang.
Abstract
Worldwide, approximately 1.8 million children die from diarrhea annually, and millions more suffer multiple episodes of nonfatal diarrhea. On average, in up to 40% of cases, no etiologic agent can be identified. The advent of metagenomic sequencing has enabled systematic and unbiased characterization of microbial populations; thus, metagenomic approaches have the potential to define the spectrum of viruses, including novel viruses, present in stool during episodes of acute diarrhea. The detection of novel or unexpected viruses would then enable investigations to assess whether these agents play a causal role in human diarrhea. In this study, we characterized the eukaryotic viral communities present in diarrhea specimens from 12 children by employing a strategy of "micro-mass sequencing" that entails minimal starting sample quantity (<100 mg stool), minimal sample purification, and limited sequencing (384 reads per sample). Using this methodology we detected known enteric viruses as well as multiple sequences from putatively novel viruses with only limited sequence similarity to viruses in GenBank.Entities:
Mesh:
Year: 2008 PMID: 18398449 PMCID: PMC2290972 DOI: 10.1371/journal.ppat.1000011
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Sample information.
| Sample | Year Collected | Age of Patient | # of High Quality Sequence Reads | # of Unique Reads | Average Unique Read Length (bp) |
| D01 | 2005 | 14 mo | 365 | 166 | 526 |
| D02 | 1998 | 10 mo | 193 | 87 | 536 |
| D03 | 1984 | NA | 302 | 281 | 506 |
| D04 | 1984 | 4 mo | 311 | 154 | 626 |
| D05 | 1980 | NA | 243 | 168 | 563 |
| D06 | 2003 | 11 mo | 153 | 132 | 393 |
| D07 | 1999 | 23 mo | 352 | 186 | 617 |
| D08 | 1999 | 35 mo | 302 | 167 | 255 |
| D09 | 1981 | NA | 302 | 294 | 491 |
| D10 | 1983 | 20 mo | 195 | 146 | 447 |
| D11 | 1978 | NA | 253 | 103 | 367 |
| D12 | 2005 | 8 mo | 198 | 129 | 300 |
Figure 1Categorization of sequence reads based on best tBLASTX scores (E-value: <10−5).
Pies on the left side of each box depict the categorization of sequences from individual samples by phylotype: viral (V); phage (P); bacterial (B); human (H); fungal (F); other (O); and unassigned (U). Pies on the right side of each box depict further characterization of viral sequences by viral families/taxa: Reoviridae (Reo); Caliciviridae (Calici); Astroviridae (Astro); anellovirus (Anello); picobirnavirus (Picobirna); Picornaviridae (Picorna); mimivirus (Mimi); Nodaviridae (Noda); Adenoviridae (Adeno); Parvoviridae (Parvo). Numbers in parentheses indicate the number of sequence reads in each category.
Selected sequence reads with limited BLAST identity to known viruses.
| Sample | Sequence Read Accession # | Identity to Top Hit | Top Hit (Accession #) | Virus Family/Taxa |
| D03 | ET065742 | 78% | Human picobirnavirus strain 1-CHN-97 (AF246939) | Picobirnavirus |
| D03 | ET065743 | 90% | Human coxsackievirus A19 (AF499641) | Picornaviridae |
| D06 | ET067042 | 74% | Human enterovirus 91 (AY697476) | Picornaviridae |
| D06 | ET067045 | 66% | TTV-like mini virus (AB026931) | Anellovirus |
| D06 | ET067040 | 79% | Snow Mountain virus (AY134748.1) | Caliciviridae |
| D06 | ET067041 | 88% | Norovirus C14 (AY845056.1) | Caliciviridae |
| D08 | ET065575 | 57% | Human astrovirus 4 (AY720891) | Astroviridae |
| D08 | ET065582 | 67% | Human astrovirus 5 (DQ028633) | Astroviridae |
| D08 | ET065578 | 45% | TT virus (AB041963) | Anellovirus |
| D09 | ET066010 | 35% | Epinephelus septemfasciatus nervous necrosis virus (AM085331) | Nodaviridae |
| D10 | ET066456 | 81% | Human picobirnavirus 2-GA-91 (AF245701) | Picobirnavirus |
Figure 2Phylogenetic analysis of highly divergent astrovirus-like sequence reads.
Maximum parsimony phylogenetic trees were generated by comparing the translated amino acid sequence of individual sequence reads to the corresponding sequences from known astroviruses. 1,000 replicates were generated with bootstrap values over 700 shown. A) Representative sequence read mapping to astrovirus serine protease ORF (Accession number ET065575); B) Representative sequence read mapping to astrovirus RNA polymerase (Accession number ET065582).
Figure 3Phylogenetic analysis of a highly divergent nodavirus-like sequence read.
Maximum parsimony phylogenetic trees were generated by comparing the translated amino acid sequence of one sequence read (Accession number ET066010) to the corresponding RNA polymerase sequences of nodaviruses. 1,000 replicates were generated with bootstrap values over 700 shown.