| Literature DB >> 21272046 |
Abstract
AIMS: The aim of this study was to develop and demonstrate an approach for describing the diversity of human pathogenic viruses in an environmentally isolated viral metagenome. METHODS ANDEntities:
Mesh:
Year: 2011 PMID: 21272046 PMCID: PMC3055918 DOI: 10.1111/j.1472-765X.2011.03014.x
Source DB: PubMed Journal: Lett Appl Microbiol ISSN: 0266-8254 Impact factor: 2.858
Human viral pathogens included in the in silico study
| Virus | Nucleic acid | Genome size (nt) | Accession number |
|---|---|---|---|
| Adenovirus | dsDNA | 34 794 | AC_000019 |
| Astrovirus | ssRNA | 6813 | NC_001943 |
| Coronavirus | ssRNA | 27 317 | NC_002645 |
| Hepatitis A virus | ssRNA | 7478 | NC_001489 |
| Norovirus | ssRNA | 7654 | NC_001959 |
| Parechovirus | ssRNA | 7348 | NC_001897 |
| Polyomavirus JC | dsDNA | 5130 | NC_001699 |
| Respiratory Syncytial virus | ssRNA | 15 225 | NC_001781 |
| Rhinovirus | ssRNA | 7152 | NC_001617 |
| Rotavirus | dsRNA | 17 448 | NC_011507* |
*Segment 1, successive segments also included.
Figure 1Box plot of total classification errors for ten human viruses according to read length and annotation method. Total errors include both ambiguous and missing identifications. Groupings are by the read length (100, 200, 400 nt), the BLAST search program (BLASTn, BLASTx, tBLASTx), and the database where ‘nt’ represents nucleotide database, ‘nr’ represents amino acid database, and ‘v’ represents virus only database. In each box the centreline represents the median, the top and bottom of the box represent the 25th and 75th error percentiles, and the lines represent the data spread. Outliers, marked by circles, were outside three standard deviations of the median. Outliers for Rotavirus were >80% error for the BLASTx nr and tBLASTx nt cases and were excluded from the graph. Complete results for each individual virus are listed in Table S2.
Human pathogenic viruses identified in the class B biosolid virome
| Virus | Nucleic acid | Genome length (nt) | Number of sequences identified |
|---|---|---|---|
| Human herpesvirus 2 | dsDNA | 154 746 | 46 |
| Human herpesvirus 8 type P | dsDNA | 137 868 | 12 |
| Human herpesvirus 1 | dsDNA | 152 261 | 10 |
| Human herpesvirus 4 | dsDNA | 171 823 | 3 |
| Human herpesvirus 6A | dsDNA | 159 322 | 1 |
| Human coronavirus 229E | ssRNA | 27 317 | 9 |
| Human coronavirus HKU1 | ssRNA | 29 926 | 1 |
| Tanapox virus | dsDNA | 144 565 | 9 |
| Orf virus | dsDNA | 139 962 | 8 |
| Human parechovirus | ssRNA | 7348 | 7 |
| Human adenovirus D | dsDNA | 35 083 | 2 |
| Human adenovirus E | dsDNA | 35 994 | 1 |
| Human adenovirus type 1 | dsDNA | 36 001 | 1 |
| Aichi virus | ssRNA | 8521 | 1 |
| Hepatitis C virus genotype 1 | ssRNA | 9646 | 1 |
| Torque Teno Virus‐like minivirus | ssDNA | 2916 | 1 |
Figure 2Relative abundance of pathogenic viruses in biosolid virome normalized by genome size and the abundance of adenovirus. Inset: Pie chart of sequence identifications (n = 51 000). Human viral pathogens represent <0·1% of total sequences.