| Literature DB >> 23562481 |
Kate S Baker1, Richard M Leggett, Nicholas H Bexfield, Mark Alston, Gordon Daly, Shawn Todd, Mary Tachedjian, Clare E G Holmes, Sandra Crameri, Lin-Fa Wang, Jonathan L Heeney, Richard Suu-Ire, Paul Kellam, Andrew A Cunningham, James L N Wood, Mario Caccamo, Pablo R Murcia.
Abstract
Viral emergence as a result of zoonotic transmission constitutes a continuous public health threat. Emerging viruses such as SARS coronavirus, hantaviruses and henipaviruses have wildlife reservoirs. Characterising the viruses of candidate reservoir species in geographical hot spots for viral emergence is a sensible approach to develop tools to predict, prevent, or contain emergence events. Here, we explore the viruses of Eidolon helvum, an Old World fruit bat species widely distributed in Africa that lives in close proximity to humans. We identified a great abundance and diversity of novel herpes and papillomaviruses, described the isolation of a novel adenovirus, and detected, for the first time, sequences of a chiropteran poxvirus closely related with Molluscum contagiosum. In sum, E. helvum display a wide variety of mammalian viruses, some of them genetically similar to known human pathogens, highlighting the possibility of zoonotic transmission.Entities:
Mesh:
Year: 2013 PMID: 23562481 PMCID: PMC3667569 DOI: 10.1016/j.virol.2013.03.014
Source DB: PubMed Journal: Virology ISSN: 0042-6822 Impact factor: 3.616
Fig. 1Bioinformatic analysis pipeline for 15,155,461 assembled contigs. (A) Contigs assembled from sequencing reads by different de novo assemblers (starred, denoted by colours) were consolidated by sequential comparisons (numbered, curved green arrows) and removal (red arrows) of duplicate sequences. The size of charts is proportional to the number of contigs. (B) Consolidated sequences subject to sequential BLAST comparison with automated taxonomic classification to identify suspect-viral sequences. Then sequences were excluded manually on the basis of related viral family and curation to identify a final set of viral sequences. Proportions of sequences assembled by each algorithm (coloured as in A) and from each sample type (coloured as in the inset key) before and after length-exclusion are shown in the stacked charts. (C) Proportions of 1,363 mammalian-virus related contigs by assembly algorithm, sample-type and identification algorithm (shaded as in B) are shown in the stacked chart. Proportions by viral family are shown in the chart on the right. Number of contigs related to members of each family are shown in parentheses after the viral family names, which are grouped by genome type. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Number and length distributions of contiguous sequences by assembler. Metrics for total assembled sequences are shown for each sample type.
| Sample type | Contiguous sequences | Assembler | |||
|---|---|---|---|---|---|
| ABySS | Velvet | MetaIDBA | MetaCortex | ||
| Urine | Number | 157,518 | 28,886 | 4,597 | 819,081 |
| Average length (bp) | 91 | 222 | 94 | 79 | |
| Largest (bp) | 6,166 | 4,127 | 2,744 | 5,704 | |
| N50 (bp) | 119 | 244 | 78 | 76 | |
| Throat | Number | 621,363 | 113,317 | 217,483 | 1,917,401 |
| Average length (bp) | 67 | 160 | 100 | 79 | |
| Largest (bp) | 14,875 | 5,392 | 14,173 | 15,292 | |
| N50 (bp) | 75 | 175 | 78 | 76 | |
| Lung | Number | 1,496,223 | 706,515 | 14,007 | 9,358,870 |
| Average length (bp) | 69 | 118 | 86 | 77 | |
| Largest (bp) | 3,955 | 783 | 2,745 | 1,980 | |
| N50 (bp) | 76 | 116 | 73 | 76 | |
Number of contiguous sequences (by assembler and sample type) assembled, removed and retained during contig consolidation. The final four rows represent those sequences in the consolidated contiguous set, and the percentage of these >76 bp in length that underwent BLAST identification is shown in parentheses in the table.
| Number of contiguous sequences | Assembler | Sample | |||
|---|---|---|---|---|---|
| Process | Outcome | Urine | Throat | Lung | |
| Assembly | Total generated (% in consolidated) | Velvet | 28,886 (60) | 113,317 (60) | 706,515 (87) |
| ABySS | 157,518 (22) | 621,363 (14) | 1,496,223 (16) | ||
| MetaIDBA | 4,597 (97) | 217,483 (94) | 14,007 (100) | ||
| MetaCortex | 819,081 (63) | 1,917,401 (63) | 9,358,870 (83) | ||
| Comparison 1 Velvet vs. ABySS | Removed | Velvet | 9,268 | 23,593 | 51,218 |
| ABySS | 46,253 | 165,256 | 439,302 | ||
| Retained | Velvet | 19,618 | 89,724 | 655,297 | |
| ABySS | 111,265 | 456,107 | 1,056,921 | ||
| Total retained | 130,883 | 545,831 | 1,712,218 | ||
| Comparison 2: Velvet/ABySS vs. MetaIDBA | Removed | Velvet | 550 | 13,315 | 877 |
| ABySS | 17,516 | 233,758 | 26,886 | ||
| MetaIDBA | 45 | 7,463 | 0 | ||
| Retained | Velvet | 19,068 | 76,409 | 654,420 | |
| ABySS | 93,749 | 222,349 | 1,030,035 | ||
| MetaIDBA | 4,552 | 210,020 | 14,007 | ||
| Total retained | 117,369 | 508,418 | 1,698,462 | ||
| Comparison 3: Velvet/ABySS/MetaIDBA vs. MetaCortex | Removed | Velvet | 1,776 | 8,648 | 41,201 |
| ABySS | 58,594 | 137,011 | 786,165 | ||
| MetaIDBA | 85 | 5,871 | 49 | ||
| MetaCortex | 303,863 | 701,171 | 1,620,716 | ||
| Retained (%>76 bp) | Velvet | 17,292 (100) | 67,761 (99) | 613,219 (98) | |
| ABySS | 35,155 (63) | 85,338 (42) | 243,870 (51) | ||
| MetaIDBA | 4,467 (33) | 204,149 (30) | 13,958 (36) | ||
| MetaCortex | 515,218 (7) | 1,216,230 (9) | 7,738,154 (6) | ||
| Total retained | 572,132 | 1,573,478 | 8,609,201 | ||
Identification of viral sequences (divided by viral family) according to sample type, assembler and identification algorithm.
| Herpes | Papilloma | Adeno | Pox | Polyoma | Picorna | Retro | Parvo | Total (%) | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Assembly algorithm | ABySS | 188 | 111 | 18 | 12 | 3 | 74 | 3 | 409 (30%) | |
| MetaCortex | 113 | 94 | 29 | 9 | 1 | 69 | 315 (23%) | |||
| Meta-IDBA | 27 | 14 | 2 | 4 | 2 | 49 (4%) | ||||
| Velvet | 211 | 189 | 21 | 15 | 1 | 3 | 145 | 5 | 590 (43%) | |
| Sample | Lung | 8 | 1 | 18 | 1 | 235 | 263 (19%) | |||
| Throat sw. | 531 | 405 | 18 | 38 | 1 | 53 | 8 | 1054 (77%) | ||
| Urine | 2 | 32 | 6 | 4 | 2 | 46 (4%) | ||||
| ID algorithm | BLASTn | 14 | 9 | 36 | 5 | 4 | 6 | 1 | 75 (6%) | |
| BLASTx | 454 | 343 | 22 | 30 | 1 | 3 | 258 | 4 | 1115 (82%) | |
| tBLASTx | 71 | 56 | 10 | 3 | 28 | 5 | 173 (13%) | |||
| 539 (40%) | 408 (30%) | 68 (5%) | 38 (3%) | 1 (0%) | 7 (0%) | 292 (21%) | 10 (1%) | 1363 (100%) | ||
Fig. 2Presence of diverse herpesviruses in throat swab samples fromEidolon . Midpoint-rooted phylogenetic tree based on 166aa of the DNA polymerase protein encoded by three contigs (starred) and representatives from the Herpesviridae. Herpesvirinae subfamily clusters are shown. Posterior probability values are shown and the bar represents the expected number of amino acid substitutions per site.
Fig. 3Gene origin and diversity of papillomavirus sequences present in. (A) Top: schematic representation of Rousettus aegyptiacus papillomavirus 1 (RaPV-1) genome. Bottom: gene classification of 408 contigs identified as papillomaviruses. The pie chart shows the proportion of contigs related to each gene. The number of contigs is shown in brackets after the gene name. (B) Midpoint-rooted phylogenetic tree based on 157 aa of the major capsid (L1) protein encoded by six contigs (starred) and representatives from the Papillomaviridae. Posterior probability values are shown at each node and the bar represents the expected number of amino acid substitutions per site.
Fig. 4Isolation ofadenovirus 1 (AdV1). (A) Pteropus alecto primary kidney cells showing cytopathic effect (and control) at day 6 post infection with E. helvum AdV1. Magnification is 200 X (B) Transmission electron micrograph of a Pteropus alecto cell infected with E. helvum AdV1. Arrow indicates adenovirus particles observed in the nucleus (N). Bar represents 200 nm. Insert adenovirus particle negative stained from supernatant of the same culture. Insert bar represents 100 nm. (C) Midpoint-rooted phylogenetic tree based on alignment of hexon proteins (915aa) of E. helvum AdV1 (starred) and representative adenoviruses. Genera are demarcated by adjacent lines and names. Posterior probability values are shown and the bar represents the expected number of amino acid substitutions per site.
Fig. 5Phylogenetic analysis of anpoxviral sequence. Midpoint-rooted phylogenetic tree based on 286 aa of the major core protein encoded by a contig from the throat sample (starred) and representatives from the Poxviridae. Posterior probability values are shown and the bar represents the expected number of amino acid substitutions per site.
Fig. 6Classification and phylogenetic analysis of retroviral sequences present in. (A) The chart shows proportions of retroviral sequences related to different retroviral genera. The number of sequences related to each genus is shown in parentheses. (B) Retroviral sequences classified by the gene they encode. The top bar represents the genome organization of Friend-murine leukemia virus, with numbers representing genome nucleotide positions. The lower bar shows the proportion of 292 retroviral contigs that were related to each gene. The number of sequences related to each gene is shown in parentheses. (C) Midpoint-rooted phylogenetic tree based on 176aa of the Pro-Pol gene of representative retroviruses and a contig from the throat sample (starred), posterior probability values are shown at each node and the bar represents the expected number of amino acid substitutions per site.