| Literature DB >> 29875243 |
Tristan P W Dennis1, Peter J Flynn2,3, William Marciel de Souza1,4, Joshua B Singer1, Corrie S Moreau2, Sam J Wilson1, Robert J Gifford5.
Abstract
A diverse range of DNA sequences derived from circoviruses (family Circoviridae) has been identified in samples obtained from humans and domestic animals, often in association with pathological conditions. In the majority of cases, however, little is known about the natural biology of the viruses from which these sequences are derived. Endogenous circoviral elements (CVe) are DNA sequences derived from circoviruses that occur in animal genomes and provide a useful source of information about circovirus-host relationships. In this study, we screened genome assemblies of 675 animal species and identified numerous circovirus-related sequences, including the first examples of CVe derived from cycloviruses. We confirmed the presence of these CVe in the germ line of the elongate twig ant (Pseudomyrmex gracilis), thereby establishing that cycloviruses infect insects. We examined the evolutionary relationships between CVe and contemporary circoviruses, showing that CVe from ants and mites group relatively closely with cycloviruses in phylogenies. Furthermore, the relatively random interspersion of CVe from insect genomes with cyclovirus sequences recovered from vertebrate samples suggested that contamination might be an important consideration in studies reporting these viruses. Our study demonstrates how endogenous viral sequences can inform metagenomics-based virus discovery. In addition, it raises doubts about the role of cycloviruses as pathogens of humans and other vertebrates.IMPORTANCE Advances in DNA sequencing have dramatically increased the rate at which new viruses are being identified. However, the host species associations of most virus sequences identified in metagenomic samples are difficult to determine. Our analysis indicates that viruses proposed to infect vertebrates (in some cases being linked to human disease) may in fact be restricted to arthropod hosts. The detection of these sequences in vertebrate samples may reflect their widespread presence in the environment as viruses of parasitic arthropods.Entities:
Keywords: EVE; circovirus; cyclovirus; diversity; endogenous; evolution; metagenomics
Mesh:
Year: 2018 PMID: 29875243 PMCID: PMC6069186 DOI: 10.1128/JVI.00145-18
Source DB: PubMed Journal: J Virol ISSN: 0022-538X Impact factor: 5.103
Novel CVe identified in this study
| Sequence source and common name | Scientific name | Class | Order | No. of sequences | Status | Intact |
|---|---|---|---|---|---|---|
| Tomato clownfish | Vertebrata | Perciformes | 1 | CVe | No | |
| Elephant fish | Vertebrata | Osteoglossiformes | 1 | CVe | No | |
| Asian bee mite | Arthropoda | Arachnida | 7 | CVe | No | |
| Varroa mite | Arthropoda | Arachnida | 19 | CVe | Yes | |
| Elongate twig ant | Arthropoda | Insecta | 1 | CVe | Yes | |
| CRESS-DNA group | ||||||
| Myxosporean parasite | Cnidaria | Myxosporea | 1 | CVe | Yes | |
| Philippine horse mussel | Mollusca | Bivalvia | 4 | CVe | Yes | |
| Mediterranean mussel | Mollusca | Bivalvia | 4 | Yes | ||
| Freshwater snail | Mollusca | Gastropoda | 1 | Yes | ||
| Tribble's cone | Mollusca | Gastropoda | 3 | Yes | ||
| Western predatory mite | Arthropoda | Arachnida | 1 | Yes | ||
| Phytoseiid predatory mite | Arthropoda | Arachnida | 1 | Yes | ||
| Brown recluse spider | Arthropoda | Arachnida | 19 | CVe | Yes | |
| Scarab beetle | Arthropoda | Insecta | 1 | CVe | Yes | |
| Drifting brine fly | Arthropoda | Insecta | 10 | Yes | ||
| Alkali fly | Arthropoda | Insecta | 8 | Yes | ||
| Amphipod crustacean | Arthropoda | Malacostraca | 3 | CVe | No | |
| Sea louse | Arthropoda | Maxillopoda | 4 | Yes | ||
| Tadpole shrimp | Arthropoda | Branchiopoda | 1 | Yes | ||
| Pork tapeworm | Cestoda | Cyclophyllidea | 3 | CVe | Yes |
Number of distinct sequences disclosing similarity to circovirus proteins that were identified in species WGS data.
Species genomes that were confirmed as containing CVe are indicated, based on the presence in WGS assemblies of at least one contig containing regions of circovirus homology flanked by >3 kb of genomic sequence.
Data indicate which species contained circovirus-derived sequences in which protein-coding potential was maintained across the entire length of the detected region of circovirus homology, and this region was at least 200 nucleotides in length.
FIG 1Phylogeny of exogenous and endogenous circovirus Rep sequences. Maximum-likelihood phylogeny reconstructed from an alignment of replication-associated protein (Rep) sequences. The tree is midpoint rooted; asterisks indicate nodes with >70% bootstrap support. The scale bar indicates evolutionary distance in the number of substitutions per site. Sequences derived from metagenomic samples are indicated by colored circles. Taxon names are shown for sequences derived from viruses and CVe. All taxa are colored to indicate associations with host species groups, as shown in the key. Stars indicate viral taxa that have been linked to human disease. See Fig. S1 in the supplemental material for accession numbers of all taxa shown here. The arrow indicates an age calibration inferred for a clade within the Circovirus genus. Mya, million years ago; CaCV, canary circovirus; RaCV, raven circovirus; FiCV, finch circovirus; StCV, starling circovirus; CoCV, columbid circovirus; BFDV, beak and feather disease virus; MiCV, mink circovirus; PCV-2, porcine circovirus 2; CfCV, canine circovirus 1; DuCV, duck circovirus; SwCV, swan circovirus; GoCV, goose circoviruses; SgCV, wels catfish circovirus; BarbCV, barbel circovirus.
FIG 2PCR confirmation of CVe-Pseudomyrmex presence in three populations of Pseudomyrmex gracilis. (a) Results of amplification using primer pair 1 (694-bp product). (b) Results of amplification using primer pair 2 (286-bp product). Lane 1, negative control; lane 2, Pseudomyrmex gracilis from the Florida Keys; lane 3, P. gracilis from mainland Florida; lane 4, P. gracilis from Texas; lane 5, P. elongatus from the Florida Keys; lane 6, P. spinicola from Guanacaste Province, Costa Rica; lane 7, P. oculatus from Cusco, Peru; lane 8, Cephalotes atratus from Cusco, Peru; lane 9, negative control; lane 10, ladder.