Literature DB >> 29875243

Insights into Circovirus Host Range from the Genomic Fossil Record.

Tristan P W Dennis1, Peter J Flynn2,3, William Marciel de Souza1,4, Joshua B Singer1, Corrie S Moreau2, Sam J Wilson1, Robert J Gifford5.   

Abstract

A diverse range of DNA sequences derived from circoviruses (family Circoviridae) has been identified in samples obtained from humans and domestic animals, often in association with pathological conditions. In the majority of cases, however, little is known about the natural biology of the viruses from which these sequences are derived. Endogenous circoviral elements (CVe) are DNA sequences derived from circoviruses that occur in animal genomes and provide a useful source of information about circovirus-host relationships. In this study, we screened genome assemblies of 675 animal species and identified numerous circovirus-related sequences, including the first examples of CVe derived from cycloviruses. We confirmed the presence of these CVe in the germ line of the elongate twig ant (Pseudomyrmex gracilis), thereby establishing that cycloviruses infect insects. We examined the evolutionary relationships between CVe and contemporary circoviruses, showing that CVe from ants and mites group relatively closely with cycloviruses in phylogenies. Furthermore, the relatively random interspersion of CVe from insect genomes with cyclovirus sequences recovered from vertebrate samples suggested that contamination might be an important consideration in studies reporting these viruses. Our study demonstrates how endogenous viral sequences can inform metagenomics-based virus discovery. In addition, it raises doubts about the role of cycloviruses as pathogens of humans and other vertebrates.IMPORTANCE Advances in DNA sequencing have dramatically increased the rate at which new viruses are being identified. However, the host species associations of most virus sequences identified in metagenomic samples are difficult to determine. Our analysis indicates that viruses proposed to infect vertebrates (in some cases being linked to human disease) may in fact be restricted to arthropod hosts. The detection of these sequences in vertebrate samples may reflect their widespread presence in the environment as viruses of parasitic arthropods.
Copyright © 2018 Dennis et al.

Entities:  

Keywords:  EVE; circovirus; cyclovirus; diversity; endogenous; evolution; metagenomics

Mesh:

Year:  2018        PMID: 29875243      PMCID: PMC6069186          DOI: 10.1128/JVI.00145-18

Source DB:  PubMed          Journal:  J Virol        ISSN: 0022-538X            Impact factor:   5.103


INTRODUCTION

Circoviruses (family Circoviridae) are small, nonenveloped viruses with circular, single-stranded DNA (ssDNA) genomes ∼1.8 to ∼2.1 kb in length. Circovirus genomes encode two major proteins: replication-associated protein (Rep) and capsid (Cap), responsible for genome replication and particle formation, respectively. Transcription is bidirectional, with the rep gene being encoded in the forward sense and the cap gene being encoded in the complementary sense (1, 2). The family Circoviridae contains two recognized genera: Circovirus and Cyclovirus (1). The genus Circovirus includes pathogenic viruses of vertebrates, such as porcine circovirus 2 (PCV-2), which causes postweaning multisystemic wasting syndrome in swine. The genus Cyclovirus, in contrast, is comprised entirely of viruses that have been identified only via sequencing and for which host species associations are less clear. Nevertheless, cycloviruses have frequently been associated with pathogenic conditions in humans and domestic mammals. For example, cyclovirus sequences have been detected in the cerebrospinal fluid of humans suffering from acute central nervous system disease in Vietnam and Malawi (3, 4). Cyclovirus sequences have also been reported in association with numerous other outbreaks of disease in humans and domestic mammals (5–7). Sequences derived from circoviruses have been shown to be present in the genomes of many eukaryotic species (8, 9). These endogenous circoviral elements (CVe) are thought to be derived from the genomes of ancient circoviruses that were, by one means or another, ancestrally integrated into the nuclear genome of germ line cells (10). CVe can provide unique information about the long-term coevolutionary relationships between viruses and hosts; for example, the identification of ancient CVe in vertebrate genomes shows that viruses in the genus Circovirus have been coevolving with vertebrate hosts for millions of years (11). We recently reported the results of a study in which we systematically screened vertebrate whole-genome sequence (WGS) data for CVe (11). Here, we expanded this screen to include a total of 675 animal genomes, including 307 invertebrate species. Via screening, we identified novel examples of sequences derived from circoviruses, cycloviruses, and the more divergent circular Rep-encoding single-stranded DNA (CRESS-DNA) group. We examine the phylogenetic relationships between these sequences, well-studied circovirus isolates, and circovirus-related sequences recovered via metagenomic sequencing of environmental samples or animal tissues. Our analysis raises important questions about the origins of cyclovirus sequences in samples derived from humans and other mammals and about their role in causing disease in these hosts. (This article was submitted to an online preprint archive [12].)

RESULTS

Identification of CVe in animal genomes.

We screened WGS data of 675 animal species (see Table S1 in the supplemental material) in silico to identify sequences related to circoviruses. We identified 300 circovirus-related sequences in total, 76 of which have not been reported previously (Tables 1 and S3). To investigate the novel sequences identified in our screen, each sequence was virtually translated and incorporated into a multiple-sequence alignment that included a representative set of previously reported circoviruses and CVe (Table S2). Incorporation of CVe sequences into an alignment provided a basis for determining their genetic structures and investigating their phylogenetic relationships to circoviruses (Fig. 1).
TABLE 1

Novel CVe identified in this study

Sequence source and common nameScientific nameClassOrderNo. of sequencesaStatusbIntactc
Circovirus
    Tomato clownfishAmphiprion frenatusVertebrataPerciformes1CVeNo
    Elephant fishParamormyrops kingsleyaeVertebrataOsteoglossiformes1CVeNo
Cyclovirus
    Asian bee miteTropilaelaps mercedesaeArthropodaArachnida7CVeNo
    Varroa miteVarroa destructorArthropodaArachnida19CVeYes
    Elongate twig antPseudomyrmex gracilisArthropodaInsecta1CVeYes
CRESS-DNA group
    Myxosporean parasiteThelohanellus kitaueiCnidariaMyxosporea1CVeYes
    Philippine horse musselModiolus philippinarumMolluscaBivalvia4CVeYes
    Mediterranean musselMytilus galloprovincialisMolluscaBivalvia4Yes
    Freshwater snailBiomphalaria glabrataMolluscaGastropoda1Yes
    Tribble's coneConus tribbleiMolluscaGastropoda3Yes
    Western predatory miteGalendromus occidentalisArthropodaArachnida1Yes
    Phytoseiid predatory miteMetaseiulus occidentalisArthropodaArachnida1Yes
    Brown recluse spiderLoxosceles reclusaArthropodaArachnida19CVeYes
    Scarab beetleOryctes borbonicusArthropodaInsecta1CVeYes
    Drifting brine flyEphydra gracilisArthropodaInsecta10Yes
    Alkali flyEphydra hiansArthropodaInsecta8Yes
    Amphipod crustaceanParhyale hawaiensisArthropodaMalacostraca3CVeNo
    Sea louseCaligus rogercresseyiArthropodaMaxillopoda4Yes
    Tadpole shrimpTriops cancriformisArthropodaBranchiopoda1Yes
    Pork tapewormTaenia soliumCestodaCyclophyllidea3CVeYes

Number of distinct sequences disclosing similarity to circovirus proteins that were identified in species WGS data.

Species genomes that were confirmed as containing CVe are indicated, based on the presence in WGS assemblies of at least one contig containing regions of circovirus homology flanked by >3 kb of genomic sequence.

Data indicate which species contained circovirus-derived sequences in which protein-coding potential was maintained across the entire length of the detected region of circovirus homology, and this region was at least 200 nucleotides in length.

FIG 1

Phylogeny of exogenous and endogenous circovirus Rep sequences. Maximum-likelihood phylogeny reconstructed from an alignment of replication-associated protein (Rep) sequences. The tree is midpoint rooted; asterisks indicate nodes with >70% bootstrap support. The scale bar indicates evolutionary distance in the number of substitutions per site. Sequences derived from metagenomic samples are indicated by colored circles. Taxon names are shown for sequences derived from viruses and CVe. All taxa are colored to indicate associations with host species groups, as shown in the key. Stars indicate viral taxa that have been linked to human disease. See Fig. S1 in the supplemental material for accession numbers of all taxa shown here. The arrow indicates an age calibration inferred for a clade within the Circovirus genus. Mya, million years ago; CaCV, canary circovirus; RaCV, raven circovirus; FiCV, finch circovirus; StCV, starling circovirus; CoCV, columbid circovirus; BFDV, beak and feather disease virus; MiCV, mink circovirus; PCV-2, porcine circovirus 2; CfCV, canine circovirus 1; DuCV, duck circovirus; SwCV, swan circovirus; GoCV, goose circoviruses; SgCV, wels catfish circovirus; BarbCV, barbel circovirus.

Novel CVe identified in this study Number of distinct sequences disclosing similarity to circovirus proteins that were identified in species WGS data. Species genomes that were confirmed as containing CVe are indicated, based on the presence in WGS assemblies of at least one contig containing regions of circovirus homology flanked by >3 kb of genomic sequence. Data indicate which species contained circovirus-derived sequences in which protein-coding potential was maintained across the entire length of the detected region of circovirus homology, and this region was at least 200 nucleotides in length. Phylogeny of exogenous and endogenous circovirus Rep sequences. Maximum-likelihood phylogeny reconstructed from an alignment of replication-associated protein (Rep) sequences. The tree is midpoint rooted; asterisks indicate nodes with >70% bootstrap support. The scale bar indicates evolutionary distance in the number of substitutions per site. Sequences derived from metagenomic samples are indicated by colored circles. Taxon names are shown for sequences derived from viruses and CVe. All taxa are colored to indicate associations with host species groups, as shown in the key. Stars indicate viral taxa that have been linked to human disease. See Fig. S1 in the supplemental material for accession numbers of all taxa shown here. The arrow indicates an age calibration inferred for a clade within the Circovirus genus. Mya, million years ago; CaCV, canary circovirus; RaCV, raven circovirus; FiCV, finch circovirus; StCV, starling circovirus; CoCV, columbid circovirus; BFDV, beak and feather disease virus; MiCV, mink circovirus; PCV-2, porcine circovirus 2; CfCV, canine circovirus 1; DuCV, duck circovirus; SwCV, swan circovirus; GoCV, goose circoviruses; SgCV, wels catfish circovirus; BarbCV, barbel circovirus. All of the newly identified sequences were derived from rep; no novel sequences derived from circovirus cap genes were detected. We identified two novel CVe derived from viruses in the genus Circovirus in fish genomes (Table 1). One of these, identified in the tomato clownfish (Amphiprion frenatus), appeared to be an ortholog of a CVe locus previously identified in other perciform fish (11). The other, identified in a mormyrid fish, was clearly related to CVe previously identified in ray-finned fish (11, 13), but as it comprised a relatively short fragment of the rep gene, its more precise phylogenetic relationship to these CVe could not be determined with confidence. We identified 93 circovirus-related sequences in invertebrate genome assemblies, 71 of which have not been reported previously (Tables 1 and S3). Of these, a relatively high proportion exhibited coding potential. Some occurred on short contigs and could potentially have been derived from contaminating virus. However, we found that, in many cases, at least one of the circovirus-related sequences identified in a WGS assembly was incorporated into a contig that was easily large enough to contain an entire circovirus genome and was flanked by >3 kb of genomic sequence and thus likely to represent CVe. On this basis, we estimate that 60 of the 93 sequences we identified in invertebrate genomes are likely to be derived from CVe. Sequences that occurred on short contigs, particularly those that lacked any in-frame stop codons or frameshifts (Table 1), might instead be derived from contaminating virus. Maximum likelihood (ML) phylogenies were reconstructed using an alignment of Rep proteins and disclosed two robustly supported, monophyletic clades corresponding to the Circovirus and Cyclovirus genera (1). In line with our previous investigations (11), we found that all Rep-related sequences from vertebrate WGSs grouped with circoviruses, with the exception of a highly divergent sequence identified in the genome of a jawless vertebrate (the hagfish Eptatretus burgeri). All sequences derived from invertebrate WGSs grouped with cycloviruses or with divergent CRESS-DNA viruses (e.g., Avon-Heathcote Estuary-associated circular virus 24 [14]). CRESS-DNA virus-like sequences from distinct species tended to emerge on relatively long branches, and bootstrap support for branching patterns in this region of the phylogeny were generally quite low. The low resolution in this part of the phylogeny likely reflects the lack of adequate sampling of viruses from invertebrate species. Some sequences from invertebrate WGSs were observed to cluster with cycloviruses in phylogenies, including CVe detected in two parasitic mite species (Varroa destructor and Tropilaelaps mercedesae) and a third detected in the genome of the elongate twig ant (Pseudomyrmex gracilis). The last element, here referred to as CVe-Pseudomyrmex, appeared to be no more distantly related to contemporary cycloviruses than many of them are to one another, including some that are associated with vertebrates (at least superficially) (Fig. 1). Because this seemed a little surprising, we sought to confirm the presence of CVe-Pseudomyrmex in the twig ant germ line. We obtained genomic DNA from four species of ant within the Pseudomyrmex genus (P. gracilis, Pseudomyrmex elongatus, Pseudomyrmex spinicola, and Pseudomyrmex oculatus), including three distinct populations of P. gracilis. We then used PCR to amplify a region encompassing part of the CVe and part of the genomic flanking sequence. We obtained an amplicon of the expected size in all three DNA samples of P. gracilis; all other samples were negative (Fig. 2). Sequencing of the amplicon confirmed that it was derived the genomic locus containing the CVe and contained both a portion of the CVe and a region of genomic flanking sequence.
FIG 2

PCR confirmation of CVe-Pseudomyrmex presence in three populations of Pseudomyrmex gracilis. (a) Results of amplification using primer pair 1 (694-bp product). (b) Results of amplification using primer pair 2 (286-bp product). Lane 1, negative control; lane 2, Pseudomyrmex gracilis from the Florida Keys; lane 3, P. gracilis from mainland Florida; lane 4, P. gracilis from Texas; lane 5, P. elongatus from the Florida Keys; lane 6, P. spinicola from Guanacaste Province, Costa Rica; lane 7, P. oculatus from Cusco, Peru; lane 8, Cephalotes atratus from Cusco, Peru; lane 9, negative control; lane 10, ladder.

PCR confirmation of CVe-Pseudomyrmex presence in three populations of Pseudomyrmex gracilis. (a) Results of amplification using primer pair 1 (694-bp product). (b) Results of amplification using primer pair 2 (286-bp product). Lane 1, negative control; lane 2, Pseudomyrmex gracilis from the Florida Keys; lane 3, P. gracilis from mainland Florida; lane 4, P. gracilis from Texas; lane 5, P. elongatus from the Florida Keys; lane 6, P. spinicola from Guanacaste Province, Costa Rica; lane 7, P. oculatus from Cusco, Peru; lane 8, Cephalotes atratus from Cusco, Peru; lane 9, negative control; lane 10, ladder.

Mapping the host associations of circoviruses and cycloviruses.

In phylogenies based on Rep, clades corresponding to the Circovirus and Cyclovirus genera contained a mixture of (i) CVe from WGS assemblies, (ii) sequences obtained from virus isolates, and (iii) sequences obtained from metagenomic samples (Fig. 1 and Table S1). Among circoviruses (genus Circovirus), host associations at the level of class appear relatively stable. For example, beak and feather disease virus (BFDV) groups robustly with a CVe that entered the germ line of birds of the order Passeriformes ∼38 million years ago (mya) (11), while barbel circovirus (BarbCV) groups robustly with CVe from the genome of the golden-line barbel in a well-supported clade containing numerous CVe from ray-finned fish. The only sequence that superficially seems to contradict this pattern is “chimpanzee” circovirus, which groups robustly with avian viruses. However, the name of this sequence is misleading: it was recovered from chimpanzee feces, but no host association is known. Indeed, the possibility that it might derive from an avian circovirus was noted at the time it was reported (15). We observed three well-supported sublineages in the clade corresponding to the Cyclovirus genus, here termed cycloviruses 1 to 3 (Fig. 1). For cycloviruses, the only confirmed host associations come from the CVe-Pseudomyrmex sequence reported above. Many of the cyclovirus sequences that have been identified via metagenomic sequencing are associated with arthropod species, such as dragonflies (16). However, others are associated with vertebrates and have names such as bat cyclovirus. The cyclovirus 1 group is exclusively comprised of viruses from vertebrate samples. In cyclovirus groups 2 and 3, however, sequences from vertebrate and invertebrate samples are extensively intermingled (Fig. 1), and clade structure does not reflect these host associations in any obvious way. Sequences from each host group appear to be dispersed randomly, and the branch lengths separating vertebrate from invertebrate viruses (and CVe) are relatively short in many cases.

DISCUSSION

In this study, we screened in silico 675 animal genomes and identified numerous sequences related to circoviruses, including many that have not been reported previously. We examined the phylogenetic relationships between these sequences, well-studied circovirus isolates, and circovirus sequences recovered via metagenomic sequencing of environmental samples or animal tissues. Most of the novel circovirus sequences reported here were identified in invertebrate genome assemblies. Many were highly divergent and are likely derived from uncharacterized CRESS-DNA virus lineages that infect invertebrate species. All of the newly identified sequences identified in our study were derived from rep genes; we did not detect any novel CVe with homology to circovirus cap genes. The preponderance of CVe derived from rep versus those derived from cap might reflect the greater heterogeneity of capsid sequences in general, which might lead to these sequences being generally harder to detect. Certainly, in the case of the more divergent invertebrate viruses, it is possible that the cap genes found in some lineages might not share any sequence homology with those sequenced previously. However, we note that even among CVe derived from viruses in the genus Circovirus, within which capsid sequences are comparatively conserved, sequences derived from rep are approximately twice as common as those derived from cap. Among the factors that may have influenced the structures and types of CVe that we observe in animal germ lines are selection pressures that have led to these sequences being co-opted or exapted by host species. Interestingly, several of the confirmed CVe in our study lacked frameshifting mutations or in-frame stop codons (Table 1), indicating that they have been evolving under purifying selection relatively recently. We confirmed the presence of one such CVe (CVe-Pseudomyrmex) in three populations of Pseudomyrmex gracilis (Fig. 2). The fact that we did not detect the CVe-Pseudomyrmex sequence in other members of the genus suggests that it was incorporated into the P. gracilis germ line after this species diverged from P. elongatus, P. spinicola, and P. oculatus in the Miocene epoch (17). However, we cannot completely rule out that Pseudomyrmex is actually older since the failure to obtain an amplicon in other Pseudomyrmex species could be accounted for in other ways (e.g., sequence divergence in the regions targeted by PCR primers). Nevertheless, the occurrence of an apparently fixed, intact, and expressed circovirus rep gene in an ant genome provides further evidence that these genes have been co-opted or exapted by host species for as yet unknown functions. Functional genomic studies in insects indicate that endogenous viral element (EVE) sequences have been co-opted into RNA-based systems of antiviral immunity (18). Thus, one possible explanation accounting for the conservation of this sequence in CVe-Pseudomyrmex is that it is involved in immune defense although this would not necessarily require maintenance of an intact coding sequence. Our study allowed the host associations of circoviruses and CVe to be examined in the context of their evolutionary relationships. With respect to this, the grouping of sequences for which the host associations are well established (i.e., CVe and viruses that have been investigated using methods in addition to sequencing) relative to sequences recovered from metagenomic samples was revealing. Prior to this study, the only host associations that had been robustly demonstrated were within the genus Circovirus. Circoviruses have been isolated from vertebrates, and in phylogenies based on Rep proteins, these isolates group together with vertebrate CVe in a well-supported clade. Furthermore, the host associations of circoviruses appear quite stable, with ancient CVe from particular host groups (e.g., orders or classes) sometimes seen grouping together with contemporary viruses from the same host groups (Fig. 1). Within the Circovirus clade there is only one sequence that seems to contradict this pattern. This sequence was recovered from a stool sample and, thus, as was observed when it was first reported (15), is likely to reflect environmental contamination. The limited evidence available regarding the zoonotic potential of circoviruses suggests that they lack the capacity to be transmitted between relatively distantly related hosts (i.e., hosts in distinct classes or orders). For example, during the 1990s and early 2000s, porcine circovirus 1 (PCV-1) was inadvertently introduced into batches of live attenuated rotavirus vaccine as an adventitious agent. These vaccines were administered to millions of people (19), yet PCV-1 is not thought to have infected any humans as a result, indicating that powerful barriers to cross-species transmission are probably in effect. Nevertheless, recent studies have identified some surprising cases wherein phylogenetic trees indicate apparent transmission of viruses between vertebrate classes (20). We see evidence for potential interclass transmission within one Circovirus subclade that contains sequences obtained from waterfowl and mammals (including mink, bats, dogs, and pigs) as well as CVe from reptile genomes (Fig. 1). Within this subclade, viruses of mink are robustly separated from porcine, canine, and waterfowl viruses by a CVe that was incorporated into the serpentine germ line >72 million years ago (11). Thus, the phylogeny indicates that at least one interclass transmission event is likely to have occurred within this clade. However, it should be noted that while the clustering patterns observed in this clade do suggest potential transmission of circoviruses between vertebrate classes, they also indicate that such events have occurred relatively infrequently during evolution since the CVe in snake genomes provide a minimum for the entire clade (assuming the root of this clade is as depicted in Fig. 1). Moreover, since clustering patterns that superficially appear to indicate host switches can be accounted for by multiple alternative evolutionary scenarios (21, 22), caution is advisable when phylogenetic approaches are used to infer the relationships between parasites and their hosts, especially when sampling is limited. If cross-species transmission of circoviruses between distinct mammalian orders does not occur readily, then transmission between arthropod and vertebrate hosts appears extremely unlikely. However, if we take the reported host species associations of cycloviruses at face value, we might conclude that transmission between distantly related species groups occurs frequently, particularly within the cyclovirus 3′ lineage (Fig. 1). Importantly, CVe-Pseudomyrmex groups robustly within this lineage, and since all other taxa within the Cyclovirus clade have been recovered via metagenomic screening, this provides the first unambiguous evidence of a host association for cycloviruses, establishing with a high degree of confidence that they do indeed infect arthropods or, at the very least, have done so in the past. Furthermore, since the only proven associations for cycloviruses so far are with arthropods, contamination of vertebrate samples with viruses derived from arthropods is perhaps the most parsimonious explanation for the host associations observed here. Contamination from arthropod sources such as dust mites can presumably occur fairly easily, given their ubiquity (23). Intriguingly, with respect to this we identified putative cyclovirus CVe in the genomes of two distinct mite species (Fig. 1 and Table 1). Since there is always a risk of being misled by contamination when viruses are identified via sequencing-based approaches, we propose that host associations of circoviruses identified via sequencing should be viewed with caution where they are found to strongly contradict established host associations within well-defined clades, particularly at higher taxonomic levels (e.g., phylum, class, or order). Whereas the weight of evidence may favor cyclovirus groups 2 and 3 being exclusively arthropod viruses that frequently contaminate vertebrate samples, the status of the cyclovirus 1 lineage is perhaps more equivocal. This basal lineage is comprised exclusively of sequences obtained from mammalian samples and includes cycloviruses proposed to cause disease in humans (cyclovirus VN and human cyclovirus VS5700009). Conceivably, these sequences could represent a mammal- or vertebrate-specific lineage of cycloviruses that is distinct from arthropod-infecting lineages. Notably, however, false-positive detection of human cyclovirus VS5700009 has been reported (24). Virus sequences recovered from metagenomic samples can be investigated by examining their phylogenetic relationships to other viruses for which host associations have been established. The work performed here demonstrates the utility of endogenous virus sequences in this process. This approach can be generalized to inform metagenomics-based virus discovery and diversity mapping efforts for any virus group that has generated endogenous sequences.

MATERIALS AND METHODS

Sequence data.

Whole-genome sequence (WGS) assemblies of 675 species (see Table S1 in the supplemental material) were downloaded from the National Center for Biotechnology Information (NCBI) website. We obtained a representative set of sequences for the genus Circovirus and a nonredundant set of vertebrate CVe sequences from an openly accessible data set we compiled in our previous work (11). This data set was expanded to include a broader range of sequences in the family Circoviridae, including representative species in the Cyclovirus genus and the more distantly related CRESS-DNA viruses (Table S2). We used GLUE, an open, data-centric software environment specialized in capturing and processing virus genome sequence data sets (25), to collate the sequences, alignments, and associated data used in this investigation. These data are available in the publicly accessible DIGS-for-EVEs (database-integrated genome screening for EVEs) online repository (https://github.com/giffordlabcvr/DIGS-for-EVEs).

Genome screening in silico.

Genome screening in silico was performed using the DIGS tool (26). The DIGS procedure comprises two steps. In the first step, the basic local alignment search tool (BLAST) program (27) is used to search a genome assembly file for similar to a particular probe (i.e., a circovirus Rep or Cap polypeptide sequence). In the second, sequences that produce statistically significant matches to the probe are extracted and classified by BLAST-based comparison to a set of virus reference genomes (Table S2). Results are captured in a MySQL database. Newly identified CVe identified in this study were assigned a unique identifier (ID) according to a convention we established previously (11). The first component of the ID is the classifier CVe. The second is a composite of two distinct subcomponents separated by a period: the name of the CVe group (usually derived from the host group in which the element occurs, e.g., Carnivora) and a numeric ID that uniquely identifies the insertion. Orthologous copies in different species are given the same number but are differentiated using the third component of the ID that uniquely identifies the species from which the sequence was obtained. Unique numeric IDs were assigned to novel CVe with reference to those used in the previously assembled data set (11).

Alignments and phylogenetic analysis.

Multiple sequence alignments were constructed using MUSCLE (28), RevTrans, version 2.0 (29), MACSE (30), and PAL2NAL (31). Manual inspection and adjustment of alignments were performed in Se-Al (32) and AliView (33). Phylogenies were reconstructed using maximum likelihood as implemented in IQ-TREE (34) and the VT+ G4 protein substitution model (35) as selected using ProTest (36), with support assessed using 1,000 nonparametric bootstrap replicates.

Amplification and sequencing.

Genomic DNA was extracted from ant tissue samples following the Moreau protocol (37) and a DNeasy blood and tissue kit (Qiagen). PCR amplification of CVe-Pseudomyrmex was performed using two sets of primer pairs designed with Primer3 (http://bioinfo.ut.ee/primer3-0.4.0/), each comprising one primer anchored in the CVe sequence and another anchored in the genomic flanking sequence. Primer pair 1 amplified a sequence that was 694 bp long, and primer pair 2 amplified a sequence that was 286 bp long. Primers were tested using illustra PuReTaq Ready-To-Go PCR beads (GE Healthcare). A temperature gradient PCR was performed to assess the optimum annealing temperature for the specific primer pairs. PCR was then performed using the genomic DNA ant extractions. The PCR conditions for this run were the following: an initial denaturation stage of 5 min at 95°C, 30 cycles of 30 s of denaturing at 95°C, 30 s of annealing at 49.7°C for primer pair 1 and at 62°C for primer pair 2, and extension at 72°C for 1 min, with a final extension after this program at 72°C for 5 min. Each run included a negative control. Amplification products (800 to 1,000 bp) for each PCR were excised and run on agarose gels. Bands of the expected size were excised, purified, and sequenced via Sanger sequencing (38).
  34 in total

1.  Small circular single stranded DNA viral genomes in unexplained cases of human encephalitis, diarrhea, and in untreated sewage.

Authors:  Tung Gia Phan; Daisuke Mori; Xutao Deng; Shaman Rajindrajith; Udaya Ranawaka; Terry Fei Fan Ng; Filemon Bucardo-Rivera; Patricia Orlandi; Kamruddin Ahmed; Eric Delwart
Journal:  Virology       Date:  2015-04-01       Impact factor: 3.616

2.  Diverse small circular DNA viruses circulating amongst estuarine molluscs.

Authors:  Anisha Dayaram; Sharyn Goldstien; Gerardo R Argüello-Astorga; Peyman Zawar-Reza; Christopher Gomez; Jon S Harding; Arvind Varsani
Journal:  Infect Genet Evol       Date:  2015-02-17       Impact factor: 3.342

3.  MACSE: Multiple Alignment of Coding SEquences accounting for frameshifts and stop codons.

Authors:  Vincent Ranwez; Sébastien Harispe; Frédéric Delsuc; Emmanuel J P Douzery
Journal:  PLoS One       Date:  2011-09-16       Impact factor: 3.240

4.  PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.

Authors:  Mikita Suyama; David Torrents; Peer Bork
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

5.  AliView: a fast and lightweight alignment viewer and editor for large datasets.

Authors:  Anders Larsson
Journal:  Bioinformatics       Date:  2014-08-05       Impact factor: 6.937

6.  Detection of a novel circovirus PCV3 in pigs with cardiac and multi-systemic inflammation.

Authors:  Tung Gia Phan; Federico Giannitti; Stephanie Rossow; Douglas Marthaler; Todd P Knutson; Linlin Li; Xutao Deng; Talita Resende; Fabio Vannucci; Eric Delwart
Journal:  Virol J       Date:  2016-11-11       Impact factor: 4.099

7.  ICTV Virus Taxonomy Profile: Circoviridae.

Authors:  Mya Breitbart; Eric Delwart; Karyna Rosario; Joaquim Segalés; Arvind Varsani
Journal:  J Gen Virol       Date:  2017-08-08       Impact factor: 3.891

8.  Insights into Circovirus Host Range from the Genomic Fossil Record.

Authors:  Tristan P W Dennis; Peter J Flynn; William Marciel de Souza; Joshua B Singer; Corrie S Moreau; Sam J Wilson; Robert J Gifford
Journal:  J Virol       Date:  2018-07-31       Impact factor: 5.103

9.  Identification of a new cyclovirus in cerebrospinal fluid of patients with acute central nervous system infections.

Authors:  Le Van Tan; H Rogier van Doorn; Ho Dang Trung Nghia; Tran Thi Hong Chau; Le Thi Phuong Tu; Michel de Vries; Marta Canuti; Martin Deijs; Maarten F Jebbink; Stephen Baker; Juliet E Bryant; Nguyen Thi Tham; Nguyen Thi Thuy Chinh BKrong; Maciej F Boni; Tran Quoc Loi; Le Thi Phuong; Joost T P Verhoeven; Martin Crusat; Rienk E Jeeninga; Constance Schultsz; Nguyen Van Vinh Chau; Tran Tinh Hien; Lia van der Hoek; Jeremy Farrar; Menno D de Jong
Journal:  mBio       Date:  2013-06-18       Impact factor: 7.867

10.  The evolution, distribution and diversity of endogenous circoviral elements in vertebrate genomes.

Authors:  Tristan P W Dennis; William Marciel de Souza; Soledad Marsile-Medun; Joshua B Singer; Sam J Wilson; Robert J Gifford
Journal:  Virus Res       Date:  2018-03-27       Impact factor: 3.303

View more
  12 in total

Review 1.  ssDNA viruses: key players in global virome.

Authors:  V G Malathi; P Renuka Devi
Journal:  Virusdisease       Date:  2019-04-19

Review 2.  Beyond Cytomegalovirus and Epstein-Barr Virus: a Review of Viruses Composing the Blood Virome of Solid Organ Transplant and Hematopoietic Stem Cell Transplant Recipients.

Authors:  Marie-Céline Zanella; Samuel Cordey; Laurent Kaiser
Journal:  Clin Microbiol Rev       Date:  2020-08-26       Impact factor: 26.132

3.  Diversity of CRESS DNA Viruses in Squamates Recapitulates Hosts Dietary and Environmental Sources of Exposure.

Authors:  Paolo Capozza; Gianvito Lanave; Georgia Diakoudi; Francesco Pellegrini; Roberta Cardone; Violetta Iris Vasinioti; Nicola Decaro; Gabriella Elia; Cristiana Catella; Alberto Alberti; Krisztián Bányai; Jairo Alfonso Mendoza-Roldan; Domenico Otranto; Canio Buonavoglia; Vito Martella
Journal:  Microbiol Spectr       Date:  2022-05-26

4.  Virus discovery in all three major lineages of terrestrial arthropods highlights the diversity of single-stranded DNA viruses associated with invertebrates.

Authors:  Karyna Rosario; Kaitlin A Mettel; Bayleigh E Benner; Ryan Johnson; Catherine Scott; Sohath Z Yusseff-Vanegas; Christopher C M Baker; Deby L Cassill; Caroline Storer; Arvind Varsani; Mya Breitbart
Journal:  PeerJ       Date:  2018-10-11       Impact factor: 2.984

5.  Insights into Circovirus Host Range from the Genomic Fossil Record.

Authors:  Tristan P W Dennis; Peter J Flynn; William Marciel de Souza; Joshua B Singer; Corrie S Moreau; Sam J Wilson; Robert J Gifford
Journal:  J Virol       Date:  2018-07-31       Impact factor: 5.103

6.  Assessing the Diversity of Endogenous Viruses Throughout Ant Genomes.

Authors:  Peter J Flynn; Corrie S Moreau
Journal:  Front Microbiol       Date:  2019-05-22       Impact factor: 5.640

7.  Viruses in Vietnamese Patients Presenting with Community-Acquired Sepsis of Unknown Cause.

Authors:  Nguyen To Anh; Nguyen Thi Thu Hong; Le Nguyen Truc Nhu; Tran Tan Thanh; Chuen-Yen Lau; Direk Limmathurotsakul; Xutao Deng; Motiur Rahman; Nguyen Van Vinh Chau; H Rogier van Doorn; Guy Thwaites; Eric Delwart; Le Van Tan
Journal:  J Clin Microbiol       Date:  2019-08-26       Impact factor: 5.948

Review 8.  Virus-Host Coevolution with a Focus on Animal and Human DNA Viruses.

Authors:  Győző L Kaján; Andor Doszpoly; Zoltán László Tarján; Márton Z Vidovszky; Tibor Papp
Journal:  J Mol Evol       Date:  2019-10-10       Impact factor: 2.395

Review 9.  Apoptosis Triggered by ORF3 Proteins of the Circoviridae Family.

Authors:  Yanting Zhang; Xingcui Zhang; Anchun Cheng; Mingshu Wang; Zhongqiong Yin; Juan Huang; Renyong Jia
Journal:  Front Cell Infect Microbiol       Date:  2021-02-02       Impact factor: 5.293

Review 10.  A Review on Viral Metagenomics in Extreme Environments.

Authors:  Sonia Dávila-Ramos; Hugo G Castelán-Sánchez; Liliana Martínez-Ávila; María Del Rayo Sánchez-Carbente; Raúl Peralta; Armando Hernández-Mendoza; Alan D W Dobson; Ramón A Gonzalez; Nina Pastor; Ramón Alberto Batista-García
Journal:  Front Microbiol       Date:  2019-10-18       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.