| Literature DB >> 33177604 |
Mathew Fisher1, Thomas M R Harrison1,2, Michelle Nebroski1, Peter Kruczkiewicz1, Jamie L Rothenburger3, Aruna Ambagala1, Bryan Macbeth4, Oliver Lung5,6.
Abstract
The complete genome sequence of a novel circovirus (elk circovirus (ElkCV) Banff/2019) was determined via high throughput sequencing of liver tissue from a euthanized Rocky Mountain elk (Cervus canadensis nelsoni) from Alberta, Canada. The genome is circular and 1,787 nucleotides long, with two major ORFs encoding predicted proteins. Comparative genomic analysis to 4,164 publicly available complete and near complete circovirus genomes showed that ElkCV shares approximately 65% pairwise genome-wide nucleotide identity with the most closely related circovirus species, porcine circoviruses (PCV) 1 and 2 and bat-associated circovirus (BatACV) 11. ElkCV features a stem-loop within the origin of replication region characteristic of circoviruses. However, it differs from those found in PCV1, PCV2 and BatACV11 since it has a longer stem and contains hexamer repeats that overlap the stem in opposing orientations. Interestingly, stem-loop structures of similar length featuring repeats in a similar position and orientation are also seen in some avian circoviruses. Based on the demarcation threshold established by the International Committee on Taxonomy of Viruses (ICTV) for members of Circoviridae (80% pairwise genome-wide nucleotide identity), ElkCV represents a novel species and is the first complete circovirus genome reported from a cervid host.Entities:
Mesh:
Year: 2020 PMID: 33177604 PMCID: PMC7659335 DOI: 10.1038/s41598-020-75577-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Structure of the elk circovirus Banff/2019 genome. Bottom: Complete circular genome map and coverage plot. On the outside, start and end nucleotide position of ORFs and motifs are indicated in italics. Amino acid sequences (5′–3′) are shown in single letter code beside motif names and positions. ORFs were visualized using SnapGene Viewer v5.0.7 (snapgene.com). Inside the annotated circular genome is a circular plot (generated using GView v1.7[38]) showing coverage at each nucleotide position based on reference mapping of the liver-derived metagenomic sequencing data to the assembled genome. Inside the coverage plot, an 874 nt PCR product (from nucleotide positon 963 to 49) that was subsequently sequenced in order to confirm a region with low relative coverage, is indicated (coverage data not shown). Top: Expanded view of the origin of replication stem-loop region. Nucleotide position and features of note are labelled. Hexamer repeats are labelled as H1-3 with a reverse orientation repeat on the opposite stem of the stem-loop compared to H1 labelled as H1R. A single base insertion located in the H2 hexamer repeat sequence that is also found in BatACV11 is indicated in bold and italics.
Figure 2Maximum likelihood phylogenetic trees of representative sequences within the genus Circovirus. Sequences were aligned using MAFFT[24], trees generated using IQ-Tree[23] on find best model setting with ModelFinder[25] with 1000 ultrafast bootstraps[26] and visualized with iTOL[27]. ElkCV is indicated in bold text and with “•” before the sequence name in all trees. (a) Full mid-point rooted phylogenetic tree generated from representative Circovirus complete genome nucleotide sequences (model TIM + F + R5). (b) Pruned trees showing only ElkCV and sequences on closely related branches from complete genome nucleotide sequences (top black box), Rep protein amino acid sequences (middle gray box, model LG + I + G4) and Cp protein amino acid sequences (bottom gray box, model PMB + R3).
Comparison of the elk circovirus Banff/2019 genome to the most closely related circovirus genomes currently known.
Percent identity is shown between the elk circovirus genome and the genomes of closely related species based on nucleotide alignments of the whole genome (WG) as well as nucleotide (nt) and amino acid (aa) alignments of the Rep and Cp genes. The sequences of the conserved motifs found within Rep and hexamer repeats found near the origin of replication are shown. Nucleotide or amino acid resides identical to elk circovirus are indicated with a period and any differences are shown. A single base insertion in the H2 repeat of both elk circovirus and BatACV11 is bolded and underlined, while the gap in other sequences where this base is absent are indicated with “–”.