| Literature DB >> 28458911 |
Arvind Varsani1,2, Mart Krupovic3.
Abstract
With the advent of metagenomics approaches, a large diversity of known and unknown viruses has been identified in various types of environmental, plant, and animal samples. One such widespread virus group is the recently established family Genomoviridae which includes viruses with small (∼2-2.4 kb), circular ssDNA genomes encoding rolling-circle replication initiation proteins (Rep) and unique capsid proteins. Here, we propose a sequence-based taxonomic framework for classification of 121 new virus genomes within this family. Genomoviruses display ∼47% sequence diversity, which is very similar to that within the well-established and extensively studied family Geminiviridae (46% diversity). Based on our analysis, we establish a 78% genome-wide pairwise identity as a species demarcation threshold. Furthermore, using a Rep sequence phylogeny-based analysis coupled with the current knowledge on the classification of geminiviruses, we establish nine genera within the Genomoviridae family. These are Gemycircularvirus (n = 73), Gemyduguivirus (n = 1), Gemygorvirus (n = 9), Gemykibivirus (n = 29), Gemykolovirus (n = 3), Gemykrogvirus (n = 3), Gemykroznavirus (n = 1), Gemytondvirus (n = 1), Gemyvongvirus (n = 1). The presented taxonomic framework offers rational classification of genomoviruses based on the sequence information alone and sets an example for future classification of other groups of uncultured viruses discovered using metagenomics approaches.Entities:
Keywords: CRESS DNA viruses; Genomoviridae; replication-associated protein; ssDNA viruses.
Year: 2017 PMID: 28458911 PMCID: PMC5399927 DOI: 10.1093/ve/vew037
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Details of all members of the Genomoviridae.
| Genus | Species | Accession # | Sequence ID | Isolation source | Common name | Sample type | Country | Reference |
|---|---|---|---|---|---|---|---|---|
| KF371641 | P9 | Blackbird | Faeces | New Zealand | ||||
| KF371642 | P22 | Blackbird | Faeces | New Zealand | ||||
| KF371643 | as41 | Sheep | Faeces | New Zealand | ||||
| KT862253 | 52 Fec78023 cow | Cow | Faeces | New Zealand | ||||
| KM510192 | BasCV-3 NZ-NZG01 Sef-2012 | Soft brome/Bull grass | Leaf | New Zealand | ||||
| JQ412056 | G14 | Cassava | Leaf | Ghana | ||||
| JQ412057 | G5 | Cassava | Leaf | Ghana | ||||
| KT309029 | 254065908 | Black-capped chickadee | Buccal and cloacal swab | USA | ||||
| KT862243 | 27 Fec79971 chicken | Chicken | Faeces | New Zealand | ||||
| KT862244 | 29 Fec79971 Ilama | Llama | Faeces | New Zealand | ||||
| KT862246 | 30 Fec79971 horse | Horse | Faeces | New Zealand | ||||
| KT862242 | 27 Fec16497 chicken | Chicken | Faeces | New Zealand | ||||
| JX185429 | FL2-5X-2010 | Dragonfly | Abdomen | USA | ||||
| KT862248 | 30 Fec80061 horse | Horse | Faeces | New Zealand | ||||
| KF371638 | as50 | New Zealand fur seal | Faeces | New Zealand | ||||
| KT862241 | 27 Fec1 chicken | Chicken | Faeces | New Zealand | ||||
| KF371636 | P24a | Chatham Island warbler | Faeces | New Zealand | ||||
| KF371637 | P24b | Chatham Island warbler | Faeces | New Zealand | ||||
| KF371639 | P24c | Chatham Island warbler | Faeces | New Zealand | ||||
| KF413620 | VNHJ1W | Hypericum | Leaf | Vietnam | ||||
| KT862245 | 29 Fec80018 llama | Llama | Faeces | New Zealand | ||||
| KT862247 | 30 Fec80018 horse | Horse | Faeces | New Zealand | ||||
| KF371635 | as24 | Mallard duck | Faeces | New Zealand | ||||
| KJ641719 | BtMf-CV-23/GD2012 | Bat | Pharyngeal & rectal swabs | China | ||||
| KP263547 | 478d | Egyptian mongoose | Faeces | Portugal | ||||
| HQ335086 | SDBVL G | Mosquito | Mosquito samples | USA | ||||
| KM598385 | OdaGmV-1-US-260BC-12 | Damselfly | Abdomen | USA | ||||
| KM598386 | OdaGmV-1-US-260SR1-12 | Dragonfly | Abdomen | USA | ||||
| KM598387 | OdaGmV-2-US-1642KW-12 | Dragonfly | Abdomen | USA | ||||
| KM598388 | OdaGmV-2-US-1634LM2-12 | Dragonfly | Abdomen | USA | ||||
| KT253577 | PaGmV-1 TO STO14-29204 2014 | Signalgrass | Leaf | Tonga | ||||
| KT253578 | PaGmV-1 TO STO15-29204 2014 | Signalgrass | Leaf | Tonga | ||||
| KT253579 | PaGmV-1 TO STO18-29204 2014 | Signalgrass | Leaf | Tonga | ||||
| KT862250 | 49 Fec80061 pig | Pig | Faeces | New Zealand | ||||
| KF371640 | as5 | Domestic pig | Faeces | New Zealand | ||||
| KT732804 | Tbat 45285 | Bat | Faeces | Tonga | ||||
| KT732805 | Tbat 47364 | Bat | Faeces | Tonga | ||||
| KT732792 | Tbat 103791 | Bat | Faeces | Tonga | ||||
| KT732793 | Tbat A 103791 | Bat | Faeces | Tonga | ||||
| KT732797 | Tbat A 103852 | Bat | Faeces | Tonga | ||||
| KT732814 | Tbat H 103806 | Bat | Faeces | Tonga | ||||
| KT732801 | Tbat 12377 | Bat | Faeces | Tonga | ||||
| KT732802 | Tbat H 12377 | Bat | Faeces | Tonga | ||||
| KT732796 | Tbat H 103639 | Bat | Faeces | Tonga | ||||
| KT732803 | Tbat 103951 | Bat | Faeces | Tonga | ||||
| KT732807 | Tbat A 103746 | Bat | Faeces | Tonga | ||||
| KT732808 | Tbat A 103909 | Bat | Faeces | Tonga | ||||
| KT732809 | Tbat H 103746 | Bat | Faeces | Tonga | ||||
| KT732810 | Tbat H 103909 | Bat | Faeces | Tonga | ||||
| KT732811 | Tbat L 103746 | Bat | Faeces | Tonga | ||||
| KT732812 | Tbat L 103909 | Bat | Faeces | Tonga | ||||
| KT732806 | Tbat 31579 | Bat | Faeces | Tonga | ||||
| KT732795 | Tbat 21383 | Bat | Faeces | Tonga | ||||
| KT732794 | Tbat H 103958 | Bat | Faeces | Tonga | ||||
| KR912221 | Ch-zjrat-01 | Rat | Blood | China | ||||
| GQ365709 | SsHADV-1 CN | Sclerotinia | Mycelial samples | China | ||||
| KF268025 | SsHADV-1 NZ H6 2012 | – | River Sediments | New Zealand | ||||
| KF268026 | SsHADV-1 NZ SR1 2012 | – | River Sediments | New Zealand | ||||
| KF268027 | SsHADV-1 NZ SR3 2012 | – | River Sediments | New Zealand | ||||
| KF268028 | SsHADV-1 NZ SR5 2012 | – | River Sediments | New Zealand | ||||
| KM598382 | SsHADV-1-US-549LB-12 | Damselfly | Abdomen | USA | ||||
| KM598383 | SsHADV-1-US-549DFS-12 | Dragonfly | Abdomen | USA | ||||
| KM598384 | SsHADV-1-US-549SR-12 | Dragonfly | Abdomen | USA | ||||
| KJ547638 | BS3917 | – | Sewage | New Zealand | ||||
| KM821747 | SaGmV-1 NZ-BS3970-2012 | – | Sewage | New Zealand | ||||
| KJ547641 | BS4117 | – | Sewage | New Zealand | ||||
| KJ547636 | BS4014 | – | Sewage | New Zealand | ||||
| KJ547637 | BS3939 | – | Sewage | New Zealand | ||||
| KJ547640 | BS3972 | – | Sewage | New Zealand | ||||
| KJ547639 | BS3970 | – | Sewage | New Zealand | ||||
| KT862249 | 47 Fec80064 sheep | Sheep | Faeces | New Zealand | ||||
| KT862251 | 51 Fec80064 sheep | Sheep | Faeces | New Zealand | ||||
| KT598248 | SlaGemV1-1 | Soybean | Leaf | USA | ||||
| JX185428 | TO-DFS3B2-2010 | Dragonfly | Abdomen | Tonga | ||||
| KT862254 | 53 Fec7 dog | Dog | Faeces | New Zealand | ||||
| JN704610 | VS4700006 | European badger | Rectal swab | Netherlands | ||||
| KT862238 | 4 Fec7 duck | Duck | Faeces | New Zealand | ||||
| KT862239 | 24 Fec7 duck | Duck | Faeces | New Zealand | ||||
| KT732790 | Tbat A 103952 | Bat | Faeces | Tonga | ||||
| KT732791 | Tbat H 103952 | Bat | Faeces | Tonga | ||||
| KJ413144 | 349 | Human | Cervical sample | South Africa | ||||
| KJ547635 | BS3963 | – | Sewage | New Zealand | ||||
| KF371632 | P14 | European starling | Faeces | New Zealand | ||||
| KP263543 | 588t | European badger | Faeces | Portugal | ||||
| KF371634 | P21 | Chatham Island black robin | Faeces | New Zealand | ||||
| KF371633 | P22 | Blackbird | Faeces | New Zealand | ||||
| LK931483 | HCBI8.215 | Cow | Serum | Germany | ||||
| JX185430 | FL1-2X-2010 | Dragonfly | Abdomen | USA | ||||
| KJ547644 | BS3980 | – | Sewage | New Zealand | ||||
| KJ547645 | BS3849 | – | Sewage | New Zealand | ||||
| KP974694 | DB2 | Human | Plasma | Germany | ||||
| LK931485 | MSSI2.225 | Human | Blood | Germany | ||||
| KP133075 | SL1 | Human | Cerebrospinal fluid | Sri Lanka | ||||
| KP133076 | SL2 | Human | Cerebrospinal fluid | Sri Lanka | ||||
| KP133077 | SL3 | Human | Cerebrospinal fluid | Sri Lanka | ||||
| KP133078 | BZ1 | Human | Faeces | Brazil | ||||
| KP133079 | BZ2 | Human | Faeces | Brazil | ||||
| KP133080 | NP | – | Sewage | Nepal | ||||
| KP263546 | 541c | Egyptian mongoose | Faeces | Portugal | ||||
| KP987887 | GemyC1c | Human | Plasma | France | ||||
| KT363839 | GeTz1 | Human | Cerebrospinal fluid | China | ||||
| KU343137 | HV-GcV2 | Human | Pericardial fluid | France | ||||
| KP263545 | 160b | Egyptian mongoose | Faeces | Portugal | ||||
| KT732813 | Tbat A 64418 | Bat | Faeces | Tonga | ||||
| KJ641737 | BtRh-CV-6/Tibet2013 | Bat | Pharyngeal & rectal swabs | China | ||||
| KP263544 | 181a | Egyptian mongoose | Faeces | Portugal | ||||
| KJ641726 | BtRf-CV-8/NM2013 | Bat | Pharyngeal & rectal swabs | China | ||||
| KJ547643 | BS4149 | – | Sewage | New Zealand | ||||
| KT862240 | 27 BS14149 chicken | Chicken | Faeces | New Zealand | ||||
| KT862252 | 52 BS14149 cow | Cow | Faeces | New Zealand | ||||
| KT862255 | 56 BS14149 hare | Hare | Faeces | New Zealand | ||||
| KJ547642 | BS3911 | – | Sewage | New Zealand | ||||
| KT732798 | Tbat A 103779 | Bat | Faeces | Tonga | ||||
| KT732799 | Tbat H 103779 | Bat | Faeces | Tonga | ||||
| KT732800 | Tbat H 103921 | Bat | Faeces | Tonga | ||||
| LK931484 | HCBI9.212 | Cow | Serum | Germany | ||||
| KJ938717 | FaGmCV-13 | Caribou | Faeces | Canada | ||||
| KJ547634 | BS3913 | – | Sewage | New Zealand | ||||
| KF371631 | as35 | Rabbit | Faeces | New Zealand | ||||
| KF371630 | as3 | Ostrich | Faeces | New Zealand | ||||
| KP974693 | DB1 | Human | Plasma | Germany |
Figure 1.Distribution of (A) genome-wide, (B) Rep and (C) CP pairwise identities (121 taxa) of genomoviruses calculated using SDT v1.2 (Muhire, Varsani, and Martin 2014).
Figure 2.Maximum likelihood phylogenetic tree of the Rep amino acid sequences inferred using PHYML (Guindon et al. 2010) with LG + G+I substitution model and rooted with geminivirus sequences. The sequences of geminiviruses labeled with the corresponding genera names are used as a guide to identify genera within the Genomoviridae family. The cyan line shows a proposed demarcation of genera for both Genomoviridae and Geminiviridae. Branches with <75% SH-like branch support have been collapsed.
Figure 3.Maximum likelihood phylogenetic tree of the genomes of viruses in the Genomoviridae family. The tree was inferred using FastTree (Price, Dehal, and Arkin 2010) (GTR + CAT). The numbers at the branches indicate SH-like support values. The topology of tree supports the proposed genera demarcation at the genome level, despite there being evidence of recombination within the genomes. Branches with <75% SH-like branch support have been collapsed.
Figure 4.Maximum likelihood phylogenetic tree of the CP amino acid sequences inferred using PHYML (Guindon et al. 2010) with LG + G+I substitution models and rooted with geminivirus sequences. Branches with <75% SH-like branch support have been collapsed.
Figure 5.Summary of genera and the associated species and their diversity (within genera) within the Genomoviridae family.
Figure 6.Summary of conserved motifs, that is nonanucleotide and Rep motifs illustrated using WebLogo3 (Crooks et al. 2004) identified in the family Genomoviridae as a whole and its nine genera separately. Note the highly derived Walker A motif (GPHRRRRT) in the sole member of the genus Gemytondvirus.