| Literature DB >> 31991902 |
Joseph P Orton1, Matheo Morales1, Rafaela S Fontenele1,2, Kara Schmidlin1,2, Simona Kraberger2, Daniel J Leavitt3, Timothy H Webster1,4, Melissa A Wilson1,5, Kenro Kusumi1, Greer A Dolby1, Arvind Varsani1,2,5,6.
Abstract
The Sonoran Desert tortoise Gopherus morafkai is adapted to the desert, and plays an important ecological role in this environment. There is limited information on the viral diversity associated with tortoises (family Testudinidae), and to date no DNA virus has been identified associated with these animals. This study aimed to assess the diversity of DNA viruses associated with the Sonoran Desert tortoise by sampling their fecal matter. A viral metagenomics approach was used to identify the DNA viruses in fecal samples from wild Sonoran Desert tortoises in Arizona, USA. In total, 156 novel single-stranded DNA viruses were identified from 40 fecal samples. Those belonged to two known viral families, the Genomoviridae (n = 27) and Microviridae (n = 119). In addition, 10 genomes were recovered that belong to the unclassified group of circular-replication associated protein encoding single-stranded (CRESS) DNA virus and five circular molecules encoding viral-like proteins.Entities:
Keywords: Arizona; CRESS DNA viruses; Genomoviridae; Gopherus morafkai; Microviridae
Mesh:
Substances:
Year: 2020 PMID: 31991902 PMCID: PMC7077246 DOI: 10.3390/v12020143
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Summary of the rolling circle replication (RCR) endonuclease motifs (Motif I, II, II) and superfamily 3 (SF3) helicase motifs (Walker A, Walker B, and Motif C) encoded genomoviruses, unclassified CRESS DNA viruses and Rep-encoding circular molecules identified in this study.
| Rolling Circle Replication (RCR) Endonuclease | Superfamily 3 (SF3) Helicase Motifs | ||||||
|---|---|---|---|---|---|---|---|
| Genus/Group | Accession # | Motif I | Motif II | Motif III | Walker A | Walker B | Motif C |
| Gemykolovirus | MK570209 | MLTYSD | PHFHC | RRWDYVGK | GATRLGKTVWAR | VFDDI | WLCN |
| MK570213 | FLTYSN | PHFHC | RRWDYVGK | GETRLGKTVWAR | IFDDI | WICN | |
| Gemykibivirus | MK570202 | LLTYPQ | VHLHA | KGAAYAIK | GGTRLGKTLWAR | VFDDM | YISN |
| MK570214 | LLTYPQ | VHLHA | KGAAYAIK | GGTRLGKTLWAR | IFDDM | YISN | |
| MK570215 | LLTYPQ | VHLHA | KGAAYAIK | GGTRLGKTLWAR | IFDDM | YISN | |
| MK570205 | LLTYPQ | IHLHA | KGYAYAIK | GPTRLGKTLWAR | VFDDM | YISN | |
| MK570211 | LLTYPQ | VHLHA | KGYAYAIK | GPTRLGKTLWAR | VFDDM | YISN | |
| MK570216 | LLTYPQ | VHLHA | KGYAYAVK | GPTRLGKTLWAR | VFDDM | YISN | |
| MK570207 | LLTYPQ | IHLHA | KGYAYATK | GPTRLGKTLWAR | VFDDM | YISN | |
| MK570208 | LLTYPQ | VHLHA | KGYAYAIK | GPTRLGKTLWAR | VFDDM | YISN | |
| MK570201 | LLTYPQ | IHLHA | KGYAYAIK | GPTRLGKTLWAR | IFDDM | YISN | |
| MK570203 | LLTYPQ | VHLHA | KGYAYAIK | GPTRLGKTLWAR | IFDDM | YISN | |
| MF373640 | LLTYPQ | VHLHA | KGYAYAIK | GPTRLGKTLWAR | VFDDM | YISN | |
| MK570206 | LLTYPQ | VHLHA | KGYAYAIK | GPTRLGKTLWAR | VFDDM | YISN | |
| MF373641 | LLTYPQ | VHLHA | KGYAYAIK | GPTRLGKTLWAR | VFDDM | YISN | |
| MF373638 | LLTYPQ | VHLHA | KGYAYAIK | GPTRLGKTLWAR | VFDDM | YISN | |
| MF373639 | LLTYPQ | VHLHA | KGYAYAVK | GPTRLGKTLWAR | VFDDM | YISN | |
| Gemycircularvirus | MK570218 | LITYAQ | VHLHA | KGYDYAIK | GDSQLGKTVWAR | IFDDM | WLCN |
| MK570204 | LLTYPQ | IHLHA | KGYDYAIK | GDSQLGKTVWAR | VFDDM | WLCN | |
| MK570212 | LLTYPQ | FHLHA | KGYDYAIK | GDSQLGKTLWAR | VFDDM | WLCN | |
| MK570210 | LLTYAQ | IHLHV | TMYDYAIK | GESRLGKTVWAR | VFDDM | WLAN | |
| MK570223 | LFTYAQ | IHFHV | TAYDYACK | GPYGCGKTVWAR | IFDDW | WLCN | |
| MK570217 | LITYSQ | VHLHC | KGWDYACK | GASQTGKTLWAR | VFDDI | WLSN | |
| MK570222 | LITYSQ | IHLHC | KGWDYACK | GASQTGKTLWAR | VFDDI | WLSN | |
| MK570219 | LITYSQ | IHLHC | KGYDYAIK | GASQTGKTLWAR | VFDDI | WLSN | |
| MK570220 | LITYSQ | IHLHC | KGYDYAIK | GASQTGKTLWAR | VFDDI | WLSN | |
| MK570221 | LLTYAQ | SHLHC | AGFDYACK | GEPLTGKTDWAR | IFDDI | WCAN | |
| Unclassified CRESS DNA viruses | MK858252 | FLTYPQ | DHLHA | DVYNYVIK | GPSKTGKTQWAR | VIDDM | ILCN |
| MK858253 | FLTYPQ | DHLHA | DVYNYVIK | GPSKTGKTQWAR | VIDDM | ILCN | |
| MK858254 | FLTYPQ | DHLHA | DVYNYVTK | GPSKTGKTAWAR | VIDDM | ILCN | |
| MK858255 | FLTYPQ | NHLHV | DVYAYITK | GASKTGKTQWAR | VLDDL | ILCN | |
| MK858256 | FLTYPR | DHLHV | HVYRYVRK | GPSKTGKTEWAR | IFDDL | ILCN | |
| MK858257 | FLTFAR | DHRHV | GARQYTQK | GPSKTGKTHWAR | VFDDL | ILCN | |
| MF373642 | FLTYPQ | PHLHC | AVRRYCSK | GNTETGKTTLAK | ILDDM | ITSN | |
| MK858265 | LVTWSQ | LHYHA | DALAYVKK | GPTGSGKTRCAI | IFDDM | - | |
| MK858262 | CKKYRR | PHIQG | ECVTYCKK | GPSGVGKTREVE | VFDDF | RITN | |
| MK858258 | - | - | - | GGSNTGKTTYLR | WIDEF | TLKN | |
| Rep-encoding circular molecules | MK858259 | CFTWNN | PHIQG | DNFKYCTK | GPAGTGKTTWGR | CIEDY | VTSN |
| MK858260 | CFTWNN | PHIQG | DNFKYCTK | GPAGTGKTTWGR | CIEDY | VTSN | |
| MK858263 | LLTFNN | YHTHL | ENRAYVLK | GETGTGKTSSVM | LFDEF | LVSN | |
| MK858264 | - | YHTHL | ENREYIRK | GSTGTGKTSYVM | LFDEF | IISN | |
Figure 1A. Phylogenetic relationship of the Rep protein sequences of genomoviruses with genus level clustering. B. Maximum likelihood phylogenetic trees of the Reps sequences of genomoviruses identified in this study that belong to the genera Gemykolovirus, Gemykibivirus and Gemycircularvirus. For each tree, a detailed view of the clusters that have the Reps from this study is provided to the right of the genus level trees. Next to each accession number the source of the sequence is provided.
Pairwise identity comparisons (showing highest identity) of the genome, and the replication associated protein (Rep) and capsid protein (CP) amino acid sequences of the genomoviruses identified in this study against themselves and those available in GenBank.
| Genomoviruses from This Study | Other Genomoviruses | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Query | Genome | Rep | CP | Genome | Rep | CP | |||||||
| Genus | Accession | % ID | Accession # | % ID | Accession # | % ID | Accession # | % ID | Accession # | % ID | Accession # | % ID | Accession # |
| Gemykolovirus | MK570209 | 65.21 | MK570213 | 69.97 | MK570213 | 45.45 | MK570221 | 70.08 | MK939374 | 81.38 | MK939374 | 49.16 | KT862242 |
| MK570213 | 65.21 | MK570209 | 69.97 | MK570209 | 44.44 | MK570223 | 66.96 | MH545501 | 71.26 | MK939374 | 47.42 | MH545501 | |
| Gemykibivirus | MF373638 | 99.82 | MF373641 | 99.69 | MF373641 | 100.00 | MF373641 | 99.22 | MK249293 | 98.46 | MK249293 | 99.67 | MK249293 |
| MF373639 | 99.73 | MF373641 | 99.69 | MF373641 | 99.67 | MF373640 | 99.31 | MK249293 | 98.46 | MK249293 | 100.00 | MK249293 | |
| MF373640 | 96.97 | MF373638 | 97.23 | MF373641 | 100.00 | MF373641 | 96.87 | MK249293 | 97.23 | MK249293 | 99.67 | MK249293 | |
| MF373641 | 99.82 | MF373638 | 99.69 | MF373638 | 100.00 | MF373638 | 99.22 | MK249293 | 98.77 | MK249293 | 99.67 | MK249293 | |
| MK570201 | 96.78 | MF373639 | 98.77 | MK570203 | 99.67 | MF373639 | 96.84 | MK249293 | 94.98 | MK947374 | 99.67 | MK249293 | |
| MK570202 | 99.47 | MK570214 | 99.08 | MK570214 | 100.00 | MK570214 | 90.70 | MK249269 | 83.69 | MK249269 | 98.38 | MK249239 | |
| MK570203 | 94.34 | MK570201 | 98.77 | MK570201 | 98.36 | MK570208 | 95.39 | MK249300 | 95.32 | MK947374 | 99.34 | MK249300 | |
| MK570205 | 95.57 | MK570216 | 95.08 | MK570211 | 99.68 | MK570215 | 97.41 | MK249307 | 99.08 | MK249307 | 98.70 | MK249239 | |
| MK570206 | 96.55 | MF373639 | 99.38 | MF373641 | 99.67 | MK570211 | 91.00 | MK947374 | 98.77 | MK249293 | 98.36 | MK249293 | |
| MK570207 | 97.47 | MK570208 | 97.23 | MK570208 | 97.94 | MK570208 | 92.44 | MK249293 | 96.92 | MK249293 | 97.25 | MK249300 | |
| MK570208 | 97.47 | MK570207 | 97.23 | MK570207 | 98.36 | MK570203 | 92.94 | MK249293 | 96.92 | MK249293 | 98.36 | MK249300 | |
| MK570211 | 94.98 | MK570206 | 95.08 | MK570205 | 99.67 | MK570206 | 92.59 | MK249293 | 97.23 | MK249269 | 98.03 | MK249293 | |
| MK570214 | 99.47 | MK570202 | 99.69 | MK570215 | 100.00 | MK570202 | 90.62 | MK249269 | 83.38 | MK249269 | 98.38 | MK249239 | |
| MK570215 | 98.67 | MK570214 | 99.69 | MK570214 | 100.00 | MK570216 | 90.35 | MK249269 | 83.08 | MK249269 | 99.03 | MK249239 | |
| MK570216 | 95.57 | MK570205 | 94.77 | MF373639 | 100.00 | MK570215 | 94.14 | MK249307 | 94.46 | MK249236 | 99.03 | MK249239 | |
| Gemycircularvirus | MK570204 | 81.28 | MK570212 | 93.99 | MK570212 | 61.76 | MK570210 | 73.52 | MK947372 | 82.53 | MK947372 | 70.10 | MG641202 |
| MK570210 | 67.50 | MK570204 | 58.66 | MK570212 | 61.76 | MK570204 | 76.02 | MK939384 | 92.42 | MK939384 | 75.40 | JQ412056 | |
| MK570212 | 81.28 | MK570204 | 93.99 | MK570204 | 59.28 | MK570218 | 74.28 | MK947372 | 82.83 | MK947372 | 69.81 | MG571096 | |
| MK570217 | 97.56 | MK570222 | 94.80 | MK570222 | 99.02 | MK570222 | 68.13 | KM821747 | 76.76 | KJ547638 | 50.00 | KM510192 | |
| MK570218 | 80.78 | MK570212 | 87.39 | MK570204 | 59.28 | MK570212 | 74.01 | MK947372 | 80.72 | MK947372 | 63.61 | MG571100 | |
| MK570219 | 97.12 | MK570220 | 98.78 | MK570220 | 99.03 | MK570220 | 68.06 | KM821747 | 74.31 | KJ547638 | 53.87 | MK939446 | |
| MK570220 | 97.12 | MK570219 | 98.78 | MK570219 | 99.03 | MK570219 | 67.97 | MK939432 | 74.01 | KJ547638 | 55.08 | MK939446 | |
| MK570221 | 66.65 | MK570204 | 52.00 | MK570210 | 50.33 | MK570204 | 65.90 | MK939384 | 66.03 | MG641197 | 58.36 | KT732806 | |
| MK570222 | 97.56 | MK570217 | 96.94 | MK570219 | 99.02 | MK570217 | 67.91 | KM821747 | 74.62 | KM821747 | 50.43 | KM510192 | |
| MK570223 | 61.64 | MK570210 | 50.76 | MK570210 | 45.54 | MK570204 | 82.49 | MG571087 | 94.22 | MG571087 | 70.39 | KF413620 | |
Figure 2Illustration of the nine events of recombination detected by RDP4 in the genomes of gemycircularviruses (n = 6) and gemykibiviruses (n = 15). The black regions depict the recombinant region, with I–IX denoting the recombination events (see Table 3 for details of recombination). The genome organization of genomoviruses is provided at the top showing the cp and rep genes.
Summary of recombination events identified in the genomes of gemycircularviruses and gemykibiviruses from this study. Major and minor parents indicate sequences (GenBank accession # provided) related to parental sequences that respectively donated the larger and smaller regions of the recombinant genome. For each event the recombination detection method with the most significant associated p-value is indicated in bold. Recombination detection methods: RDP (R), GENCONV (G), BOOTSCAN (B), MAXCHI (M), CHIMERA (C), SISCAN (S) and 3SEQ (T). Sites where the actual breakpoint is undetermined are marked with *.
| Event | Begin | End | Recombinant Sequence(s) | Minor Parental Sequence(s) | Major Parental Sequence(s) | Method | |
|---|---|---|---|---|---|---|---|
| I | 190 | 1017 | MK570210 | Unknown | MH939384 | 2.71 × 10−25 | MCS |
| II | 249 * | 1050 | MK570204, MK570212, MK570218 | Unknown | MK570204 | 6.55 × 10−16 | GBMCST |
| III | 243 * | 1009 | MK570218 | Unknown | MK570204 | 5.10 × 10−14 | GBMCT |
| IV | 1539 * | 1591 | MK570222, MK570217 | MF173067 | MK570220, MK570219 | 7.12 × 10−8 | RGBT |
| V | 23 | 1144 | MK570202, MK570214, MK570215 | MK249302, MK249239, MK249243, MK249246, MK249252, MK249256, MK249268, MK249269, MK249275, MK249287, MK249288, MK249307, MK570205, MK570216 | Unknown | 1.67 × 10−52 | RGBMCST |
| VI | 1275 | 2192 | MK570211 | MK249239, MK249243, MK249246, MK249252, MK249256, MK249268, MK249269, MK249275, MK249287, MK249288, MK249302, MK249307 | MF373638, MF373639, MF373640, MF373641, MK249293, MK570206 | 3.64 × 10−86 | GBMCST |
| VII | 333 | 1113 | MF373638, MF373639, MF373640, MF373641, MK570201, MK570203, MK570206, MK570207, MK570208, MK570211 | MK483084 | MH973737, MH973738, MH973739, MH973740, MK249236, MK249241, MK249251, MK249255, MK249271, MK249276, MK249279, MK249286, MK249301 | 7.69 × 10−30 | GBMCST |
| VIII | 1643 | 1958 | MK570216 | MH973737, MH973738, MH973739, MH973740, MK249236, MK249241, MK249251, MK249255, MK249271, MK249276, MK249279, MK249286, MK249293, MK249301, MK570206 | MK249239, MK249243, MK249246, MK249252, MK249256, MK249268, MK249269, MK249275, MK249287, MK249288, MK249302, MK249307, MK570205 | 5.32 × 10−25 | RGBMCST |
| IX | 1993 | 2211 | MK570205, MK570216 | MF373638, MF373639, MF373640, MF373641, MH973737, MH973738, MH973739, MH973740, MK249236, MK249241, MK249251, MK249255, MK249271, MK249276, MK249279, MK249286, MK249293, MK249296, MK249301, MK570206, MK947374 | MK249239, MK249243, MK249246, MK249252, MK249256, MK249268, MK249269, MK249275, MK249287, MK249288, MK249302 | 7.83 × 10−22 | GBMCST |
Figure 3Sequence similarity networks showing family-level clustering of the Reps of CRESS DNA viruses and Rep-encoding circular molecules. For each cluster that has a Rep from this study (colored in red), a phylogenetic tree (midpoint rooted) has been inferred for all sequences belonging to the cluster.
Pairwise identities (showing highest identity) of the replication associated protein (Rep) and capsid protein (CP) amino acid sequences of the unclassified CRESS DNA viruses and circular molecules identified in this study against themselves and against those available in GenBank.
| Unclassified CRESS DNA Viruses from This Study | Other Unclassified CRESS DNA Viruses | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Query | Rep | CP | Rep | CP | |||||
| Virus Group/Molecules | Accession # | Accession # | %ID | Accession # | %ID | Accession # | %ID | Accession # | %ID |
| Unclassified CRESS DNA virus | MF373642 | MK858254 | 29.76 | - | - | KY487833 | 42.00 | - | - |
| MK858252 | MK858253 | 100.00 | MK858253 | 99.00 | MK570182 | 78.42 | MK858254 | 38.68 | |
| MK858253 | MK858252 | 100.00 | MK858253 | 99.00 | MK570182 | 78.42 | MK858254 | 38.68 | |
| MK858254 | MK858252 | 81.16 | MK858255 | 39.16 | MK570182 | 75.99 | MK570182 | 39.25 | |
| MK858255 | MK858252 | 79.03 | MK858254 | 39.25 | MK570182 | 83.57 | MK570165 | 53.64 | |
| MK858256 | MK858255 | 65.75 | MK858257 | 33.47 | MK570182 | 64.44 | MK570168 | 33.46 | |
| MK858257 | MK858256 | 56.23 | MK858254 | 25.21 | MK570165 | 54.08 | MK570168 | 42.45 | |
| MK858258 | MK858260 | 35.57 | MK858261 | 52.65 | MK570170 | 45.86 | - | - | |
| MK858262 | MK858265 | 30.80 | MH616676 | 46.77 | KY302869 | 28.12 | |||
| MK858265 | MK858262 | 30.80 | - | - | KM821751 | 30.80 | - | - | |
| Circular molecule | MK858259 | MK858260 | 99.70 | - | - | MF118167 | 49.34 | - | - |
| MK858260 | MK858259 | 99.70 | - | - | MF118167 | 49.34 | - | - | |
| MK858261 | - | - | MK858258 | 52.65 | - | - | - | - | |
| MK858263 | MK858264 | 72.53 | - | - | MH616676 | 47.17 | - | - | |
| MK858264 | MK858263 | 72.53 | - | - | MH616676 | 47.17 | - | - | |
Figure 4A. Linearized genome representation of the genomes of microviruses identified in this study with color coded open reading frames. B. Pairwise identities of the MCP amino acid sequences (showing highest identity) of the microviruses identified in this study against themselves and against those available in GenBank. Gokushovirinae is an officially recognized sub-family of the family Microviridae. Alphavirinae, Pichovirinae and Parabacteroides group are proposed groups within the family Microviridae [50,51]. Clade group is based on the phylogenetic analysis of the MCPs (Figure 5; Supplementary Data 1).
Figure 5Approximately Maximum-Likelihood cladogram of the MCP sequences (n = 2797). Branches are color coded based on sub-families (Bullavirinae and Gokushovirinae) and Alphavirinae-, Parabacteroides- and Pichovirinae-clades. In addition to these, 22 unique clades are marked with numbers. Branches in grey represent an additional nine singletons and 13 clades. Branches in red denote sequences identified in this study. The outer circle represents taxa with some level of classification assigned prior to this study. Branch support with >0.8 aLRT is shown. Detailed cladogram and phylogram are provided in Supplementary Figures S1 and S2 and taxa names and assignment are provided in Supplementary Data 1.