| Literature DB >> 30962460 |
Paulina D Anindita1, Michihito Sasaki2, Gabriel Gonzalez3, Wallaya Phongphaew1, Michael Carr4,5, Bernard M Hang'ombe6,7,8, Aaron S Mweene9,7,8, Kimihito Ito3,4, Yasuko Orba1, Hirofumi Sawa1,4,7,8,10.
Abstract
The Smacoviridae has recently been classified as a family of small circular single-stranded DNA viruses. An increasing number of smacovirus genomes have been identified exclusively in faecal matter of various vertebrate species and from insect body parts. However, the genetic diversity and host range of smacoviruses remains to be fully elucidated. Herein, we report the genetic characterization of eleven circular replication-associated protein (Rep) encoding single-stranded (CRESS) DNA viruses detected in the faeces of Zambian non-human primates. Based on pairwise genome-wide and amino acid identities with reference smacovirus species, ten of the identified CRESS DNA viruses are assigned to the genera Porprismacovirus and Huchismacovirus of the family Smacoviridae, which bidirectionally encode two major open reading frames (ORFs): Rep and capsid protein (CP) characteristic of a type IV genome organization. The remaining unclassified CRESS DNA virus was related to smacoviruses but possessed a genome harbouring a unidirectionally oriented CP and Rep, assigned as a type V genome organization. Moreover, phylogenetic and recombination analyses provided evidence for recombination events encompassing the 3'-end of the Rep ORF in the unclassified CRESS DNA virus. Our findings increase the knowledge of the known genetic diversity of smacoviruses and highlight African non-human primates as carrier animals.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30962460 PMCID: PMC6453971 DOI: 10.1038/s41598-019-41358-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sample information and results of PCR screening.
| Sample | Non-human primate species (common name) | Corresponding primer | Virus | Abbreviation | Accession number |
|---|---|---|---|---|---|
| ZM09#51 | ssDNAV-3 | PcSmV3-ZM09-51 | LC386205 | ||
| ZM09#64 | ssDNAV-1 | PkSmV1-ZM09-64 | LC386203 | ||
| ZM09#71 | ssDNAV-2 | PcSmV2-ZM09-71 | LC386204 | ||
| ZM09#72 | ssDNAV-6 | PcSmV6-ZM09-72 | LC386201 | ||
| ZM09#74 | ssDNAV-3 | PcSmV3-ZM09-74 | LC386197 | ||
| ZM09#76 | ssDNAV-3 | PkSmV3-ZM09-76 | LC386198 | ||
| ZM09#83 | ssDNAV-5 | CcSmV5-ZM09-83 | LC386200 | ||
| ZM09#86 | ssDNAV-1 | CcSmV1-ZM09-86 | LC386195 | ||
| ZM09#95 | ssDNAV-4 | CcSmV4-ZM09-95 | LC386199 | ||
| ZM09#96 | ssDNAV-1 | CcSmV1-ZM09-96 | LC386196 | ||
| ZM09#96 | ssDNAV-6 | CcSmV6-ZM09-96 | LC386202 |
Figure 1Genome organization of the representative smacovirus carrying a ambisense type IV genome (A) and CRESS DNA virus carrying a unisense type V genome (B) from Zambian NHPs showing ORFs (Rep, rolling replication-associated protein; CP, capsid protein), predicted stem loop and nonanucleotide.
Genome features of Zambian non-human primate CRESS DNA viruses.
| Virus | Genome length (nt) | Rep (aa) | CP (aa) | Nonanucleotide motif | RCR motif I | RCR motif II | RCR motif III | Walker A | Walker B | Walker C |
|---|---|---|---|---|---|---|---|---|---|---|
| PkSmV1-ZM09-64 | 2544 | 259 | 368 | AGGTCTTAC | ATVWVD | HFQFRG | YVYK | SRGNWGKT | LVDTP | VLCN |
| CcSmV1-ZM09-86 | 2691 | 284 | 369 | AATGATTAC | ATIDR | HWQCRF | YETK | PTGNIGKS | IIDIP | VMTN |
| CcSmV1-ZM09-96 | 2659 | 268 | 182 | AAAAATTAC | VTMGR | HWQVRF | YEAK | ETGNRGKS | VIDIP | VLTN |
| PcSmV2-ZM09-71 | 2766 | 271 | 359 | AATGATTAC | CTVPN | HFQLRC | YVEK | RVGGRGKT | VIDMP | VITN |
| PcSmV3-ZM09-51 | 2540 | 269 | 312 | TAGTGTTAC | MTAPR | HWQCRI | YEAK | ETGNVGKS | IIDVP | VMTN |
| PcSmV3-ZM09-74 | 2488 | 245 | 330 | TAGTATTAC | GTAPR | HWQIRI | YEAK | ETGNVGKS | IIDVP | VMTN |
| PkSmV3-ZM09-76 | 2542 | 269 | 330 | TAGTGTTAC | MTAPR | HWQCRI | YEAK | ETGNRGKS | IIDVP | VMTN |
| CcSmV4-ZM09-95 | 2656 | 260 | 271 | TAGTATTAC | LTIPR | HWQVVV | YCRK | VRGGHGKT | WIDLP | VTTN |
| CcSmV5-ZM09-83 | 2617 | 261 | 354 | TAGTATTAC | GTISA | HYQYTV | YCKK | GGGHGKT | WIDLP | VTTN |
| PcSmV6-ZM09-72 | 2613 | 269 | 353 | TAGTATTAC | GTISA | HFQYYI | YCKK | VGGAHGKT | WIDIP | VTTN |
| CcSmV6-ZM09-96 | 2537 | 269 | 354 | TAGTATTAC | GTISA | HFQYCI | YCKK | VGGAHGKT | WIDLP | VTTN |
Pairwise sequence identity among Zambian non-human primate CRESS DNA viruses and known smacoviruses.
| Species | Isolate | Closest genome sequence | Identity (%) | Closest genome sequence of known smacovirus species | Identity (%) | Closest Rep sequence of known smacovirus species | Identity (%) | Genus |
|---|---|---|---|---|---|---|---|---|
| 1 | (LC386203) PkSmV1-ZM09-64 | * | * | (KT862223) Bovine faeces associated smacovirus 1 | 42.73 | unassigned | ||
| 2 | (LC386204) PcSmV2-ZM09-71 | (LC386198) PkSmV3-ZM09-76 | 56.58 | (KP233194) Lemur smacovirus | 55.46 | (KT862223) Bovine faeces associated smacovirus 1 | 46.86 |
|
| 3 | (LC386195) CcSmV1-ZM09-86 | (LC386196) CcSmV1-ZM09-96 | 82.29 | (KT600068) Human feces smacovirus 2 | 64.69 | (KT600068) Human feces smacovirus 2 | 75.18 |
|
| 3 | (LC386196) CcSmV1-ZM09-96 | (LC386195) CcSmV1-ZM09-86 | 82.29 | (KT600068) Human feces smacovirus 2 | 62.44 | (GQ351274) Chimpanzee stool associated circular ssDNA virus | 64.48 |
|
| 4 | (LC386205) PcSmV3-ZM09-51 | (LC386198) PkSmV3-ZM09-76 | 99.80 | (KJ577810) Porcine stool associated circular virus 1 | 67.15 | (JX274036) Porcine stool associated circular virus | 81.07 |
|
| 4 | (LC386197) PcSmV3-ZM09-74 | (LC386198) PkSmV3-ZM09-76 | 92.86 | (KJ577810) Porcine stool associated circular virus 1 | 67.84 | (JX274036) Porcine stool associated circular virus | 83.95 |
|
| 4 | (LC386198) PkSmV3-ZM09-76 | (LC386205) PcSmV3-ZM09-51 | 99.80 | (KJ577810) Porcine stool associated circular virus 1 | 67.32 | (JX274036) Porcine stool associated circular virus | 81.48 |
|
| 5 | (LC386199) CcSmV4-ZM09-95 | (LC386201) PcSmV6-ZM09-72 | 79.62 | (KT600068) Human feces smacovirus 2 | 55.70 | (KT862221) Sheep faeces associated smacovirus 2 | 42.08 |
|
| 5 | (LC386201) PcSmV6-ZM09-72 | (LC386199) CcSmV4-ZM09-95 | 79.62 | (KT600068) Human feces smacovirus 2 | 55.12 | (KT862218) Bovine faeces associated smacovirus 2 | 39.24 |
|
| 6 | (LC386200) CcSmV5-ZM09-83 | (LC386202) CcSmV6-ZM09-96 | 80.14 | (KT600068) Human feces smacovirus 2 | 55.00 | (KT862218) Bovine faeces associated smacovirus 2 | 40.92 |
|
| 6 | (LC386202) CcSmV6-ZM09-96 | (LC386200) CcSmV5-ZM09-83 | 80.14 | (KP233194) Lemur smacovirus | 56.65 | (KT862218) Bovine faeces associated smacovirus 2 | 39.66 |
|
*Not comparable due to the inversion of the replication-associated protein ORF.
Figure 2Phylogenetic tree constructed based on the whole genome nucleotide sequences of Zambian non-human primate CRESS DNA viruses and reference smacoviruses. Tree topology following the maximum likelihood (ML) approach is shown and annotated with the support of the posterior probability from the Bayesian inference approach. Smacoviruses identified in the study are shown in red color. The sequence of PkSmV1-ZM09-64 was not included in the analysis due to the unisense genome organization of the ORFs (Rep and CP).
Figure 3Phylogenetic tree constructed based on the CP amino acid sequences of Zambian non-human primate CRESS DNA viruses and reference smacoviruses. Tree topology following the ML approach is shown and annotated with the support of the posterior probability from the Bayesian inference approach. Smacoviruses identified in this study are shown in red color.
Figure 4Phylogenetic tree constructed based on the Rep amino acid sequences of Zambian non-human primate CRESS DNA viruses and other known smacoviruses. Tree topology following the ML approach is shown and annotated with the support of the posterior probability from the Bayesian inference approach. Smacoviruses identified in the study are shown in red color.
Figure 5Recombination analysis of smacovirus species. (A) Phylogenetic compatibility matrix over the species of smacoviruses including ZM-SmVs. The respective regions in the smacoviruses alignment correspond to the normalized Robinson-Foulds distance between two neighbor-joining (NJ) trees. (B) Distribution of recombination breakpoints within the smacoviruses alignment. Breakpoints detected by recombination detection program (RDP) analysis are shown at top of the panel. The vertical axis is the number of recombination boundaries per window. The solid black line indicates the number of observed breakpoints. The grey and red areas indicate 95 and 99% confidence intervals of each breakpoint, respectively.