| Literature DB >> 35891454 |
Fabiola Villanova1, Flávio Augusto de Padua Milagres2,3, Rafael Brustulin2, Emerson Luiz Lima Araújo4, Ramendra Pati Pandey5, V Samuel Raj5, Xutao Deng6,7, Eric Delwart6,7, Adriana Luchs8, Antonio Charlys da Costa9, Élcio Leal1.
Abstract
Putative replication-associated protein (REP) and capsid-like (CAP) proteins are encoded by circular single-stranded DNA viruses (CRESS DNA), which have been found in samples from most eukaryotic groups. However, the details of these viruses' life cycles and their significance in diseases have yet to be established. We presented and analyzed two full-length CRESS DNA genomes acquired from two children diagnosed with acute gastroenteritis (GI) in the northeast state of Tocantins, Brazil, using next-generation sequencing and a virus-like filtration approach. Both sequences (named SmaCV3BR08 and SmaCV3BR291) are closely similar to a prior CRESS DNA sequence discovered in the feces of a new world monkey (Alouatta caraya) from the United States in 2009 and termed Howler monkey-associated porprismacovirus 1 (Genbank ID: NC 026317). According to our comparative study, these porprismacovirus genomes deviate by 10% at the nucleotide level. For comparative reasons, the divergence between our sequences (SmaCV3BR08 and SmaCV3BR291) and a porprismacovirus recently identified in a human fecal sample from Peru is 37%. These data suggest that there is a great diversity of porprismacoviruses in South America, perhaps more than two species. In addition, the finding of closely related sequences of porprismacoviruses in humans and native monkeys highlights the zoonotic potential of these viruses.Entities:
Keywords: CRESS DNA; Smacoviridae; metagenomics; porprismacovirus; virus-like particles
Mesh:
Substances:
Year: 2022 PMID: 35891454 PMCID: PMC9319269 DOI: 10.3390/v14071472
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Nucleotide identity of complete genomes of members of smacovirus.
| Genbank ID | Host */Date/Location | SmaCV3BR08 | SmaCV3BR291 |
|---|---|---|---|
| Identity | |||
| NC026317 (2485) | 0.874 | 0.884 | |
| MG522850 (2511) | 0.650 | 0.643 | |
| KX838317 (2581) | 0.640 | 0.643 | |
| NC026318 (2529) | 0.660 | 0.654 | |
| MH933796 (2538) | 0.650 | 0.651 | |
| MF347417 (2560) | 0.644 | 0.646 | |
* this refers to the species in which fecal samples were collected. 1 Base pairs.
Figure 1Genomic annotation of Brazilian porprismacoviruses. (a) Genomic map of the sequence SmaCV3BR08 and (b) map of the sequence SmaCV3BR291. Both sequences were annotated according to the predicted location of open reading frames of the reference sequence NC_026317 of CRESS-DNA viruses (Family: Cremevirales; Genera: Smacoviridae; species: Porprismacovirus). Rep denotes the replicase protein, and CAP denotes the capsid protein of these viruses. The sizes of Rep and Cap proteins are indicated in base pairs in the gray rectangles. The TAGTGTTAC loop is indicated in a magenta rectangle. Inverted repeats are indicated in green rectangles. The blue line perpendicular to the circular genome indicates the first nucleotide of the sequence.
Figure 2Amino acid composition of CAP and REP. Comparison of the predicted CAP and REP proteins between sequences generated in this study (SmaCV3BR08 and SmaCV3BR291) and the Howler monkey-associated porprismacovirus 1 (NC_026317). Upper panels show a matrix indicating number of distinct acid acids and the sequence of residues in the homolog region of the predicted CAP protein. The lower panels show the differences in amino acid composition and the sequences of predicted REP protein in Howler monkey-associated porprismacovirus, SmaCV3BR08 and SmaCV3BR291.
Figure 3Phylogenetic analysis of complete genomes of smacoviruses. Maximum likelihood tree inferred assuming the GTR model. The tree is mid-rooted, and the statistical branch support (based on the Shimodaira–Hasegawa test) is indicated in a blue color scale. Lower horizontal bar indicates the nucleotide substitutions per site. Porprismacoviruses identified in this study are indicated by blue dots. Red dots indicate sequences used for nucleotide identity (Table 1 in the main text). The genetic divergence of some clades is indicated in the gray rectangles. Sequences are colored according to their hosts (upper left panel).