| Literature DB >> 34834982 |
Marie-Thérèse König1, Robert Fux2, Ellen Link2, Gerd Sutter2, Erwin Märtlbauer1, Andrea Didier1.
Abstract
In recent years, a variety of circular replicase-encoding single-stranded (CRESS) DNA viruses and unclassified virus-like DNA elements have been discovered in a broad range of animal species and environmental samples. Key questions to be answered concern their presence in the human diet and their potential impact on disease emergence. Especially DNA elements termed bovine meat and milk factors (BMMF) are suspected to act as co-factors in the development of colon and breast cancer. To expand our knowledge on the occurrence of these potential pathogens in human nutrition, a total of 73 sheep and 40 goat milk samples were assayed by combining rolling circle amplification (RCA), PCR and Sanger sequencing. The present study further includes retail milk from the aforementioned species. We recovered 15 single stranded (ss) circular genomes. Of those, nine belong to the family Genomoviridae and six are members of the unclassified group of BMMF. Thus, dairy sheep and goats add to dispersal of CRESS viruses and circular ssDNA elements, which enter the food chain via milk. The presence of these entities is therefore more widespread in Bovidae than initially assumed and seems to be part of the common human nutrition.Entities:
Keywords: BMMF; Cressdnaviricota; Genomoviridae; circular ssDNA; colon/breast cancer; milk; small ruminants
Mesh:
Substances:
Year: 2021 PMID: 34834982 PMCID: PMC8621823 DOI: 10.3390/v13112176
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Overview of all circular genomes detected in this study with accession numbers, sequence length, DNA-types, species and origin of isolation. Interestingly, two genomoviral sequences were isolated from sheep No. 115.
| Isolate | Accession | Length in Nucleotides (nt) | DNA-Type | Species | Origin |
|---|---|---|---|---|---|
| SmGV1 | OK148616 | 2191 | Genomovirus | Sheep No. 109 | Flock B |
| SmGV2 | OK148617 | 2192 | Genomovirus | Sheep No. 110 | Flock B |
| SmGV3 | OK148618 | 2190 | Genomovirus | Sheep No. 111 | Flock B |
| SmGV4 | OK148619 | 2190 | Genomovirus | Sheep No. 113 | Flock B |
| SmGV5 | OK148620 | 2191 | Genomovirus | Sheep No. 115 | Flock B |
| SmGV6 | OK148621 | 2124 | Genomovirus | Sheep No. 115 | Flock B |
| SmGV7 | OK148622 | 2190 | Genomovirus | Sheep No. 117 | Flock B |
| SmGV8 | OK148623 | 2125 | Genomovirus | Sheep No. 151 | Flock C |
| GmGV9 | OK148624 | 2122 | Genomovirus | Goat | Retail |
| SmI1 | OK148625 | 2036 | Sphinx 1.76-like | Sheep No. 153 | Flock C |
| GmI2 | OK148626 | 2092 | Sphinx 1.76-like | Goat | Retail |
| SmI3 | OK148627 | 2522 | Sphinx 1.76-like | Sheep | Retail |
| SmI4 | OK148628 | 2093 | Sphinx 1.76-like | Sheep | Retail |
| GmI5 | OK148629 | 1765 | Sphinx 1.76-like | Goat | Retail |
| GmI6 | OK148630 | 2522 | Sphinx 1.76-like | Goat | Retail |
Abbreviations: SmGV—sheep milk genomovirus; GmGV-goat milk genomovirus; SmI-sheep milk isolate: GmI–goat milk isolate.
Figure 1(A) Genome-wide pairwise nucleotide similarity score matrix including full-length genomoviruses from this study and representative members from nine genera of the family Genomoviridae. Sequences from this study are highlighted in red. (B) Schematic genome organization of the herein identified genomoviral sequences shows at least two major open reading frames and a stem-loop structure in the intergenic region.
Figure 2Maximum Likelihood phylogenetic tree of representative sequences from nine genomoviral genera and genomoviral sequences from this study. The evolutionary history was inferred by using the Maximum Likelihood method and Tamura-Nei model [37]. The bootstrap consensus tree inferred from 500 replicates is taken to represent the evolutionary history of the taxa analyzed [38]. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates are collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches [38]. Branch support values lower than 60% were not included. This analysis involved 39 nucleotide sequences. There were a total of 3641 positions in the final dataset. Evolutionary analyses were conducted in MEGA software. All taxa are indicated by name and the corresponding GenBank accession number.
Summary of conserved motifs in genomoviral sequences after intron clearing and in silico translation.
| Isolate | Nonanucleotide | Motif I | Motif II | GRS Motif | Motif III | Walker A | Walker B | Walker C |
|---|---|---|---|---|---|---|---|---|
| SmGV1 | TAATATTAT | IITFPQ | VHYHV | TAFDYFGAHGNIKSVR | YVGK | GPTRTGKT | VFDDI | MCMN |
| SmGV2 | TAATATTAT | IITFPQ | VHYHV | TAFDYFGAHGNIKSVR | YVGK | GPTRTGKT | VFDDI | MCMD |
| SmGV3 | TAATATTAT | IITFPQ | VHYHV | TAFDYFGAHGNIKSVR | YVGK | GPTRTGKT | VFDDI | MCMN |
| SmGV4 | TAATATTAT | IITFPQ | VHYHV | TAFDYFGAHGNIKSVR | YVGK | GPTRTGKT | VFDDI | MCMN |
| SmGV5 | TAATATTAT | IITFPQ | VHYHV | TAFDYFGAHGNIKSVR | YVGK | GPTRTGKT | VFDDI | MCMN |
| SmGV6 | TAATATTAC | LFTYSQ | THLHA | RKFDVVGFHPNIISTI | YATK | GPSRTGKT | VFDDI | WLSN |
| SmGV7 | TAATATTAT | IITFPQ | VHYHV | TAFDYFGAHGNIKSVR | YVGK | GPTRTGKT | VFDDI | MCMN |
| SmGV8 | TAATATTAC | LFTYSQ | THLHA | RKFDVEGFHPNIISTI | YATK | GPSRTGKT | VFDDI | WLSN |
| GmGV9 | TAATATTAT | IIMFPQ | IHYHI | TAFDYFGAHGNIKSIR | YVGK | GPTRTGKT | VFDDI | MCMN |
Figure 3Illustration of amino acid variations in conserved Rep motifs using WebLogo3 [50]. Reps of ICTV-listed gemykrog- and gemykibiviruses as well as from the nine genomoviral isolates identified herein have been included in the alignment.
Figure 4(A) Maximum Likelihood phylogenetic tree of novel full-length Sphinx-like DNA genomes, Sphinx 1.76, Sphinx 2.36, selected members of BMMF group 1 and 2 and recently published Sphinx-like DNA genomes isolated from water buffalo milk. The evolutionary history was inferred by using the Maximum Likelihood method and Tamura-Nei model [37]. The tree with the highest log likelihood (“-“50216.13) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches [38]. Branch support values lower than 60% were not included. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. This analysis involved 51 nucleotide sequences. There were a total of 4500 positions in the final dataset. Black dots mark sequences found in this study. All taxa are indicated by name followed by the corresponding GenBank accession number. (B) Genomic organization of the six new BMMF-like genomes from this study. Red and orange arrows indicate rep-genes, blue arrows show further potential ORFs and purple bars represent tandem repeat regions.
BMMF-like DNA isolates and inherent tandem repeat regions.
| Isolate | Period Size | Copy Number | Sequence | Nt between TR and RepA | Nt between TR and RepB |
|---|---|---|---|---|---|
| SmI1 | 22 | 3.5 | CCTACGTTTACCCATCAATACC | 60 | - |
| GmI2 | 22 | 3.5 | ACACCGTTTACCCATCAATATG | 59 | - |
| SmI3 | 22 | 3.8 | ATACCCCTACGTTTACCGATCA | 60 | - |
| SmI4 | 22 | 3.5 | CACCGTTTACCCATCAATATGA | 27 | 151 |
| GmI5 | 22 | 3.8 | ATACTCCTAGGTTTACCTACCA | 59 | - |
| GmI6 | 22 | 3.8 | ATACCCCTACGTTTACCGATCA | 60 | - |