| Literature DB >> 34946202 |
Klaudia Chrzastek1, Simona Kraberger2, Kara Schmidlin2, Rafaela S Fontenele2, Arun Kulkarni3, Len Chappell3, Louise Dufour-Zavala3, Darrell R Kapczynski1, Arvind Varsani2,4.
Abstract
High-throughput sequencing approaches offer the possibility to better understand the complex microbial communities associated with animals. Viral metagenomics has facilitated the discovery and identification of many known and unknown viruses that inhabit mucosal surfaces of the body and has extended our knowledge related to virus diversity. We used metagenomics sequencing of chicken buccal swab samples and identified various small DNA viruses with circular genome organization. Out of 134 putative circular viral-like circular genome sequences, 70 are cressdnaviruses and 26 are microviruses, whilst the remaining 38 most probably represent sub-genomic molecules. The cressdnaviruses found in this study belong to the Circoviridae, Genomoviridae and Smacoviridae families as well as previously described CRESS1 and naryavirus groups. Among these, genomoviruses and smacoviruses were the most prevalent across the samples. Interestingly, we also identified 26 bacteriophages that belong to the Microviridae family, whose members are known to infect enterobacteria.Entities:
Keywords: Cressdnaviricota; Microviridae; chicken; respiratory tract; single-stranded DNA viruses
Year: 2021 PMID: 34946202 PMCID: PMC8703526 DOI: 10.3390/microorganisms9122602
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Summary of the families and genera of the classified eukaryote-infecting viruses identified in or associated with chicken.
| Genome Type | Family | Genera |
|---|---|---|
| dsDNA |
|
|
|
|
| |
|
|
| |
| ssDNA |
|
|
|
|
| |
|
|
| |
|
|
| |
|
|
| |
| dsRNA |
|
|
|
|
| |
|
|
| |
| ssRNA (+) |
|
|
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
| ssRNA (-) |
|
|
|
|
| |
|
|
| |
|
|
| |
|
|
| |
| RNA-RT |
|
|
Figure 1A maximum-likelihood phylogenetic tree of Rep sequences of members of the cressdnavirus families (Bacillidnaviridae, Circoviridae, Geminiviridae, Genomoviridae, Nanoviridae, Redondoviridae and Smacoviridae), Alphasatellitidae and Metaxyviridae, CRESS groups 1–6 (unclassified) and putative family-level virus groups naryavirus, nenyavirus and vilyavirus. To the right of the phylogenetic tree, a sequence similarity network of the Rep sequences of the cressdnaviruses determined using EFI-EST [30] is shown. Nodes with red fill represent Rep sequences of viruses from the six chicken samples. Thirteen Reps sequences do not cluster with established virus families in the phylum Cressdnaviricota but form four putative family level clusters (clusterI, clusterII, CRESS1, naryavirus) and one is a singleton.
Summary of the number of de novo assembled viral-like contigs (>750 nts in length) from the six samples based on BLASTx analysis. The likely taxonomy assignment of these contigs is based on the top BLASTx hit.
| Contigs in Each Sample | ||||||||
|---|---|---|---|---|---|---|---|---|
| Family | Length of Contigs (nt) | C2 | C4 | C5 | C6 | C7 | C8 | Total Contigs |
|
| 762–8399 | - | - | 1 | 1 | - | - | 2 |
|
| 815 | - | - | - | - | 1 | - | 1 |
|
| 1170–1440 | 1 | - | 2 | - | - | - | 3 |
|
| 771–2329 | - | 2 | 3 | 7 | 2 | 3 | 17 |
|
| 772–2046 | - | 1 | - | - | 1 | 2 | 4 |
|
| 760–2347 | 5 | 11 | 4 | 4 | 11 | 8 | 43 |
|
| 751–6913 | 1 | - | - | 1 | 3 | 9 | 14 |
|
| 867–5794 | 2 | - | - | - | - | 2 | 4 |
|
| 781–6787 | 1 | 25 | 16 | 24 | 24 | 36 | 126 |
|
| 754–8471 | 23 | 3 | 17 | 11 | 9 | 56 | 119 |
|
| 785–2429 | - | 2 | - | - | - | - | 2 |
|
| 769–14,239 | 11 | 1 | 3 | 9 | 9 | 10 | 43 |
|
| 751–11,004 | 32 | 6 | 23 | 24 | 10 | 58 | 153 |
|
| 896–2385 | 3 | 3 | 7 | 4 | 4 | 4 | 25 |
| unclassified cressdnaviruses | 768–5193 | - | 25 | 20 | 24 | 15 | 25 | 109 |
Summary of the cressdnaviruses identified in this study and the conserved HUH and SF3 motifs in the Rep.
| Family | Genus/Group | Species | Accession | Name | Motif I | Motif II | GRS | Motif III | Walker A | Walker B | Motif C |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| MN379599 | Chicken cyclovirus mg5_2967 | SWTLNN | PHLQG | - | QNHDYCSK | GASGTGKSRRAA | IIDDF | ITSN |
| MN379600 | Chicken cyclovirus mg7_102 | SWTLNN | PHLQG | - | QNHDYCAK | GASGTGKSRRAA | IIDDF | ITSN | |||
|
| MN379598 | Chicken cyclovirus mg4_1122 | CFTYNN | KHLQG | - | QNYDYCTK | GETGTGKSRKCA | IIDDF | ITSN | ||
|
|
|
| MN379613 | Chicken genomovirus mg7_74 | LFTYSQ | THLHV | RKFDVEDFHPNIVPSL | GGWDYATK | GRSRTGKTILAR | VFDDI | WIMN |
| MN379617 | Chicken genomovirus mg2_274u | LFTYSQ | THLHV | RKFDVEGFHPNIVPSL | GGWDYATK | GRSRTGKTWLAR | VFDDI | WIMN | |||
|
| MN379606 | Chicken genomovirus mg4_1196 | LLTYAQ | THLHS | DAFDVGGYHPNISPSY | KGFDYTIK | GPSRLGKTLWAR | VFDDI | WLSN | ||
|
| MN379612 | Chicken genomovirus mg7_73 | LLTYAQ | THLHA | SVFDVAGFHPNISITK | IHYDYAIK | GKSRTGKTNYAR | VFDDI | WISN | ||
|
| MN379615 | Chicken genomovirus mg8_401 | LLTYSQ | THLHA | DVYDVDGFHPNISPSL | RGYDYAIK | GPTRTGKTMWSR | VFDDV | WLAN | ||
|
| MN379608 | Chicken genomovirus mg4_1218 | LLTYPQ | NHLHA | DVFDVDGRHPNIQSRL | AGYDYVIK | GDTLTGKTQWAR | IFDDL | YISN | ||
|
| MN379603 | Chicken genomovirus mg4_1107 | LLTYPQ | IHLHA | RAFDVEGCHPNVSPSR | DGYDYAIK | GPSRMGKTIWAR | IFDDF | WLSN | ||
|
| MN379602 | Chicken genomovirus mg2_77 | LFTYSQ | THLHA | RKFDVEGFHPNIISTI | GSWDYATK | GPSRTGKTMWAR | VFDDI | WLSN | ||
|
| MN379607 | Chicken genomovirus mg4_1210 | LFTYSQ | SHLHV | RKFDVEGFHPNIVPSL | GGWDYATK | GPSRTGKTMWAR | VFDDI | WLMN | ||
|
|
| MN379611 | Chicken genomovirus mg7_70 | FLTYSQ | HHFHA | SRLDFGCHHPNIQSVR | RTWDYVGK | GPTRTGKTANIL | VFDDI | MLMN | |
|
| MN379610 | Chicken genomovirus mg4_1259 | IITFPQ | VHYHV | TAFDYFGAHGNIKSVR | KVFDYVGK | GPTRTGKTLYAR | VFDDI | MCMN | ||
|
| MN379609 | Chicken genomovirus mg4_1247 | LLTYSQ | THFHV | RLFDFGSSHPNIQIIR | KAFDYAGK | GPTRSGKSVWPR | VFDDL | MCMN | ||
|
| MN379601 | Chicken genomovirus mg2_75 | FLTYSQ | SHLHC | SLFDYRGAHPNIKSIR | KPWNYAGK | GPSRTGKTVWAR | IFDDI | MCMN | ||
| MN379616 | Chicken genomovirus mg8_416 | FLTYSQ | SHLHC | SLFDYRGAHPNIKSIR | KPWNYAGK | GPSRTGKTVWAR | IFDDI | MCMN | |||
|
| MN379605 | Chicken genomovirus mg4_1173 | FLTYSQ | CHFHV | SRLDFGGHHPNIQSVR | RVWDYAGK | GPTRTGKTVWAR | VFDDI | MLMN | ||
|
| MN379614 | Chicken genomovirus mg7_78 | LLTYSK | VHVHC | FRFDFGGSHPNIRSVS | RTYDYAGK | GPTRTGKTVWAR | VFDDL | CLMN | ||
|
| MN379604 | Chicken genomovirus mg4_1165 | LLTYSQ | LHFHC | SRLDYNGSHPNIKPIR | RAWEYTGK | GESRTGKTIWAR | IFDDI | LLCN | ||
|
|
|
| MN379619 | Chicken smacovirus mg4_881 | MITMPR | QHWQC | - | DTWEYETK | PEGNHGKTWLVG | FIDIP | VMTN |
| MN379625 | Chicken smacovirus mg6_1052 | MITMPR | QHWQC | - | DTWDYETK | PEGNHGKTWLVG | FIDIP | VMTN | |||
| MN379626 | Chicken smacovirus mg7_57 | MITMPR | QHWQC | - | DTWDYETK | PEGNHGKTWLVG | FIDIP | VMTN | |||
|
| MN379623 | Chicken smacovirus mg5_1212 | MLTIPR | EHWQV | - | DKWEYETK | PKGNNGKSWLVG | VIDLP | VLTN | ||
|
| MN379620 | Chicken smacovirus mg4_885 | MLTIPR | KHWQV | - | EDSDYETK | PKGKAGKSWLIG | IIDMP | VLTN | ||
|
| MN379627 | Chicken smacovirus mg7_67 | MLTIPR | DHWQI | - | DKWEYERK | PIGNRGKSWLAG | VIDIP | ILTN | ||
|
| MN379621 | Chicken smacovirus mg4_964 | MLTIPR | EHWQV | - | DKWEYERK | PIGNRGKSWLAG | VIDIP | IMTN | ||
| MN379624 | Chicken smacovirus mg5_1444 | MMTIPR | DHWQV | - | DKWEYEKK | PIGNRGKSWLAG | VIDIP | IMTN | |||
| MN379628 | Chicken smacovirus mg8_345 | MLTIPR | EHWQI | - | DRWEYERK | PIGNRGKSWLAG | VIDIP | IMTN | |||
|
| MN379622 | Chicken smacovirus mg5_1081 | MLTIPQ | KHWQI | - | DVWDYERK | KSGNHGKTWLSQ | IIDIP | IFTN | ||
|
| MN379618 | Chicken smacovirus mg2_55 | IMTIPQ | EHWQI | - | DECLYERK | AGGNVGKSWFCG | IIDVP | CLTN | ||
| unclassified | ClusterI | MN379594 | Chicken virus mg5_1345 | DATIWI | RHYQF | - | RDFEYVYK | EVGNAGKTAWGM | IIDTP | VLCN | |
| ClusterI | MN379595 | Chicken virus mg6_1197 | DATIWC | RHFQF | - | RDFEYVYK | ERGNSGKTAWAM | IIDTP | ILCN | ||
| ClusterI | MN379596 | Chicken virus mg7_59 | DATIWC | RHFQF | - | RDFEYVYK | ERGNSGKTAWAM | IIDTP | VLCN | ||
| ClusterI | MN379597 | Chicken virus mg8_324 | DATIWC | RHFQF | - | RDFEYVYK | ERGNSGKTAWAM | IIDTP | VLCN | ||
| ClusterII | MN379584 | Chicken virus mg4_280 | QLTLNQ | EHIHI | - | QNIDYIRK | GAPGVGKTYSAY | VVEEF | FASV | ||
| ClusterII | MN379591 | Chicken virus mg6_2056 | QITLNE | KHIHI | - | QNIAYIEK | GDSGSGKTYKAY | VIEEF | IASI | ||
| ClusterII | MN379586 | Chicken virus mg4_1578 | QITLND | KHIHI | - | QNIDYIEK | GDSGTGKTYKAY | IVEEF | IASI | ||
| ClusterII | MN379589 | Chicken virus mg5_2676 | QLTLNQ | EHIHI | - | QNIEYIRK | GAPGVGKTYSAY | VVEEF | FASV | ||
| CRESS1 | MN379585 | Chicken virus mg4_657 | CFTSFK | KHLQG | - | KAIEYCKK | GQAGSGKSHHCY | WFDEF | ISTT | ||
| CRESS1 | MN379592 | Chicken virus mg8_273 | CFTSFK | KHLQG | - | KAIEYCKK | GQAGSGKSHHCY | WFDEF | ISTT | ||
| Naryavirus | MN379588 | Chicken virus mg5_2197 | QITLNE | EHIHC | - | QNVAYVKK | GPSGVGKTERAK | IYDDF | ITSV | ||
| Naryavirus | MN379590 | Chicken virus mg5_2876 | QLTLNE | EHIHI | - | QNIDYILK | GPSGVGKTNKAL | LYDDF | ITSV | ||
| Singleton | MN379587 | Chicken virus mg4_2302 | IFTINN | QHLQG | - | QAYNYATK | GDSGTGKSYSAR | VLDEF | ITTN |
Figure 2Genome organization of the cressdnaviruses (families Circoviridae, Genomoviridae and Smacoviridae as well as those that are unclassified) and microviruses. Distribution of the virus sequences across the chicken swab samples is shown with black filled boxes. A bar graph summarizes the number of viruses identified in each sample at a family level and those that are unclassified within the phylum Cressdnaviricota.
Figure 3Maximum-likelihood phylogenetic tree of the Rep amino acid sequences of the three unique viruses that belong to the genus Cyclovirus (family Circoviridae). The maximum-likelihood phylogenetic tree is rooted with representative sequences of Rep sequences in the genus Circovirus. Branches with <0.8 aLRT support are collapsed. The clade with the Reps of the cycloviruses from this study is expanded and shown on the right with accession numbers in red bold font.
Figure 4Maximum-likelihood phylogenetic tree of the Rep amino acid sequences of the seventeen unique genomoviruses that belong to the Gemykibivirus (n = 9) and Gemykrogvirus (n = 8) genera. The maximum-likelihood phylogenetic tree is rooted with representative Rep sequences of viruses in the family Geminiviridae. Branches with <0.8 aLRT support are collapsed. The clades with the Reps of gemykibiviruses and gemykrogviruses from this study are expanded and shown on the right with accession numbers in red bold font.
Figure 5Maximum-likelihood phylogenetic tree of the Rep amino acid sequences of the eleven unique viruses that belong to the genus Porprismacovirus (family Smacoviridae). The maximum-likelihood phylogenetic tree is rooted with representative Rep amino acid sequences of viruses in the family Nanoviridae. Branches with <0.8 aLRT support are collapsed. The clades with the Reps of porprismacoviruses from this study are expanded and shown on the right with accession numbers in red bold font.
Figure 6Maximum-likelihood phylogenetic trees (midpoint rooted) for all sequences belonging to the SSN clusters that cannot be assigned to currently established viral families (i.e., ClusterI, ClusterII, CRESS1, naryavirus). Branches with <0.8 aLRT support are collapsed. Accession numbers of Reps encoded by viruses identified in this study are in red bold font.
Figure 7An approximately maximum-likelihood cladogram of the MCP amino acid sequences of microvirus genomes available in GenBank and those identified in this study. The cladogram branches are colored based on sub-families (Bullavirinae and Gokushovirinae) and Alphavirinae, Parabacteroides and Pichovirinae clades. Branch support with >0.8 aLRT are shown. MCPs of sequences identified in this study are marked with red branches and red colored accession numbers.