| Literature DB >> 34199948 |
Christina M Lazov1, Graham J Belsham2, Anette Bøtner2, Thomas Bruun Rasmussen3.
Abstract
Bat species worldwide are receiving increased attention for the discovery of emerging viruses, cross-species transmission, and zoonoses, as well as for characterizing virus infections specific to bats. In a previous study, we investigated the presence of coronaviruses in faecal samples from bats at different locations in Denmark, and made phylogenies based on short, partial ORF1b sequences. In this study, selected samples containing bat coronaviruses from three different bat species were analysed, using a non-targeted approach of next-generation sequencing. From the resulting metagenomics data, we assembled full-genome sequences of seven distinct alphacoronaviruses, three astroviruses, and a polyomavirus, as well as partial genome sequences of rotavirus H and caliciviruses, from the different bat species. Comparisons to published sequences indicate that the bat alphacoronaviruses belong to three different subgenera-i.e., Pedacovirus, Nyctacovirus, and Myotacovirus-that the astroviruses may be new species in the genus Mamastrovirus, and that the polyomavirus could also be a new species, but unassigned to a genus. Furthermore, several viruses of invertebrates-including two Rhopalosiphum padi (aphid) viruses and a Kadipiro virus-present in the faecal material were assembled. Interestingly, this is the first detection in Europe of a Kadipiro virus.Entities:
Keywords: RNA viruses; insect viruses; virus excretion; virus taxonomy
Year: 2021 PMID: 34199948 PMCID: PMC8229204 DOI: 10.3390/v13061073
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Overview of bat faecal sample material used in the study. Previously published data mentioned with GenBank accession numbers consist of partial bat coronavirus ORF1b sequences of 208 or 130 nucleotides in length.
| Sample ID | Species | Collection Time | Location | Previous Data | Virus Sequences Assembled in This Study |
|---|---|---|---|---|---|
| 13585-35 |
| October 2014 | Mønsted | LR025670, LR025709 | BtCoV |
| 13585-58 |
| October 2014 | Mønsted | LR025680, LR025712 | BtCoV, BtAstV, BtPV, RAAV |
| 21164-6 |
| November 2015 | Mønsted | LR025732 | BtCoV, BtAstV, BtCV, BtPyV, KDV, BaV |
| OV-157 |
| October 2018 | Mønsted | - | BtCoV, BtCV, RhPV |
| 18802-1 |
| November 2016 | Mønsted | LR025723 | BtCoV, RVH |
| B40-5 |
| August 2013 | Sollerup | LR025698, LR025733 | BtCoV |
| 7542-55 |
| June 2014 | Vadum | LR025676, LR025706 | BtCoV |
Abbreviations of bat species names are: Myotis daubentonii (M. dau); Myotis dasycneme (M. das); and Pipistrellus pygmaeus (P. pyg). Abbreviations of virus names are: bat coronavirus (BtCoV); bat astrovirus (BtAstV); bat calicivirus/sapovirus (BtCV), rotavirus H (RVH); bat polyomavirus (BtPyV); bat picornavirus (BtPV); Basavirus (BaV); Kadipiro virus (KDV); Rhopalosiphum padi virus (RhPV); and rosy apple aphid virus (RAAV).
Overview of the output from Kaiju metagenomics analysis, with a focus on reads assigned to selected viruses. Read numbers highlighted in bold indicate that viral genome sequences were assembled.
| NGS Dataset | D28-7 | D28-9 | D32-7 | D32-9 | D29-6 + D32-10 | D32-6 | D32-4 |
|---|---|---|---|---|---|---|---|
| Bat species |
|
|
|
|
|
|
|
| Sample no. |
|
|
|
|
|
|
|
| Biosample ac. no. | SAMN | SAMN | SAMN | SAMN | SAMN | SAMN | SAMN |
| Assigned reads | 296,356 | 1,469,272 | 2,234,894 | 2,155,628 | 1,495,375 | 55,361 | 182,572 |
|
| 2221 | 503 | 132 | 352 | 2078 | 207 | 360 |
|
| 189,602 | 1,121,102 | 28,900 | 164,552 | 1,447,060 | 39,417 | 148,103 |
|
| 22,941 | 144,898 | 8868 | 18,813 | 18,251 | 2438 | 9956 |
|
| 72,440 | 189,014 | 2,195,308 | 1,961,910 | 8998 | 10,187 | 2871 |
| Unclassified vir. | 13,181 1 | 7865 1 | 497,671 1 | 6 | 8 | 3 | |
| Unclas. RNA vir. [ | 44,631 3 | 40,180 3 | 6584 4 | 654,384 3 | 264 | 401 | 5 |
| Unclas. bacterial vir. | 26 | 2997 | - | 13 | 22 | 8 | 20 |
| Retro-transcribing vir. | 386 | 44 | 31 | 66 | 276 | 113 | 115 |
| dsDNA viruses | 1012 | 208 | 97 | 1565 | 84 | 188 | |
| ssDNA viruses | - | 23 | - | - | 16 | 7 | - |
| dsRNA viruses | 458 6 | 29 | 770 8 | - | - | ||
|
| 9980 | 136,492 | 70,512 | 734,899 | 6708 | 9501 | 2517 |
| Unclassified vir. | 22 | - | 306 | - | - | - | |
|
|
|
|
|
|
|
|
|
|
| 8 |
|
| 2 | - | - | - |
|
| 87 | 2811 | 54 | 15 | 7 | 191 | - |
|
| 8 | - | 357 | - | 6 | - | - |
|
| 3166 | 1094 | 435 13 | 681,091 | 41 | 1825 | 14 |
|
| - | - | 42 | - | 2 | - | |
|
| 443 | 137 | - | 5063 15 | 12 | 1795 14 | - |
|
| 2679 11 | 17 | 48 | 26 | 19 | 10 | |
|
| - | - |
|
| - | - | - |
1 Predominant species: Aphis glycines virus 2; 2 predominant species: Basavirus sp.; 3 predominant species: Hubei permutotetra-like virus 4; 4 predominant species: Hubei partiti-like virus 22; 5 predominant species: Pomona leaf-nosed-bat-associated polyomavirus; 6 predominant species: unclassified Birnaviridae; 7 predominant species: Kadipiro virus; 8 predominant species: Totiviridae; 9 predominant species: rotavirus; 10 predominant species: rosy apple aphid virus; 11 predominant species: Rhopalosiphum padi virus; 12 predominant species: Washington bat picornavirus; 13 predominant species: Pow Burn virus; 14 predominant species: iflavirus; 15 predominant species: Brevicoryne brassicae picorna-like virus. Abbreviations used: viruses (vir.); unclassified (unclas.); double-stranded (ds); single-stranded (ss).
List of all full-length (highlighted in bold) and partial genome sequences assembled in this study and deposited in GenBank.
| Virus Description | Virus Strain Name | Mapped Reads | Average | Length nt. | Accession |
|---|---|---|---|---|---|
| Alphacoronavirus | BtCoV/13585-35/M.dau/DK/2014 | 8800 | 50 |
| MN535731 |
| BtCoV/13585-58/M.dau/DK/2014 | 140,000 | 1000 |
| MN535732 | |
| BtCoV/21164-6/M.dau/DK/2015 | 70,000 | 370 |
| MN543743 | |
| BtCoV/21164-6-alt/M.dau/DK/2015 | 20,000 | 120 |
| MZ218052 | |
| BtCoV/OV-157/M.dau/DK/2018 | 13,000 | 102 |
| MN535733 | |
| BtCoV/18802-1/M.das/DK/2016 | 9800 | 42 |
| MN535734 | |
| BtCoV/B40-5/P.pyg/DK/2013 | 9200 | 59 |
| MN482242 | |
| BtCoV/7542-55/P.pyg/DK/2014 | 1800 | 8 | 27,786 1 | MZ218060 | |
| Mamastrovirus | BtAstV/13585-58/M.dau/DK/2014 | 17,000 | 465 |
| MN832787 |
| BtAstV/21164-6-A/M.dau/DK/2015 | 630 | 15 | 6619 | MZ218053 | |
| BtAstV/21164-6-B/M.dau/DK/2015 | 540 | 13 | 6558 2 | MZ218054 | |
| Calicivirus | BtCV/OV-157/M.dau/DK/2018 | 352 | 8 | 7486 3 | MZ218056 |
| BtCV/21164-6-A/M.dau/DK/2015 | 2 | 1.5 | 396 | MZ218057 | |
| BtCV/21164-6-B/M.dau/DK/2015 | 6 | 1.4 | 706 | MZ218058 | |
| Rotavirus H | RVH/18802-1/M.das/DK/2016 | 98 | 2.2 | 58–2169 4,5 | MZ218062- |
| MZ218070 | |||||
| Polyomavirus | BtPyV/21164-6/M.dau/DK/2015 | 710 | 17 |
| MZ218055 |
| Picornavirus | BtPV/13585-58/M.dau/DK/2014 | 870 | 16 | 9300 | MZ218061 |
| Basavirus | BaV/21164-6/M.dau/DK/2015 | 8560 | 150 |
| MW929926 |
| Kadipiro virus | KDV/21164-6/M.dau/DK/2015 | 1,960,000 | 3700–16,500 | MN543741, | |
| MN543742, | |||||
| MN543744- | |||||
| MN543753 | |||||
| RhPV/OV-157/M.dau/DK/2018 | 440,000 | 6490 |
| MN535735 | |
| RhPV/OV-157-alt/M.dau/DK/2018 | 340,000 | 4600 |
| MZ218059 | |
| Rosy apple aphid virus | RAAV/13585-58/M.dau/DK/2014 | 5860 | 102 |
| MW929927 |
Some sequences contain NNNs to indicate unknown nucleotides. Numbers of NNNs are: 1 2037; 2 4; 3 70; 4 2558. 5 A total of 9 out of 11 partial genome segments, with a total length of 5542 nt (excluding NNNs), were assembled into gapped sequences for rotavirus H using both reference assemblies and de novo assembly. 6 All 12 genome segments of Kadipiro virus were assembled with a total length of 20,781 nt.
Figure 1Comparison of full-length genomes of bat coronaviruses (BtCoVs) to one another, and to the closest matching full-length sequences from GenBank, after aligning the sequences with MUSCLE. The grouping of sequences corresponds to the clustering seen in the phylogenetic tree (Figure 2). The numbers in the upper-right part of the table are the percentage nucleotide identities between the sequences, while the lower-left part shows the actual number of nucleotide differences between the sequences. The heat map shading indicates degree of identity, with increased similarity shown by darker colours. The incomplete genome sequence BtCoV/7542-55/P.pyg/DK/2014 was not included in this analysis. The new sequences reported here are marked in bold text.
Figure 2Phylogenetic tree of bat coronavirus sequences from Denmark. The sequences reported here (highlighted in orange) were compared to the closest matching sequences from GenBank and at least one ICTV exemplar isolate of each species of the Orthocoronavirinae subfamily. Bat coronavirus sequences are indicated with a bat symbol. The tree was generated using the PhyML maximum likelihood method with 100 bootstrap iterations on MUSCLE-aligned concatenated amino acid sequences predicted from the partial ORF1ab nt sequences (conserved regions of nsp5, nsp12, and nsp13), with a total length of 1224 amino acids. Only bootstrap values above 50 are displayed on the nodes. The evolutionary model for the tree was LG + G. The ICTV reference sequences have all been assigned to subgenera within the genera Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus. Clustering of the Danish bat coronavirus sequences indicates that they belong to the same subgenera as the reference sequences in the genus Alphacoronavirus, within the subgenera Pedacovirus, Nyctacovirus, and Myotacovirus (highlighted in green).
The closest matching sequences to the bat astroviruses found in this study are not complete genomes, but partial ORF1ab gene sequences generated by sequencing of RT-PCR products (around 381 nucleotides in length without primers) from the most commonly used assay for detection of astroviruses in bats [46]. This table summarizes the top three identity BLASTn search hits of these short sequences as accession numbers, with % identity/length in parentheses. The 1st BLASTn hits were described previously [45]. Abbreviations used for German regions: North Rhine-Westphalia (NRW); and Mecklenburg-Western Pomerania (MV).
| BtAstV Genomes | 1st BLASTn Hit | 1st BLASTn Hit Source | 2nd BLASTn Hit | 3rd BLASTn Hit |
|---|---|---|---|---|
| BtAstV/13585-58/M.dau/DK/2014 | KT894882 | EU847161 | EU847160 | |
| BtAstV/21164-6-A/M.dau/DK/2015 | KT894889 | KT894883 | KT894893 | |
| BtAstV/21164-6-B/M.dau/DK/2015 | KT894894 | KT894896 | KT894895 |
Figure 3Phylogenetic tree of bat astrovirus sequences from Denmark. The Danish sequences (indicated in bold text) were analysed with the closest matching sequences with full ORF2 coding sequences available in GenBank, and at least one ICTV exemplar isolate of each species of the genus Mamastrovirus in the Astroviridae family, with an exemplar isolate of the genus Avastrovirus chosen as an outgroup. Bat astrovirus sequences are indicated with a bat symbol. The tree was generated using the neighbour-joining method, with 1000 bootstrap iterations on MUSCLE-aligned predicted amino acid sequences from the complete ORF2 gene coding sequence, with a total length of 1144 amino acids. Bootstrap values above 50 are displayed.
Rhopalosiphum padi virus comparisons to one another, to the reference RhPV sequence NC_001874, and to the closely related sequence MF535298 from Hungary [51].
| RhPV/OV-157/M.dau/DK/2018 | RhPV/OV-157-alt/M.dau/DK/2018 | |||
|---|---|---|---|---|
| Nucleotide Comparison | (10,009 nt) | (10,010 nt) | ||
| Nt. Differences | Percent Identity | Nt. Differences | Percent Identity | |
| 351 | 96.5 | 357 | 96.4 | |
| - 5′-end (579 nt) | 47 | 91.9 | 48 | 91.7 |
| - ORF1 (5997 nt) | 230 | 96.2 | 231 | 96.2 |
| - IGR (533 nt) | 8 | 98.5 | 11 | 97.9 |
| - ORF2 (2451 nt) | 59 | 97.6 | 59 | 97.6 |
| - 3′-end (451 nt) | 7 | 98.4 | 8 | 98.2 |
| 50 | 99.5 | 76 | 99.2 | |
| - 5′-end (219 nt) | 0 | 100 | 0 | 100 |
| - ORF1 (5997 nt) | 19 | 99.7 | 48 | 99.2 |
| - IGR (533 nt) | 2 | 99.6 | 3 | 99.4 |
| - ORF2 (2451 nt) | 23 | 99.1 | 21 | 99.1 |
| - 3′-end (451 nt) | 6 | 98.7 | 4 | 99.1 |
| 106 | 98.9 | |||
| - 5′-end (579 and 580 nt) | 2 | 99.7 | ||
| - ORF1 (5997 nt) | 59 | 99.0 | ||
| - IGR (533 nt) | 5 | 99.1 | ||
| - ORF2 (2361 nt) | 31 | 98.7 | ||
| - 3′-end (539 nt) | 9 | 98.3 | ||
| Amino acid comparison: ORF1 | (1988 a.a.) | (1988 a.a.) | ||
| A.a. differences | Percent identity | A.a. differences | Percent identity | |
| 23 | 98.9 | 28 | 98.6 | |
| 3 | 99.9 | 4 | 99.8 | |
| 7 | 99.7 | |||
| Amino acid comparison: ORF2 | (788 a.a.) | (788 a.a.) | ||
| A.a. differences | Percent identity | A.a. differences | Percent identity | |
| 11 | 98.6 | 13 | 98.4 | |
| 6 | 99.2 | 6 | 99.2 | |
| 4 | 99.5 | |||
Assembled Kadipiro virus genome segments listed with the first three BLASTn search hits as accession numbers, with percent identities in parentheses. KX884650–KX884661: Kadipiro virus strain QTM27331 from China 2013; MG590140–MG590148, MG590150–MG590151: Kadipiro virus strain SDKL1625 from China 2016; AF133429, AF133509–AF133513, AF052019–AF052023: Kadipiro virus strain JKT-7075, an isolate originally from Java, Indonesia [30]; KX247778: Kadipiro virus isolate Kenya2 [59,60]; FJ159105: Kadipiro virus strain YN0557 from China 2005.
| Genome Segment | Segment Length | Accession Number | 1st BLASTn Hit | 2nd BLASTn Hit | 3rd BLASTn Hit |
|---|---|---|---|---|---|
| Segment 1 | 3799 | MN543741 | KX884650 (86) | MG590148 (86) | AF133429 (84) |
| Segment 2 | 3016 | MN543742 | MG590140 (85) | KX884651 (84) | AF134509 (82) |
| Segment 3 | 2353 | MN543744 | MG590141 (84) | KX884652 (84) | AF134510 (81) |
| Segment 4 | 2121 | MN543745 | KX884653 (86) | AF134511 (84) | KX247778 (74) |
| Segment 5 | 1895 | MN543746 | MG590142 (83) | KX884654 (83) | AF134512 (81) |
| Segment 6 | 1659 | MN543747 | KX884655 (87) | AF134513 (86) | MG590143 (86) |
| Segment 7 | 1232 | MN543748 | MG590144 (92) | KX884656 (92) | AF052023 (90) |
| Segment 8 | 1083 | MN543749 | KX884657 (88) | AF052022 (85) | MG590145 (86) |
| Segment 9 | 1041 | MN543750 | KX884658 (91) | MG590150 (90) | AF052021 (88) |
| Segment 10 | 914 | MN543751 | KX884659 (87) | MG590151 (86) | AF052020 (84) |
| Segment 11 | 875 | MN543752 | KX884660 (85) | AF052019 (83) | MG590146 (83) |
| Segment 12 | 754 | MN543753 | MG590147 (93) | FJ159105 (90) | KX884661 (89) |