| Literature DB >> 35208721 |
Elisa M Bolatti1,2, Gastón Viarengo3, Tomaz M Zorec4, Agustina Cerri1, María E Montani5,6,7, Lea Hosnjak4, Pablo E Casal2, Eugenia Bortolotto8, Violeta Di Domenica6, Diego Chouhy1,2,3, María Belén Allasia8, Rubén M Barquez6,7, Mario Poljak4, Adriana A Giri1,2.
Abstract
Bats are natural reservoirs of a variety of zoonotic viruses, many of which cause severe human diseases. Characterizing viruses of bats inhabiting different geographical regions is important for understanding their viral diversity and for detecting viral spillovers between animal species. Herein, the diversity of DNA viruses of five arthropodophagous bat species from Argentina was investigated using metagenomics. Fecal samples of 29 individuals from five species (Tadarida brasiliensis, Molossus molossus, Eumops bonariensis, Eumops patagonicus, and Eptesicus diminutus) living at two different geographical locations, were investigated. Enriched viral DNA was sequenced using Illumina MiSeq, and the reads were trimmed and filtered using several bioinformatic approaches. The resulting nucleotide sequences were subjected to viral taxonomic classification. In total, 4,520,370 read pairs were sequestered by sequencing, and 21.1% of them mapped to viral taxa. Circoviridae and Genomoviridae were the most prevalent among vertebrate viral families in all bat species included in this study. Samples from the T. brasiliensis colony exhibited lower viral diversity than samples from other species of New World bats. We characterized 35 complete genome sequences of novel viruses. These findings provide new insights into the global diversity of bat viruses in poorly studied species, contributing to prevention of emerging zoonotic diseases and to conservation policies for endangered species.Entities:
Keywords: Anelloviridae; Chiroptera; Cossaviricota; Cressdnaviricota; metagenomics; virome; virus identification
Year: 2022 PMID: 35208721 PMCID: PMC8880087 DOI: 10.3390/microorganisms10020266
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Bat samples included in each of the six sequenced sample pools by bat species, age, and collection site.
| Source | Location | Collection Date | Pool ID | Sample ID | Bat Species | Age |
|---|---|---|---|---|---|---|
| Bat colony | Rosario | 25 January 2017 | 1 | M51 |
| Young/adult |
| 25 January 2017 | M52 | |||||
| 25 January 2017 | M53 | |||||
| 25 January 2017 | M54 | |||||
| 25 January 2017 | M55 | |||||
| 25 January 2017 | 2 | M56 |
| Young/adult | ||
| 25 January 2017 | M57 | |||||
| 25 January 2017 | M58 | |||||
| 25 January 2017 | M59 | |||||
| 25 January 2017 | M60 | |||||
| Individual bats | Villarino Park in Zavalla | 3 February 2017 | 4 | M66 |
| Adult |
| 3 February 2017 | M68 | |||||
| 3 February 2017 | M73 | |||||
| 3 February 2017 | M74 |
| ||||
| 3 February 2017 | M75 | |||||
| 17 March 2017 | 6 | M76 |
| Adult | ||
| 17 March 2017 | M77 | |||||
| 17 March 2017 | M78 | |||||
| 17 March 2017 | M79 | |||||
| 12 April 2017 | 9 | M90 |
| Adult | ||
| 13 April 2017 | M93 | |||||
| 13 April 2017 | M95 | |||||
| 13 April 2017 | M96 | |||||
| 13 April 2017 | M103 | |||||
| 12 April 2017 | 11 | M87 |
| Adult | ||
| 12 April 2017 | M92 | |||||
| 13 April 2017 | M97 | |||||
| 13 April 2017 | M99 | |||||
| 13 April 2017 | M100 |
Numbers of read pairs and contigs obtained by metagenomic workflow analysis of fecal samples of five bat species from Argentina.
| Pool ID | Raw Read Pairs, | Read Pairs after Quality Filtering and Trimming, | Read Pairs after Host Subtraction, | Read Pairs after Subtraction of Bacterial Reads, | Viral Read Pairs, | Viral Read Pairs that Remapped to Viral Contigs, | Viral Read Pairs that Remapped to Viral Contigs, %(*) | Contigs | Viral Contigs | Viral Contigs, |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 991,903 | 716,256 | 711,479 | 591,522 | 92,981 | 92,859 | 15.7 | 797 | 19 | 2.38 |
| 2 | 880,008 | 784,591 | 784,248 | 777,013 | 487,928 | 487,872 | 62.8 | 303 | 16 | 5.28 |
| 4 | 736,734 | 641,183 | 641,088 | 587,045 | 37,660 | 37,399 | 6.37 | 8001 | 279 | 3.49 |
| 6 | 38,989 | 336,497 | 336,174 | 279,277 | 46,235 | 46,180 | 16.5 | 1910 | 115 | 6.02 |
| 9 | 753,848 | 648,621 | 628,771 | 589,624 | 42,214 | 41,876 | 7.10 | 5087 | 153 | 3.01 |
| 11 | 767,987 | 622,945 | 621,915 | 606,187 | 18,130 | 18,003 | 2.97 | 1565 | 109 | 6.97 |
| Total | 4,520,370 | 3,750,093 | 3,723,675 | 3,430,668 | 725,148 | 724,189 | 21.1 | 17,663 | 691 | 3.91 |
P1 and P2: T. brasiliensis; P4: E. diminutus and E. patagonicus; P6 and P9: M. molossus; P11: E. bonariensis. (*) Percentage of “Viral read pairs that remapped to viral contigs” with respect to “Read pairs after subtraction of bacterial reads”. (**) Percentage of “Viral contigs (>500 bp)” with respect to “Contigs (>500 bp)”.
Figure 1Abundance of viral families in pooled fecal samples of five bat species from Argentina identified by metagenomics. (A) Heatmap of relative abundances of viral families according to the number of reads found in each sample pool. The viral families are grouped according to host kingdom. (B) Relative abundance of viral reads classified at the kingdom level in each sample pool. Percentages higher than 1% are shown. P1 and P2: T. brasiliensis; P4: E. diminutus and E. patagonicus; P6 and P9: M. molossus; P11: E. bonariensis.
Rényi’s entropy indexes for different values of α (0–2) of fecal samples of five bat species from Argentina.
| P1 | P2 | P4 | P6 | P9 | P11 | |
|---|---|---|---|---|---|---|
| H0 | 1.085 | 1.004 | 1.847 | 1.928 | 1.810 | 1.468 |
| H0.25 | 1.028 | 1.001 | 1.278 | 1.376 | 1.293 | 1.145 |
| H0.5 | 1.946 | 1.792 | 3.664 | 2.833 | 3.136 | 2.890 |
| H0.75 | 0.914 | 0.444 | 2.674 | 1.938 | 2.222 | 2.002 |
| H1 | 0.361 | 0.068 | 1.684 | 1.295 | 1.424 | 1.190 |
| H2 | 0.156 | 0.012 | 0.984 | 0.897 | 0.890 | 0.656 |
P1 and P2: T. brasiliensis; P4: E. diminutus and E. patagonicus; P6 and P9: M. molossus; P11: E. bonariensis.
Bray–Curtis dissimilarity of fecal samples from five bat species from Argentina.
| P1 | P2 | P4 | P6 | P9 | |
|---|---|---|---|---|---|
| P2 | 0.684 | ||||
| P4 | 0.964 | 0.995 | |||
| P6 | 0.983 | 0.999 | 0.153 | ||
| P9 | 0.967 | 0.996 | 0.125 | 0.155 | |
| P11 | 0.977 | 0.999 | 0.369 | 0.461 | 0.419 |
P1 and P2: T. brasiliensis; P4: E. diminutus and E. patagonicus; P6 and P9: M. molossus; P11: E. bonariensis.
Novel bat-associated DNA viruses identified in fecal samples of five bat species from Argentina.
| Host | Viral Family | Genus | Novel Viral Species | Virus Name | MAVG | Sequencing Pool | Genbank Accession Number |
|---|---|---|---|---|---|---|---|
|
|
|
| Yes; KJ641727 ( | Eumops bonariensis associated circovirus 1 | MAVG08 | P11 | OL704833 |
|
| Yes; KJ641740 ( | Eumops bonariensis associated cyclovirus 1 | MAVG03 | P11 | OL704828 | ||
|
|
| Yes; MK483082 ( | Eumops bonariensis associated gemycircularvirus 4 | MAVG23 | P11 | OL704848 | |
|
| Yes; KT363839 ( | Eumops bonariensis associated gemykibivirus 1 * | MAVG24 | P11 | OL704849 | ||
|
|
| Possible; NC_001354.1 ( | Eumops bonariensis papillomavirus type 1 | MAVG33 | P11 | OL704824 | |
| Unclassified | Possible; KX812447 ( | Eumops bonariensis papillomavirus type 2 | MAVG34 | P11 | OL704825 | ||
|
|
| Yes; AB937980 ( | Bat associated cyclovirus 17 | MAVG01 | P4 | OL704826 | |
|
|
| Yes; KF371637 ( | Bat associated gemycircularvirus 1 ** | MAVG27 | P4 | OL704852 | |
|
| Yes; MK032742 ( | Bat associated gemykibivirus 2 | MAVG28 | P4 | OL704853 | ||
|
| Yes; MK483082 ( | Bat associated gemykronzavirus 2 | MAVG29 | P4 | OL704854 | ||
|
|
|
| Yes; JQ011377 ( | Molossus molossus associated circovirus 1 | MAVG04 | P9 | OL704829 |
| Yes; JQ011377 ( | Molossus molossus associated circovirus 2 | MAVG05 | P9 | OL704830 | |||
| Yes; JQ011377 ( | Molossus molossus associated circovirus 3 | MAVG06 | P9 | OL704831 | |||
| Yes; KJ641727 ( | Molossus molossus associated circovirus 4 | MAVG07 | P9 | OL704832 | |||
|
|
| No; MT138090 ( | Molossus molossus associated gemycircularvirus 3 | MAVG15 | P9 | OL704840 | |
| Possible; MH047857 ( | Molossus molossus associated gemycircularvirus 1 | MAVG13 | P9 | OL704838 | |||
| Yes; KY308268 ( | Molossus molossus associated gemycircularvirus 2 | MAVG14 | P9 | OL704839 | |||
| Yes; KT862242 ( | Molossus molossus associated gemycircularvirus 5 ** | MAVG25 | P6 | OL704850 | |||
| Yes; KF371637 ( | Molossus molossus associated gemycircularvirus 4 | MAVG19 | P9 | OL704844 | |||
|
| Yes; MH939362 ( | Molossus molossus associated gemygorvirus 1 | MAVG20 | P9 | OL704845 | ||
|
| Yes; MK032742 ( | Molossus molossus associated gemykibivirus 1 * | MAVG12 | P9 | OL704837 | ||
| Yes; MK032742 ( | Molossus molossus associated gemykibivirus 2 | MAVG16 | P9 | OL704841 | |||
| Yes; 63% similar to MK032742 ( | Molossus molossus associated gemykibivirus 3 | MAVG17 | P9 | OL704842 | |||
| Yes; KT363839 ( | Molossus molossus associated gemykibivirus 4 | MAVG18 | P9 | OL704843 | |||
| Yes; MK032742 ( | Molossus molossus associated gemykibivirus 5 | MAVG21 | P9 | OL704846 | |||
| Yes; KJ547642 ( | Molossus molossus associated gemykibivirus 6 | MAVG22 | P9 | OL704847 | |||
|
| Yes; MK483082 ( | Molossus molossus associated gemykronzavirus 1 | MAVG26 | P6 | OL704851 | ||
|
| Unclassified | Yes; MN582084 (CRESS virus sp. ctf7a5, complete genome); 58% | Molossus molossus associated CRESSDNA virus | MAVG30 | P9 | OL704855 | |
|
|
| Yes; KP860907 ( | Molossus molossus associated porprismacovirus 1 | MAVG11 | P9 | OL704836 | |
|
| Unclassified | No; KX812447 ( | Molossus molossus papillomavirus type 2 | MAVG32 | P9 | OL704823 | |
|
|
|
| Yes; GU226971 ( | Tadarida brasiliensis associated dependoparvovirus | MAVG31 | P1 | OL704856 |
|
|
| Yes; HQ738634 ( | Tadarida brasiliensis associated cyclovirus 1 | MAVG02 | P2 | OL704827 | |
|
| Yes; NC_040576 ( | Tadarida brasiliensis associated circovirus 1 | MAVG09, MAVG10 | P1, P2 | OL704834, OL704835 | ||
|
|
| Yes; MT010529 (unclassified canid anellovirus); 64% | Torque teno Tadarida brasiliensis virus 2 | MAVG35 | P2 | OL704857 | |
| Unclassified | Yes; HM633238 ( | Torque teno Tadarida brasiliensis virus 3 | MAVG36 | P2 | OL704858 | ||
| Total | 7 | 13 | 31 | 35 | 36 | 6 |
* and ** indicate that the virus belongs to the same viral species as another virus that was identified during this study. # Sequence identities were measured using SDT [32] at the genomic level relevant for species demarcation as prescribed by the ICTV: complete genome identities for Circo-, Smaco- and Genomoviridae and L1 and ORF1 gene nucleotide sequence identities for Papilloma- and Anelloviridae, respectively, and the amino acid sequence identity of NS1 for Parvoviridae. ## Proposition. ICTV handles the official taxonomical decisions regarding viral species definition and naming.
Figure 2Placement of 18 novel genomoviruses into the phylogenetic context of other viruses clustering to the family Genomoviridae. The phylogenetic tree was built based on the Rep protein multiple sequence alignment (aa), which was produced using mafft (v7.45) [30]. The sequences evaluated in the most recent taxonomic update of the family Genomoviridae [27] were downloaded from GenBank and used as context (n = 109). Newly identified viruses are marked with black dots, and genera are color-coded. Saturated node support values are shown with asterisks (*). The phylogenetic tree was constructed using iqtree v1.6 [35] with 1,000 UFBootstrap replicates [37] and the phylogenetic model LG+F+R9. Tree visualization was facilitated using Figtree v1.4.4 (https://github.com/rambaut/figtree.git, accessed on 25 September 2021), and the tree was rooted at midpoint.
Figure 3Placement of six novel circoviruses, three novel cycloviruses, and one novel circo-like virus into the phylogenetic context of other viruses clustering to the family Circoviridae. The phylogenetic tree was built based on the Rep protein multiple sequence alignment (aa), which was produced using mafft (v7.45) [33]. Context sequences were downloaded from the ICTV Circoviridae data resource (https://talk.ictvonline.org/cfs-file/__key/communityserver-wikis-components-files/00-00-00-00-83/OSD.Cir.Fig1A.Cyclovirus_5F00_circovirus_5F00_reps_5F00_aln_5F00_ed.fas, 25 September 2021). In addition, the first three most similar RefSeq sequences according to blastn searches not yet present among the primary ICTV sequences were added to the database of context sequences (n = 90). Newly identified viruses are marked with black dots, and genera are color-coded. Saturated node support values are shown with asterisks (*). The phylogenetic tree was constructed using iqtree v1.6 [35] with 1000 UFBootstrap replicates [37] and the phylogenetic model LG+F+R9. Tree visualization was facilitated using Figtree v1.4.4 (https://github.com/rambaut/figtree.git, accessed on 25 September 2021), and the tree was rooted in such a way that the two genera, Circovirus and Cyclovirus, were monophyletic.
Figure 4Placement of the three novel papillomaviruses (PVs) into the phylogenetic context of other viruses clustering to the family Papillomaviridae. The tree was built using the concatenation of the E1, E2, L2, and L1 gene sequences (nt) of 384 reference PV genomes and the corresponding genes from potentially novel PVs. The phylogenetic tree was constructed using iqtree v1.6 [35] with the GTR+F+R10 model, which was chosen as the best-fitting model according to the Bayesian information criterion using ModelFinder [36]. Tree visualization was facilitated using Figtree v1.4.4 (https:/github.com/rambaut/figtree.git, accessed on 25 September 2021) and rooted at SaPV1. Branches were annotated with UF bootstrap support (1000 replicates) values [37]. Node support values < 50 are not shown, and saturated node support values are shown with asterisks (*). Alpha-, Beta-, and Gammapapillomavirus genera were collapsed. Novel PV types, MmoPV2, EbonPV1, and EbonPV2, are marked with black dots. Bat PV types are depicted in blue. Un-PV = unclassified PV genera.
Figure 5Placement of two novel anelloviruses into the phylogenetic context of other viruses clustering to the family Anelloviridae. The phylogenetic tree was built using the ORF1 multiple nucleotide sequence alignment (nt), which was produced using mafft (v7.453) [33]. The anellovirus sequences evaluated in the most recent taxonomic update [29] were downloaded from GenBank and used as context (n = 987). Newly identified viruses are marked with black dots, and genera are color-coded. Iota-, Alep-, Rho-, and Kappatorquevirus genera were collapsed. Saturated node support values are shown with asterisks (*). Only a subtree with the 312 most relevant context sequences, rooted using the sequence MF187212, is shown. The phylogenetic tree was constructed using iqtree v1.6 [35] with 1000 UFBootstrap replicates [37] and the phylogenetic model GTR+F+R4, which was chosen as the best-fitting model according to the Bayesian information criterion using ModelFinder [36]. Tree visualization was facilitated using Figtree v1.4.4 (https://github.com/rambaut/figtree.git, accessed on 25 September 2021).
Figure 6Placement of the novel smacovirus into the phylogenetic context of other viruses clustering to the family Smacoviridae. The phylogenetic tree was built based on the Rep protein multiple sequence alignment (aa), which was produced using mafft (v7.45) [33]. The sequences evaluated in the most recent taxonomic update of the family Smacoviridae [28] were downloaded from GenBank and used as context (n = 84). Newly identified viruses are marked with black dots, and genera are color-coded. Saturated node support values are shown with asterisks (*). The phylogenetic tree was constructed using iqtree v1.6 [35] with 1,000 UFBootstrap replicates [37] and the phylogenetic model LG+F+R9. Tree visualization was facilitated using Figtree v1.4.4 (https://github.com/rambaut/figtree.git, accessed on 25 September 2021), and the tree was rooted in a way that allowed most of the genera to be monophyletic.
Figure 7Placement of the novel parvovirus into the phylogenetic context of other viruses clustering to the subfamily Parvovirinae. The phylogenetic tree was built using the NS1 protein multiple sequence alignment; as suggested by the most recent relevant taxonomic update [47], the Rep40 protein subsequence was used in the case of the novel parvovirus (aa). The multiple sequence alignment was produced using mafft (v7.453) [33]. Context sequences were downloaded from the ICTV official resource page for the subfamily Parvovirinae (https://talk.ictvonline.org/ictv-reports/ictv_online_report/ssdna-viruses/w/parvoviridae/1055/resources-parvoviridae; 25 September 2021; n = 59). The black dot marks the newly identified virus. Saturated node support values are shown with asterisks (*). The phylogenetic tree was constructed using iqtree v1.6 [35] with 1000 UFBootstrap replicates [37] and the phylogenetic model rtREV+F+I+G4, which was chosen as the best-fitting model according to the Bayesian information criterion using ModelFinder [36]. Tree visualization was facilitated using Figtree v1.4.4 (https://github.com/rambaut/figtree.git, accessed on 25 September 2021), and the tree was rooted at midpoint.