| Literature DB >> 34643419 |
Solomon K Langat1,2, Fredrick Eyase3,4, Wallace Bulimo2,5, Joel Lutomiah2, Samuel O Oyola6, Mabel Imbuga1, Rosemary Sang2.
Abstract
Vector-borne diseases (VBDs) cause enormous health burden worldwide, as they account for more than 17% of all infectious diseases and over 700,000 deaths each year. A significant number of these VBDs are caused by RNA virus pathogens. Here, we used metagenomics and metabarcoding analysis to characterize RNA viruses and their insect hosts among biting midges from Kenya. We identified a total of 15 phylogenetically distinct insect-specific viruses. These viruses fall into six families, with one virus falling in the recently proposed negevirus taxon. The six virus families include Partitiviridae, Iflaviridae, Tombusviridae, Solemoviridae, Totiviridae, and Chuviridae. In addition, we identified many insect species that were possibly associated with the identified viruses. Ceratopogonidae was the most common family of midges identified. Others included Chironomidae and Cecidomyiidae. Our findings reveal a diverse RNA virome among Kenyan midges that includes previously unknown viruses. Further, metabarcoding analysis based on COI (cytochrome c oxidase subunit 1 mitochondrial gene) barcodes reveal a diverse array of midge species among the insects used in the study. Successful application of metagenomics and metabarcoding methods to characterize RNA viruses and their insect hosts in this study highlights a possible simultaneous application of these two methods as cost-effective approaches to virus surveillance and host characterization. IMPORTANCE The majority of the viruses that currently cause diseases in humans and animals are RNA viruses, and more specifically arthropod-transmitted viruses. They cause diseases such as dengue, West Nile infection, bluetongue disease, Schmallenberg disease, and yellow fever, among others. Several sequencing investigations have shown us that a diverse array of RNA viruses among insect vectors remain unknown. Some of these could be ancient lineages that could aid in comprehensive studies on RNA virus evolution. Such studies may provide us with insights into the evolution of the currently pathogenic viruses. Here, we applied metagenomics to field-collected midges and we managed to characterize several RNA viruses, where we recovered complete and nearly complete genomes of these viruses. We also characterized the insect host species that are associated with these viruses. These results add to the currently known diversity of RNA viruses among biting midges as well as their associated insect hosts.Entities:
Keywords: RNA viruses; biting midges; metabarcoding; metagenomics
Mesh:
Year: 2021 PMID: 34643419 PMCID: PMC8513680 DOI: 10.1128/mSphere.00551-21
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Viruses identified in this study
| Strain | Site | Family classification | Accession no. | Closest hit | % identity (RdRp; aa) |
|---|---|---|---|---|---|
| Turkana_1 | Turkana |
|
| Medway virus | 59.05 |
| Turkana_2 | Turkana |
|
| Chuvirus Mos8Chu0 | 42.39 |
| Turkana_3 | Turkana | Tombus-like |
| Diaphorina citri associated C virus | 55.24 |
| Turkana_4 | Turkana |
|
| Lindangsbacken virus | 50.79 |
| Turkana_5 | Turkana |
|
| Heliconius erato iflavirus | 85.62 |
| Turkana_6 | Turkana |
|
| Lichen partiti-like RNA virus | 46.14 |
| Turkana_7 | Turkana | Partiti-like |
| Hubei partiti-like virus 45 | 52.74 |
| Turkana_8 | Turkana | Iflavi-like |
| Redbank virus | 66.3 |
| Turkana_9 | Turkana | Negevirus |
| Sandewavirus dungfly1 | 39.13 |
| Turkana_10 | Turkana | Picorna-like |
| Boghill Burn virus | 84.27 |
| Baringo_1 | Baringo | Partiti-like |
| Araticum virus | 56.01 |
| Budalangi_1 | Budalangi | Iflavi-like |
| Redbank virus | 70.84 |
| Isiolo_1 | Isiolo |
|
| Culex Iflavi-like virus 4 | 97.7 |
| Kacheliba_1 | Kacheliba | Iflavi-like |
| Hubei picorna-like virus 38 | 46.37 |
| Kacheliba_2 | Kacheliba |
|
| Grapevine partitivirus | 51.61 |
FIG 1Genome architectures of the representative genomes of complete iflavi-like viruses (A), complete novel negevirus genome (B), partial Chuviridae genome showing the RNA-dependent RNA polymerase coding segment (C), a novel Solemoviridae virus with the two putative segments (D), and the representative partial Partitiviridae genome showing the RNA-dependent RNA polymerase coding segment (E). Similar organization was observed in the other Partitiviridae genomes obtained in the study.
FIG 2Midpoint rooted phylogenies for negeviruses (A), Iflaviridae (B) Tombusviridae (C), and Picornaviridae (D) and related sequences. The trees were inferred based on 1,000 bootstrap replicates and an approximate-likelihood-ratio test. Confidence values are shown in the tree nodes, and the sequences obtained from the study are in red.
FIG 3Midpoint rooted phylogenies for Solemoviridae (A), Chuviridae (B), Partitiviridae (C), and Totiviridae (D) and related sequences. The trees were inferred based on 1,000 bootstrap replicates and an approximate-likelihood-ratio test. Confidence values are shown in the tree nodes, and the sequences obtained from the study are in red.
Community composition of the different vector pools for each of the sites
| Site | No. of specimens | Family | Species | Sequence abundance | Fraction of COI reads (%) | OTU abundance |
|---|---|---|---|---|---|---|
| Isiolo | 640 (114) |
| 637 | 21.03 | 21 | |
|
|
| 260 | 8.58 | 2 | ||
|
| 33 | 1.09 | 1 | |||
|
| 124 | 4.09 | 2 | |||
| 918 | 30.31 | 4 | ||||
|
| 53 | 1.75 | 3 | |||
| 343 | 11.32 | 8 | ||||
| 501 | 16.54 | 9 | ||||
| 160 | 5.28 | 1 | ||||
| Baringo | 892 (873) |
|
| 109 | 1.46 | 7 |
|
| 4,871 | 65.46 | 17 | |||
|
| 987 | 13.26 | 4 | |||
|
| 1,474 | 19.81 | 8 | |||
| Turkana | 1,063 (884) |
| 212 | 5.88 | 8 | |
|
|
| 260 | 7.21 | 6 | ||
|
| 1,010 | 27.99 | 9 | |||
| 102 | 2.83 | 1 | ||||
|
| 354 | 9.81 | 6 | |||
|
| 745 | 20.65 | 14 | |||
|
| 119 | 3.3 | 2 | |||
| 806 | 22.34 | 5 | ||||
| Kacheliba | 156 (6) |
| 1110 | 100 | 36 | |
| Budalangi | 600 (600) |
|
| 4940 | 100 | 13 |
The total number of specimens processed for each site is given, with the number of specimens morphologically classified as Culicoides sp. in parentheses.
Number of consensus sequences obtained after sequence clustering.