| Literature DB >> 27346944 |
Kenneth G Frey1, Tara Biser2, Theron Hamilton3, Carlos J Santos4, Guillermo Pimentel3, Vishwesh P Mokashi3, Kimberly A Bishop-Lilly1.
Abstract
Mosquitoes are efficient, militarily relevant vectors of infectious disease pathogens, including many RNA viruses. The vast majority of all viruses are thought to be undiscovered. Accordingly, recent studies have shown that viruses discovered in insects are very divergent from known pathogens and that many of them lack appropriate reference sequences in the public databases. Given that the majority of viruses are likely still undiscovered, environ mental sampling stands to provide much needed reference samples as well as genetic sequences for comparison. In this study, we sought to determine whether samples of mosquitoes collected from different sites (the Caribbean and locations on the US East Coast) could be differentiated using metagenomic analysis of the RNA viral fraction. We report here distinct virome profiles, even from samples collected short distances apart. In addition to profiling the previously known viruses from these samples, we detected a number of viruses that have been previously undiscovered.Entities:
Keywords: Bunyaviridae; high-throughput sequencing; metagenome sequencing; metagenomics; mosquito virus; novel virus
Year: 2016 PMID: 27346944 PMCID: PMC4912310 DOI: 10.4137/EBO.S38518
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Sequencing statistics by location.
| SAMPLE NAME | SAMPLING LOCATIONS (# INSECTS) | TOTAL SEQUENCING READS | MEAN READ LENGTH (NT) | TOTAL CONTIGS | SRA EXPERIMENT |
|---|---|---|---|---|---|
| Frederick | Frederick, MD (50) | 19.5M | 131 | 32,791 | SRR3168926 |
| Germantown | Germantown, MD (60) | 9.1M | 129.6 | 35,345 | SRR3168927 |
| Jefferson | Jefferson, MD (50) | 29.1M | 140.3 | 6,646 | SRR3168918 |
| Martinsburg | Martinsburg, WV (50) | 24.2M | 137.6 | 1,936 | SRR3168919 |
| Cabo Rojo | Cabo Rojo, PR (14) | 10.1M | 131 | 60,178 | SRR3168917 |
| Lajas | Lajas, PR (25) | 10.2M | 131.1 | 57,421 | SRR3168920 |
| Moca Aguadilla | Aguadilla, PR (5); Moca, PR (11); Aguada, PR (9) | 9.9M | 131.1 | 5,999 | SRR3168921 |
| San German | San German, PR (18) | 9.7M | 154.8 | 65,655 | SRR3168922 |
| Arecibo | Arecibo, PR (25) | 6.8M | 145.8 | 26,810 | SRR3168923 |
| Hatillo | Hatillo, PR (25) | 10.0M | 136.5 | 22,487 | SRR3168924 |
| Mayaguez | Mayaguez, PR (25) | 13.0M | 133.4 | 33,800 | SRR3168916 |
| Ponce Adjuntas | Ponce, PR (10); Adjuntas, PR (12) | 8.7M | 146.8 | 4,967 | SRR3168925 |
Figure 1Taxonomic profiling of contigs by location. The numbers indicate raw count of contigs in each category. The de novo assembled contigs were assigned to categories using BLAST.
Figure 2Virome content by location. The proportion of the identifiable virome contributed by each viral family or group is estimated using a read mapping approach. Briefly, raw reads were mapped back to de novo assembled contigs using CLC Genomics Workbench, and for each of the contigs that was identified by BLAST as being of viral origin, the number of raw reads mapping to that contig was extracted and is presented here.
Figure 3Contribution of virus families to total assembled sequences. The total number of raw reads mapped back to viral contigs using CLC Genomics Workbench per virus family or grouping is presented here, for all samples combined.
Novel virus sequences discovered in Maryland and West Virginia mosquitoes.
| POOL | CONTIG DESIGNATION | CONTIG LENGTH (KB) | VIRAL GROUP | CLOSEST SEQUENCED RELATIVE | GENE PRODUCT(S) ENCODED | DEPTH |
|---|---|---|---|---|---|---|
| Frederick, MD | 5105 | 0.26 | Yichang Insect Virus (AJG39334.1) | Nucleocapsid | 2.81 | |
| 246 | 1.03 | Yichang Insect Virus (AJG39334.1) | Glycoprotein | 12.66 | ||
| 838 | 0.32 | Yichang Insect Virus (AJG39334.1) | Glycoprotein | 4.56 | ||
| 4315 | 0.40 | Yichang Insect Virus (AJG39334.1) | Glycoprotein | 4.53 | ||
| 2895 | 0.37 | Yichang Insect Virus (AJG39334.1) | Glycoprotein | 5.76 | ||
| 2560 | 0.25 | Yichang Insect Virus (AJG39334.1) | RdRp | 5.01 | ||
| 562 | 0.34 | Yichang Insect Virus (AJG39334.1) | RdRp | 4.43 | ||
| 4733 | 9.8 | Unclassified | Rosy apple aphid virus (DQ286292) | Structural protein, polyprotein | 338.97 | |
| 13,993 | 12.4 | Xincheng Mosquito virus (KM81766.1) | Nucleoprotein, Glycoprotein, RdRp | 30.28 | ||
| Martinsburg, WV | 825 | 0.59 | Manawa Virus (AFN73042.1) | RdRp | 5.68 | |
| 690 | 0.34 | Manawa Virus (AFN73042.1) | RdRp | 4.17 | ||
| 464 | 1.24 | Manawa Virus (AFN73042.1) | RdRp | 8.22 | ||
| 374 | 0.42 | Manawa Virus (AFN73042.1) | RdRp | 7.35 | ||
| 567 | 0.34 | Manawa Virus (AFN73042.1) | RdRp | 6.25 | ||
| 295 | 1.74 | Manawa Virus (AFN73042.1) | RdRp | 8.40 | ||
| Jefferson, MD | 5816 | 9.68 | Cricket Paralysis virus (NP_647481.1) and Aphid Lethal Paralysis virus (NP_733845.1) | 2 Polyproteins, NS and S | 45.10 |
Novel virus sequences discovered in mosquitoes from Puerto Rico.
| POOL | CONTIG DESIGNATION | CONTIG LENGTH (KB) | VIRAL GROUP | CLOSEST SEQUENCED RELATIVE | GENE PRODUCT(S) ENCODED | DEPTH |
|---|---|---|---|---|---|---|
| Arecibo, Puerto Rico | 14,385 | 0.21 | Unclassified viruses | Dansoman virus (KP714086.1) and/or Chronic Bee Paralysis virus | Replicase | 8.10 |
| 16,219 | 0.36 | Unclassified viruses | Dansoman virus (KP714086.1) and/or Chronic Bee Paralysis virus | Replicase | 2.84 | |
| 24,685 | 0.23 | Culex theileri flavivirus RP-2011 | Polyprotein | 1.94 | ||
| Cabo Rojo, Puerto Rico | 11,265 | 0.70 | Wuhan Mosquito Virus 9 (AJG39218.1) | RdRp | 3.41 | |
| 36,482 | 0.31 | Wuhan Mosquito Virus 9 | RdRp | 4.36 | ||
| 43,737 | 0.25 | Sanxia Water Strider virus 5 strain SX5SP11 (KM817634.1) and Wuhan Mosquito Virus 9 (AJG39218.1) | RdRp | 5.95 | ||
| 5468 | 0.54 | Wuhan Mosquito Virus 9 strain JX1-13 (KM817659.1) | RdRp | 3.39 | ||
| 40,415 | 0.28 | Culex pipiens densovirus (YP_002887624.1) | Nonstructural protein | 3.07 | ||
| Hatillo, Puerto Rico | 171 | 0.77 | Unclassified viruses | Ixodes scapularis associated virus 2 (AII01812.1) | RdRp | 50.18 |
| 174 | 0.77 | Unclassified viruses | Ixodes scapularis associated virus 2 (AII01812.1) | RdRp | 46.38 | |
| 8438 | 0.23 | Unclassified viruses | Ixodes scapularis associated virus 1 (AII01797.1) | RdRp | 2.97 | |
| 1161e | 1.06 | Bat sobemovirus (AGN73380.1) | Capsid | 48.66 | ||
| 1162 | 1.35 | Bat sobemovirus (AGN73380.1) | Capsid | 19.19 | ||
| Mayaguez, Puerto Rico | 213 | 1.78 | Unclassified viruses | Ixodes scapularis associated virus 2 (AII01812.1) | RdRp | 174.41 |
| 371 | 0.52 | Unclassified viruses | Ixodes scapularis associated virus 2 (AII01811.1) | Protease | 181.93 | |
| 372 | 2.74 | Unclassified viruses | Ixodes scapularis associated virus 2 (AII01811.1 and AII01812.1) | RdRp and protease | 68.58 | |
| 10,131 | 0.22 | Wuhan Mosquito Virus 5 strain XC1-7 (KM817624.1) | PB1 protein | 2.19 | ||
| 12,947 | 0.45 | Wuhan Mosquito Virus 5 strain XC1-7 (KM817624.1) | PB1 protein | 3.70 | ||
| 15,309 | 0.28 | Wuhan Mosquito Virus 5 strain XC1-7 (KM817624.1) | PB1 protein | 2.71 | ||
| 17,647 | 0.33 | Wuhan Mosquito Virus 5 strain XC1-7 (KM817624.1) | PB1 protein | 4.34 | ||
| 1772 | 2.35 | Wellfleet Bay virus | Nucleoprotein | 11.73 | ||
| San German, Puerto Rico | 3879 | 1.32 | Wellfleet Bay virus (YP_00911683.1) | Nucleoprotein | 25.54 |
Notes:
Denotes edited sequence. The portion of chimeric contig that corresponded to eukaryotic mitochondrial RNA was removed.
Figure 4Novel bunyavirus example from Frederick, MD. In this schematic, contigs corresponding to various genes of a proposed novel bunyavirus from mosquitoes collected in Frederick, MD, are shown aligned to the three closest sequenced relatives. (A) The contigs corresponding to RdRP sequences, (B) contigs corresponding to glycoprotein sequences, and (C) contigs corresponding to nucleocapsid sequences. Percent amino acid identity for these contigs and for Cumuto and Gouleako sequences is shown with respect to Yichang insect virus protein sequences.
Figure 5Schematic of virus genome encoded by Frederick contig 13,993. The four ORFs encoded by Frederick contig 13,993 are depicted here along with their putative protein products and the amino acid identity of those products to the sequences of the closest sequenced relative, Xincheng Mosquito virus.