| Literature DB >> 31429704 |
Eugeni Belda1,2,3, Ferdinand Nanfack-Minkeu1,2,4, Karin Eiglmeier1,2, Guillaume Carissimo5, Inge Holm1,2, Mawlouth Diallo6, Diawo Diallo6, Amélie Vantaux7, Saorin Kim7, Igor V Sharakhov8, Kenneth D Vernick9,10.
Abstract
BACKGROUND: Mosquitoes are colonized by a large but mostly uncharacterized natural virome of RNA viruses, and the composition and distribution of the natural RNA virome may influence the biology and immunity of Anopheles malaria vector populations.Entities:
Keywords: Anopheles; Insect specific virus; Malaria vector; RNA virus; Virome; Virus genome assembly
Mesh:
Substances:
Year: 2019 PMID: 31429704 PMCID: PMC6702732 DOI: 10.1186/s12864-019-6034-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Taxonomic profile of Anopheles sample pools. Relative abundance values of Anopheles species were computed by mapping long RNAseq reads to mitochondrial cytochrome C oxidase subunit I gene sequences from the Barcode of Life COI-5P Database. Taxa represented by > 100 sequence reads and 1% frequency in the sample pool were plotted in pie charts. White wedges in pie charts represent the combined proportion of all sequence matches that were individually present at less than 1% frequency in the sample. All data are presented in tabular form in Additional file 1: Table S1
Summary of virus assemblies, Senegal Anopheles sample pools
| Reference virus | 3NCBI classification reference virus | Closest relative (BLASTX) | Assembled sequence | Alignment length (identities, positives, gaps, frame) | Score (E-value) Blastx | Alignment coordinates (start..end Assembled sequence-start..end Closest relative) |
|---|---|---|---|---|---|---|
| DsRNA virus environmental sample clone mill.culi_contig84 | Viruses; dsRNA viruses; environmental samples. | gi|766989332|gb|AJT39580.1| proline-alanine-rich protein [dsRNA virus environmental sample] (975 aa) | PrAlaRichProt_EnvVirDak (1345 nt) | 308 (71,122,6,+ 1) | 77(1e-11) | 4..909–653..954 |
| Homalodisca vitripennis reovirus segment S3 | Viruses; dsRNA viruses; Reoviridae; Sedoreovirinae; Phytoreovirus; unclassified Phytoreovirus | gi|226423326|ref.|YP_002790886.1| major core protein [Homalodisca vitripennis reovirus] (1021 aa) | CP_HVreovirusDak (5674 nt) | 483 (113,206,21,+ 3) | 90(2e-14) | 3774..5159–211..671 |
| Daeseongdong virus 1 strain A12.2708/ROK/2012 | Viruses; unclassified viruses. | gi|959121745|ref.|YP_009182191.1| putative RNA-dependent RNA polymerase [Daeseongdong virus 1]. (2361 aa) | RdRP_DaeseondongVirDak (2530 nt) | 503 (125,225,6,-1) | 135(1e-28) | 875..2365–45..489 |
| Ixodes scapularis associated virus 2 isolate A1, partial genome | Viruses; unclassified viruses. | gi|669132782|gb|AII01812.1| hypothetical protein, partial [Ixodes scapularis associated virus 2] (337 aa) | HP1.1_IxodesVirDak (2820 nt) | 340 (141,210,2,-2) | 267(3e-77) | 105..1118–3..337 |
| HP1.2_IxodesVirDak (2561 nt) | 319 (116,157,11,-1) | 159(4e-39) | 156..1079–3..319 | |||
| Uncultured virus isolate acc_7.4 | Viruses; environmental samples. | gi|545716017|gb|AGW51759.1| RNA-dependent RNA polymerase-like protein [uncultured virus] (473 aa) | RdRP_UncVir1Dak (1488 nt) | 475 (217,295,8,-2) | 377(e-121) | 57..1457–1..471 |
| Uncultured virus isolate acc_1.3 | Viruses; environmental samples. | gi|545716010|gb|AGW51755.1| RNA-dependent RNA polymerase-like protein, partial [uncultured virus] (1445 aa) | RdRP_UncVir2Dak (2011 nt) | 236 (168,200,0,+ 1) 178 (146,157,0,+ 3) 91 (71,80,0,+ 2) | 353 (0.0) 298 (0.0) 152 (0.0) | 25..732–58..293 1020..1553–408..585 752..1024–320..410 |
| American dog tick phlebovirus isolate FI3 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Bunyavirales; Phlebovirus; unclassified Phlebovirus. | gi|734669629|gb|AJA31764.1| nucleocapsid, partial [American dog tick phlebovirus] (245 aa) | NuclCap1.1_ADTphlebovirusDak (1105 nt) | 142 (55,75,11,+ 2) | 75(5e-12) | 191..583–38..175 |
| NuclCap1.2_ADTphlebovirusDak (1148 nt) | 151 (53,71,9,+ 2) | 72(3e-11) | 173..598–27..175 | |||
| Culex tritaeniorhynchus rhabdovirus RNA, complete genome, strain: TY | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | gi|700895640|ref.|YP_009094323.1| large protein [Culex tritaeniorhynchus rhabdovirus] (2123 aa) | LP_CulexRhabdovDak (1526 nt) | 266 (101,149,0,+ 3) 108 (33,45,0,+ 2) 72 (50,59,0,+ 2) | 177(6e-52) 56(6e-52) 115(2e-23) | 498..1295–155..417 149..472–38..145 1307..1522–593..664 |
| Phasi Charoen-like virus | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Bunyaviridae; unclassified Bunyaviridae. | gi|664682120|gb|AIF71032.1| nucleocapsid [Phasi Charoen-like virus] (268 aa) | NuclCap1.2_PCLVDak (1104 nt) | 82 (49,57,1,+ 3) | 84(6e-26) | 699..941–116..197 |
| NuclCap1.3_PCLVDak (1125 nt) | 97 (57,74,0,+ 2) | 115(7e-48) | 119..409–65..161 | |||
| NuclCap1.4_PCLVDak (1144 nt) | 262 (114,167,3,+ 3) | 217(7e-64) | 153..929–6..267 | |||
| gi|870898373|gb|AKP18600.1| glycoprotein [Phasi Charoen-like virus] (1237 aa) | GP_PCLVDak (3887 nt) | 1185 (436,711,15,+ 1) | 894 (0.0) | 91..3600–20..1197 | ||
| gi|664682116|gb|AIF71030.1| RNA-dependent RNA polymerase [Phasi Charoen-like virus] (2217 aa) | RdRP_PCLVDak (6711 nt) | 2213 (1042,1432,52,+ 1) | 1944 (0.0) | 148..6630–8..2196 | ||
| Wellfleet Bay virus isolate 10–280-G segment 4 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Quaranjavirus; unclassified Quaranjavirus. | gi|727361119|ref.|YP_009110683.1| nucleoprotein [Wellfleet Bay virus] (537 aa) | Nuclprot1.1_WBvirDak (1973 nt) | 213 (60,101,7,+ 3) | 68(2e-08) | 705..1322–149..354 |
| Nuclprot1.2_WBvirDak (3252 nt) | 459 (114,197,39,+ 3) | 117(8e-24) | 1413..2672–11..451 | |||
| Wuhan Mosquito Virus 9 strain JX1–13 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455731|gb|AJG39214.1| ORF1 [Wuhan Mosquito Virus 9] (279 aa) | ORF1_Wuhan9virDak (1574 nt) | 204 (57,94,2,+ 3) | 86(3e-15) | 867..1472–13..210 |
| Wuhan Mosquito Virus 1 strain WT3–15 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455880|gb|AJG39296.1| glycoprotein precursor [Wuhan Mosquito Virus 1]. (681 aa) | GP_Wuhan1virDak (3547 nt) | 153 (97,122,11,+ 1) | 219(3e-56) | 2023..2448–243..395 |
| Wuhan Spider Virus strain SYZZ-2 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455826|gb|AJG39269.1| RNA-dependent RNA polymerase [Wuhan Spider Virus] (2251 aa) | RdRP1.1_WSVDak (998 nt) | 354 (87,154,26,+ 3) | 104(1e-20) | 6..989–1239..1589 |
| RdRP1.2_WSVDak (2083 nt) | 730 (209,327,36 + 2) | 263(1e-70) | 2..2083–531..1243 | |||
| RdRP1.3_WSVDak (1070 nt) | 377 (108,172,23 + 2) | 139(9e-32) | 2..1063–579..951 | |||
| Xincheng Mosquito Virus strain XC1–6 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455743|gb|AJG39224.1| ORF1 [Xincheng Mosquito Virus] (462 aa) | ORF1_XinchengVirDak (947 nt) | 299 (73,122,11,+ 2) | 55(5e-05) | 81..944–22..311 |
| gi|752455744|gb|AJG39225.1| ORF2 [Xincheng Mosquito Virus] (437 aa) | ORF2_XinchengVirDak (1726 nt) | 165 (44,88,2,+ 1) | 67(2e-08) | 145..633–260..422 | ||
| gi|752455745|gb|AJG39226.1| glycoprotein [Xincheng Mosquito Virus] (648 aa) | GP_XinchengVirDak (5993 nt) | 592 (352,445,0,+ 3) | 751 (0.0) | 3720..5495–16..607 | ||
| gi|752455746|gb|AJG39227.1| RNA-dependent RNA polymerase [Xincheng Mosquito Virus] (2026 aa) | RdRP1.1_XinchengVirDak (11707 nt) | 2001 (733,1101,50,+ 3) | 1226 (0.0) | 5571..11423–28..1977 | ||
| RdRP1.2_XinchengVirDak (11722 nt) | 2001 (733,1101,50,+ 3) | 1226 (0.0) | 5586..11438–28..1977 | |||
| RdRP1.3_XinchengVirDak (11710 nt) | 2001 (734,1104,50,+ 3) | 1229 (0.0) | 293..6145–28..1977 | |||
| RdRP1.4_XinchengVirDak (11694 nt) | 2001 (734,1104,50,-3) | 1229 (0.0) | 293..6145–28..1977 | |||
| RdRP1.5_XinchengVirDak (11728 nt) | 1955 (733,1093,47,-1) | 1233 (0.0) | 440..6163–28..1924 | |||
| RdRP1.6_XinchengVirDak (11716 nt) | 1955 (733,1093,47,+ 2) | 1233 (0.0) | 5561..11284–28..1924 | |||
| RdRP1.7_XinchengVirDak (6128 nt) | 1962 (1088,1397,9,-3) | 2179 (0.0) | 124..5982–36..1994 | |||
| Xinzhou Mosquito Virus strain XC3–5 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455830|gb|AJG39271.1| RNA-dependent RNA polymerase [Xinzhou Mosquito Virus]. (2022 aa) | RdRP1.1_XinzhouVirDak (7527 nt) | 2017 (1403,1670,3,+ 1) | 2863 (0.0) | 1315..7356–1..2013 |
| RdRP1.2_XinzhouVirDak (7524 nt) | 2020 (1403,1676,6,+ 1) | 2864 (0.0) | 1312..7353–1..2013 | |||
| Sunn-hemp mosaic virus | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Virgaviridae; Tobamovirus. | gi|12643499|sp.|P89202.2|RDRP_SHMV RecName: Full = Replicase large subunit] (1629 aa) | RdRP_SHMVDak (1216 nt) | 365 (105,171,13,-1) | 125(4e-27) | 2..1057–932..1240 |
| Omono River virus | Viruses; dsRNA viruses | gi|307933351|dbj|BAJ21511.1| RNA-dependent RNA polymerase [Omono River virus] (621 aa) | RdRP_OmonoVirDak (613 nt) | 194 (71,110,1,-1) | 131(1e-31) | 5..583–135..323 |
| Beaumont virus strain 6 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | gi|550631504|gb|AGX86091.1| RNA-dependent RNA polymerase, partial [Beaumont virus] (1568 aa) | RdRP_BeaumontVirDak (805 nt) | 153 (76,98,0,+ 3) | 142(9e-34) | 3..461–417..569 |
Summary of virus assemblies, Cambodia Anopheles sample pools
| Reference virus | NCBI classification reference virus | Closest relative (BLASTX) | Assembled sequence | Alignment length (identities, positives, gaps, frame) | Score (E-value) Blastx | Alignment coordinates (start..end Assembled sequence-start..end Closest relative) |
|---|---|---|---|---|---|---|
| uncultured virus | Viruses; environmental samples. | RNA-dependent RNA polymerase-like protein, partial [uncultured virus] (KF298266.1) (1445 aa) | RdRP1.1_UncVir2Camb (857 nt) | 261 (176,212,0,+ 3) | 379(e-117) | 75..857–1..261 |
| RdRP1.2_UncVir2Camb (797 nt) | 194 (135,161,0,+ 2) | 282(8e-83) | 212..793–102..295 | |||
| RdRP1.3_UncVir2Camb (2177 nt) | 486 (350,408,0,+ 1) | 731 (0.0) | 718..2175–1..486 | |||
| RdRP1.4_UncVir2Camb (2574 nt) | 748 (525,619,0,+ 3) | 1063 (0.0) | 6..2249–567..1314 | |||
| RdRP1.6_UncVir2Camb (1716 nt) | 365 (274,312,0,+ 3) 83 (61,72,0,+ 1) | 563 (0.0) 132 (0.0) | 315..1409–85..449 64..312–1..83 | |||
| Culex tritaeniorhynchus rhabdovirus RNA, complete genome; NC_025384 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | gi|700895639|ref.|YP_009094322.1| glycoprotein [Culex tritaeniorhynchus rhabdovirus] (503 aa) | GP_CulexRhabdovCamb (2866 nt) | 360 (85,130,15,+ 1) | 83(6e-13) | 1261..2295–49..397 |
| gi|700895640|ref.|YP_009094323.1| large protein [Culex tritaeniorhynchus rhabdovirus] (2123 aa) | LP1.1_CulexRhabdovCamb (755 nt) | 125 (36,58,0,+ 1) | 64(2e-08) | 64..438–42..166 | ||
| LP1.2_CulexRhabdovCamb (1454 nt) | 211 (87,130,3,+ 3) | 165(1e-39) | 498..1121–214..421 | |||
| Phasi Charoen-like virus | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Bunyavirales; unclassified Bunyavirales | gi|870898373|gb|AKP18600.1| glycoprotein [Phasi Charoen-like virus] (1237 aa) | GP1.1_PCLVCamb (642 nt) | 221 (61,110,8,+ 3) | 103(2e-21) | 3..641–226..444 |
| GP1.2_PCLVCamb (933 nt) | 292 (141,197,0,+ 1) | 297(4e-88) | 4..879–813..1104 | |||
| gi|870898376|gb|AKP18601.1| Nucleocapsid [Phasi Charoen-like virus] (268 aa) | NuclCap1.1_PCLVCamb (1104 nt) | 269 (119,165,3,+ 3) | 214(4e-63) | 207..1004–1..267 | ||
| NuclCap1.2_PCLVCamb (533 nt) | 113 (48,77,0,+ 2) | 97(8e-28) | 47..385–3..115 | |||
| NuclCap1.3_PCLVCamb (535 nt) | 116 (69,90,0,+ 2) | 143(4e-38) | 119..466–64..179 | |||
| Bivens Arm virus isolate UF 10 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; Tibrovirus | gi|751997168|gb|AJG05818.1| nucleoprotein N [Bivens Arm virus] (428 aa) | NProt_BivArmsVirCamb (516 nt) | 162 (47,76,10,+ 1) | 52(6e-05) | 52..507–22..176 |
| Jurona virus | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; Vesiculovirus. | gi|701219310|ref.|YP_009094377.1| polymerase [Jurona virus] > gnl|... 176 2e-43 (2090 aa) | RdRP_JuronaVirCamb (1329 nt) | 303 (110,155,0,-2) | 176(2e-43) | 3..911–1635..1935 |
| Puerto Almendras virus isolate LO-39 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | gi|701219331|ref.|YP_009094394.1| L protein [Puerto Almendras vir... 213 2e-54 (2059 aa) | Lprot1.1_PAvirCamb (1869 nt) | 587 (163,281,28,-3) | 213(2e-54) | 191..1867–1461..2039 |
| Lprot1.2_PAvirCamb (3895 nt) | 1308 (657,904,10,-2) | 1318 (0.0) | 1..3894–161..1467 | |||
| gi|701219327|ref.|YP_009094389.1| N protein [Puerto Almendras virus] (434 aa) | Nprot_PAvirCamb (4449 nt) | 431 (109,194,18,-2) | 123(7e-26) | 2964..4202–8..426 | ||
| Beaumont virus strain 6 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | gi|550631504|gb|AGX86091.1| RNA-dependent RNA polymerase, partial [Beaumont virus] (1568 aa) | RdRP1.1_BeaumontVirCamb (586 nt) | 81 (42,62,0,+ 2) | 100(2e-20) | 344..586–187..267 |
| RdRP1.2_BeaumontVirCamb (633 nt) | 187 (106,137,0,+ 2) | 233(3e-66) | 65..625–1..187 | |||
| RdRP1.3_BeaumontVirCamb (594 nt) | 156 (98,125,0,+ 1) | 198(2e-54) | 40..507–1225..1380 | |||
| RdRP1.4_BeaumontVirCamb (1359 nt) | 452 (288,349,0,+ 3) | 592 (0.0) | 3..1358–228..679 | |||
| RdRP1.5_BeaumontVirCamb (1606 nt) | 536 (365,428,1,+ 1) | 758 (0.0) | 1..1605–26..561 | |||
| RdRP1.6_BeaumontVirCamb (1141 nt) | 380 (236,287,1,+ 1) | 477(e-151) | 1..1137–670..1049 | |||
| RdRP1.7_BeaumontVirCamb (1667 nt) | 324 (169,222,15,+ 3) | 317(5e-91) | 705..1631–303..622 | |||
| Wellfleet Bay virus isolate 10–280-G segment 4 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Quaranjavirus; unclassified Quaranjavirus. | gi|727361119|ref.|YP_009110683.1| nucleoprotein [Wellfleet Bay virus] (537 aa) | Nuclprot1.1_WBvirCamb (1011 nt) | 114 (41,68,2,-3) | 75(1e-11) | 2..337–149..262 |
| Nuclprot1.2_WBvirCamb (1139 nt) | 311 (82,141,9,+ 1) | 98(7e-19) | 127..1032–127..420 | |||
| Nuclprot1.3_WBvirCamb (2942 nt) | 353 (81,143,11,-3) | 79(8e-12) | 1267..2292–103..429 | |||
| Xinzhou Mosquito Virus strain XC3–5 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455830|gb|AJG39271.1| RNA-dependent RNA polymerase [Xinzhou Mosquito Virus] (2022 aa) | RdRP_XinzhouVirCamb (8129 nt) | 1401 (1076,1212,0,+ 1) 180 (118,144,0,+ 2) 392 (193,262,12,+ 3) | 2189 (0.0) 241(4e-60) 389(e-105) | 1300..5502–1..1401 5504..6043–1402..1579 6219..7358–1639..2019 |
| Wuhan Mosquito Virus 1 strain WT3–15 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455822|gb|AJG39267.1| RNA-dependent RNA polymerase [Wuhan Mosquito Virus 1] (2095 aa) | RdRP1.1_Wuhan1virCamb (3576 nt) | 1167 (716,918,1,+ 1) | 1519 (0.0) | 1..351–932..2070 |
| RdRP1.2_Wuhan1virCamb (2929 nt) | 966 (625,749,0,+ 1) | 1287 (0.0) | 31..2928–1..966 | |||
| RdRP1.3_Wuhan1virCamb (3939 nt) | 1300 (907,1072,0,+ 1) | 1889 (0.0) | 19..3918–49..1348 | |||
| RdRP1.5_Wuhan1virCamb (518 nt) | 169 (78,113,0,+ 1) | 166(3e-43) | 10..516–1611..1779 | |||
| RdRP1.6_Wuhan1virCamb (6431 nt) | 2095 (1287,1605,0,+ 3) | 2670 (0.0) | 63..6347–1..2094 | |||
| RdRP1.7_Wuhan1virCamb (6435 nt) | 2096 (1327,1647,0,-2) | 2737 (0.0) | 81..6368–1..2095 | |||
| gi|752455880|gb|AJG39296.1|glycoprotein precursor [Wuhan Mosquito Virus 1] (681 aa) | GP1.1_Wuhan1virCamb (523 nt) | 87 (48,75,0,+ 2) | 116(1e-26) | 92..352–589..675 | ||
| GP1.3_Wuhan1virCamb (1282 nt) | 107 (70,85,0,+ 1) | 160(4e-47) | 781..1101–273..379 | |||
| GP1.5_Wuhan1virCamb (2231 nt) | 652 (326,445,1,-3) | 704 (0.0) | 166..2118–24..675 | |||
| GP1.6_Wuhan1virCamb (2205 nt) | 675 (326,473,1,+ 1) | 717 (0.0) | 22..2043–1..675 | |||
| GP1.7_Wuhan1virCamb (2219 nt) | 640 (344,466,1,+ 1) | 740 (0.0) | 127..2043–36..675 | |||
| gi|752455945|gb|AJG39330.1| nucleopasid protein [Wuhan Mosquito Virus 1] (345 aa) | NuclCap1.1_Wuhan1virCamb (645 nt) | 210 (122,150,0,+ 1) | 234(6e-72) | 16..645–43..249 | ||
| NuclCap1.2_Wuhan1virCamb (735 nt) | 174 (107,131,0,+ 1) | 211(2e-62) | 214..735–23..193 | |||
| NuclCap1.3_Wuhan1virCamb (629 nt) | 205 (122,149,0,+ 2) | 231(7e-71) | 14..628–38..239 | |||
| NuclCap1.4_Wuhan1virCamb (546 nt) | 103 (53,68,0,+ 3) | 110(2e-25) | 3..311–228..330 | |||
| NuclCap1.5_Wuhan1virCamb (549 nt) | 168 (86,109,0,+ 3) | 156(2e-42) | 45..548–2..165 | |||
| NuclCap1.7_Wuhan1virCamb (1259 nt) | 160 (81,110,0,+ 1) | 168(5e-44) | 1..480–174..333 | |||
| NuclCap1.8_Wuhan1virCamb (1015 nt) | 240 (126,162,7,+ 1) | 228(6e-68) | 316..1014–6..241 | |||
| NuclCap1.9_Wuhan1virCamb (3081 nt) | 310 (172,218,0,+ 1) | 332(e-100) | 1213..2142–2..307 | |||
| NuclCap1.10_Wuhan1virCamb (1473 nt) | 342 (178,229,0,+ 3) | 339(e-108) | 366..1391–6..336 | |||
| NuclCap1.11_Wuhan1virCamb (1791 nt) | 332 (178,225,0,-3) | 343(e-109) | 782..1777–6..333 | |||
| NuclCap1.12_Wuhan1virCamb (2147 nt) | 346 (184,230,1,+ 3) | 345(e-108) | 357..1391–6..343 | |||
| Wuhan Mosquito Virus 9 strain JX1–13 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | gi|752455734|gb|AJG39217.1| glycoprotein [Wuhan Mosquito Virus 9] (614 aa) | GP1.1_Wuhan9virCamb (658 nt) | 164 (43,86,1,-1) | 71(6e-11) | 2..490–50..210 |
| GP1.2_Wuhan9virCamb (924 nt) | 130 (31,63,2,+ 1) | 58(5e-06) | 232..615–101..230 | |||
| GP1.3_Wuhan9virCamb (2429 nt) | 89 (20,43,0,+ 2) 353 (82,148,6,+ 1) | 41(1e-23) 99(1e-23) | 590..856–51..139 841..1881–135..441 | |||
| gi|752455731|gb|AJG39214.1| ORF1 [Wuhan Mosquito Virus 9] (279 aa) | ORF1.1_Wuhan9virCamb (1872 nt) | 192 (55,91,2,+ 1) | 85(9e-15) | 1012..1581–13..198 | ||
| ORF1.2_Wuhan9virCamb (1625 nt) | 90 (27,49,2,+ 1) | 55(4e-05) | 1075..1338–13..102 | |||
| ORF1.3_Wuhan9virCamb (1202 nt) | 205 (56,96,2,+ 2) | 85(3e-15) | 515..1123–13..211 | |||
| Xincheng Mosquito Virus strain XC1–6 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses | gi|752455743|gb|AJG39224.1| ORF1 [Xincheng Mosquito Virus] (462 aa) | ORF1_XinchengVirCamb (1329 nt) | 28 (11,15,0,+ 2) 155 (62,92,16,+ 3) 97 (37,57,1,+ 1) | 27(1e-36) 105(1e-36) 70(1e-36) | 227..310–155..182 339..755–194..346 769..1056–351..445 |
| gi|752455745|gb|AJG39226.1| glycoprotein [Xincheng Mosquito Virus] (648 aa) | GP1.1_XinchengVirCamb (509 nt) | 132 (56,80,0,+ 2) | 120(3e-28) | 5..400–421..552 | ||
| GP1.2_XinchengVirCamb (953 nt) | 242 (98,147,1,+ 2) | 203(5e-56) | 50..772–139..379 | |||
| GP1.3_XinchengVirCamb (1635 nt) | 119 (39,67,9,+ 2) 143 (49,74,17,+ 3) | 79(6e-12) 85(4e-14) | 491..820–244..362 639..1016–303..445 | |||
| GP1.5_XinchengVirCamb (1313 nt) | 100 (63,80,0,+ 2) 165 (106,135,0,+ 3) | 150(3e-36) 232(3e-65) | 1010..1309–190..289 546..1040–290..450 | |||
| GP1.6_XinchengVirCamb (3076 nt) | 402 (176,254,14,+ 2) | 362(e-108) | 1814..2977–230..631 | |||
| GP1.7_XinchengVirCamb (1314 nt) | 350 (164,242,0,+ 3) | 371(e-118) | 228..1277–268..617 | |||
| GP1.8_XinchengVirCamb (2660 nt) | 627 (242,370,36,+ 2) | 475(e-152) | 482..2254–6..624 | |||
| GP1.9_XinchengVirCamb (1757 nt) | 65 (32,49,0,+ 2) 394 (246,306,6,+ 1) | 83(2e-13) 523(e-174) | 29..223–173..237 226..1389–236..629 | |||
| gi|752455746|gb|AJG39227.1| RNA-dependent RNA polymerase [Xincheng Mosquito Virus] (2026 aa) | RdRP1.1_XinchengVirCamb (925 nt) | 306 (137,185,5,+ 2) | 240(4e-67) | 23..925–270..574 | ||
RdRP1.2_XinchengVirCamb (904 nt) | 28 (20,23,0,+ 1) 61 (54,56,0,+ 3) 202 (141,161,5,+ 2) | 45(6e-28) 107(6e-28) 288(9e-84) | 127..210–474..501 210..392–503..563 281..871–536..728 | |||
| RdRP1.3_XinchengVirCamb (991 nt) | 241 (105,141,4,+ 3) | 190(2e-49) | 213..923–159..399 | |||
| RdRP1.4_XinchengVirCamb (1065 nt) | 115 (85,100,0,+ 1) 64 (50,56,0,+ 2) 73 (52,64,0,+ 3) 48 (34,43,0,+ 1) 52 (42,48,0,+ 2) | 179(e-133) 103(e-133) 120(e-133) 75(e-133) 88(e-133) | 13..357–835..949 365..556–952..1015 549..767–1012..1084 766..909–1085..1132 908..1063–1166..1217 | |||
| RdRP1.5_XinchengVirCamb (1354 nt) | 431 (204,278,1,+ 1) | 401(e-121) | 64..1353–587..1013 | |||
| RdRP1.6_XinchengVirCamb (2062 nt) | 316 (216,249,0,+ 1) | 447(e-134) | 1114..2061–359..671 | |||
| RdRP1.7_XinchengVirCamb (3974 nt) | 29 (26,27,0,+ 3) 608 (422,501,0,+ 1) 479 (274,351,0,+ 2) 96 (58,76,0,+ 1) 63 (34,43,0,+ 3) | 62(5e-06) 883 (0.0) 568 (0.0) 130 (0.0) 76 (0.0) | 24..110–730..758 100..1923–755..1361 1898..3334–1352..1830 3343..3630–1836..1931 3642..3830–1955..2017 | |||
| Nienokoue virus isolate B51/CI/2004 | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae | gi|655454925|ref.|YP_009041466.1| polyprotein [Nienokoue virus] (3356 aa) | PolProt1.1_FlavivirusCamb (1008 nt) | 265 (81,115,2,+ 2) | 111(6e-23) | 203..991–1322..1578 |
| PolProt1.2_FlavivirusCamb (2193 nt) | 439 (184,264,4,+ 2) 96 (40,58,0,+ 2) 30 (17,23,0,+ 1) | 329(3e-92) 79(6e-19) 45(6e-19) | 32..1336–1675..2108 1385..1672–1529..1623 1738..1827–1638..1667 | |||
| PolProt1.3_FlavivirusCamb (1010 nt) | 137 (67,86,0,+ 1) | 135(1e-30) | 565..975–1855..1990 | |||
| PolProt1.4_FlavivirusCamb (10610 nt) | 3342 (1354,1889,63,-3) | 2383 (0.0) | 514..10350–92..3348 | |||
| Tobacco streak virus isolate pumpkin segment RNA1 | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Bromoviridae; Ilarvirus | gi|254554401|gb|ACT67442.1| replicase [Tobacco streak virus] (1092 aa) | Replicase_TSvirCamb (1565 nt) | 308 (83,136,8,+ 1) | 95(3e-17) | 364..1263–39..309 |
| Oat golden stripe virus RNA1 | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Virgaviridae; Furovirus | gi|9635455|ref.|NP_059511.1| replicase [Oat golden stripe virus] | Replicase_OatGSvirCamb (1661 nt) | 320 (85,143,28,+ 3) | 107(1e-20) | 666..1541–18..327 |
Similarity of Senegal and Cambodia virus assemblies by BLASTX to 24 reference viruses in GenBank. Ten targets are shared, nine are Senegal-specific, and five are Cambodia-specific
| Reference virus | Viral taxonomy | Senegal libraries | Cambodia libraries |
|---|---|---|---|
| Culex tritaeniorhynchus rhabdovirus RNA, complete genome | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | ||
| Phasi Charoen-like virus | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Bunyavirales; unclassified Bunyavirales. | ||
| Uncultured virus isolate acc_1.3 | Viruses; environmental samples. | ||
| Wellfleet Bay virus isolate 10–280-G segment 4 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Quaranjavirus; unclassified Quaranjavirus. | ||
| Wuhan Mosquito Virus 1 strain WT3–15 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | ||
| Wuhan Mosquito Virus 9 strain JX1–13 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | ||
| Xincheng Mosquito Virus strain XC1–6 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | ||
| Xinzhou Mosquito Virus strain XC3–5 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | ||
| Beaumont virus strain 6 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | ||
| Jurona virus | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; Vesiculovirus. | ||
| Omono River virus | Viruses; dsRNA viruses | ||
| American dog tick phlebovirus isolate FI3 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Bunyavirales; Phlebovirus; unclassified Phlebovirus. | ||
| Daeseongdong virus 1 strain A12.2708/ROK/2012 | Viruses; unclassified viruses. | ||
| DsRNA virus environmental sample clone mill.culi_contig84 | Viruses; dsRNA viruses; environmental samples. | ||
| Homalodisca vitripennis reovirus segment S3 | Viruses; dsRNA viruses; Reoviridae; Sedoreovirinae; Phytoreovirus; unclassified Phytoreovirus | ||
| Ixodes scapularis associated virus 2 isolate A1, partial genome | Viruses; unclassified viruses. | ||
| Sunn-hemp mosaic virus | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Virgaviridae; Tobamovirus. | ||
| Uncultured virus isolate acc_7.4 | Viruses; environmental samples. | ||
| Wuhan Spider Virus strain SYZZ-2 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; unclassified ssRNA negative-strand viruses. | ||
| Nienokoue virus isolate B51/CI/2004 | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae | ||
| Oat golden stripe virus RNA1 | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Virgaviridae; Furovirus | ||
| Puerto Almendras virus isolate LO-39 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; unclassified Rhabdoviridae. | ||
| Tobacco streak virus isolate pumpkin segment RNA1 | Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Bromoviridae; Ilarvirus | ||
| Bivens Arm virus isolate UF 10 | Viruses; ssRNA viruses; ssRNA negative-strand viruses; Mononegavirales; Rhabdoviridae; Tibrovirus |
Fig. 2Phylogenetic tree of reference and novel virus assemblies from the Bunyavirales order. Maximum-likelihood phylogeny based on RNA-dependent RNA polymerase (RdRP) predicted peptide sequences of viruses from the Bunyavirales order. Novel viruses characterized in the current study (red name labels) are placed with reference viruses (black name labels) within the Phasmavirus clade and in a basal position of the Phlebovirus-Tenuivirus clade. Node robustness is indicated by bootstrap values (number of replicates supporting the node), indicated by color of the dot at the branch base, see key. Protein lengths and functional status of RdRP peptide sequences from novel viruses in the current study are included to distinguish between complete and partial and/or non-functional pseudogenes (indicated by label “pseudogenized”, functional status also shown in Additional file 2: Table S2 and Additional file 3: Table S3). Average protein size of reference virus RdRP genes is 2496 amino acids
Fig. 3Phylogenetic tree of reference and novel virus assemblies from the Mononegavirales order. a Maximum-likelihood phylogeny based on RNA-dependent RNA polymerase (RdRP) predicted peptide sequences of viruses from Mononegavirales order. Novel virus assemblies characterized from Cambodia and Senegal Anopheles samples (red name labels) are placed with reference viruses (black name labels), predominantly within the Dimarhabdovirus clade and as close relative of the Nyamivirus clade. Node robustness is indicated by bootstrap values (number of replicates supporting the node), indicated by color of the dot at the branch base, see key. Protein lengths and functional status of RdRP peptide sequences from novel viruses in the current study are included to distinguish between complete and partial and/or non-functional pseudogenes (indicated by label “pseudo”, functional status indicated in Additional file 2: Table S2 and Additional file 3: Table S3). Average protein size of reference virus RdRP genes is 2098 amino acids. b Genome comparison of novel and reference Xincheng Mosquito Viruses, which are too diverged to align at the nucleic acid sequence level. Grey blocks represent peptide sequence homology regions between compared sequences. The nucleotide sequences of the entire viral contigs, and not only of the RdRP gene as in (a), were translated and used to search the translated nucleotide database with TBLASTX. The viruses display recognizable relatedness over their genomes, despite geographic distance and nucleotide sequence divergence. Color intensity indicates identity levels from TBLASTX results (values indicated in key)
Fig. 4Virus abundance profiles across mosquito sample pools based on long and small RNA sequence mapping. Heatmap of log2-transformed reads per kilobase per million reads (RPKM) abundance values of novel virus assemblies identified from Cambodia and Senegal sample pools based on long and small RNA sequence libraries. Broadly similar viral abundance profiles are detected in sample pools by the long and small RNA sequence data. Representation of particular viruses is uneven among mosquito sample pools, suggesting inter-individual mosquito differences for virus carriage. X-axis, Anopheles sample pools from Cambodia, Cam, and Senegal, Dak; y-axis, names of 115 assembled virus contigs displaying sequence similarity to known virus families (Additional file 2: Table S2 and Additional file 3: Table S3)
Fig. 5Small RNA size profiles of novel virus assemblies from Cambodia and Senegal sample pools. Hierarchical clustering of 88 novel virus assemblies based on Pearson correlation of small RNA size profiles. The 88 viruses were the members of the 115 novel virus set meeting the threshold of at least 100 small RNA reads mapped to the viral contig, to assure reliable small RNA size profiling. Small RNA reads that mapped to each of the 88 virus assemblies were extracted, and their size distributions were normalized with a z-score transformation. Heatmaps indicate the frequency of small RNA reads of size 15 to 35 nucleotides that map over the positive strand (left panel) and negative strand (right panel) of the reference sequence indicated on the y-axis. The x-axis indicates the size in nucleotides of the small RNAs mapped. Four main clusters were defined (indicated by numbers on the left of each panel) based on these small RNA size profiles. The profile in Cluster 3 is enriched for 21 nucleotide reads mapping over both positive and negative strands, characteristic of the classical small interacting RNA (siRNA) product size profile
Fig. 6O’nyong nyong arbovirus infection influences expression of candidate piRNA genes in Anopheles coluzzii. Anopheles coluzzii mosquitoes were challenged with O’nyong nyong virus (ONNV) by feeding an infectious bloodmeal or an uninfected control bloodmeal, and small RNAs expressed during the primary infection at 3 d post-bloodmeal were sequenced. Analysis using Cuffdiff highlighted two candidate piRNA genes that displayed decreased abundance of mapped small RNAs in ONNV infected samples (see Results, piRNA loci XLOC_012931 and XLOC_012762). Here, the small RNA sequence reads mapping to the two candidate piRNA loci were quantified using the Integrative Genomics Viewer normalized to the library size, and the difference between ONNV infected and uninfected samples tested statistically. X-axis indicates candidate piRNA locus, y-axis indicates percentage of normalized small RNA reads mapping to the piRNA gene. ONNV-infected mosquitoes, red bar; uninfected control mosquitoes, black bar. Experiments were done in two biological replicates, error bars indicate standard deviation. Locus XLOC_012931, Chi-squared = 77.36, df = 1, p-value< 2.2e-16 (ONNV-infected mean mapped reads = 36 ± 141,421,356, mean total reads = 19,193,551 ± 8,555,908.61, ONNV-uninfected mean mapped reads = 160 ± 14,1,421,356, mean total reads = 19,167,336 ± 3,962,902.88052); and locus XLOC_012762, Chi-squared = 75.78, df = 1, p-value< 2.2e-16 (ONNV-infected mean mapped reads = 51 ± 19,09, mean total reads = 19,193,551 ± 8,555,908.61, ONNV-uninfected, mean mapped reads = 184 ± 848,528,137, mean total reads = 19,167,336 ± 3,962,902.88)