| Literature DB >> 29549363 |
Martha Zakrzewski1, Gordana Rašić2, Jonathan Darbro2,3, Lutz Krause1,4, Yee S Poo5, Igor Filipović2, Rhys Parry6, Sassan Asgari6, Greg Devine2, Andreas Suhrbier7.
Abstract
Medically important arboviruses such as dengue, Zika, and chikungunya viruses are primarily transmitted by the globally distributed mosquito Aedes aegypti. Increasing evidence suggests that transmission can be influenced by mosquito viromes. Herein RNA-Seq was used to characterize RNA metaviromes of wild-caught Ae. aegypti from Bangkok (Thailand) and from Cairns (Australia). The two mosquito populations showed a high degree of similarity in their viromes. BLAST searches of assembled contigs suggest up to 27 insect-specific viruses may infect Ae. aegypti, with up to 23 of these currently uncharacterized and up to 16 infecting mosquitoes from both Cairns and Bangkok. Three characterized viruses dominated, Phasi Charoen-like virus, Humaita-Tubiacanga virus and Cell fusing agent virus, and comparisons with other available RNA-Seq datasets suggested infection levels with these viruses may vary in laboratory-reared mosquitoes. As expected, mosquitoes from Bangkok showed higher mitochondrial diversity and carried alleles associated with knock-down resistance to pyrethroids. Blood meal reads primarily mapped to human genes, with a small number also showing homology with rat/mouse and dog genes. These results highlight the wide spectrum of data that can be obtained from such RNA-Seq analyses, and suggests differing viromes may need to be considered in arbovirus vector competence studies.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29549363 PMCID: PMC5856816 DOI: 10.1038/s41598-018-22945-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Bioinformatics work-flow. The bioinformatics work-flow is shown with read numbers and where the final data is presented (blue). Raw sequence reads were quality filtered using Trimmomatic and paired end reads were merged using the software PEAR. Ae. aegypti reads were identified using STAR and BWA mem and used for variant analyses. Non-mosquito reads were subsequently aligned to a custom ribosomal database using BLAST. Reads not matching a ribosomal reference sequence were taxonomically assigned using DIAMOND and the lowest common ancestor (LCA) approach. Non-chordate sequences were analyzed using BLAST to identify blood meal species. Contigs assembled with Trinity from non-mosquito and non-ribosomal reads were taxonomically assigned using the best BLAST hit.
Figure 2Overview of main ISVs, bacteria and fungi. Read based taxonomic classification using DIAMOND (NCBI protein database, 18/03/16) and the Lowest Common Ancestor (LCA) approach, with results visualized by Krona. (a) Viruses. Percentages of all viral reads (Cairns 959,502, Bangkok 2,074,841) aligning to the indicated virus species. For Cairns 697,301 reads, and for Bangkok 1,711,929 reads, aligned to Phasi Charoen-like virus. (b) Bacteria. Percentages of all bacterial reads aligning to the indicated genera or family. For Cairns, 31,587 reads aligned to the genus Wolbachia. (c) Fungi. Percentages of all fungal reads aligning with the indicated genera, family or species; 107,802 reads aligned to the phylum Ascomycota for Cairns and 47,038 reads aligned to Edhazardia aedis for Bangkok.
Figure 3Reads mapping to CFAV, PCLV and HTV. RNA-Seq reads (generated without polyA enrichment) from (i) wild-caught Bangkok and Cairns Ae. aegypti and (ii) Ae. aegypti originally from Cairns that had been reared in the laboratory for ≈2 years, were mapped (using Bowtie allowing a 5% mismatch) to the genomes of CFAV, PCLV and HTV (see Methods for accession numbers). The y-axis (1 to 100,000) represents the number of reads that cover each position in the genome. The x-axis is the genome position, with the entire genome for each virus represented. The percentages represent viral reads as a percent of Ae. aegypti reads.
Reads aligning to PCLV, HTV and CFAV in publicly available RNA-Seq data sets.
| Mosquito population | In lab. | Total reads, millions | Phasi Charoen-like virus | Humaita-Tubiacanga virus | Cell fusing agent virus | |||
|---|---|---|---|---|---|---|---|---|
| Reads mapped | % of total | Reads mapped | % of total | Reads mapped | % of total | |||
| PhetR | 2 gens | 229 m | 1,869 | 0.0008 | 1,194 | 0.0005 | 534 | 0.0002 |
| NakhRc | 2 gens | 254 m | 889 | 0.0003 | 674 | 0.0003 | ND | 0 |
| CaynR | 2 gens | 241 m | 1,653 | 0.0007 | 169 | 0.0001 | ND | 0 |
| StGeR | 2 gens | 238 m | ND | 0 | ND | 0 | ND | 0 |
| LivpS | 1930s | 284 m | ND | 0 | ND | 0 | 393 | 0.0001 |
| DeltaR | 1990s | 197 m | ND | 0 | ND | 0 | 389 | 0.0002 |
| NwOrS | 1980s | 167 m | ND | 0 | ND | 0 | ND | 0 |
| Aag2 cell line | 22 m | 196 | 0.0009 | ND | 0 | 3,563 | 0.0159 | |
RNA-Seq data (generated with a polyA enrichment step) from pools of ≈90 mosquitoes per pool and Aag2 cell line[19] were obtained from NCBI, accession numbers (top to bottom) ERX1788144-6, ERX1788141-3, ERX1788135-7, ERX1788138-40, ERX1788126-8, ERX1788132-4, ERX1788129-31, Aag2 - SRX1367297, SRX1366586. Using BWA mem these were aligned to CFAV, PCLV and HTV genomes. A small number of reads mapped to Tongilchon virus 1[70]; 6 for LivpS and 7 for NwOrS. Colony identifiers: PhetR - Thailand (Phetchaburi); NakhRc - Thailand (Nakonsawan); CaynR - French Guiana (Cayenne); StGeR - French Guiana (St-Georges); LivpS - Liverpool strain (Benin); DeltaR - French Polynesia (Bora-Bora); NwOrS - USA (New Orleans). ND – not detected, no reads mapping. In lab: 2 gens - mosquitoes reared for 2 generations in the laboratory; 1930 s - mosquitoes introduced to the lab in the 1930 s.
Summary of contig-based virus identification.
| Classification | Virus or | Host | Genes | Range of % acid identity amino | |
|---|---|---|---|---|---|
| Known ISVs of | CAIRNS | BANGKOK | |||
| Genus; | Phasi Charoen-like virus |
| Nucleocapsid, Glycoprotein, RdRp | 97–98 | 54–100 |
| Genus; | Cell fusing agent virus |
| Polyprotein | 99 | 98 |
| Unclassified ssRNA(+) | Humaita-Tubiacanga virus |
| Capsid Replicase | 97–98 | 97–98 |
| Unclassified dsRNA (possibly a Totivirus) | Unnamed | PArp-RdRp, partial (Contig 5425) | 95–98 | 91–99 | |
|
|
|
| Non-structural protein 1 & capsid | NI | 98 |
|
| |||||
| Genus; |
|
| RdRp Polyprotein | 60 | 59–67 |
| Genus: |
|
| PB1, PA, PB2 | 76–85 | 76–85 |
| Unclassified. Order; |
|
| Glycoprotein | 37–59 | 34–39 |
| Family; | Glycoprotein precursor | 34–44 | 42 | ||
|
| Glycoprotein precursor | NI | 57 | ||
| Unclassified; | Glycoprotein ORF1 | 28–37 | 30–37 | ||
| Genus; |
|
| RdRp Putative capsid | 27–45 | 28–45 |
| Unclassified |
|
| Putative glycoprotein | 75 | 73 |
| Unclassified |
| Mosquito | Hypothetical protein 1 | 39–46 | Reads map |
|
|
|
| Proline-alanine-rich protein | 37 | NI |
|
|
|
| RdRp AGW51764.1 | 87 | NI |
| Unclassified virus |
|
| Putative glycoprotein | Reads map | 73 |
|
| Mosquito | Hypothetical protein | NI | 68 | |
|
| Mosquito | Hypothetical protein | NI | 84 | |
| Family; |
|
| Putative nucleoprotein | Reads map | 48 |
|
| |||||
| Unclassified |
| Putative polyprotein | 44–48 | 45–47 | |
|
|
| Hypothetical proteins | 51 | NI | |
|
|
| Psyllid | RdRp | 34 | NI |
|
|
|
| RdRp | 32–53 | NI |
| Family |
|
| ORF B, AcOrf-4 & AcOrf-5 peptides | 69 | 56–100 |
|
|
| Pachypsylla psyllid | Orf1 | NI | 38–59 |
|
|
| Hypothetical protein | NI | 31 | |
|
| |||||
|
|
| RdRp | NI | 44 | |
|
| |||||
|
|
|
| RdRp | 45 | NI |
| Genus: |
| Carrot | ORF2 | 36 | NI |
|
|
| Nucleocapsid Polymerase | 35–49 | NI | |
| Genus |
| Cucumber | Unnamed protein | NI | 28 |
Contigs were assembled from reads obtained from sequencing Cairns and Bangkok Ae. aegypti and were used for virus identification using the NCBI protein database. The range of percentage amino acid identities for one or more contigs is shown. Underlining (first column) highlights viruses or nearest relatives (italics), which appear to be unique to Cairns or Bangkok mosquitoes. Grey shading (NI - Not Identified) indicates where no contigs or reads matching the contig for the indicated virus were identified. “Reads map” indicates where no contigs were assembled, but reads from Cairns did map to the contig assembled from the Bangkok data or vice verse. The full data set, including contig information and references, is shown in Tables S2a and S2b.
Figure 4Mitochondrial sequence diversity in Ae. aegypti from Cairns and Bangkok. For Cairns 611,909 and for Bangkok 4,021,144 reads aligned with the Ae. aegypti mitochondrial genome AaegL3. Only nucleotide sites with at least 100 reads in both Cairns and Bangkok data sets were considered. (a) Box plot of the percentage of polymorphic sites (where the minor allele frequency > 0.1) for per site with a read coverage of > 100, using seven mitochondrial genes (ND2, COX1, COX2, COX3, ND4, CYTB, rRNA). (b) The minor allele frequency for nucleotide positions in the mitochondrial genome with a read depth > 100 reads. Only sites with a minor allele frequency of > 0.1 in Cairns and/or Bangkok data sets are shown.
Voltage-gated sodium channel gene SNPs in the Ae. aegypti genomes.
| Exon | bp position in exon | Codon change | Amino acid change | Cairns | Read depth | Bangkok | Read depth |
|---|---|---|---|---|---|---|---|
| 15 | 115,365 | Tcc/Ccc | Y | T 100% | 87 | T 77% C 23% | 26 |
| 24 | 155,477 | aCc/aTc | Y | C 100% | 26 | C 71% T 29% | 14 |
| 24 | 155,519 | tTc/tGc | Y | T 100% | 55 | T 36% C 64% | 22 |
| 15 | 115,283 | aaG/aaA | N | G 44% A 56% | 41 | G 100% | 21 |
| 15 | 115,313 | ttC/ttT | N | C 59% T 41% | 61 | C 100% | 27 |
| 25 | 155,890 | ttT/ttC | N | T 65% C 35% | 23 | T 100% | 61 |
Amino acid change Y; non-synonymous SNPs associated with pyrethroid resistance (knockdown resistance, kdr) (VectorBase: AAEL006019-RD, supercont1.186: 18,685–170,948). Amino acid change N; synonymous SNPs found in Cairns Ae. aegypti.