| Literature DB >> 28676109 |
Umberto Palatini1, Pascal Miesen2, Rebeca Carballar-Lejarazu1, Lino Ometto3, Ettore Rizzo4, Zhijian Tu5, Ronald P van Rij2, Mariangela Bonizzoni6.
Abstract
BACKGROUND: Arthropod-borne viruses (arboviruses) transmitted by mosquito vectors cause many important emerging or resurging infectious diseases in humans including dengue, chikungunya and Zika. Understanding the co-evolutionary processes among viruses and vectors is essential for the development of novel transmission-blocking strategies. Episomal viral DNA fragments are produced from arboviral RNA upon infection of mosquito cells and adults. Additionally, sequences from insect-specific viruses and arboviruses have been found integrated into mosquito genomes.Entities:
Keywords: Arboviruses; Endogenous viral elements; Mosquitoes; Non-retroviral integrated RNA viruses; piRNA
Mesh:
Substances:
Year: 2017 PMID: 28676109 PMCID: PMC5497376 DOI: 10.1186/s12864-017-3903-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Pipeline for NIRVS identification. The currently available 22 mosquito genomes and the genome of Drosophila melanogaster were probed bioinformatically using tblastx and 425 viral species (424 non-retroviral RNA viruses and 1 DNA arbovirus). Tested insect and viral RNA genomes are shown in the context of their phylogeny [2, 38]. Identified blast hits were parsed based on gene ontology and the presence of partial or complete viral ORFs. In Ae. albopictus, bioinformatic analyses was extended to whole-genome sequencing data from 16 individual mosquitoes of the Foshan strain. This stringent pipeline led to the characterization of 242 loci with NIRVS. Viral families for which NIRVS were characterized are shown in red
Number of viral integrations (NIRVS) detected for each of the viral families tested across the 22 mosquito genomes. A total of 424 non-retroviral RNA viruses with complete genomes were analyzed. The genome of African swine fever virus, the only known DNA arbovirus was also included in the analyses, but no NIRVS were found for this virus. Mosquito species and viral families for which NIRVS were detected are in bold
| Families of tested non-retroviral RNA viruses (N. species) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mosquito species | Togaa (24) | Flavia (92) | Bunyaa (59) | Reoa (70) | Orthomyxoa (4) | Rhabdoa (93) | Borna (6) | Filo (8) | Nyami (4) | Paramyxo (64) |
|
| 0 |
|
|
| 0 |
| 0 | 0 | 0 | 0 |
|
| 0 |
| 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 |
| 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Anopheles merus | 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 |
| 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 |
| 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
| Anopheles farauti | 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 |
| 0 | 0 | 0 |
| 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
aVirus families that contain arboviruses
Fig. 2Different abundance of NIRVS across virus genera, genes and host species. Schematic representation of the genome structures of Rhabdoviridae (a) and the genera Flavivirus (family Flaviviridae) (b), Orbivirus (family Reoviridae) (c) and Hantavirus (Family Bunyaviridae) (d). Numbers within each box represent the number of NIRVS loci spanning the corresponding viral gene per mosquito species. When a NIRVS locus encompassed more than one viral gene, the viral gene with the longest support was considered. Mosquitoes of the Culicinae and Anophelinae subfamilies are in black and blue, respectively. Dotted lines indicate viral integrations were not contiguous in the host genomes
Fig. 3Variability of NIRVS within the Ae. albopictus Foshan strain. Bioinformatic analyses of the Ae. albopictus genome identified 4 NIRVS on scaffold JXUM01S011498: AlbFlavi32, AlbFlavi33, AlbFlavi34 and AlbFlavi36. No read coverage was seen for AlbFlavi32 and AlbFlavi33 in any of the 16 sequenced genomes. AlbFlavi36 had read coverage in 13 of the 16 tested mosquitoes, whereas AlbFlavi34 showed length variability
Fig. 4Phylogenetic analyses of Reoviridae, Bunyaviridae, Flavivirus, and Rhabdoviridae-like integrations. Phylogenetic relationships of NIRVS with similarity to the Reoviridae VP5 (a), Reoviridae VP1 (b), Bunyaviridae G (c), Flavivirus NS3 (d), and Rhabdoviridae N (e) genes. The evolutionary history was inferred using the Maximum Likelihood method. The trees with the highest log likelihood are shown. Support for tree nodes was established after 1000 bootstraps
Fig. 5Enrichment of NIRVS in two regions of the Ae. aegypti genome. One fourth of the identified NIRVS in Ae. aegypti map to two genomic regions. a Region 1 (supercont1.286:1,316,885-1,429,979) includes piRNA cluster 3 [23] and is enriched for the LTR transposons LTR/Pao_Bel and LTR/Ty3_gypsy, which occupy 16.33 and 14.98% of the region, respectively. b Region 2 (supercont1.1:1,160,748-1,472,976) includes piRNA clusters 2 and 30 and is also enriched for LTR transposons. LTR/Ty3_gypsy occupancy in region 2 is 24.18%. NIRVS are color-coded based on their sequence identity (Additional file 5: Table S5)
Clustering of viral integrations (NIRVS) in piRNA loci of the Ae. aegypti and Ae. albopictus genomes
| Host | Genomic region | Length (bp) | % genome | k integrationsa |
|
|---|---|---|---|---|---|
|
| piRNA cluster | 17,000,000 | 1.24 | 54 | < 10-10 |
| Coding genes | 286,538,182 | 20.82 | 24 | 0.66 | |
| Intergenic regions | 1,072,461,818 | 77.94 | 44 | 1 | |
|
| piRNA cluster | 1,926,670 | 0.61 | 9 | < 10-10 |
| Coding genes | 163,407,667 | 8.26 | 14 | 2.08 10-3 | |
| Intergenic regions | 1,803,592,333 | 91.14 | 49 | 1 |
The probability (P) of observing k NIRVS loci in piRNA clusters, coding genes and intergenic regions. P was estimated using cumulative binomial distribution; a value of P < 0.05 indicates a statistically significant enrichment of NIRVS in the corresponding genomic region
aSix integrations in the Ae. aegypti genome were in exons of genes within piRNA clusters; in these analyses they were attributed to piRNA clusters. Statistical significance did not change when these integrations were assigned to coding genes (P changed from 0.66 to 0.180)
Fig. 6NIRVS produce 25–30 nt piRNAs, but not 21-nt siRNAs. Size distribution of small RNAs from published resources mapping to NIRVS in the Ae. aegypti (a) and Ae. albopictus (b) genomes. Black bars represent RNAs that map to the sense strand, gray bars show RNAs that map to the antisense strand. NIRVS-derived piRNAs are biased for sequences that are antisense to viral mRNAs, suggesting potential to target viral RNA. c-d Nucleotide bias at each position of small RNAs mapping to the sense (+) strand (upper panel) and antisense (-) strand (lower panel). NIRVS-derived piRNAs are biased for uridine at position 1 in both Ae. aegypti (c) and Ae. albopictus (d). e Number of all NIRVS-derived piRNAs and secondary NIRVS-derived piRNAs expressed in Ae. aegypti (left charts) and Ae. albopictus (right charts). Ring charts were scaled to reflect normalized piRNA counts of F-NIRVS (red), R-NIRVS (blue), and NIRVS from Reovirus (yellow) (Reovirus-NIRVS have been found only in Ae. aegypti). Numbers reflect piRNAs counts normalized to the corresponding library size. f Left panel, heat map of the relative abundance of NIRVS-derived small RNAs in Aag2 cells in which PIWI expression was silencing using RNAi (dsPiwi4-6, and dsAgo3), compared to control dsRNA treatment. Right panel, heatmap of small RNA enrichment in immunoprecipitations (IP) of the indicated PIWI proteins over control GFP IP. V5 epitope-tagged PIWI transgenes were used for IPs (V5-IP)
NIRVS and transposable elements (TEs)
| TE occupancy (%) | ||||
|---|---|---|---|---|
| TE groupa | AaegL3b | NIRVSc | Region 1d | Region 2e |
| LTR retrotransposons | 12.34 | 23.06 (25.88) | 31.35 | 30.56 |
| LTR/Pao_Bel | 4.42 | 6.9 (6.49) | 16.33 | 4.15 |
| LTR/Ty1_copia | 5.34 | 1.46 (1.90) | 0.04 | 2.22 |
| LTR/Ty3_gypsy | 2.58 | 14.7 (17.50) | 14.98 | 24.18 |
| non-LTR retrotransposons | 12.81 | 3.91 (5.54) | 0 | 3.43 |
| SINEs | 1.14 | 0.16 (0.22) | 0 | 0 |
| DNA transposons | 6.96 | 3.29 (3.49) | 3.38 | 2.72 |
| MITEs | 12.81 | 8.03 (7.34) | 0.26 | 2.34 |
| Helitrons | 1.2 | 2.01 (2.32) | 0 | 2.04 |
| Penelope | 0.42 | 0.2 (0.28) | 0.26 | 0.8 |
TE occupancy in the entire Ae. aegypti genome (AaegL3), in regions harboring NIRVS (NIRVS), in region1 and in region 2
aFor consistency with previous publications and for unambiguous classification, only TEs annotated in TEfam were used. TE occupancy was defined as the number of bases in the genomic sequence that match TEs
bThe genome assembly described in [71] is slightly different from AaegL3 (Aedes-aegypti-Liverpool_SCAFFOLDS_AaegL3.fa), which is used in this analysis. For better comparison with viral integration sites, a new RepeatMasker analysis was performed using the AaegL3 assembly under default parameters
cThe NIRVS sites plus 5 kb or 10 kb (in brackets) of sequence flanking each side were retrieved for the analysis. These results may under-estimate the actual TE occupancy because NIRVS sequences are included in the analyses
dSupercontig 1.286 between positions 1,316,885 bp and 1,429,979 bp (plus 5 kb flanking each side)
eSupercontig 1.1 between positions 1,160,748 bp and 1,472,976 bp (plus 5 kb flanking each side)