| Literature DB >> 31996404 |
Eric Roberto Guimarães Rocha Aguiar1,2, João Paulo Pereira de Almeida1, Lucio Rezende Queiroz1, Liliane Santana Oliveira3, Roenick Proveti Olmo1,4, Isaque João da Silva de Faria1, Jean-Luc Imler4, Arthur Gruber3, Benjamin J Matthews5, João Trindade Marques1,4.
Abstract
Endogenous viral elements (EVEs) are found in many eukaryotic genomes. Despite considerable knowledge about genomic elements such as transposons (TEs) and retroviruses, we still lack information about nonretroviral EVEs. Aedes aegypti mosquitoes have a highly repetitive genome that is covered with EVEs. Here, we identified 129 nonretroviral EVEs in the AaegL5 version of the A. aegypti genome. These EVEs were significantly associated with TEs and preferentially located in repeat-rich clusters within intergenic regions. Genome-wide transcriptome analysis showed that most EVEs generated transcripts although only around 1.4% were sense RNAs. The majority of EVE transcription was antisense and correlated with the generation of EVE-derived small RNAs. A single genomic cluster of EVEs located in a 143 kb repetitive region in chromosome 2 contributed with 42% of antisense transcription and 45% of small RNAs derived from viral elements. This region was enriched for TE-EVE hybrids organized in the same coding strand. These generated a single long antisense transcript that correlated with the generation of phased primary PIWI-interacting RNAs (piRNAs). The putative promoter of this region had a conserved binding site for the transcription factor Cubitus interruptus, a key regulator of the flamenco locus in Drosophila melanogaster Here, we have identified a single unidirectional piRNA cluster in the A. aegypti genome that is the major source of EVE transcription fueling the generation of antisense small RNAs in mosquitoes. We propose that this region is a flamenco-like locus in A. aegypti due to its relatedness to the major unidirectional piRNA cluster in Drosophila melanogaster.Entities:
Keywords: A. aegypti; EVE; RNA interference; endogenous viral elements; flamenco locus; piRNAs
Mesh:
Substances:
Year: 2020 PMID: 31996404 PMCID: PMC7161354 DOI: 10.1261/rna.073965.119
Source DB: PubMed Journal: RNA ISSN: 1355-8382 Impact factor: 4.942
FIGURE 1.Characterization of nonretroviral EVEs in A. aegypti mosquitoes. (A) Scatter plot showing the correlation between the abundance of viral families assigned to EVEs and exogenous viruses circulating in A. aegypti mosquitoes. r (Pearson correlation) and P-values are indicated. Viral families represented among EVEs are highlighted with different colors. Only the most abundant viral families exclusively found among circulating viruses are also named. (B) Genomic origin of EVEs in A. aegypti. (C) Distance of EVEs to the closest feature annotated in the same or opposite strands of the A. aegypti genome. Significance of colocalization was computed using Fisher's exact test for genomic data. A bin of 500 nt was utilized. EVEs annotated as genes were not considered for this analysis. (D) Percentage of EVEs association with TEs in the A. aegypti genome. EVEs found within 500 nt of TEs were considered associated. (E) Number of EVEs per chromosome of the AaegL5 version of the A. aegypti genome. EVEs were grouped by bins of 20 kb separated by strand. The AaegL5 version of the A. aegypti genome was used for EVE identification.
FIGURE 2.EVEs are organized in genomic clusters that generate antisense RNAs. (A) Production of EVE-derived transcripts for each cluster or single EVE in the A. aegypti genome. (B) Scatter plot showing the relation between EVE transcription and the density of viral elements for each cluster or single EVE in the A. aegypti genome. (C) Contribution of each cluster or single EVE to total transcription and abundance of EVEs on reference genome. (D) Abundance of EVE-derived small RNAs for each cluster or single EVE in the A. aegypti genome. (E) Scatter plot showing the relation between EVE-derived small RNAs and the density of viral elements for each cluster or single EVE. (F) Contribution of each cluster or single EVE to total small RNA production and abundance of EVEs on reference genome. Fifty-nine EVE regions are defined, in which regions with more than one element are referred to as clusters. Each EVE region is numbered according to its location on AaegL5 version of the A. aegypti chromosomes, as indicated in Figure 1E. Fisher's exact test was applied. P-values are indicated for each comparison. RNA libraries from whole mosquitoes were used in this analysis.
FIGURE 3.Comparative analysis of the largest EVE cluster in AaegL3 and AaegL5 versions of the A. aegypti genome. (A) Dotplot showing pairwise comparison of the largest EVE cluster in AaegL3 and AaegL5 versions of the mosquito genome. Shaded areas show a divergent region between the two genome versions that is further analyzed in C. (B) Graph showing coverage of the region spanning the largest EVE cluster identified in AaegL3 and AaegL5. (C) Zoom of a divergent region within the largest EVE cluster (shaded area in A) in the AaegL5 version of the mosquito genome showing the coverage of PacBio long reads. The structure of the region shows a repeated 5-kb block composed of a putative LTR retrotransposon containing capsid and a reverse transcriptase/integrase genes. (D) Single long reads containing the full repeated block give support for the assembly in the AaegL5 version of the A. aegypti genome.
FIGURE 4.A TE and EVE rich region in the A. aegypti genome is related to the flamenco locus. (A) Zoomed-in view of the shaded area in chromosome 2 corresponding to cluster 38 indicating EVE, gene and TE content per strand. Middle graphs show the density of transcripts and small RNAs considering single and multiple mapped reads. Bottom graph shows gap occupancy of small RNA coverage over the entire region. The dashed line represents a significance cutoff of P = 0.05 that the gap occurred by chance. Significant gaps are shown. (B) Correlation between production of long and small RNAs for each element within cluster 38. Each dot represents one element with the type (EVEs or TEs) and length indicated by colors and size of the symbol, respectively. (C,D) Size distribution (C) and base preferences (D) for EVEs and TEs within EVE cluster 38. (E) Relative frequency of distances between 3′ and 5′ ends of piRNAs derived from the same genomic strand within EVE cluster 38 in mosquitoes. (F) Nucleotide composition of the genomic neighborhood of small RNA 3′ ends. (G) Distance between 5′ ends of small RNAs in different strands for EVEs and TEs within EVE cluster 38. (H) Genomic context of promoter region of the flamenco locus and EVE cluster 38 in A. aegypti. Small RNAs between 24 and 29 nt were used for the analyses in B–F. RNA libraries from whole mosquitoes and flies were used in this analysis.
FIGURE 5.EVE-derived piRNAs from the flamenco-like locus associate with mosquito PIWI proteins. (A) Size profile of small RNA derived from miRNAs or the flamenco-like locus in control or oxidation-treated libraries. The nt at the 5′ end is indicated by color. (B) Association of piRNAs derived from the top five uni- and dual-strand clusters (highlighted in Supplemental Table S4) compared to microRNAs with different PIWI proteins based on immunoprecipitations from mosquito Aag2 cells. The flamenco-like locus belongs to the group of uni-strand piRNA clusters and is highlighted in red for comparison. Statistics were performed using Wilcoxon rank-sum test. Significant P-values (P < 0.01) are indicated. Small RNA libraries from mosquito Aag2 cells were used in these analyses.