Literature DB >> 30861037

Genome-wide profiling of piRNAs in the whitefly Bemisia tabaci reveals cluster distribution and association with begomovirus transmission.

Md Shamimuzzaman1, Daniel K Hasegawa1, Wenbo Chen2, Alvin M Simmons1, Zhangjun Fei2,3, Kai-Shu Ling1.   

Abstract

The whitefly Bemisia tabaci MEAM1 is a notorious vector capable of transmitting many plant viruses, resulting in serious crop loss and food shortage around the world. To investigate potential sRNA-mediated regulatory mechanisms in whiteflies that are affected by virus acquisition and transmission, we conducted small RNA (sRNA) deep sequencing and performed genome-wide profiling of piwi-interacting RNAs (piRNAs) in whiteflies that were fed on tomato yellow leaf curl virus (TYLCV)-infected or non-infected tomato plants for 24, 48, and 72 h. In the present study, piRNA reads ranging from 564,395 to 1,715,652 per library were identified and shown to distribute unevenly in clusters (57 to 96 per library) on the whitefly (B. tabaci MEAM1) genome. Among them, 53 piRNA clusters were common for all treatments. Comparative analysis between libraries generated from viruliferous and non-viruliferous whiteflies identified five TYLCV-induced and 24 TYLCV-suppressed piRNA clusters. Approximately 62% of piRNAs were derived from non-coding sequences including intergenic regions, introns, and untranslated regions (UTRs). The remaining 38% were derived from coding sequences (CDS) or repeat elements. Interestingly, six protein coding genes were targeted by the TYLCV-induced piRNAs. We identified a large number of piRNAs that were distributed in clusters across the whitefly genome, with 60% being derived from non-coding regions. Comparative analysis revealed that feeding on a virus-infected host caused induction and suppression of only a small number of piRNA clusters in whiteflies. Although piRNAs primarily regulate the activity of transposable elements, our results suggest that they may have additional functions in regulating protein coding genes and in insect-virus interactions.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30861037      PMCID: PMC6413925          DOI: 10.1371/journal.pone.0213149

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Small RNAs (sRNAs) are 19–31 nucleotide (nt) non-coding regulatory elements commonly found in plants, animals, including insects [1-4]. They play important roles in regulating gene expression, transposable elements (TE), or parasite immunity [2,5-7]. Major classes of sRNAs include small interfering RNAs (siRNAs) and microRNAs (miRNAs), which are primarily involved in silencing gene expression [1,2]. Another class of sRNAs known as PIWI-interacting RNAs (piRNAs) has been identified in vertebrate and invertebrate animals [4,5,8,9]. These piRNAs ranging from 26 to 31 nt are considered to regulate the activity of transposable elements as well as to silence the expression of protein coding genes [9-11]. The biogenesis of piRNAs has been studied in fruit fly (Drosophila melanogaster), honey bee (Apis melifera), mosquito (Anopheles gambiae), and other insects [4,11-13]. Three core proteins known as P-element induced wimpy testis (Piwi), Aubergine (Aub) and Argonaute-3 (Ago3) are required for piRNA biogenesis [7,8,12]. piRNAs are generated from distinct chromosomal regions referred to as piRNA clusters [7,8] and can vary in size from 1 kb to 250 kb and may contain as few as 10 to many thousands of piRNAs [13-15]. Two mechanisms of piRNA biogenesis has been described in Drosophila and other insects [7,13,14]. In first mechanism, long single-stranded piRNA precursors in the sense or antisense orientation are transcribed from piRNA clusters, and then processed into primary piRNAs by the cytoplasmic endonuclease, Zucchini (Zuc) [8,16,17]. The primary piRNAs, in association with Piwi proteins, will then form a RNA-induced silencing complex (RISC), which leads to piRNA-directed cleavage of mRNAs [8,14]. In the second mechanism, secondary piRNAs are generated through an amplification loop cycle, referring to as the ping-pong cycle [13,14]. Here, the target transcripts are cleaved by Aub-piRISCs, resulting in 5′- cleaved products being released and degraded in the process, whereas 3′- cleaved products are loaded onto Ago3 and subsequently processed into secondary piRNAs [13,18]. The hallmarks of the ping-pong mediated amplification of piRNAs are evidenced with a strong uridine (U) bias at the 5’ end for Aub-associated piRNAs and an adenosine (A) at position 10 for Ago3-associted piRNAs [7,8,13]. A 10 base-pair overlap is often seen between the complementary primary and secondary piRNAs. The functions of piRNAs are highly conserved among a number of animal and insect species [5,8,9,12,13], which are primarily involved in silencing the activity of transposable elements (TEs) in germline cells but are also active in other cell types [4,7,8,19,20]. TEs represent an important part of repetitive elements and can drive intra-genomic variation and genome diversification. The movement of TEs from one part of a genome to another occurs via an RNA or DNA intermediate, and thus, are classified as retrotransposons (class I) and DNA transposons (class II), respectively [21,22]. The Class I TEs are composed of long terminal repeat (LTR) and non-LTR (NLTR) retrotransposons. Families of LTR retrotransposons, including gypsy, copia and pao, are abundant in diverse insect species [7,11,23]. Similarly, NLTR, including long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs), are also well studied transposable elements in insects [8,13]. On the other hand, the Class II DNA transposons, including maverick, mariner, hAT, harbinger, and P elements, are also abundant in insects [7,13,24]. Thus, piRNAs can regulate the activity of both retrotransposons and DNA transposons. Furthermore, some piRNAs are non-repetitive and non-transposon-related, which can regulate expression of protein-coding genes [8,13]. In addition to the functions of piRNAs in controlling TE activities and silencing the expression of protein-coding genes, the piRNA pathway has also been shown to be involved in antiviral response in several insects (D. melanogaster, A. gambiae and A. aegypti [6,7,25,26]. Altogether, piRNAs play diverse roles in the pathways associated with insect development, reproduction and immune responses. The whitefly Bemisia tabaci Middle-East Asia Minor 1 (MEAM1) is one of the most widely distributed whitefly cryptic species, which is capable of transmitting hundreds of plant viruses to cause devastating losses on important agricultural crops, including beans, cassava, cotton, melon, sweetpotato and tomato [27,28]. Whitefly-transmitted viruses are classified in the genera of Begomovirus, Crinivirus, Ipomovirus, Torradovirus and Carlavirus. One of the most important begomoviruses is tomato yellow leaf curl virus (TYLCV), which has also been the focus for numerous other studies to understand the whitefly-begomovirus interactions [29-31]. In the present study, our preliminary analysis of sRNA sequence datasets revealed an enrichment of sRNAs (ranging from 26–31 nt) in addition to the typical miRNAs, which led us to explore the piRNA profiling. We performed genome-wide identification of piRNA clusters in whitefly (B. tabaci, MEAM1) and analyzed their expression levels in whiteflies feeding on TYLCV-infected or uninfected tomato plants for 24, 48, and 72 h. With ~45% of the whitefly genome sequence comprised of repeat elements, including miniature inverted repeat transposable elements (MITEs), LINEs, SINEs, LTRs and DNA transposons [28], piRNAs, involved in regulation of transposable elements, may play an important role in maintaining whitefly genome integrity, reproduction and antiviral immune responses. The piRNA pathway has been linked to other epidemiologically important phenotypes in mosquitoes [20,26,32]. The role of piRNAs in antiviral immune responses in both Aedes aegypti and A. albopictus has also been reported by multiple research groups [20,25,26]. Knowledge of the mechanism on how the piRNA pathway regulates reproduction and antiviral immune responses in whitefly could be useful for both fundamental and applied research. Our enhanced understanding of the piRNA pathway in whitefly could facilitate the identification of novel targets for vector control using RNAi and genome editing technologies.

Materials and methods

Whiteflies and feeding assays

The same whitefly materials under the identical treatments used in our previous transcriptome analysis [31] were used in the present study. Briefly, a whitefly B. tabaci MEAM1 colony was maintained at the USDA-ARS, U.S. Vegetable Laboratory on broccoli (Brassica oleracea L. var. botrytis) in a greenhouse (26°C ± 5°C). The MEAM1 colony was confirmed using PCR primers to amplify the mitochondrial cytochrome oxidase 1 gene [19]. Approximately 1,500 adult whiteflies were transferred to TYLCV-infected or uninfected tomato (Solanum lycopersicum cv. Moneymaker) plants and were allowed to feed for 24, 48, or 72 h, as described by Hasegawa et al. [31]. At the end of each time point, 200–500 living whiteflies were collected and immediately stored at -80°C until processing. Three biological replicates were performed for each treatment. Those viruliferous whiteflies were demonstrated in our previous study to contain TYLCV in acquisition periods of 24, 48 and 72 h, whereas those whiteflies fed on non-infected tomato plants were negative for TYLCV [31].

RNA isolation and library preparation

Total RNA was isolated with a TRIzol Reagent (Thermo Fisher Scientific, USA) according to the manufacturer’s protocol, and purified with the Direct-zol RNA MiniPrep kit (Zymo, USA). sRNAs were then polyacrylamide gel electrophoresis (PAGE)-purified and used to construct sRNA libraries as described [33,34]. A total of 18 sRNA libraries were constructed and sequenced on an Illumina HiSeq 2500 system.

Processing of sRNA raw reads

Raw sRNA reads were processed to remove adapters and low quality sequences using an in-house perl script included in the VirusDetect package [35]. For piRNA analysis, reads shorter than 25 nt and longer than 40 nt were also removed. The remaining size-selected reads were aligned to the Rfam database (Version 12.1) (http://rfam.sanger.ac.uk/) to filter out any reads that were mapped to rRNAs, tRNAs, and snRNAs. Raw and processed sequencing data are available in the GEO (Gene Expression Omnibus) database of the National Center for Biotechnology Information (NCBI) under accession number GSE111343.

Identification of piRNA clusters in whitefly

The high-quality cleaned sRNA reads were aligned to the whitefly reference genome using Bowtie [36] allowing up to two mismatches. Replicates were combined and genome-mapped reads were used to identify piRNA clusters using the proTRAC (Version 2.3) pipeline [37] with default parameters. proTRAC predicts potential piRNAs and piRNA clusters based on typical characteristics such as the number of loci with uridine (U) at position 1 or adenosine (A) at position 10, the number of loci within the typical piRNA length (26–31 nt), strand bias, and sliding window of 5 kb. Whitefly gene annotation and repeat sequence annotation obtained from the whitefly genome database [28] were utilized while running the proTRAC pipeline. The proTRAC output provides the normalized quantitative expression of piRNA clusters in reads per million mapped reads (RPM). This pipeline also generates a fasta file for individual clusters that contains all piRNA sequences. Differentially expressed piRNA clusters at each time points were identified using edgeR [38], with a cutoff of adjusted p-value < 0.05. In addition, we calculated the overall percentage for sense versus antisense strand derived piRNAs, and single versus multiple loci derived piRNAs.

piRNA cluster associated genes and transposable elements

Genes and repeat elements mapped to the piRNA clusters were retrieved from proTRAC output. We calculated the percentage of CDS-, repeat elements- and non-coding sequence-derived piRNAs utilizing the piRNA fasta files generated by proTRAC. Repeat annotation from the whitefly genome database [28] was utilized to divide piRNA cluster mapped repeat elements into different classes and families. We identified commonly expressed piRNA clusters across all time points using bedtools intersect [39]. Similarly, piRNA clusters that were induced or suppressed in whiteflies fed on uninfected versus TYLCV-infected plants were identified.

Results

Sequencing and initial processing of sRNA reads

To shed light on the whitefly piRNA profiles in response to feeding on TYLCV-infected tomato, deep sequencing of sRNAs was performed using RNAs isolated from whiteflies fed on either TYLCV-infected or uninfected tomato for 24, 48, and 72 h. We performed three biological replicates for each time point. Sequencing of 18 sRNA libraries produced a total of approximately 175 million reads with a range of approximately 5 to 13 million reads per library (S1 Table) and Pearson correlation coefficient (S2 Table) indicates high reproducibility of sequencing libraries. After trimming and removing low quality reads, rRNAs, tRNAs, and snRNAs, we found that reads with 28–31 nt in length were the most abundant in the datasets (Fig 1A), indicating that the libraries were enriched for reads corresponding to piRNAs. Reads that had lengths within the range of expected piRNAs of 26–40 nt were selected for further analysis.
Fig 1

A. Distribution of sRNA read lengths. Size distribution of sequence reads with peaking at 28–31 nt which is usual size range for piRNAs. Horizontal axis provides read length and vertical axis provides total number of reads at specific length. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively. Fig 1B. Distribution of piRNAs in different genomic regions. Percentage of all identified piRNAs mapping to different genomic features. Red color denotes piRNAs derived from coding sequences (CDS), green color denotes piRNAs derived from repeat elements, and blue denotes piRNAs derived from non-coding sequences including intergenic regions, introns and UTRs. C). Distribution of piRNAs based on strand. Green color denotes percentage of piRNAs derived from sense strand whereas brown color denotes percentage of piRNAs derived from antisense strand. D). Distribution of piRNAs based on mapping to number of genomic loci. Blue color denotes piRNAs mapped to multiple genomic loci whereas brown color denotes piRNAs mapped to single unique genomic loci. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively.

A. Distribution of sRNA read lengths. Size distribution of sequence reads with peaking at 28–31 nt which is usual size range for piRNAs. Horizontal axis provides read length and vertical axis provides total number of reads at specific length. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively. Fig 1B. Distribution of piRNAs in different genomic regions. Percentage of all identified piRNAs mapping to different genomic features. Red color denotes piRNAs derived from coding sequences (CDS), green color denotes piRNAs derived from repeat elements, and blue denotes piRNAs derived from non-coding sequences including intergenic regions, introns and UTRs. C). Distribution of piRNAs based on strand. Green color denotes percentage of piRNAs derived from sense strand whereas brown color denotes percentage of piRNAs derived from antisense strand. D). Distribution of piRNAs based on mapping to number of genomic loci. Blue color denotes piRNAs mapped to multiple genomic loci whereas brown color denotes piRNAs mapped to single unique genomic loci. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively.

Expression of core components of piRNA pathway genes in Bemisia tabaci

Core components of piRNA pathway genes, Ago-3 (Bta04637), Aub (Bta08949), and piwi (Bta00007 and Bta00198) are conserved in the B. tabaci genome [28]. We retrieved the expression values of the above genes from a recently published transcriptome dataset [31] that was generated using the same pool of total RNA from which sRNAs were isolated. We found that all core component genes of the piRNA pathway were expressed at all three time points (Table 1). The expression of Ago-3 ranged in16 to 18 RPKM, and Aub in 83 to 93 RPKM. Both piwi genes were also expressed, although at lower levels (Table 1), indicating a functional piRNA pathway is present in the whitefly B. tabaci. P-values (Table 1) obtained from edgeR [38] analysis indicate that expression level of these piRNA pathway genes did not change significantly across all three time points.
Table 1

Expression of genes encoding the core components of piRNA pathways.

GeneAnnotation24 h-H24 h-Vp-val48 h-H48 h-Vp-val72 h-H72 h-V*p-val
Bta04637argonaute-318.3816.28116.9417.42117.1316.091
Bta08949aubergine-like protein90.0588.42190.1897.91193.2183.521
Bta00007Piwi-like protein5.84.620.974.694.4315.155.131
Bta00198Piwi-like protein2.852.5113.472.6813.423.461

*24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours respectively. Expression values are provided in reads per kilobase per million mapped reads (RPKM). Expression values and edgeR [38] calculated p-values were retrieved from our recently published article [31].

*24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours respectively. Expression values are provided in reads per kilobase per million mapped reads (RPKM). Expression values and edgeR [38] calculated p-values were retrieved from our recently published article [31].

Identification of piRNA clusters

Whitefly genome-mapped reads were used to identify piRNA clusters using proTRAC pipeline [37]. We detected 57 to 96 clusters of piRNAs at each of the three different time points and treatments (Table 2 and S2 Table). The total number of piRNAs at each time point ranged from 564,395 to 1,715,652 (Table 2). The results obtained from the proTRAC pipeline also provides a detailed graphical view of each cluster. A representative view of an identified graphical cluster was shown in S1 Fig. On average, each cluster contained 13,998 piRNAs. We found piRNA clusters are derived from either positive or negative strand (S2 Table), suggesting they are mono directionally transcribed. The percentage of sense strand-derived piRNAs gradually decreased (from 60% to 49%) over the three tested time periods with acquisition of TYLCV, whereas the antisense strand-derived piRNAs increased (from 40% to 51%) over the same period time (Fig 1C). Additionally, we identified piRNAs matching to single genomic loci as well as multiple genomic loci. The percentage of piRNAs matching to a single genomic loci ranged from 41% to 47% whereas the percentage of piRNAs matching to multiple genomic loci ranged from 53% to 59% (Fig 1D). When mapped to the whitefly B. tabaci MEAM1 genome, piRNAs were shown to be derived from either coding sequences (CDS), transposable elements, or non-coding sequences. We found that ~ 62% of whitefly piRNAs were derived from non-coding sequences, which included intergenic regions, introns, and UTRs (Fig 1B). The potential roles for non-coding sequence derived piRNAs are still unknown. Approximately 38% of piRNAs were derived from CDS and repeat elements (Fig 1B).
Table 2

Summary of identified piRNA clusters in different libraries.

Library*Number of piRNA clustersTotal piRNAs
24 h-H961,715,652
24 h-V921,547,894
48 h-H68868,509
48 h-V67623,211
72 h-H71993,629
72 h-V57564,395

*24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours respectively.

*24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours respectively.

CDS and repeat elements derived piRNAs

Apart from the non-coding genomic-region derived piRNAs, there was a significant percentage of piRNAs that were derived from gene coding sequences and repeat elements which were of major interests for this study. The percentage of CDS-derived piRNAs in whiteflies fed on uninfected tomato were 14.6%, 21.0%, and 19.6% of all piRNAs for 24 h, 48 h and 72 h, respectively, whereas the percentages were 15.0%, 22.6%, and 26.3% in whiteflies fed on TYLCV-infected tomato (S2 Fig). These piRNAs were derived from genes that were predicted to be involved in diverse growth, developmental and metabolic pathways (S3 Table). piRNAs are well known for their involvement in silencing the activity of transposable elements. Overall 19.4% of all identified piRNAs were derived from different classes of repeat elements (Fig 1B and S4 Table). For whiteflies fed on uninfected and TYLCV-infected tomato for 24 h, approximately 16% of piRNAs were derived from repeat elements, whereas at 48 h, it increased to ~ 22% (S2 Fig). Similarly, we found approximately 21% and 23% of piRNAs derived from repeat elements at 72 h for whiteflies fed on uninfected and TYLCV-infected tomato, respectively (S2 Fig). Further analysis of repeat elements associated with piRNA clusters provided a comprehensive view of silencing of both class I (retrotransposons) and class II (DNA transposons) transposable elements. Two major families of retrotransposons are LTR and non-LTR retrotransposons. We found approximately 10%, 9% and 8% of LTR repeat elements were targeted by piRNA clusters at 24 h, 48 h, and 72 h, respectively, in whiteflies fed on uninfected tomato (Fig 2). Similarly, in whiteflies fed on TYLCV-infected tomato, we found 10%, 10%, and 13% of LTR repeat elements were targeted by piRNA clusters at 24 h, 48 h, and 72 h, respectively (Fig 2). We also identified piRNA clusters associated with non-LTR repeat elements that included LINEs and SINEs for each time point. Overall approximately 6–8% of non-LTR repeat elements were targeted by piRNA clusters across all time points. Our analysis identified a small percentage of DNA transposons targeted by piRNA clusters. They ranged between 4% and 7% of the total repeat elements targeted by piRNA clusters at all time points (Fig 2). Additionally, we found a major class of repeat elements targeted by piRNA clusters, called MITE elements, which ranged from 21% to 23% of the total repeat elements targeted by piRNA clusters across three different time points (Fig 2).
Fig 2

Distribution of repeat elements in piRNA clusters.

Different color denotes different families of transposable elements. It shows the percentage of specific repeat family present at specific time points after whiteflies fed on uninfected/TYLCV-infected tomato. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively.

Distribution of repeat elements in piRNA clusters.

Different color denotes different families of transposable elements. It shows the percentage of specific repeat family present at specific time points after whiteflies fed on uninfected/TYLCV-infected tomato. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively.

Comparison of identified piRNA clusters across three time points

To identify common and treatment-specific piRNA clusters, we performed comparative analysis across all time points using genomic coordinates of the identified piRNA clusters. We found 53 clusters that were commonly expressed across all time points in whiteflies fed on TYLCV-infected or uninfected tomato, and had normalized expression values that ranged from a few hundred to a few thousand RPM (S5 Table). We performed statistical test using edgeR to determine differentially expressed piRNA clusters at three time points separately. P-values (S5 Table and Table 3) obtained from edgeR analysis indicate that one of these common piRNA clusters showed significant differential expression. The top 10 highly expressed piRNA clusters were expressed at 2,606 to 10,708 RPM (Table 3) and the edgeR obtained p-values are greater than 0.05.
Table 3

Top 10 highly expressed piRNA clusters across three time points.

piRNA Cluster#Genomic Coordinate24 h-H24 h-Vp-val48 h-H48 h-Vp-val72 h-H72 h-V*p-val
Cluster 14Scaffold1496: 970038–9880278653.818785.460.99986585.596929.980.55337327.086488.660.1387
Cluster 51Scaffold942: 2258000–229402510483.8210708.490.99984817.165109.440.55338611.145167.090.1387
Cluster 6Scaffold12176: 1–84125176.255281.120.99983769.103932.870.55334396.973952.820.1387
Cluster 34Scaffold2737: 1425031–14399985036.335075.960.99984287.834512.690.55334445.803819.970.4808
Cluster 16Scaffold1558: 1069000–10801444824.274932.090.99983633.063849.600.55334179.383796.120.1387
Cluster 40Scaffold3872: 103000–1280264776.274861.660.99983710.973915.160.55334157.383639.440.1387
Cluster 44Scaffold637: 935001–9510254626.024675.210.99983745.703887.180.55334018.033585.030.1387
Cluster 50Scaffold942: 760005–7809244132.764153.560.99983403.433748.420.55333576.292941.600.1387
Cluster 31Scaffold2605: 3026002–30460173315.153469.880.99983052.213023.400.55333100.272910.320.163
Cluster 25Scaffold2124: 493014–5150274412.404478.170.99982643.392741.010.55333676.772606.070.5941

#Common cluster are named sequentially starts from 1 to 53. P-values were calculated using a Bioconductor package edgeR [38].

*24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours respectively. Expression values are provided in reads per million mapped reads (RPM).

#Common cluster are named sequentially starts from 1 to 53. P-values were calculated using a Bioconductor package edgeR [38]. *24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours respectively. Expression values are provided in reads per million mapped reads (RPM).

Induced piRNA clusters in whitefly acquired TYLCV

To evaluate whether the expression of certain piRNA clusters may be affected upon TYLCV acquisition, we identified five piRNA clusters that were exclusively induced and 24 piRNA clusters that were suppressed in whiteflies fed on TYLCV-infected tomato (Fig 3 and Table 4). Low p-values obtained from edgeR analysis indicate these expression differences are significant. Among those five piRNA clusters that were induced in whiteflies when fed on TYLCV-infected tomato (Table 4), their expression ranged from 592 to 2,387 RPM and targeted six protein coding genes as well as a number of transposable elements (Tables 5 and 6). Three of the protein coding genes (Bta07031, Bta05801, Bta02620) have unknown functions, while the other three genes (Bta07186, Bta07187, Bta07188) were annotated as encoding a COMM domain-containing protein, a sentrin-specific protease, and a mitochondrial carrier protein. In addition, these induced piRNA clusters targeted both class I retrotransposons and class II DNA transposons. The piRNA targets included multiple members of different families of transposable elements (Table 6).
Fig 3

Comparison of identified piRNA clusters across three time points.

Most piRNA clusters are common, and only handful of them are induced in whiteflies fed on TYLCV-infected tomato. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively.

Table 4

Primary piRNA clusters induced or suppressed upon TYLCV acquisition.

piRNA Cluster*Genomic CoordinatesCluster Size (bp)Normalized reads (RPM)P-value
Induced piRNAs in viruliferous whiteflies
24 h-V_Cluster 56Scaffold27: 2717004–2735011180082387.563.35E-62
48 h-V_Cluster 27Scaffold1805: 1007000–10120275028594.658.8E-50
48 h-V_Cluster 42Scaffold27: 2727003–27349587956727.631.34E-47
48 h-V_Cluster 57Scaffold699: 1484000–14890265027592.922.21E-51
72 h-V_Cluster 9Scaffold1335: 74003–810157013764.651.91E-55
Suppressed piRNAs in viruliferous whiteflies
24 h-H_Cluster 32Scaffold16276: 8857001–88640267026865.452.5E-58
24 h-H_Cluster 39Scaffold1805: 1007000–10130116012736.442.47E-54
24 h-H_Cluster 43Scaffold2068: 132008–14001680091007.068.74E-56
24 h-H_Cluster 49Scaffold226: 4092002–4102027100262289.333.35E-62
24 h-H_Cluster 72Scaffold469: 1514001–15209676967941.285.89E-55
48 h-H_Cluster 21Scaffold1580: 896006–9020266021640.282.8E-48
48 h-H_Cluster 36Scaffold226: 4014011–40700205601023289.992.58E-86
48 h-H_Cluster 38Scaffold2418: 1–566556653247.884.64E-63
48 h-H_Cluster 57Scaffold653: 1–79987998981.358.35E-55
72 h-H_Cluster 3Scaffold1144: 774002–7800206019676.321.37E-40
72 h-H_Cluster 9Scaffold130: 3650002–36569616960879.156.31E-49
72 h-H_Cluster 13Scaffold137: 1429000–1439844108452168.714.19E-59
72 h-H_Cluster 17Scaffold147: 9362004–937072887251354.551.45E-52
72 h-H_Cluster 22Scaffold16276: 8857001–88630246024662.553.92E-47
72 h-H_Cluster 30Scaffold199: 5961009–5973027120191953.233.01E-56
72 h-H_Cluster 36Scaffold226: 4014011–40700235601324372.343.01E-56
72 h-H_Cluster 37Scaffold2366: 1306015–1321028150142510.114.19E-59
72 h-H_Cluster 46Scaffold300: 230249–254907246595437.121.42E-64
72 h-H_Cluster 51Scaffold403: 2–60286027655.505.69E-46
72 h-H_Cluster 52Scaffold4455: 156013–172781167695224.843.35E-63
72 h-H_Cluster 60Scaffold707: 83004–9202490211124.671.48E-49
72 h-H_Cluster 63Scaffold811: 1066000–10710145015615.705.94E-42
72 h-H_Cluster 68Scaffold942: 2627013–2644026170145992.445.61E-64
72 h-H_Cluster 70Scaffold988: 12017–190257009629.348.86E-56

*Cluster names are associated with the library name in which the cluster was identified. Induced clusters were expressed only in viruliferous whiteflies, so their expression was considered zero in healthy control whiteflies when edgeR [38] analysis was performed for each time points. Similarly, suppressed clusters were expressed only in healthy control whiteflies, so their expression was considered zero in viruliferous whiteflies.

Table 5

Predicted target genes of induced piRNA clusters.

piRNA Cluster*Target geneAnnotationTranscripts level in RPKM
24 h-V_Cluster 56Bta07031Unknown protein7.52
48 h-V_Cluster 27No target geneN/ANA
48 h-V_Cluster 42Bta05801Unknown protein5.45
48 h-V_Cluster 57Bta07186COMM domain-containing protein 44.76
48 h-V_Cluster 57Bta07187Sentrin-specific protease 10
48 h-V_Cluster 57Bta07188Mitochondrial carrier protein1.5
72 h-V_Cluster 9Bta02620Unknown protein0

*Cluster names are associated with the library name in which the cluster was identified.

N/A indicated that no gene is targeted by a piRNA cluster.

Table 6

Transposable elements targeted by induced piRNA clusters.

piRNA Cluster*DNA TransposonRetrotransposon
24 h-V_Cluster 56MITE, hAT, TcMarLINE/Penelope, LINE/I
48 h-V_Cluster 27LINE/Jockey, LTR/Gypsy
48 h-V_Cluster 42MITE, TcMarLINE/Phenelope
48 h-V_Cluster 57LINE/R1
72 h-V_Cluster 9 LTR/Gypsy, LTR/Copia

*Cluster names are associated with the library name in which the cluster was identified.

Comparison of identified piRNA clusters across three time points.

Most piRNA clusters are common, and only handful of them are induced in whiteflies fed on TYLCV-infected tomato. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours, respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively. *Cluster names are associated with the library name in which the cluster was identified. Induced clusters were expressed only in viruliferous whiteflies, so their expression was considered zero in healthy control whiteflies when edgeR [38] analysis was performed for each time points. Similarly, suppressed clusters were expressed only in healthy control whiteflies, so their expression was considered zero in viruliferous whiteflies. *Cluster names are associated with the library name in which the cluster was identified. N/A indicated that no gene is targeted by a piRNA cluster. *Cluster names are associated with the library name in which the cluster was identified.

Suppressed piRNA clusters when whitefly acquired TYLCV

Among those 24 piRNA clusters that were suppressed in whiteflies when fed on TYLCV (Table 4), expression values of the piRNA clusters ranged from 615 to 24,372 RPM, and targeted a total of 17 protein coding genes with diverse biological functions (S6 Table). Eight of them have unknown functions and the remaining genes are not predicted to be involved in any single pathway. In addition to protein coding genes, these piRNA clusters also target both class I and II transposable elements (S7 Table). Class I retrotransposons targeted by these piRNA clusters include LTR and Non-LTR elements. Moreover, a number of class II DNA TE families were also targeted by these piRNA clusters.

Discussion

piRNAs are widely known to be involved in silencing the activity of transposable elements. Numerous studies on the piRNA pathway have broadened our understanding of the role of piRNA-mediated regulation of TEs in genetic, biological and developmental processes [40-42]. Additionally, piRNAs also regulate the activity of protein coding genes reported for a number of insect species. More recently different groups have reported that piRNAs target the non-coding sequences as well [8,13]. We found that the highest percentage of piRNAs in whitefly is derived from non-coding sequences (Fig 1B), which makes it challenging to infer their potential functions. However, transposable elements and protein coding genes are also targeted by piRNAs. It was quite interesting to understand that piRNA clusters could be derived either from a positive or from a negative stranded DNA in the whitefly genome. Generation of different orientation of piRNAs was likely due to the two different mechanisms of piRNA biogenesis [7,13,14]. First, piRNA precursors are transcribed from piRNA clusters, and then processed into primary piRNAs [8,16,17], which lead to piRNA-directed cleavage of mRNAs [8,14]. On the other hand, secondary piRNAs are generated through the ping-pong cycle [13,14], which lead to an amplification of piRNA regeneration. The percentage of sense strand-derived piRNAs gradually decreased over time, from 60% at 24 h to 49% at 72 h post virus acquisition in the viruliferous whiteflies, whereas the antisense strand-derived piRNAs increased from 40% to 51% during the same period of time. Although such trends are interesting, we are still unsure about their implications. Recent studies have demonstrated the adaptive antiviral immunity in the mosquito A. aegyti [43] as well as many of other arthropods [44] suggesting likely involvement of piRNAs in facilitating TYLCV transmission by the vector whitefly B. tabaci. In this study, we found that piRNAs were derived from both class I retrotransposons and class II DNA transposons. For class I, three classes of LTR retrotransposons, gypsy, copia and pao-Bel are the most prevalent sources of piRNAs (S4 Table). Previous studies in mosquito and other insects showed the same dominant sources for LTR retrotransposon-derived piRNAs as observed in whitefly [7,11,23]. We also found piRNAs derived from non-LTR retrotransposons, LINEs and SINEs in whitefly. In addition to class I retrotransposons, our study identified piRNAs derived from different families of DNA transposons (S4 Table), including MITE, hAT, TcMar, Maverick, Academ and MULE-MuDR. The highest percentage of piRNAs are derived from MITE DNA transposons when compared among individual TE families (Fig 2). Overall, our results demonstrated that TE-piRNA production in whitefly shared the trends of piRNA biogenesis in other insect species. The mechanism of piRNA-mediated silencing is complex and likely depends on both transcriptional and posttranscriptional mechanisms. Perhaps the mechanism of silencing might differ among different families of transposable elements. An increasing number of studies reported that piRNAs also target protein coding genes and silence their expression [8,13]. In addition to CDS, they can also target introns and UTRs. Even though gene-derived piRNAs were predominantly mapped to the sense strand in several mosquito species [7,13], we found piRNAs derived from genes in whiteflies are produced from both sense and antisense strands. We observed that the piRNA targeted genes (S3 Table) were not associated with any particular pathway, but instead, seemed to be involved in diverse biological processes. Our study identified a large number of piRNA clusters. We divided the piRNA clusters into three different categories such as common piRNA clusters, and induced and suppressed piRNA clusters in the presence of TYLCV in whiteflies. We identified 53 common piRNA clusters that are expressed in whiteflies that fed on tomatoes with or without TYLCV infection (S5 Table). These piRNA clusters showed consistent expression across all treatments and did not show any differential expression and targeted numerous classes of transposable elements and protein coding genes of different biological functions. It is likely that these common piRNA clusters are not involved in the acquisition, circulation or transmission of TYLCV in whitefly B. tabaci. In addition to common clusters, it was interesting to observe that five piRNA clusters were induced when whiteflies fed on TYLCV-infected tomato. However, the expression of these specific piRNA clusters was not consistent across three time points, suggesting a potential influence from the virus circulation or trafficking through different orgasm in the viruliferous insect vector as measured post acquisition for 24, 48 and 72 h. These specific piRNA clusters targeted six protein coding genes (Table 5). We found that half of the genes have unknown functions, while the other three were annotated as encoding a COMM domain-containing protein, a sentrin-specific protease, and a mitochondrial carrier protein. Based on our previous published transcriptome data [31], these genes are expressed at very low levels, as they are potentially targeted by piRNAs. However, it is still not clear how these proteins are involved in virus transmission or anti-viral immunity in whitefly. The COMM domain family is a multifunctional protein, which could regulate transcription factor NF-kappa-B and copper metabolism. It has been reported that COMM domain-containing proteins modulate the activity of cullin-RING E3 ubiquitin ligase (CRL) complexes [45], and promote the ubiquitination and proteasomal degradation of cellular proteins. Perhaps silencing the expression of this gene in whitefly by piRNAs facilitates deubiquitination of certain cellular proteins involved in virus transmission. Other induced piRNA clusters target sentrin-specific protease 1 (SENP1), which is involved in the small ubiquitin-like modifier (SUMO) pathway. This protein is associated with deconjugation of SUMO from targeted proteins [46,47]. As SENP1 expression is suppressed by the action of piRNA, it could facilitate the conjugation of SUMO to the target proteins, suggesting that piRNA mediated suppression of SENP1 might be playing a role in anti-viral defense, thus to facilitate circulation or trafficking of the virus particles in a viruliferous insect vector. Another gene that was suppressed by the induced piRNA clusters upon acquisition of TYLCV was the mitochondrial carrier protein gene. Mitochondrial carrier proteins transfer molecules across the membranes in the mitochondria. Suppression of the gene encoding mitochondrial carrier protein by a piRNA cluster may facilitate virus trafficking across the membrane to enhance virus transmission. We also looked into the transposable elements targeted by these piRNA clusters. We found that both classes of TEs were targeted by piRNAs, including Class I: LTR/Gypsy, Copia and LINEs/Penelope, R1, L1, and Jockey and class II DNA transposons: MITE, hAT, and TcMar (Table 6). Future in-depth studies are needed to understand the role of piRNAs during virus transmission and immunity. We also identified 24 piRNA clusters that were induced in whiteflies fed on uninfected tomato (Table 4). There were 15 genes targeted by these piRNA clusters (S6 Table), with one-third of them having unknown functions. The remaining genes were not associated with any particular pathway. Our analysis also found that both class I and II transposable elements were targeted by these piRNA clusters (S7 Table). Class I retrotransposons that were targeted include LTR/Gypsy, Copia, Pao, LINE/DRE, Jockey, Penelope, LOA, CR1, RTE, Dong-4, and SINE, whereas class II DNA transposons include MITE, DNA/Academ, Kolobok-T2, Mariner, MULE-MuDR, Maverick, and hAT. There were multiple members of each of those families of transposable elements. Further studies are needed to pinpoint how the suppression of these piRNAs by TYCLV is associated with anti-viral defense mechanisms. Moreover, a detailed follow up functional study is also required to elucidate the mechanisms on how piRNAs are associated with virus transmission and anti-viral defense in whitefly. This study provides genome-wide profiling of piRNAs in whiteflies. We found that piRNA clusters were distributed across the whitefly genome, with more than 60% of identified piRNA clusters being derived from non-coding regions with unknown function. Comparative analysis revealed that feeding on a virus-infected host caused induction and suppression of only a small number of piRNA clusters in whiteflies. Majority of the identified piRNA clusters were commonly expressed across all time points in whiteflies fed on TYLCV-infected or uninfected tomato. Apart from piRNAs targeting non-coding regions, other piRNAs target protein coding genes and transposable elements. Although piRNAs primarily regulate the activity of TEs, our results suggest that they may have additional functions in regulating protein coding genes and in insect-virus interactions. This report on piRNA profiling of whitefly will serve as a valuable resource to the whitefly research community. Additionally, this study could also open new avenues to target piRNA-associated genes in whitefly using RNAi or genome editing technologies for vector management.

Distribution of piRNAs in different genomic regions.

Representative piRNA clusters identified by the proTRAC program. It provides detailed information about the piRNA clusters including the genomic coordinates, read coverage, genes and repeat elements falling within the cluster. (TIFF) Click here for additional data file. Percentage of piRNAs mapping to different genomic features. Red color denotes piRNAs derived from coding sequences (CDS), green color denotes piRNAs derived from repeat elements, and blue denotes piRNAs derived from non-coding sequences including intergenic regions, introns and UTRs. 24 h-H, 48 h-H, and 72 h-H represents whiteflies fed on uninfected tomato for 24, 48 and 72 hours respectively. 24 h-V, 48 h-V, and 72 h-V represents whiteflies fed on TYLCV-infected tomato for 24, 48 and 72 hours, respectively. (TIFF) Click here for additional data file.

Summary of reads in 18 small RNA libraries.

(DOCX) Click here for additional data file.

Pearson's correlation coefficients for three biological replicates and Clusters of piRNAs at each of the three different time points and treatments.

(XLSX) Click here for additional data file.

Predicted piRNAs target genes involved in diverse growth, developmental and metabolic pathways.

(XLSX) Click here for additional data file.

piRNAs targeting different classes of repeat elements.

(XLSX) Click here for additional data file.

piRNA clusters commonly expressed across all time points in whiteflies fed on TYLCV-infected or uninfected tomato.

(XLSX) Click here for additional data file.

Predicted genes targeted by suppressed piRNA clusters.

(DOCX) Click here for additional data file.

Transposable elements targeted by suppressed piRNA clusters.

(DOCX) Click here for additional data file.
  45 in total

Review 1.  Small RNAs and the control of transposons and viruses in Drosophila.

Authors:  Ronald P van Rij; Eugene Berezikov
Journal:  Trends Microbiol       Date:  2009-03-18       Impact factor: 17.079

2.  Comparative transcriptome analysis reveals networks of genes activated in the whitefly, Bemisia tabaci when fed on tomato plants infected with Tomato yellow leaf curl virus.

Authors:  Daniel K Hasegawa; Wenbo Chen; Yi Zheng; Navneet Kaur; William M Wintermantel; Alvin M Simmons; Zhangjun Fei; Kai-Shu Ling
Journal:  Virology       Date:  2017-10-13       Impact factor: 3.616

3.  Biogenesis pathways of piRNAs loaded onto AGO3 in the Drosophila testis.

Authors:  Akihiro Nagao; Toutai Mituyama; Haidong Huang; Dahua Chen; Mikiko C Siomi; Haruhiko Siomi
Journal:  RNA       Date:  2010-10-27       Impact factor: 4.942

Review 4.  The biogenesis and function of PIWI proteins and piRNAs: progress and prospect.

Authors:  Travis Thomson; Haifan Lin
Journal:  Annu Rev Cell Dev Biol       Date:  2009       Impact factor: 13.827

5.  Discrete small RNA-generating loci as master regulators of transposon activity in Drosophila.

Authors:  Julius Brennecke; Alexei A Aravin; Alexander Stark; Monica Dus; Manolis Kellis; Ravi Sachidanandam; Gregory J Hannon
Journal:  Cell       Date:  2007-03-08       Impact factor: 41.582

6.  The mosquito Aedes aegypti has a large genome size and high transposable element load but contains a low proportion of transposon-specific piRNAs.

Authors:  Peter Arensburger; Robert H Hice; Jennifer A Wright; Nancy L Craig; Peter W Atkinson
Journal:  BMC Genomics       Date:  2011-12-15       Impact factor: 3.969

7.  Production of virus-derived ping-pong-dependent piRNA-like small RNAs in the mosquito soma.

Authors:  Elaine M Morazzani; Michael R Wiley; Marta G Murreddu; Zach N Adelman; Kevin M Myles
Journal:  PLoS Pathog       Date:  2012-01-05       Impact factor: 6.823

8.  Endogenous siRNAs and piRNAs derived from transposable elements and genes in the malaria vector mosquito Anopheles gambiae.

Authors:  Inna Biryukova; Tao Ye
Journal:  BMC Genomics       Date:  2015-04-10       Impact factor: 3.969

Review 9.  Transposable elements in Drosophila.

Authors:  Tabitha J McCullers; Mindy Steiniger
Journal:  Mob Genet Elements       Date:  2017-04-19

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more
  1 in total

1.  Exploiting somatic piRNAs in Bemisia tabaci enables novel gene silencing through RNA feeding.

Authors:  Mosharrof Mondal; Judith K Brown; Alex Flynt
Journal:  Life Sci Alliance       Date:  2020-08-06
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.