| Literature DB >> 25267446 |
Jimin Song1, Jixia Liu2, Sandra L Schnakenberg3, Hongseok Ha4, Jinchuan Xing4, Kevin C Chen5.
Abstract
Transposable elements (TEs) are one of the most important features of genome architecture, so their evolution and relationship with host defense mechanisms have been topics of intense study, especially in model systems such as Drosophila melanogaster. Recently, a novel small RNA-based defense mechanism in animals called the Piwi-interacting RNA (piRNA) pathway was discovered to form an adaptive defense mechanism against TEs. To investigate the relationship between piRNA and TE content between strains of a species, we sequenced piRNAs from 16 inbred lines of D. melanogaster from the Drosophila Genetic Reference Panel. Instead of a global correlation of piRNA expression and TE content, we found evidence for a host response through de novo piRNA production from novel TE insertions. Although approximately 20% of novel TE insertions induced de novo piRNA production, the abundance of de novo piRNAs was low and did not markedly affect the global pool of ovarian piRNAs. Our results provide new insights into the evolution of TEs and the piRNA system in an important model organism.Entities:
Keywords: Drosophila melanogaster; de novo piRNA production; piRNA; transposable elements
Mesh:
Substances:
Year: 2014 PMID: 25267446 PMCID: PMC4224344 DOI: 10.1093/gbe/evu217
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
The Percentage of Novel TE Insertions with the De Novo piRNA Signature As the RPKM and Skewness Cutoffs Are Varied (Materials and Methods)
| RPKM Cutoff | Skewness Cutoff | Median of the Percentages of Novel TE Insertions with De Novo piRNA Signature over | ||
|---|---|---|---|---|
| Cridland (%) | Linheiro (%) | TEMP (%) | ||
| 0.5 | 70 | 19.6 | 21.6 | 5.6 |
| 1 | 70 | 11.2 | 13.7 | 3.6 |
| 3 | 70 | 3.8 | 3.8 | 1.6 |
| 0.5 | 90 | 9.2 | 9.6 | 3.7 |
| 1 | 90 | 6.7 | 5.9 | 2.5 |
| 3 | 90 | 2.4 | 0 | 1.2 |
aRPKM cutoff x means the number of piRNAs is at least RPKM ≥x on each 1-Kb flanking region
bSkewness cutoff y means at least y% of piRNAs in each flanking region are skewed on the minus strand upstream or on the plus strand downstream in the genome.
The Number of Sequencing Reads at Each Step of Our Computational Pipeline for the Pooled Sample of All 16 Strains
| Step | Number of Reads in the Pool (Million) |
|---|---|
| Raw | 558.4 |
| 5 ≤ size ≤ 45 | 546.3 |
| Removed small RNA-derived reads | 334.0 |
| Removed unmapped reads | 269.8 |
| 23 ≤ size ≤ 29 | 226.1 |
FThe distribution of read sizes for all mapped putative piRNA reads in the pooled sample. There is a clear peak in the range 23–29 nt.
FDistribution of overlap sizes between all pairs of overlapping piRNAs for strain 705. There is a clear ping-pong signal with a peak at 10 nt. The peak of the distribution for uniquely mapped reads is not as sharp as for all mapped reads, presumably due to missing reads from bona fide ping-pong pairs.
FDistribution of piRNA-generating loci across different classes of genomic elements. In order not to double-count overlapping regions, TE-derived piRNAs are defined as TE-derived but not within annotated piRNA clusters and genic piRNAs are defined as genic but neither TE-derived nor within annotated clusters.
FThe distribution of CVs of piRNA cluster expression levels between strains computed from all mapped and uniquely mapped piRNA reads. The distribution of CVs of protein-coding gene expressions between strains is plotted as a reference.
Pearson Correlation between the Normalized Number of All Mapped piRNAs and the Number of Novel TE Insertions for Each TE Family
| TE Family | Pearson’s | TE Family | Pearson’s | ||
|---|---|---|---|---|---|
| Stalker2 | −0.70 | 0.02 | roo | 0.49 | 0.13 |
| G2 | −0.69 | 0.02 | pogo | 0.48 | 0.14 |
| Rt1b | −0.59 | 0.05 | Max | 0.48 | 0.14 |
| gypsy12 | −0.30 | 0.37 | mdg1 | 0.46 | 0.15 |
| I | −0.24 | 0.49 | Cr1a | 0.44 | 0.18 |
| transib3 | −0.17 | 0.61 | 412 | 0.35 | 0.29 |
| 3S18 | −0.15 | 0.66 | Quasimodo | 0.32 | 0.34 |
| Doc | −0.14 | 0.67 | FB | 0.29 | 0.39 |
| Transpac | −0.14 | 0.68 | flea | 0.26 | 0.44 |
| INE-1 | −0.08 | 0.80 | Tirant | 0.24 | 0.49 |
| Tabor | −0.05 | 0.87 | Juan | 0.21 | 0.54 |
| Burdock | −0.02 | 0.95 | F | 0.12 | 0.73 |
| S | 0.09 | 0.79 |
FThe overlap of TE predictions between the three novel TE annotation data sets for one strain that was annotated by all three methods.
FNumber of piRNA reads found in the flanking regions (±1 kb) of novel TE insertions in DGRP strain 391. PiRNA reads are mostly found on the minus strand upstream (i.e., to the left) of novel TE insertions and on the plus strand downstream (i.e., to the right) of novel TE insertions. The percentage of piRNAs with a 5′-uridine (1U) confirms that the reads are likely to be piRNAs. The y axis shows the actual number of sequencing reads.
When Considering All Novel TE Insertions with De Novo piRNA Signatures, Most piRNAs in the Left Flanking Region Are on the Minus Strand with High 1U Percentage and Most piRNAs in the Right Flanking Region Are on the Plus Strand with High 1U Percentage
| TE Annotation Data Set | Strain | % of piRNAs on Minus Strand Upstream | % 1U of piRNAs on Minus Strand Upstream | % of piRNAs on Plus Strand Downstream | % 1U of piRNAs on Plus Strand Downstream |
|---|---|---|---|---|---|
| Cridland | 375 | 91.4 | 74.1 | 94.5 | 73.8 |
| 380 | 92.3 | 74.8 | 92.5 | 74.3 | |
| 391 | 95.6 | 79.1 | 95.0 | 77.6 | |
| 399 | 96.8 | 66.0 | 98.7 | 69.5 | |
| 427 | 94.7 | 70.3 | 96.8 | 64.4 | |
| 437 | 95.8 | 75.2 | 96.8 | 76.5 | |
| 555 | 97.4 | 74.3 | 93.6 | 81.8 | |
| 705 | 94.4 | 81.8 | 97.8 | 70.8 | |
| 707 | 95.4 | 74.5 | 96.2 | 73.9 | |
| 714 | 90.5 | 100.0 | 95.4 | 68.6 | |
| Linheiro | 313 | 100.0 | 100.0 | 100.0 | 73.3 |
| 358 | 100.0 | 66.7 | 86.2 | 66.7 | |
| 362 | 94.6 | 63.3 | 83.8 | 75.5 | |
| 375 | 93.5 | 75.0 | 93.2 | 73.1 | |
| 379 | 90.9 | 75.2 | 97.8 | 68.8 | |
| 380 | 90.3 | 76.2 | 95.2 | 79.9 | |
| 391 | 96.6 | 78.5 | 96.5 | 73.6 | |
| 399 | 88.6 | 68.0 | 79.6 | 79.5 | |
| 555 | 94.1 | 79.7 | 89.3 | 76.0 | |
| 705 | 99.0 | 78.3 | 86.8 | 72.3 | |
| 707 | 90.8 | 70.8 | 98.3 | 79.7 | |
| 712 | 94.9 | 77.1 | 87.9 | 79.8 | |
| 714 | 92.0 | 70.4 | 96.9 | 75.3 | |
| 732 | 94.8 | 78.3 | 96.5 | 77.5 | |
| TEMP | 362 | 93.9 | 71.6 | 94.6 | 70.0 |
| 391 | 95.1 | 77.4 | 96.2 | 77.5 | |
| 437 | 92.2 | 74.9 | 95.9 | 72.6 |
Ovary-Expressed Genes Tend to Show a Significant Difference in Gene Expression between Strains with and without Novel TE Insertions in the Gene
| TE Data Set | Number of Genes with Novel TE Insertions | Showing Significant Gene Expression Difference by Nominal | Showing Significant Gene Expression Difference by | ||||
|---|---|---|---|---|---|---|---|
| Number of Genes | Number of Ovary-Expressed Genes | Number of Genes | Number of Ovary-Expressed Genes | ||||
| Linheiro | 248 | 41 | 22 (53.7%) | 2.1e-3 | 9 | 7 (77.8%) | 5.3e-3 |
| Cridland | 344 | 73 | 40 (54.8%) | 2.0e-5 | 20 | 10 (50.0%) | 0.06 |
| TEMP | 3,211 | 98 | 43 (43.9%) | 4.8e-3 | 0 | 0 | N/A |
aP values are from Fisher exact tests.