| Literature DB >> 25335504 |
Julie M Cridland1, Kevin R Thornton2, Anthony D Long2.
Abstract
Transposable elements are a common source of genetic variation that may play a substantial role in contributing to gene expression variation. However, the contribution of transposable elements to expression variation thus far consists of a handful of examples. We used previously published gene expression data from 37 inbred Drosophila melanogaster lines from the Drosophila Genetic Reference Panel to perform a genome-wide assessment of the effects of transposable elements on gene expression. We found thousands of transcripts with transposable element insertions in or near the transcript and that the presence of a transposable element in or near a transcript is significantly associated with reductions in expression. We estimate that within this example population, ∼2.2% of transcripts have a transposable element insertion, which significantly reduces expression in the line containing the transposable element. We also find that transcripts with insertions within 500 bp of the transcript show on average a 0.67 standard deviation decrease in expression level. These large decreases in expression level are most pronounced for transposable element insertions close to transcripts and the effect diminishes for more distant insertions. This work represents the first genome-wide analysis of gene expression variation due to transposable elements and suggests that transposable elements are an important class of mutation underlying expression variation in Drosophila and likely in other systems, given the ubiquity of these mobile elements in eukaryotic genomes.Entities:
Keywords: DGRP; gene expression; rare alleles of large effect; transposable elements
Mesh:
Substances:
Year: 2014 PMID: 25335504 PMCID: PMC4286695 DOI: 10.1534/genetics.114.170837
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
TE insertions within and near transcripts
| Region | No. TE insertions | Expected insertions |
|---|---|---|
| Within exon | 235 | 1096 |
| Introns ≤400 bp | 67 | 130 |
| Within 200 bp of acceptor site | 61 | 90 |
| Within 200 bp of donor site | 63 | 100 |
| Within first intron | 527 | 783 |
| Not within first intron | 754 | 1124 |
| ≤500 bp of TSS | 170 | 247 |
| 501 bp to 2 kb of TSS | 389 | 599 |
| >2 kb of TSS | 1609 | 2126 |
| ≤500 bp of TES | 200 | 234 |
| 501 bp to 2 kb of TES | 333 | 544 |
| >2 kb of TES | 1515 | 1987 |
Figure 1Normalized rank expression of transposable elements. Observed numbers of TE-containing lines per rank bin vs. 10,000 permutations. Red dots indicate the observed number of TE-containing lines; box plots show permutations. Box plot tails indicate the 2.5% and the 97.5% confidence intervals. Open circles above and below the box plots indicate the 0.5% and the 99.5% confidence interval. (A) TE is in an exon, (B) TE is in a 1st intron, (C) TE is in an intron ≤400bp in length, (D) TE is not in 1st Intron, (E) TE ≤ 500bp from TSS, (F) TE ≤ 500bp from TES, (G) TE ≤ 200bp from a donor site, and (H) TE ≤ 200bp from an acceptor site.
Causative mutations
| Category | Observed | Expected | O–E | % functional |
|---|---|---|---|---|
| Within exon | 101 | 7.1 | 93.9 | 93.0 |
| Introns ≤400 bp | 14 | 2.1 | 11.9 | 85.0 |
| Within 200 bp of acceptor site | 5 | 1.8 | 3.2 | 64.0 |
| Within 200 bp of donor site | 12 | 1.8 | 10.2 | 85.0 |
| Within first intron | 38 | 15.3 | 22.7 | 59.7 |
| Not within first intron | 41 | 24.4 | 16.6 | 40.5 |
| ≤500 bp of TSS | 20 | 5.4 | 14.6 | 73.0 |
| 501 bp to 2 kb of TSS | 17 | 11.9 | 5.1 | 30.0 |
| >2 kb of TSS | 55 | 60.4 | −5.4 | −9.8 |
| ≤500 bp of TES | 19 | 6.0 | 13.0 | 68.4 |
| 501 bp to 2 kb of TES | 13 | 10.0 | 3.0 | 23.1 |
| >2 kb of TES | 68 | 56.2 | 11.8 | 17.4 |
| Total | 403 | 202.4 | 200.6 |
% functional = (observed–expected)/observed.
Mean z-scores for transcripts with TEs
| Category | Mean | |
|---|---|---|
| Within exon | −3.44 | 249 |
| Introns ≤400 bp | −1.03 | 72 |
| Within 200 bp of acceptor site | −0.90 | 64 |
| Within 200 bp of donor site | −0.67 | 64 |
| Within first intron | −0.37 | 545 |
| Not within first intron | −0.11 | 852 |
| ≤500 bp of TSS | −0.43 | 186 |
| 501 bp to 2 kb of TSS | −0.01 | 418 |
| >2 kb of TSS | −0.05 | 2121 |
| ≤500 bp of TES | −0.52 | 213 |
| 501 bp to 2 kb of TES | −0.04 | 347 |
| >2 kb of TES | −0.02 | 1976 |
Mean z-scores are calculated from the transcript/TE pairs for all transcripts with an insertion in each location category.
Figure 2Transposable elements as a class of variation. Probability–probability plot of observed and expected P-values from t-tests of all cases where four or more lines show an independent TE insertion in the same location category for the same transcript.