| Literature DB >> 22754749 |
Kevin J Lee, Andrew B Conley, Victoria V Lunyak, I King Jordan.
Abstract
It is currently thought that small RNA (sRNA) based repression mechanisms are primarily employed to mitigate the mutagenic threat posed by the activity of transposable elements (TEs). This can be achieved by the sRNA guided processing of TE transcripts via Dicer-dependent (e.g., siRNA) or Dicer-independent (e.g., piRNA) mechanisms. For example, potentially active human L1 elements are silenced by mRNA cleavage induced by element encoded siRNAs, leading to a negative correlation between element mRNA and siRNA levels. On the other hand, there is emerging evidence that TE derived sRNAs can also be used to regulate the host genome. Here, we evaluated these two hypotheses for human TEs by comparing the levels of TE derived mRNA and TE sRNA across six tissues. The genome defense hypothesis predicts a negative correlation between TE mRNA and TE sRNA levels, whereas the genome regulatory hypothesis predicts a positive correlation. On average, TE mRNA and TE sRNA levels are positively correlated across human tissues. These correlations are higher than seen for human genes or for randomly permuted control data sets. Overall, Alu subfamilies show the highest positive correlations of element mRNA and sRNA levels across tissues, although a few of the youngest, and potentially most active, Alu subfamilies do show negative correlations. Thus, Alu derived sRNAs may be related to both genome regulation and genome defense. These results are inconsistent with a simple model whereby TE derived sRNAs reduce levels of standing TE mRNA via transcript cleavage, and suggest that human cells efficiently process TE transcripts into sRNA based on the available message levels. This may point to a widespread role for processed TE transcripts in genome regulation or to alternative roles of TE-to-sRNA processing including the mitigation of TE transcript cytotoxicity.Entities:
Year: 2012 PMID: 22754749 PMCID: PMC3383446 DOI: 10.4161/mge.19031
Source DB: PubMed Journal: Mob Genet Elements ISSN: 2159-2543
Table 1. Results of the tag-to-genome mapping for mRNA and sRNA sequence libraries for six human tissues
| Reads per tissue | Reads after clipping | Reads that map to hg18 | % reads mapped | Reads that map to TEs | % of mapping reads that map to TEs | Reads that map to genes | % of mapping reads that map to genes | |
|---|---|---|---|---|---|---|---|---|
| | | | | | | | | |
| brain | 34,493,914 | n/a | 28,389,338 | 82.3 | 1,001,006 | 3.5 | 24,194,582 | 85.2 |
| heart | 40,338,602 | n/a | 32,751,816 | 81.2 | 571,069 | 1.7 | 26,665,851 | 81.4 |
| kidney | 83,696,940 | n/a | 42,051,713 | 50.2 | 3,828,411 | 9.1 | 33,587,016 | 79.9 |
| liver | 125,090,140 | n/a | 73,281,292 | 58.6 | 6,769,796 | 9.2 | 64,212,056 | 87.6 |
| lung | 25,862,057 | n/a | 19,808,655 | 76.6 | 3,138,208 | 15.8 | 16,434,340 | 83.0 |
| muscle | 45,280,908 | n/a | 36,984,450 | 81.7 | 919,399 | 2.5 | 32,413,952 | 87.6 |
| | | | | | | | | |
| brain | 5,021,339 | 2,977,817 | 2,939,957 | 98.7 | 33,102 | 1.1 | 2,452,355 | 83.4 |
| heart | 5,901,910 | 4,937,144 | 4,921,992 | 99.7 | 42,284 | 0.9 | 4,701,738 | 95.5 |
| kidney | 2,869,903 | 2,135,001 | 2,108,413 | 98.8 | 23,959 | 1.1 | 1,720,229 | 81.6 |
| liver | 6,312,578 | 3,448,077 | 3,422,122 | 99.2 | 74,695 | 2.2 | 860,191 | 25.1 |
| lung | 7,294,106 | 4,808,564 | 4,709,583 | 97.9 | 62,764 | 1.3 | 3,652,715 | 77.6 |
| muscle | 3,793,410 | 3,537,750 | 3,532,680 | 99.9 | 38,019 | 1.1 | 3,458,249 | 97.9 |

Figure 1. Scheme of the analytical pipeline and tools presented herein. (A) Analytical pipeline overview. (B) Example of the linear regression and correlation analysis used to compare mRNA vs. sRNA levels for individual TE subfamilies and genes across six human tissues. (C) Example of the distribution of the resulting correlation coefficients for all genes.

Figure 2. mRNA vs. sRNA correlation coefficient distributions for human TE subfamilies and genes across six tissues. (A) Observed (blue) and randomized (red) correlation coefficient distributions for TE subfamilies. (B) Observed (blue) and randomized (red) correlation coefficient distributions for genes. (C) Correlation coefficient median ± standard error values for TE subfamily and gene observed (blue) vs. random (red) distributions.

Figure 3. Median ± standard error values for the (A) correlation coefficient and (B) slope distributions for individual TE family (classes).