| Literature DB >> 21985647 |
Svitlana Tyekucheva1, Robert H Yolken, W Richard McCombie, Jennifer Parla, Melissa Kramer, Sarah J Wheelan, Sarven Sabunciyan.
Abstract
BACKGROUND: Although nearly half of the human genome is comprised of repetitive sequences, the expression profile of these elements remains largely uncharacterized. Recently developed high throughput sequencing technologies provide us with a powerful new set of tools to study repeat elements. Hence, we performed whole transcriptome sequencing to investigate the expression of repetitive elements in human frontal cortex using postmortem tissue obtained from the Stanley Medical Research Institute.Entities:
Mesh:
Year: 2011 PMID: 21985647 PMCID: PMC3207997 DOI: 10.1186/1471-2164-12-495
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Detection of HERV-W gag transcripts by Northern blot analysis. The Northern blot demonstrates that some repetitive elements are highly expressed in brain and other tissues. This is consistent with our whole transcriptome sequencing data where we found a large number of reads mapping to repeat element sequences.
Figure 2Distribution of reads by repeat family in the frontal cortex. Distribution of the reads aligning to the repeats by family observed in the frontal cortex samples and in simulated reads drawn from the database of coding sequences (random CCDS) and from the repetitive sequence (random repeats). Height of the bars corresponds to average frequencies of the reads aligned to a particular family, the error bars indicate plus/minus standard deviation. Averages and standard deviations for samples are derived from pooling the samples, and for simulated data over 100 simulation runs.
Significance Tests for Observed Repeat Element Transcription
| Repeat Family | Fold Change | p-value | Adjusted for multiple comparisons p-value |
|---|---|---|---|
| Alu_SINE | 0.8162648 | 0.00E+00 | 0.00E+00 |
| ERV | 1.626032 | 0.00E+00 | 0.00E+00 |
| TcMar | 1.1997436 | 3.18E-92 | 4.23E-92 |
| hAT | 0.8364789 | 2.42E-140 | 3.63E-140 |
| DNA | 0.4241692 | 8.22E-50 | 9.86E-50 |
| L1 | 2.1400934 | 0.00E+00 | 0.00E+00 |
| L2 | 0.769354 | 0.00E+00 | 0.00E+00 |
| LTR | 1.6255619 | 2.05E-06 | 2.05E-06 |
| CR1 | 0.6404078 | 5.19E-201 | 1.04E-200 |
| Satellite | 2.8788933 | 2.78E-167 | 4.77E-167 |
| MIR | 0.4463942 | 0.00E+00 | 0.00E+00 |
| Other | 1.1499157 | 3.49E-35 | 3.81E-35 |
We compared our sequencing data to our simulated reads (simulated reads represent what is expected if repeat element transcription was merely at random) and tested for differential expression. The data was modeled using a negative binomial distribution. The fold changes were derived based on this model. Benjamini-Hochberg method was used to correct for multiple testing. P-values adjusted for multiple testing are shown in the last column.