| Literature DB >> 16698959 |
Anton Buzdin1, Elena Kovalskaya-Alexandrova, Elena Gogvadze, Eugene Sverdlov.
Abstract
We developed a technique called GREM (Genomic Repeat Expression Monitor) that can be applied to genome-wide isolation and quantitative analysis of any kind of transcriptionally active repetitive elements. Briefly, the technique includes three major stages: (i) generation of a transcriptome wide library of cDNA 5' terminal fragments, (ii) selective amplification of repeat-flanking genomic loci and (iii) hybridization of the cDNA library (i) to the amplicon (ii) with subsequent selective amplification and cloning of the cDNA-genome hybrids. The sequences obtained serve as 'tags' for promoter active repetitive elements. The advantage of GREM is an unambiguous mapping of individual promoter active repeats at a genome-wide level. We applied GREM for genome-wide experimental identification of human-specific endogenous retroviruses and their solitary long terminal repeats (LTRs) acting in vivo as promoters. Importantly, GREM tag frequencies linearly correlated with the corresponding LTR-driven transcript levels found using RT-PCR. The GREM technique enabled us to identify 54 new functional human promoters created by retroviral LTRs.Entities:
Mesh:
Year: 2006 PMID: 16698959 PMCID: PMC3303178 DOI: 10.1093/nar/gkl335
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Schematic representation of solitary (left) and proviral (right) LTRs expression. The transcription driven from 5′ proviral LTRs results in mRNAs of viral genes, whereas the expression of either solitary or 3′ proviral LTRs results in the transcription of host genomic sequences, flanking the 3′ ends of the retroelements.
Figure 2Schematic representation of the GREM technique (for details, see text). The procedure includes three major stages: (Stage 1) genome-wide amplification of the genomic DNA flanking the 3′ ends of target repetitive elements (here, HS LTRs). Treatment of the resulting amplicon with ExoIII generates 5′ protruding ends to be used at the third stage. (Stage 2) A double-stranded oligo d(T)-primed cDNA library is synthesized for tissues where expression of repetitive elements is to be studied. At this stage cDNAs are tagged by a linker oligonucleotide (CS) at the RNA transcription start sites using the ‘cap-switch’ effect. cDNAs are then digested with AluI restriction endonuclease that has no recognition sites within HS LTRs. This step precludes amplification of LTR sequences read-through in the sense orientation. (Stage 3) Finally, the genomic DNA amplicon (Stage 1) is hybridized to the 5′ tagged cDNAs (Stage 2). The protruding DNA ends are filled in with DNA polymerase, and the hybrids obtained (ELTs) are nested PCR amplified with primers specific to the flanking genomic DNA adapter and cDNA 5′ terminal tag sequence, respectively.
Genome-wide map of promoter active HS LTRs with relative contents of expressed LTR tags (ELTs) for human testicular parenchyma
| LTR ID | GenBanka | Genomic location | ELT content (%)b | Status of LTR | Human specificity (+/−) |
|---|---|---|---|---|---|
| 1 | AL359965 | 1p32.3 | 0.25 | Solitary | + |
| 2 | AL356379 | 1p34.2 | 0.50 | Solitary | − |
| 3 | AL355480 | 1p34.1 | 0.50 | Solitary | + |
| 4 | AL139421 | 1p22.1 | 0.50 | Solitary | + |
| 5 | AL135927 | 1q22 | 5.57 | Solitary | + |
| 6 | AL353807 | 1q23.1 | 0.25 | 3′ proviral | + |
| 7 | AC011811 | 2q37.1 | 0.50 | Solitary | + |
| 8 | AC074019 | 2q36.3 | 0.50 | Solitary | + |
| 9 | AC069420 | 3q27.2 | 32.66 | 3′ proviral | + |
| 10 | AC025548 | 3p21.31 | 0.25 | Solitary | + |
| 11 | AC024626 | 4 | 0.25 | Solitary | + |
| 12 | AC118278 | 4p16.3 | 1.77 | Solitary | + |
| 13 | AC110373 | 4q26 | 1.27 | Solitary | − |
| 14 | AC010267 | 5q23.1 | 0.25 | Solitary | + |
| 15 | AC116309 | 5p13.3 | 2.53 | 3′ proviral | + |
| 16 | AC026424 | 5q13.3 | 0.25 | Solitary | + |
| 17 | AC008648 | 5q35.1 | 0.25 | Solitary | + |
| 18 | AC016577 | 5q33.3 | 1.52 | 3′ proviral | + |
| 19 | AL139090 | 6q15 | 1.01 | Solitary | + |
| 20 | AL009179 | 6p22.1 | 0.50 | Solitary | − |
| 21 | AC026010 | 6q23.2 | 0.76 | Solitary | + |
| 22 | AL451165 | 6p21.31 | 15.19 | Solitary | + |
| 23 | AL138889 | 6p21.31 | 0.25 | Solitary | + |
| 24 | AL590543 | 6q25.1 | 0.25 | Solitary | − |
| 25 | AL589643 | 6q21 | 0.25 | Solitary | + |
| 26 | AC023201 | 6q25.1 | 0.25 | Solitary | + |
| 27 | AL353588 | 6p21.1 | 0.25 | Solitary | − |
| 28 | AC069335 | 7q34 | 1.01 | Solitary | − |
| 29 | AC021973 | 8q24.3 | 0.25 | 3′ proviral | + |
| 30 | AC120036 | 8q11.21 | 0.50 | Solitary | + |
| 31 | AF235103 | 8q24.3 | 0.50 | Solitary | − |
| 32 | AC015640 | 9p22.2 | 0.50 | Solitary | + |
| 33 | AL162412 | 9q21.12 | 0.50 | Solitary | + |
| 34 | AL353766 | 9q31.2 | 0.50 | Solitary | + |
| 35 | AC068707 | 10q11.21 | 1.27 | Solitary | − |
| 36 | AL392107 | 10q24.2 | 0.25 | 3′ proviral | + |
| 37 | BC001407 | 10q21.3 | 2.79 | Solitary | + |
| 38 | AP002754 | 11q12.2 | 0.25 | Solitary | − |
| 39 | AP002513 | 11q13.4 | 1.01 | Solitary | + |
| 40 | AP002793 | 11q12.13 | 0.50 | Solitary | − |
| 41 | AP003385 | 11q13.2 | 0.25 | Solitary | + |
| 42 | AC002350 | 12q24.11 | 0.25 | 3′ provirus | + |
| 43 | U47924 | 12p13.31 | 0.76 | Solitary | + |
| 44 | AL135901 | 13q14.2 | 0.25 | Solitary | − |
| 45 | AC055861 | 15q26.3 | 0.50 | Solitary | + |
| 46 | AC026817 | 15q22.2 | 0.50 | Solitary | + |
| 47 | AC068213 | 15q22.31 | 0.76 | Solitary | + |
| 48 | AC018768 | 16p13.2 | 0.25 | Solitary | + |
| 49 | AC012175 | 16p13.3 | 0.25 | Solitary | + |
| 50 | AC012146 | 17p13.2 | 1.52 | Solitary | + |
| 51 | AC008996 | 19q12 | 0.25 | 3′ proviral | + |
| 52 | AL109748 | 21q11.2 | 0.50 | Solitary | + |
| 53 | AC007326 | 22q11.21 | 0.25 | 3′ proviral | + |
| 54 | AL109653 | Xq27.3 | 0.25 | Solitary | + |
| 5′ proviral LTRs | 14.94 | 5′ proviral |
aGenBank accession number corresponding to the LTR; not applicable to 5′ proviral LTRs line.
bThe relative ELT content calculated as a ratio of the number of tags for each individual HS LTR to the total number (395) of all ELTs in the sequenced library.
Relative LTR transcript levels and frequency of occurrence of the corresponding ELTs for testicular parenchyma
| LTR ID | Transcript levela (percentage of the beta-actin gene transcript level) | ELT frequency (%) |
|---|---|---|
| 4 | 0.26 ± 0.09 | 0.50 |
| 9 | 2.9 ± 0.2 | 32.66 |
| 12 | 0.16 ± 0.05 | 1.77 |
| 16 | 0.17 ± 0.03 | 0.25 |
| 17 | 0.02 ± 0.005 | 0.25 |
| 18 | 0.24 ± 0.03 | 1.52 |
| 22 | 1.4 ± 0.4 | 15.19 |
| 24 | 0.032 ± 0.013 | 0.25 |
| 27 | 0.13 ± 0.04 | 0.25 |
| 37 | 0.35 ± 0.06 | 2.79 |
| 38 | 0.16 ± 0.04 | 0.25 |
| 43 | 0.059 ± 0.014 | 0.76 |
| 47 | 0.12 ± 0.02 | 0.76 |
| 55 | 0.004 ± 0.001 | 0 |
| 56 | 0 | 0 |
| 57 | 0 | 0 |
| 58 | 0.24 ± 0.03 | 3.61 |
| 59 | 0.013 ± 0.004 | 0 |
| 60 | 0.01 ± 0.003 | 0 |
| 5′ proviral LTRs | 1.08 ± 0.09 | 14.94 |
aRelative transcript levels measured by RT–PCR.
Genomic primer sets used for PCR amplification
| Name | Sequence (5′–3′) | Accessiona |
|---|---|---|
| Oligonucleotides used for GREM procedure | ||
| (1) LTR specific primers | ||
| LTRfor1 | gtcttgtgaccctgacacatcc | — |
| LTRfor2 | cctccatatgctgaacgctg | — |
| LTRfor3 | ggggcaacccacccctac | — |
| (2) Suppression adapter oligonucleotides | ||
| A1A2 | gtaatacgactcactatagggcagtcgacgcgtgcccggtccgac | — |
| A3 | gtcggaccgggc | — |
| A1 | gtaatacgactcactatagggc | — |
| A2 | agtcgacgcgtgcccggtccgac | — |
| A4 | tcgacgcgtgcccggtccgacct | |
| (3) Oligonucleotides used for cap-switch based cDNA amplification | ||
| CDS | aagcagtggtatcaacgcagagtac(t)30 | — |
| riboCS | taacaacgcagagtacgcrgrgrg | — |
| CS | taacaacgcagagtacgcgg | — |
| Primers used for RT–PCR experiments | ||
| (1) LTR specific primers | ||
| LTRfor1 | gtcttgtgaccctgacacatcc | — |
| LTRfor2 | cctccatatgctgaacgctg | — |
| LTRfor3 | ggggcaacccacccctac | — |
| (2) Unique genomic primers specific for LTR 3′ flanking regions | ||
| gLTR55 | taagtggatataattactaagtccagg | AC068381 |
| gLTR38 | ccaacatctgtctcttccctg | AP002754 |
| gLTR22 | gaccatttgcatggacaaatc | AL451165 |
| gLTR9 | ccatcccttccatgccttag | AC069420 |
| gLTR56 | agctttgtggattgtaatttgg | AC072054 |
| gLTR4 | ctcagtaaagatgaaggtatgacaag | AL139421 |
| gLTR57 | gaggcagaggttgcagtgagcc | AC002400 |
| gLTR16 | ataaaggagaaatcttccatgaag | AC026424 |
| gLTR17 | tgtgacggtataatggcctct | AC008648 |
| gLTR58 | ggttatgaataaagttccctcgg | AC027750 |
| gLTR59 | agaatagagcgaacagacacag | AL352982 |
| gLTR24 | aggttattgatacattgcatcgac | AC023201 |
| gLTR12 | caataacagtcattctcactggag | AC118278 |
| gLTR27 | gagttgggatgtggtcttagg | AL353588 |
| gLTR60 | ctcatgctaaactgtctgattatgc | AC105049 |
| gLTR37 | ttgtgcaaactgtctacagcca | BC001407 |
| gLTR18 | aacatacaggttgaggccagg | AC016577 |
| gLTR47 | ttgtagctgaccaacagcctgc | AC068213 |
| gLTR43 | ttaggccagggtctcactgag | U47924 |
| (3) HERV-K (HML-2) proviral gene gag specific primer | ||
| Gag rev | aatggcccaatcattccata | — |
aGenBank accession numbers of corresponding LTRs from non-redundant and high throughout genome sequence databases.