Literature DB >> 23061024

Intronic retroelements: Not just "speed bumps" for RNA polymerase II.

Kristel Kaer1, Mart Speek.   

Abstract

Two well-known retroelements, L1 and Alu, comprise about one third of the human genome and are nearly equally distributed between the intergenic and intragenic regions. They carry different regulatory elements and contribute structurally and functionally to the expression of our genes. Recent data also suggest that hundreds of intronic L1s and Alus interfere with the transcription of human genes by inducing intron retention, forcing exonization and cryptic polyadenylation. These novel features can be explained with the RNA polymerase kinetic model and suggest that intronic L1s and Alus are not just "speed bumps" in regulation of RNA polymerase traffic. Here we discuss the complexity of the regulation of gene transcription imposed by intronic retroelements and predict that in addition to transcriptional activity, transcription factor binding and nucleosomal occupancy play a significant role in the transcriptional interference effects of the host genes.

Entities:  

Year:  2012        PMID: 23061024      PMCID: PMC3463474          DOI: 10.4161/mge.20774

Source DB:  PubMed          Journal:  Mob Genet Elements        ISSN: 2159-2543


Long and short interspersed nuclear elements, classified as LINEs (L1, L2, L3) and SINEs (Alus, SVA and other composite elements) are the two most abundant families of type I mobile genetic elements or retroelements, which have been generated by reverse transcription and transposition into mammalian genomes over the last ~170 million years. L1s and Alus are the representative members of these two classes, comprising about 17% and 10% of the human genome, respectively. L1s are the only autonomous and retrotranspositionally competent elements (~100 active copies per individual genome), encoding functional reverse transcriptase. They are involved in trans-mobilization of other, passive or nonautonomous retroelements, such as Alus, SVAs and generation of processed pseudogenes (for a review see ref. 3). When inserted into the genome, retroelements could influence the transcription of the host genes by various ways. For example, they could provide alternative promoters, involved in alternative splicing and termination of transcription. Depending on the location, insertion into coding region (exon) breaking ORF sequence disrupts the normal gene expression. However, insertion into intron (or intergenic region) is generally less damaging and thus may or may not influence the transcription of the host gene. About 7,000 full-length human L1s are nearly equally distributed between intragenic and intergenic regions. Most of them contain broken ORFs and have accumulated many mutations, as a result of which they are no longer able to retrotranspose. Nevertheless, perhaps thousands of them have retained transcriptional activity- and/or contain binding sites for YY1, SOX family members, RUNX3 and other unknown transcription factors (TFs). Similarly to L1, about half of one million Alus are located in introns of genes. Only a small fraction of intra- or intergenic Alus are transcriptionally active. However, activation of many Alus depends on the cell stress (e.g. viral infection and heat shock). Alus are transcribed by RNA polymerase (RNA pol) III, and contain many functional TF binding sites for RNA pol III as well as RNA pol II.- Despite their small size (~300 nt), compared with full-length L1s (~6.0 kb), Alus of different ages (old-J, intermediate-S and young-Y) have accumulated many mutations which resulted in loss of TF binding in some elements or gain in other, overall expanding the number of TF binding sites. Previous studies performed in Deininger and Boeke laboratories, have shown that intronic L1s could act as molecular “rheostats” by inhibiting elongation of the RNA pol II and causing premature transcriptional termination and cryptic polyadenylation within L1 ORF1 and ORF2 sequences of the host gene transcripts. It was shown that L1 sense strand, containing 19 cryptic polyadenylation sites (vs. two of antisense strand) with respect to gene transcription and having A-rich bias (~40%), had a stronger effect than antisense strand. Earlier, it was suggested that evolutionary pressure acting against premature polyadenylation within the L1 sense strand, has contributed to the selection of L1 antisense strand, as this strand occurred about twice more frequently. Another mechanism, by which intronic L1s could interfere with normal gene expression involves exonization. Many intronic L1s contain functional donor and acceptor splice sites (e.g. positions 97 and 791/979 in sense strand, respectively) which in combination with intronic sequences could provide alternative transcripts. When translated, these transcripts, containing only a partial ORF sequences, could produce truncated proteins and thus may alter the normal gene expression. Similarly to L1s, Alus contain many splice sites, either present in consensus sequences, e.g. seven splice donor and 12 acceptor sites in antisense strand and only a few in sense strand, or created by mutations. Therefore, exonization of Alus could affect alternative splicing of the host gene. Indeed, about 5% of alternatively spliced exons are derived from Alu. Also, in some cases a single point mutation in Alu sequence could shift from alternative to constitutive splicing resulting in human disease. We have recently demonstrated that intronic L1s and possibly other retroelements in tandem orientation with respect to gene transcription could cause transcriptional interference (TI) of the host gene by inducing intron retention and forcing exonization and cryptic polyadenylation (Fig. 1). Differently from previous studies,, these effects were observed upstream to L1, depended on the activity of L1 sense and antisense (ASP) promoters, and/or TF binding, distance between the observed TI effect and L1, and the presence or absence of cryptic splice/polyadenylation signals. It was proposed that intronic L1s could interfere with the elongation of RNA pol II by forcing it to slow down or dissociate from the template giving rise to prematurely terminated transcripts. While the RNA pol II complex bound to L1 sense promoter would act as a “sitting duck,” another RNA pol II complex operating on the L1 antisense strand (ASP), would cause head-on-collision with the host RNA pol II according to the mechanisms described for TI. Consistent with the latter, correlation between the L1 ASP activity and TI effect was found for nine genes in different human tissues (our unpublished results). We predicted that a minimum of 100 L1 copies from about one thousand full-length intronic tandemly oriented L1s may be involved in the TI of the host genes. Our preliminary bioinformatic analysis also revealed that from the total of 55 Alus analyzed in 123 genes, 39 tandemly orientated intronic Alus possibly caused intron retention in 31 cases and exonization in seven cases. These (39) Alus had the following age distribution: AluJ-23%, AluS-67% and AluY-10%. The remaining Alus (15) in convergent orientation showed seven and one events of intron retention and exonization, respectively. This result shows that tandemly arranged Alus, like L1s, are about five times more efficient in causing TI than convergent Alus. The exact reason for this strand bias is not known. We suspect that TF binding and nucleosomal structure may be somehow involved (see below). Interestingly, seven of the 27 single exon protein-coding nested genes in tandem orientation showed effects on TI, while only one of the 23 convergently oriented similar genes was effective, suggesting that orientation bias is characteristic also for protein-coding genes. It remains to be shown how many intronic Alus could potentially induce TI effects in the human genome. Despite the lack of orientational bias, the extremely large number of intronic Alus and their high TF binding potential suggests that hundreds, if not thousands of Alus could participate in the TI of the host genes.

Figure 1. Transcriptional interference induced by human retroelements. L1 and Alu positioned in the nucleosomal structure located in the intron of the hypothetical gene fragment act not only as “speed bumps” for RNA pol II, but also interfere with the host gene transcription by causing intron retention, forcing exonization and cryptic polyadenylation in their upstream region (characteristic structures 2 and 4 are shown at bottom). Also, in some instances, intron retention and exonization could include part of the retroelement (structures 3 and 5, respectively) and prematurely terminated transcripts may or may not contain poly(A) tail (structures 2 and 4). The observed TI effects depend on the L1 SP-ASP and Alu transcriptional activity, TF binding and the presence or absence of cryptic splice sites and polyadenylation signals upstream to or within the retroelements. TI effect may be also influenced by the nucleosome occupation in the L1 or Alu region. For the sake of simplicity, only two nucleosomes aligned to the 5′ region of the L1 and Alu are shown. Exons 1–3 are shown as colored boxes and their preferential location in nucleosomal regions is marked with small nucleosomes. Red arrow refers to the position, where TI effects occur. For each RNA pol II complex direction of transcription is shown with arrow.

Figure 1. Transcriptional interference induced by human retroelements. L1 and Alu positioned in the nucleosomal structure located in the intron of the hypothetical gene fragment act not only as “speed bumps” for RNA pol II, but also interfere with the host gene transcription by causing intron retention, forcing exonization and cryptic polyadenylation in their upstream region (characteristic structures 2 and 4 are shown at bottom). Also, in some instances, intron retention and exonization could include part of the retroelement (structures 3 and 5, respectively) and prematurely terminated transcripts may or may not contain poly(A) tail (structures 2 and 4). The observed TI effects depend on the L1 SP-ASP and Alu transcriptional activity, TF binding and the presence or absence of cryptic splice sites and polyadenylation signals upstream to or within the retroelements. TI effect may be also influenced by the nucleosome occupation in the L1 or Alu region. For the sake of simplicity, only two nucleosomes aligned to the 5′ region of the L1 and Alu are shown. Exons 1–3 are shown as colored boxes and their preferential location in nucleosomal regions is marked with small nucleosomes. Red arrow refers to the position, where TI effects occur. For each RNA pol II complex direction of transcription is shown with arrow. Sela et al. suggested that insertion of the retroelements into introns can lead to activation of certain alternatively spliced exons. According to their definition, also originally proposed by Parma et al., exonization is a creation of new exon as a result of mutation(s) in intronic sequence. In evolutionary terms, it is believed that this important driving force could enhance the divergence and speciation. However, the question here is: Do we only need a point mutation or mutations to get the intronic sequences exonized? We proposed that retroelements could act as roadblocks or “speed bumps” by slowing down the elongating RNA pol II and forcing its action to be completed by transcription-coupled splicing and termination factors/processes (Fig. 1). Therefore, retroelements not only reduce the elongation speed and increase RNA pol II pausing, but also force exonization and polyadenylation at cryptic splice sites and polyadenylation signals. All these data are in accordance with the kinetic model of transcription originally proposed by Eperon et al. and subsequently modified by Kornblihtt. According to this model, the use of alternative splice sites depends on the elongation rate of RNA pol II. Under high elongation speed of RNA pol II and in the absence of internal stops, only strong acceptor splice sites are used and exons with weak splice sites are skipped. However, when elongation speed is reduced or inhibited, e.g., with mutant form of RNA pol II or artificial pause sites, then the weak acceptor splice sites become available and can be used. The kinetics of RNA pol II elongation is influenced by nucleosome occupation, histone code and the action of chromatin remodelling factors. With exons enriched in nucleosome structures because of their matching sizes (both ~150 nt) and higher GC content, compared with surrounding introns, RNA pol II has higher binding level toward exons, determined from chromatin immunoprecipitation experiments. Therefore, nucleosomes could serve as transcription “speed bumps,” regulating RNA pol II traffic. Also, in support of this idea, Alexander et al. have demonstrated transient accumulation of RNA pol II around 3′ ends of introns, which in agreement with their experimental data was required for cotranscriptional splicing. It is not clear to what extent intronic retroelements, their transcriptional activity and TF binding are involved in the formation or positioning of nucleosomes. Interestingly, the high level of nucleosome occupancy, visualized by UCSC Genome Browser, was predicted for L1 promoter region (~0.9 kb) and Alus., Both these elements have high GC content (~60–65%). However, only for Alus, but not for L1s, the tendency to form nucleosome structure has been experimentally proved by using DNase footprinting combined with primer extension and micrococcal digestion combined with sequencing. The potential role of L1 or Alu nucleosomal structure in TI has not been studied. Recently, an interesting connection between TI and nucleosome occupancy was revealed by Thebault et al. They showed that nucleosome deposition initiated by transcription from the upstream non-coding RNA gene (SRG1) mediates repression of the downstream tandemly oriented protein-coding gene (SER3) by a promoter occlusion mechanism. It remains to be shown whether intronic retroelements are involved in the nucleosome occupation or depositioning by similar or other mechanisms.
  34 in total

1.  Antisense promoter of human L1 retrotransposon drives transcription of adjacent cellular genes.

Authors:  M Speek
Journal:  Mol Cell Biol       Date:  2001-03       Impact factor: 4.272

2.  Alu-containing exons are alternatively spliced.

Authors:  Rotem Sorek; Gil Ast; Dan Graur
Journal:  Genome Res       Date:  2002-07       Impact factor: 9.043

3.  Nested genes in the human genome.

Authors:  Peng Yu; Dalong Ma; Mingxu Xu
Journal:  Genomics       Date:  2005-10       Impact factor: 5.736

4.  Chromatin, transcript elongation and alternative splicing.

Authors:  Alberto R Kornblihtt
Journal:  Nat Struct Mol Biol       Date:  2006-01       Impact factor: 15.369

5.  Identification, characterization, and cell specificity of a human LINE-1 promoter.

Authors:  G D Swergold
Journal:  Mol Cell Biol       Date:  1990-12       Impact factor: 4.272

6.  Transcription regulation by the noncoding RNA SRG1 requires Spt2-dependent chromatin deposition in the wake of RNA polymerase II.

Authors:  Philippe Thebault; Geneviève Boutin; Wajid Bhat; Anne Rufiange; Joseph Martens; Amine Nourani
Journal:  Mol Cell Biol       Date:  2011-01-10       Impact factor: 4.272

7.  Nucleosome positioning by human Alu elements in chromatin.

Authors:  E W Englander; B H Howard
Journal:  J Biol Chem       Date:  1995-04-28       Impact factor: 5.157

8.  Unit-length line-1 transcripts in human teratocarcinoma cells.

Authors:  J Skowronski; T G Fanning; M F Singer
Journal:  Mol Cell Biol       Date:  1988-04       Impact factor: 4.272

9.  RNA truncation by premature polyadenylation attenuates human mobile element activity.

Authors:  Victoria Perepelitsa-Belancio; Prescott Deininger
Journal:  Nat Genet       Date:  2003-11-16       Impact factor: 38.330

10.  Many LINE1 elements contribute to the transcriptome of human somatic cells.

Authors:  Sanjida H Rangwala; Lili Zhang; Haig H Kazazian
Journal:  Genome Biol       Date:  2009-09-22       Impact factor: 13.583

View more
  5 in total

Review 1.  Altering Genomic Integrity: Heavy Metal Exposure Promotes Transposable Element-Mediated Damage.

Authors:  Maria E Morales; Geraldine Servant; Catherine Ade; Astrid M Roy-Engel
Journal:  Biol Trace Elem Res       Date:  2015-03-14       Impact factor: 3.738

Review 2.  Alu elements: an intrinsic source of human genome instability.

Authors:  Catherine Ade; Astrid M Roy-Engel; Prescott L Deininger
Journal:  Curr Opin Virol       Date:  2013-09-27       Impact factor: 7.090

3.  Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1.

Authors:  D Cristopher Bragg; Kotchaphorn Mangkalaphiban; Christine A Vaine; Nichita J Kulkarni; David Shin; Rachita Yadav; Jyotsna Dhakal; Mai-Linh Ton; Anne Cheng; Christopher T Russo; Mark Ang; Patrick Acuña; Criscely Go; Taylor N Franceour; Trisha Multhaupt-Buell; Naoto Ito; Ulrich Müller; William T Hendriks; Xandra O Breakefield; Nutan Sharma; Laurie J Ozelius
Journal:  Proc Natl Acad Sci U S A       Date:  2017-12-11       Impact factor: 11.205

4.  Dissecting the Causal Mechanism of X-Linked Dystonia-Parkinsonism by Integrating Genome and Transcriptome Assembly.

Authors:  Tatsiana Aneichyk; William T Hendriks; Rachita Yadav; David Shin; Dadi Gao; Christine A Vaine; Ryan L Collins; Aloysius Domingo; Benjamin Currall; Alexei Stortchevoi; Trisha Multhaupt-Buell; Ellen B Penney; Lilian Cruz; Jyotsna Dhakal; Harrison Brand; Carrie Hanscom; Caroline Antolik; Marisela Dy; Ashok Ragavendran; Jason Underwood; Stuart Cantsilieris; Katherine M Munson; Evan E Eichler; Patrick Acuña; Criscely Go; R Dominic G Jamora; Raymond L Rosales; Deanna M Church; Stephen R Williams; Sarah Garcia; Christine Klein; Ulrich Müller; Kirk C Wilhelmsen; H T Marc Timmers; Yechiam Sapir; Brian J Wainger; Daniel Henderson; Naoto Ito; Neil Weisenfeld; David Jaffe; Nutan Sharma; Xandra O Breakefield; Laurie J Ozelius; D Cristopher Bragg; Michael E Talkowski
Journal:  Cell       Date:  2018-02-22       Impact factor: 66.850

5.  Intergenic Alu exonisation facilitates the evolution of tissue-specific transcript ends.

Authors:  Mojca Tajnik; Alessandra Vigilante; Simon Braun; Heike Hänel; Nicholas M Luscombe; Jernej Ule; Kathi Zarnack; Julian König
Journal:  Nucleic Acids Res       Date:  2015-09-22       Impact factor: 16.971

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.