Literature DB >> 20889727

Primate and rodent specific intron gains and the origin of retrogenes with splice variants.

Michal W Szcześniak, Joanna Ciomborowska, Witold Nowak, Igor B Rogozin, Izabela Makałowska.   

Abstract

Retroposition, a leading mechanism for gene duplication, is an important process shaping the evolution of genomes. Retrogenes are also involved in the gene structure evolution as a major player in the process of intron deletion. Here, we demonstrate the role of retrogenes in intron gain in mammals. We identified one case of "intronization," the transformation of exonic sequences into an intron, in the primate specific retrogene RNF113B and two independent "intronization" events in the retrogene DCAF12L2, one in the common ancestor of primates and rodents and another one in the rodent lineage. Intron gain resulted from the origin of new splice variants, and both genes have two transcript forms, one with retained intron and one with the intron spliced out. Evolution of these genes, especially RNF113B, has been very dynamic and has been accompanied by several additional events including parental gene loss, secondary retroposition, and exaptation of transposable elements.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20889727      PMCID: PMC3002245          DOI: 10.1093/molbev/msq260

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


The majority of protein-coding genes in eukaryotes are interrupted by introns that are removed from the pre-mRNA by a RNA–protein complex called the spliceosome (Cavalier-Smith 1985; Crick 1979). Introns and the splicing machinery have been found in all eukaryotic species with fully sequenced genomes (Chow et al. 1977; Roy and Gilbert 2006). Comparative genomic studies have revealed striking conservation of intron positions in distant eukaryotes such as animals and plants (Fedorov et al. 2002; Rogozin et al. 2003; Carmel et al. 2007). On the other hand, many genome-wide comparisons of eukaryotic species demonstrated multiple intron losses and intron gains (Roy et al. 2003; Cho et al. 2004; Qiu et al. 2004; Coulombe-Huntington and Majewski 2007b; Li et al. 2009). However, it was found that intron gain is a very rare event in vertebrate evolution (Loh et al. 2007) and no intron gains into intact conserved coding regions of mammalian genes are known (Roy et al. 2003; Coulombe-Huntington and Majewski 2007a). Comparative gene structure studies have not revealed any intron gain into existing exons in mammals. The only reported new introns were acquired, by and large, by either a fusion of retrogene with host genes or de novo from the genomic environment as a result of new exon capture (O'Neill et al. 1998; Vinckenbosch et al. 2006; Sela et al. 2007; Baertsch et al. 2008; Fablet et al. 2009). Here, we report two retrogenes, RNF113B and DCAF12, where the exon sequence was split by creation of a new intron as the result of mutations and emergence of new splice sites. The introns discovered by us represent cases of intron creation via recruitment of exonic sequence (intronization) proposed by Irimia et al. (2008) and Lahn and Page (1999).

Evolution of Introns

RNF113A is a retrogene encoding a ring finger protein of unknown function and is present in the genomes of all vertebrates. Interestingly, in mammalian genomes, only intronless copy exist, whereas in all other vertebrates, a ten-exon parental gene is present and no retrogenes were detected. Genomic sequence analysis showed that there are two copies of RNF113 in primates, rodents, carnivores, and even-toed ungulates and only one in the genomes of the other mammals we studied. The first copy of RNF113 was retroposed into the intronic region of NDUFA1 gene in the genome of the mammalian ancestor. Following the retroposition, the parental gene was lost. This likely took place before the divergence of Prototheria (Monotremes) and Theria (Marsupials and Placentals) because in the genomes of all species representing these lineages, the multiexon form of RNF113 is absent. After the mammalian radiation the RNF113A retrogene was duplicated, by retropositions or segmental duplications, in several lineages. Analysis of genomic locations of these copies suggests that the duplication events were independent in each lineage. For example, in rodents, the RNF113 copy (RNF113A2) was inserted into an intron of the 2900006K08Rik gene, whereas the primate specific gene, RNF113B, was copied into an intron of the FARP1 gene. The primate specific duplication happened before Old World Monkeys and New World Monkeys diverged (fig. 1).
F

Schematic tree representing major events during evolution of RNF113 gene in mammalian lineage. Color version of the figure can be found in supplementary Data (Supplementary Material online).

Schematic tree representing major events during evolution of RNF113 gene in mammalian lineage. Color version of the figure can be found in supplementary Data (Supplementary Material online). After the retroposition/duplication, the primate specific RNF113B gene underwent rapid evolution including intron gain. The presence of the intron is surprising, however, it is supported by several GenBank mRNA sequences (accession numbers: AF539427, BC025388, and BC017585). To confirm the existence of the intron and learn about its origin, we compared RNF113B sequences from available primate genomes (human, marmoset, macaque, orangutan, and chimpanzee) with sequences of other mammalian RNF113A genes. Sequence alignment revealed that the intron of RNF113B is not a de novo insertion but rather originated from the exonic sequence (fig. 2). A double point mutation, AG → GT, generated the donor site (fig. 2). The origin of acceptor site is not so clear. One possibility is that a point mutation, GG → AG, created acceptor site. Another option is that the acceptor site was brought during the exonization of L1 element, merged at the 3’ end of RNF113B (fig. 2). The newly generated splice sites together with the branch site and the polypyrimidine tract likely enabled recognition of the new intron by the U2 spliceosome (fig. 2). The 105 bp intron contains 59 nucleotides of previously coding sequence and 46 nucleotides from the 3’ UTR.
F

(a) Alignment of mammalian RNF113A and primate RNF113B genomic sequences at the acceptor and donor sites. (b) Structure of human RNF113A mRNA and two splice variants of RNF113B. Color version of the figure can be found in supplementary Data (Supplementary Material online).

(a) Alignment of mammalian RNF113A and primate RNF113B genomic sequences at the acceptor and donor sites. (b) Structure of human RNF113A mRNA and two splice variants of RNF113B. Color version of the figure can be found in supplementary Data (Supplementary Material online). Generation of splice sites most probably occurred in the primate specific RNF113B copy since neither human RNF113A gene, which gave a rise to primate RNF113B, nor RNF113A genes from other mammals have AG or GT at the donor and acceptor positions. Splicing signals were formed before the Old World Monkeys and New Monkeys split. Interestingly, loss of the splicing boundaries subsequently converted the intron into a “retained intron” in some primates. In rhesus, for example, acceptor was lost due to a point mutation (AG → AA change) (fig. 1). The creation of splicing signals was accompanied not only by exaptation of an L1 element but also by exonization of an Alu element. The L1 element inserted within the 3′ end of the gene could have contributed the acceptor site and provided a new polyA signal used for the new splice variant (fig. 2). The complete AluSx element transposed upstream the gene was exapted at the 5′ end and most probably delivered some regulatory elements. Sequencing of the human RNF113B cDNA using primers flanking the intronic sequence revealed that RNF113B produces two variant transcripts. One variant has two exons, as described above, and the other one is a single exon transcript similar to RNF113A. Consequently, most primates have three transcripts of RNF113: one from the RNF113A retrogene and two from the RNF113B (fig. 2). Rodents, cow, and dog have two transcripts, each coming from different copy of RNF113, and all other mammals have only one RNF113 transcript. The presence of the splice variants in the retrogene is very surprising and has only been reported once before (Lahn and Page 1999). A second case involves DCAF12 (DDB1 and CUL4 associated factor 12), which encodes a WD repeat-containing protein that interacts with the COP9 signalosome (Jin et al. 2006). Although the gene is present in vertebrate and insect genomes, only placental mammals have retrocopies of this gene. One copy, DCAF12L2, has the same location in all placental mammals and therefore most likely was retroposed in the placental mammals ancestor. Another copy, DCAF12L1, is present only in Euarchontoglires (a clade which includes rodents and primates). It likely emerged as a result of tandem duplication of DCAF12L2 as it is located next to the DCAG12L1 gene. There were two events that changed the splicing pattern in DCAF12L2. First, an intronization event occurred in the common ancestor of primates and rodents. Second, an alternative donor site emerged in rodents only (fig. 3). The limited available data and sequence divergence make any conclusions in regard to the exact pattern of splice site evolution infeasible. However, there is convincing experimental evidence confirming both splicing events (fig. 3): splicing at the shared rodent–primate intron, boundaries are confirmed by two expressed sequence tags (ESTs) (AK034343 and AK047360), and usage of the rodent alternative donor site is confirmed by four ESTs (AK038557, BC068319, AK034472, and AK039767).
F

Pattern of DCAF12 duplication and “intronization” events in mammalian genomes. Color version of the figure can be found in supplementary Data (Supplementary Material online).

Pattern of DCAF12 duplication and “intronization” events in mammalian genomes. Color version of the figure can be found in supplementary Data (Supplementary Material online).

Retrogene Expression

Numerous studies revealed a tendency of retrogenes to be expressed exclusively in testis. It was suggested that the hypertranscription present in the meiotic and postmeiotic spermatogenic cells makes possible transcription of DNA that is usually not transcribed. This may facilitate transcription of retrocopies in the testis during their early evolution (reviewed in (Kaessmann et al. 2009). Another hypothesis explains the high expression of retrogenes in testis by the fact that these are, in most cases, retrocopies of spermatogenesis-related genes located on the X chromosome. Because the X chromosome is inactivated during meiosis, retroposition to autosomes enables escape from inactivation and expression during spermatogenesis (Turner 2007). The retroposition of both genes studied here, RNF113 and DCAF12, was in the opposite direction, from autosomes to chromosome X. In the case of RNF113, the parental gene is detectable by sequence similarity as an apparent pseudogene on chromosome 9. The parental multiexon DCAF12 gene is coincidentally also located on chromosome 9. RNF113A and both DCAF12 retrogenes are on chromosome X. We surveyed the expression pattern of all human RNF113 transcripts (one from RNF113A and two from RNF113B) in 16 human tissues (fig. 4) (for methods, see Supplementary Material online). RNF113A was expressed in all studied tissues, including testes. Interestingly, RNF113B exhibited tissue-specific splicing; while the unspliced form of RNF113B was expressed in all tissues but testis, the spliced variant was expressed in testis, prostate, thymus, and lung. Both RNF113B splice variants were present in thymus, prostate, and lung, but in all of these tissues, the form with the intron spliced out had much lower expression level than the single exon primary form. Relatively high expression of the new form of RNF113B, form with the intron spliced out, was observed only in testis.
F

Expression pattern of RNF113A and two forms of RNF113B (195 bp product with intron spliced; 295 bp product-form with intron retained) in 16 human tissues: 1: heart, 2: brain, 3: placenta, 4: lung, 5: liver, 6: skeletal muscle, 7: kidney, 8: pancreas, 9: spleen, 10: thymus, 11: prostate, 12: testis, 13: ovary, 14: small intestine w/o mucosal lining, 15: colon, 16: peripheral leukocytes, P: genomic DNA, and N: water.

Expression pattern of RNF113A and two forms of RNF113B (195 bp product with intron spliced; 295 bp product-form with intron retained) in 16 human tissues: 1: heart, 2: brain, 3: placenta, 4: lung, 5: liver, 6: skeletal muscle, 7: kidney, 8: pancreas, 9: spleen, 10: thymus, 11: prostate, 12: testis, 13: ovary, 14: small intestine w/o mucosal lining, 15: colon, 16: peripheral leukocytes, P: genomic DNA, and N: water. According to the EST data, the human DCAF12 gene is widely expressed. EST sequences present in the dbEST database represent almost 40 libraries and show the highest expression in testis and trachea. The retrogene DCAF12L1 is expressed only in kidney and testis and a second human retrogene, DCAF12L2, is expressed in eye and testis. Therefore, both retrogenes show very different expression patterns than their parental genes, with very limited and low expression level and notable expression in testis.

Conclusions

Retroposition, a major mechanism for gene duplication, is an important process shaping the evolution of genomes (Brosius 1991; Marques et al. 2005). Our study confirms the unusual role of retrogenes in shaping the genomes and underscores the importance of mobile elements in evolution. It also reveals that retrogenes may be responsible for a wealth of species-specific features including species-specfic introns and splice variants. Previous analyses of introns in the vertebrate genomes did not uncover any intron gain in mammals (Roy et al. 2003). Our study clearly shows that creation of introns has occurred during mammalian evolution. The failure of previous studies to find intron gains can be explained by the fact that they were focused on different intron gain mechanisms and did not consider exon intronization. In addition, they looked at conserved among studied species genes, while we focused on young and in many cases lineage-specific retrogenes. Interestingly, the retrogenes studied here exhibit testis-specific expression typically associated with genes escaping from the X chromosome despite their opposite history (retroposition from autosome to X). This biased expression pattern may not be exclusively related to meiotic genes, sex chromosome inactivation, and dosage compensation (Marques et al. 2005; Vinckenbosch et al. 2006; Potrzebowski et al. 2008). The same pattern of high expression level in testis is observed in young, primate-specific splice variant of retrogene RNF113B as well as in both retroposed copies of DCAF12 retroposed on the human X chromosome. The older, unspliced variant of RNF113B, as well as an earlier retrocopy RNF113A, displays more diverse expression patterns. Therefore, testis-specific expression could be a common feature of all newly evolved transcripts regardless of their chromosomal localization and may reflect a transcriptional noise due to “hypertranscription” in testis, facilitating the activation of new transcripts (Kleene et al. 1998). The small number of observed intron gain in retrogenes may reflect that this is a rare event. Alternatively, the low number of observations could reflect the difficulties in identification of such events. One major complication lies in annotation problems and the common expectation that retrogenes do not have introns. Genome-wide comparative studies currently underway have already showed that intron gain in retrogenes could be more frequent than we expected but that annotations remain a major obstacle in uncovering this phenomenon.

Supplementary Material

Supplementary Data are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
  28 in total

1.  A family of diverse Cul4-Ddb1-interacting proteins includes Cdt2, which is required for S phase destruction of the replication factor Cdt1.

Authors:  Jianping Jin; Emily E Arias; Jing Chen; J Wade Harper; Johannes C Walter
Journal:  Mol Cell       Date:  2006-09-01       Impact factor: 17.970

2.  Three distinct modes of intron dynamics in the evolution of eukaryotes.

Authors:  Liran Carmel; Yuri I Wolf; Igor B Rogozin; Eugene V Koonin
Journal:  Genome Res       Date:  2007-05-10       Impact factor: 9.043

3.  Origin of introns by 'intronization' of exonic sequences.

Authors:  Manuel Irimia; Jakob Lewin Rukov; David Penny; Jeppe Vinther; Jordi Garcia-Fernandez; Scott William Roy
Journal:  Trends Genet       Date:  2008-07-01       Impact factor: 11.639

4.  Characterization of intron loss events in mammals.

Authors:  Jasmin Coulombe-Huntington; Jacek Majewski
Journal:  Genome Res       Date:  2006-11-15       Impact factor: 9.043

5.  Intron loss and gain in Drosophila.

Authors:  Jasmin Coulombe-Huntington; Jacek Majewski
Journal:  Mol Biol Evol       Date:  2007-10-27       Impact factor: 16.240

6.  Investigation of loss and gain of introns in the compact genomes of pufferfishes (Fugu and Tetraodon).

Authors:  Yong-Hwee Loh; Sydney Brenner; Byrappa Venkatesh
Journal:  Mol Biol Evol       Date:  2007-12-17       Impact factor: 16.240

Review 7.  Meiotic sex chromosome inactivation.

Authors:  James M A Turner
Journal:  Development       Date:  2007-02-28       Impact factor: 6.868

8.  Retrocopy contributions to the evolution of the human genome.

Authors:  Robert Baertsch; Mark Diekhans; W James Kent; David Haussler; Jürgen Brosius
Journal:  BMC Genomics       Date:  2008-10-08       Impact factor: 3.969

9.  Comparative analysis of transposed element insertion within human and mouse genomes reveals Alu's unique role in shaping the human transcriptome.

Authors:  Noa Sela; Britta Mersch; Nurit Gal-Mark; Galit Lev-Maor; Agnes Hotz-Wagenblatt; Gil Ast
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

10.  Chromosomal gene movements reflect the recent origin and biology of therian sex chromosomes.

Authors:  Lukasz Potrzebowski; Nicolas Vinckenbosch; Ana Claudia Marques; Frédéric Chalmel; Bernard Jégou; Henrik Kaessmann
Journal:  PLoS Biol       Date:  2008-04-01       Impact factor: 8.029

View more
  27 in total

1.  Reverse transcriptase and intron number evolution.

Authors:  Kemin Zhou; Alan Kuo; Igor V Grigoriev
Journal:  Stem Cell Investig       Date:  2014-09-28

2.  Novel mutation and three other sequence variants segregating with phenotype at keratoconus 13q32 susceptibility locus.

Authors:  Marta Czugala; Justyna A Karolak; Dorota M Nowak; Piotr Polakowski; Jose Pitarque; Andrea Molinari; Malgorzata Rydzanicz; Bassem A Bejjani; Beatrice Y J T Yue; Jacek P Szaflik; Marzena Gajecka
Journal:  Eur J Hum Genet       Date:  2011-11-02       Impact factor: 4.246

3.  Emergence and evolution of Zfp36l3.

Authors:  Timothy J Gingerich; Deborah J Stumpo; Wi S Lai; Thomas A Randall; Scott J Steppan; Perry J Blackshear
Journal:  Mol Phylogenet Evol       Date:  2015-10-19       Impact factor: 4.286

Review 4.  Origin and evolution of spliceosomal introns.

Authors:  Igor B Rogozin; Liran Carmel; Miklos Csuros; Eugene V Koonin
Journal:  Biol Direct       Date:  2012-04-16       Impact factor: 4.540

5.  CRL4-DCAF12 Ubiquitin Ligase Controls MOV10 RNA Helicase during Spermatogenesis and T Cell Activation.

Authors:  Tomas Lidak; Nikol Baloghova; Vladimir Korinek; Radislav Sedlacek; Jana Balounova; Petr Kasparek; Lukas Cermak
Journal:  Int J Mol Sci       Date:  2021-05-20       Impact factor: 5.923

Review 6.  Identifying the mechanisms of intron gain: progress and trends.

Authors:  Paul Yenerall; Leming Zhou
Journal:  Biol Direct       Date:  2012-09-10       Impact factor: 4.540

7.  Mechanisms of intron loss and gain in the fission yeast Schizosaccharomyces.

Authors:  Tao Zhu; Deng-Ke Niu
Journal:  PLoS One       Date:  2013-04-17       Impact factor: 3.240

8.  Newly evolved introns in human retrogenes provide novel insights into their evolutionary roles.

Authors:  Li-Fang Kang; Zheng-Lin Zhu; Qian Zhao; Li-Yong Chen; Ze Zhang
Journal:  BMC Evol Biol       Date:  2012-07-28       Impact factor: 3.260

9.  "Orphan" retrogenes in the human genome.

Authors:  Joanna Ciomborowska; Wojciech Rosikiewicz; Damian Szklarczyk; Wojciech Makałowski; Izabela Makałowska
Journal:  Mol Biol Evol       Date:  2012-10-12       Impact factor: 16.240

10.  Surprisingly high number of Twintrons in vertebrates.

Authors:  Jessin Janice; Marcin Jąkalski; Wojciech Makałowski
Journal:  Biol Direct       Date:  2013-01-28       Impact factor: 4.540

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.