| Literature DB >> 25735770 |
Federico Abascal1, Michael L Tress1, Alfonso Valencia1.
Abstract
Transposable elements constitute a large fraction of vertebrate genomes and, during evolution, may be co-opted for new functions. Exonization of transposable elements inserted within or close to host genes is one possible way to generate new genes, and alternative splicing of the new exons may represent an intermediate step in this process. The genes TMPO and ZNF451 are present in all vertebrate lineages. Although they are not evolutionarily related, mammalian TMPO and ZNF451 do have something in common-they both code for splice isoforms that contain LAP2alpha domains. We found that these LAP2alpha domains have sequence similarity to repetitive sequences in non-mammalian genomes, which are in turn related to the first ORF from a DIRS1-like retrotransposon. This retrotransposon domestication happened separately and resulted in proteins that combine retrotransposon and host protein domains. The alternative splicing of the retrotransposed sequence allowed the production of both the new and the untouched original isoforms, which may have contributed to the success of the colonization process. The LAP2alpha-specific isoform of TMPO (LAP2α) has been co-opted for important roles in the cell, whereas the ZNF451 LAP2alpha isoform is evolving under strong purifying selection but remains uncharacterized.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25735770 PMCID: PMC4495291 DOI: 10.1093/bioinformatics/btv132
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Schematic organization of the ZNF451 and TMPO transcripts (A) and the domain organization of the translated protein isoforms (B). The presence of a LAP2alpha domain within ZNF451-L2a is supported by an Hmmscan search against Pfam (e value of 1.5e-08). The LAP2alpha domain of both TMPO and ZNF451 is coded by a large 3-prime exon. The exon includes a polyadenylation signal that prevents the simultaneous incorporation of other exons. In the case of ZNF451, alternatively skipped 3-prime exons code for multiple Zinc fingers. In the case of TMPO, they code for the C-terminal half of the protein, which includes a nuclear membrane attachment trans-membrane helix. The exon coding the LAP2alpha domain translates into amino acid regions of 497 and 506 residues (9114 and 2812 bp) in ZNF451 and TMPO, respectively
Fig. 2.(A) A multiple sequence alignment including the LAP2alpha domains of mammalian LAP2α and ZNF451-L2a and homologous sequences from non-mammalian DIRS1-like elements. The phylogenetic tree in (B) was built with Phyml for the LAP2alpha domains of LAP2α and ZNF451-L2a. Overall dN/dS ratios were calculated with the SLAC method in DataMonkey (see Supplementary Appendix). (C) A schematic structure of the DIRS1-like element from which the LAP2alpha domain originated