| Literature DB >> 19443853 |
Eugene A Gladyshev1, Irina R Arkhipova.
Abstract
In the course of sequencing telomeric chromosomal regions of the bdelloid rotifer Adineta vaga, we encountered an unusual DNA transposon. Unlike other bdelloid and, more generally, eukaryotic transposable elements (TEs), it exhibits similarity to prokaryotic insertion sequences (ISs). Phylogenetic analysis indicates that this transposon, named IS5_Av, is related to the ISL2 group of the IS5 family of bacterial IS elements. Despite the apparent intactness of the single open reading frame coding for a DDE transposase and the perfect identity of its 213-bp terminal inverted repeats (TIRs), the element is present in only one copy per diploid genome. It does not exhibit any detectable levels of transcription, so that its transposase gene appears to be silent in the bdelloid host. Although horizontal transfers of TEs between kingdoms are not known to happen in nature, it appears likely that IS5_Av underwent integration into the A. vaga genome relatively recently, but was not successful in adapting to the new host and failed to increase in copy number. Alternatively, it might be the only known member of a novel eukaryotic DNA TE superfamily which is so rare that its other members, if any, have not yet been identified in eukaryotic genomes sequenced to date.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19443853 PMCID: PMC7187635 DOI: 10.1093/molbev/msp104
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FStructure (A) and genomic environment (B) of IS5_Av. (A) Within the transposase (TPase) ORF, the positions 788; 884; 1,040; and 1,714 denote the TTG, GTG, ATG, and TGA codons, respectively. Also shown are the positions of the DDE catalytic residues, and the primers used for PCR amplification of the internal region between TIRs and for RT-PCR shown in figure 2 (RT-F1, RT-R1) (half arrows). A small gray box denotes a putative 63-aa ORF extension in case of a −1 frameshift. Hypothetical 10-bp secondary TIRs are shown by tiny arrowheads. Primers for the outermost part of TIRs (unnamed half arrows) failed to yield PCR products, apparently due to PCR interference with the DNA secondary structure. Scale bar, 100 bp. (B) A 40-kb region (region 63135…103135 from GenBank accession no. EU643477) near the Adineta vaga telomere K_A (designated by the letter T), with nine TE insertions (table 2) and five protein-coding ORFs, of which histidine-ammonia lyase (HAL, 3′ truncated) is presumably of bacterial origin, acyl-CoA synthetase (3′ truncated) is of metazoan origin, and the rest is of uncertain origin. Coding sequences with transcriptional orientation toward the telomere are shown above the line, whereas those with transcriptional orientation away from the telomere are shown below the line. Two AATAAA signals in the hobo 3′ UTR are denoted by letters A. The intergenic region scanned for TSD–TIR combinations is denoted by a thick line. Scale bar, 1 kb.
BlastP Similarity Scores for the IS5_Av Query Used to Search the Nonredundant GenBank Database (A) and the IS Finder Database (http://www-is.biotoul.fr/) (B)
| Sequences Producing Significant Alignments: | Score (Bits) |
| |||||
|
| |||||||
| ref | YP_594226.1 | transposase, IS4 [Deinococcus geothermalis D… | 81.3 | 1e−13 | |||||
| ref | ZP_02737947.1 | transposase, IS4 [Gemmata obscuriglobus UQ… | 77.8 | 1e−12 | |||||
| ref | ZP_02732909.1 | transposase, IS4 [Gemmata obscuriglobus UQ… | 76.6 | 2e−12 | |||||
| ref | ZP_03176005.1 | putative transposase [ | 75.1 | 7e−12 | |||||
| ref | ZP_03143215.1 | transposase IS4 family protein [Cyanothece… | 70.5 | 1e−10 | |||||
| ref | ZP_03141860.1 | transposase IS4 family protein [Cyanothece… | 69.3 | 3e−10 | |||||
| ref | ZP_03178672.1 | putative transposase [ | 68.6 | 6e−10 | |||||
| ref | ZP_03143523.1 | transposase IS4 family protein [Cyanothece… | 68.6 | 6e−10 | |||||
| ref | ZP_02942299.1 | transposase IS4 family protein [Cyanothece… | 68.6 | 7e−10 | |||||
| ref | ZP_02939447.1 | transposase IS4 family protein [Cyanothece… | 68.2 | 7e−10 | |||||
| ref | ZP_01731099.1 | hypothetical protein CY0110_01550 [Cyanoth… | 68.2 | 9e−10 | |||||
| ref | ZP_02942406.1 | transposase IS4 family protein [Cyanothece… | 67.4 | 1e−19 | |||||
|
| |||||||
| Sequences Producing Significant Alignments | IS Family | Group | Origin | Score (Bits) |
| ||
| ISDge6 | IS5 | ISL2 |
| 77 | 1e−15 | ||
| IS1515 | IS5 | ISL2 |
| 62 | 3e−11 | ||
| IS702 | IS5 | ISL2 |
| 62 | 4e−11 | ||
| ISL2A | IS5 | ISL2 |
| 54 | 1e−08 | ||
| ISL2 | IS5 | ISL2 |
| 54 | 1e−08 | ||
| ISMae4 | IS5 | ISL2 |
| 53 | 2e−08 | ||
| IS493 | IS5 | ISL2 |
| 45 | 5e−06 | ||
| IS1381 | IS5 | ISL2 |
| 42 | 4e−05 | ||
| IS470 | IS5 | ISL2 |
| 42 | 5e−05 | ||
| IS1373 | IS5 | ISL2 |
| 40 | 2e−04 | ||
| IS1381A [V] | IS5 | ISL2 |
| 39 | 3e−04 | ||
| IS112 | IS5 | ISL2 |
| 39 | 4e−04 | ||
NOTE.—Two IS5-derived ORFs from Naegleria gruberi yield the same top hits as IS5_Av. Note that most of the GenBank entries are misannotated as IS4 transposases.
FCopy number (A) and expression (B) of IS5_Av. (A) Ethidium bromide staining (left panels) and Southern blot hybridization with the 32P-labeled IS5_Av probe (right panels) of the Adineta vaga genomic DNA digested with PvuII or XhoI/HpaI, as indicated. (B) RT-PCR with IS5_Av (left) and Ddl_Av (right) forward and reverse primers. Shown are the sizes of the corresponding PCR products, which for the Ddl gene differ by 50 bp due to splicing. RT-, no reverse transcriptase added; RT+, addition of SuperScriptII; DNA, control genomic DNA amplification. M, 1 kb+ ladder (Invitrogen).
Top BlastP Database Hits for Eight TEs from the Adineta vaga Telomere K_A (EU643477) Shown in figure 1
| TE Superfamily (Position) | Top BlastP Hit, Species |
| % Identity (Similarity) | CD Hits | In-Frame Stops/Frameshifts/Indels |
| mariner (66428…68107) | Transposase, | 5e−61 | 36% (54%) | Transposase_1 | 2/1/0 |
| piggyBac (68182…70303) | Transposase, | 9e−89 | 42% (63%) | none | 1/0/0 |
| mariner (71477…71555, 72009…72645) | Avmar1 transposase, | 1e−78 | 78% (83%) | Transposase_1 | 1/1/2 |
| hobo (77307…78830, 80417…81399) | Transposase, | 2e−39 | 24% (44%) | hATC superfamily | 1/1/1 |
| mariner (78833…80416) | Transposase, | 2e−40 | 31% (50%) | none | 0/0/0 |
| IS5 (81508…83527) | Transposase, | 1e−13 | 25% (44%) | Transposase_11 | 0/0/0 |
| LTR retrotransposon ( | Retrotransposon | 1e−06; 0.0; 3e−01 | 21%(44%); 39% (57%) 27% (47%) | Retrotrans_gag; RVP superfamily; RT_LTR (RT-like superfamily); rve superfamily | 0/0/0; 0/2/0; 0/0/0 |
| piggyBac (100845…102137) | Transposase, | 6e−18 | 31% (54%) | none | 6/0/1 |
NOTE.—Also listed are hits to the CD database, and disruptions in ORFs, if any.
FPhylogenetic placement of IS5_Av. Shown is a Neighbor-Joining phylogram including representatives of different groups of the IS5 megafamily, which contains six known groups of the bacterial IS5 family (IS5, IS903, ISH1, IS1031, IS427, and ISL2) and two known eukaryotic IS5-like superfamilies (Harbinger/PIF and IS4EU/ISL2EU). Also shown are the newly designated groups ISL2PR (from various protists) and IS493 (members of which are assigned to ISL2 in the IS Finder database). Bootstrap support values for the major named groups, obtained by Neighbor-joining and minimum evolution methods, respectively, are as follows: ISL2, 48/58; IS493, 90/89; IS5, 100/100; IS903, 100/100; ISH1, 100/100; IS1031, 100/100; IS427, 100/100; ISL2PR, 34/43; IS4EU/ISL2EU, 100/100; and Harbinger/PIF, 51/47. The bacterial IS5, IS903, ISH1, IS1031, and IS427 groups can be joined into a well supported (97/87) supergroup. Support for IS5_Av/Naegleria branch is 51/64. Eukaryotic species are in boldface, and branches leading to them are shown by thick lines. Scale bar, 0.1 amino acid substitutions per site. For multiple sequence alignment, see supplementary data, Supplementary Material online.
List of Putative Donor Species Predicted by EasyGene to Yield a Full-Length Transposase Using an Alternative Start Codon as an Optimal
| Model | Feature | Start | End | Score | Startc | Species | Taxonomy |
| SSW02 | CDS | 788 | 1714 | 8.60e−27 | TTG |
| Cyanobacteria; Chroococcales |
| PMMI02 | CDS | 788 | 1714 | 9.81e−22 | TTG |
| Cyanobacteria; Prochlorales |
| NM02 | CDS | 884 | 1714 | 3.96e−11 | GTG |
| Proteobacteria; Betaproteobacteria; Neisseriales |
| SE02 | CDS | 788 | 1714 | 7.16e−11 | TTG |
| Proteobacteria; Gammaproteobacteria; Enterobacteriales |
| NO02 | CDS | 788 | 1714 | 1.78e−10 | TTG |
| Cyanobacteria; Nostocales |
| PL01 | CDS | 788 | 1714 | 4.16e−10 | TTG |
| Proteobacteria; Gammaproteobacteria; Enterobacteriales |
| LP02 | CDS | 788 | 1714 | 5.54e−09 | TTG |
| Firmicutes; Bacilli; Lactobacillales; Lactobacillaceae |
| ECC02 | CDS | 788 | 1714 | 1.66e−08 | TTG |
| Proteobacteria; Gammaproteobacteria; Enterobacteriales |
| STT02 | CDS | 788 | 1714 | 1.93e−08 | TTG |
| Proteobacteria; Gammaproteobacteria; Enterobacteriales |
| STY02 | CDS | 788 | 1714 | 4.07e−08 | TTG |
| Proteobacteria; Gammaproteobacteria; Enterobacteriales |
| ECE03 | CDS | 788 | 1714 | 8.15e−08 | TTG |
| Proteobacteria; Gammaproteobacteria; Enterobacteriales |
| WDM01 | CDS | 788 | 1714 | 1.12e−07 | TTG |
| Proteobacteria; Alphaproteobacteria; Rickettsiales |
| LM02 | CDSsub | 884 | 1714 | 1.21e−07 | GTG |
| Firmicutes; Bacilli; Bacillales |
| CDI01 | CDS | 788 | 1714 | 1.54e−07 | TTG |
| Actinobacteria; Actinobacteridae; Actinomycetales |
| BS03 | CDS | 788 | 1714 | 4.32e−07 | TTG |
| Firmicutes; Bacilli; Bacillales |
| CT02 | CDS | 788 | 1714 | 4.96e−07 | TTG |
| Chlamydiae/Verrucomicrobia; Chlamydiae; Chlamydiales |
| ECO02 | CDS | 788 | 1714 | 5.17e−07 | TTG |
| Proteobacteria; Gammaproteobacteria; Enterobacteriales |
| XC02 | CDS | 788 | 1714 | 6.53e−07 | TTG |
| Proteobacteria; Gammaproteobacteria; Xanthomonadales |
| BT02 | CDS | 788 | 1714 | 8.86e−07 | TTG |
| Bacteroidetes/Chlorobi group; Bacteroidetes; Bacteroidales |
| VC02 | CDS | 788 | 1714 | 1.41e−06 | TTG |
| Proteobacteria; Gammaproteobacteria; Vibrionales |
| BS03 | CDSsub | 884 | 1714 | 1.49e−06 | GTG |
| Firmicutes; Bacilli; Bacillales |
| CB02 | CDSsub | 788 | 1714 | 2.72e−06 | TTG |
| Proteobacteria; Gammaproteobacteria; Legionellales |
| NE02 | CDS | 788 | 1714 | 3.23e−06 | TTG |
| Proteobacteria; Betaproteobacteria; Nitrosomonadales |
| MBU01 | CDS | 788 | 1714 | 7.20e−06 | TTG |
| Euryarchaeota; Methanomicrobia; Methanosarcinales |
| CB02 | CDS | 884 | 1714 | 7.76e−06 | GTG |
| Proteobacteria; Gammaproteobacteria; Legionellales |
| SO02 | CDS | 788 | 1714 | 8.70e−06 | TTG |
| Proteobacteria; Gammaproteobacteria; Alteromonadales |
| XF02 | CDSsub | 788 | 1714 | 1.83e−05 | TTG |
| Proteobacteria; Gammaproteobacteria; Xanthomonadales |
| BPS01 | CDS | 788 | 1714 | 0.000366 | TTG |
| Proteobacteria; Betaproteobacteria; Burkholderiales |
| PS02 | CDS | 788 | 1714 | 0.000449 | TTG |
| Proteobacteria; Gammaproteobacteria; Pseudomonadales |
| BBA01 | CDS | 884 | 1714 | 0.000589 | GTG |
| Proteobacteria; Deltaproteobacteria; Bdellovibrionales |
| MLO03 | CDS | 788 | 1714 | 0.170785 | TTG |
| Proteobacteria; Alphaproteobacteria; Rhizobiales |
NOTE.—Four species predicted to utilize alternative start codons as suboptimal (CDSsub) are also included.
FCopy number dynamics of DNA TEs (modified after Hartl et al. 1997). The shape of the curve is arbitrary, as is the location of branches giving rise to domesticated or horizontally escaped copies.