| Literature DB >> 23548000 |
Abstract
BACKGROUND: Bacterial insertion sequences (IS) of IS200/IS605 and IS607 family often encode a transposase (TnpA) and a protein of unknown function, TnpB.Entities:
Year: 2013 PMID: 23548000 PMCID: PMC3627910 DOI: 10.1186/1759-8753-4-12
Source DB: PubMed Journal: Mob DNA
Figure 1Motifs and alignments of Fanzor and TnpB proteins. Conserved amino acids and helix-turn-helix (HTH) domain are marked (above); gray regions indicate the variable N-terminal halves. Numbers above the diagram refer to the residue position in SPu-1-1p or TnpB_IS608. Titles of Fanzor1 proteins are shaded in the alignment.
Species harboring Fanzor sequences
| Metazoa | 13 | | ||
| | 1 | | ||
| Choanoflagellida | | 2 | ||
| Fungi | 3 | | ||
| | 4 | | ||
| | 3 | | ||
| | 3 | | ||
| | 10 | | ||
| | | 1 | ||
| | | 1 | ||
| | | 1 | ||
| | | 1 | ||
| Amoebozoa | 4 | | ||
| | 7 | | ||
| | | 2 | ||
| Chlorophyta | 2 | | ||
| | 5 | | ||
| | 1 | | ||
| Rhodophyta | 1 | | ||
| Stramenopiles | 6 | | ||
| | 1 | | ||
| | 4 | 1 | ||
| | 2 | 1 | ||
| | 1 | 1 | ||
| | 2 | | ||
| | 1 | | ||
| dsDNA virus | 2 | | ||
| | 1 | | ||
| | 1 | | ||
| | 1 | | ||
| | 1 | | ||
| | 1 | | ||
| | 1 | | ||
| | 1 | | ||
| | 1 | | ||
| | 1 | | ||
| | | 1 | ||
| | | 3 | ||
| | | 3 | ||
| | | 2 | ||
| | | 2 | ||
| | | 2 | ||
| | | 1 | ||
| 1 |
Figure 2Phylogeny of Fanzor/TnpB proteins. Eukaryotes and eukaryotic viruses are colored as follows: dsDNA viruses (blue), metazoa (yellow), fungi (green), chlorophyta (cyan), rhodophyta (red), stramenopiles (dark red), choanoflagellida (orange), and amoebozoa (pink). TnpB proteins are from the ISfinder database (ISXXX) or GenBank (with accession number). The tree is based on the alignment of a longer region including most of the N-terminal and the C-terminal portions (see Additional file 4).
Figure 3Fanzor proteins in elements. Transposable elements (TEs) are indicated by bars flanked by TA target site duplications (TSDs); the undetermined ends are indicated by dash lines (PUl-5, 6; PCa-1). The triangles at the element ends represent the terminal inverted repeats (TIRs) sequences. The inner arrows indicate the protein coding regions (dashed lines indicate the degenerated coding sequences). The alignments of the 5′ and 3′ TIRs sequences are shown on the right.
Figure 4-elements. (A) CRe-1, 2, 3 consensus sequences and the exons (black boxes). Dotted areas indicate the 5′-ends, approximately 200- bp long, which are 98% identical to those of confirmed Helitrons. Asterisks at the 3′-ends indicate the short homologous regions in CRe-1, 2, 3 and Helitron elements (C, D). The corresponding sequences of the15 example loci (1 to 15) are indicated by solid lines below. Dashed lines mark the internal deletion regions. Nine of them are flanked by target site duplications (TSDs) indicated by small diamonds. Note that locus 1 and 7 include short segments of ‘non-Fanzor’ sequences (gray line) at the 5′-ends. The sequences of the 15 loci are shown in Additional file 8B. (B) Examples of the nine TSD sequences (shaded). Note that the 5′-TSDs are immediately downstream of TTTT tetra-nucleotides. (C) Helitron-1_CRe and non-autonomous Helitrons. (D) The alignment of the 3′-ends of Helitrons and CRe-1,2,3. The 3′ asymmetrical terminal inverted repeats (ATIRs) are boxed. (E) Target specificity of Helitron-1N1_CRe elements. They insert specifically between TTTT and T/C and produce no TSDs. Helitron-1N2_CRe elements also insert after TTTT. Three examples of the pre- and post-insertion sites are shown. (F) The illustration of the 5′-truncation or 3′-overabundance in CRe-1 elements: graphical summary of a NCBI online BLASTN search of the Chlamydomonas reinhardtii genome with the consensus of CRe-1.
Figure 5subgroup and its terminal structure. (A) Structural features of Helitron1 and Helitron2 groups. Helitron1 has only 3′-subterminal hairpin structure [2]. The dark arrows in Helitron2 represent the asymmetric terminal inverted repeats (ATIRs). (B) Phylogeny of the RepHel proteins encoded by Heliron1 and Helitron2 groups. The black square indicates Helitrons in which both termini are known. The alignment of the RepHel proteins is shown in Additional file 9. (C) Terminal sequences from the selected examples of Helitron2 elements. The bases in bold font represent the ATIRs. Pairing nucleotides in the hairpins are shaded in gray. The 5′-end sequences of CRe-1, 2, 3 are similar to that of Helitron-1_CRe (not shown). Note that the RepHel protein is encoded in the opposite direction in Helitron-2_CRe and Helitron-1_DR (marked with #). The compensatory mutations in the complementary segments are highlighted by asterisks at the bottom in the alignment of Helitron-1N1_CQu and Helitron-1N2_CQu.
Figure 6Fanzor1 protein in elements.IS4-type elements in Ectocarpus siliculosus and Ectocarpus siliculosus virus-1. The alignment of terminal inverted repeats (TIRs) is shown on the right. The 8-bp perfect TSDs flanking ESvi-1B are indicated by diamonds. Note that the ESv-2 element is named as ISvEsV1_1 in the ISfinder database, where the encoded Fanzor1 protein is annotated as a passenger protein of unknown function.
Figure 7Fanzor1 protein in elements from (A) and (B) . Similar regions are indicated by the same type of patterns. 5′ and 3′ terminal inverted repeats (TIRs) sequences are shown on the right.
Figure 8Other diverse elements. (A) MCi-2 family (left), and alignment of their 3′-ends (right). Copy numbers of subfamilies are indicated in parentheses. Different 5′ regions are indicated by different patterns. Undetermined termini are indicated by dashed lines. (B) Target site duplications (TSDs) sequences of the MCi-2A and MCi-2D subfamilies. (C) MCi-5 elements and their lengths. Undetermined element ends are indicated by dashed lines. The alignments of the 5′ and 3′ ends are shown on the right. The black triangle above locus-1 and locus-2 indicates small insertions ( approximately 520 bp) relative to locus-3. (D) Relationship between CMe-1A and TE-N2_CMe. (E) Serine recombinases (SR) encoded by some Fanzor2 elements. HARB; Harbinger Tpase.