| Literature DB >> 26811739 |
Xiao Chen1, Laura F Landweber2.
Abstract
BACKGROUND: Transposable elements are a major player contributing to genetic variation and shaping genome evolution. Multiple independent transposon domestication events have occurred in ciliates, recruiting transposases to key roles in cellular processes. In the ciliate Oxytricha trifallax, the telomere-bearing elements (TBE), a Tc1/mariner transposon, occupy a significant portion of the germline genome and are involved in programmed genome rearrangements that produce a transcriptionally active somatic nucleus from a copy of the germline nucleus during development.Entities:
Keywords: Ciliates; Genome rearrangement; Purifying selection; Transposable element; Transposon domestication
Year: 2016 PMID: 26811739 PMCID: PMC4724952 DOI: 10.1186/s13100-016-0057-9
Source DB: PubMed Journal: Mob DNA
Fig. 1Phylogeny of sampled Oxytricha TBE genes and orthologs identified in three other stichotrich ciliates. a Schematic map of TBE transposons. Gray arrows represent terminal inverted repeats (TIR). Orange arrows represent ORFs encoded by TBEs. b Phylogeny constructed with TBE 42kD transposases (29 TBE1, 27 TBE2.1, 26 TBE2.2 and 25 TBE3 42kD protein sequences). Clades formed by TBE1, TBE2 and TBE3 are labeled accordingly. TBE2.1 representatives are indicated in red and TBE2.2 in blue. Internal branches supported by posterior probability higher than 0.9 are colored in green. c Phylogeny constructed with TBE 22kD ORFs (32 TBE1, 39 TBE2.1, 30 TBE2.2 and 28 TBE3 22kD protein sequences). Colors are as above. d Phylogeny constructed with TBE 57kD ORFs (27 TBE1, 26 TBE2.1, 23 TBE2.2 and 21 TBE3 57kD protein sequences). Clades formed by TBE1, TBE2.1, TBE2.2 and TBE3 are labeled accordingly; colors as above. The multiple sequence alignment was produced with MAFFT v6.956b and trimmed with trimAl v1.2 to remove excess gaps and poorly aligned regions. The unrooted Bayesian trees were produced with MrBayes v3.2.2 [35]. The three TBE orthologs are 1: Sterkiella histriomuscorum; 2: Tetmemena sp.; 3: Laurentiella sp.. All posterior probability values are above 0.5. The scale below the phylogeny illustrates branch substitutions per site
Genomic distribution of TBEs in the Oxytricha germline genome
| Complete | Partial | Total | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| # | Length (Mb) | % of complete TBEs by length | TIR | Near MDSs | # | Length (Mb) | % of partial TBEs by length | Length (Mb) | % of all TBEs by length | ||||
| 2 | 1 | 0 | # | % total copies | |||||||||
| TBE1 | 2502 | 9.9 | 24.75 % | 2005 | 228 | 269 | 521 | 28.2 % | 6216 | 6.5 | 24.8 % | 16.4 | 24.8 % |
| TBE2.1 | 2484 | 10.0 | 25 % | 2166 | 58 | 260 | 354 | 19.2 % | 3129 | 3.9 | |||
| TBE2.2 | 1087 | 4.3 | 10.75 % | 916 | 30 | 141 | 197 | 10.7 % | 1146 | 1.3 | |||
| TBE2 | 3571 | 14.3 | 35.75 % | 3082 | 88 | 401 | 551 | 29.9 % | 9898 (TBE2.1 + TBE2.2 + unclassified partial TBE2) | 10.5 | 40.1 % | 24.8 | 37.5 % |
| TBE3 | 3946 | 15.8 | 39.5 % | 3148 | 358 | 440 | 773 | 41.9 % | 8588 | 9.2 | 35.1 % | 25.0 | 37.7 % |
| Total | 10,019 | 40.0 | 100 % | 8235 | 674 | 1110 | 1845 | 100 % | 24,702 | 26.2 | 100 % | 66.2 | 100 % |
Pairwise percent protein sequence similarity of TBE genes
| 42kD | 22kD | 57kD | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TBE1 | TBE2.1 | TBE2.2 | TBE3 | TBE1 | TBE2.1 | TBE2.2 | TBE3 | TBE1 | TBE2.1 | TBE2.2 | TBE3 | |
| TBE1 | 89.3 ± 4.4 | 74.2 ± 3.6 | 74.9 ± 3.3 | 67.1 ± 3.1 | 90.4 ± 4.2 | 67.1 ± 3.6 | 68.5 ± 3.6 | 65.3 ± 3.4 | 86.4 ± 5.2 | 46.3 ± 2.7 | 49.5 ± 2.8 | 36.4 ± 1.7 |
| TBE2.1 | 89.3 ± 4.3 | 89.3 ± 4.2 | 67.9 ± 3.2 | 89.5 ± 5.1 | 87.6 ± 5.2 | 64.4 ± 3.4 | 87.2 ± 6.0 | 53.5 ± 3.1 | 34.5 ± 1.6 | |||
| TBE2.2 | 90.8 ± 4.5 | 68.7 ± 2.9 | 89.8 ± 4.8 | 65.5 ± 3.1 | 86.1 ± 5.9 | 38.2 ± 1.8 | ||||||
| TBE3 | 90.9 ± 4.8 | 89.9 ± 5.4 | 86.1 ± 7.5 | |||||||||
Features of Oxytricha TBEs and complete TBEs identified in other stichotrich genomes
| Terminal inverted repeat (TIR) (Underline for telomeric repeat) | Target site duplication | Distance (bp) between (mode, % of mode) | |||
|---|---|---|---|---|---|
| TIR/42kD | 57kD/TIR | ||||
|
| TBE1 |
| ANT | 3 | 41 |
| TAAGTGCTTTGATTTGTAGGGAATTTGTTA | 97.7 % | 86.5 % | |||
| GGGGTTGGGGTTATTAAT (78 bp) | |||||
| TBE2.1 |
| ANT | 0 | 23 | |
| TTGAGTTTTTGATTGATAAAAGTAGACTAT | 99.1 % | 82.3 % | |||
| TAGTGCATACTTTATTAGGGTTTTAATAGG | |||||
| GTTTATGTAGGGGTTTAATGTTTAAATATT | |||||
| AGTAATTTAAGTGAGTAT (138 bp) | |||||
| TBE2.2 |
| ANT | 21 | 44 | |
| TTGAGTTTTTGATTGATAAAAGTAGACTAT | 96.4 % | 86 % | |||
| TAGTGCATACTTTATTAGGGTTTTAATAGG | |||||
| GTTTATGTAGGGGTTTAATGTTTAAAT (117 bp) | |||||
|
| ANT | 26 | 49 | ||
| TTGAGTTTTTGATTGATGAAAGTAGACTAT | 97.6 % | 92.1 % | |||
| TAACGCATGCTTTATTAGGGTTTTAATAGG | |||||
| GTTTATGTAGGGGTTTAGGGTT (112 bp) | |||||
| TBE3 |
| ANT | 17 | 54 | |
| TAAGTGCTTTGATTTGTAGGGTATAGTTGG | 93.3 % | 84.6 % | |||
| GGTCTTATTGGGGTTAGTAGAGAAA (85 bp) | |||||
|
|
| ANT | 0 | 28 | |
| TATGAGTTTTTGATTGTGTTGGGATTATTA | |||||
| GTGTTTATTAGGGTTTATTAATAATTGGGG | |||||
| TTAGTACACAAA (102 bp) | |||||
|
|
| ANT | −4 | −6 | |
| AAAGTGAAAATAAGTTGTGTATAATTAATT | |||||
| TCTTTATTAATACTTATAATCATGC (85 bp) | |||||
|
|
| ANT | 1 | 11 | |
| TTAATAATAAGAATTGATATATATTAATTT | |||||
| CATAATTATCAACGTTTTTAGAGTAATTAA | |||||
| ACTGCGATGAGTTATATAAA (110 bp) | |||||
Prevalence of premature stop codons and frameshifts
| 42kD | 22kD | 57kD | ||||
|---|---|---|---|---|---|---|
| Stop codon | Frameshift | Stop codon | Frameshift | Stop codon | Frameshift | |
| TBE1 | 30.4 % | 76.8 % | 14.9 % | 59.0 % | 38.1 % | 87.1 % |
| TBE2.1 | 29.6 % | 79.1 % | 23.1 % | 62.7 % | 45.5 % | 86.2 % |
| TBE2.2 | 29.4 % | 74.8 % | 16.8 % | 60.8 % | 36.7 % | 87.3 % |
| TBE3 | 84.7 % | 75.7 % | 90.7 % | 56.3 % | 35.8 % | 81.3 % |
Fig. 2TBE substitution rate variation. Box plots represent dN/dS values for the three TBE-encoded genes (with or without premature stop codons or frameshifts) among the four TBE families. The numbers of ORFs analyzed are summarized in Additional file 2. *For TBE3 42kD and 22kD genes, since very few sequences lack frameshifts or premature stop codons, we permitted the presence of the most frequent stop codons at residue 70 (42kD protein) and residue 186 (22kD protein) for the “No frameshift/stop codon” group