| Literature DB >> 25062917 |
Lidia Mateo1, Josefa González2.
Abstract
The centromere is a chromatin region that is required for accurate inheritance of eukaryotic chromosomes during cell divisions. Among the different centromere-associated proteins (CENP) identified, CENP-B has been independently domesticated from a pogo-like transposase twice: Once in mammals and once in fission yeast. Recently, a third independent domestication restricted to holocentric lepidoptera has been described. In this work, we take advantage of the high-quality genome sequence and the wealth of functional information available for Drosophila melanogaster to further investigate the possibility of additional independent domestications of pogo-like transposases into host CENP-B related proteins. Our results showed that CENP-B related genes are not restricted to holocentric insects. Furthermore, we showed that at least three independent domestications of pogo-like transposases have occurred in metazoans. Our results highlight the importance of transposable elements as raw material for the recurrent evolution of important cellular functions.Entities:
Keywords: Drosophila; exaptation; functional domain; holocentric chromosomes; pogo
Mesh:
Substances:
Year: 2014 PMID: 25062917 PMCID: PMC4231638 DOI: 10.1093/gbe/evu153
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FDomain structure of pogo transposase and CR proteins. (A) Human CENP-B (hCENP-B), D. melanogaster pogo transposase (pogoR11), D. melanogaster CAG (CAG), yeast CENP-B homologs (Abp1, Cbh1, Cbh2), and Spodoptera frugiperda CENP-B homolog (SfCENP-B). CENP-B_N domain is shown in red, d1iufa1 in yellow, HTH_Tnp_Tc5 in green, DDE_1 in light blue, DIM in pink, and PCNA in dark blue. Domains predicted by hmmscan are shown as black-lined boxes, the other domains were inferred from experimental evidence. The discontinuous line indicates the deleted region. (B) 3D structure prediction of D. melanogaster CAG DBD using human CENP-B as a template. Z-score = −6.66 and −6.7 for CENP-B and CAG, respectively.
FBiological processes overrepresented in the CAG 2-neighbourhood PPI. Hierarchical representation of the 72 biological process GO terms enriched in the neighbourhood-2 of CAG PPI network. Node colors indicate the level of significance. The overrepresented GO terms were categorized into four groups related to the neigbourhood-1 genes of CAG PPI: Cell cycle and spindle organization, response to stimulus and regulation of metabolic process, nucleic acids metabolism, and protein metabolism. GO terms enriched also in the neighborhood-2 of human CENP-B are represented inside gray boxes.
CR Genes Identified in Holocentric and Nonholocentric Insecta
| Class | Order | Species | Protein Identifier | Protein Sequence Identity | Protein Length | Conserved Protein Domains | |
|---|---|---|---|---|---|---|---|
| HTH_CENP-B_N | HTH_Tnp_Tc5 | ||||||
| Insecta | Diptera | CAG (CG12346) | 100 | 225 | X | X | |
| GD15259 | 97.22 | 111 | X | — | |||
| GM20484 | 94.67 | 225 | X | X | |||
| GE13064 | 92.44 | 225 | X | X | |||
| GG22708 | 93.24 | 207 | X | X | |||
| GF13390 | 59.11 | 222 | X | X | |||
| GA11571 | 57.46 | 228 | X | X | |||
| GL17090 | 58.77 | 228 | X | X | |||
| GK19073 | 40.29 | 222 | X | X | |||
| GJ16124 | 39.81 | 227 | X | X | |||
| Insecta | Lepidoptera | BGIBMGA013031 | 32.09 | 278 | X | X | |
| BGIBMGA008012 | 26.35 | 501 | X | X | |||
| BGIBMGA007903 | 25 | 468 | X | X | |||
| BGIBMGA013624 | 29.63 | 722 | X | X | |||
| Insecta | Lepidoptera | HMEL009793 | 35.38 | 255 | X | X | |
| HMEL010729 | 33.8 | 295 | X | X | |||
| HMEL014790 | 31.55 | 533 | X | X | |||
| HMEL007960 | 50 | 533 | X | X | |||
| HMEL011593 | 35.38 | 192 | X | X | |||
| Insecta | Lepidoptera | 94B11_25* | 22.22# | 488 | X | X | |
| Insecta | Lepidoptera | 72F01* | 23.61# | 488 | X | X | |
| Insecta | Coleoptera | TC003750 | 26.39 | 1175 | X | X | |
| TC001653 | 30 | 486 | X | — | |||
| TC005011 | 51.49 | 212 | — | X | |||
aAll sequences can be downloaded from Ensembl Metazoa except those with an “*” that can be downloaded from LepidoDB.
bProtein sequence identity estimated using BLASTp except for those with an “#” estimated using ClustalW (see Materials and Methods).
FPhylogenetic distribution of pogo-related transposases and transposase-derived genes in metazoans. JR and CR indicate that the sequences belong to the JR clade and the CR clades, respectively. Filled-boxes depict pogo-related transposases and empty boxes depict transposase-derived genes. Numbers in the nodes show posterior probabilities (black) and bootstrap values (red). Shaded branches correspond to new CR proteins identified in this work and in d'Alençon et al 2011 (table 1) that have been incorporated to the previously published phylogeny (Casola et al 2008). Dotted lines represent branches not drawn to scale. Trees including nonmetazoans pogo-related transposases and transposase-derived genes are depicted in supplementary figures S2 and S3, Supplementary Material online.