| Literature DB >> 24094345 |
Peter Heger1, Rebecca George, Thomas Wiehe.
Abstract
Alteration of regulatory DNA elements or their binding proteins may have drastic consequences for morphological evolution. Chromatin insulators are one example of such proteins and play a fundamental role in organizing gene expression. While a single insulator protein, CTCF (CCCTC-binding factor), is known in vertebrates, Drosophila melanogaster utilizes six additional factors. We studied the evolution of these proteins and show here that-in contrast to the bilaterian-wide distribution of CTCF-all other D. melanogaster insulators are restricted to arthropods. The full set is present exclusively in the genus Drosophila whereas only two insulators, Su(Hw) and CTCF, existed at the base of the arthropod clade and all additional factors have been acquired successively at later stages. Secondary loss of factors in some lineages further led to the presence of different insulator subsets in arthropods. Thus, the evolution of insulator proteins within arthropods is an ongoing and dynamic process that reshapes and supplements the ancient CTCF-based system common to bilaterians. Expansion of insulator systems may therefore be a general strategy to increase an organism's gene regulatory repertoire and its potential for morphological plasticity.Entities:
Keywords: Adaptive evolution; BEAF-32; CP190; GAGA factor; Mod(mdg4); Su(Hw); Zw5; barrier element; gene loss; lineage-specific genes
Mesh:
Substances:
Year: 2013 PMID: 24094345 PMCID: PMC4208683 DOI: 10.1111/evo.12155
Source DB: PubMed Journal: Evolution ISSN: 0014-3820 Impact factor: 3.694
Number of collected candidate sequences. Using BLAST searches with a specified cutoff (threshold), the number of arthropod candidate sequences retrieved in total, the number of unique sequences, and the number of sequences retained after clustering are shown. Candidates for a vertebrate GAF were collected from Danio rerio, Homo sapiens, and Strongylocentrotus purpuratus
| Insulator | Threshold | Total | Unique | After clustering |
|---|---|---|---|---|
| CTCF, Su(Hw), Zw5 | 10−05 | 8929 | 4245 | 587 |
| CP190, GAF, Mod(mdg4) | 10−14 | 5166 | 1501 | n. d./727 |
| BEAF-32 | ∞ | 130 | 64 | n. d. |
| Vertebrate GAF | 10−05 | 1135 | 161 | n. d. |
n. d., not determined.
Figure 1Domain structure of D. melanogaster insulator proteins. The domain composition of known D. melanogaster insulator proteins is drawn approximately to scale. Upper row: Factors with poly-zinc finger (ZF) domain. Individual ZFs (red) comprise about 29 AA and are regularly spaced, except for Su(Hw)’s first ZF. Lower row: Three insulator proteins with BTB (∼105 AA, green) s+ ZF domain (red) and BEAF-32 (with zf-BED and BESS domain).
Figure 2Successive gain of insulator proteins during arthropod evolution. (A) Schematic representation of arthropod relationships (after Regier et al. (2010); Simon et al. (2009); Wiegmann et al. (2009); Trautwein et al. (2012)). Green dots highlight the position of major arthropod groups. The birth (*) and loss (†) of each insulator protein is indicated in red. Several clades/orders are omitted for clarity. Ephemeroptera (mayflies) and Odonata (dragonflies) are combined in Palaeoptera. (B) For the taxa shown in A, the number of ESTs and nucleotide sequences deposited at NCBI (http://www.ncbi.nlm.nih.gov; reference day is May 7, 2012) is indicated as logarithmic bar graph, along with the number of available genome sequences (right column). Sequence data are biased toward holometabolous insects and particular crustacean and chelicerate lineages. Zygentoma have less than 1000 sequences (empty bar graph). (C) Phylogenetic mapping of insulator proteins. Presence of a particular insulator protein is indicated by ⊕, absence by ⊖. Absence is only indicated in orders with sufficient EST and genomic data. Results refer to the BLAST/phylogeny-based strategy and are identical for the HMM/OrthoMCL-based strategy, except some minor modifications (Table S4).