| Literature DB >> 27809783 |
Thomas Pauli1, Lucia Vedder2, Daniel Dowling3, Malte Petersen4, Karen Meusemann4,5,6, Alexander Donath4, Ralph S Peters7, Lars Podsiadlowski8, Christoph Mayer4, Shanlin Liu9,10, Xin Zhou11,12, Peter Heger13, Thomas Wiehe13, Lars Hering14, Georg Mayer14, Bernhard Misof4, Oliver Niehuis15.
Abstract
BACKGROUND: Body plan development in multi-cellular organisms is largely determined by homeotic genes. Expression of homeotic genes, in turn, is partially regulated by insulator binding proteins (IBPs). While only a few enhancer blocking IBPs have been identified in vertebrates, the common fruit fly Drosophila melanogaster harbors at least twelve different enhancer blocking IBPs. We screened recently compiled insect transcriptomes from the 1KITE project and genomic and transcriptomic data from public databases, aiming to trace the origin of IBPs in insects and other arthropods.Entities:
Keywords: Arthropod evolution; Comparative transcriptomic analyses; Gene evolution; Insulator binding proteins
Mesh:
Substances:
Year: 2016 PMID: 27809783 PMCID: PMC5094011 DOI: 10.1186/s12864-016-3205-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of all currently known insulator binding proteins (IBPs) in Drosophila melanogaster, with information on the Pfam symbol of the conserved protein domain families found in the respective proteins with the corresponding references
| Insulator binding protein | Conserved domains | Reference |
|---|---|---|
| CTCF | zf-C2H2 [11] | [ |
| Su(Hw) | zf-C2H2 [12] | [ |
| Pita | zf-AD [1], zf-C2H2 [10] | [ |
| ZIPIC | zf-C2H2 [7] | [ |
| Zw5 | zf-C2H2 [8] | [ |
| CP190 | BTB [1], zf-C2H2 [4] | [ |
| GAF | BTB [1], GAGA [1] | [ |
| Mod (mdg4) | BTB [1], FLYWCH [1] | [ |
| BEAF-32 | zf-BED [1], BESS [1] | [ |
| Ibf1 | zf-BED [1] | [ |
| Ibf2 | zf-BED [1] | [ |
| Elba-complex (Elba 1,2,3) | BEN [1] (Elba 1,2), none (Elba 3) | [ |
The number of repeats of each conserved domain in the respective protein is given in square brackets
Fig. 1Evolution of enhancer-blocking insulator proteins (IBP) in arthropods. Gains of IBPs are indicated by blue stars, potential losses by red crosses. Blue circles in the table indicate the presence of transcripts of a given IBP in transcriptomes. Multiple blue circles indicate the number of copies found. Transparent circles indicate the putative absence of transcripts of an IBP. The backbone tree topology is adopted from [38]. The phylogenetic relationships among outgroup taxa (i.e., Onychophora, Tardigrada, and Nematoda) are according to [62]
Results from analyzing dN/dS ratios in genes encoding insulator proteins in insects
| Gene | Branch | lnL0 | lnL1 | LRT |
|
|---|---|---|---|---|---|
| CP190 | Crustacea | −5495.527 | −5495.388 | 0.278 | 0.598 |
| CP190 | Holometabola | −5495.284 | −5495.260 | 0.047 | 0.828 |
| CTCF | Onychophora | −1314.281 | −1310.626 | 7.308 | 0.007 |
| CTCF | Holometabola | −1311.997 | −1312.166 | 0.338 | 0.561 |
| GAF | Acerentomon | −2006.810 | −2006.810 | 0.0 | 1.000 |
| GAF | Holometabola | −2006.810 | −2006.810 | 0.0 | 1.000 |
| Mod (mdg4) | Polyneoptera | −15377.060 | −15377.060 | 4.000∙10–6 | 0.998 |
| Mod (mdg4) | Holometabola | −15374.903 | −15373.888 | 2.032 | 0.154 |
| Pita | Hymenoptera | −1054.403 | −1046.840 | 15.13 | <0.001* |
| Su (Hw) | Holometabola | −11052.401 | −11052.210 | 0.383 | 0.536 |
Shown are the gene name and the branch, along which the respective selection model was tested, the log-likelihood for the neutral model (lnL0) and for positive selection (lnL1), the likelihood ratio test statistic (LRT), and the associated p-value. Branches on which the positive selection model fits significantly better than the neutral selection model are indicated by *. Bonferroni corrected significance threshold was α = 0.005. The degree of freedom (df) was 1 for all tests
BUSCO assessment for completeness of the 100 1KITE transcriptomes
| Complete [%] | Fragmented [%] | Missing [%] | |
|---|---|---|---|
| Min | 15.3 | 3.8 | 14.7 |
| 1st Qu. | 49.0 | 9.3 | 22.4 |
| Median | 57.9 | 11.0 | 30.7 |
| Mean | 57.3 | 11.0 | 31.8 |
| 3rd Qu. | 68.6 | 12.5 | 37.9 |
| Max | 81.2 | 19.0 | 72.5 |
Given are the proportions of complete, fragmented and missing BUSCO genes