| Literature DB >> 23873833 |
E Longobardi1, D Penkov, D Mateos, G De Florian, M Torres, Francesco Blasi.
Abstract
TALE (three amino acids loop extension) homeodomain transcription factors are required in various steps of embryo development, in many adult physiological functions, and are involved in important pathologies. This review focuses on the PREP, MEIS, and PBX sub-families of TALE factors and aims at giving information on their biochemical properties, i.e., structure, interactors, and interaction surfaces. Members of the three sets of protein form dimers in which the common partner is PBX but they can also directly interact with other proteins forming higher-order complexes, in particular HOX. Finally, recent advances in determining the genome-wide DNA-binding sites of PREP1, MEIS1, and PBX1, and their partial correspondence with the binding sites of some HOX proteins, are reviewed. These studies have generated a few general rules that can be applied to all members of the three gene families. PREP and MEIS recognize slightly different consensus sequences: PREP prefers to bind to promoters and to have PBX as a DNA-binding partner; MEIS prefers HOX as partner, and both PREP and MEIS drive PBX to their own binding sites. This outlines the clear individuality of the PREP and MEIS proteins, the former mostly devoted to basic cellular functions, the latter more to developmental functions.Entities:
Keywords: PREP, MEIS, and PBX; TALE proteins; vertebrates
Mesh:
Substances:
Year: 2013 PMID: 23873833 PMCID: PMC4232920 DOI: 10.1002/dvdy.24016
Source DB: PubMed Journal: Dev Dyn ISSN: 1058-8388 Impact factor: 3.780
Figure 1Phylogenesis and basic structure of the TALE proteins. A: Cladogram representing the phylogenetic relationship between the TALE-superfamily members of zebrafish, mouse, and human. The alignment of peptide sequences was performed using “ClustalW” software, the amino acid substitutions per site are indicated in every branch. Individual proteins are identified by the species (Hs, Homo sapiens; Mm, Mus musculus; Dr, Danio rerio). B: Basic features of the TALE proteins. The homeodomain (HD) is conserved throughout the whole group. The amino-terminal regions are family specific, since PBC-A and -B domains are found only in the PBX family while MEIS-A and -B in both Prep and Meis subfamilies. The location of the NES and NLS is also shown.
Correspondence of TALE Gene Symbols Between Danio Rerio, Mus musculus and Homo Sapiens
| Vertebrate family | Vertebrate subfamily | ||||
|---|---|---|---|---|---|
| Hth | MEINOX | Prep (pKnox) | Prep1.1 Prep1.2 | Prep1 | PREP1 |
| Hth | MEINOX | Prep (pKnox) | Prep2 | Prep2 | PREP2 |
| Hth | MEINOX | Meis | Meis1 Meis4.1a | Meis1 | MEIS1 |
| Hth | MEINOX | Meis | Meis2.1 Meis2.2 | Meis2 | MEIS2 |
| Hth | MEINOX | Meis | Meis3 | Meis3 | MEIS3 |
| Exd | PBC | Pbx | Pbx1a | Pbx1 | PBX1 |
| Exd | PBC | Pbx | Pbx2 | Pbx2 | PBX2 |
| Exd | PBC | Pbx | Pbx4 (lazarus) | Pbx3 | PBX3 |
| Exd | PBC | Pbx | Pbx3 | Pbx3 | PBX3 |
| Exd | PBC | Pbx | Pbx4 | PBX4 |
No Danio rerio ortholog of mammalian Pbx4 (Wagner et al., 2001) has been identified. In fact lazarus, although called Pbx4 (Pöpperl et al., 2000), is an ortholog of mammalian Pbx3 and its function during embryonic development is correlated to the function of mammalian Pbx1.
Gene Structure, Chromosomal Location, Transcript, and Protein Length1
| Gene | Chromosome | Gene length (kbp) | Transcript length (bp) | Protein length (aa) | 5′UTR (bp) | N of exons | 3′UTR (bp) | N Isoforms |
|---|---|---|---|---|---|---|---|---|
| 21 | 59 | 5,003 | 436 | 188 | 11 | 3,498 | 1 | |
| 17 | 41 | 2,583 | 436 | 145 | 11 | 1,083 | 1 | |
| 9 | 18 | 2,350 | 435 | 22 | 11 | 116 | 3 | |
| 1 | 17 | 1,293 | 431 | NK | 10 | NK | 2 | |
| 11 | 269 | 3,730 | 472 | 314 | 13 | 2,057 | 1 | |
| 9 | 256 | 3,488 | 474 | 223 | 13 | 1,840 | 1 | |
| 10 | 19 | 1,206 | 401 | NK | 8 | NK | 1 | |
| 2 | 137 | 3,180 | 415 | 382 | 13 | 1,550 | 1 | |
| 11 | 138 | 3178 | 390 | 729 | 13 | 1,276 | 2 | |
| 13 | 79 | 2,258 | 391 | 383 | 14 | 1,700 | 2 | |
| 15 | 209 | 3,687 | 381 | 1076 | 13 | 1,465 | 7 | |
| 2 | 203 | 3,718 | 401 | 1026 | 13 | 1,486 | 7 | |
| 20 | 21.3 | 3,104 | 393 | 694 | 14 | 1,184 | 1 | |
| 17 | 99 | 3,169 | 390 | 544 | 13 | 1,511 | 1 | |
| 19 | 16 | 2,149 | 421 | 447 | 13 | 436 | 1 | |
| 7 | 11 | 1,781 | 378 | 177 | 13 | 467 | 2 | |
| 15 | 22 | 1,110 | 369 | NK | 10 | NK | 1 | |
| 1 | 292 | 6,635 | 430 | 188 | 9 | 5,159 | 2 | |
| 1 | 272 | 1,293 | 430 | NK | 9 | NK | 2 | |
| 2 | 32 | 1,029 | 342 | NK | 7 | NK | 3 | |
| 6 | 5.4 | 3,213 | 430 | 271 | 9 | 1,649 | 1 | |
| 17 | 5.1 | 3,068 | 430 | 292 | 9 | 1,477 | 1 | |
| 16 | 24 | 1,212 | 403 | NK | 8 | NK | 1 | |
| 9 | 220 | 2,823 | 434 | 67 | 9 | 1,450 | 6 | |
| 2 | 200 | 2,393 | 434 | 117 | 9 | 972 | 4 | |
| 8 | 137 | 1,053 | 350 | NK | 7 | NK | 3 | |
| 19 | 57 | 1,459 | 374 | 24 | 8 | 290 | 1 | |
| 8 | 40 | 2,251 | 378 | 38 | 8 | 1,076 | 2 | |
| 3 | 50 | 834 | 278 | NK | 5 | NK | 1 |
The length of the transcript refers to the longest transcript present in the data bank, which is also the most abundant form. Details: the data derived from ENSEMBL database (23.11.2007) except where specified. The length of the genes is counted from the starting point of the transcription. The length of the 5′ UTR may not be not accurate because in most genes the exact starting point of transcription has not been experimentally determined. In many genes also the 3′ UTR has not been established. Hs, Homo sapiens; Mm, Mus musculus; Dr, Danio rerio. NK, not known.
Based on direct determination of the transcription start site (Bernardi et al., 2010).
Figure 2Conserved domains in human Meis and Prep proteins. A: Comparison of the main domains in human Meis1, Meis2, Meis3, Prep1, Prep2, and Drosophila hth proteins. On top the aminoacid sequence of Meis-A domain. The list on the right indicates the corresponding protein. In the middle, the Meis-B domain; at the bottom, the HD domain. In Meis-A domain, the binding sequences for Pbx1, P160, and 4EHP are highlighted in green. Accession numbers are: hMeis1: O00470; hMeis2: O14770; hMeis3: Q99687; Hth: NP_476578; hPrep1: NP_004562; hPrep2: CAD01142.
Figure 3Sequence comparison of human Pbx proteins and Drosophila Exd. Alignment of full-length proteins was performed using ClustalW2 software. Stars indicate conserved amino acids. Magenta, the PBC-A domain; blue, the PBC-B domain; green, the HD domain Accession numbers are: Pbx1: AAG30941; Pbx2: P40425; Pbx3: P40426; Pbx4: CAC28212; Exd: AAC37182.1
Sequence Characterization of Human Pbx Proteins1
| Length of protein or protein domain | PBX1a | PBX2 | PBX3a | PBX4 |
|---|---|---|---|---|
| Protein | 430 | 430 | 434 | 330 |
| PBC-A | 75 | 75 | 75 | 51 |
| PBC-B | 88 | 88 | 88 | 88 |
| HD | 60 | 60 | 60 | 60 |
| Pbx2 | 69/75 | 80/88 | 59/60 | |
| Pbx3a | 72/75 | 86/88 | 60/60 | |
| Pbx4 | 66/75 | 75/88 | 55/60 |
The size of the full-length proteins indicates the longest isoform (like PBX1a and PBX3a) or the most frequent of the known isoforms (in the case of PBX2 and PBX4).
Figure 4The a and b isoforms of human and mouse Pbx1. The “a” forms are 430, the “b” forms 358 (H. sapiens) and 357 (M. musculus) residues long.
Amino Acids Sequence Identity in Human, Mouse, and Dr Meinox Protein Domains
| Protein domain | Human /mouse | |
|---|---|---|
| Prep1 MEIS-A | 31/32 | 26/32 |
| Prep1 MEIS-B | 45/45 | 38/45 |
| Prep1 HD | 60/60 | 57/60 |
| Prep2 MEIS-A | 32/32 | 30/32 |
| Prep2 MEIS-B | 45/45 | 44/45 |
| Prep2 HD | 60/60 | 60/60 |
| Meis1 MEIS-A | 40/40 | 39/40 |
| Meis1 MEIS-B | 45/45 | 45/45 |
| Meis1 HD | 60/60 | 59/60 |
| Meis2 MEIS-A | 40/40 | 39/40 |
| Meis2 MEIS-B | 45/45 | 44/45 |
| Meis2 HD | 60/60 | 60/60 |
| Meis3 MEIS-A | 45/46 | 32/51 |
| Meis3 MEIS-B | 44/45 | 44/45 |
| Meis3 HD | 60/60 | 59/60 |
The data refer to D. rerio R Prep1.1.
For this comparison, the sequence of D. rerio Meis2.1 has been used.
PREP1 and MEIS1 Interactors
| PREP1 interactor | Nuclear/cytoplasmic | Interacting site | Reference | |
|---|---|---|---|---|
| PREP1 motif | Interactor motif | |||
| PBX | N | 63LFP | PBCA | Berthelsen et al., 1998 |
| Smads 2,3,4 | N | Bailey et al., | ||
| Pax6 | N | Mikkola et al., | ||
| p160MBP | N | 63LFP | LXXLL domains, N terminus | Diaz et al., |
| 4EHP | N | 58 | Villaescusa et al., | |
| RFX3 | (2HS) | Ravasi et al., | ||
| ZBTB3 | (2HS) | Ravasi et al., | ||
| TLX1 | (2HS) | Ravasi et al., | ||
| β ACTIN | N | Diaz et al., | ||
| NMMHCIIA | N | Diaz et al., | ||
| Oct1 | N | HD | Rave-Harel et al., | |
| p54/Nrb/NonO | N | Ferrai et al., | ||
| PSF | N | Diaz et al., | ||
| PBX | N | aa 64-202 | PBCA | Chang et al., |
| HOX9-13 | N | C terminal | Williams et al., | |
| PKA | C | C terminal | Huang et al., | |
| TLX1 | (2HS) | Ravasi et al., | ||
| TLX2 | (2HS) | Ravasi et al., | ||
| EMX2 | (2HS) | Ravasi et al., | ||
In bold are the residues whose mutation into Ala drastically decrease binding (Diaz et al., 2007a; Villaescusa et al., 2009).
In these proteins it is not known whether they interact with Prep/Meis or Pbx.
Mammalian 2-Hybrid system.
Pbx Interactors
| Interactor | Notes | Reacting site in PBX | Reacting site in the interactor | Reference |
|---|---|---|---|---|
| Clustered homeodomain transcription factors | HD, C terminal | Hexapeptide (usually YPWMX), | Piper et al., | |
| Pbx regulating protein-1 (pKnox1) | PBC-A, N terminal | 63LFPLLALL70 | Berthelsen et al., 1998 | |
| Pbx regulating protein-2 (pKnox2) | PBC-A, N terminal | HR1, N terminal | Fognani et al., | |
| Myeloid Ecotropic Insertion site-1 | N terminal (aa 1–88) | N terminal (aa 30–60) | Chang et al., | |
| Necessary for induction of muscle differentiation. | HD, C terminal | CL-X-W motif | Knoepfler et al., | |
| Forms ternary complex with Pbx and Prep1/Meis1. | HD, C terminal | Hexapeptide (FPWMK), N terminal | Peers et al., | |
| Involved in Pbx1 retention in the cytoplasm | PBC-B | Coiled-coil tail (aa 161–177) | Huang et al., | |
| Same interaction as for clustered Hox | HD, C terminal | Hexapeptide (FPWME), N terminal | Allen et al., | |
| Inhibits the formation of the Pbx-Hox complexes on DNA | HD (aa 160–230) | Abramovich et al., | ||
| Prevents HOXA9-Pbx binding to DNA | aa 215–241 | Laurent et al., | ||
| Forms a DNA binding complex with Pbx3 | HD, C terminal | Rhee et al., | ||
| Phosphorylate-specific serines in the PBC-B domain | PBC-B, N terminal | Kilstrup-Nielsen et al., | ||
| HDACs recruits to Pbx a corepressor complex containing N-CoR/SMRT and mSIN3B | N terminal (aa 89–172) | Saleh et al., 2000 | ||
| Coactivator of Pbx1b | C terminal | Rave-Harel et al., | ||
| Transcription factor important for neural crest–derived tissue and mesenchymal mesoderm cells. Inhibits Pbx1a transactivation. | Berry et al., | |||
| Accessory splicing factor playing a role in actin-dependent transcription-regulating role of Hox | N terminal | Ferrai et al., | ||
| 2HS | Ravasi et al., | |||
| 2HS | Mikkola et al., 2000 | |||
| 2HS. HNF-4a coactivator; myosin head domain containing 1; thyroid hormone receptor interactor 3 | Ravasi et al., | |||
| 2HS. GABA-A receptor-associated membrane protein 2; palmitoyltransferase | Ravasi et al., | |||
| Intracellular mediators of TGFbeta signaling. | Bailey et al., |
Hox A9 and Hox 10 do not have a classical YPWMX hexapeptide.
2HS: Mammalian 2 hybrid system.
General Rules Drawn From the Prep1, Meis1+2 and Pbx1 ChIPseq in E10.5 Embryo Trunks1
| Prep1 | Meis | Pbx1 | |
|---|---|---|---|
| Promoter | Intra and intergenic promoters and enhancers | Depends on Prep or Meis | |
| Pbx1 | Pbx1 and Hox | Prep1, Meis1+2 and Hox | |
| 5.58 | NA | 12.6 | |
| NA | 7.17 | 19 | |
| Decanucleotide | Octanucleotide and hexanucleotide | Undefined when alone. Decanucleotide (Prep1–Pbx1) octanucleotide and hexanucleotide (Meis1+2-Pbx1). | |
| 2.9 | 11 | 4.4 | |
| 3.9 | 15.3 | 6.0 | |
| 98 | NA | 43 |
Based on Penkov et al. (2013).
Based on a P value of 10<10−6.
The data on HoxC9 and HoxA2have been obtained from specific tissues whereas those of Prep1, Meis 1 and Pbx1 have been obtained in the whole embryo. Therefore, the overlaps must not be considered absolute as they may exist only in certain tissues and not in others. However, the basic rule derived, i.e., that many Meis-binding sites are in fact Hox-binding sites, remains very likely.
Numbers represent the degree of overlap of thymocytes Prep1 and Pbx2 binding sites with those of Prep1 and Pbx2 in total E10.5 embryo trunk. The data are based on a ChIP-on-ChIP analysis with anti-Prep1 and anti-Pbx2 antibodies of mouse thymocytes in which Prep1 and Pbx2 represent the largely predominant form of TALE protein expressed.