| Literature DB >> 15186484 |
Abstract
The basic helix-loop-helix proteins are dimeric transcription factors that are found in almost all eukaryotes. In animals, they are important regulators of embryonic development, particularly in neurogenesis, myogenesis, heart development and hematopoiesis.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15186484 PMCID: PMC463060 DOI: 10.1186/gb-2004-5-6-226
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Classification of bHLH proteins by sequence
| Phylogenetic group | Description | Classification according to Murre | Examples of classified proteins (family names) |
| A | Bind to CAGCTG or CACCTG | I, II | MyoD, Twist, Net |
| B | Bind to CACGTG or CATGTTG | III, IV | Mad, Max, Myc |
| C | Bind to ACGTG or GCGTG. Contain a PAS domain | Single-minded, aryl hydrocarbon receptor nuclear translocator (Arnt), hypoxia-inducible factor (HIF), Clock | |
| D | Lack a basic domain and hence do not bind DNA but form protein-protein dimers that function as antagonists of group A proteins | V | ID |
| E | Bind preferentially to N-box sequences CACGCG or CACGAG. Contain an orange domain and a WRPW peptide | VI | Hairy |
| F | Contain an additional COE domain, involved in dimerization and DNA binding | Coe (Col/Olf-1/EBF) |
This classification of bHLH proteins is based on sequence comparisons, E-box binding, conservation of residues in parts of the protein other than the bHLH region and the presence or absence of additional domains. It was first adopted by Atchley and Fitch [13] and extended by Ledent and coworkers [6]. The older classification based on tissue distributions, DNA-binding specificities and dimerization, proposed by Murre and coworkers for a much smaller set of sequences [12], is shown for comparison.
The bHLH protein structures available in the Protein Data Bank (PDB)
| PDB code | Protein Chains | Protein name | Species | Group | SCOP superfamily | CATH homologous superfamily | CATH sequence family |
| 1mdy* | ABCD | MyoD bHLH | Mouse | A | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.1 |
| 1an4 | AB | USF bHLH | Human | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.2 |
| 1an2 | AC | Max bHLHZ | Mouse | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.2 |
| 1hlo | AB | Max bHLHZ | Human | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.2 |
| 1nlw* | BE | Max bHLHZ | Human | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.2 |
| 1nkp* | BE | Max bHLHZ | Human | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.2 |
| 1nkp* | AD | Myc prot-oncogene bHLHZ | Human | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.2 |
| 1am9* | ABCD | SREBP-1a bHLHZ | Human | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.3 |
| 1ukl | CDEF | SREBP-2 HLHZ | Human | B | NC | NC | NC |
| 1a0a* | AB | Pho4 bHLH | B | Helix-loop-helix DNA-binding domain | MyoD basic-helix-loop-helix domain, subunit B | 4.10.280.10.4 | |
| 1nlw* | AD | Mad bHLHZ | Human | B | Helix-loop-helix DNA-binding domain | NC | NC |
The PDB codes and protein names for nine bHLH proteins deposited in the PDB are shown with their superfamily names from the CATH [23] and SCOP [24] classifications and their sequence family numbers from CATH. Max has more than one structure solved, including two complexes (1nkp, Max-Myc and 1nlw, Max-Mad). *Structures shown in Figure 1; NC, the protein is not yet included in the CATH or SCOP protein structure classifications. SREBP, steroid response element binding protein. 'Protein chains' indicates the chain identification letter assigned to individual subunits in the PDB files.
Figure 1Representative structures of bHLH proteins from the Protein Data Bank [22]. In each diagram, the protein is shown as a secondary-structure cartoon and the DNA double helix is shown in stick representation. (a) MyoD bHLH-domain homodimer (PDB code 1mdy). (b) Pho4 bHLH-domain homodimer (1am9). (c) SREBP-1a bHLH-domain homodimer (1aoaC). (d) Max-Mad heterodimer (1nlw). (e) Max-Myc heterodimer (1nkp). (f) Max-Myc heterotetramer (1nkp). In (d-f) the Max HLH monomer is shown in dark gray. The scales are not comparable between different structures.
Functional classes of bHLH proteins
| Phylogenetic class | bHLH family | Example mammalian protein | Function |
| A | MyoD | Myf4 | Myogenic: initiates myogenic programme in many cell types |
| NeuroD | Neurogenic differentiation factor NDF2 | Neurogenic: involved in terminal neurone differentiation | |
| SCL | Tal1 | Hematopoietic: essential for primitive hematopoiesis and invokes enhanced proliferation and differentiation during erythroid development | |
| Hand | dHand | Cardiogenic: regulates the morphogenetic events of asymmetric heart development | |
| B | Myc | c-Myc | Cell proliferation and differentiation; oncogenic |
| Mad | Mad1 | Regulation of cell proliferation | |
| SREBP | SREBP-2 | Cholesterol metabolism | |
| C | Sim | Single-minded 1 (Sim1) | Neurogenic: regulation of midline cell lineage in the central nervous system |
| D | Emc | Id1 | Myogenic and neurogenic: negative inhibition of DNA binding |
| E | Hairy | Hes1 | Neurogenic: restricts differentiation of neurons from neural precursor cells |
| F | Coe | Early B-cell factor (EBF1) | Hematopoietic: essential for B-cell development |
Examples of mammalian proteins and their diverse functions are shown for each phylogenetic group of bHLH proteins. The phylogenetic groups are those indicated in Table 1 and discussed in the text.