| Literature DB >> 12093377 |
Valérie Ledent1, Odier Paquet, Michel Vervoort.
Abstract
BACKGROUND: The basic helix-loop-helix (bHLH) proteins are a large and complex multigene family of transcription factors with important roles in animal development, including that of fruitflies, nematodes and vertebrates. The identification of orthologous relationships among the bHLH genes from these widely divergent taxa allows reconstruction of the putative complement of bHLH genes present in the genome of their last common ancestor.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12093377 PMCID: PMC116727 DOI: 10.1186/gb-2002-3-6-research0030
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
The 44 families of animal bHLH defined by our phylogenetic analyses
| Family name | Number of worm genes | Number of fly genes | Number of mouse genes | Number of human genes | Number of sea squirt genes | Number of pufferfish genes | Group |
|---|---|---|---|---|---|---|---|
| Achaete-Scute a | 4 | 4 | 2 | 2 | 0 | 3 | A |
| Achaete-Scute b | 1 | 0 | 1 | 3 | 1 | 3 | A |
| MyoD | 1 | 1 | 4 | 4 | 1 | 1 | A |
| E12/E47 | 1 | 1 | 4 | 6 | 1 | 1 | A |
| Neurogenin | 1 | 1 | 3 | 3 | 1 | 2 | A |
| NeuroD | 1 | 0 | 4 | 4 | 0 | 5 | A |
| Atonal | 1 | 3 | 2 | 2 | 1 | 4 | A |
| Mist | 0 | 1 | 1 | 1 | 1 | 0 | A |
| Beta3 | 1 or 2* | 1 | 2 | 2 | 0 | 2 | A |
| Oligo | 0 or 1* | 0 | 3 | 3 | 0 | 3 | A |
| Net | 1 | 1 | 1 | 1 | 1 | 0 | A |
| Mesp | 0 | 1 | 3 | 4 | 1 | 2 | A |
| Twist | 1 | 1 | 2 | 1 | 0 | 3 | A |
| Paraxis | 0 | 1 | 2 | 1 | 0 | 2 | A |
| MyoR | 1 | 1 | 2 | 4 | 0 | 3 | A |
| Hand | 1 | 1 | 2 | 2 | 0 | 1 | A |
| PTFa | 0 | 1 | 1 | 1 | 0 | 1 | A |
| PTFb | 1 | 2 | 0 | 1 | 0 | 2 | A |
| SCL | 0 | 1 | 4 | 3 | 0 | 1 | A |
| NSCL | 1 | 1 | 2 | 2 | 0 | 1 | A |
| SRC | 0 | 1 | 3 | 3 | 0 | 2 | B |
| Figa | 0 | 0 | 1 | 1 | 1 | 0 | B |
| Myc | 0 | 1 | 4 | 5 | 0 | 3 | B |
| Mad | 1 | 0 | 4 | 5 | 1 or 2† | 0 | B |
| Mnt | 0 | 1 | 1 | 1 | 0 or 1† | 0 | B |
| Max | 2 | 1 | 1 | 1 | 0 | 1 | B |
| USF | 1 | 1 | 2 | 3 | 0 | 1 | B |
| MITF‡ | 1 | 0 | 4 | 5 | 0 | 2 | B |
| SREBP‡ | 1 | 1 | 2 | 2 | 0 | 2 | B |
| AP4 | 1 | 1 | 1 | 1 | 0 | 0 | B |
| MLX | 0 or 1§ | 1 | 1 | 2 | 0 | 3 | B |
| TF4 | 0 or 1§ | 1 | 1 | 2 | 1 | 1 | B |
| Clock | 1 | 3 | 2 | 2 | 0 | 1 | C |
| ARNT | 1 | 1 | 2 | 2 | 1 | 2 | C |
| Bmal | 0 | 1 | 1 | 2 | 0 | 2 | C |
| AHR | 1 | 2 | 1 | 4 | 0 | 1 | C |
| Sim | 0 to 1¶ | 1 | 2 or 3¶ | 2 or 3¶ | 1 or 2¶ | 4 or 5¶ | C |
| Trh | 0 to 1¶ | 1 | 1 or 2¶ | 1 or 2¶ | 0 or 1¶ | 0 or 1¶ | C |
| HIF | 1 to 2¶ | 1 | 3 or 4¶ | 3 or 4¶ | 0 or 1¶ | 2 or 3¶ | C |
| Emc | 0 | 1 | 4 | 5 | 0 | 4 | D |
| Hey# | 0 | 2 | 2 | 5 | 1 | 2 | E |
| Hairy# | 0 | 3 | 1 | 2 | 1 | 4 | E |
| E (spl) # | 1 | 8 | 8 | 8 | 1 | 8 | E |
| COE | 1 | 1 | 4 | 4 | 0 | 0 | F |
| Orphans | 6 | 1 | 0 | 3 | 0 | 0 | No |
Families have been named according to the name (or its common abbreviation) of the first discovered or best-known member of the family. The number of members per family in worm, fly and human (complete genomes) as well as in mouse, sea squirt, and pufferfish (uncompleted genomes) is reported. Each family has been tentatively assigned to a high-order group using the classification of Atchley and Fitch [6] and Ledent and Vervoort [8]. Genes that cannot be assigned to any families are categorized as 'orphan' genes. *Beta3 and Oligo are closely related families, one C. elegans gene (F38C2.2) belongs to the Beta3 family while another (DY3.3) is equally related to both Beta3 and Oligo families. †Mad and Mnt are closely related families, one Ciona gene (Not7) belongs to the Mad family while another (LQW20007) is equally related to both Mad and Mnt families. ‡These two families also include yeast genes. §TF4 and Mlx are closely related families, one C. elegans gene (T20B12.6) is equally related to both families. ¶The Hif, Sim, and Trh families form a strongly supported monophyletic group (bootstrap value, 95%). A few genes that are included in this group cannot be clearly related to one of the three families (see Additional data for details). #The Hey, Hairy and Enhancer of split families genes form a well-supported monophyletic group (group E; see Figure 1). Two clear families (Hairy and Hey families) with high bootstrap support emerge from this group. All the remaining sequences have been grouped in a single family (named Enhancer of split), which has no real phylogenetic support. A phylogenetic tree of the group can be found in the Additional data.
Figure 1Phylogenetic relationships and high-order grouping of the bHLH families. A neighbor-joining (NJ) tree showing the evolutionary relationships of the 44 animal bHLH families listed in Table 1 is shown. We used one gene (usually from mouse) per family to construct this tree. Although there are strong theoretical reasons for preferring the unrooted tree, we show a rooted tree because it is easier to display compactly and more clearly represents the relationships at the tip of the branches. This tree is just a representation of an unrooted tree with rooting that should be considered arbitrary. We used the plant bHLH family (R family) as outgroup. For simplicity, we show a tree in which branch lengths are not proportional to distances between sequences. High-order groups [6,8] are shown. Some of these groups (A and E) are monophyletic groups, others (D and F) correspond to only one family, and yet others (B and C) are paraphyletic (the last common ancestor of the different families that constitute the group is also that of bHLHs that do not belong to that group). A subgroup of group A families (the Atonal 'superfamily' [8]) is also highlighted and is displayed in more detail in Figure 2.
Figure 2Some examples of phylogenetic relationships among human and mouse bHLH. Rooted NJ trees are shown. Numbers above branches indicate per cent support in bootstrap analyses (1,000 replicates). As in Figure 1, the rooting should be considered arbitrary. Branch lengths are proportional to distance between sequences. Mouse genes are shown in red, human genes in blue, and other species in black. Species abreviations are as followed: Br, Brachydanio rerio; Ce, Caenorhabditis elegans; Ci, Ciona intestinalis; Dm, Drosophila melanogaster; Gg, Gallus gallus; Tr, Takifugu rubripes; Xl, Xenopus laevis. (a) Evolutionary relationships among Atonal 'superfamily' members (see Figure 1). The different constituting families are pointed out. For sake of simplicity, only mouse, human and fly genes are shown. This tree is rooted using the closely related twist gene from mouse (see Figure 1) as outgroup. In all cases, a human and a mouse sequence cluster together with high bootstrap values, indicating orthology relationships. (b) Evolutionary relationships among Mesp family members. This tree is rooted using the closely related MATH1 gene from mouse (see Figure 1) as outgroup. Whereas one human and one mouse bHLH (N015926 and pMeso1, respectively) are clearly orthologs, there is no one-to-one relationship between two mouse bHLH (Mesp1 and Mesp2) and three human bHLH (N010356a, b, c), although these bHLH cluster together with a high bootstrap value. (c) Evolutionary relationships among TF4 and MLX family members. This tree is rooted using the closely related MITF gene from mouse (see Figure 1) as outgroup. Two human genes have clear mouse orthologs but two others (Q9HAP2 and N005106) have no such orthologs. (d) Evolutionary relationships among SCL family members. This tree is rooted using the closely related Hen1 gene (NSCL family) from mouse (see Figure 1) as outgroup. The Lyl1 and Lyl2 mouse genes are collectively orthologs to one human gene (P12980), indicating a probable gene duplication specific to mouse.
The complete list of bHLH genes from Homo sapiens
| Sequence identification | Gene name | Family | Mouse ortholog(s) | Contigs | Chromosome localization |
|---|---|---|---|---|---|
| Achaete-Scute a | NT_009439.3 | 12q22-q23 | |||
| Achaete-Scute a | NT_009368.3 | 11p15.5 | |||
| Achaete-Scute b | NT_024228.3 | 11p15.3 | |||
| Achaete-Scute b | NT_004680.3 | 1q31-q32 | |||
| Achaete-Scute b | NT_009720.3 | 12q23-q24 | |||
| MyoD | NT_004662.3 | 1q31-41 | |||
| MyoD | NT_024473.2 | 12q21 | |||
| MyoD | NT_009307.3 | 11p15.4 | |||
| MyoD | NT_024473.2 | 12q21 | |||
| E12/E47 | NT_011269.3 | 19p13.3 | |||
| E12/E47 | NT_010289.3 | 15q21 | |||
| E12/E47 | NT_011059.5 | 18q21.1 | |||
| E12/E47 | NT_029427.1 | 12 | |||
| E12/E47 | NT_011269.3 | 19p13.3 | |||
| E12/E47 | NT_008413.3 | 9p22-q22 | |||
| Atonal | ? | 4q22 | |||
| Atonal | NT_029388.3 | 10q21-q26 | |||
| Mist | NT_007816.3 | 7q21-q31 | |||
| Oligo | NT_011512.3 | 21q21-q22 | |||
| Oligo | ? | ? | |||
| Oligo | NT_025741.3 | 6q22-q24 | |||
| Beta3 | NT_030199.1 | 8q21 | |||
| Beta3 | NT_011333.4 | 20p11-q13 | |||
| Neurogenin | NT_022859.3 | 4 | |||
| Neurogenin | NT_024089.3 | 10q21.3 | |||
| Neurogenin | NT_007091.3 | 5q23-Q31 | |||
| NeuroD | NT_009563.3 | 12q13-q14 | |||
| NeuroD | NT_007819.6 | 7p14-p15 | |||
| NeuroD | NT_010685.3 | 17q12 | |||
| NeuroD | NT_005272.3 | 2q32 | |||
| Mesp | NT_010356.6 | 15q25-q26 | |||
| Mesp | NT_010356.6 | 15q25-q26 | |||
| Mesp | NT_010356.6 | 15q25-q26 | |||
| Mesp | NT_015926.3 | 2p24 | |||
| Net | NT_015805.6 | 2p11-q24 | |||
| MyoR | NT_005204.6 | 2p21-p25 | |||
| MyoR | NT_008253.3 | 8q13 | |||
| MyoR | NT_005204.6 | 2p21-p25 | |||
| MyoR | NT_008166.3 | 8q13-q22 | |||
| PTFa | NT_008895.6 | 10p12-q22 | |||
| PTFb | NT_007918.6 | 7p15-p21 | |||
| Hand | NT_026280.4 | 5q33 | |||
| Hand | NT_006257.3 | 4q31-q33 | |||
| Twist | NT_007918.3 | 7p21 | |||
| Paraxis | NT_011493.3 | 20 | |||
| NSCL | NT_021883.3 | 1p11-p12 | |||
| NSCL | NT_004406.3 | 1q22 | |||
| SCL | NT_008470.3 | 9q31 | |||
| SCL | NT_004701.3 | 1p32 | |||
| SCL | NT_011247.3 | 19p13.2 | |||
| AP4 | NT_015360.3 | 19p13 | |||
| Mnt | NT_010692.3 | 17p13.3 | |||
| Mad | NT_022865.3 | 4p16.3 | |||
| Mad | NT_022865.3 | 4p16.3 | |||
| Mad | NT_024048.3 | 10q25 | |||
| Mad | NT_005420.3 | 2p12-p13 | |||
| Mad | ? | ? | |||
| Max | NT_025892.3 | 14q23 | |||
| Myc | NT_026240.1 | 2p24.1 | |||
| Myc | NT_008012.3 | 8q24 | |||
| Myc | NT_004893.3 | 1p34 | |||
| Myc | NT_011762.3 | Xq22-q23 | |||
| Myc | NT_011572.3 | Xq27 | |||
| SRC | NT_005204.3 | 2p22-p25 | |||
| SRC | NT_023676.3 | 8p22-q21 | |||
| SRC | NT_011371.3 | 20q12 | |||
| MITF | NT_005510.3 | 3p12-p14 | |||
| MITF | NT_011611.3 | Xp11 | |||
| MITF | NT_023409.3 | 6p21 | |||
| MITF | NT_026338.1 | 7 | |||
| MITF | NT_009714.3 | 12p11-q14 | |||
| SREBP | NT_010657.3 | 17p11.2 | |||
| SREBP | NT_011520.3 | 22q13 | |||
| USF | NT_026219.3 | 1q22-q23 | |||
| USF | NT_011294.3 | 19q13 | |||
| USF | NT_009711.6 | 12 | |||
| MLX | NT_023557.3 | 7q11 | |||
| MLX | ? | 12q21 | |||
| TF4 | NT_010771.3 | 17q21.1 | |||
| TF4 | NT_005106.6 | 2p24-q36 | |||
| Figa | NT_005420.3 | 2p13-p24 | |||
| Bmal | NT_017854.3 | 11p15 | |||
| Bmal | NT_009622.3 | 12p11-p12 | |||
| ARNT | NT_004811.3 | 1q21 | |||
| ARNT | NT_004811.6 | 1q21 | |||
| Clock | NT_029271.2 | 4q12 | |||
| Clock | NT_022171.6 | 2q13 | |||
| HIF | ? | 14q21-q24 | |||
| HIF | NT_029237.2 | 2p21-p16 | |||
| HIF | NT_011166.6 | 19q13 | |||
| HIF/Sim/Trh | NT_011166.3 | 19q13 | |||
| Sim | NT_019424.3 | 6q16-q21 | |||
| Sim | NT_011512.3 | 21q22.13 | |||
| Trh | NT_010164.3 | 14q12-13 | |||
| AHR | NT_007755.4 | 7p15 | |||
| AHR | NT_016866.3 | 5p15 | |||
| AHR | NT_016866.3 | 5p15 | |||
| N030106 | AHR | NT_030106.1 | 11q12-q13 | ||
| Hey | NT_023700.5 | 8q21 | |||
| Hey | ? | ? | |||
| Hey | NT_004893.5 | 1p34.3 | |||
| Hey | NT_004893.5 | 1p34.3 | |||
| Hey | NT_029966.1 | 4 | |||
| E(spl) | NT_005927.3 | 3p24-p26 | |||
| E(spl) | NT_009471.3 | 12p11-p12 | |||
| E(spl) | NT_029854.1 | 1p36 | |||
| E(spl) | NT_010841.2 | 17p12-p13 | |||
| E(spl) | NT_019265.6 | 1p36 | |||
| E(spl) | NT_005139.6 | 2q36-q37 | |||
| E(spl) | NT_019265.6 | 1p36 | |||
| E(spl) | NT_011333.4 | 20q11-q13 | |||
| Hairy | NT_005571.3 | 3q28-q29 | |||
| Hairy | NT_025635.5 | 1p36 | |||
| Emc | NT_028392.4 | 20q11 | |||
| Emc | NT_005999.3 | 3p21-q13 | |||
| Emc | NT_022194.3 | 2p25 | |||
| Emc | NT_004359.6 | 1p36 | |||
| Emc | NT_027049.3 | 6p21-p22 | |||
| Coe | NT_007006.4 | 5q34 | |||
| Coe | NT_023666.3 | 8p21-p22 | |||
| Coe | NT_008818.6 | 10q25-q26 | |||
| Coe | ? | 20p11-p13 | |||
| Orphan | NT_010809.6 | 17p13.3 | |||
| Orphan | NT_030131.1 | 13q12-q14 | |||
| Orphan | NT_010194.6 | 15q14-q22 |
Human sequences are identified using their accession number from Swissprot, Trembl, Smart or NCBI genome project sequences. In the latest case, the accession number is that of the contig which includes the bHLH gene. This accession number NT_XXXXXX.Y (XXXXXX identifies the contig and Y the version of the draft) has been abbreviated as NXXXXXXX. Gene names are those reported in protein databases or have been assigned by us on the basis of the orthology relationships with mouse genes (these names are marked by an asterisk). The identification of the contig in which each of the bHLH gene is included is also given. In a few cases (marked with a question mark), we were unable to retrieve, in the genome sequence, previously cloned genes. This may be due to the fact that these genes lie in still unsequenced regions of the genome, or to some limitations of the current version of BLAST (see text for details). Chromosomal localizations are given as reported in the NCBI human genome sequence database (LocusLink and/or OMIM [77,78]).
The complete list of bHLH genes from Drosophila melanogaster
| Full gene name | Symbol | ID | Family | Localization | Accession number |
|---|---|---|---|---|---|
| E12/E47 | 31D11-E1 | pir||A31641 | |||
| MyoD | 95B3-5 | SW:P22816 | |||
| Achaete-Scute a | 1B1 | gb|AAF45498.1 | |||
| Achaete-Scute a | 1B1 | gb|AAA28313.1 | |||
| Achaete-Scute a | 1B1 | gb|AAF45500.1 | |||
| Achaete-Scute a | 1B1 | gb|AAF45502.1 | |||
| Neurogenin | 74B1-2 | emb|CAA65103.1 | |||
| Mist | 39D3 | gb|AAF53991.1 | |||
| Beta3 | 36C6-7 | gb|AAF53631.1 | |||
| Atonal | 53A1-2 | gb|AAF58026.1 | |||
| Atonal | 84F6 | gb|AAF54209.1 | |||
| Atonal | 37A1-2 | gb|AAF53678.1 | |||
| Net | 21A5-B1 | gb|AAF51562.1 | |||
| MyoR | 54E7-9 | gb|AAF57795.1 | |||
| Mesp | 85D7-10 | gb|AAF54351.1 | |||
| Paraxis | 9A4 | sp|Q9W2Z5 | |||
| Twist | 59C2-3 | emb|CAA32707.1 | |||
| PTFa | 84C3-4 | gb|AAF54058.1 | |||
| PTFb | 89B9-12 | gb|AAF55280.1 | |||
| PTFb | 86F1-2 | gb|AAF54684.1 | |||
| Hand | 31D1-6 | gb| AAF52900.1 | |||
| SCL | 3B3-4 | gb|AAF45802.1 | |||
| NSCL | 4C6-7 | gb|AAF45967.1 | |||
| Mnt | 3F2-3 | sp|O46042 | |||
| Max | 76A3 | gb|AAF49179.1 | |||
| Myc | 3D3-4 | gb|AAB39842.1 | |||
| USF | 4C4 | gb|AAF45953.1 | |||
| AP4 | 35F6-7 | gb|AAF53510.1 | |||
| TF4 | 97F5-6 | gb|AAF56696.1 | |||
| MLX | 39D1-2 | gb|AAF53989.1 | |||
| SREBP | 76D1-3 | gb|AAF49115.1 | |||
| SRC | 30A7-8 | sp|Q9VLD9 | |||
| Clock | 66A11-B1 | gb|AAD10630.1 | |||
| Clock | 10C6-8 | gb|AAC14350.1 | |||
| Clock | 13C1 | gb|AAF48439.1 | |||
| AHR | 96F14-97A1 | gb|AAF56569.1 | |||
| AHR | 89C1-2 | gb|AAD09205.1 | |||
| Sim | 87D12-13 | gb|AAF54902.1 | |||
| Trh | 61C1 | gb|AAA96754.1 | |||
| HIF | 99D5-F1 | gb|AAC47303.1 | |||
| ARNT | 85C5-7 | gb|AAF54329.1 | |||
| BMAL | 76D2-3 | gb|AAF49107.1 | |||
| Emc | 61D1-2 | gb|AAF47413.1 | |||
| Hey | 43F9-44A1 | gb|AAF59152.1 | |||
| Hey | 86A5-6 | gb|AAF24476.1 | |||
| Hairy | 66D11-12 | emb|CAA34018.1 | |||
| Hairy | 44B3-4 | gb|AAB24149.1 | |||
| Hairy | 37B9-11 | gb|AAF53741.1 | |||
| E (spl) | 96F10-12 | gb|AAF56550.1 | |||
| E (spl) | 96F10-12 | emb|CAA34552.1 | |||
| E (spl) | 96F10-12 | sp|P13098 | |||
| E (spl) | 96F10-12 | emb|CAA34553.1 | |||
| E (spl) | 96F10-12 | gb|AAA28910.1 | |||
| E (spl) | 96F10-12 | gb|AAA28911.1 | |||
| E (spl) | 96F10-12 | gb|AAA28909.1 | |||
| E (spl) | 17A3 | gb|AAF48810.1 | |||
| COE | 51C2-5 | gb|AAF58204.1 | |||
| Orphan | 97B1-2 | gb|AAF56590.1 |
Gene names (with commonly used synonyms in some cases) and their usual abbreviation are as reported in FlyBase [79,80], except those marked by an asterisk. In these cases, we propose names based on the orthology relationships with well-characterized vertebrate genes. Identification numbers are those from the Berkeley Drosophila Genome Project [81]. Sequences are listed with the family in which there are included (or stated as orphan genes), their chromosomal localization (position on the polytene chromosomes map as found in FlyBase), and their accession number. The 'orphan' gene Delilah clearly belongs to the high-order group A and is most probably a highly divergent NeuroD family gene (see [8] for discussion).
The complete list of bHLH genes from Caenorhabditis elegans
| Sequence name | Family | Localization | Accession number |
|---|---|---|---|
| E12/E47 | I: 1,82 | TR:Q17588 | |
| MyoD | II: -4,51 | SW:P22980 | |
| Achaete-Scute a | II: 1 | gb|AAB38323.1 | |
| Achaete-Scute a | II:1,11 | TR:Q09961 | |
| Achaete-Scute a | X: -19,47 | TR:Q20941 | |
| Achaete-Scute a | IV: 3,86 | TR:Q18277 | |
| Achaete-Scute b | II: 1,51 | emb|CAA87416.1 | |
| NeuroD | III: -2,01 | sp|P46581 | |
| Neurogenin | **** | **** | |
| Beta3 | IV: 24,06 | TR:O45489 | |
| Beta3/Oligo | I: 3,04 | TR:O45320 | |
| Atonal | X: -15,13 | TR:10574 | |
| Net | III:0,92 | sp|P34555|YNP2 | |
| MyoR | V: 1,87 | TR:Q23579 | |
| Twist | X: -0,63 | gb|AAC26105.1 | |
| Hand | X: 5,8 | TR:Q18612 | |
| PTFb | X: -8,42 | TR:Q20561 | |
| NSCL | X: -14 | TR:Q18590 | |
| Max | X: 12,32 | TR/Q18711 | |
| Max | V: 3,05 | TR:P90982 | |
| Mad | X:-2,63 | sp|Q21663 | |
| USF | III: -28,29 | gb|AAC68792 | |
| MITF | IV: -1, 14 | TR:P91527 | |
| AP4 | III: 0,63 | SW:P34474 | |
| SREBP | III: 8,9 | TR:Q9XX00 | |
| TF4/Mlx | III:-0,71 | gb|AAA19059.1 | |
| Clock | V: 4,63 | emb|CAA99775.1 | |
| AHR | I: 3,75 | emb|CAB51463.1 | |
| HIF | V: 27,08 | pir||T21944 | |
| HIF/Sim/Trh | V: 5,39 | TR:P90953 | |
| ARNT | X: 0,43 | TR:O02219 | |
| E (spl) | IV: 6,9 | gb|AAB68848.1 | |
| COE | X: 19,39 | gb|AAC06226.1 | |
| Orphan | III: -19,16 | gb|AAF605231 | |
| Orphan | II: 2,22 | emb|CAA88744.1 | |
| Orphan | II: -1,28 | gb|AAB52693.1 | |
| Orphan | X: 24,06 | TR:Q19917 | |
| Orphan | II: -1,28 | gb|AAK31421 | |
| Orphan | II:1,28 | TR:Q18053 |
Gene identifications are those of the C. elegans genome project. The localization of the genes referred to the worm recombination genetic map as found in Wormbase [82]. Sequences marked with an asterisk form a well supported monophyletic group and encode proteins with two bHLH (see text for details). The Y69A2AR gene was not found in the databases. Its sequence comes from [45].
The complete list of bHLH genes from Mus musculus
| Gene name | Family | Human ortholog(s) | Accession number |
|---|---|---|---|
| Achaete-Scute a | gb|AAB28830.1 | ||
| Achaete-Scute a | gb|AAD33794.1 | ||
| Achaete-Scute b | sp|CAC37689 | ||
| MyoD | P15173 | sp|P12979 | |
| MyoD | P23409 | ref|NP_032683.1 | |
| MyoD | P15172 | sp|P10085 | |
| MyoD | P13349 | ref|NP_032682.1 | |
| E12/E47 | sp|15806 | ||
| E12/E47 | ref|NP_035674.1 | ||
| E12/E47 | ref|NP_038713.1 | ||
| E12/E47 | ? | dbj|BAA06218.1 | |
| Atonal | dbj|BAA07791.1 | ||
| Atonal | gb|AAC68868.1 | ||
| Mist | gb|AAF17706.1 | ||
| Oligo | ref|NP_058664.1 | ||
| Oligo | sptrembl|Q9EQW6 | ||
| Oligo | sptrembl|Q9EQW5 | ||
| Beta3 | gb|AAF32324.1 | ||
| Beta3 | sptrembl|Q9H494 | ||
| Neurogenin | gb|AAC53028.1 | ||
| Neurogenin | emb|CAA70366.1 | ||
| Neurogenin | sp|P70660 | ||
| NeuroD | dbj|BAA07923.1 | ||
| NeuroD | gb|AAC15969.1 | ||
| NeuroD | sp|Q62414 | ||
| NeuroD | gb|AAC52203.1 | ||
| Net | spnew|BAB39468 | ||
| Mesp | ? | gb|AAF70375.1 | |
| Mesp | ? | gb|AAB51199.1 | |
| Mesp | ref|NP_062417.1 | ||
| MyoR | gb|AAC62513.1 | ||
| MyoR | gb|AAD10053.1 | ||
| PTFa | emb|CAB65273.1 | ||
| Hand | gb|AAB35104.1 | ||
| Hand | gb|AAC52338.1 | ||
| Twist | gb|AAA40514.1 | ||
| Twist | emb|CAA69333.1 | ||
| Paraxis | gb|AAA86825.1 | ||
| Paraxis | ? | gb|AAB34266.1 | |
| NSCL | gb|AAA39840.1 | ||
| NSCL | gb|AAB22580.1 | ||
| SCL | emb|CAB72256.1 | ||
| SCL | gb|AAA40162.1 | ||
| SCL | emb|CAA40870.1 | ||
| SCL | pir||B43814 | ||
| Figa | sptrembl|O55208 | ||
| AP4 | Q01664 | gb|AAF80448.1 | |
| Mnt | swissprot|O08789 | ||
| Mad | swissprot|P50540 | ||
| Mad | swissprot|P50538 | ||
| Mad | sptrembl|Q60947 | ||
| Mad | pir||S60006 | ||
| Max | sp|P28574 | ||
| Myc | gb|AAA39833.1 | ||
| Myc | emb|CAA25508.1 | ||
| Myc | emb|CAA32128.1 | ||
| Myc | ? | ref|NP_034980.1 | |
| SRC | gb|AAB01228.1 | ||
| SRC | gb|AAB06177.1 | ||
| SRC | sp|O09000 | ||
| MITF | gb|AAF81266.1 | ||
| MITF | gb|AAB21130.1 | ||
| MITF | gb|AAD20979.1 | ||
| MITF | gb|AAD24426.1 | ||
| SREBP | dbj|BAA74795.1 | ||
| SREBP | gb|AAG01859.1 | ||
| USF | emb|CAA64627.1 | ||
| USF | pir||A56522 | ||
| MLX | gb|AAK20940.1 | ||
| TF4 | gb|AAB51368.1 | ||
| BMAL | dbj|BAA76414.1 | ||
| ARNT | gb|AAA56717.1 | ||
| ARNT | dbj|BAA09799.1 | ||
| Clock | swissnew|O08785 | ||
| Clock | gb|AAB47249.1 | ||
| HIF | emb|CAA70306.1 | ||
| HIF | gb|AAC12871.1 | ||
| HIF | gb|AAC72734.1 | ||
| HIF/Sim/Trh | gb|AAB47247.1 | ||
| Sim | gb|AAC05481.1 | ||
| Sim | gb|AAB84099.1 | ||
| Trh | gb|AAF14283.1 | ||
| AHR | dbj|BAA07469.1 | ||
| Emc | sp|P20067 | ||
| Emc | gb|AAA79771.1 | ||
| Emc | Q02535 | sp|P41133 | |
| Emc | emb|CAA05120.1 | ||
| Hey | emb|CAB51321.1 | ||
| Hey | gb|AAF37298.1 | ||
| Hairy | dbj|BAA03931.1 | ||
| E(spl) | sptrembl|O14503 | ||
| E(spl) | spnew|BAB21503 | ||
| E(spl) | dbj|BAA24091.1 | ||
| E(spl) | dbj|BAA19799.1 | ||
| E(spl) | dbj|BAA06858.1 | ||
| E(spl) | gb|AAF63757.1 | ||
| E(spl) | spnew|BAB39526 | ||
| E(spl) | spnew|BAA9469 | ||
| Coe | swissprot|Q07802 | ||
| Coe | ? | gb|AAB58423.1 | |
| Coe | ? | swissprot|O08792 | |
| Coe | ? | swissprot|O08791 |
Mouse genes are listed with the family in which they are included, the identification of their human ortholog(s) (? indicates that no clear ortholog was found, see text for details), and their accession number. In most cases, several names exist for each gene. We report here only one name; synonyms can be found in the protein databases using the reported accession numbers.
The complete list of bHLH genes from Saccharomyces cerevisiae
| Gene name | Accession number | Family |
|---|---|---|
| gb|AAA86842.1 | RTG3P | |
| sp| P32607 | MITF | |
| sp|P33122 | SREBP | |
| sp|Q12398 | SREBP | |
| ref|NP_011227.1 | Pho4 | |
| gb|AAA34490.1 | CBP | |
| sp| P26798 | Orphan | |
| sp|P13902 | Orphan |
Yeast genes are listed with the family in which they are included and their accession number.
Figure 3Worm proteins with two bHLH domains. A rooted NJ tree is shown that depicts the phylogenetic relationships of the five worm proteins with two bHLH domains. Mouse genes representative of some of the animal families have been included in this analysis. Rooting is as in Figure 1. Numbers above branches indicate per cent support in bootstrap analyses (1,000 replicates). As in Figure 1, the rooting should be considered arbitrary. Branch lengths are proportional to distance between sequences. Mm, Mus musculus; Ce, Caenorhabditis elegans. The sequences of the first bHLH of each worm proteins are shown in blue, the second in red. Both form monophyletic groups with high bootstrap values, indicating that these proteins originate from an ancestral protein that already had two bHLH domains. There is, furthermore, a weaker support (40% bootstraps) for an association of the two bHLH domains into a monophyletic group (not shown in the figure, as only nodes with 50% or more support are shown), suggesting that the ancestral protein may have acquired its two bHLH domains through tandem duplication rather than by association of unrelated bHLH domains.