| Literature DB >> 24312906 |
Abstract
The previous survey identified 70 basic helix-loop-helix (bHLH) proteins, but it was proved to be incomplete, and the functional information and regulatory networks of frog bHLH transcription factors were not fully known. Therefore, we conducted an updated genome-wide survey in the Xenopus tropicalis genome project databases and identified 105 bHLH sequences. Among the retrieved 105 sequences, phylogenetic analyses revealed that 103 bHLH proteins belonged to 43 families or subfamilies with 46, 26, 11, 3, 15, and 4 members in the corresponding supergroups. Next, gene ontology (GO) enrichment analyses showed 65 significant GO annotations of biological processes and molecular functions and KEGG pathways counted in frequency. To explore the functional pathways, regulatory gene networks, and/or related gene groups coding for Xenopus tropicalis bHLH proteins, the identified bHLH genes were put into the databases KOBAS and STRING to get the signaling information of pathways and protein interaction networks according to available public databases and known protein interactions. From the genome annotation and pathway analysis using KOBAS, we identified 16 pathways in the Xenopus tropicalis genome. From the STRING interaction analysis, 68 hub proteins were identified, and many hub proteins created a tight network or a functional module within the protein families.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24312906 PMCID: PMC3842043 DOI: 10.1155/2013/145037
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Significant pathways identified by KOBAS with 93 Xenopus tropicalis bHLH transcription factors.
| Term | Pathway database | Database ID | Sample gene number | Background number |
| Corrected | Genes |
|---|---|---|---|---|---|---|---|
| Circadian rhythm: mammal | KEGG | xtr04710 | 3 | 21 | 1.10 | 0.0001219 | XSBmal2; XSDec2; XSClock |
| TGF-beta signaling pathway | KEGG | xtr04350 | 4 | 73 | 1.52 | 0.0001219 | XSId3; XSId2; XSId4; XSnMyc |
| Notch signaling pathway | PANTHER | P00045 | 2 | 5 | 0.0004516 | 0.0024084 | XSHes1a; XSHes1b; XSHerp1 |
| Notch signaling pathway | KEGG | xtr04330 | 2 | 43 | 0.0037667 | 0.0150668 | XSHes1a; XSHes1b; XSHes5a |
| Developmental biology | Reactome | None | 2 | 106 | 0.0145401 | 0.0398896 | XSHes1a; XSHes1b; XSNDF1; XSNDF2 |
| Circadian clock | Reactome | None | 1 | 8 | 0.0149586 | 0.0398896 | XSClock |
| Herpes simplex infection | KEGG | xtr05168 | 2 | 128 | 0.0304156 | 0.0695215 | XSBmal2; XSClock |
| MAPK signaling pathway | KEGG | xtr04010 | 2 | 220 | 0.0802242 | 0.1604484 | XSMAX; XSnMyc |
| Fanconi anemia pathway | KEGG | xtr03460 | 1 | 51 | 0.1044307 | 0.1856547 | XSHes1a; XSHes1b |
| ErbB signaling pathway | KEGG | xtr04012 | 1 | 70 | 0.1406256 | 0.2250009 | XSnMyc |
| Melanogenesis | KEGG | xtr04916 | 1 | 86 | 0.1700261 | 0.2386888 | XSMITF |
| Jak-STAT signaling pathway | KEGG | xtr04630 | 1 | 91 | 0.1790166 | 0.2386888 | XSnMyc |
| Metabolism | Reactome | None | 2 | 458 | 0.206835 | 0.2544026 | XSSRC2; XSSRC1 |
| Cell cycle | KEGG | xtr04110 | 1 | 116 | 0.2226023 | 0.2544026 | XSnMyc |
| Wnt signaling pathway | KEGG | xtr04310 | 1 | 131 | 0.2476916 | 0.2642044 | XSnMyc |
| Wnt signaling pathway | PANTHER | P00057 | 1 | 37 | 0.2756772 | 0.2756772 | XSvMyc |
Information of the Xenopus tropicalis 105 bHLH transcription factors.
| bHLH | Gene name |
| Protein accessionc | Genome contigd | ||
|---|---|---|---|---|---|---|
| Name | MLE bootstrap value (%)a | BI posterior probability (%)b | ||||
| ASCa |
|
| 89 | 99 | XP_002944648.1 | NW_003169609.1 |
| ASCa |
|
| n/m* | 99 | XP_002940290.1 | NW_003163913.1 |
| ASCb |
|
| 90 | 100 | XP_002940370.1 | NW_003163927.1 |
| MyoD |
|
| 96 | 94 | NP_988972.1 | NW_003166075.1 |
| MyoD |
|
| 94 | 100 | NP_001016725.1 | NW_003163495.1 |
| MyoD |
|
| n/m | 76 | NP_988932.1 | NW_003163331.1 |
| MyoD |
|
| 82 | 95 | NP_001017160.1 | NW_003163331.1 |
| E12/E47 |
|
| 99 | 53 | NP_001093743.1 | NW_003163736.1 |
| E12/E47 |
|
| 76 | 88 | XP_002940299.1 | NW_003163915.1 |
| E12/E47 |
|
| 76 | n/m* | NP_001096226.2 | NW_003163423.1 |
| Ngn |
|
| 83 | 78 | NP_001116895.1 | NW_003163503.1 |
| NeuroD |
|
| n/m | n/m* | NP_001090868.1 | NW_003163341.1 |
| NeuroD |
|
| 65 | 63 | NP_001072486.1 | NW_003163936.1 |
| NeuroD |
|
| 79 | 80 | NP_001072273.1 | NW_003163914.1 |
| NeuroD |
|
| 97 | 99 | NP_001124513.1 | NW_003163487.1 |
| Mist1 |
|
| 99 | 100 | XP_002931994.1 | NW_003163340.1 |
| Beta3 |
|
| 70 | 53 | XP_002944506.1 | NW_003167409.1 |
| Beta3 |
|
| 77 | 94 | NP_001072933.1 | NW_003163515.1 |
| Oligo |
|
| 97 | 100 | XP_002938497.1 | NW_003163700.1 |
| Oligo |
|
| 76 | 73 | XP_002938491.1 | NW_003163700.1 |
| Oligo |
|
| 83 | 90 | NP_001008191.1 | NW_003163713.1 |
| Oligo |
|
| n/m | n/m | NP_001039180.1 | NW_003163795.1 |
| Net |
|
| 100 | 100 | XP_002937330.1 | NW_003163606.1 |
| Mesp |
|
| n/m | n/m | NP_001039184.1 | NW_003163348.1 |
| Mesp |
|
| n/m | n/m | NP_001016653.1 | NW_003163348.1 |
| Mesp |
|
| 99 | 100 | NP_001039104.1 | NW_003163426.1 |
| Twist |
|
| 91 | 83 | NP_989415.1 | NW_003163378.1 |
| Twist |
|
| 98 | 100 | NP_001096679.1 | NW_003163487.1 |
| Paraxis |
|
| 62 | 83 | NP_001016506.1 | NW_003165117.1 |
| Paraxis |
|
| 96 | 99 | XP_002942929.1 | NW_003164455.1 |
| Paraxis |
|
| 74 | 59 | XP_002937913.1 | NW_003163647.1 |
| MyoRa |
|
| 63 | 60 | NP_001096235.1 | NW_003163586.1 |
| MyoRa |
|
| n/m | 62 | NP_001103518.1 | NW_003163498.1 |
| MyoRb |
|
| 78 | 94 | GNOMON∣93674.pe
| NW_003164157.1 |
| MyoRb |
|
| 55 | 95 | GNOMON∣522504.pe
| NW_003163470.1 |
| Hand |
|
| 94 | 100 | NP_001016743.1 | NW_003163350.1 |
| Hand |
|
| 99 | 55 | NP_001093695.1 | NW_003163380.1 |
| PTFa |
|
| 99 | 100 | NP_001095279.1 | NW_003163378.1 |
| PTFb |
|
| 91 | 100 | XP_002933181.1 | NW_003163373.1 |
| SCL |
|
| 77 | 62 | NP_001135468.1 | NW_003163327.1 |
| SCL |
|
| 72 | 76 | XP_002934026.1 | NW_003163404.1 |
| SCL |
|
| 86 | 97 | XP_002939165.1 | NW_003163774.1 |
| NSCL |
|
| 99 | 100 | XP_002937307.1 | NW_003163605.1 |
| SRC |
|
| 82 | 97 | NP_001106383.1 | NW_003163796.1 |
| SRC |
|
| 97 | 100 | NP_001135631.1 | NW_003163586.1 |
| SRC |
|
| 80 | 97 | XP_002933204.1 | NW_003163374.1 |
| Fig |
|
| 92 | 100 | NP_001016342.1 | NW_003163469.1 |
| MYC |
|
| 71 | 65 | NP_001011144.1 | NW_003164143.1 |
| MYC |
|
| n/m | 98 | NP_989390.1 | NW_003163721.1 |
| MYC |
|
| 91 | 99 | NP_001006874.1 | NW_003163866.1 |
| Mad |
|
| 85 | 97 | NP_001008129 | NW_003180496.1 |
| Mad |
|
| n/m | 88 | NP_001072228.1 | NW_003163469.1 |
| Mad |
|
| 99 | 100 | NP_001017299.1 | NW_003163577.1 |
| Mad |
|
| 89 | 100 | NP_001096239.1 | NW_003164437.1 |
| Mnt |
|
| n/m | 97 | NP_001135494.1 | NW_003163468.1 |
| MAX |
|
| 90 | 100 | NP_001008208.1 | NW_003163599.1 |
| USF |
|
| 92 | 99 | NP_001096236.1 | NW_003168160.1 |
| USF |
|
| n/m | 60 | NP_001007857.1 | NW_003163677.1 |
| USF |
|
| 85 | 99 | NP_001120597.1 | NW_003164188.1 |
| MITF |
|
| n/m | n/m | NP_001093747.1 | NW_003163951.1 |
| MITF |
|
| 84 | 100 | NP_001072648.1 | NW_003163367.1 |
| MITF |
|
| 66 | 99 | XP_002935013.1 | NW_003163447.1 |
| MITF |
|
| 85 | 78 | XP_002944430.1 | NW_003166883.1 |
| SREBP |
|
| 88 | 99 | XP_002935886.1 | NW_003163500.1 |
| SREBP |
|
| 88 | 99 | XP_002935887.1 | NW_003163500.1 |
| SREBP |
|
| 88 | 99 | XP_002944649.1 | NW_003169615.1 |
| SREBP |
|
| n/m | 67 | NP_001116910.1 | NW_003163395.1 |
| AP4 |
|
| 71 | 98 | NP_001123841.1 | NW_003163353.1 |
| Mlx |
|
| 89 | 100 | NP_001090682.1 | NW_003163637.1 |
| TF4 |
|
| 88 | 100 | GNOMON:712044.pe
| NW_003164277.1, |
| Clock |
|
| 99 | 100 | NP_001122127.1 | NW_003163433.1 |
| ARNT |
|
| n/m | n/m | NP_001116925.1 | NW_003163477.1 |
| ARNT |
|
| 100 | n/m | NP_001093686.1 | NW_003163348.1 |
| Bmal |
|
| 63 | 100 | NP_001096298.1 | NW_003164805.1 |
| AHR |
|
| 92 | 99 | XP_002933348.1 | NW_003163378.1 |
| AHR |
|
| 91 | 100 | XP_002935182.1 | NW_003163457.1 |
| Sim |
|
| n/m* | 98 | XP_002932187.1 | NW_003163345.1 |
| Sim |
|
| 89 | 99 | XP_002941575.1 | NW_003164120.1 |
| Trh |
|
| n/m | 70 | NP_001072647.1 | NW_003163363.1 |
| HIF |
|
| 99 | n/m | NP_001011165.1 | NW_003163817.1 |
| HIF |
|
| 79 | 94 | NP_001005647.1 | NW_003163351.1 |
| Emc |
|
| 78 | 90 | NP_988885.1 | NW_003163451.1 |
| Emc |
|
| 79 | 98 | NP_001016271.1 | NW_003163432.1 |
| Emc |
|
| 86 | 54 | NP_001004839.1 | NW_003163385.1 |
| Hey |
|
| 83 | 97 | NP_001007911.1 | NW_003163551.1 |
| Hey |
|
| 86 | 92 | XP_002936042.1 | NW_003163507.1 |
| Hey |
|
| 98 | 100 | XP_002934312.1 | NW_003163416.1 |
| H/E(spl) |
|
| 99 | n/m | NP_001027504.1 | NW_003163993.1 |
| H/E(spl) |
|
| n/m | 81 | NP_001011194.1 | NW_003163571.1 |
| H/E(spl) |
|
| n/m | 81 | NP_988870.1 | NW_003163533.1 |
| H/E(spl) |
|
| n/m* | 61 | NP_001037880.1 | NW_003163546.1 |
| H/E(spl) |
|
| n/m* | 61 | NP_001037974.1 | NW_003163546.1 |
| H/E(spl) |
|
| n/m* | 100 | NP_001039178.1 | NW_003163399.1 |
| H/E(spl) |
|
| n/m* | 100 | NP_001037951.1 | NW_003163399.1 |
| H/E(spl) |
|
| n/m* | 82 | NP_001107462.1 | No finding |
| H/E(spl) |
|
| n/m* | 100 | NP_001037989.1 | NW_003163399.1 |
| H/E(spl) |
|
| n/m | n/m | NP_001072210.1 | NW_003163381.1 |
| H/E(spl) |
|
| 73 | 97 | NP_001039166.1 | NW_003164377.1 |
| H/E(spl) |
|
| 86 | 100 | NP_001107508.1 | NW_003164377.1 |
| Coe |
|
| n/m | 51 | XP_002939654.1 | NW_003163834.1 |
| Coe |
|
| 91 | 97 | NP_989200.1 | NW_003163356.1 |
| Coe |
|
| 91 | 66 | XP_002932694.1 | NW_003163358.1 |
| Coe |
|
| 91 | 66 | XP_002932695.1 | NW_003163358.1 |
| Orphan |
|
| 86 | 100 | XP_002938975.1 | NW_003163749.1 |
| Orphan |
|
| 94 | 100 | XP_002943245.1 | NW_003164609.1 |
Xenopus tropicalis bHLH genes were named according to their human orthologous genes' names (or common abbreviations) and the referred nomenclature was mainly from the tables and additional tables provided by Ledent et al. [4] and Simionato et al. [6]. Bootstrap values were converted from phylogenetic analyses with human bHLH sequences using BI and MLE algorithm, respectively. MLE bootstrap valuea refers to the result from maximum likelihood estimate in phylogenetic analysis, and BI posterior probabilityb refers to the result from BI in phylogenetic analysis. The numbers in the phylogenetic trees are converted into percentages. cThe accession numbers were retrieved from the following resources; this sequence was verified by many EST TBLASTN search hits, such as EG651417.1 and CX503003.2 (EST accession number). These numbered as “NP” were from the RefSeq protein database and those numbered as “XP” were from the Build protein database. Notes in the brackets are also gene symbols according to records in NCBI and Xenbase. All of the bHLH genes are organized in the order of bHLH families manifested in Table 1 of Ledent et al. [4]. The question mark means no matching; mark n/m* means no monophyletic group with single particular orthologous gene sequences, but formed a monophyletic group with two or more orthologous gene sequences of the family; mark n/m denotes the case of lower bootstrap value estimated less than 50%.eThe accession numbers were retrieved from the ab initio protein database.
Figure 1Alignment of 105 Xenopus tropicalis bHLH domains. Designation of basic, helix 1, loop, and helix 2 follows Ferre-D'Amare et al. [39–43] and bHLH domains were shaded using GeneDoc. Family and bHLH protein names and high-order groups were organized according to Table 1 in the paper of Ledent et al. [4]. Highly conserved sites are shaded in black and indicated with asterisks on the top.
GO enrichment by categories of super-groups by DAVID bioinformatics bases with 105 Xenopus tropicalis bHLH transcription factors.
| Group | Enriched genes | GO term ID | GO category | GO definition | Coherence (%)a |
|
|---|---|---|---|---|---|---|
| A | 43 | GO:0030528 | MF | Transcription regulator activity | 100 | 2.50 |
| GO:0045449 | BP | Regulation of transcription | 100 | 7.60 | ||
| GO:0007517 | BP | Muscle organ development | 15.4 | 5.50 | ||
| GO:0007519 | BP | Skeletal muscle tissue development | 7.7 | 1.30 | ||
| GO:0055123 | BP | Digestive system development | 7.7 | 1.30 | ||
| GO:0014706 | BP | Striated muscle tissue development | 7.7 | 1.30 | ||
| GO:0043282 | BP | Pharyngeal muscle development | 7.7 | 1.30 | ||
| GO:0002074 | BP | Extraocular skeletal muscle development | 7.7 | 1.30 | ||
| GO:0048741 | BP | Skeletal muscle fiber development | 7.7 | 1.30 | ||
| GO:0048747 | BP | Muscle fiber development | 7.7 | 1.30 | ||
| GO:0060538 | BP | Skeletal muscle organ development | 7.7 | 1.30 | ||
| GO:0060465 | BP | Pharynx development | 7.7 | 1.30 | ||
| GO:0007423 | BP | Sensory organ development | 11.5 | 1.70 | ||
| GO:0042692 | BP | Muscle cell differentiation | 7.7 | 2.00 | ||
| GO:0060537 | BP | Muscle tissue development | 7.7 | 2.00 | ||
| GO:0051146 | BP | Striated muscle cell differentiation | 7.7 | 2.00 | ||
| GO:0055002 | BP | Striated muscle cell development | 7.7 | 2.00 | ||
| GO:0055001 | BP | Muscle cell development | 7.7 | 2.00 | ||
| GO:0003677 | MF | DNA binding | 26.9 | 8.70 | ||
|
| ||||||
| B | 267 | GO:0030528 | MF | Transcription regulator activity | 100 | 8.10 |
| GO:0045449 | BP | Regulation of transcription | 100 | 6.90 | ||
| GO:0035257 | MF | Nuclear hormone receptor binding | 10.5 | 1.20 | ||
| GO:0051427 | MF | Hormone receptor binding | 10.5 | 1.60 | ||
| GO:0003713 | MF | Transcription coactivator activity | 10.5 | 2.00 | ||
| GO:0003712 | MF | Transcription cofactor activity | 10.5 | 3.10 | ||
| GO:0008134 | MF | Transcription factor binding | 10.5 | 6.90 | ||
| GO:0006355 | BP | Regulation of transcription, DNA-dependent | 26.3 | 9.70 | ||
|
| ||||||
| C | 11 | GO:0006350 | BP | Transcription | 100 | 1.40 |
| GO:0030528 | MF | Transcription regulator activity | 100 | 4.70 | ||
| GO:0006355 | BP | Regulation of transcription, DNA-dependent | 100 | 9.50 | ||
| GO:0051252 | BP | Regulation of RNA metabolic process | 100 | 1.00 | ||
| GO:0003677 | MF | DNA binding | 100 | 4.10 | ||
| GO:0045449 | BP | Regulation of transcription | 100 | 9.30 | ||
| GO:0003700 | MF | Transcription factor activity | 71.4 | 1.70 | ||
| KEGG_Id:480089074 | KEGG pathway | Circadian rhythm | 28.6 | 3.90 | ||
|
| ||||||
| D | 3 | None | None | None | None | None |
|
| ||||||
| E | 15 | GO:0030528 | MF | Transcription regulator activity | 100 | 1.30 |
| GO:0045449 | BP | Regulation of transcription | 100 | 2.40 | ||
| GO:0006350 | BP | Transcription | 68.8 | 7.90 | ||
| GO:0003677 | MF | DNA binding | 75 | 1.10 | ||
| GO:0006355 | BP | Regulation of transcription, DNA-dependent | 68.8 | 1.70 | ||
| GO:0051252 | BP | Regulation of RNA metabolic process | 68.8 | 1.90 | ||
| GO:0016564 | MF | Transcription repressor activity | 31.2 | 2.90 | ||
| GO:0000122 | BP | Negative regulation of transcription from RNA polymerase II promoter | 25 | 1.80 | ||
| GO:0009792 | BP | Embryonic development ending in birth or egg hatching | 25 | 7.50 | ||
| GO:0043009 | BP | Chordate embryonic development | 25 | 7.50 | ||
| GO:0045892 | BP | Negative regulation of transcription, DNA-dependent | 25 | 1.60 | ||
| GO:0051253 | BP | Negative regulation of RNA metabolic process | 25 | 1.80 | ||
| GO:0046982 | MF | Protein heterodimerization activity | 18.8 | 2.10 | ||
| GO:0021915 | BP | Neural tube development | 18.8 | 2.20 | ||
| GO:0016481 | BP | Negative regulation of transcription | 25 | 2.80 | ||
| GO:0007219 | BP | Notch signaling pathway | 18.8 | 3.10 | ||
| GO:0051172 | BP | Negative regulation of nitrogen compound metabolic process | 25 | 3.70 | ||
| GO:0045934 | BP | Negative regulation of nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | 25 | 3.70 | ||
| GO:0031327 | BP | Negative regulation of cellular biosynthetic process | 25 | 4.60 | ||
| GO:0010558 | BP | Negative regulation of macromolecule biosynthetic process | 25 | 4.60 | ||
| GO:0009890 | BP | Negative regulation of biosynthetic process | 25 | 5.00 | ||
| GO:0010629 | BP | Negative regulation of gene expression | 25 | 5.70 | ||
| GO:0006357 | BP | Regulation of transcription from RNA polymerase II promoter | 25 | 8.90 | ||
| GO:0010605 | BP | Negative regulation of macromolecule metabolic process | 25 | 9.50 | ||
| KEGG_Id:480089058 | KEGG pathway | TGF-beta signaling pathway | 18.8 | 2.40 | ||
| GO:0033504 | BP | Floor plate development | 12.5 | 7.90 | ||
| GO:0046983 | MF | Protein dimerization activity | 18.8 | 9.20 | ||
| GO:0048635 | BP | Negative regulation of muscle development | 12.5 | 1.20 | ||
| GO:0048634 | BP | Regulation of muscle development | 12.5 | 1.60 | ||
| KEGG_Id:480089056 | KEGG pathway | Notch signaling pathway | 12.5 | 4.60 | ||
|
| ||||||
| F | 4 | None | None | None | None | None |
|
| ||||||
| Orphan | 2 | None | None | None | None | None |
All GO annotations terms in the table were from gene ontology database (http://www.geneontology.org/). GO annotations included every layer of biological process, molecular function, cellular component category, and KEGG pathway. When a GO term and its sublayer GO are both enriched in a group significantly, only deeper layer GO annotation is shown in the table. BP: biological process; MF: molecular function. The above table showed the GO annotations enriched significantly (P < 0.05) in each group. aGO coherence of each group, measured as the percentage of genes in group covered by the GO category.
Figure 2STRING mapping profiles of protein interaction network (PIN) representing bHLH transcription factor protein interactions. Panel (a) showed the main figure of PIN profile and connectivity of hub proteins and the others. The protein interacting gene products are marked in blue and green lines. There are totally 68 hub proteins identified and many hub proteins created a tight network or a functional module within their protein families. Panel (b) magnified the implication of different connective lines with different data sources in the main figure.