| Literature DB >> 17352807 |
Juan-Ramon Martinez-Morales1, Thorsten Henrich, Mirana Ramialison, Joachim Wittbrodt.
Abstract
BACKGROUND: Development of the vertebrate head depends on the multipotency and migratory behavior of neural crest derivatives. This cell population is considered a vertebrate innovation and, accordingly, chordate ancestors lacked neural crest counterparts. The identification of neural crest specification genes expressed in the neural plate of basal chordates, in addition to the discovery of pigmented migratory cells in ascidians, has challenged this hypothesis. These new findings revive the debate on what is new and what is ancient in the genetic program that controls neural crest formation.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17352807 PMCID: PMC1868935 DOI: 10.1186/gb-2007-8-3-r36
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Gene phylogeny was explored using a sequential blast pipeline. (a) All known mouse proteins were sequentially blasted (cutoff value E = 10-4) against available databases and then classified according to their appearance into seven different categories: prokaryota (pro), eukaryota (euk), metazoa (met), deuterostomia (deu), chordata (cor), vertebrata (ver), and mammalia (mam). (b) The table shows the number of mouse genes assigned to each category compared with their estimated age in millions of years. (c) Graphical representation of the global gene phylogeny.
Frequency of GO terms for each group of 'new genes'
| GO ID | GO term | Count sample | Count total | |
|---|---|---|---|---|
| Prokaryota | ||||
| GO:0050875 | Cellular physiological process | 3,219 | 8,198 | 0 |
| GO:0008152 | Metabolism | 2,576 | 5,906 | 0 |
| GO:0044237 | Cellular metabolism | 2,369 | 5,566 | 0 |
| GO:0044238 | Primary metabolism | 2,192 | 5,312 | 0 |
| GO:0043170 | Macromolecule metabolism | 1,569 | 3,298 | 0 |
| GO:0044260 | Cellular macromolecule metabolism | 1,158 | 2,500 | 0 |
| GO:0019538 | Protein metabolism | 1,149 | 2,486 | 0 |
| GO:0044267 | Cellular protein metabolism | 1,138 | 2,469 | 0 |
| GO:0000166 | Nucleotide binding | 1,070 | 1,577 | 0 |
| GO:0016787 | Hydrolase activity | 1,037 | 1,876 | 0 |
| Eukaryota | ||||
| GO:0005622 | Intracellular | 1,820 | 6,664 | 0 |
| GO:0043226 | Organelle | 1,587 | 5,789 | 0 |
| GO:0043229 | Intracellular organelle | 1,586 | 5,785 | 0 |
| GO:0043227 | Membrane-bound organelle | 1,419 | 5,097 | 0 |
| GO:0043231 | Intracellular membrane-bound organelle | 1,417 | 5,092 | 0 |
| GO:0005634 | Nucleus | 1,054 | 3,267 | 0 |
| GO:0046914 | Transition metal ion binding | 644 | 1,791 | 0 |
| GO:0008270 | Zinc ion binding | 619 | 1,416 | 0 |
| GO:0004888 | Transmembrane receptor activity | 23 | 2,007 | 0 |
| GO:0043169 | Cation binding | 799 | 2,589 | 3.45 × e-85 |
| Metazoa | ||||
| GO:0016020 | Membrane | 1,768 | 6,163 | 0 |
| GO:0031224 | Intrinsic to membrane | 1,524 | 4,932 | 0 |
| GO:0016021 | Integral to membrane | 1,523 | 4,930 | 0 |
| GO:0007154 | Cell communication | 1,234 | 3,201 | 0 |
| GO:0007165 | Signal transduction | 1,211 | 3,059 | 0 |
| GO:0004872 | Receptor activity | 1,143 | 2,793 | 0 |
| GO:0007166 | Cell surface receptor linked signal transduction | 1,061 | 2,253 | 0 |
| GO:0004888 | Transmembrane receptor activity | 926 | 2,007 | 0 |
| GO:0007186 | G-protein coupled receptor protein signaling pathway | 906 | 1,763 | 0 |
| GO:0004930 | G-protein coupled receptor activity | 870 | 1,693 | 0 |
| Deuterostomia | ||||
| GO:0004931 | ATP-gated cation channel activity | 5 | 6 | 4.74 × e-05 |
| GO:0009607 | Response to biotic stimulus | 45 | 979 | 0.00100739 |
| GO:0006952 | Defense response | 44 | 950 | 0.00100739 |
| GO:0004800 | Thyroxine 5'-deiodinase activity | 3 | 3 | 0.002093473 |
| GO:0030106 | MHC class I receptor activity | 5 | 15 | 0.002209497 |
| GO:0006955 | Immune response | 35 | 736 | 0.002495027 |
| GO:0030178 | Negative regulation of Wnt receptor signaling pathway | 4 | 9 | 0.003585659 |
| GO:0042981 | Regulation of apoptosis | 16 | 246 | 0.003971402 |
| GO:0008430 | Selenium binding | 6 | 29 | 0.004113225 |
| GO:0008517 | Folic acid transporter activity | 3 | 4 | 0.004113225 |
| Chordata | ||||
| GO:0005911 | Intercellular junction | 38 | 131 | 5.96 × e-33 |
| GO:0005921 | Gap junction | 20 | 24 | 1.97 × e-29 |
| GO:0030054 | Cell junction | 38 | 164 | 2.28 × e-29 |
| GO:0005922 | Connexon complex | 17 | 18 | 2.57 × e-27 |
| GO:0005243 | Gap-junction forming channel activity | 17 | 18 | 2.57 × e-27 |
| GO:0015285 | Connexon channel activity | 17 | 18 | 2.57 × e-27 |
| GO:0005923 | Tight junction | 17 | 60 | 2.44 × e-14 |
| GO:0016327 | Apicolateral plasma membrane | 17 | 76 | 1.45 × e-12 |
| GO:0043296 | Apical junction complex | 17 | 76 | 1.45 × e-12 |
| GO:0005615 | Extracellular space | 74 | 2,021 | 7.43 × e-10 |
| Vertebrata | ||||
| GO:0005102 | Receptor binding | 130 | 507 | 0 |
| GO:0016503 | Pheromone receptor activity | 59 | 111 | 0 |
| GO:0005179 | Hormone activity | 53 | 115 | 0 |
| GO:0042221 | Response to chemical stimulus | 90 | 329 | 9.81 × e-79 |
| GO:0009628 | Response to abiotic stimulus | 92 | 414 | 2.94 × e-59 |
| GO:0005615 | Extracellular space | 230 | 2,021 | 1.24 × e-45 |
| GO:0005550 | Pheromone binding | 50 | 94 | 1.49 × e-38 |
| GO:0005125 | Cytokine activity | 52 | 212 | 5.02 × e-38 |
| GO:0005549 | Odorant binding | 50 | 99 | 3.45 × e-37 |
| GO:0001664 | G-protein-coupled receptor binding | 36 | 47 | 3.23 × e-36 |
| Mammalia | ||||
| GO:0005615 | Extracellular space | 198 | 2,021 | 6.14 × e-53 |
| GO:0005102 | Receptor binding | 80 | 507 | 1.79 × e-46 |
| GO:0005125 | Cytokine activity | 48 | 212 | 1.79 × e-46 |
| GO:0009607 | Response to biotic stimulus | 104 | 979 | 1.03 × e-30 |
| GO:0006952 | Defense response | 102 | 950 | 1.03 × e-30 |
| GO:0042742 | Defense response to bacteria | 34 | 70 | 2.51 × e-28 |
| GO:0009617 | Response to bacteria | 34 | 78 | 2.22 × e-26 |
| GO:0005126 | Hematopoietin/interferon-class (D200-domain) cytokine receptor binding | 20 | 33 | 6.10 × e-19 |
| GO:0008083 | Growth factor activity | 26 | 141 | 2.98 × e-18 |
| GO:0051707 | Response to other organism | 60 | 594 | 1.67 × e-15 |
The table summarizes the 10 most statistically overrepresented Gene Ontology (GO) annotations for genes belonging to each of the seven categories. We only considered GO terms for which P > 0.001 and count sample was above 15.
Neural crest genes compiled using Phenotype Ontology annotations (phenotypic information derived from mutant mice studies)
| Group | Gene |
|---|---|
| Deuterostomia | Brain derived neurotrophic factor |
| Fanconi anemia, complementation group A | |
| Fos-like antigen 2 | |
| Neurotropin 3 | |
| Noggin | |
| Purinergic receptor P2X, ligand-gated ion channel, 7 | |
| Rod outer segment membrane protein 1 | |
| Vertebrata | BCL2-like 11 (apoptosis facilitator) |
| Calcitonin/calcitonin-related polypeptide, alpha | |
| Cocaine and amphetamine regulated transcript | |
| Endothelin 1 | |
| Endothelin 3 | |
| Formin 1 | |
| Glial cell line derived neurotrophic factor | |
| Gonadotropin releasing hormone 1 | |
| Hermansky-Pudlak syndrome 6 | |
| Integrin, alpha 10 | |
| Islet amyloid polypeptide | |
| Leukocyte cell derived chemotaxin 1 | |
| Matrix Gla protein | |
| Melanoma inhibitory activity 1 | |
| Myelin protein zero | |
| Natriuretic peptide precursor type C | |
| Neuregulin 1 | |
| Neurturin | |
| Parathyroid hormone | |
| Parathyroid hormone-like peptide | |
| Phosphodiesterase 6G, cGMP-specific, rod, gamma | |
| Pro-opiomelanocortin-alpha | |
| Silver | |
| Tenomodulin | |
| Treacher Collins Franceschetti syndrome 1, homolog | |
| Chordata | Activating transcription factor 4 |
| Cbp/p300-interacting transactivator, with Glu/Asp-rich carboxy-terminal domain, 2 | |
| Claudin 14 | |
| Epilepsy, progressive myoclonic epilepsy, type 2 gene alpha | |
| Fos-like antigen 1 | |
| Gap junction membrane channel protein beta 6 | |
| Hyaluronan and proteoglycan link protein 1 | |
| Transforming growth factor, beta receptor III | |
| Mammalia | Adrenocortical dysplasia |
| Ameloblastin | |
| Amelogenin X chromosome | |
| BH3 interacting domain death agonist | |
| Colony stimulating factor 2 (granulocyte-macrophage) | |
| Harakiri, BCL2 interacting protein (contains only BH3 domain) | |
| Kit ligand | |
| Leptin | |
| Matrix extracellular phosphoglycoprotein with ASARM motif (bone) | |
| MyoD family inhibitor | |
| Nonagouti | |
| Oncostatin M | |
| Programmed cell death 1 | |
| TYRO protein tyrosine kinase binding protein | |
The first appearance of neural crest genes was then determined using the sequential blast pipeline (Figure 1). The table contains the complete name of neural crest genes emerging in deuterostomia, chordata, vertebrata and mammalia.
Figure 2Tissue-specific profiles of gene emergence. The accumulative number of emerging genes (y-axis) in the deuterostomia-mammalia evolutionary window (x-axis) is represented for different tissue-specific genetic programs. We termed these representations gene emergence plots. At the chordate-vertebrate transition the rate of gene emergence (ger) was estimated for the different genetic programs. (a) Using mouse phenotypic annotations we calculated ger values between chordata and vertebrata for each main phenotype structure in the database. Structures are highlighted from blue to yellow, according to decreasing values of ger. Neural crest derivative structures are present within the highest ger values (red box). (b) Plots of representative structures of each class of ger value: class I = ger > 3; class II = 3 > ger > 1.5; and class III: ger < 1.5.
Figure 3Gene emergence plots of neural crest derivatives. Graphs and gene emergence rate (ger) values associated both with (a) the total collection of neural crest genes and (b) the different bone, nervous system, and pigmentation derivatives.
Figure 4Emerging ligands control the specification of neural crest precursors. The progressive determination of neural crest (NC) precursors into different cell lineages is represented in the scheme with a code of colors. Superimposed on this, the collection of new growth factors appearing first in vertebrates is depicted. The role of each ligand in controlling the specification/survival of each particular neural crest derivative is indicated with a corresponding code of colors. alpha-MSH, alpha-melanocyte-stimulating hormone; End, endothelin; GDNF, glial-derived neurotropic factor; NT, neurotropin; Nppc, natriuretic peptide precursor.
Neural crest associated Pfam domains emerging in vertebrates
| Group | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Symbol | Gene | blast | pro | euk | met | deu | chr | ver | mam |
| Slc12a6 | Solute carrier family 12, member 6 | pro | AA_permease | - | - | - | - | KCl_Cotrans_1 | - |
| Apc | Adenomatosis polyposis coli | pro | - | Arm | APC_crr APC_15aa | - | - | EB1_binding APC_basic SAMP | - |
| Asph | Aspartate-beta-hydroxylase | pro | Asp_Arg_Hydrox | - | - | - | - | Asp-B-Hydro_N | - |
| Top2b | Topoisomerase (DNA) II beta | pro | DNA_topoisoIV DNA_gyraseB HATPase_c | - | - | - | - | DTHCT | - |
| Nef3 | Neurofilament 3, medium | pro | - | - | Filament | - | - | Filament_head | - |
| Nefl | Neurofilament, light polypeptide | pro | - | - | Filament | - | - | Filament_head | - |
| Cryab | Crystallin, alpha B | pro | HSP20 | - | - | - | - | Crystallin | - |
| Rabggta | Rab geranylgeranyl transferase, a subunit | pro | LRR_1 | LRR_2 PPTA | - | - | - | RabGGT_insert | - |
| Otx1 | Orthodenticle homolog 1 | euk | - | Homeobox | - | - | - | TF_Otx | - |
| Otx2 | Orthodenticle homolog 2 | euk | - | Homeobox | - | - | - | TF_Otx | - |
| Zfp98 | Zinc finger protein 98 | euk | zf-C2H2 | - | - | - | - | SCAN | - |
| Prph1 | Peripherin 1 | met | - | - | Filament | - | - | Filament_head | - |
| Gfra1 | Glial cell line derived neurotrophic factor family receptor alpha 1 | met | - | - | - | - | - | GDNF | - |
| Cdx1 | Caudal type homeo box 1 | met | - | Homeobox | - | - | - | Caudal_act | - |
| Cdx2 | Caudal type homeo box 2 | met | - | Homeobox | - | - | - | Caudal_act | - |
| Hoxb9 | Homeo box B9 | met | - | Homeobox | - | - | - | Hox9_act | - |
| Hoxa9 | Homeo box A9 | met | - | Homeobox | - | - | - | Hox9_act | - |
| Nr3c1 | Nuclear receptor subfamily 3, group C, member 1 | met | - | - | Hormone_recep zf-C4 | - | - | GCR | - |
| Pdgfa | Platelet derived growth factor, alpha | met | - | - | PDGF | - | - | PDGF_N | - |
| Bdnf | Brain derived neurotrophic factor | deu | - | - | - | - | - | NGF | - |
| Ntf3 | Neurotropin 3 | deu | - | - | - | - | - | NGF | - |
| P2rx7 | Purinergic receptor P2X, ligand-gated ion channel, 7 | deu | - | - | - | - | - | P2X_receptor | - |
| Hapln1 | Hyaluronan and proteoglycan link protein 1 | cor | - | - | V-set | - | - | Xlink | - |
| Nppc | Natriuretic peptide precursor type C | ver | - | - | - | - | - | ANP | - |
| Calca | Calcitonin/calcitonin-related polypeptide, alpha | ver | - | - | - | - | - | Calc_CGRP_IAPP | - |
| Iapp | Islet amyloid polypeptide | ver | - | - | - | - | - | Calc_CGRP_IAPP | - |
| Cart | Cocaine and amphetamine regulated transcript | ver | - | - | - | - | - | CART | - |
| Edn1 | Endothelin 1 | ver | - | - | - | - | - | Endothelin | - |
| Edn3 | Endothelin 3 | ver | - | - | - | - | - | Endothelin | - |
| Nrg1 | Neuregulin 1 | ver | I-set | EGF | ig V-set | - | - | Neuregulin | - |
| Pomc1 | Pro-opiomelanocortin-alpha | ver | - | - | - | - | - | Op_neuropeptide ACTH_domain | - |
| Pthlh | Parathyroid hormone-like peptide | ver | - | - | - | - | - | Parathyroid | - |
| Pth | Parathyroid hormone | ver | - | - | - | - | - | Parathyroid | - |
| Pde6g | Phosphodiesterase 6G, cGMP-specific, rod, gamma | ver | - | - | - | - | - | PDE6_gamma | - |
| a | Nonagouti | mam | - | - | - | - | - | Agouti | - |
| Osm | Oncostatin M | mam | - | - | - | - | - | LIF_OSM | - |
| Kitl | Kit ligand | mam | - | - | - | - | - | SCF | - |
The table summarizes a list of those neural crest genes having at least one Pfam domain appearing first in vertebrates. All the corresponding Pfam domains of these genes, when these domains have appeared, and the classification of the genes according to our previous sequential blast analysis (blast) are indicated. cor, chordata; deu, deuterostomia; euk, eukaryota; mam, mammalia; met, metazoa; pro, prokaryota; ver, vertebrata.
Temporal categories of downloaded genomes
| Group | Name | Genomes | Sequences | Source |
|---|---|---|---|---|
| Prokaryota | Archaea | 21 archaeal | 48625 pep | Cogent241 |
| Bacteria | 191 bacterial | 568028 pep | Cogent241 | |
| Eukaryota | Eukaryota | 3 (2 yeast, plasmodium) | 16597 pep | Cogent241 |
| Metazoa | Metazoa | 3 (2 insect, nematode) | 19957 pep | Cogent241 |
| Deuterostomia | Deuterostoma | 1 (sea urchin) | 527735 nuc | NCBI |
| Chordata | Urochordata | 1 (ciona) | 21574 pep | EnsEMBL_v31 |
| Cephalochordata | 1 (branchiostoma) | 321472 ests | NCBI | |
| Vertebrata | Vertebrata | 3 (fish genomes) | 93151 pep | EnsEMBL_v32 |
| Mammalia | Mouse | 1 (mouse) | 23658 pep | EnsEMBL_v31 |
est, expressed sequence tag; nuc, nucleotide; pep, peptide.