Literature DB >> 22208168

Ancient Pbx-Hox signatures define hundreds of vertebrate developmental enhancers.

Hugo J Parker1, Paul Piccinelli, Tatjana Sauka-Spengler, Marianne Bronner, Greg Elgar.   

Abstract

BACKGROUND: Gene regulation through cis-regulatory elements plays a crucial role in development and disease. A major aim of the post-genomic era is to be able to read the function of cis-regulatory elements through scrutiny of their DNA sequence. Whilst comparative genomics approaches have identified thousands of putative regulatory elements, our knowledge of their mechanism of action is poor and very little progress has been made in systematically de-coding them.
RESULTS: Here, we identify ancient functional signatures within vertebrate conserved non-coding elements (CNEs) through a combination of phylogenetic footprinting and functional assay, using genomic sequence from the sea lamprey as a reference. We uncover a striking enrichment within vertebrate CNEs for conserved binding-site motifs of the Pbx-Hox hetero-dimer. We further show that these predict reporter gene expression in a segment specific manner in the hindbrain and pharyngeal arches during zebrafish development.
CONCLUSIONS: These findings evoke an evolutionary scenario in which many CNEs evolved early in the vertebrate lineage to co-ordinate Hox-dependent gene-regulatory interactions that pattern the vertebrate head. In a broader context, our evolutionary analyses reveal that CNEs are composed of tightly linked transcription-factor binding-sites (TFBSs), which can be systematically identified through phylogenetic footprinting approaches. By placing a large number of ancient vertebrate CNEs into a developmental context, our findings promise to have a significant impact on efforts toward de-coding gene-regulatory elements that underlie vertebrate development, and will facilitate building general models of regulatory element evolution.

Entities:  

Mesh:

Year:  2011        PMID: 22208168      PMCID: PMC3261376          DOI: 10.1186/1471-2164-12-637

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Cis-regulatory elements play an essential role in the precise co-ordination of vertebrate development as illustrated by the increasing number of examples where mutations in such sequences lead to developmental malformations [1-3]. One of the major challenges in modern biology is the deciphering of the regulatory language, syntax and grammar, encoded in the genome, that directs spatio-temporally restricted gene expression. To achieve this requires the identification and functional characterisation of cis-regulatory elements, followed by the deconvolution of the TFBSs therein. Cis-regulatory elements can be predicted by sequence conservation analysis, as tight clusters of functional TFBSs can be under strong evolutionary constraint [4-7]. Alternatively, targeted approaches involving chromatin immunoprecipitation (ChIP) can be used to identify binding-events between specific transcription factors and DNA [8-10]. These are complementary approaches, as ChIP analyses are restricted to identifying regulatory regions that are targets of selected TFs at the particular time-points chosen for the analysis, whilst sequence conservation can identify elements irrespective of the TFs that bind to them or the developmental time-points at which they act. Sequence conservation can also provide evidence for ancient gene regulatory network (GRN) interactions that are shared between species. Comparative approaches applied to vertebrate genomes have identified a set of putative regulatory elements showing extreme conservation across mammals (Ultra-conserved elements [11]), as well as Conserved Non-coding Elements (CNEs) shared between mammals and fishes [6,12,13]. These elements are clustered around developmental genes [6,12] and a large proportion of CNEs that have been tested in transgenic assays drive spatially restricted reporter gene expression in mouse or zebrafish embryos [6,7,14]. Furthermore, a number of CNEs have been shown to have roles in developmental diseases [1,15]. Despite their high sequence conservation between vertebrates, only a minute fraction of CNEs can be traced back to invertebrate chordates [16]. Thus, CNEs represent a set of cis-regulatory elements that are likely to be fundamental during development of the vertebrate body plan and comprise a valuable resource for deciphering the genomic regulatory code for vertebrate development. Phylogenetic footprinting has been successfully implemented to identify TFBSs that play key roles in the action of individual CNEs [17-19]. However, there have been very few studies seeking to place large numbers of deeply conserved CNEs into a developmental context through using this approach [20,21]. Furthermore, despite progress having been made identifying key sequence motifs within vertebrate promoter elements [22], ancient CNEs have remained somewhat recalcitrant to systematic motif-identification algorithms, despite some elegant targeted approaches. Within non-coding elements conserved amongst mammals, a large number of long motifs [12-22 nucleotides) [23] and some shorter motifs [24] have been identified as overrepresented. However, the majority of these were not matched to any known factors, nor linked to any patterns of enhancer activity, so the biological significance of these motifs is hard to interpret. Recently, an elegant study used a classifier algorithm to identify sequence motifs predictive of heart enhancer activity in mammalian CNEs [25]. However, mammalian CNEs represent a set of sequences that only partially overlap with the more ancient mammal-fish CNEs and it is not clear to what extent they are functionally and mechanistically alike. Studies seeking to identify motifs that contribute to tissue-specific expression of deeply conserved vertebrate CNEs have discovered novel motifs associated with forebrain enhancer activity [7,21]. As part of a large-scale project to characterise the in-vivo enhancer activity of CNEs, Pennacchio et al. [7] used four human-fugu CNEs that drove forebrain reporter expression in mouse embryos to identify 6 enriched 5 bp-long sequence motifs. 23 elements enriched for these motifs were tested for enhancer activity, of which 4 were found to drive forebrain expression - an enrichment for this expression domain compared to the original enhancer set. Li et al. [21] characterised 13 CNEs driving forebrain reporter expression in zebrafish embryos, identifying 5 enriched motifs of 6 bp and demonstrating that these sequences contributed to forebrain enhancer activity. These investigations go some way towards providing a developmental context for the CNEs with those motifs, but this is somewhat limited by the factors that bind to them remaining uncharacterised. Whilst it is unclear to what extent ancient vertebrate CNEs are composed of 'conventional', previously characterised TFBSs, candidate motif search approaches have provided evidence that mammalian UCEs are enriched for known TFBS motifs [26] and that ancient vertebrate CNEs associated with genes involved in CNS development show enrichment for Oct and Sox motifs [20]. The success of these isolated studies hints that it may be possible to systematically identify functional TFBSs within CNEs by phylogenetic footprinting. The sea lamprey (Petromyzon marinus) hails from an anciently diverging jawless vertebrate lineage, the agnathans, which split from the jawed vertebrate lineage 550-650 million years ago [27]. We have previously found a significant number of CNEs that are conserved between lamprey and jawed vertebrates [28]. We predicted that the relatively low sequence identity between the lamprey and jawed-vertebrate homologous elements would facilitate the identification of conserved TFBS motifs within them. In addition, characterisation of these motifs could illuminate ancient GRN interactions common to all vertebrates. Thus, we sought to identify TFBS motifs in CNEs by performing phylogenetic footprinting, using the lamprey elements as a guide. Here we identify deeply conserved TFBS motifs for the Pbx-Hox heterodimer within a cluster of CNEs associated with the meis2 gene. We use in-silico analyses to demonstrate that jawed vertebrate CNEs and other sets of conserved vertebrate enhancers are highly enriched in Pbx-Hox motifs. Using reporter assays in zebrafish and lamprey embryos, we show that these motifs correlate with enhancer function in the hindbrain and pharyngeal arches. These results represent a further step toward de-coding vertebrate CNEs, allowing a large proportion of them to be more firmly placed into a developmental context and revealing ancient gene regulatory network interactions for hindbrain patterning that are shared across vertebrates. Finally, our findings enable us to hypothesise an evolutionary scenario regarding the role of many CNEs in the evolution of the vertebrate hindbrain and the branchial region of the head.

Results

A set of meis2 CNEs drive expression in the hindbrain and cranial ganglia in zebrafish and lamprey embryos

We previously identified a genomic region, downstream of the developmental gene meis2, containing a number of CNEs that are conserved between jawed vertebrates and lamprey [28] (Additional File 1). We grouped these CNEs into five separate elements (Additional File 2) for functional testing in a zebrafish tol2 reporter assay [29]. Four of these elements drive discreet and complementary patterns of reporter expression in the hindbrain of zebrafish embryos, with homologous zebrafish and lamprey elements driving highly similar expression patterns (Figure 1). These patterns of reporter expression are consistent with the endogenous expression of meis2 in the hindbrain [30,31], where Meis proteins play a crucial patterning role by interacting with Hox and Pbx transcription factors [32,33]. Lamprey and zebrafish CNE 3285 elements both drive GFP expression in the cranial ganglia and primary neurons of the hindbrain and spinal cord. CNE 3288 elements of zebrafish and lamprey drive GFP in neurons of the hindbrain posterior to rhombomere 4 (r4), as shown by comparison to RFP expression in r3 and r5 in a transgenic line containing RFP under the control of a krox20 regulatory element [34]. CNE 3299 elements up-regulate GFP in the anterior hindbrain - r2-4 for the lamprey homolog and r3-4 plus neural crest migrating into the hyoid pharyngeal arch for the zebrafish homolog. CNE 329X of lamprey and zebrafish both drive GFP expression in the anterior hindbrain and neurons of the midbrain.
Figure 1

Meis2 CNEs from zebrafish and lamprey drive equivalent expression patterns in zebrafish and lamprey embryos. A, multiple alignment of orthologous genomic regions containing the gene c15orf41 (blue peak), downstream of meis2, revealing CNEs (red peaks). Human, zebrafish and lamprey sequences are aligned with the fugu sequence as a baseline. Zebrafish CNE 329X is translocated in the current zebrafish genome assembly so does not appear in this alignment. B-M, orthologous elements from lamprey (B-G) and zebrafish (H-M) drive similar GFP expression patterns in the nervous system of zebrafish embryos at 54hpf: element 3285 in the cranial ganglia (arrows) and primary neurons of the hindbrain and spinal cord (arrowhead) (B, H); 3288 in neurons of the hindbrain posterior to rhombomere (r) 4 (C, I), as determined by comparison with r3r5 RFP expression (D, J); 3299 in the anterior hindbrain - r2-4 for the lamprey homolog (E, F) and r3-4 plus the corresponding neural crest for the zebrafish homolog (K, L); 329X in the hindbrain and neurons of the midbrain (G, M). N-O , embryonic day 14-15 lamprey embryos transgenic for lamprey elements 3285 (N) and 3299 (O) show GFP expression in the cranial ganglia (arrowheads) and anterior hindbrain respectively, consistent with their expression in zebrafish (3285: B, 3299: E). P-Q, dorsal views of the head of lamprey (P) and zebrafish (Q) embryos transgenic for lamprey element 3299. cg: cranial ganglia; hb: hindbrain; mb: midbrain; nc: neural crest; sc: spinal cord.

Meis2 CNEs from zebrafish and lamprey drive equivalent expression patterns in zebrafish and lamprey embryos. A, multiple alignment of orthologous genomic regions containing the gene c15orf41 (blue peak), downstream of meis2, revealing CNEs (red peaks). Human, zebrafish and lamprey sequences are aligned with the fugu sequence as a baseline. Zebrafish CNE 329X is translocated in the current zebrafish genome assembly so does not appear in this alignment. B-M, orthologous elements from lamprey (B-G) and zebrafish (H-M) drive similar GFP expression patterns in the nervous system of zebrafish embryos at 54hpf: element 3285 in the cranial ganglia (arrows) and primary neurons of the hindbrain and spinal cord (arrowhead) (B, H); 3288 in neurons of the hindbrain posterior to rhombomere (r) 4 (C, I), as determined by comparison with r3r5 RFP expression (D, J); 3299 in the anterior hindbrain - r2-4 for the lamprey homolog (E, F) and r3-4 plus the corresponding neural crest for the zebrafish homolog (K, L); 329X in the hindbrain and neurons of the midbrain (G, M). N-O , embryonic day 14-15 lamprey embryos transgenic for lamprey elements 3285 (N) and 3299 (O) show GFP expression in the cranial ganglia (arrowheads) and anterior hindbrain respectively, consistent with their expression in zebrafish (3285: B, 3299: E). P-Q, dorsal views of the head of lamprey (P) and zebrafish (Q) embryos transgenic for lamprey element 3299. cg: cranial ganglia; hb: hindbrain; mb: midbrain; nc: neural crest; sc: spinal cord. We have developed a parallel reporter assay in lamprey embryos (in submission) to assess the functional conservation of CNEs across vertebrates. Using this assay, we have tested lamprey CNEs 3285 and 3299 for enhancer activity during lamprey embryogenesis. In lamprey embryos, CNE 3285 drives GFP expression in the cranial ganglia and CNE 3299 in the anterior hindbrain (Figure 1). Thus, for both of these elements, the pattern of reporter expression driven in lamprey embryos is almost identical to the pattern driven in zebrafish embryos (Figure 1). This provides compelling evidence that these CNEs are part of a gene-regulatory network for hindbrain patterning that is conserved across all vertebrates.

Some meis2 CNEs contain deeply conserved Pbx-Hox TFBS motifs

Because of its clear and specific expression pattern in the hindbrain of zebrafish and lamprey embryos, we chose element 3299 as a starting point for the identification of putative transcription-factor binding-sites by phylogenetic footprinting. A number of studies have documented a role for the anterior Hox proteins in regulating rhombomere-specific gene expression by binding as hetero-dimers and -trimers with the TALE-class homeodomain proteins Pbx and Meis [17,18,35,36]. These complexes bind to characteristic binding-sites composed of partially overlapping Pbx-Hox half-sites, frequently in conjunction with a distal Meis/Prep site [17,18,35,36]. In some cases it has been shown that the pbx-hox motif is both necessary and sufficient for highly specific patterns of reporter expression, for instance for activity of a mouse hoxb1 enhancer in r4 in the mouse hindbrain [17] and for r4 and pharyngeal arch activity of a mouse hoxb2 enhancer [36]. We identified two Pbx-Hox motifs within CNE 3299, conforming to the TGATNNAT consensus [37,38], that are conserved across all sequenced vertebrate genomes, each closely associated with conserved Meis motifs (TGACAG/A) [39] (Figure 2). In the zebrafish sequence, the first pair of Pbx-Hox and Meis motifs is also preceded by a Pbx-Meis motif (TGATTGACAG/A) [39]. We verified the essential nature of these motifs for rhombomere-specific activity of the enhancer through mutagenesis of the zebrafish element followed by reporter assay (Methods). Mutating the first cluster of motifs (sub1) resulted in a loss of the neural crest expression of the wild type enhancer and less anteriorly restricted expression in the hindbrain compared to the wild type element (Figure 2b, c). Mutation of the second Pbx-Hox and Meis motif cluster abrogated reporter expression by this enhancer altogether, whilst a construct in which both motif clusters were mutated (sub12) also drove no GFP expression. Interestingly, CNEs 3285, 3288 and 329X were also found to harbour conserved Pbx-Hox and Meis motifs. Together, the expression patterns of these elements in the hindbrain (3285, 3288, 3299, 329X), as well as in the pharyngeal arch neural crest (3299), suggests that these motifs may represent a common feature of CNEs that drive segment-specific expression patterns in the vertebrate hindbrain and pharyngeal arches.
Figure 2

CNE 3299 harbours essential conserved Pbx-Hox and Meis binding-sites. A, multiple sequence alignment of a region of CNE 3299 from human, zebrafish and lamprey highlighting conserved Pbx-Hox (blue box) and Meis (green box) binding-site motifs. The specific sites mutated in elements sub1 and sub2 are indicated below the alignment. B-C, compared to wild-type 3299 expression (B), mutating the first Pbx-Hox and Meis motif cluster (sub1) results in the loss of reporter expression in the neural crest (arrow) and broader expression in the hindbrain (arrowhead) (C). nc: neural crest.

CNE 3299 harbours essential conserved Pbx-Hox and Meis binding-sites. A, multiple sequence alignment of a region of CNE 3299 from human, zebrafish and lamprey highlighting conserved Pbx-Hox (blue box) and Meis (green box) binding-site motifs. The specific sites mutated in elements sub1 and sub2 are indicated below the alignment. B-C, compared to wild-type 3299 expression (B), mutating the first Pbx-Hox and Meis motif cluster (sub1) results in the loss of reporter expression in the neural crest (arrow) and broader expression in the hindbrain (arrowhead) (C). nc: neural crest.

Pbx-Hox motifs are enriched in CNEs and in other sets of conserved vertebrate enhancers

In order to address how widespread Pbx-Hox motifs are across conserved vertebrate enhancers, we performed a systematic scan for these motifs in vertebrate CNEs. We searched for instances of the canonical Pbx-Hox motif, TGATNNAT, that are completely conserved across CNE multiple sequence alignments. In a set of 246 alignments of CNEs between human, zebrafish, fugu and lamprey (Methods), we identified 61 conserved motifs, representing 22 fold enrichment over shuffled alignments (Methods and Additional File 3). Furthermore, in a set of 4259 gnathostome CNE alignments (of human, fugu and zebrafish sequences), 712 conserved motifs were identified; a 9 fold enrichment compared to shuffled alignment controls. Further analysis of Pbx-Hox motifs in the gnathostome set reveals a paucity of cytosines at variable positions 5 and 6 (Figure 3). This is a feature of characterised Pbx-Hox binding-sites, where T, A or G at these positions contribute to determining the Hox specificity of the binding site [38,40,41]. Furthermore, positions 9 and 10, immediately 3' to the canonical Pbx-Hox motif, show strong bias towards G/T and A/G respectively, thereby defining a more stringent TGATNNATKR (KR) consensus motif that is also consistent with previously characterised Pbx-Hox binding-sites [17,18,37,38] (Figure 3). Further analysis of the lamprey and gnathostome CNE alignment sets results in even stronger enrichment for this 'KR' motif (Additional File 3).
Figure 3

Pbx-hox motifs in CNEs strongly resemble verified PBX-HOX binding-sites. Position frequency logos generated from Gnathostome alignments (based on 712 conserved human TGATNNAT motifs in 4529 CNE alignments), Human CNEs (generated from the CONDOR CNE set using Cis-finder [41]) and from previous studies [36] (Literature). The relative base frequencies at positions 5 and 6, and 9 and 10, in CNEs, are in good agreement with known functional Pbx-Hox binding sites, supporting a strong KR consensus.

Pbx-hox motifs in CNEs strongly resemble verified PBX-HOX binding-sites. Position frequency logos generated from Gnathostome alignments (based on 712 conserved human TGATNNAT motifs in 4529 CNE alignments), Human CNEs (generated from the CONDOR CNE set using Cis-finder [41]) and from previous studies [36] (Literature). The relative base frequencies at positions 5 and 6, and 9 and 10, in CNEs, are in good agreement with known functional Pbx-Hox binding sites, supporting a strong KR consensus. We complemented our 'bottom-up' search for Pbx-Hox motifs in CNEs with a 'top-down' de novo motif search using the tool Cis-Finder [42]. Strikingly, one of the top-scoring predicted motifs identified by Cis-Finder matches our consensus KR motif for a set of 6, 693 human sequences from the CONDOR CNE database [43] ('CONDOR CNEs') (Figure 3, Methods). The KR motif occurs 562 times in this CNE set, representing a highly significant enrichment over shuffled versions of the motif (p = 5.7 × 10-5), and when compared to control genomic regions and the entire human genome (Table 1, Methods and Additional File 4). Interestingly, the Meis motif is also significantly enriched in the CONDOR CNE set (p = 1.0 × 10-4)(Additional File 5). We then examined the distribution of KR motifs in other sets of evolutionarily conserved non-coding sequences. The VISTA Enhancer Browser (EB) [44] contains over 1300 human sequences, around half of which drive reporter gene expression in mouse embryos at day 11.5. There is a significant enrichment for the KR motif (p = 0.0033) across the entire dataset compared with shuffled versions despite the fact that some of the sequences in EB are not deeply conserved (Table 1). Finally, we analysed a large set of deeply conserved human CNEs identified through comparison with the cartilaginous chimera, Callorhinchus milii [13], and once again found significant enrichment for the KR motif (p = 6.2 × 10-5) (Table 1 - 'Shark CNEs').
Table 1

Frequency of KR motifs, compared to shuffled versions, in different test sets

MotifCONDOR CNEs (from [57])Human:Shark CNEs (from 13]VISTA EB (all) [43]VISTA EB set HB/BA/CN+ve (from [43])VISTA EB set HB +ve (from [43])Zebrafish CBset all [21]Zebrafish CB set HB +ve (from [21])
TGATNNATKR562666609161131177
TGTANNATKR1711883886552123
GTATNNATKR150168279543992
GTTANNATKR150178325796582
TTGANNATKR200245447806491
TTAGNNATKR167238398745570
ATGTNNATKR2592974528672201
ATTGNNATKR2332974367461202
AGTTNNATKR215254431856890
TAGTNNATKR1471542975442104
TATGNNATKR1761983657460103
GATTNNATKR2743154199774110
TGATNNTAKR106143314655061
TGTANNTAKR1421514218260141
GTATNNTAKR5973195413410
GTTANNTAKR105108253503350
TTGANNTAKR1622053857262100
TTAGNNTAKR7397235413100
ATGTNNTAKR103124376645530
ATTGNNTAKR136158305574261
AGTTNNTAKR85121320645051
TAGTNNTAKR6669198372710
TATGNNTAKR8494345806251
GATTNNTAKR144177292584220
mean165.38196.58353.5470.5855.468.331.25
S.D102.75122.0693.7324.7420.925.521.67
z-score for pbxhox3.863.842.723.653.611.573.43
p-value5.68E-056.16E-053.30E-031.00E-042.00E-04N/S3.00E-04

Enrichment analysis for Pbx-Hox KR motifs, relative to shuffled versions (retaining G+C content for each binding site), within different sets of CNEs. CNEs from the VISTA enhancer browser (EB) and zebrafish cneBrowser (CB) sets have also been grouped according to annotated expression in the hindbrain (HB), branchial arches (BA) or cranial nerves (CN). All sequences are human except the Zebrafish cneBrowser set. N/S = not significant

Frequency of KR motifs, compared to shuffled versions, in different test sets Enrichment analysis for Pbx-Hox KR motifs, relative to shuffled versions (retaining G+C content for each binding site), within different sets of CNEs. CNEs from the VISTA enhancer browser (EB) and zebrafish cneBrowser (CB) sets have also been grouped according to annotated expression in the hindbrain (HB), branchial arches (BA) or cranial nerves (CN). All sequences are human except the Zebrafish cneBrowser set. N/S = not significant

Pbx-Hox motifs are associated with hindbrain and pharyngeal arch CNE enhancer function

Next, we tested whether Pbx-Hox motifs within CNEs associate with segment-specific reporter expression in the hindbrain and pharyngeal arches. To do this, we assayed 21 zebrafish CNEs containing conserved Pbx-Hox motifs for reporter expression in zebrafish. All of these CNEs are conserved across gnathostomes, with 11 also identifiable in lamprey (Additional File 2). Elements were chosen to represent a range of different genes from the lamprey and gnathostome CNE sets. 12 of these 21 elements consistently up-regulate patterns of reporter expression, comprised of 8 from the lamprey set and 4 from the gnathostome set. It should be noted that some of the elements from which no consistent expression patterns were obtained may act as enhancers in-vivo, but not in our transient transgenic reporter assay, possibly due to being taken out of their genomic context. Remarkably, 11 of the 12 GFP-expressing elements (91.6%) drive expression either in the hindbrain, pharyngeal arches or both, with one element expressing in the trunk musculature (Figure 4). In support of the hypothesis that these elements are directly regulated by specific Hox proteins, which have segmentally-restricted expression patterns, the majority of the elements expressing in the hindbrain do so in particular rhombomeres, as shown by comparison with r3r5 RFP expression (Figure 4). Hindbrain reporter expression driven by these elements is often further restricted dorso-ventrally (e.g. Nkx6-1_4281), medio-laterally (e.g. Pax2_217) and temporally (e.g. Tshz3_43509).
Figure 4

Pbx-Hox motifs correlate with segment-specific hindbrain and pharyngeal arch reporter expression. A-R, zebrafish elements from the lamprey (A-J, M, O, Q) and jawed vertebrate (K, L, N, P, R) CNE sets drive GFP expression in the hindbrain and pharyngeal arches. Elements: Evi1_40224 (A, B), Tshz3_43509 (C, D), NR2F2_27254 (E, F), Pax2_217 (G, dorsal view: H), ZNF503_32799 (I, J), Nkx6-1_4281 (K, L), Tshz3_24804 (M), Pax9_2099 (N), TshZ3_24805-6 (O), FoxP1_886 (P), Tshz3_24807 (Q), BCL11A_2554 (R). Expression in the hindbrain is often restricted to certain rhombomeres, as shown by comparison with r3r5 RFP expression (B, D, F, H, J, L). Tshz3_24807 drives expression in the trunk musculature (Q). Elements show temporal variation in reporter expression, expressing most strongly at 24-30hpf (C, D), 48-54hpf (A, B, E, F, I, J, Q) or 72-78hpf (G, H, K, L, M, N, O, P, R). hb: hindbrain; pa: pharyngeal arches; m: muscle.

Pbx-Hox motifs correlate with segment-specific hindbrain and pharyngeal arch reporter expression. A-R, zebrafish elements from the lamprey (A-J, M, O, Q) and jawed vertebrate (K, L, N, P, R) CNE sets drive GFP expression in the hindbrain and pharyngeal arches. Elements: Evi1_40224 (A, B), Tshz3_43509 (C, D), NR2F2_27254 (E, F), Pax2_217 (G, dorsal view: H), ZNF503_32799 (I, J), Nkx6-1_4281 (K, L), Tshz3_24804 (M), Pax9_2099 (N), TshZ3_24805-6 (O), FoxP1_886 (P), Tshz3_24807 (Q), BCL11A_2554 (R). Expression in the hindbrain is often restricted to certain rhombomeres, as shown by comparison with r3r5 RFP expression (B, D, F, H, J, L). Tshz3_24807 drives expression in the trunk musculature (Q). Elements show temporal variation in reporter expression, expressing most strongly at 24-30hpf (C, D), 48-54hpf (A, B, E, F, I, J, Q) or 72-78hpf (G, H, K, L, M, N, O, P, R). hb: hindbrain; pa: pharyngeal arches; m: muscle. We next examined functional data from the VISTA Enhancer Browser (EB). Compared to shuffled motifs, the KR motif was found to be significantly enriched in those elements annotated as hindbrain positive as well as those positive for either hindbrain, branchial arch or cranial nerve expression (Table 1). Investigating those EB elements that overlap CNEs from the CONDOR set, we found significant enrichment for the KR motif in those with hindbrain expression (HB+, 64 motifs in 112 kb) compared with those with no hindbrain annotation (HB-, 85 in 238 kb) (chi-square p = 0.0042). We then focused upon those sub-regions within EB enhancers that align directly with CNEs. Within these deeply conserved regions, there was more than two-fold enrichment for the stringent Pbx-Hox motif (30 occurrences in 24990 bp of HB+ elements compared with 32 occurrences in 60341 bp of HB- elements; p = 0.001). Importantly, this enrichment demonstrates that Pbx-Hox motifs in ancient CNEs show a correlation with hindbrain reporter expression. We also analysed a smaller dataset from the cneBrowser [21] that contains evolutionarily conserved enhancers associated with genes expressed in forebrain and hindbrain during zebrafish development. Although only 18 of 146 enhancers are annotated as hindbrain positive, 7 out of a total of 17 identified KR motifs reside in hindbrain positive enhancers (p = 3 × 10-4) (Table 1).

CNEs containing Pbx-Hox motifs are associated with genes that have roles in A-P patterning of the hindbrain and head

We have examined the distribution of Pbx-Hox motifs across CNEs of different genes, to ask whether genes with the highest enrichment of Pbx-Hox motifs in their CNEs have roles in hindbrain or pharyngeal arch patterning (Table 2 and Additional file 6). In keeping with the common use of auto-regulation in gene-regulatory networks [45], we find the CNEs of the HOXD cluster and the Hox co-factors, PBX3 and MEIS2, to be amongst those with the highest number of these motifs. Many of the other genes with the highest density of Pbx-Hox motifs in their CNEs have characterised roles in anterior-posterior (A-P) head patterning and show segment specific patterns of expression during development. For instance, the ZNF503/703 (Nlz1 and Nlz2) zinc-finger proteins are essential for specification of rhombomere 4 in zebrafish [46,47]. The orphan nuclear receptor genes NR2F1/2 (COUP-TF1/2) are negative transcriptional regulators involved in the retinoic acid signalling pathway, which has a key role in A-P patterning of the hindbrain and pharyngeal arches [48]. The members of the teashirt protein family (TSHZ1, 2 and 3) show segment-specific hindbrain expression [49], Tshz1 being essential for segmentally restricted gene expression in the hindbrain and pharyngeal arches of frog and mouse [50,51].
Table 2

Frequency of KR motifs in CNEs at different gene loci

GENE# KR motifs in test setLength of CNE seq for locus (kb)#hits per kb# hits in control set (mean)standard deviationz-scorep-value
ZNF5033627.7811.303.181.7618.620.00E+00
TSHZ33023.3231.293.091.7715.230.00E+00
IRX52737.0590.735.392.339.290.00E+00
IRX22123.9810.883.101.809.950.00E+00
TSHZ11610.3511.551.631.3210.930.00E+00
PBX31617.8860.891.891.3510.440.00E+00
HOXD91617.770.902.191.449.590.00E+00
NR2F21618.990.842.521.598.490.00E+00
NR2F11625.6550.623.721.846.672.53E-11
MEIS21624.5530.653.421.916.594.49E-11
ZFHX1B1323.2750.563.131.725.739.86E-09
SALL31211.4051.051.431.218.760.00E+00
FOXP11215.8570.761.731.248.252.22E-16
MAF117.3341.501.151.108.950.00E+00
NKX6-1106.8531.460.820.929.940.00E+00

Details are shown for the 15 gene loci from the CONDOR CNE set with the highest number of Pbx-Hox KR motifs in their CNEs, showing enrichment relative to shuffled CNE sets (Methods and Additional File 6). For each gene locus, the number of Pbx-Hox KR motifs in the associated CNEs is given. The number of Pbx-Hox KR motifs per kb of CNE sequence for each locus (column 4) is calculated by dividing the number of Pbx-Hox KR motifs in the CNEs of that locus (column 2) by the total combined length of the CNEs in that locus (column 2). Control sets were generated by zero order Markov shuffling of CNEs at each locus in 1000 randomisations (Methods). Some gene loci also contain other genes besides the one after which they are named, for instance the IRX5 locus contains Irx3, Irx5 and Irx6.

Frequency of KR motifs in CNEs at different gene loci Details are shown for the 15 gene loci from the CONDOR CNE set with the highest number of Pbx-Hox KR motifs in their CNEs, showing enrichment relative to shuffled CNE sets (Methods and Additional File 6). For each gene locus, the number of Pbx-Hox KR motifs in the associated CNEs is given. The number of Pbx-Hox KR motifs per kb of CNE sequence for each locus (column 4) is calculated by dividing the number of Pbx-Hox KR motifs in the CNEs of that locus (column 2) by the total combined length of the CNEs in that locus (column 2). Control sets were generated by zero order Markov shuffling of CNEs at each locus in 1000 randomisations (Methods). Some gene loci also contain other genes besides the one after which they are named, for instance the IRX5 locus contains Irx3, Irx5 and Irx6. There is good agreement between the genes highlighted by our in-silico binding-site search and by microarray screens for downstream targets of hoxb1 in rhombomere 4 of zebrafish [52] and mouse [53]. Specifically, the expression levels of znf503, tshz2, evi1, zic4, shox, and meis2.1 are decreased upon knock-down of HoxB1 in zebrafish, with Znf503, Nkx6-1, Atbf1 and Mab21l2 down-regulated in HoxB1-/- mouse embryos. Accordingly, the CNEs around each of these genes are enriched in Pbx-Hox motifs (Table 2 and Additional File 6). Thus, both microarray datasets are consistent with our prediction that Pbx-Hox motifs in CNEs represent direct regulatory links between Hox genes and their targets during development.

Discussion

Discovery of Pbx-Hox motif enrichment is a further step toward de-coding CNEs

Despite a pervasive assumption that CNEs bind transcription factors in order to elicit gene activation, there is, perhaps surprisingly, very little direct evidence to confirm this. We sought to identify TFBS motifs in CNEs through phylogenetic footprinting, reasoning that the relatively high divergence of lamprey CNEs would highlight important motifs. The utility of this approach is confirmed by our identification of conserved Pbx-Hox and Meis TFBS motifs in CNEs. The enrichment of the Pbx-Hox TFBS motif in the jawed vertebrate CNE set reveals this motif to be a regulatory signature that is utilised by a large proportion of highly conserved cis-regulatory elements (the 6, 693 CONDOR CNEs contain 562 KR motifs and 1416 TGATNNAT motifs). Whilst enriched motifs identified in mammalian conserved elements include a few that show partial overlap with variants of the Pbx-Hox consensus motif [23,24], the link between those enriched motifs and Hox factors had not been made, and their strong enrichment in more ancient CNEs had not been characterised. This enrichment agrees with the crucial, conserved roles of Hox factors in development of the vertebrate body plan. Indeed, the association of these motifs with hindbrain and pharyngeal arch enhancer function is in keeping with the characterised roles of Pbx, Hox and Meis factors in patterning these domains. Despite the crucial roles of Hox factors in patterning the vertebrate embryo, relatively few downstream target genes, other than the hox genes themselves, have been identified. Our data suggests that Pbx-Hox motifs in CNEs can identify such targets. The striking manner in which Pbx-Hox and Meis TFBS motifs are highlighted as conserved sequence blocks in multiple alignments, especially when lamprey sequences are included, leads us to predict that this footprinting approach will be useful for further deciphering the regulatory code within vertebrate enhancers. In combination, our in-silico and functional analyses form an important link between well characterised cis-regulatory motifs and a large proportion of relatively uncharacterised ancient CNEs, helping to better place these elements within a developmental context. This represents a significant further step in systematically de-coding the enhancers responsible for development of the vertebrate body plan and highlights the utility of the lamprey as a model system for investigating vertebrate gene regulation. The diversity of expression patterns driven by our tested elements suggests that Pbx-Hox TFBSs are just one component of a complex cis-regulatory logic encoded within these enhancers. Whilst responding to A-P patterning cues by interacting with particular Hox factors through Pbx-Hox TFBSs, these elements concomitantly determine the tissues in which they are active (e.g. hindbrain vs pharyngeal arch) and limit the expression patterns dorso-ventrally, medio-laterally and temporally. An example of this is the CNE Pax2_403, which drives reporter expression that is restricted to a ventro-lateral population of neurons in r2-3 of the hindbrain (Figure 4). Furthermore, whilst some of our functionally characterised CNEs drive reporter expression in domains with sharp boundaries that are co-incident with rhombomere boundaries -similar to that of previously characterised Pbx-Hox regulated elements - this is not the case for all of them. This could be due to the Pbx-Hox input establishing a competence for the enhancer to drive expression within particular rhombomeres, which is further restricted to specific sub-domains within the rhombomeres by the influence of other regulatory inputs to the enhancer. This would result in expression domains that do not encompass the whole area of expression of the regulating Hox factor. It is likely that the reason why many previously characterised Pbx-Hox regulated elements show expression domains across whole rhombomeres and with tight boundaries co-incident with rhombomere boundaries is that the majority of these elements are regulating Hox factors, and thus setting up or maintaining the rhombomere-specific Hox expression patterns. Many of the elements described in this study may be acting downstream of this Hox network, utilising these AP patterning cues along with other cues to further pattern the hindbrain. The tissue specificity of these enhancers, as well as the restriction of expression to specific domains and time points, is presumably due to other factors acting as specifiers by binding to nearby TFBSs. Identifying these specifiers and characterising their TFBSs, as well as the nature of their interactions with Hox factors, are key tasks toward understanding the cis-regulatory logic underlying vertebrate development. The set of putative Hox-responsive cis-regulatory elements identified in this study provides a powerful resource that will facilitate efforts toward this end. Our expression data from the mutated versions of zebrafish CNE 3299 suggests that the multiple Pbx-Hox and Meis sites predicted in this enhancer may interact with each other, to co-operatively modulate and restrict reporter expression. The two clusters of Pbx-Hox and Meis motifs do not contribute equally to the expression driven by this enhancer in the hindbrain and pharyngeal arch neural crest. The second Pbx-Hox and Meis motif cluster appears to be necessary for the general function of this enhancer, as its mutation results in the loss of reporter expression in both hindbrain and neural crest. In contrast, the first Pbx-Hox and Meis motif cluster appears to be necessary (but not sufficient) for neural crest expression, but not for hindbrain expression. Conversely, it appears to restrict the hindbrain expression, as reporter expression is seen more anteriorly when this cluster is mutated. This is reminiscent of interactions between Pbx-Hox and Meis/Prep binding sites within a Hoxb1 enhancer, which direct expression of this gene to r4 the hindbrain in mouse and chick [54]. In that case, it was found that the formation of a Pbx-Hox-Meis/Prep ternary complex on Pbx-Hox and Meis sites within this enhancer could be restricted by the binding of a Pbx1-Prep1 heterodimer to a nearby site, thus limiting the expression driven by this enhancer to r4. This highlights the complexity of the regulatory interactions between transcription factors that are likely to bind to CNEs, a complexity that could well underlie their high sequence constraint.

A potential role for CNEs in the evolution of vertebrate head patterning

A strength of identifying conserved cis-regulatory elements is that they can provide compelling evidence for conserved GRNs. Our reporter assay data from zebrafish and lamprey embryos clearly demonstrate functional conservation of enhancers shared between the most distantly related extant vertebrate lineages. We deduce that all vertebrates share aspects of a GRN for hindbrain patterning, downstream of nested Hox expression. As the sea lamprey is from a vertebrate lineage that diverged prior to the evolution of many jawed vertebrate innovations, such as paired appendages and jaws [55], we predict that the lamprey reporter assay will be a crucial tool for investigating the gene regulatory changes involved in vertebrate evolution. Without detailed knowledge of the function or mechanism of action of CNEs it has been difficult to derive scenarios of how they evolved and became fixed in vertebrate genomes. The findings from our in silico and functional analyses, coupled with previous characterisation of Pbx-Hox and Meis transcription-factor complexes, enable us to propose a hypothesis regarding the role of a large number of CNEs in vertebrate evolution. Recognising the same TFBS motifs in worms, flies and vertebrates, the Pbx, Hox and Meis factors are part of an ancient regulatory language shared across bilaterians [17,36,37,56,57]. Nevertheless, none of the CNEs containing these motifs are identifiable in invertebrate genomes, leading us to speculate that many of these elements may have arisen in the vertebrate lineage. Accordingly, our functional data suggest that many of these CNEs have roles in patterning an elaborate head and brain - key vertebrate innovations [54]. We hypothesise that the fundamental role of head patterning in vertebrates led to the functional conservation of these elements and that their reliance upon the precise organisation of TFBSs necessitated their strict sequence conservation. The mechanisms through which new cis-regulatory elements arise in the genome are still largely unresolved. In this case, the finding that simple Pbx-Hox sites are sufficient to drive robust and specific, but modifiable, expression [17] hints that these particular TFBSs may pioneer new cis-regulatory elements, functioning as one of the fundamental seeds from which many CNEs were able to grow.

Conclusions

The finding that vertebrate CNEs are highly enriched for Pbx-Hox binding-site motifs represents a further step toward de-coding ancient vertebrate enhancers. Coupled with our experimental data, this enables a large proportion of these elements to be more firmly placed into a developmental context and reveals ancient gene regulatory network interactions for hindbrain and head patterning that were present in ancestral vertebrates. Finally, our findings lead us to hypothesise that the evolution of many of these CNEs contributed to the elaboration of the vertebrate hindbrain and the branchial region of the head.

Methods

Identification of CNEs

6, 693 non-redundant human CNEs (average length 116 bp) were retrieved from the CONDOR database [43] at http://condor.nimr.mrc.ac.uk (Additional File 7). We used these to search lamprey sequence reads available from the NCBI trace server at http://www.ncbi.nlm.nih.gov/Traces/trace.cgi with sensitive parameters (-W 7 -q -1 -e 5e-4) as described previously [28]. The lamprey trace sequences were searched because they represent a greater coverage of the lamprey genome than the publicly available draft genome assembly, which consists of many short contigs and thus provides little advantage with regard to identification of conserved syntenic regions. Lamprey sequences satisfying the initial parametric threshold were further analysed for contamination, and those with > 90% homology to human or chicken across the whole read (i.e. extending outside the evolutionarily conserved region in other vertebrates) were removed.

Alignments

The sequences of human, fugu and most zebrafish CNEs were retrieved from the CONDOR database [57]. Additional zebrafish CNEs that were not previously included in the CONDOR database due to absence from earlier assemblies were identified using BLAST against a more recent zebrafish genome assembly (Zv8 release 58). Sequences in each alignment were clipped to the same size to prevent unaligned edges. To align the sequences we used ClustalW version 1.83. These alignments formed the lamprey CNE set (comprised of alignments of CNEs from human, fugu, zebrafish and lamprey) and the gnathostome CNE set (containing alignments of CNEs from human, fugu and zebrafish). As a control, for each CNE we also generated 1000 multiple alignments by randomly shuffling the columns of each alignment using the seqboot implementation in Phylip version 3.67. The sequences of lamprey and zebrafish CNEs in these datasets are given in Additional File 8 and Additional File 9 respectively. The sequences of CNEs from the EB and cneBrowser datasets are given in Additional File 10 and Additional File 11 respectively.

Scanning CNEs and control sets for Pbx-Hox motifs

We searched for Pbx-Hox motifs in two different types of datasets. Firstly, in multiple sequence alignments of CNEs from the CONDOR CNE database (the lamprey and gnathostome CNE sets). Secondly, in datasets consisting of sequences from just one species (the EB CNEs, shark CNEs, human elements from the CONDOR CNE set, cneBrowser CNEs). The two different types of datasets required different types of control to test for Pbx-Hox motif enrichment. For the alignment sets, we generated control sets of shuffled alignments. To find evolutionarily conserved Pbx-hox motifs (TGATNNAT and TGATNNATKR) we employed the software Cis-Finder [42] on our two alignment sets and their respective shuffled alignment controls. A motif match was only considered if it matched all aligned species and occurred at the exact same aligned position. For the single species sequence sets we generated shuffled motifs, based upon the KR motif, as a control to search across the same sets. In parallel we also employed a de-novo motif finding strategy implemented in Cis-Finder on the CONDOR CNE set. It scans a set of DNA sequences for over-represented position frequency matrices (PFMs), clusters these and then estimates significance using the false discovery rate [42]. The TGATNNAT and TGATNNATKR motif occurances in the CONDOR CNE set are detailed in Additional File 12 and Additional File 13. To characterise the frequency of Pbx-Hox KR motifs in different gene loci, we used the CONDOR CNE set, in which CNEs are grouped according to gene locus as specified in the CONDOR database [57]. For each gene locus, we counted the frequency of Pbx-Hox KR motifs in the associated CNEs and compared this to the average frequency in 1000 sets of randomised versions of CNEs from that locus. A markov chain model of order zero was used to generate shuffled sequences. To model DNA sequences, 4 states (A, C, G, T) and 4 transitions were used. Transition probabilities were retrieved from the CNE set by calculating the relative frequencies of the bases.

Measuring relative enrichment of Pbx-Hox motifs

To measure the enrichment, we compared the occurrence of Pbx-Hox motifs in a test set against shuffled versions (Table 1), calculated mean and standard deviation and generated z-scores. The z-scores were then transformed into p-values under a normal distribution model. We also counted Pbx-Hox occurrence and shuffled versions in a number of control regions and across the whole human genome (Additional File 4).

Overlap with other evolutionarily conserved 'enhancer' sets

There is inevitably some overlap between the different sets of evolutionarily conserved sequences. 482/1307 EB human sequences overlap 994 CONDOR CNEs, covering a total of 146226 bases (7.4% of the EB sequence; 18.8% of CNE sequence). 1632 human sequences identified through comparison with Callorhinchus milii [13] overlap 2172 CONDOR CNEs, covering a total of 271260 bases (26.5% of the Callorhinchus dataset; 34.9% of CNEs). Finally, 69/146 zebrafish cneBrowser sequences overlap 83 CONDOR CNEs, covering a total of 11496 bases (20.5% of the cneBrowser sequence; 1.5% of CNE sequence).

Zebrafish transgenesis

CNEs were amplified from zebrafish and lamprey genomic DNA by PCR, sub-cloned into the Pcr8/GW/TOPO vector (Invitrogen) and then into a Tol2 construct (pGW_cfosEGFP) [29,58,59], using the Gateway LR Clonase II enzyme (Invitrogen). The Tol2 reporter assay was performed as described previously [29]. Transient transgenic zebrafish embryos were screened for GFP expression at 24-30hpf, 48-54hpf and 72-78hpf using a Leica M165FC microscope and photographs taken with a Leica DFC310FX camera. Expression patterns were deemed consistent when found in > 20% of founders, consistent with previous studies [25,60].

CNE Mutagenesis

Mutations in zebrafish CNE 3299 were introduced by PCR from genomic DNA with primers containing the desired mutations either through conventional PCR (for sub1) or megaprimer PCR (for sub2) [61]. Mutated CNE PCR products were then cloned for zebrafish transgenesis as described above.

Lamprey transgenesis

The transgenesis protocol was based upon that developed in Xenopus [62]. Lamprey CNEs were amplified from genomic DNA and cloned into the cFos-I-sceI-EGFP plasmid, which contains the mouse cFos minimal promoter and EGFP coding sequence flanked by I-sceI restriction sites. Plasmids were extracted using the EndoFree Plasmid Maxi Kit (Qiagen) and eluted with water through QIAQuick columns (Qiagen). Fresh restriction digests (20 μl containing 400 ng plasmid, 15 units I-SceI enzyme (NEB), 1 × I-SceI buffer + BSA, digested for 40 minutes at 37°C) were micro-injected into 5-6hpf lamprey embryos using a Pico-Spritzer with drop volume of 2-3 nl. Lamprey husbandry was performed as described previously [63]. Embryos were screened for GFP expression between embryonic days 7-16. Typical survival rates ranged from 20-50% of injected embryos. The promoter alone drives highly mosaic background expression in the ectoderm in roughly 50% of surviving embryos. Enhancer-specific expression was seen in approximately 10% of surviving embryos.

Abbreviations

CNE: conserved non-coding element; TFBS: transcription factor binding site; GRN: gene regulatory network; EB: enhancer browser (dataset); A-P: anterior-posterior.

Authors' contributions

Conceived and designed the experiments: HJP and GE. Performed the experiments: HJP. Conceived and designed bio-informatic analyses: PP, HJP and GE. Performed bio-informatic analyses: PP and GE. Analysed the data: HJP, PP and GE. Supplied materials, reagents and lamprey expertise: MB and TS-S. Wrote the paper: HJP, PP and GE. All authors read and approved the final manuscript.

Additional file 1

Genomic sequences used to generate MLAGAN alignment. The genomic sequences of human, fugu, zebrafish and lamprey used to generate the MLAGAN alignment in this study. Click here for file

Additional file 2

CNEs functionally tested for this study. The sequences of the CNEs of zebrafish (dr) and lamprey (pm) that were tested by reporter assay in zebrafish embryos for this study. Click here for file

Additional file 3

The frequency of Pbx-Hox motifs in different test sets. A table describing the frequency of Pbx-Hox motifs in different test sets, compared with 1000 randomised controls. Click here for file

Additional file 4

The frequency of KR motifs in different control sets. A table listing the frequency of KR motifs, compared to shuffled versions, in different control sets. Click here for file

Additional file 5

The frequency of Meis motifs in Human CNEs. A table listing the frequency of Meis motifs, compared to shuffled versions in human CONDOR CNEs. Click here for file

Additional file 6

The frequency of KR motifs in CNEs at different gene loci. A table listing the frequency of KR motifs in CNEs at different gene loci. Click here for file

Additional file 7

The CONDOR CNE set. 6693 non-redundant human CNE sequences, shared between human and fugu, retrieved from the CONDOR database. Click here for file

Additional file 8

Lamprey CNEs. Lamprey sequences of 246 CNEs shared between jawed vertebrates and lamprey (from the lamprey CNE alignment set). Click here for file

Additional file 9

Zebrafish CNEs. Zebrafish sequences of 4259 CNEs shared between human, fugu and zebrafish (from the gnathostome CNE alignment set). Click here for file

Additional file 10

CNEs from the VISTA Enhancer Browser. 1307 human sequences from the VISTA Enhancer Browser http://enhancer.lbl.gov/. Click here for file

Additional file 11

CNEs from the cneBrowser. 146 functionally tested zebrafish sequences from the cneBrowser http://bioinformatics.bc.edu/chuanglab/cneBrowser/#home. Click here for file

Additional file 12

TGATNNAT motif hits on the human CONDOR CNEs. Hits of the TGATNNAT motif on the human CONDOR CNEs listed by CNE with details of the start and finish of the motif within the CNE. Click here for file

Additional file 13

TGATNNATKR motif hits on the human CONDOR CNEs. Hits of the TGATNNATKR motif on the human CONDOR CNEs listed by CNE with details of the start and finish of the motif within the CNE. Click here for file
  62 in total

1.  A transposon-mediated gene trap approach identifies developmentally regulated genes in zebrafish.

Authors:  Koichi Kawakami; Hisashi Takeda; Noriko Kawakami; Makoto Kobayashi; Naoto Matsuda; Masayoshi Mishina
Journal:  Dev Cell       Date:  2004-07       Impact factor: 12.270

2.  Molecular phylogeny and divergence times of deuterostome animals.

Authors:  Jaime E Blair; S Blair Hedges
Journal:  Mol Biol Evol       Date:  2005-07-27       Impact factor: 16.240

3.  Whole-genome ChIP-chip analysis of Dorsal, Twist, and Snail suggests integration of diverse patterning processes in the Drosophila embryo.

Authors:  Julia Zeitlinger; Robert P Zinzen; Alexander Stark; Manolis Kellis; Hailan Zhang; Richard A Young; Michael Levine
Journal:  Genes Dev       Date:  2007-02-15       Impact factor: 11.361

4.  The amphioxus genome illuminates vertebrate origins and cephalochordate biology.

Authors:  Linda Z Holland; Ricard Albalat; Kaoru Azumi; Elia Benito-Gutiérrez; Matthew J Blow; Marianne Bronner-Fraser; Frederic Brunet; Thomas Butts; Simona Candiani; Larry J Dishaw; David E K Ferrier; Jordi Garcia-Fernàndez; Jeremy J Gibson-Brown; Carmela Gissi; Adam Godzik; Finn Hallböök; Dan Hirose; Kazuyoshi Hosomichi; Tetsuro Ikuta; Hidetoshi Inoko; Masanori Kasahara; Jun Kasamatsu; Takeshi Kawashima; Ayuko Kimura; Masaaki Kobayashi; Zbynek Kozmik; Kaoru Kubokawa; Vincent Laudet; Gary W Litman; Alice C McHardy; Daniel Meulemans; Masaru Nonaka; Robert P Olinski; Zeev Pancer; Len A Pennacchio; Mario Pestarino; Jonathan P Rast; Isidore Rigoutsos; Marc Robinson-Rechavi; Graeme Roch; Hidetoshi Saiga; Yasunori Sasakura; Masanobu Satake; Yutaka Satou; Michael Schubert; Nancy Sherwood; Takashi Shiina; Naohito Takatori; Javier Tello; Pavel Vopalensky; Shuichi Wada; Anlong Xu; Yuzhen Ye; Keita Yoshida; Fumiko Yoshizaki; Jr-Kai Yu; Qing Zhang; Christian M Zmasek; Pieter J de Jong; Kazutoyo Osoegawa; Nicholas H Putnam; Daniel S Rokhsar; Noriyuki Satoh; Peter W H Holland
Journal:  Genome Res       Date:  2008-06-18       Impact factor: 9.043

5.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.

Authors:  Adam Siepel; Gill Bejerano; Jakob S Pedersen; Angie S Hinrichs; Minmei Hou; Kate Rosenbloom; Hiram Clawson; John Spieth; Ladeana W Hillier; Stephen Richards; George M Weinstock; Richard K Wilson; Richard A Gibbs; W James Kent; Webb Miller; David Haussler
Journal:  Genome Res       Date:  2005-07-15       Impact factor: 9.043

6.  A structural model for a homeotic protein-extradenticle-DNA complex accounts for the choice of HOX protein in the heterodimer.

Authors:  S K Chan; R S Mann
Journal:  Proc Natl Acad Sci U S A       Date:  1996-05-28       Impact factor: 11.205

7.  High-throughput transgenesis in Xenopus using I-SceI meganuclease.

Authors:  Hajime Ogino; William B McConnell; Robert M Grainger
Journal:  Nat Protoc       Date:  2006       Impact factor: 13.491

8.  Systematic discovery of regulatory motifs in conserved regions of the human genome, including thousands of CTCF insulator sites.

Authors:  Xiaohui Xie; Tarjei S Mikkelsen; Andreas Gnirke; Kerstin Lindblad-Toh; Manolis Kellis; Eric S Lander
Journal:  Proc Natl Acad Sci U S A       Date:  2007-04-18       Impact factor: 11.205

9.  Phylogeny of the teashirt-related zinc finger (tshz) gene family and analysis of the developmental expression of tshz2 and tshz3b in the zebrafish.

Authors:  Joana S Santos; Nuno A Fonseca; Cristina P Vieira; Jorge Vieira; Fernando Casares
Journal:  Dev Dyn       Date:  2010-03       Impact factor: 3.780

10.  A global genomic transcriptional code associated with CNS-expressed genes.

Authors:  Peter J Bailey; Joanna M Klos; Elisabet Andersson; Mattias Karlén; Magdalena Källström; Jasmina Ponjavic; Jonas Muhr; Boris Lenhard; Albin Sandelin; Johan Ericson
Journal:  Exp Cell Res       Date:  2006-06-21       Impact factor: 3.905

View more
  14 in total

1.  A unique stylopod patterning mechanism by Shox2-controlled osteogenesis.

Authors:  Wenduo Ye; Yingnan Song; Zhen Huang; Marco Osterwalder; Anja Ljubojevic; Jue Xu; Brent Bobick; Samuel Abassah-Oppong; Ningsheng Ruan; Ross Shamby; Diankun Yu; Lu Zhang; Chen-Leng Cai; Axel Visel; Yanding Zhang; John Cobb; YiPing Chen
Journal:  Development       Date:  2016-06-10       Impact factor: 6.868

2.  Lampreys, the jawless vertebrates, contain only two ParaHox gene clusters.

Authors:  Huixian Zhang; Vydianathan Ravi; Boon-Hui Tay; Sumanty Tohari; Nisha E Pillai; Aravind Prasad; Qiang Lin; Sydney Brenner; Byrappa Venkatesh
Journal:  Proc Natl Acad Sci U S A       Date:  2017-08-07       Impact factor: 11.205

3.  I-SceI Meganuclease-mediated transgenesis in the acorn worm, Saccoglossus kowalevskii.

Authors:  Paul J Minor; D Nathaniel Clarke; José M Andrade López; Jens H Fritzenwanker; Jessica Gray; Christopher J Lowe
Journal:  Dev Biol       Date:  2018-11-06       Impact factor: 3.582

4.  An atlas of anterior hox gene expression in the embryonic sea lamprey head: Hox-code evolution in vertebrates.

Authors:  Hugo J Parker; Marianne E Bronner; Robb Krumlauf
Journal:  Dev Biol       Date:  2019-05-06       Impact factor: 3.582

5.  TALE factors use two distinct functional modes to control an essential zebrafish gene expression program.

Authors:  Franck Ladam; William Stanney; Ian J Donaldson; Ozge Yildiz; Nicoletta Bobola; Charles G Sagerström
Journal:  Elife       Date:  2018-06-18       Impact factor: 8.140

Review 6.  Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk.

Authors:  Siddhartha P Kar; Jonathan P Tyrer; Qiyuan Li; Kate Lawrenson; Katja K H Aben; Hoda Anton-Culver; Natalia Antonenkova; Georgia Chenevix-Trench; Helen Baker; Elisa V Bandera; Yukie T Bean; Matthias W Beckmann; Andrew Berchuck; Maria Bisogna; Line Bjørge; Natalia Bogdanova; Louise Brinton; Angela Brooks-Wilson; Ralf Butzow; Ian Campbell; Karen Carty; Jenny Chang-Claude; Yian Ann Chen; Zhihua Chen; Linda S Cook; Daniel Cramer; Julie M Cunningham; Cezary Cybulski; Agnieszka Dansonka-Mieszkowska; Joe Dennis; Ed Dicks; Jennifer A Doherty; Thilo Dörk; Andreas du Bois; Matthias Dürst; Diana Eccles; Douglas F Easton; Robert P Edwards; Arif B Ekici; Peter A Fasching; Brooke L Fridley; Yu-Tang Gao; Aleksandra Gentry-Maharaj; Graham G Giles; Rosalind Glasspool; Ellen L Goode; Marc T Goodman; Jacek Grownwald; Patricia Harrington; Philipp Harter; Alexander Hein; Florian Heitz; Michelle A T Hildebrandt; Peter Hillemanns; Estrid Hogdall; Claus K Hogdall; Satoyo Hosono; Edwin S Iversen; Anna Jakubowska; James Paul; Allan Jensen; Bu-Tian Ji; Beth Y Karlan; Susanne K Kjaer; Linda E Kelemen; Melissa Kellar; Joseph Kelley; Lambertus A Kiemeney; Camilla Krakstad; Jolanta Kupryjanczyk; Diether Lambrechts; Sandrina Lambrechts; Nhu D Le; Alice W Lee; Shashi Lele; Arto Leminen; Jenny Lester; Douglas A Levine; Dong Liang; Jolanta Lissowska; Karen Lu; Jan Lubinski; Lene Lundvall; Leon Massuger; Keitaro Matsuo; Valerie McGuire; John R McLaughlin; Iain A McNeish; Usha Menon; Francesmary Modugno; Kirsten B Moysich; Steven A Narod; Lotte Nedergaard; Roberta B Ness; Heli Nevanlinna; Kunle Odunsi; Sara H Olson; Irene Orlow; Sandra Orsulic; Rachel Palmieri Weber; Celeste Leigh Pearce; Tanja Pejovic; Liisa M Pelttari; Jennifer Permuth-Wey; Catherine M Phelan; Malcolm C Pike; Elizabeth M Poole; Susan J Ramus; Harvey A Risch; Barry Rosen; Mary Anne Rossing; Joseph H Rothstein; Anja Rudolph; Ingo B Runnebaum; Iwona K Rzepecka; Helga B Salvesen; Joellen M Schildkraut; Ira Schwaab; Xiao-Ou Shu; Yurii B Shvetsov; Nadeem Siddiqui; Weiva Sieh; Honglin Song; Melissa C Southey; Lara E Sucheston-Campbell; Ingvild L Tangen; Soo-Hwang Teo; Kathryn L Terry; Pamela J Thompson; Agnieszka Timorek; Ya-Yu Tsai; Shelley S Tworoger; Anne M van Altena; Els Van Nieuwenhuysen; Ignace Vergote; Robert A Vierkant; Shan Wang-Gohrke; Christine Walsh; Nicolas Wentzensen; Alice S Whittemore; Kristine G Wicklund; Lynne R Wilkens; Yin-Ling Woo; Xifeng Wu; Anna Wu; Hannah Yang; Wei Zheng; Argyrios Ziogas; Thomas A Sellers; Alvaro N A Monteiro; Matthew L Freedman; Simon A Gayther; Paul D P Pharoah
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2015-07-24       Impact factor: 4.254

7.  A Simple Predictive Enhancer Syntax for Hindbrain Patterning Is Conserved in Vertebrate Genomes.

Authors:  Joseph Grice; Boris Noyvert; Laura Doglio; Greg Elgar
Journal:  PLoS One       Date:  2015-07-01       Impact factor: 3.240

8.  A reporter assay in lamprey embryos reveals both functional conservation and elaboration of vertebrate enhancers.

Authors:  Hugo J Parker; Tatjana Sauka-Spengler; Marianne Bronner; Greg Elgar
Journal:  PLoS One       Date:  2014-01-09       Impact factor: 3.240

9.  Comparative genomic analysis of human GLI2 locus using slowly evolving fish revealed the ancestral gnathostome set of early developmental enhancers.

Authors:  Shahid Ali; Irum Arif; Ayesha Iqbal; Irfan Hussain; Muhammad Abrar; Muhammad Ramzan Khan; Neil Shubin; Amir Ali Abbasi
Journal:  Dev Dyn       Date:  2021-01-07       Impact factor: 2.842

10.  Parallel evolution of chordate cis-regulatory code for development.

Authors:  Laura Doglio; Debbie K Goode; Maria C Pelleri; Stefan Pauls; Flavia Frabetti; Sebastian M Shimeld; Tanya Vavouri; Greg Elgar
Journal:  PLoS Genet       Date:  2013-11-21       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.