| Literature DB >> 27192554 |
Bernard Fongang1, Andrzej Kudlicki1.
Abstract
Metameric segmentation of the vertebrate body is established during somitogenesis, when a cyclic spatial pattern of gene expression is created within the mesoderm of the developing embryo. The process involves transcriptional regulation of genes associated with the Wnt, Notch, and Fgf signaling pathways, each gene is expressed at a specific time during the somite cycle. Comparative genomics, including analysis of expression timelines may reveal the underlying regulatory modules and their causal relations, explaining the nature and origin of the segmentation mechanism. Using a deconvolution approach, we computationally reconstruct and compare the precise timelines of expression during somitogenesis in chicken and zebrafish. The result constitutes a resource that may be used for inferring possible causal relations between genes and subsequent pathways. While the sets of regulated genes and expression profiles vary between different species, notable similarities exist between the temporal organization of the pathways involved in the somite clock in chick and mouse, with certain aspects (as the phase of expression of Notch genes) conserved also in the zebrafish. The regulated genes have sequence motifs that are conserved in mouse and chicken but not zebrafish. Promoter sequence analysis suggests involvement of several transcription factors that may bind these regulatory elements, including E2F, EGR and PLAG, as well as a possible role of G-quadruplex DNA structure in regulation of the cyclic genes. Our research lays the groundwork for further studies that will probe the evolution of the regulatory mechanism of segmentation across all vertebrates.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27192554 PMCID: PMC4871587 DOI: 10.1371/journal.pone.0155802
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Top 20 genes with one peak of expression during chicken somitogenesis.
Times in minutes, assuming a 90mn periodicity, are provided for each peak. Accuracy has been estimated using a Monte Carlo method. All experimentally validated cyclic genes were detected using our algorithm (see [15]).
| Probe set ID | Gene | Time(min) | LS p-value |
|---|---|---|---|
| Gga.3180.1.S2_a_at | 78±5 | 0.00345 | |
| Gga.3754.2.S1_at | 72±4 | 0.00442 | |
| Gga.11242.1.S1_at | 78±6 | 0.00477 | |
| GgaAffx.23401.4.S1_s_at | 87±6 | 0.00531 | |
| Gga.14954.1.S1_at | 75±8 | 0.00615 | |
| GgaAffx.7254.1.S1_at | 77±4 | 0.00704 | |
| GgaAffx.22378.1.S1_s_at | 75±6 | 0.00712 | |
| Gga.3772.1.S1_a_at | 49±2 | 0.00737 | |
| Gga.6311.1.S1_at | 70±3 | 0.02147 | |
| Gga.329.1.S1_at | 40±3 | 0.0198 | |
| Gga.13220.1.S1_at | 9±3 | 0.02493 | |
| Gga.2701.1.S2_at | 76±7 | 0.02595 | |
| Gga.12366.1.S1_at | 75±5 | 0.02417 | |
| GgaAffx.23741.1.S1_at | 81±6 | 0.0218 | |
| GgaAffx.25111.1.S1_s_at | 78±1 | 0.02073 | |
| Gga.4283.1.S1_at | 51±3 | 0.02377 | |
| Gga.892.1.S1_at | 67±1 | 0.03083 | |
| Gga.8363.1.S2_at | 21±8 | 0.03148 | |
| Gga.8082.1.A1_at | 22±7 | 0.03127 | |
| Gga.2170.1.S1_at | 51±8 | 0.04218 |
Fig 1Phase conservation between mouse and chicken.
Axin2, Wnt-related cyclic gene oscillates opposite phase to Hes1 (Notch-related gene) suggesting a phase conservation between mouse and chicken. Raf1, an Fgf-related gene previously shown to have two peaks of expression during mouse somitogenesis showed the same pattern of expression in chicken. Phase of gene expression varies from 0 to 1 and corresponds to 120 and 90 minutes periodicity for mouse and chicken respectively.
Top 20 genes with one peak of expression during zebrafish somitogenesis.
Times in minutes, assuming a 30mn periodicity, are provided for each peak. Accuracy has been estimated using a Monte Carlo method. All experimentally validated cyclic genes were detected using our algorithm (see [15]).
| Probe set ID | Gene | Time(min) | LS p-value |
|---|---|---|---|
| Dr.8086.1.S1_s_at | 17±1 | 0.0009 | |
| Dr.5372.1.S1_x_at | 25±1 | 0.0014 | |
| Dr.3696.1.S1_at | 24±1 | 0.0023 | |
| Dr.24815.2.S1_at | 14±4 | 0.0024 | |
| Dr.11157.1.S1_at | 25±2 | 0.0039 | |
| Dr.1462.1.S1_at | 16±1 | 0.0044 | |
| Dr.5759.1.A1_at | 6±2 | 0.0053 | |
| Dr.1899.1.S1_at | 24±1 | 0.0058 | |
| Dr.4733.1.A1_at | 2±1 | 0.0063 | |
| Dr.7852.1.S1_at | 28±4 | 0.0072 | |
| Dr.8835.2.S1_at | 13±4 | 0.008 | |
| Dr.1410.1.S1_at | 29±5 | 0.0083 | |
| Dr.4295.1.S1_at | 10±3 | 0.0087 | |
| Dr.17281.1.A1_at | 18±4 | 0.0097 | |
| Dr.1460.1.S1_at | 27±2 | 0.0106 | |
| Dr.12851.1.S1_at | 24±9 | 0.0109 | |
| Dr.14998.2.S1_at | 28±6 | 0.0111 | |
| Dr.14781.1.S1_at | 14±3 | 0.012 | |
| Dr.17394.1.A1_at | 18±7 | 0.0129 | |
| Dr.13969.1.A1_at | 14±3 | 0.0129 |
Fig 2Hes5 cyclic expression is conserved between mouse, chicken and zebrafish.
The expression pattern of Hes5 is shown as periodic during mouse, chicken and zebrafish somitogenesis. Moreover, the oscillations are in phase in all three species as depicted by the expression profile. Phase of gene expression varies from 0 to 1 and corresponds to 120, 90, and 30 minutes periodicity of mouse, chicken, and zebrafish respectively.
Fig 3The timing of the significant genes involved in the chicken and zebrafish somitogenesis.
Position of a gene symbol on the plots reflects time of peak timing (angle; clockwise) and the mean expression level (genes with high expression level are closer to the center). The timing is based on 90 and 30 minutes periodicity for the chicken and zebrafish somite cycle respectively. Genes are color-coded according to their known pathway association with green for Notch, brown for Fgf, and purple for Wnt. Genes in blue have never been reported as cyclic but are regulated during chicken and zebrafish somitogenesis. Also, a comparison with Fig 3 of [15] suggests that notable similarities exist between the temporal organization of the pathways involved in the somite clock in chick and mouse, with certain aspects (as the phase of expression of Notch genes) conserved also in the zebrafish.
Fig 4Gene Ontology analysis of the candidate cyclic genes with one peak of expression during mouse, chicken, and zebrafish somitogenesis.
Shown are the top three GO terms for each species. The analysis was done using DAVID, an online set of tools for functional analysis of co-regulated genes. Only GO terms containing at least 5 of the input genes and a q-value (Benjamini corrected p-value) < 0.05 were selected.
Published cyclic genes of mouse, chicken and zebrafish somitogenesis used for motifs enrichment.
We considered cyclic genes in the same pathway as co-regulated and used their sequences for cis-regulatory elements finding. Moreover, as somitogenesis is better described in mouse that any other species, mouse sequences are used for de novo motif finding and the subsequent motifs were used for enrichment analysis in chicken and zebrafish.
| Mouse | Chicken | Zebrafish | |
|---|---|---|---|
Fig 5De novo motif discovery led to 20 statistically significant motifs overrepresented in the promoter regions of mouse cyclic genes.
We found 10 motifs (named N1-N10) in the promoter region of Notch cyclic genes, 5 motifs for Wnt cyclic genes (W1-W5) and Fgf cyclic genes (F1-F5). The motifs were ruled significant after passing all steps described in the Methods section.
Fig 6Several TFs may cooperatively regulate the expression of cyclic genes during mouse somitogenesis.
A GC rich regulatory motif denoted as A1 in the text was found to be overrepresented in the promoter of nearly ¾ of reported cyclic genes during chicken somitogenesis. The average position of the motif is around 300 bps upstream of the transcription start site for mouse genes. When tested for enrichment in the database of experimentally validated TFBSs, it appears this regulatory motif may contain the binding sites of several couple of TFs acting cooperatively to regulate the expression of corresponding genes. The first and second possibilities (panels A and B) is the TF E2F1 binding upstream of EGR1 and ZNF263 respectively and the others possibilities (panels C and D) involve the TF PLAG1 binding upstream of RREB1 and EGR2.
De novo identified motifs and their conservation between pathways and species.
We tested the reference motif list for pathway conservation and between species conservation. For each motif, we computed the enrichment in known TFBSs databases and reported the corresponding significant TFs. It appears that several TFs may be acting cooperatively to regulate the expression of cyclic genes. We also noticed motif conservation between mouse and chicken as opposed to zebrafish where any of the discovered motifs could be found in the promoters of known cyclic genes. The conservation between species is given in terms of percentage of reported cyclic genes of the species with the corresponding motif in their promoter region.
| Motifs | Possible TFs | Best possible match (5’-3’) | Related GO terms (Biological Process) | Wnt- Notch- Fgf | chicken | Zebrafish |
|---|---|---|---|---|---|---|
| (E2F1/PLAG1)—(EGR1/ RREB1); E2F1—ZNF263; PLAG1—EGR2 | Multicellular organismal development; Developmental process; System development | W2-N1-F1 | Yes (74%) | No | ||
| - | Cellular developmental process; Anatomical structure development; Multicellular organismal development | W1-N8-** | No | No | ||
| SP2, PREB1, ZNF263, EGR1, E2F3, EGR2 | Anatomical structure morphogenesis; Chordate embryonic development; Organ morphogenesis | W3-N2-F4 | Yes (63%) | No | ||
| - | Primary metabolic process; Metabolic process; Cellular metabolic process | W4-N4-F3 | No | No | ||
| SP2, EGR1, PREB1, SP1, EGR2 | Developmental process; Multicellular organismal development; Anatomical structure development | W2-N3-F2 | Yes (58%) | No | ||
| - | Cellular protein metabolic process; Protein metabolic process; Cellular macromolecule metabolic process | **-N5-** | No | No | ||
| - | RNA processing; Metabolic process; Cellular metabolic process | W5-N10-F1 | No | No | ||
| (ZNFF263/E2F6)—EGR1 | Neuron projection development; Cell development; Neuron projection morphogenesis | W2-N7-F5 | Yes (50%) | No | ||
| - | Cellular metabolic process; RNA metabolic process; Primary metabolic process | W5-**-** | Yes (50%) | No | ||
| SP2, PREB1, EGR1 | Forebrain development; Central nervous system development; Brain development | W2-N3-F2 | No | No | ||
| - | Anatomical structure development; Negative regulation of RNA metabolic process; Cellular macromolecule metabolic process | **-N6-** | No | No | ||
| Tcf3, NHLH1, Ets1 | System development; Negative regulation of transcription from RNA polymerase II promoter; Negative regulation of transcription, DNA-dependent | **-N9-** | No | No |
Distribution of G-quadruplex structures in the promoter regions of mouse, chicken, zebrafish, and human genes.
We computed the distribution of G4 in the promoter (2 kbps upstream of the TSS) of mouse, chicken, zebrafish, and human genes and compared to that of corresponding cyclic genes. As cyclic genes in human are not well defined, we used homologs of mouse cyclic genes for comparison purposes. G4 is significantly represented in the promoter of mouse cyclic genes. Indeed 85% of known mouse cyclic genes contain the G4 structures, suggesting a possible role during mouse somitogenesis. The p-value is computed using Fisher exact test.
| Species | Mouse | Chicken | Zebrafish | Human | ||||
|---|---|---|---|---|---|---|---|---|
| Genome | Cyclic | Genome | Cyclic | Genome | Cyclic | Genome | Homologs of mouse cyclic genes | |
| 15293 | 39 | 13487 | 19 | 10347 | 9 | 25064 | 39 | |
| 6761 | 33 | 6317 | 11 | 493 | 1 | 10999 | 29 | |
| 44% | 85% | 47% | 58% | 5% | 11% | 44% | 74% | |
| Yes | No | No | Yes | |||||
| 0.006966 | - | - | 0.034693 | |||||
Motif enrichment in the lists of candidate cyclic genes from mouse, chicken, and zebrafish datasets.
We have compiled the lists of candidate cyclic genes with one and two peak of expression during mouse, chicken, and zebrafish somitogenesis and tested them against the reference of regulatory motifs. For each species, we reported the number of genes containing the corresponding motif in their promoter as well as the genes with the highest E-values. These genes represent leading cyclic candidate genes. As expected, none of the motifs discovered in mouse was found to be overrepresented in the promoter of zebrafish candidate cyclic genes.
| Motifs | Mouse unimodal [ | Mouse bimodal [ | Chicken unimodal ( | Chicken bimodal ( | Zebrafish unimodal ( | Zebrafish bimodal ( |
|---|---|---|---|---|---|---|
| (87 genes): Cmtm6, Ptpn11, Fgfrl1, Pole, Ccnd1, Skil, Llf3 | (52 genes): Fgf13, Wsb2, Reep5 Pik3ca, Arfip2, Ndfip1 | (90 genes): PSMB7, PCSK5, PTRH2, MAP2K6 | (50 genes): WBSCR22, SAMD15 | 0 | 0 | |
| (52 genes): Start3, Psmd7, Timm44, Lrig3, Mark1, Ephb4 | (41 genes): Ddah2, Cdkn1a, Rnpep, Scand1, Bola2, Vps37a | (5 genes): FBLN2, MNAT1, CDC42BPA | (1 gene): ESD | 0 | 0 | |
| (60 genes): Cmtm6, Safb2, Fgfrl1, Ankhd1, Msi2, Ccnd1 | (31 genes): Reep5, Rcor1, Creb3, Ube3b, Tcap, Isca2 | (38 genes): MAP2K6, EPC2, URM1, OLA1 | (1 gene): RNF126 | 0 | 0 | |
| (85 genes): Ankhd1, Fam13c, Akap1, Mtm1, Nova1, Ogt, Thra | (72 genes): Oxa1l, Pik3ca, Rnpep, Csnk2a1, Antxr1, Surf4 | (10 genes): SHPK, MAD2L1, OLA1, WBP4 | (7 genes): RARB, FADD | 0 | 0 | |
| (43 genes): Safb2, Cmtm6, Ccnd1, Fgfrl1, Qser1, Akirin1 | (35 genes): Fgf13, Vps37a, Zfp219, Tcap, Kap, Gnb1 | (54 genes): PSMB7, MAP2K6, CEP76, XPO7 | 0 | 0 | 0 | |
| (62 genes): Tnpo3, Ube2l3, Urm1, Efemp2, Slc16a11, Trub2 | (45 genes): Ndfip1, Grn, Reep5, Rad17, Tor1b, Cdkn1a | 0 | 0 | 0 | 0 | |
| (61 genes): Rps9, Rbm14, Paf1, Tnpo3, Ddx46, Slc16a11, Zfp7 | (46 genes): Phf20, Shmt1, Rab22a, Rad17, Tor1b, Vcp | 0 | 0 | 0 | 0 | |
| (55 genes): Coro2b, Hsd17b7, Dnajc3, Cmtm6, Ugp2, Ephb4 | (41 genes): Spred2, 9030624J02Rik, Tcap, Copg2, Dgat2 | (35 genes): FAR1, MRPL32, PTRH2, CTNND1 | (2 genes): MEF2BNB, CCNK | 0 | 0 | |
| (46 genes): Wdr33, Akap1, Timm44, Tnpo3, Ddx46, Trub2 | (39 genes): Tor1b, Zc3h11a, Med1, Gnpda2, Ddah2, Suz12 | (3 genes): SEC14L1, PEX16, PIGW | 0 | 0 | 0 | |
| (41 genes): Wdr33, Fgfrl1, Ercc8, Safb2, Hdlbp, Thra, Exosc4, Coro2b | (28 genes): Add3, Tcap, Reep5, Creb3, Kap | 0 | 0 | 0 | 0 |
Parameters describing the wave deceleration and the geometry of the system for collected data in mouse, chicken and zebrafish somitogenesis.
The parameters α and d are computed assuming that the cycle phase φ is random at the time when the embryos are sacrificed [15].
| Mouse [ | Chick | Zebrafish | |
|---|---|---|---|
| 0.022 | 0.110 | 0.140 | |
| 1.900 | 4.040 | 1.850 |
Fig 7Relationship between the phase of gene expression and position along the PSM of the chicken, mouse and zebrafish embryos.
The waves start as moving fast for small values of x (most posterior part of the PSM), and slow down as the wave progresses toward larger values of x. The four snapshots depict a sequence of time-points, corresponding to different phases of the somite cycle.