| Literature DB >> 26519295 |
Song Yang1, Nir Oksenberg2,3, Sachiko Takayama4, Seok-Jin Heo5, Alexander Poliakov6, Nadav Ahituv7,8, Inna Dubchak9,10, Dario Boffelli11.
Abstract
BACKGROUND: To examine the contributions of sequence and function conservation in the evolution of enhancers, we systematically identified enhancers whose sequences are not conserved among distant groups of vertebrate species, but have homologous function and are likely to be derived from a common ancestral sequence. Our approach combined comparative genomics and epigenomics to identify potential enhancer sequences in the genomes of three groups of distantly related vertebrate species.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26519295 PMCID: PMC4628251 DOI: 10.1186/s12864-015-2070-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Scheme of the analysis of orthologous introns bracketed by the orthologous exons in the rodent/rabbit and fish evolutionary groups of genomes. Conserved sequences (A1 and B, and A2 and C in multiple alignments of the groups 1 and 2 respectively) are compared, and highly similar A1 and A2 removed from the analysis. Sequences B and C are selected for experimental validation
Genome assemblies used in this study
| Species | Genome assembly | |
|---|---|---|
| Rodent + Rabbit | Mus musculus (Mouse) | Jul. 2007 (mm9) |
| Rattus norvegicus (Rat) | Nov. 2004 (rn4) | |
| Cavia porcellus (Guinea pig) | Feb. 2008 (cavPor3) | |
| Oryctolagus cuniculus (Rabbit) | Apr. 2009 (oryCun2) | |
| Bird | Gallus gallus (Chicken) | May 2006 (galGal3) |
| Taeniopygia guttata (Zebra finch) | Jul. 2008 (taeGut1) | |
| Meleagris gallopavo (Turkey) | Dec. 2009 (melGal1) | |
| Fish | Danio rerio (Zebrafish) | Jul. 2010 (danRer7) |
| Takifugu rubripes (Fugu) | Oct. 2004 (fr2) | |
| Tetraodon nigroviridis (Tetraodon) | Mar. 2010 (tetNig2) | |
| Gasterosteus aculeatus (Stickleback) | Feb. 2006 (gasAcu1) | |
| Oryzias latipes (Medaka) | Oct. 2005 (oryLat2) |
Reference genomes for each of the four groups are listed first
Number of orthologous introns between pairs of species and groups of three species
| Number of annotated orthologous introns | Projected introns | Total number | |
|---|---|---|---|
| Human-Mouse | 127,176 | - | 127,176 |
| Human-Chicken | 22,815 | 59,465 | 82,280 |
| Human-Zebrafish | 27,337 | 24,999 | 52,336 |
| Mouse-Chicken-Zebrafish | 17,921 |
Rodent-bird-fish comparison
| Rodent | Bird | Fish | |
|---|---|---|---|
| Intra-group conserved regions genome-wide | 350,003 | 178,982 | 394,456 |
| Conserved noncoding regions (ECR) in orthologous introns | 6,390 | 4,079 | 5,044 |
| Intra-group intronic ECRs not conserved in other groups | 4,814 | 2,904 | 3,625 |
| ECRs located in introns with ECRs in all groups of species | 1,415 | 1,220 | 1,447 |
| overlapping with H3K4me1 but not H3K4Me3 | 728 |
| 273 |
Zebrafish and mouse predicted enhancers subjected to experimental validation
| ECR | Zebrafish (Zv9 July 2010) | Mouse (mm9) July 2007) | ||
|---|---|---|---|---|
| Expression | Coordinates | Expression | Coordinates | |
| 1 | Spinal cord (48) | chr6:20,797,116-20,798,106 | negative | chr5:107,545,775–107,546,750 |
| 2 | Epidermis (24,48) | chr6:43,538,446–43,539,094 | negative | chr6:98,881,656–98,882,625 |
| negative | chr6:98,883,956–98,884,889 | |||
|
|
|
|
| |
| 4 | Notochord (24,48) | chr19:22,713,465–22,714,026 | negative | chr18:80,876,694–80,877,675 |
| negative | chr18:80,876,778–80,877,739 | |||
|
|
|
|
| |
|
|
| chr2:39,790,777–39,791,782 |
| chr1:77,387,974–77,389,004 |
|
|
| chr2:39,792,312–39,793,241 | Eye (24,48), | chr1:77,390,034–77,391,031 |
| 8 | Somitic muscle (24,48), Olfactory epithelium (48), Hindbrain (48) | chr2:39,837,096–39,838,009 | negative | chr1:77,503,689–77,504,676 |
| negative | chr1:77,506,344–77,507,445 | |||
| 9 | Spinal cord (24,48) | chr4:16,470,975–16,471,819 | Midbrain (24), Yolk (24) | chr10:87,004,979–87,005,964 |
| Somitic Muscle (24,48), Skin under yolk (24,48), Phar. Arches (48), Pectoral fin (48) | chr10:87,008,830–87,009,792 | |||
|
|
| chr9:22,365,926–22,366,799 |
| chr14:58,234,894–58,235,874 |
|
| Hindbrain (24), | chr9:22,368,460–22,369,405 | ||
|
|
| chr14:41,645,428–41,646,386 |
| chrX:130,162,955–130,163,923 |
| 13 | Hindbrain (24), Olfactory epithelium (24,48), Line under eye (24,48), Spinal cord (48) | chr20:21,970,078–21,970,911 | Spinal cord (24) | chr12:72,999,373–73,000,279 |
Tissues in bold (in ECR 6, 7, 10, 11, 12) show consistent enhancer activity between the mouse and zebrafish syntenic enhancers at the indicated time point (24 or 48 hpf). Each construct was injected in at least 100 embryos; the proportion of fish alive and expressing in a given tissue is shown in Additional file 2: Table S1
Fig. 2Expression data in for positive zebrafish (blue) and mouse (brown) ECR constructs in transgenic zebrafish. The x-axis shows tissues with positive expression in at least one construct; the y-axis shows the number of fish expressing a construct in a given tissue (minus the number of fish with ectopic expression – see Additional file 2: Table S1 for details), reported as the proportion of fish alive at 24 hpf (light color) and 48 hpf (dark color). The identity of each construct is shown in the top right corner of each chart. The expression of the mouse ECR syntenic to zebrafish ECRs 10 and 11 is reported in both charts
Fig. 3Expression patterns driven by zebrafish and mouse syntenic ECRs with homologous activity in a zebrafish transgenic enhancer assay. ECRs are identified by the number at the bottom left of each panel, their genomic coordinates are in Table 4. Expression patterns were recorded at 24 hpf (ECR 7) or 48 hpf (ECRs 6, 10, 11, and 12) and detected by GFP expression (green). Labels indicate tissues in which homologous mouse and zebrafish sequences show consistent patterns of GFP expression (see Experimental Procedures for the definition of consistent pattern of expression). ECR 6: constructs from both species drive strong expression in several epidermal cells; the zebrafish construct also drives expression in the otic vesicle , weak staining at the periphery of the vesicle is visible with the mouse construct (see also Additional file 1: Figure S1). ECR 7: the mouse construct drives expression in the olfactory bulb, but expression by the zebrafish construct is restricted to the olfactory epithelium (the sensory component of the olfactory bulb – see also Additional file 1: Figure S1); constructs from both species drive expression in large neurons in the hindbrain of. ECR 10 and 11: one ECR is present in the mouse, and two ECRs are present in the orthologous zebrafish intron; constructs from both species drive clear expression in somitic muscle cells. ECR12: constructs from both species drive clear expression in several epidermal cells. Abbreviations – e: epidermis; ov: otic vesicle; oe: olfactory epithelium; ob: olfactory bulb; hb: hindbrain; sm: somitic muscle
Fig. 4Transcription factor binding sites detected in the five syntenic ECRs. a-e: each panel shows the five motifs with most significant E-value in the given ECR, and the sequence of the n-mer, identical in the zebrafish, mouse, and human sequence of the ECR, from which the motif was identified. The name of the transcription factor binding to a motif is shown above the motif, and the sequence of the motif is shown for the strand on which the n-mer was found. The arrangement of the motifs along the zebrafish (zf), mouse (mm), and human (hs) sequences is shown at the bottom of each panel (the scale, in base pairs, is the same for all panels and shown only for panel E). Motifs are color-coded according the legend at the bottom of the figure; motifs shown above and below the reference line are identified on the forward and reverse strands, respectively