| Literature DB >> 19698106 |
Xianjun Dong1, David Fredman, Boris Lenhard.
Abstract
Genomic regulatory blocks are chromosomal regions spanned by long clusters of highly conserved noncoding elements devoted to long-range regulation of developmental genes, often immobilizing other, unrelated genes into long-lasting syntenic arrangements. Synorth http://synorth.genereg.net/ is a web resource for exploring and categorizing the syntenic relationships in genomic regulatory blocks across multiple genomes, tracing their evolutionary fate after teleost whole genome duplication at the level of genomic regulatory block loci, individual genes, and their phylogenetic context.Entities:
Mesh:
Year: 2009 PMID: 19698106 PMCID: PMC2745767 DOI: 10.1186/gb-2009-10-8-r86
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Species available for comparison in Synorth and their associated species tree. (a) Phylogenetic tree based on data from [14,50,70-72]. The red dots indicate the second-round (2R) and third-round (3R) WGD events [72]. The blue dot indicates the genome compaction in the pufferfish lineage beginning 20 to 30 Myr ago [70]. The species shown in the tree are: bichir (Polypterus senegalus), zebrafish (Danio rerio), fugu (Takifugu rubripes), Tetraodon (Tetraodon nigroviridis), stickleback (Gasterosteus aculeatus), medaka (Oryzias latipes), frog (Xenopus tropicalis), chicken (Gallus gallus), mouse (Mus musculus), and human (Homo sapiens). Sources of fish images: Byrappa Venkatesh (fugu), Manfred Schartl (medaka), Wikipedia (zebrafish, tetraodon), Kraft CE et al. [73] (stickleback). (b) Reference and comparison species available in Synorth. Shaded boxes correspond to the reference genomes in Synorth. Connecting lines indicate genome pairs between which GRBs are available to check in the browser. Dashed lines indicate the genome comparison to be offered in the near future. The following genome assemblies underlie the current data sets: human NCBI 36, zebrafish Zv7 (The Wellcome Trust Sanger Institute), fugu v4.0 [74], tetraodon V7 [75], stickleback v1.0 (The Broad Institute), medaka v1.0 [51].
Figure 2Synorth pipeline and three data views. (a) The Synorth data/analysis pipeline. (b) The Synorth genome locus browser: (A) navigation bar; (B) search field; (C) external link to other browsers (Ancora, UCSC Genome Browser, Ensembl and Vista) for the same region; (D) GRB genomic coordinates - the coordinates are the union of all synteny regions overlapping with the query gene - if no overlapping synteny region is found, no region will be shown; (E) reference species track; (F) track(s) of compared species - in each track, the species name and the sub-tracks for all synteny regions are shown; (G) browser legend; (H) chromosome color key for the compared species. For more details, see the description in the Help page of the Synorth website [20]. (c) Synorth table browser (d) Synorth tree browser.
Ortholog gene counts in Synorth
| One2one | 7,790 | 8,429 | 7,784 | 8,903 | 8,639 |
| One2many | 6,144 | 6,285 | 7,718 | 5,868 | 5,307 |
| Many2many | 2,711 | 1,410 | 1,541 | 1,592 | 1,311 |
| Apparent_one2one (out-paralogs) | 197 | 192 | 181 | 175 | 238 |
| Total | 16,842 | 16,316 | 17,224 | 16,538 | 15,495 |
| 26,695 | 20,529 | 20,460 | 23,070 | 21,269 | |
| 16,645 | 16,124 | 17,043 | 16,363 | 15,257 | |
| 20,036 | 18,427 | 18,094 | 19,449 | 18,236 | |
Counts for the different ortholog classes in the Ensembl ortholog set, the Synorth exon predictions, and out-paralogs that can be excluded via options on the Synorth preference page (Additional data file 2). Option 1 represents the inclusion of additional ortholog predictions from our exonAlign pipeline. Option 2 represents the exclusion of out-paralogs. The exonAlign ortholog prediction pipeline and the method used to exclude out-paralogs are described in Additional data file 1. The Ensembl ortholog set was extracted from Ensembl Compara version 49.
Figure 3Scenario code illustration and statistics. (a) Definition of scenario code. (b) Descriptive statistics of scenario codes for all bystander-target pairs in 215 curated GRBs.
Figure 4Using Synorth to show GRB duplication, maintenance and breakage. (a) FOXD3 locus, (b) TBX2 locus and (c) LHX1 locus. See the text for description.
Figure 5PAX2 example. Using Synorth to trace the GRB changes between teleost fish and to detect possible errors in genome assembly or missed annotation. See detailed interpretation in the text.