| Literature DB >> 23180783 |
Mikhail Pachkov1, Piotr J Balwierz, Phil Arnold, Evgeniy Ozonov, Erik van Nimwegen.
Abstract
Identification of genomic regulatory elements is essential for understanding the dynamics of cellular processes. This task has been substantially facilitated by the availability of genome sequences for many species and high-throughput data of transcripts and transcription factor (TF) binding. However, rigorous computational methods are necessary to derive accurate genome-wide annotations of regulatory sites from such data. SwissRegulon (http://swissregulon.unibas.ch) is a database containing genome-wide annotations of regulatory motifs, promoters and TF binding sites (TFBSs) in promoter regions across model organisms. Its binding site predictions were obtained with rigorous Bayesian probabilistic methods that operate on orthologous regions from related genomes, and use explicit evolutionary models to assess the evidence of purifying selection on each site. New in the current version of SwissRegulon is a curated collection of 190 mammalian regulatory motifs associated with ∼340 TFs, and TFBS annotations across a curated set of ∼35 000 promoters in both human and mouse. Predictions of TFBSs for Saccharomyces cerevisiae have also been significantly extended and now cover 158 of yeast's ∼180 TFs. All data are accessible through both an easily navigable genome browser with search functions, and as flat files that can be downloaded for further analysis.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23180783 PMCID: PMC3531101 DOI: 10.1093/nar/gks1145
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Information provided for each predicted site. Left panel: listed are the name of the regulatory motif (NHLH1,2), the identification number of the site (NHLH1,2-1252160), the source of the prediction (MotEvo), the length of the site (12 bp), its posterior probability (0.971), the sequence of the site and the orthologous sites from other organisms aligned to it. In this case, orthologous sites of this site occur in mouse (mm9), dog (canFam2), cow (bosTau6) and opossum (monDom5). A sequence logo of the WM is also shown. Right panel: position-dependent TFBS density for this motif. The figure shows the probability to find a TFBS for the TF NHLH motif as a function of position relative to TSS in both high-CpG (green) and low-CpG (red) promoters. Listed are also the TFs associated with the motif (NHLH1 and NHLH2), the promoters that are putatively driven by this site and finally the sequence of the site.
Figure 2.The main browser panel showing a region around the promoter of the PARD3B gene in human. In this example, five tracks are shown: transcripts, promoters, TSC, TSS and TFBSs. The figure demonstrates selection of a region with the mouse and the associated drop-down menu with options. It also illustrates the pop-up windows with information about the promoter and one of the TFBSs, which will appear when placing the mouse pointer on these features.