| Literature DB >> 17584794 |
Katherine A Romer1, Guy-Richard Kayombya, Ernest Fraenkel.
Abstract
WebMOTIFS provides a web interface that facilitates the discovery and analysis of DNA-sequence motifs. Several studies have shown that the accuracy of motif discovery can be significantly improved by using multiple de novo motif discovery programs and using randomized control calculations to identify the most significant motifs or by using Bayesian approaches. WebMOTIFS makes it easy to apply these strategies. Using a single submission form, users can run several motif discovery programs and score, cluster and visualize the results. In addition, the Bayesian motif discovery program THEME can be used to determine the class of transcription factors that is most likely to regulate a set of sequences. Input can be provided as a list of gene or probe identifiers. Used with the default settings, WebMOTIFS accurately identifies biologically relevant motifs from diverse data in several species. WebMOTIFS is freely available at http://fraenkel.mit.edu/webmotifs.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17584794 PMCID: PMC1933171 DOI: 10.1093/nar/gkm376
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of the WebMOTIFS analysis package. The user provides a set of gene or probe identifiers that WebMOTIFS converts to sequences. In the default mode, the sequences are analyzed by four motif discovery programs. The outputs of each program are tested for statistical significance and clustered to reveal a small set of likely motifs. In the advanced mode, the Bayesian motif discovery program THEME is also used to find motifs consistent with particular DNA-binding domain families. THEME includes its own principled scoring algorithms, eliminating the need for post-processing. The motifs discovered using ChIP-chip data for Fkh2 in high-H2O2 conditions are shown. The motifs discovered using the default settings match the known specificity of Fkh2 and of the interacting protein Mcm1. THEME also finds the Fkh2 and Mcm1 motifs, and in both cases correctly identifies the DNA-binding domain family. Note that many motif discovery algorithms are non-deterministic. Therefore, results may very among repeated runs of WebMOTIFS using the same input.
Figure 2.Sample output from WebMOTIFS applied to sequences from human ChIP-chip experiments. The expected motif for the Hnf4α protein is discovered.