Literature DB >> 7584439

The value of prior knowledge in discovering motifs with MEME.

T L Bailey1, C Elkan.   

Abstract

MEME is a tool for discovering motifs in sets of protein or DNA sequences. This paper describes several extensions to MEME which increase its ability to find motifs in a totally unsupervised fashion, but which also allow it to benefit when prior knowledge is available. When no background knowledge is asserted. MEME obtains increased robustness from a method for determining motif widths automatically, and from probabilistic models that allow motifs to be absent in some input sequences. On the other hand, MEME can exploit prior knowledge about a motif being present in all input sequences, about the length of a motif and whether it is a palindrome, and (using Dirichlet mixtures) about expected patterns in individual motif positions. Extensive experiments are reported which support the claim that MEME benefits from, but does not require, background knowledge. The experiments use seven previously studied DNA and protein sequence families and 75 of the protein families documented in the Prosite database of sites and patterns, Release 11.1.

Mesh:

Substances:

Year:  1995        PMID: 7584439

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  257 in total

1.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

2.  Bioinformatic characterization of the trimeric intracellular cation-specific channel protein family.

Authors:  Abe L F Silverio; Milton H Saier
Journal:  J Membr Biol       Date:  2011-04-26       Impact factor: 1.843

3.  The Rcs signal transduction pathway is triggered by enterobacterial common antigen structure alterations in Serratia marcescens.

Authors:  María E Castelli; Eleonora García Véscovi
Journal:  J Bacteriol       Date:  2010-10-22       Impact factor: 3.490

4.  Identification of the binding sites of regulatory proteins in bacterial genomes.

Authors:  Hao Li; Virgil Rhodius; Carol Gross; Eric D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-14       Impact factor: 11.205

5.  A motif co-occurrence approach for genome-wide prediction of transcription-factor-binding sites in Escherichia coli.

Authors:  Martha L Bulyk; Abigail M McGuire; Nobuhisa Masuda; George M Church
Journal:  Genome Res       Date:  2004-02       Impact factor: 9.043

6.  Finding functional sequence elements by multiple local alignment.

Authors:  Martin C Frith; Ulla Hansen; John L Spouge; Zhiping Weng
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

Review 7.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

8.  Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles.

Authors:  Pedro A Reche; John-Paul Glutting; Hong Zhang; Ellis L Reinherz
Journal:  Immunogenetics       Date:  2004-09-03       Impact factor: 2.846

9.  AREM: aligning short reads from ChIP-sequencing by expectation maximization.

Authors:  Daniel Newkirk; Jacob Biesinger; Alvin Chon; Kyoko Yokomori; Xiaohui Xie
Journal:  J Comput Biol       Date:  2011-10-28       Impact factor: 1.479

10.  Relative evolutionary rates of NBS-encoding genes revealed by soybean segmental duplication.

Authors:  Xiaohui Zhang; Ying Feng; Hao Cheng; Dacheng Tian; Sihai Yang; Jian-Qun Chen
Journal:  Mol Genet Genomics       Date:  2010-11-16       Impact factor: 3.291

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.