Literature DB >> 15252200

MotifPrototyper: a Bayesian profile model for motif families.

Eric P Xing1, Richard M Karp.   

Abstract

In this article, we address the problem of modeling generic features of structurally but not textually related DNA motifs, that is, motifs whose consensus sequences are entirely different but nevertheless share "metasequence features" reflecting similarities in the DNA-binding domains of their associated protein recognizers. We present MotifPrototyper, a profile Bayesian model that can capture structural properties typical of particular families of motifs. Each family corresponds to transcription regulatory proteins with similar types of structural signatures in their DNA-binding domains. We show how to train MotifPrototypers from biologically identified motifs categorized according to the TRANSFAC categorization of transcription factors and present empirical results of motif classification, motif parameter estimation, and de novo motif detection by using the learned profile models.

Mesh:

Substances:

Year:  2004        PMID: 15252200      PMCID: PMC489970          DOI: 10.1073/pnas.0403564101

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  7 in total

1.  TRANSFAC: an integrated system for gene expression regulation.

Authors:  E Wingender; X Chen; R Hehl; H Karas; I Liebich; V Matys; T Meinhardt; M Prüss; I Reuter; F Schacherer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Classifying G-protein coupled receptors with support vector machines.

Authors:  Rachel Karchin; Kevin Karplus; David Haussler
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

3.  Logos: a modular bayesian model for de novo motif detection.

Authors:  Eric P Xing; Wei Wu; Michael I Jordan; Richard M Karp
Journal:  J Bioinform Comput Biol       Date:  2004-03       Impact factor: 1.122

4.  An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences.

Authors:  C E Lawrence; A A Reilly
Journal:  Proteins       Date:  1990

5.  Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology.

Authors:  K Sjölander; K Karplus; M Brown; R Hughey; A Krogh; I S Mian; D Haussler
Journal:  Comput Appl Biosci       Date:  1996-08

Review 6.  Specificity, free energy and information content in protein-DNA interactions.

Authors:  G D Stormo; D S Fields
Journal:  Trends Biochem Sci       Date:  1998-03       Impact factor: 13.807

7.  Hidden Markov models in computational biology. Applications to protein modeling.

Authors:  A Krogh; M Brown; I S Mian; K Sjölander; D Haussler
Journal:  J Mol Biol       Date:  1994-02-04       Impact factor: 5.469

  7 in total
  16 in total

1.  Connecting protein structure with predictions of regulatory sites.

Authors:  Alexandre V Morozov; Eric D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2007-04-16       Impact factor: 11.205

Review 2.  Identifying regulatory elements in eukaryotic genomes.

Authors:  Leelavati Narlikar; Ivan Ovcharenko
Journal:  Brief Funct Genomic Proteomic       Date:  2009-06-04

3.  Improved similarity scores for comparing motifs.

Authors:  Emi Tanaka; Timothy Bailey; Charles E Grant; William Stafford Noble; Uri Keich
Journal:  Bioinformatics       Date:  2011-05-04       Impact factor: 6.937

4.  Improving MEME via a two-tiered significance analysis.

Authors:  Emi Tanaka; Timothy L Bailey; Uri Keich
Journal:  Bioinformatics       Date:  2014-03-24       Impact factor: 6.937

5.  The construction and use of log-odds substitution scores for multiple sequence alignment.

Authors:  Stephen F Altschul; John C Wootton; Elena Zaslavsky; Yi-Kuo Yu
Journal:  PLoS Comput Biol       Date:  2010-07-15       Impact factor: 4.475

6.  Metamotifs--a generative model for building families of nucleotide position weight matrices.

Authors:  Matias Piipari; Thomas A Down; Tim Jp Hubbard
Journal:  BMC Bioinformatics       Date:  2010-06-25       Impact factor: 3.169

7.  DBD2BS: connecting a DNA-binding protein with its binding sites.

Authors:  Ting-Ying Chien; Chih-Kang Lin; Chih-Wei Lin; Yi-Zhong Weng; Chien-Yu Chen; Darby Tien-Hao Chang
Journal:  Nucleic Acids Res       Date:  2012-06-11       Impact factor: 16.971

8.  Predicting target DNA sequences of DNA-binding proteins based on unbound structures.

Authors:  Chien-Yu Chen; Ting-Ying Chien; Chih-Kang Lin; Chih-Wei Lin; Yi-Zhong Weng; Darby Tien-Hao Chang
Journal:  PLoS One       Date:  2012-02-01       Impact factor: 3.240

9.  Practical strategies for discovering regulatory DNA sequence motifs.

Authors:  Kenzie D MacIsaac; Ernest Fraenkel
Journal:  PLoS Comput Biol       Date:  2006-04       Impact factor: 4.475

10.  Sampling realistic protein conformations using local structural bias.

Authors:  Thomas Hamelryck; John T Kent; Anders Krogh
Journal:  PLoS Comput Biol       Date:  2006-08-21       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.