Literature DB >> 9322037

Enumerating and ranking discrete motifs.

C G Nevill-Manning1, K S Sethi, T D Wu, D L Brutlag.   

Abstract

Discrete motifs that discriminate functional classes of proteins are useful for classifying new sequences, capturing structural constraints, and identifying protein subclasses. Despite the fact that the space of such motifs can grow exponentially with sequence length and number, we show that in practice it usually does not, and we describe a technique that infers motifs from aligned protein sequences by exhaustively searching this space. Our method generates sequence motifs over a wide range of recall and precision, and chooses a representative motif based on a score that we derive from both statistical and information-theoretic frameworks. Finally, we show that the selected motifs perform well in practice, classifying unseen sequences with extremely high precision, and infer protein subclasses that correspond to known biochemical classes.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9322037

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  6 in total

1.  The EMOTIF database.

Authors:  J Y Huang; D L Brutlag
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Predicting deleterious amino acid substitutions.

Authors:  P C Ng; S Henikoff
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

3.  3MATRIX and 3MOTIF: a protein structure visualization system for conserved sequence motifs.

Authors:  Steven P Bennett; Lin Lu; Douglas L Brutlag
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  Highly specific protein sequence motifs for genome analysis.

Authors:  C G Nevill-Manning; T D Wu; D L Brutlag
Journal:  Proc Natl Acad Sci U S A       Date:  1998-05-26       Impact factor: 11.205

5.  eBLOCKs: enumerating conserved protein blocks to achieve maximal sensitivity and specificity.

Authors:  Qiaojuan Jane Su; Lin Lu; Serge Saxonov; Douglas L Brutlag
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

6.  Evaluating deterministic motif significance measures in protein databases.

Authors:  Pedro Gabriel Ferreira; Paulo J Azevedo
Journal:  Algorithms Mol Biol       Date:  2007-12-24       Impact factor: 1.405

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.