Literature DB >> 8520485

Finding flexible patterns in unaligned protein sequences.

I Jonassen1, J F Collins, D G Higgins.   

Abstract

We present a new method for the identification of conserved patterns in a set of unaligned related protein sequences. It is able to discover patterns of a quite general form, allowing for both ambiguous positions and for variable length wildcard regions. It allows the user to define a class of patterns (e.g., the degree of ambiguity allowed and the length and number of gaps), and the method is then guaranteed to find the conserved patterns in this class scoring highest according to a significance measure defined. Identified patterns may be refined using one of two new algorithms. We present a new (nonstatistical) significance measure for flexible patterns. The method is shown to recover known motifs for PROSITE families and is also applied to some recently described families from the literature.

Mesh:

Substances:

Year:  1995        PMID: 8520485      PMCID: PMC2143188          DOI: 10.1002/pro.5560040817

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  21 in total

1.  A search for common patterns in many sequences.

Authors:  M A Roytberg
Journal:  Comput Appl Biosci       Date:  1992-02

2.  The SWISS-PROT protein sequence data bank.

Authors:  A Bairoch; B Boeckmann
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

Review 3.  SH3--an abundant protein domain in search of a function.

Authors:  A Musacchio; T Gibson; V P Lehto; M Saraste
Journal:  FEBS Lett       Date:  1992-07-27       Impact factor: 4.124

4.  Finding sequence motifs in groups of functionally related proteins.

Authors:  H O Smith; T M Annau; S Chandrasegaran
Journal:  Proc Natl Acad Sci U S A       Date:  1990-01       Impact factor: 11.205

5.  Automated assembly of protein blocks for database searching.

Authors:  S Henikoff; J G Henikoff
Journal:  Nucleic Acids Res       Date:  1991-12-11       Impact factor: 16.971

6.  Improved detection of helix-turn-helix DNA-binding motifs in protein sequences.

Authors:  I B Dodd; J B Egan
Journal:  Nucleic Acids Res       Date:  1990-09-11       Impact factor: 16.971

7.  Automatic generation of primary sequence patterns from sets of related protein sequences.

Authors:  R F Smith; T F Smith
Journal:  Proc Natl Acad Sci U S A       Date:  1990-01       Impact factor: 11.205

8.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

Review 9.  The PHD finger: implications for chromatin-mediated transcriptional regulation.

Authors:  R Aasland; T J Gibson; A F Stewart
Journal:  Trends Biochem Sci       Date:  1995-02       Impact factor: 13.807

10.  A comprehensive set of sequence analysis programs for the VAX.

Authors:  J Devereux; P Haeberli; O Smithies
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

View more
  73 in total

1.  The EMBL nucleotide sequence database.

Authors:  W Baker; A van den Broek; E Camon; P Hingamp; P Sterk; G Stoesser; M A Tuli
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites.

Authors:  R Apweiler; T K Attwood; A Bairoch; A Bateman; E Birney; M Biswas; P Bucher; L Cerutti; F Corpet; M D Croning; R Durbin; L Falquet; W Fleischmann; J Gouzy; H Hermjakob; N Hulo; I Jonassen; D Kahn; A Kanapin; Y Karavidopoulou; R Lopez; B Marx; N J Mulder; T M Oinn; M Pagni; F Servant; C J Sigrist; E M Zdobnov
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  The EMBL nucleotide sequence database.

Authors:  G Stoesser; W Baker; A van den Broek; E Camon; M Garcia-Pastor; C Kanz; T Kulikova; V Lombard; R Lopez; H Parkinson; N Redaschi; P Sterk; P Stoehr; M A Tuli
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

4.  The EMBL Nucleotide Sequence Database.

Authors:  Guenter Stoesser; Wendy Baker; Alexandra van den Broek; Evelyn Camon; Maria Garcia-Pastor; Carola Kanz; Tamara Kulikova; Rasko Leinonen; Quan Lin; Vincent Lombard; Rodrigo Lopez; Nicole Redaschi; Peter Stoehr; Mary Ann Tuli; Katerina Tzouvara; Robert Vaughan
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

5.  Finding important sites in protein sequences.

Authors:  Peter J Bickel; Katherina J Kechris; Philip C Spector; Gary J Wedemayer; Alexander N Glazer
Journal:  Proc Natl Acad Sci U S A       Date:  2002-11-04       Impact factor: 11.205

6.  The EMBL Nucleotide Sequence Database: major new developments.

Authors:  Guenter Stoesser; Wendy Baker; Alexandra van den Broek; Maria Garcia-Pastor; Carola Kanz; Tamara Kulikova; Rasko Leinonen; Quan Lin; Vincent Lombard; Rodrigo Lopez; Renato Mancuso; Francesco Nardone; Peter Stoehr; Mary Ann Tuli; Katerina Tzouvara; Robert Vaughan
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

7.  GPRM: A genetic programming approach to finding common RNA secondary structure elements.

Authors:  Yuh-Jyh Hu
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

8.  Comparative homology agreement search: an effective combination of homology-search methods.

Authors:  Intikhab Alam; Andreas Dress; Marc Rehmsmeier; Georg Fuellen
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-14       Impact factor: 11.205

9.  Isolation of poly-3-hydroxybutyrate metabolism genes from complex microbial communities by phenotypic complementation of bacterial mutants.

Authors:  Chunxia Wang; David J Meek; Priya Panchal; Natalie Boruvka; Frederick S Archibald; Brian T Driscoll; Trevor C Charles
Journal:  Appl Environ Microbiol       Date:  2006-01       Impact factor: 4.792

10.  Support-vector-machine classification of linear functional motifs in proteins.

Authors:  Dariusz Plewczynski; Adrian Tkacz; Lucjan Stanisław Wyrwicz; Adam Godzik; Andrzej Kloczkowski; Leszek Rychlewski
Journal:  J Mol Model       Date:  2005-12-10       Impact factor: 1.810

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.