Literature DB >> 14691242

FoldMiner: structural motif discovery using an improved superposition algorithm.

Jessica Shapiro1, Douglas Brutlag.   

Abstract

We report an unsupervised structural motif discovery algorithm, FoldMiner, which is able to detect global and local motifs in a database of proteins without the need for multiple structure or sequence alignments and without relying on prior classification of proteins into families. Motifs, which are discovered from pairwise superpositions of a query structure to a database of targets, are described probabilistically in terms of the conservation of each secondary structure element's position and are used to improve detection of distant structural relationships. During each iteration of the algorithm, the motif is defined from the current set of homologs and is used both to recruit additional homologous structures and to discard false positives. FoldMiner thus achieves high specificity and sensitivity by distinguishing between homologous and nonhomologous structures by the regions of the query to which they align. We find that when two proteins of the same fold are aligned, highly conserved secondary structure elements in one protein tend to align to highly conserved elements in the second protein, suggesting that FoldMiner consistently identifies the same motif in members of a fold. Structural alignments are performed by an improved superposition algorithm, LOCK 2, which detects distant structural relationships by placing increased emphasis on the alignment of secondary structure elements. LOCK 2 obeys several properties essential in automated analysis of protein structure: It is symmetric, its alignments of secondary structure elements are transitive, its alignments of residues display a high degree of transitivity, and its scoring system is empirically found to behave as a metric.

Mesh:

Substances:

Year:  2004        PMID: 14691242      PMCID: PMC2286532          DOI: 10.1110/ps.03239404

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  64 in total

1.  Increased coverage of protein families with the blocks database servers.

Authors:  J G Henikoff; E A Greene; S Pietrokovski; S Henikoff
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations.

Authors:  S Henikoff; J G Henikoff; S Pietrokovski
Journal:  Bioinformatics       Date:  1999-06       Impact factor: 6.937

4.  Three-dimensional structure analysis of PROSITE patterns.

Authors:  A Kasuya; J M Thornton
Journal:  J Mol Biol       Date:  1999-03-12       Impact factor: 5.469

5.  An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance.

Authors:  A S Yang; B Honig
Journal:  J Mol Biol       Date:  2000-08-18       Impact factor: 5.469

6.  PROSITE: a documented database using patterns and profiles as motif descriptors.

Authors:  Christian J A Sigrist; Lorenzo Cerutti; Nicolas Hulo; Alexandre Gattiker; Laurent Falquet; Marco Pagni; Amos Bairoch; Philipp Bucher
Journal:  Brief Bioinform       Date:  2002-09       Impact factor: 11.622

7.  The structural alignment between two proteins: is there a unique answer?

Authors:  A Godzik
Journal:  Protein Sci       Date:  1996-07       Impact factor: 6.725

8.  Protein folds and functions.

Authors:  A C Martin; C A Orengo; E G Hutchinson; S Jones; M Karmirantzou; R A Laskowski; J B Mitchell; C Taroni; J M Thornton
Journal:  Structure       Date:  1998-07-15       Impact factor: 5.006

9.  Local alignment statistics.

Authors:  S F Altschul; W Gish
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

10.  Threading a database of protein cores.

Authors:  T Madej; J F Gibrat; S H Bryant
Journal:  Proteins       Date:  1995-11
View more
  10 in total

1.  FoldMiner and LOCK 2: protein structure comparison and motif discovery on the web.

Authors:  Jessica Shapiro; Douglas Brutlag
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  A novel approach to structural alignment using realistic structural and environmental information.

Authors:  Yu Chen; Gordon M Crippen
Journal:  Protein Sci       Date:  2005-10-31       Impact factor: 6.725

3.  Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures.

Authors:  Rachel Kolodny; Patrice Koehl; Michael Levitt
Journal:  J Mol Biol       Date:  2005-01-16       Impact factor: 5.469

4.  DOM-fold: a structure with crossing loops found in DmpA, ornithine acetyltransferase, and molybdenum cofactor-binding domain.

Authors:  Hua Cheng; Nick V Grishin
Journal:  Protein Sci       Date:  2005-06-03       Impact factor: 6.725

5.  On the difference in quality between current heuristic and optimal solutions to the protein structure alignment problem.

Authors:  Mauricio Arriagada; Aleksandar Poleksic
Journal:  Biomed Res Int       Date:  2012-12-23       Impact factor: 3.411

6.  Fast and accurate protein substructure searching with simulated annealing and GPUs.

Authors:  Alex D Stivala; Peter J Stuckey; Anthony I Wirth
Journal:  BMC Bioinformatics       Date:  2010-09-03       Impact factor: 3.169

7.  ROC and confusion analysis of structure comparison methods identify the main causes of divergence from manual protein classification.

Authors:  Vichetra Sam; Chin-Hsien Tai; Jean Garnier; Jean-Francois Gibrat; Byungkook Lee; Peter J Munson
Journal:  BMC Bioinformatics       Date:  2006-04-13       Impact factor: 3.169

8.  A fast indexing approach for protein structure comparison.

Authors:  Lei Zhang; James Bailey; Arun S Konagurthu; Kotagiri Ramamohanarao
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

9.  Tableau-based protein substructure search using quadratic programming.

Authors:  Alex Stivala; Anthony Wirth; Peter J Stuckey
Journal:  BMC Bioinformatics       Date:  2009-05-19       Impact factor: 3.169

10.  Identification of structurally conserved residues of proteins in absence of structural homologs using neural network ensemble.

Authors:  Ganesan Pugalenthi; Ke Tang; P N Suganthan; Saikat Chakrabarti
Journal:  Bioinformatics       Date:  2008-11-27       Impact factor: 6.937

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.