Literature DB >> 14649289

More for less in structural genomics.

A Heger1, L Holm.   

Abstract

Structural genomics is the idea of covering protein space so that every protein sequence comes within model building distance of a protein of known structure. Unfortunately, reproducing the structural alignment of distantly related proteins is a difficult challenge to existing sequence alignment and motif search software. We have developed a new transitive alignment algorithm (MaxFlow), which generates accurate alignments between proteins deep in the twilight zone of sequence similarity, below 20% sequence identity. In particular, MaxFlow reliably identifies conserved core motifs between proteins which are only indirect PSI-Blast neighbours. Based on MaxFlow alignments, useful 3D models can be generated for all members of a superfamily from as few as a single structural template--despite hundreds of representatives at 40% sequence identity level and patchy detection of homology by PSI-Blast. We propose novel strategies for target prioritization using MaxFlow scores to predict the optimal templates in a superfamily. Our results support an increase in the granularity of covering protein space that has potentially enormous economic implications for planning the transition to the full production phase of structural genomics.

Mesh:

Substances:

Year:  2003        PMID: 14649289     DOI: 10.1023/a:1026145703834

Source DB:  PubMed          Journal:  J Struct Funct Genomics        ISSN: 1345-711X


  19 in total

1.  A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3.

Authors:  S Dietmann; J Park; C Notredame; A Heger; M Lappe; L Holm
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 2.  Comparative protein structure modeling of genes and genomes.

Authors:  M A Martí-Renom; A C Stuart; A Fiser; R Sánchez; F Melo; A Sali
Journal:  Annu Rev Biophys Biomol Struct       Date:  2000

Review 3.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

4.  Completeness in structural genomics.

Authors:  D Vitkup; E Melamud; J Moult; C Sander
Journal:  Nat Struct Biol       Date:  2001-06

Review 5.  Recent progress in multiple sequence alignment: a survey.

Authors:  Cédric Notredame
Journal:  Pharmacogenomics       Date:  2002-01       Impact factor: 2.533

6.  Completion and refinement of 3-D homology models with restricted molecular dynamics: application to targets 47, 58, and 111 in the CASP modeling competition and posterior analysis.

Authors:  J A Flohil; G Vriend; H J C Berendsen
Journal:  Proteins       Date:  2002-09-01

Review 7.  Automated detection of remote homology.

Authors:  Sabine Dietmann; Narcis Fernandez-Fuentes; Liisa Holm
Journal:  Curr Opin Struct Biol       Date:  2002-06       Impact factor: 6.809

Review 8.  Mapping the protein universe.

Authors:  L Holm; C Sander
Journal:  Science       Date:  1996-08-02       Impact factor: 47.728

9.  An evolutionary treasure: unification of a broad set of amidohydrolases related to urease.

Authors:  L Holm; C Sander
Journal:  Proteins       Date:  1997-05

10.  The cytidylyltransferase superfamily: identification of the nucleotide-binding site and fold prediction.

Authors:  P Bork; L Holm; E V Koonin; C Sander
Journal:  Proteins       Date:  1995-07
View more
  1 in total

1.  Evaluation of 3D-Jury on CASP7 models.

Authors:  László Kaján; Leszek Rychlewski
Journal:  BMC Bioinformatics       Date:  2007-08-21       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.