Literature DB >> 14691221

Detection of homologous proteins by an intermediate sequence search.

Bino John1, Andrej Sali.   

Abstract

We developed a variant of the intermediate sequence search method (ISS(new)) for detection and alignment of weakly similar pairs of protein sequences. ISS(new) relates two query sequences by an intermediate sequence that is potentially homologous to both queries. The improvement was achieved by a more robust overlap score for a match between the queries through an intermediate. The approach was benchmarked on a data set of 2369 sequences of known structure with insignificant sequence similarity to each other (BLAST E-value larger than 0.001); 2050 of these sequences had a related structure in the set. ISS(new) performed significantly better than both PSI-BLAST and a previously described intermediate sequence search method. PSI-BLAST could not detect correct homologs for 1619 of the 2369 sequences. In contrast, ISS(new) assigned a correct homolog as the top hit for 121 of these 1619 sequences, while incorrectly assigning homologs for only nine targets; it did not assign homologs for the remainder of the sequences. By estimate, ISS(new) may be able to assign the folds of domains in approximately 29,000 of the approximately 500,000 sequences unassigned by PSI-BLAST, with 90% specificity (1 - false positives fraction). In addition, we show that the 15 alignments with the most significant BLAST E-values include the nearly best alignments constructed by ISS(new).

Mesh:

Substances:

Year:  2004        PMID: 14691221      PMCID: PMC2286512          DOI: 10.1110/ps.03335004

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  38 in total

1.  The ASTRAL compendium for protein structure and sequence analysis.

Authors:  S E Brenner; P Koehl; M Levitt
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Saturated BLAST: an automated multiple intermediate sequence search used to detect distant homology.

Authors:  W Li; F Pio; K Pawłowski; A Godzik
Journal:  Bioinformatics       Date:  2000-12       Impact factor: 6.937

3.  Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL.

Authors:  S A Teichmann; C Chothia; G M Church; J Park
Journal:  Bioinformatics       Date:  2000-02       Impact factor: 6.937

4.  Combining sensitive database searches with multiple intermediates to detect distant homologues.

Authors:  A A Salamov; M Suwa; C A Orengo; M B Swindells
Journal:  Protein Eng       Date:  1999-02

5.  Optimization of a new score function for the detection of remote homologs.

Authors:  M Kann; B Qian; R A Goldstein
Journal:  Proteins       Date:  2000-12-01

6.  Comparative protein structure modeling. Introduction and practical examples with modeller.

Authors:  R Sánchez; A Sali
Journal:  Methods Mol Biol       Date:  2000

7.  Structural genomics in North America.

Authors:  T C Terwilliger
Journal:  Nat Struct Biol       Date:  2000-11

Review 8.  100,000 protein structures for the biologist.

Authors:  A Sali
Journal:  Nat Struct Biol       Date:  1998-12

Review 9.  Structural genomics: beyond the human genome project.

Authors:  S K Burley; S C Almo; J B Bonanno; M Capel; M R Chance; T Gaasterland; D Lin; A Sali; F W Studier; S Swaminathan
Journal:  Nat Genet       Date:  1999-10       Impact factor: 38.330

10.  Pairwise sequence alignment below the twilight zone.

Authors:  J D Blake; F E Cohen
Journal:  J Mol Biol       Date:  2001-03-23       Impact factor: 5.469

View more
  9 in total

1.  Structure- and sequence-based function prediction for non-homologous proteins.

Authors:  Lee Sael; Meghana Chitale; Daisuke Kihara
Journal:  J Struct Funct Genomics       Date:  2012-01-22

Review 2.  The limits of protein sequence comparison?

Authors:  William R Pearson; Michael L Sierk
Journal:  Curr Opin Struct Biol       Date:  2005-06       Impact factor: 6.809

3.  Detecting remotely related proteins by their interactions and sequence similarity.

Authors:  Jordi Espadaler; Ramón Aragüés; Narayanan Eswar; Marc A Marti-Renom; Enrique Querol; Francesc X Avilés; Andrej Sali; Baldomero Oliva
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-09       Impact factor: 11.205

4.  ESG: extended similarity group method for automated protein function prediction.

Authors:  Meghana Chitale; Troy Hawkins; Changsoon Park; Daisuke Kihara
Journal:  Bioinformatics       Date:  2009-05-12       Impact factor: 6.937

5.  Graph pyramids for protein function prediction.

Authors:  Tushar Sandhan; Youngjun Yoo; Jin Choi; Sun Kim
Journal:  BMC Med Genomics       Date:  2015-05-29       Impact factor: 3.063

Review 6.  Template-based protein structure modeling.

Authors:  Andras Fiser
Journal:  Methods Mol Biol       Date:  2010

7.  Profiles of Natural and Designed Protein-Like Sequences Effectively Bridge Protein Sequence Gaps: Implications in Distant Homology Detection.

Authors:  Gayatri Kumar; Narayanaswamy Srinivasan; Sankaran Sandhya
Journal:  Methods Mol Biol       Date:  2022

8.  Functional classification of immune regulatory proteins.

Authors:  Rotem Rubinstein; Udupi A Ramagopal; Stanley G Nathenson; Steven C Almo; Andras Fiser
Journal:  Structure       Date:  2013-04-11       Impact factor: 5.006

9.  Comparative modelling of protein structure and its impact on microbial cell factories.

Authors:  Nuria B Centeno; Joan Planas-Iglesias; Baldomero Oliva
Journal:  Microb Cell Fact       Date:  2005-06-30       Impact factor: 5.328

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.