Literature DB >> 10745990

IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices.

A A Schäffer1, Y I Wolf, C P Ponting, E V Koonin, L Aravind, S F Altschul.   

Abstract

MOTIVATION: Many studies have shown that database searches using position-specific score matrices (PSSMs) or profiles as queries are more effective at identifying distant protein relationships than are searches that use simple sequences as queries. One popular program for constructing a PSSM and comparing it with a database of sequences is Position-Specific Iterated BLAST (PSI-BLAST).
RESULTS: This paper describes a new software package, IMPALA, designed for the complementary procedure of comparing a single query sequence with a database of PSI-BLAST-generated PSSMs. We illustrate the use of IMPALA to search a database of PSSMs for protein folds, and one for protein domains involved in signal transduction. IMPALA's sensitivity to distant biological relationships is very similar to that of PSI-BLAST. However, IMPALA employs a more refined analysis of statistical significance and, unlike PSI-BLAST, guarantees the output of the optimal local alignment by using the rigorous Smith-Waterman algorithm. Also, it is considerably faster when run with a large database of PSSMs than is BLAST or PSI-BLAST when run against the complete non-redundant protein database.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10745990     DOI: 10.1093/bioinformatics/15.12.1000

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  104 in total

1.  A rapid classification protocol for the CATH Domain Database to support structural genomics.

Authors:  F M Pearl; N Martin; J E Bray; D W Buchan; A P Harrison; D Lee; G A Reeves; A J Shepherd; I Sillitoe; A E Todd; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  The estimation of statistical parameters for local alignment score distributions.

Authors:  S F Altschul; R Bundschuh; R Olsen; T Hwa
Journal:  Nucleic Acids Res       Date:  2001-01-15       Impact factor: 16.971

3.  The CATH extended protein-family database: providing structural annotations for genome sequences.

Authors:  Frances M G Pearl; David Lee; James E Bray; Daniel W A Buchan; Adrian J Shepherd; Christine A Orengo
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

Review 4.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

5.  Evolutionary relationships among G protein-coupled receptors using a clustered database approach.

Authors:  R C Graul; W Sadée
Journal:  AAPS PharmSci       Date:  2001

6.  MODBASE, a database of annotated comparative protein structure models.

Authors:  Ursula Pieper; Narayanan Eswar; Ashley C Stuart; Valentin A Ilyin; Andrej Sali
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

7.  SUPFAM--a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes.

Authors:  Shashi B Pandit; Dilip Gosar; S Abhiman; S Sujatha; Sayali S Dixit; Natasha S Mhatre; R Sowdhamini; N Srinivasan
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

8.  Comparative genomics and evolution of proteins involved in RNA metabolism.

Authors:  Vivek Anantharaman; Eugene V Koonin; L Aravind
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

Review 9.  Structural genomics: a pipeline for providing structures for the biologist.

Authors:  Mark R Chance; Anne R Bresnick; Stephen K Burley; Jian-Sheng Jiang; Christopher D Lima; Andrej Sali; Steven C Almo; Jeffrey B Bonanno; John A Buglino; Simon Boulton; Hua Chen; Narayanan Eswar; Guoshun He; Raymond Huang; Valentin Ilyin; Linda McMahan; Ursula Pieper; Soumya Ray; Marc Vidal; Li Kai Wang
Journal:  Protein Sci       Date:  2002-04       Impact factor: 6.725

10.  Gene3D: structural assignment for whole genes and genomes using the CATH domain structure database.

Authors:  Daniel W A Buchan; Adrian J Shepherd; David Lee; Frances M G Pearl; Stuart C G Rison; Janet M Thornton; Christine A Orengo
Journal:  Genome Res       Date:  2002-03       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.