Literature DB >> 10222208

Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches.

L Aravind1, E V Koonin.   

Abstract

Using a number of diverse protein families as test cases, we investigate the ability of the recently developed iterative sequence database search method, PSI-BLAST, to identify subtle relationships between proteins that originally have been deemed detectable only at the level of structure-structure comparison. We show that PSI-BLAST can detect many, though not all, of such relationships, but the success critically depends on the optimal choice of the query sequence used to initiate the search. Generally, there is a correlation between the diversity of the sequences detected in the first pass of database screening and the ability of a given query to detect subtle relationships in subsequent iterations. Accordingly, a thorough analysis of protein superfamilies at the sequence level is necessary in order to maximize the chances of gleaning non-trivial structural and functional inferences, as opposed to a single search, initiated, for example, with the sequence of a protein whose structure is available. This strategy is illustrated by several findings, each of which involves an unexpected structural prediction: (i) a number of previously undetected proteins with the HSP70-actin fold are identified, including a highly conserved and nearly ubiquitous family of metal-dependent proteases (typified by bacterial O-sialoglycoprotease) that represent an adaptation of this fold to a new type of enzymatic activity; (ii) we show that, contrary to the previous conclusions, ATP-dependent and NAD-dependent DNA ligases are confidently predicted to possess the same fold; (iii) the C-terminal domain of 3-phosphoglycerate dehydrogenase, which binds serine and is involved in allosteric regulation of the enzyme activity, is shown to typify a new superfamily of ligand-binding, regulatory domains found primarily in enzymes and regulators of amino acid and purine metabolism; (iv) the immunoglobulin-like DNA-binding domain previously identified in the structures of transcription factors NFkappaB and NFAT is shown to be a member of a distinct superfamily of intracellular and extracellular domains with the immunoglobulin fold; and (v) the Rag-2 subunit of the V-D-J recombinase is shown to contain a kelch-type beta-propeller domain which rules out its evolutionary relationship with bacterial transposases. Copyright 1999 Academic Press.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10222208     DOI: 10.1006/jmbi.1999.2653

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  172 in total

1.  Crystal structure of NAD(+)-dependent DNA ligase: modular architecture and functional implications.

Authors:  J Y Lee; C Chang; H K Song; J Moon; J K Yang; H K Kim; S T Kwon; S W Suh
Journal:  EMBO J       Date:  2000-03-01       Impact factor: 11.598

Review 2.  Microbial relatives of the seed storage proteins of higher plants: conservation of structure and diversification of function during evolution of the cupin superfamily.

Authors:  J M Dunwell; S Khuri; P J Gane
Journal:  Microbiol Mol Biol Rev       Date:  2000-03       Impact factor: 11.056

3.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

4.  Comparative genome analysis of the pathogenic spirochetes Borrelia burgdorferi and Treponema pallidum.

Authors:  G Subramanian; E V Koonin; L Aravind
Journal:  Infect Immun       Date:  2000-03       Impact factor: 3.441

Review 5.  Structural and mechanistic conservation in DNA ligases.

Authors:  A J Doherty; S W Suh
Journal:  Nucleic Acids Res       Date:  2000-11-01       Impact factor: 16.971

6.  Bacterial-type DNA holliday junction resolvases in eukaryotic viruses.

Authors:  A D Garcia; L Aravind; E V Koonin; B Moss
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-01       Impact factor: 11.205

7.  Common fold in helix-hairpin-helix proteins.

Authors:  X Shao; N V Grishin
Journal:  Nucleic Acids Res       Date:  2000-07-15       Impact factor: 16.971

8.  A DNA repair system specific for thermophilic Archaea and bacteria predicted by genomic context analysis.

Authors:  Kira S Makarova; L Aravind; Nick V Grishin; Igor B Rogozin; Eugene V Koonin
Journal:  Nucleic Acids Res       Date:  2002-01-15       Impact factor: 16.971

9.  A comparison of position-specific score matrices based on sequence and structure alignments.

Authors:  Anna R Panchenko; Stephen H Bryant
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

Review 10.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.