Literature DB >> 12975528

Using protein design for homology detection and active site searches.

Jimin Pei1, Nikolay V Dokholyan, Eugene I Shakhnovich, Nick V Grishin.   

Abstract

We describe a method of designing artificial sequences that resemble naturally occurring sequences in terms of their compatibility with a template structure and its functional constraints. The design procedure is a Monte Carlo simulation of amino acid substitution process. The selective fixation of substitutions is dictated by a simple scoring function derived from the template structure and a multiple alignment of its homologs. Designed sequences represent an enlargement of sequence space around native sequences. We show that the use of designed sequences improves the performance of profile-based homology detection. The difference in position-specific conservation between designed sequences and native sequences is helpful for prediction of functionally important residues. Our sequence selection criteria in evolutionary simulations introduce amino acid substitution rate variation among sites in a natural way, providing a better model to test phylogenetic methods.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12975528      PMCID: PMC208762          DOI: 10.1073/pnas.2034878100

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  36 in total

1.  Prediction of amino acid sequence from structure.

Authors:  K Raha; A M Wollacott; M J Italia; J R Desjarlais
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

2.  Understanding hierarchical protein evolution from first principles.

Authors:  N V Dokholyan; E I Shakhnovich
Journal:  J Mol Biol       Date:  2001-09-07       Impact factor: 5.469

3.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

4.  Native protein sequences are close to optimal for their structures.

Authors:  B Kuhlman; D Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2000-09-12       Impact factor: 11.205

5.  Estimation of the number of amino acid substitutions per site when the substitution rate varies among sites.

Authors:  N V Grishin
Journal:  J Mol Evol       Date:  1995-11       Impact factor: 2.395

6.  SEALS: a system for easy analysis of lots of sequences.

Authors:  D R Walker; E V Koonin
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1997

7.  De novo protein design: fully automated sequence selection.

Authors:  B I Dahiyat; S L Mayo
Journal:  Science       Date:  1997-10-03       Impact factor: 47.728

8.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

9.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction.

Authors:  R L Dunbrack; M Karplus
Journal:  J Mol Biol       Date:  1993-03-20       Impact factor: 5.469

10.  Structure prediction and active site analysis of the metal binding determinants in gamma -glutamylcysteine synthetase.

Authors:  J J Abbott; J Pei; J L Ford; Y Qi; V N Grishin; L A Pitcher; M A Phillips; N V Grishin
Journal:  J Biol Chem       Date:  2001-08-29       Impact factor: 5.157

View more
  10 in total

1.  Improving computational protein design by using structure-derived sequence profile.

Authors:  Liang Dai; Yuedong Yang; Hyung Rae Kim; Yaoqi Zhou
Journal:  Proteins       Date:  2010-08-01

2.  A score of the ability of a three-dimensional protein model to retrieve its own sequence as a quantitative measure of its quality and appropriateness.

Authors:  León P Martínez-Castilla; Rogelio Rodríguez-Sotres
Journal:  PLoS One       Date:  2010-09-07       Impact factor: 3.240

3.  Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design.

Authors:  Gong Cheng; Bin Qian; Ram Samudrala; David Baker
Journal:  Nucleic Acids Res       Date:  2005-10-13       Impact factor: 16.971

4.  Incorporating background frequency improves entropy-based residue conservation measures.

Authors:  Kai Wang; Ram Samudrala
Journal:  BMC Bioinformatics       Date:  2006-08-17       Impact factor: 3.169

5.  RosettaDesign server for protein design.

Authors:  Yi Liu; Brian Kuhlman
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

6.  The structurally constrained protein evolution model accounts for sequence patterns of the LbetaH superfamily.

Authors:  Gustavo Parisi; Julián Echave
Journal:  BMC Evol Biol       Date:  2004-10-22       Impact factor: 3.260

7.  NrichD database: sequence databases enriched with computationally designed protein-like sequences aid in remote homology detection.

Authors:  Richa Mudgal; Sankaran Sandhya; Gayatri Kumar; Ramanathan Sowdhamini; Nagasuma R Chandra; Narayanaswamy Srinivasan
Journal:  Nucleic Acids Res       Date:  2014-09-27       Impact factor: 16.971

8.  Identification of two conserved aspartic acid residues required for DNA digestion by a novel thermophilic Exonuclease VII in Thermotoga maritima.

Authors:  Andres A Larrea; Ilene M Pedroso; Arun Malhotra; Richard S Myers
Journal:  Nucleic Acids Res       Date:  2008-09-23       Impact factor: 16.971

9.  Reconstruction of ancestral protein sequences and its applications.

Authors:  Wei Cai; Jimin Pei; Nick V Grishin
Journal:  BMC Evol Biol       Date:  2004-09-17       Impact factor: 3.260

10.  Protein meta-functional signatures from combining sequence, structure, evolution, and amino acid property information.

Authors:  Kai Wang; Jeremy A Horst; Gong Cheng; David C Nickle; Ram Samudrala
Journal:  PLoS Comput Biol       Date:  2008-09-26       Impact factor: 4.475

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.