Literature DB >> 15321723

Estimating the prevalence of protein sequences adopting functional enzyme folds.

Douglas D Axe1.   

Abstract

Proteins employ a wide variety of folds to perform their biological functions. How are these folds first acquired? An important step toward answering this is to obtain an estimate of the overall prevalence of sequences adopting functional folds. Since tertiary structure is needed for a typical enzyme active site to form, one way to obtain this estimate is to measure the prevalence of sequences supporting a working active site. Although the immense number of sequence combinations makes wholly random sampling unfeasible, two key simplifications may provide a solution. First, given the importance of hydrophobic interactions to protein folding, it seems likely that the sample space can be restricted to sequences carrying the hydropathic signature of a known fold. Second, because folds are stabilized by the cooperative action of many local interactions distributed throughout the structure, the overall problem of fold stabilization may be viewed reasonably as a collection of coupled local problems. This enables the difficulty of the whole problem to be assessed by assessing the difficulty of several smaller problems. Using these simplifications, the difficulty of specifying a working beta-lactamase domain is assessed here. An alignment of homologous domain sequences is used to deduce the pattern of hydropathic constraints along chains that form the domain fold. Starting with a weakly functional sequence carrying this signature, clusters of ten side-chains within the fold are replaced randomly, within the boundaries of the signature, and tested for function. The prevalence of low-level function in four such experiments indicates that roughly one in 10(64) signature-consistent sequences forms a working domain. Combined with the estimated prevalence of plausible hydropathic patterns (for any fold) and of relevant folds for particular functions, this implies the overall prevalence of sequences performing a specific function by any domain-sized fold may be as low as 1 in 10(77), adding to the body of evidence that functional folds require highly extraordinary sequences.

Mesh:

Substances:

Year:  2004        PMID: 15321723     DOI: 10.1016/j.jmb.2004.06.058

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  14 in total

1.  Thermodynamic prediction of protein neutrality.

Authors:  Jesse D Bloom; Jonathan J Silberg; Claus O Wilke; D Allan Drummond; Christoph Adami; Frances H Arnold
Journal:  Proc Natl Acad Sci U S A       Date:  2005-01-11       Impact factor: 11.205

2.  Cofactor binding and enzymatic activity in an unevolved superfamily of de novo designed 4-helix bundle proteins.

Authors:  Shona C Patel; Luke H Bradley; Sayuri P Jinadasa; Michael H Hecht
Journal:  Protein Sci       Date:  2009-07       Impact factor: 6.725

Review 3.  The outlook for protein engineering in crop improvement.

Authors:  A Gururaj Rao
Journal:  Plant Physiol       Date:  2008-05       Impact factor: 8.340

4.  Navigating the protein fitness landscape with Gaussian processes.

Authors:  Philip A Romero; Andreas Krause; Frances H Arnold
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-31       Impact factor: 11.205

5.  Evolutionary innovations and the organization of protein functions in genotype space.

Authors:  Evandro Ferrada; Andreas Wagner
Journal:  PLoS One       Date:  2010-11-30       Impact factor: 3.240

Review 6.  Exploring protein fitness landscapes by directed evolution.

Authors:  Philip A Romero; Frances H Arnold
Journal:  Nat Rev Mol Cell Biol       Date:  2009-12       Impact factor: 94.444

7.  Random field model reveals structure of the protein recombinational landscape.

Authors:  Philip A Romero; Frances H Arnold
Journal:  PLoS Comput Biol       Date:  2012-10-04       Impact factor: 4.475

8.  Structure-guided recombination creates an artificial family of cytochromes P450.

Authors:  Christopher R Otey; Marco Landwehr; Jeffrey B Endelman; Kaori Hiraga; Jesse D Bloom; Frances H Arnold
Journal:  PLoS Biol       Date:  2006-04-11       Impact factor: 8.029

9.  The Universal Plausibility Metric (UPM) & Principle (UPP).

Authors:  David L Abel
Journal:  Theor Biol Med Model       Date:  2009-12-03       Impact factor: 2.432

10.  Stylus: a system for evolutionary experimentation based on a protein/proteome model with non-arbitrary functional constraints.

Authors:  Douglas D Axe; Brendan W Dixon; Philip Lu
Journal:  PLoS One       Date:  2008-06-04       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.