Literature DB >> 12441379

Thoroughly sampling sequence space: large-scale protein design of structural ensembles.

Stefan M Larson1, Jeremy L England, John R Desjarlais, Vijay S Pande.   

Abstract

Modeling the inherent flexibility of the protein backbone as part of computational protein design is necessary to capture the behavior of real proteins and is a prerequisite for the accurate exploration of protein sequence space. We present the results of a broad exploration of sequence space, with backbone flexibility, through a novel approach: large-scale protein design to structural ensembles. A distributed computing architecture has allowed us to generate hundreds of thousands of diverse sequences for a set of 253 naturally occurring proteins, allowing exciting insights into the nature of protein sequence space. Designing to a structural ensemble produces a much greater diversity of sequences than previous studies have reported, and homology searches using profiles derived from the designed sequences against the Protein Data Bank show that the relevance and quality of the sequences is not diminished. The designed sequences have greater overall diversity than corresponding natural sequence alignments, and no direct correlations are seen between the diversity of natural sequence alignments and the diversity of the corresponding designed sequences. For structures in the same fold, the sequence entropies of the designed sequences cluster together tightly. This tight clustering of sequence entropies within a fold and the separation of sequence entropy distributions for different folds suggest that the diversity of designed sequences is primarily determined by a structure's overall fold, and that the designability principle postulated from studies of simple models holds in real proteins. This has important implications for experimental protein design and engineering, as well as providing insight into protein evolution.

Mesh:

Substances:

Year:  2002        PMID: 12441379      PMCID: PMC2373757          DOI: 10.1110/ps.0203902

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  52 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Improved biocatalysts by directed evolution and rational protein design.

Authors:  U T Bornscheuer; M Pohl
Journal:  Curr Opin Chem Biol       Date:  2001-04       Impact factor: 8.822

3.  Analysis of covariation in an SH3 domain sequence alignment: applications in tertiary contact prediction and the design of compensating hydrophobic core substitutions.

Authors:  S M Larson; A A Di Nardo; A R Davidson
Journal:  J Mol Biol       Date:  2000-10-27       Impact factor: 5.469

4.  Prediction of amino acid sequence from structure.

Authors:  K Raha; A M Wollacott; M J Italia; J R Desjarlais
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

5.  Native protein sequences are close to optimal for their structures.

Authors:  B Kuhlman; D Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2000-09-12       Impact factor: 11.205

Review 6.  Tailoring new enzyme functions by rational redesign.

Authors:  F Cedrone; A Ménez; E Quéméneur
Journal:  Curr Opin Struct Biol       Date:  2000-08       Impact factor: 6.809

7.  Threading a database of protein cores.

Authors:  T Madej; J F Gibrat; S H Bryant
Journal:  Proteins       Date:  1995-11

8.  Contact order, transition state placement and the refolding rates of single domain proteins.

Authors:  K W Plaxco; K T Simons; D Baker
Journal:  J Mol Biol       Date:  1998-04-10       Impact factor: 5.469

9.  Are protein folds atypical?

Authors:  H Li; C Tang; N S Wingreen
Journal:  Proc Natl Acad Sci U S A       Date:  1998-04-28       Impact factor: 11.205

10.  Protein family classification based on searching a database of blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Genomics       Date:  1994-01-01       Impact factor: 5.736

View more
  38 in total

1.  Coarse-grained sequences for protein folding and design.

Authors:  Scott Brown; Nicolas J Fawzi; Teresa Head-Gordon
Journal:  Proc Natl Acad Sci U S A       Date:  2003-09-08       Impact factor: 11.205

2.  Experimental library screening demonstrates the successful application of computational protein design to large structural ensembles.

Authors:  Benjamin D Allen; Alex Nisthal; Stephen L Mayo
Journal:  Proc Natl Acad Sci U S A       Date:  2010-11-02       Impact factor: 11.205

3.  Protein backbone ensemble generation explores the local structural space of unseen natural homologs.

Authors:  Christian D Schenkelberg; Christopher Bystroff
Journal:  Bioinformatics       Date:  2016-01-18       Impact factor: 6.937

4.  Improvement of comparative model accuracy by free-energy optimization along principal components of natural structural variation.

Authors:  Bin Qian; Angel R Ortiz; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-18       Impact factor: 11.205

5.  Protein sequence entropy is closely related to packing density and hydrophobicity.

Authors:  H Liao; W Yeh; D Chiang; R L Jernigan; B Lustig
Journal:  Protein Eng Des Sel       Date:  2005-03-23       Impact factor: 1.650

6.  Sequence optimization and designability of enzyme active sites.

Authors:  Raj Chakrabarti; Alexander M Klibanov; Richard A Friesner
Journal:  Proc Natl Acad Sci U S A       Date:  2005-08-15       Impact factor: 11.205

7.  Computationally designed libraries of fluorescent proteins evaluated by preservation and diversity of function.

Authors:  Thomas P Treynor; Christina L Vizcarra; Daniel Nedelcu; Stephen L Mayo
Journal:  Proc Natl Acad Sci U S A       Date:  2006-12-19       Impact factor: 11.205

8.  Packing regularities in biological structures relate to their dynamics.

Authors:  Robert L Jernigan; Andrzej Kloczkowski
Journal:  Methods Mol Biol       Date:  2007

9.  Computationally mapping sequence space to understand evolutionary protein engineering.

Authors:  Kathryn A Armstrong; Bruce Tidor
Journal:  Biotechnol Prog       Date:  2007-11-17

10.  Toward full-sequence de novo protein design with flexible templates for human beta-defensin-2.

Authors:  Ho Ki Fung; Christodoulos A Floudas; Martin S Taylor; Li Zhang; Dimitrios Morikis
Journal:  Biophys J       Date:  2007-09-07       Impact factor: 4.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.