Literature DB >> 12409479

Designing gene libraries from protein profiles for combinatorial protein experiments.

Wei Wang1, Jeffery G Saven.   

Abstract

Protein combinatorial libraries provide new ways to probe the determinants of folding and to discover novel proteins. Such libraries are often constructed by expressing an ensemble of partially random gene sequences. Given the intractably large number of possible sequences, some limitation on diversity must be imposed. A non-uniform distribution of nucleotides can be used to reduce the number of possible sequences and encode peptide sequences having a predetermined set of amino acid probabilities at each residue position, i.e., the amino acid sequence profile. Such profiles can be determined by inspection, multiple sequence alignment or physically-based computational methods. Here we present a computational method that takes as input a desired sequence profile and calculates the individual nucleotide probabilities among partially random genes. The calculated gene library can be readily used in the context of standard DNA synthesis to generate a protein library with essentially the desired profile. The fidelity between the desired profile and the calculated one coded by these partially random genes is quantitatively evaluated using the linear correlation coefficient and a relative entropy, each of which provides a measure of profile agreement at each position of the sequence. On average, this method of identifying such codon frequencies performs as well or better than other methods with regard to fidelity to the original profile. Importantly, the method presented here provides much better yields of complete sequences that do not contain stop codons, a feature that is particularly important when all or large fractions of a gene are subject to combinatorial mutation.

Mesh:

Substances:

Year:  2002        PMID: 12409479      PMCID: PMC135844          DOI: 10.1093/nar/gnf119

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  20 in total

1.  Optimizing doped libraries by using genetic algorithms.

Authors:  D Tomandl; A Schober; A Schwienhorst
Journal:  J Comput Aided Mol Des       Date:  1997-01       Impact factor: 3.686

2.  Assessment of protein models with three-dimensional profiles.

Authors:  R Lüthy; J U Bowie; D Eisenberg
Journal:  Nature       Date:  1992-03-05       Impact factor: 49.962

3.  Database of homology-derived protein structures and the structural meaning of sequence alignment.

Authors:  C Sander; R Schneider
Journal:  Proteins       Date:  1991

4.  A phage display system for studying the sequence determinants of protein folding.

Authors:  H Gu; Q Yi; S T Bray; D S Riddle; A K Shiau; D Baker
Journal:  Protein Sci       Date:  1995-06       Impact factor: 6.725

5.  Scoring functions for computational algorithms applicable to the design of spiked oligonucleotides.

Authors:  L J Jensen; K V Andersen; A Svendsen; T Kretzschmar
Journal:  Nucleic Acids Res       Date:  1998-02-01       Impact factor: 16.971

6.  Yeast surface display for screening combinatorial polypeptide libraries.

Authors:  E T Boder; K D Wittrup
Journal:  Nat Biotechnol       Date:  1997-06       Impact factor: 54.908

7.  Design of synthetic gene libraries encoding random sequence proteins with desired ensemble characteristics.

Authors:  T H LaBean; S A Kauffman
Journal:  Protein Sci       Date:  1993-08       Impact factor: 6.725

8.  Protein design by binary patterning of polar and nonpolar amino acids.

Authors:  S Kamtekar; J M Schiffer; H Xiong; J M Babik; M H Hecht
Journal:  Science       Date:  1993-12-10       Impact factor: 47.728

9.  Antibody engineering by parsimonious mutagenesis.

Authors:  R F Balint; J W Larrick
Journal:  Gene       Date:  1993-12-27       Impact factor: 3.688

Review 10.  Effects of rare codon clusters on high-level expression of heterologous proteins in Escherichia coli.

Authors:  J F Kane
Journal:  Curr Opin Biotechnol       Date:  1995-10       Impact factor: 9.740

View more
  6 in total

1.  Oligonucleotide-directed site-specific integration of high complexity libraries into ssDNA templates.

Authors:  M B Hale; G P Nolan; R Wolkowicz
Journal:  Nucleic Acids Res       Date:  2004-01-29       Impact factor: 16.971

Review 2.  Designing specific protein-protein interactions using computation, experimental library screening, or integrated methods.

Authors:  T Scott Chen; Amy E Keating
Journal:  Protein Sci       Date:  2012-06-08       Impact factor: 6.725

3.  SwiftLib: rapid degenerate-codon-library optimization through dynamic programming.

Authors:  Timothy M Jacobs; Hayretin Yumerefendi; Brian Kuhlman; Andrew Leaver-Fay
Journal:  Nucleic Acids Res       Date:  2014-12-24       Impact factor: 16.971

4.  Structure-based redesign of the binding specificity of anti-apoptotic Bcl-x(L).

Authors:  T Scott Chen; Hector Palacios; Amy E Keating
Journal:  J Mol Biol       Date:  2012-11-12       Impact factor: 5.469

5.  A focused antibody library for selecting scFvs expressed at high levels in the cytoplasm.

Authors:  Pascal Philibert; Audrey Stoessel; Wei Wang; Annie-Paule Sibler; Nicole Bec; Christian Larroque; Jeffery G Saven; Jérôme Courtête; Etienne Weiss; Pierre Martineau
Journal:  BMC Biotechnol       Date:  2007-11-22       Impact factor: 2.563

6.  Optimizing nucleotide sequence ensembles for combinatorial protein libraries using a genetic algorithm.

Authors:  Roger A Craig; Jin Lu; Jinquan Luo; Lei Shi; Li Liao
Journal:  Nucleic Acids Res       Date:  2009-11-04       Impact factor: 16.971

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.