Literature DB >> 32176271

DeCoDe: degenerate codon design for complete protein-coding DNA libraries.

Tyler C Shimko1, Polly M Fordyce1,2,3,4, Yaron Orenstein5.   

Abstract

MOTIVATION: High-throughput protein screening is a critical technique for dissecting and designing protein function. Libraries for these assays can be created through a number of means, including targeted or random mutagenesis of a template protein sequence or direct DNA synthesis. However, mutagenic library construction methods often yield vastly more nonfunctional than functional variants and, despite advances in large-scale DNA synthesis, individual synthesis of each desired DNA template is often prohibitively expensive. Consequently, many protein-screening libraries rely on the use of degenerate codons (DCs), mixtures of DNA bases incorporated at specific positions during DNA synthesis, to generate highly diverse protein-variant pools from only a few low-cost synthesis reactions. However, selecting DCs for sets of sequences that covary at multiple positions dramatically increases the difficulty of designing a DC library and leads to the creation of many undesired variants that can quickly outstrip screening capacity.
RESULTS: We introduce a novel algorithm for total DC library optimization, degenerate codon design (DeCoDe), based on integer linear programming. DeCoDe significantly outperforms state-of-the-art DC optimization algorithms and scales well to more than a hundred proteins sharing complex patterns of covariation (e.g. the lab-derived avGFP lineage). Moreover, DeCoDe is, to our knowledge, the first DC design algorithm with the capability to encode mixed-length protein libraries. We anticipate DeCoDe to be broadly useful for a variety of library generation problems, ranging from protein engineering attempts that leverage mutual information to the reconstruction of ancestral protein states.
AVAILABILITY AND IMPLEMENTATION: github.com/OrensteinLab/DeCoDe. CONTACT: yaronore@bgu.ac.il. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32176271      PMCID: PMC7267834          DOI: 10.1093/bioinformatics/btaa162

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  43 in total

1.  An in vitro DNA virus for in vitro protein evolution.

Authors:  I Tabuchi; S Soramoto; N Nemoto; Y Husimi
Journal:  FEBS Lett       Date:  2001-11-23       Impact factor: 4.124

2.  Design of a novel globular protein fold with atomic-level accuracy.

Authors:  Brian Kuhlman; Gautam Dantas; Gregory C Ireton; Gabriele Varani; Barry L Stoddard; David Baker
Journal:  Science       Date:  2003-11-21       Impact factor: 47.728

3.  Optimization of combinatorial mutagenesis.

Authors:  Andrew S Parker; Karl E Griswold; Chris Bailey-Kellogg
Journal:  J Comput Biol       Date:  2011-09-16       Impact factor: 1.479

4.  RNA-peptide fusions for the in vitro selection of peptides and proteins.

Authors:  R W Roberts; J W Szostak
Journal:  Proc Natl Acad Sci U S A       Date:  1997-11-11       Impact factor: 11.205

5.  FPbase: a community-editable fluorescent protein database.

Authors:  Talley J Lambert
Journal:  Nat Methods       Date:  2019-04       Impact factor: 28.547

6.  Evolutionary trend toward kinetic stability in the folding trajectory of RNases H.

Authors:  Shion A Lim; Kathryn M Hart; Michael J Harms; Susan Marqusee
Journal:  Proc Natl Acad Sci U S A       Date:  2016-10-31       Impact factor: 11.205

7.  Yeast surface display for screening combinatorial polypeptide libraries.

Authors:  E T Boder; K D Wittrup
Journal:  Nat Biotechnol       Date:  1997-06       Impact factor: 54.908

8.  Resurrecting the ancestral steroid receptor: ancient origin of estrogen signaling.

Authors:  Joseph W Thornton; Eleanor Need; David Crews
Journal:  Science       Date:  2003-09-19       Impact factor: 47.728

9.  Epitope mapping of antibodies using bacterial surface display.

Authors:  Johan Rockberg; John Löfblom; Barbara Hjelm; Mathias Uhlén; Stefan Ståhl
Journal:  Nat Methods       Date:  2008-11-23       Impact factor: 28.547

10.  Local fitness landscape of the green fluorescent protein.

Authors:  Karen S Sarkisyan; Dmitry A Bolotin; Margarita V Meer; Dinara R Usmanova; Alexander S Mishin; George V Sharonov; Dmitry N Ivankov; Nina G Bozhanova; Mikhail S Baranov; Onuralp Soylemez; Natalya S Bogatyreva; Peter K Vlasov; Evgeny S Egorov; Maria D Logacheva; Alexey S Kondrashov; Dmitry M Chudakov; Ekaterina V Putintseva; Ilgar Z Mamedov; Dan S Tawfik; Konstantin A Lukyanov; Fyodor A Kondrashov
Journal:  Nature       Date:  2016-05-11       Impact factor: 49.962

View more
  2 in total

1.  CoLiDe: Combinatorial Library Design tool for probing protein sequence space.

Authors:  Vyacheslav Tretyachenko; Václav Voráček; Radko Souček; Kosuke Fujishima; Klára Hlouchová
Journal:  Bioinformatics       Date:  2021-05-01       Impact factor: 6.937

2.  uPIC-M: Efficient and Scalable Preparation of Clonal Single Mutant Libraries for High-Throughput Protein Biochemistry.

Authors:  Mason J Appel; Scott A Longwell; Maurizio Morri; Norma Neff; Daniel Herschlag; Polly M Fordyce
Journal:  ACS Omega       Date:  2021-11-02
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.