Derek J Pitman1, Christian D Schenkelberg1, Yao-Ming Huang2, Frank D Teets1, Daniel DiTursi1, Christopher Bystroff3. 1. Department of Biology, Rensselaer Polytechnic Institute, Troy, NY 12180, Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94143, Department of Computer Science and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA. 2. Department of Biology, Rensselaer Polytechnic Institute, Troy, NY 12180, Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94143, Department of Computer Science and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA Department of Biology, Rensselaer Polytechnic Institute, Troy, NY 12180, Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94143, Department of Computer Science and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA. 3. Department of Biology, Rensselaer Polytechnic Institute, Troy, NY 12180, Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94143, Department of Computer Science and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA Department of Biology, Rensselaer Polytechnic Institute, Troy, NY 12180, Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94143, Department of Computer Science and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA Department of Biology, Rensselaer Polytechnic Institute, Troy, NY 12180, Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94143, Department of Computer Science and Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.
Abstract
MOTIVATION: Accuracy in protein design requires a fine-grained rotamer search, multiple backbone conformations, and a detailed energy function, creating a burden in runtime and memory requirements. A design task may be split into manageable pieces in both three-dimensional space and in the rotamer search space to produce small, fast jobs that are easily distributed. However, these jobs must overlap, presenting a problem in resolving conflicting solutions in the overlap regions. RESULTS: Piecemeal design, in which the design space is split into overlapping regions and rotamer search spaces, accelerates the design process whether jobs are run in series or in parallel. Large jobs that cannot fit in memory were made possible by splitting. Accepting the consensus amino acid selection in conflict regions led to non-optimal choices. Instead, conflicts were resolved using a second pass, in which the split regions were re-combined and designed as one, producing results that were closer to optimal with a minimal increase in runtime over the consensus strategy. Splitting the search space at the rotamer level instead of at the amino acid level further improved the efficiency by reducing the search space in the second pass. AVAILABILITY AND IMPLEMENTATION: Programs for splitting protein design expressions are available at www.bioinfo.rpi.edu/tools/piecemeal.html CONTACT: bystrc@rpi.edu Supplementary information: Supplementary data are available at Bioinformatics online.
MOTIVATION: Accuracy in protein design requires a fine-grained rotamer search, multiple backbone conformations, and a detailed energy function, creating a burden in runtime and memory requirements. A design task may be split into manageable pieces in both three-dimensional space and in the rotamer search space to produce small, fast jobs that are easily distributed. However, these jobs must overlap, presenting a problem in resolving conflicting solutions in the overlap regions. RESULTS: Piecemeal design, in which the design space is split into overlapping regions and rotamer search spaces, accelerates the design process whether jobs are run in series or in parallel. Large jobs that cannot fit in memory were made possible by splitting. Accepting the consensus amino acid selection in conflict regions led to non-optimal choices. Instead, conflicts were resolved using a second pass, in which the split regions were re-combined and designed as one, producing results that were closer to optimal with a minimal increase in runtime over the consensus strategy. Splitting the search space at the rotamer level instead of at the amino acid level further improved the efficiency by reducing the search space in the second pass. AVAILABILITY AND IMPLEMENTATION: Programs for splitting protein design expressions are available at www.bioinfo.rpi.edu/tools/piecemeal.html CONTACT: bystrc@rpi.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Ilan Samish; Christopher M MacDermaid; Jose Manuel Perez-Aguilar; Jeffery G Saven Journal: Annu Rev Phys Chem Date: 2011 Impact factor: 12.703
Authors: John Karanicolas; Jacob E Corn; Irwin Chen; Lukasz A Joachimiak; Orly Dym; Sun H Peck; Shira Albeck; Tamar Unger; Wenxin Hu; Gaohua Liu; Scott Delbecq; Gaetano T Montelione; Clint P Spiegel; David R Liu; David Baker Journal: Mol Cell Date: 2011-03-31 Impact factor: 17.970
Authors: Frank V Cochran; Sophia P Wu; Wei Wang; Vikas Nanda; Jeffery G Saven; Michael J Therien; William F DeGrado Journal: J Am Chem Soc Date: 2005-02-09 Impact factor: 15.419
Authors: Yao-Ming Huang; Shounak Banerjee; Donna E Crone; Christian D Schenkelberg; Derek J Pitman; Patrick M Buck; Christopher Bystroff Journal: Biochemistry Date: 2015-09-30 Impact factor: 3.162