Justin R Klesmith1, Benjamin J Hackel1. 1. Department of Chemical Engineering and Materials Science, University of Minnesota, Twin Cities, Minneapolis, MN, USA.
Abstract
MOTIVATION: Deep mutational scanning experiments have enabled the measurement of the sequence-function relationship for thousands of mutations in a single experiment. The Protein Analysis and Classifier Toolkit (PACT) is a Python software package that marries the fitness metric of a given mutation within these experiments to sequence and structural features enabling downstream analyses. PACT enables the easy development of user sharable protocols for custom deep mutational scanning experiments as all code is modular and reusable between protocols. Protocols for mutational libraries with single or multiple mutations are included. To exemplify its utility, PACT assessed two deep mutational scanning datasets that measured the tradeoff of enzyme activity and enzyme stability. RESULTS: PACT efficiently evaluated classifiers that predict protein mutant function tested on deep mutational scanning screens. We found that the classifiers with the lowest false positive and highest true positive rate assesses sequence homology, contact number and if mutation involves proline. AVAILABILITY AND IMPLEMENTATION: PACT and the processed datasets are distributed freely under the terms of the GPL-3 license. The source code is available at GitHub (https://github.com/JKlesmith/PACT). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Deep mutational scanning experiments have enabled the measurement of the sequence-function relationship for thousands of mutations in a single experiment. The Protein Analysis and Classifier Toolkit (PACT) is a Python software package that marries the fitness metric of a given mutation within these experiments to sequence and structural features enabling downstream analyses. PACT enables the easy development of user sharable protocols for custom deep mutational scanning experiments as all code is modular and reusable between protocols. Protocols for mutational libraries with single or multiple mutations are included. To exemplify its utility, PACT assessed two deep mutational scanning datasets that measured the tradeoff of enzyme activity and enzyme stability. RESULTS:PACT efficiently evaluated classifiers that predict protein mutant function tested on deep mutational scanning screens. We found that the classifiers with the lowest false positive and highest true positive rate assesses sequence homology, contact number and if mutation involves proline. AVAILABILITY AND IMPLEMENTATION:PACT and the processed datasets are distributed freely under the terms of the GPL-3 license. The source code is available at GitHub (https://github.com/JKlesmith/PACT). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Jennifer R Cochran; Yong-Sung Kim; Shaun M Lippow; Balaji Rao; K Dane Wittrup Journal: Protein Eng Des Sel Date: 2006-03-30 Impact factor: 1.650
Authors: Timothy A Whitehead; Aaron Chevalier; Yifan Song; Cyrille Dreyfus; Sarel J Fleishman; Cecilia De Mattos; Chris A Myers; Hetunandan Kamisetty; Patrick Blair; Ian A Wilson; David Baker Journal: Nat Biotechnol Date: 2012-05-27 Impact factor: 54.908
Authors: Douglas M Fowler; Carlos L Araya; Sarel J Fleishman; Elizabeth H Kellogg; Jason J Stephany; David Baker; Stanley Fields Journal: Nat Methods Date: 2010-08-15 Impact factor: 28.547
Authors: Angélica V Medina-Cucurella; Paul J Steiner; Matthew S Faber; Jesús Beltrán; Alexandra N Borelli; Monica B Kirby; Sean R Cutler; Timothy A Whitehead Journal: Protein Eng Des Sel Date: 2019-09-10 Impact factor: 1.650
Authors: Matthew S Faber; James T Van Leuven; Martina M Ederer; Yesol Sapozhnikov; Zoë L Wilson; Holly A Wichman; Timothy A Whitehead; Craig R Miller Journal: ACS Synth Biol Date: 2020-01-07 Impact factor: 5.110
Authors: Alison C Leonard; Jonathan J Weinstein; Paul J Steiner; Annette H Erbse; Sarel J Fleishman; Timothy A Whitehead Journal: Protein Eng Des Sel Date: 2022-02-17 Impact factor: 1.952
Authors: Alison C Leonard; Jonathan J Weinstein; Paul J Steiner; Annette H Erbse; Sarel J Fleishman; Timothy A Whitehead Journal: bioRxiv Date: 2021-11-24