Literature DB >> 35039677

Learning protein fitness models from evolutionary and assay-labeled data.

Chloe Hsu1, Hunter Nisonoff2, Clara Fannjiang3, Jennifer Listgarten4,5.   

Abstract

Machine learning-based models of protein fitness typically learn from either unlabeled, evolutionarily related sequences or variant sequences with experimentally measured labels. For regimes where only limited experimental data are available, recent work has suggested methods for combining both sources of information. Toward that goal, we propose a simple combination approach that is competitive with, and on average outperforms more sophisticated methods. Our approach uses ridge regression on site-specific amino acid features combined with one probability density feature from modeling the evolutionary data. Within this approach, we find that a variational autoencoder-based probability density model showed the best overall performance, although any evolutionary density model can be used. Moreover, our analysis highlights the importance of systematic evaluations and sufficient baselines.
© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35039677     DOI: 10.1038/s41587-021-01146-5

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   68.164


  40 in total

1.  Laboratory evolution of peroxide-mediated cytochrome P450 hydroxylation.

Authors:  H Joo; Z Lin; F H Arnold
Journal:  Nature       Date:  1999-06-17       Impact factor: 49.962

2.  Molecular dynamics and protein function.

Authors:  M Karplus; J Kuriyan
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-03       Impact factor: 11.205

Review 3.  Protein therapeutics: a summary and pharmacological classification.

Authors:  Benjamin Leader; Quentin J Baca; David E Golan
Journal:  Nat Rev Drug Discov       Date:  2008-01       Impact factor: 84.694

4.  Engineering green fluorescent protein for improved brightness, longer wavelengths and fluorescence resonance energy transfer.

Authors:  R Heim; R Y Tsien
Journal:  Curr Biol       Date:  1996-02-01       Impact factor: 10.834

Review 5.  Development and applications of CRISPR-Cas9 for genome engineering.

Authors:  Patrick D Hsu; Eric S Lander; Feng Zhang
Journal:  Cell       Date:  2014-06-05       Impact factor: 41.582

Review 6.  Molecular basis of glyphosate resistance-different approaches through protein engineering.

Authors:  Loredano Pollegioni; Ernst Schonbrunn; Daniel Siehl
Journal:  FEBS J       Date:  2011-06-28       Impact factor: 5.542

7.  Green fluorescent protein as a marker for gene expression.

Authors:  M Chalfie; Y Tu; G Euskirchen; W W Ward; D C Prasher
Journal:  Science       Date:  1994-02-11       Impact factor: 47.728

8.  Global analysis of protein folding using massively parallel design, synthesis, and testing.

Authors:  Gabriel J Rocklin; Tamuka M Chidyausiku; Inna Goreshnik; Alex Ford; Scott Houliston; Alexander Lemak; Lauren Carter; Rashmi Ravichandran; Vikram K Mulligan; Aaron Chevalier; Cheryl H Arrowsmith; David Baker
Journal:  Science       Date:  2017-07-14       Impact factor: 47.728

9.  The Rosetta All-Atom Energy Function for Macromolecular Modeling and Design.

Authors:  Rebecca F Alford; Andrew Leaver-Fay; Jeliazko R Jeliazkov; Matthew J O'Meara; Frank P DiMaio; Hahnbeom Park; Maxim V Shapovalov; P Douglas Renfrew; Vikram K Mulligan; Kalli Kappel; Jason W Labonte; Michael S Pacella; Richard Bonneau; Philip Bradley; Roland L Dunbrack; Rhiju Das; David Baker; Brian Kuhlman; Tanja Kortemme; Jeffrey J Gray
Journal:  J Chem Theory Comput       Date:  2017-05-12       Impact factor: 6.006

View more
  2 in total

1.  Mutational analysis of SARS-CoV-2 variants of concern reveals key tradeoffs between receptor affinity and antibody escape.

Authors:  Emily K Makowski; John S Schardt; Matthew D Smith; Peter M Tessier
Journal:  PLoS Comput Biol       Date:  2022-05-31       Impact factor: 4.779

Review 2.  Machine learning to navigate fitness landscapes for protein engineering.

Authors:  Chase R Freschlin; Sarah A Fahlberg; Philip A Romero
Journal:  Curr Opin Biotechnol       Date:  2022-04-09       Impact factor: 10.279

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.