Literature DB >> 32979254

Learning peptide recognition rules for a low-specificity protein.

Lucas C Wheeler1,2,3, Arden Perkins1,2, Caitlyn E Wong1,2, Michael J Harms1,2.   

Abstract

Many proteins interact with short linear regions of target proteins. For some proteins, however, it is difficult to identify a well-defined sequence motif that defines its target peptides. To overcome this difficulty, we used supervised machine learning to train a model that treats each peptide as a collection of easily-calculated biochemical features rather than as an amino acid sequence. As a test case, we dissected the peptide-recognition rules for human S100A5 (hA5), a low-specificity calcium binding protein. We trained a Random Forest model against a recently released, high-throughput phage display dataset collected for hA5. The model identifies hydrophobicity and shape complementarity, rather than polar contacts, as the primary determinants of peptide binding specificity in hA5. We tested this hypothesis by solving a crystal structure of hA5 and through computational docking studies of diverse peptides onto hA5. These structural studies revealed that peptides exhibit multiple binding modes at the hA5 peptide interface-all of which have few polar contacts with hA5. Finally, we used our trained model to predict new, plausible binding targets in the human proteome. This revealed a fragment of the protein α-1-syntrophin that binds to hA5. Our work helps better understand the biochemistry and biology of hA5, as well as demonstrating how high-throughput experiments coupled with machine learning of biochemical features can reveal the determinants of binding specificity in low-specificity proteins.
© 2020 The Protein Society.

Entities:  

Keywords:  S100 proteins; X-ray crystallography; binding specificity; hydrophobicity; machine learning; peptides

Mesh:

Substances:

Year:  2020        PMID: 32979254      PMCID: PMC7586891          DOI: 10.1002/pro.3958

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  48 in total

1.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

2.  pytc: Open-Source Python Software for Global Analyses of Isothermal Titration Calorimetry Data.

Authors:  Hiranmayi Duvvuri; Lucas C Wheeler; Michael J Harms
Journal:  Biochemistry       Date:  2018-04-18       Impact factor: 3.162

Review 3.  Affinity and specificity of motif-based protein-protein interactions.

Authors:  Ylva Ivarsson; Per Jemth
Journal:  Curr Opin Struct Biol       Date:  2018-10-24       Impact factor: 6.809

4.  Syntrophin-dependent expression and localization of Aquaporin-4 water channel protein.

Authors:  J D Neely; M Amiry-Moghaddam; O P Ottersen; S C Froehner; P Agre; M E Adams
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-20       Impact factor: 11.205

5.  Data-driven supervised learning of a viral protease specificity landscape from deep sequencing and molecular simulations.

Authors:  Manasi A Pethe; Aliza B Rubenstein; Sagar D Khare
Journal:  Proc Natl Acad Sci U S A       Date:  2018-12-26       Impact factor: 11.205

6.  Learning peptide recognition rules for a low-specificity protein.

Authors:  Lucas C Wheeler; Arden Perkins; Caitlyn E Wong; Michael J Harms
Journal:  Protein Sci       Date:  2020-10-05       Impact factor: 6.725

7.  Brain S100A5 is a novel calcium-, zinc-, and copper ion-binding protein of the EF-hand superfamily.

Authors:  B W Schäfer; J M Fritschy; P Murmann; H Troxler; I Durussel; C W Heizmann; J A Cox
Journal:  J Biol Chem       Date:  2000-09-29       Impact factor: 5.157

8.  Restricted expression of calcium-binding protein S100A5 in human kidney.

Authors:  Takumi Teratani; Takumi Watanabe; Kaori Yamahara; Hiromichi Kumagai; Akira Ishikawa; Kazumori Arai; Ryushi Nozawa
Journal:  Biochem Biophys Res Commun       Date:  2002-03-01       Impact factor: 3.575

Review 9.  S100A6 - new facts and features.

Authors:  Wiesława Leśniak; Łukasz P Słomnicki; Anna Filipek
Journal:  Biochem Biophys Res Commun       Date:  2009-11-03       Impact factor: 3.575

10.  Short Linear Motifs recognized by SH2, SH3 and Ser/Thr Kinase domains are conserved in disordered protein regions.

Authors:  Siyuan Ren; Vladimir N Uversky; Zhengjun Chen; A Keith Dunker; Zoran Obradovic
Journal:  BMC Genomics       Date:  2008-09-16       Impact factor: 3.969

View more
  1 in total

1.  Learning peptide recognition rules for a low-specificity protein.

Authors:  Lucas C Wheeler; Arden Perkins; Caitlyn E Wong; Michael J Harms
Journal:  Protein Sci       Date:  2020-10-05       Impact factor: 6.725

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.