Literature DB >> 35129957

End-to-End Deep Learning Model to Predict and Design Secondary Structure Content of Structural Proteins.

Chi-Hua Yu1,2, Wei Chen2, Yu-Hsuan Chiang3, Kai Guo1, Zaira Martin Moldes4, David L Kaplan4, Markus J Buehler1,5,6.   

Abstract

Structural proteins are the basis of many biomaterials and key construction and functional components of all life. Further, it is well-known that the diversity of proteins' function relies on their local structures derived from their primary amino acid sequences. Here, we report a deep learning model to predict the secondary structure content of proteins directly from primary sequences, with high computational efficiency. Understanding the secondary structure content of proteins is crucial to designing proteins with targeted material functions, especially mechanical properties. Using convolutional and recurrent architectures and natural language models, our deep learning model predicts the content of two essential types of secondary structures, the α-helix and the β-sheet. The training data are collected from the Protein Data Bank and contain many existing protein geometries. We find that our model can learn the hidden features as patterns of input sequences that can then be directly related to secondary structure content. The α-helix and β-sheet content predictions show excellent agreement with training data and newly deposited protein structures that were recently identified and that were not included in the original training set. We further demonstrate the features of the model by a search for de novo protein sequences that optimize max/min α-helix/β-sheet content and compare the predictions with folded models of these sequences based on AlphaFold2. Excellent agreement is found, underscoring that our model has predictive potential for rapidly designing proteins with specific secondary structures and could be widely applied to biomedical industries, including protein biomaterial designs and regenerative medicine applications.

Entities:  

Keywords:  artificial intelligence; deep learning; materiomics; protein structure; secondary structure; structural proteins

Mesh:

Substances:

Year:  2022        PMID: 35129957      PMCID: PMC9347213          DOI: 10.1021/acsbiomaterials.1c01343

Source DB:  PubMed          Journal:  ACS Biomater Sci Eng        ISSN: 2373-9878


  48 in total

1.  Unfolding proteins by external forces and temperature: the importance of topology and energetics.

Authors:  E Paci; M Karplus
Journal:  Proc Natl Acad Sci U S A       Date:  2000-06-06       Impact factor: 11.205

2.  Hypotheses that correlate the sequence, structure, and mechanical properties of spider silk proteins.

Authors:  C Y Hayashi; N H Shipley; R V Lewis
Journal:  Int J Biol Macromol       Date:  1999 Mar-Apr       Impact factor: 6.953

3.  Comparative analysis of nanomechanics of protein filaments under lateral loading.

Authors:  Max Solar; Markus J Buehler
Journal:  Nanoscale       Date:  2011-12-22       Impact factor: 7.790

4.  Nanostructure and molecular mechanics of spider dragline silk protein assemblies.

Authors:  Sinan Keten; Markus J Buehler
Journal:  J R Soc Interface       Date:  2010-06-02       Impact factor: 4.118

5.  Secondary structure and rigidity in model proteins.

Authors:  Stefania Perticaroli; Jonathan D Nickels; Georg Ehlers; Hugh O'Neill; Qui Zhang; Alexei P Sokolov
Journal:  Soft Matter       Date:  2013-10-28       Impact factor: 3.679

6.  Dissecting the structural determinants for the difference in mechanical stability of silk and amyloid beta-sheet stacks.

Authors:  Senbo Xiao; Shijun Xiao; Frauke Gräter
Journal:  Phys Chem Chem Phys       Date:  2013-04-30       Impact factor: 3.676

7.  ColabFold: making protein folding accessible to all.

Authors:  Milot Mirdita; Sergey Ovchinnikov; Martin Steinegger; Konstantin Schütze; Yoshitaka Moriwaki; Lim Heo
Journal:  Nat Methods       Date:  2022-05-30       Impact factor: 47.990

Review 8.  Deep learning methods in protein structure prediction.

Authors:  Mirko Torrisi; Gianluca Pollastri; Quan Le
Journal:  Comput Struct Biotechnol J       Date:  2020-01-22       Impact factor: 7.271

9.  Structures and polymorphic interactions of two heptad-repeat regions of the SARS virus S2 protein.

Authors:  Yiqun Deng; Jie Liu; Qi Zheng; Wei Yong; Min Lu
Journal:  Structure       Date:  2006-05       Impact factor: 5.006

10.  Expanding Canonical Spider Silk Properties through a DNA Combinatorial Approach.

Authors:  Zaroug Jaleel; Shun Zhou; Zaira Martín-Moldes; Lauren M Baugh; Jonathan Yeh; Nina Dinjaski; Laura T Brown; Jessica E Garb; David L Kaplan
Journal:  Materials (Basel)       Date:  2020-08-14       Impact factor: 3.748

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.