Literature DB >> 34213059

SidechainNet: An all-atom protein structure dataset for machine learning.

Jonathan Edward King1, David Ryan Koes2.   

Abstract

Despite recent advancements in deep learning methods for protein structure prediction and representation, little focus has been directed at the simultaneous inclusion and prediction of protein backbone and sidechain structure information. We present SidechainNet, a new dataset that directly extends the ProteinNet dataset. SidechainNet includes angle and atomic coordinate information capable of describing all heavy atoms of each protein structure and can be extended by users to include new protein structures as they are released. In this article, we provide background information on the availability of protein structure data and the significance of ProteinNet. Thereafter, we argue for the potentially beneficial inclusion of sidechain information through SidechainNet, describe the process by which we organize SidechainNet, and provide a software package (https://github.com/jonathanking/sidechainnet) for data manipulation and training with machine learning models.
© 2021 Wiley Periodicals LLC.

Entities:  

Keywords:  dataset; deep learning; machine learning; protein structure; proteins; software

Mesh:

Substances:

Year:  2021        PMID: 34213059      PMCID: PMC8492522          DOI: 10.1002/prot.26169

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  24 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Amber biomolecular simulation programs.

Authors:  David A Case; Thomas E Cheatham; Tom Darden; Holger Gohlke; Ray Luo; Kenneth M Merz; Alexey Onufriev; Carlos Simmerling; Bing Wang; Robert J Woods
Journal:  J Comput Chem       Date:  2005-12       Impact factor: 3.376

3.  Structure prediction for CASP8 with all-atom refinement using Rosetta.

Authors:  Srivatsan Raman; Robert Vernon; James Thompson; Michael Tyka; Ruslan Sadreyev; Jimin Pei; David Kim; Elizabeth Kellogg; Frank DiMaio; Oliver Lange; Lisa Kinch; Will Sheffler; Bong-Hyun Kim; Rhiju Das; Nick V Grishin; David Baker
Journal:  Proteins       Date:  2009

4.  Distance-based protein folding powered by deep learning.

Authors:  Jinbo Xu
Journal:  Proc Natl Acad Sci U S A       Date:  2019-08-09       Impact factor: 11.205

5.  ff19SB: Amino-Acid-Specific Protein Backbone Parameters Trained against Quantum Mechanics Energy Surfaces in Solution.

Authors:  Chuan Tian; Koushik Kasavajhala; Kellon A A Belfon; Lauren Raguette; He Huang; Angela N Migues; John Bickel; Yuzhang Wang; Jorge Pincay; Qin Wu; Carlos Simmerling
Journal:  J Chem Theory Comput       Date:  2019-12-03       Impact factor: 6.006

6.  A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions.

Authors:  Maxim V Shapovalov; Roland L Dunbrack
Journal:  Structure       Date:  2011-06-08       Impact factor: 5.006

7.  End-to-End Differentiable Learning of Protein Structure.

Authors:  Mohammed AlQuraishi
Journal:  Cell Syst       Date:  2019-04-17       Impact factor: 10.304

8.  Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.

Authors:  Michael M Mysinger; Michael Carchia; John J Irwin; Brian K Shoichet
Journal:  J Med Chem       Date:  2012-07-05       Impact factor: 7.446

9.  UniProt: a worldwide hub of protein knowledge.

Authors: 
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  ProteinNet: a standardized data set for machine learning of protein structure.

Authors:  Mohammed AlQuraishi
Journal:  BMC Bioinformatics       Date:  2019-06-11       Impact factor: 3.169

View more
  1 in total

Review 1.  Protein Function Analysis through Machine Learning.

Authors:  Chris Avery; John Patterson; Tyler Grear; Theodore Frater; Donald J Jacobs
Journal:  Biomolecules       Date:  2022-09-06
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.