Literature DB >> 22072531

SIDEpro: a novel machine learning approach for the fast and accurate prediction of side-chain conformations.

Ken Nagata1, Arlo Randall, Pierre Baldi.   

Abstract

Accurate protein side-chain conformation prediction is crucial for protein modeling and existing methods for the task are widely used; however, faster and more accurate methods are still required. Here we present a new machine learning approach to the problem where an energy function for each rotamer in a structure is computed additively over pairs of contacting atoms. A family of 156 neural networks indexed by amino acid and contacting atom types is used to compute these rotamer energies as a function of atomic contact distances. Although direct energy targets are not available for training, the neural networks can still be optimized by converting the energies to probabilities and optimizing these probabilities using Markov Chain Monte Carlo methods. The resulting predictor SIDEpro makes predictions by initially setting the rotamer probabilities for each residue from a backbone-dependent rotamer library, then iteratively updating these probabilities using the trained neural networks. After convergences of the probabilities, the side-chains are set to the highest probability rotamer. Finally, a post processing clash reduction step is applied to the models. SIDEpro represents a significant improvement in speed and a modest, but statistically significant, improvement in accuracy when compared with the state-of-the-art for rapid side-chain prediction method SCWRL4 on the following datasets: (1) 379 protein test set of SCWRL4; (2) 94 proteins from CASP9; (3) a set of seven large protein-only complexes; and (4) a ribosome with and without the RNA. Using the SCWRL4 test set, SIDEpro's accuracy (χ(1) 86.14%, χ(1+2) 74.15%) is slightly better than SCWRL4-FRM (χ(1) 85.43%, χ(1+2) 73.47%) and it is 7.0 times faster. On the same test set SIDEpro is clearly more accurate than SCWRL4-rigid rotamer model (RRM) (χ(1) 84.15%, χ(1+2) 71.24%) and 2.4 times faster. Evaluation on the additional test sets yield similar accuracy results with SIDEpro being slightly more accurate than SCWRL4-flexible rotamer model (FRM) and clearly more accurate than SCWRL4-RRM; however, the gap in CPU time is much more significant when the methods are applied to large protein complexes. SIDEpro is part of the SCRATCH suite of predictors and available from: http://scratch.proteomics.ics.uci.edu/.
Copyright © 2011 Wiley Periodicals, Inc.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22072531      PMCID: PMC3240718          DOI: 10.1002/prot.23170

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  23 in total

1.  Improved recognition of native-like protein structures using a combination of sequence-dependent and sequence-independent features of proteins.

Authors:  K T Simons; I Ruczinski; C Kooperberg; B A Fox; C Bystroff; D Baker
Journal:  Proteins       Date:  1999-01-01

2.  TOUCHSTONE II: a new approach to ab initio protein structure prediction.

Authors:  Yang Zhang; Andrzej Kolinski; Jeffrey Skolnick
Journal:  Biophys J       Date:  2003-08       Impact factor: 4.033

3.  A graph-theory algorithm for rapid protein side-chain prediction.

Authors:  Adrian A Canutescu; Andrew A Shelenkov; Roland L Dunbrack
Journal:  Protein Sci       Date:  2003-09       Impact factor: 6.725

4.  Bayesian statistical analysis of protein side-chain rotamer preferences.

Authors:  R L Dunbrack; F E Cohen
Journal:  Protein Sci       Date:  1997-08       Impact factor: 6.725

5.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool.

Authors:  M J Bower; F E Cohen; R L Dunbrack
Journal:  J Mol Biol       Date:  1997-04-18       Impact factor: 5.469

6.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction.

Authors:  R L Dunbrack; M Karplus
Journal:  J Mol Biol       Date:  1993-03-20       Impact factor: 5.469

7.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy.

Authors:  P Koehl; M Delarue
Journal:  J Mol Biol       Date:  1994-06-03       Impact factor: 5.469

8.  Conformational analysis of the backbone-dependent rotamer preferences of protein sidechains.

Authors:  R L Dunbrack; M Karplus
Journal:  Nat Struct Biol       Date:  1994-05

9.  Efficient rotamer elimination applied to protein side-chains and related spin glasses.

Authors:  R F Goldstein
Journal:  Biophys J       Date:  1994-05       Impact factor: 4.033

10.  Modeling structurally variable regions in homologous proteins with rosetta.

Authors:  Carol A Rohl; Charlie E M Strauss; Dylan Chivian; David Baker
Journal:  Proteins       Date:  2004-05-15
View more
  12 in total

1.  Biallelic Variants in the Nuclear Pore Complex Protein NUP93 Are Associated with Non-progressive Congenital Ataxia.

Authors:  Ginevra Zanni; P De Magistris; M Nardella; E Bellacchio; S Barresi; A Sferra; A Ciolfi; M Motta; H Lue; D Moreno-Andres; M Tartaglia; E Bertini; Wolfram Antonin
Journal:  Cerebellum       Date:  2019-06       Impact factor: 3.847

2.  Deep architectures for protein contact map prediction.

Authors:  Pietro Di Lena; Ken Nagata; Pierre Baldi
Journal:  Bioinformatics       Date:  2012-07-30       Impact factor: 6.937

3.  Incorporating post-translational modifications and unnatural amino acids into high-throughput modeling of protein structures.

Authors:  Ken Nagata; Arlo Randall; Pierre Baldi
Journal:  Bioinformatics       Date:  2014-02-25       Impact factor: 6.937

4.  Assessment of protein side-chain conformation prediction methods in different residue environments.

Authors:  Lenna X Peterson; Xuejiao Kang; Daisuke Kihara
Journal:  Proteins       Date:  2014-03-31

5.  OPUS-Rota4: a gradient-based protein side-chain modeling framework assisted by deep learning-based predictors.

Authors:  Gang Xu; Qinghua Wang; Jianpeng Ma
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

6.  Side-chain Packing Using SE(3)-Transformer.

Authors:  Akhil Jindal; Sergei Kotelnikov; Dzmitry Padhorny; Dima Kozakov; Yimin Zhu; Rezaul Chowdhury; Sandor Vajda
Journal:  Pac Symp Biocomput       Date:  2022

7.  Novel Homozygous KCNJ10 Mutation in a Patient with Non-syndromic Early-Onset Cerebellar Ataxia.

Authors:  Francesco Nicita; Giorgio Tasca; Marta Nardella; Emanuele Bellacchio; Ilaria Camponeschi; Gessica Vasco; Tommaso Schirinzi; Enrico Bertini; Ginevra Zanni
Journal:  Cerebellum       Date:  2018-08       Impact factor: 3.847

8.  Enhancing human spermine synthase activity by engineered mutations.

Authors:  Zhe Zhang; Yueli Zheng; Margo Petukh; Anthony Pegg; Yoshihiko Ikeguchi; Emil Alexov
Journal:  PLoS Comput Biol       Date:  2013-02-28       Impact factor: 4.475

9.  PON-SC - program for identifying steric clashes caused by amino acid substitutions.

Authors:  Jelena Čalyševa; Mauno Vihinen
Journal:  BMC Bioinformatics       Date:  2017-11-29       Impact factor: 3.169

10.  Expanding the clinical phenotype of IARS2-related mitochondrial disease.

Authors:  Barbara Vona; Reza Maroofian; Emanuele Bellacchio; Maryam Najafi; Kyle Thompson; Ahmad Alahmad; Langping He; Najmeh Ahangari; Abolfazl Rad; Sima Shahrokhzadeh; Paulina Bahena; Falk Mittag; Frank Traub; Jebrail Movaffagh; Nafise Amiri; Mohammad Doosti; Reza Boostani; Ebrahim Shirzadeh; Thomas Haaf; Daria Diodato; Miriam Schmidts; Robert W Taylor; Ehsan Ghayoor Karimiani
Journal:  BMC Med Genet       Date:  2018-11-12       Impact factor: 2.103

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.