Literature DB >> 24264942

A global machine learning based scoring function for protein structure prediction.

Eshel Faraggi1, Andrzej Kloczkowski.   

Abstract

We present a knowledge-based function to score protein decoys based on their similarity to native structure. A set of features is constructed to describe the structure and sequence of the entire protein chain. Furthermore, a qualitative relationship is established between the calculated features and the underlying electromagnetic interaction that dominates this scale. The features we use are associated with residue-residue distances, residue-solvent distances, pairwise knowledge-based potentials and a four-body potential. In addition, we introduce a new target to be predicted, the fitness score, which measures the similarity of a model to the native structure. This new approach enables us to obtain information both from decoys and from native structures. It is also devoid of previous problems associated with knowledge-based potentials. These features were obtained for a large set of native and decoy structures and a back-propagating neural network was trained to predict the fitness score. Overall this new scoring potential proved to be superior to the knowledge-based scoring functions used as its inputs. In particular, in the latest CASP (CASP10) experiment our method was ranked third for all targets, and second for freely modeled hard targets among about 200 groups for top model prediction. Ours was the only method ranked in the top three for all targets and for hard targets. This shows that initial results from the novel approach are able to capture details that were missed by a broad spectrum of protein structure prediction approaches. Source codes and executable from this work are freely available at http://mathmed.org/#Software and http://mamiris.com/.
Copyright © 2013 Wiley Periodicals, Inc.

Entities:  

Keywords:  global features; neural network; protein knowledge potentials; protein potential energy; protein scoring functions; tertiary protein structure

Mesh:

Substances:

Year:  2013        PMID: 24264942     DOI: 10.1002/prot.24454

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  9 in total

1.  GENN: a GEneral Neural Network for learning tabulated data with examples from protein structure prediction.

Authors:  Eshel Faraggi; Andrzej Kloczkowski
Journal:  Methods Mol Biol       Date:  2015

2.  A Hybrid Levenberg-Marquardt Algorithm on a Recursive Neural Network for Scoring Protein Models.

Authors:  Eshel Faraggi; Robert L Jernigan; Andrzej Kloczkowski
Journal:  Methods Mol Biol       Date:  2021

3.  Accurate single-sequence prediction of solvent accessible surface area using local and global features.

Authors:  Eshel Faraggi; Yaoqi Zhou; Andrzej Kloczkowski
Journal:  Proteins       Date:  2014-09-25

4.  Classifying kinase conformations using a machine learning approach.

Authors:  Daniel Ian McSkimming; Khaled Rasheed; Natarajan Kannan
Journal:  BMC Bioinformatics       Date:  2017-02-02       Impact factor: 3.169

5.  From Extraction of Local Structures of Protein Energy Landscapes to Improved Decoy Selection in Template-Free Protein Structure Prediction.

Authors:  Nasrin Akhter; Amarda Shehu
Journal:  Molecules       Date:  2018-01-19       Impact factor: 4.411

6.  Reoptimized UNRES Potential for Protein Model Quality Assessment.

Authors:  Eshel Faraggi; Pawel Krupa; Magdalena A Mozolewska; Adam Liwo; Andrzej Kloczkowski
Journal:  Genes (Basel)       Date:  2018-12-03       Impact factor: 4.096

7.  Uncertainty reduction in biochemical kinetic models: Enforcing desired model properties.

Authors:  Ljubisa Miskovic; Jonas Béal; Michael Moret; Vassily Hatzimanikatis
Journal:  PLoS Comput Biol       Date:  2019-08-20       Impact factor: 4.475

8.  Estimation of model accuracy by a unique set of features and tree-based regressor.

Authors:  Mor Bitton; Chen Keasar
Journal:  Sci Rep       Date:  2022-08-18       Impact factor: 4.996

9.  An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12.

Authors:  Chen Keasar; Liam J McGuffin; Björn Wallner; Gaurav Chopra; Badri Adhikari; Debswapna Bhattacharya; Lauren Blake; Leandro Oliveira Bortot; Renzhi Cao; B K Dhanasekaran; Itzhel Dimas; Rodrigo Antonio Faccioli; Eshel Faraggi; Robert Ganzynkowicz; Sambit Ghosh; Soma Ghosh; Artur Giełdoń; Lukasz Golon; Yi He; Lim Heo; Jie Hou; Main Khan; Firas Khatib; George A Khoury; Chris Kieslich; David E Kim; Pawel Krupa; Gyu Rie Lee; Hongbo Li; Jilong Li; Agnieszka Lipska; Adam Liwo; Ali Hassan A Maghrabi; Milot Mirdita; Shokoufeh Mirzaei; Magdalena A Mozolewska; Melis Onel; Sergey Ovchinnikov; Anand Shah; Utkarsh Shah; Tomer Sidi; Adam K Sieradzan; Magdalena Ślusarz; Rafal Ślusarz; James Smadbeck; Phanourios Tamamis; Nicholas Trieber; Tomasz Wirecki; Yanping Yin; Yang Zhang; Jaume Bacardit; Maciej Baranowski; Nicholas Chapman; Seth Cooper; Alexandre Defelicibus; Jeff Flatten; Brian Koepnick; Zoran Popović; Bartlomiej Zaborowski; David Baker; Jianlin Cheng; Cezary Czaplewski; Alexandre Cláudio Botazzo Delbem; Christodoulos Floudas; Andrzej Kloczkowski; Stanislaw Ołdziej; Michael Levitt; Harold Scheraga; Chaok Seok; Johannes Söding; Saraswathi Vishveshwara; Dong Xu; Silvia N Crivelli
Journal:  Sci Rep       Date:  2018-07-02       Impact factor: 4.379

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.