Literature DB >> 17177203

Achieving 80% ten-fold cross-validated accuracy for secondary structure prediction by large-scale training.

Ofer Dor1, Yaoqi Zhou.   

Abstract

An integrated system of neural networks, called SPINE, is established and optimized for predicting structural properties of proteins. SPINE is applied to three-state secondary-structure and residue-solvent-accessibility (RSA) prediction in this paper. The integrated neural networks are carefully trained with a large dataset of 2640 chains, sequence profiles generated from multiple sequence alignment, representative amino acid properties, a slow learning rate, overfitting protection, and an optimized sliding-widow size. More than 200,000 weights in SPINE are optimized by maximizing the accuracy measured by Q(3) (the percentage of correctly classified residues). SPINE yields a 10-fold cross-validated accuracy of 79.5% (80.0% for chains of length between 50 and 300) in secondary-structure prediction after one-month (CPU time) training on 22 processors. An accuracy of 87.5% is achieved for exposed residues (RSA >95%). The latter approaches the theoretical upper limit of 88-90% accuracy in assigning secondary structures. An accuracy of 73% for three-state solvent-accessibility prediction (25%/75% cutoff) and 79.3% for two-state prediction (25% cutoff) is also obtained. (c) 2006 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2007        PMID: 17177203     DOI: 10.1002/prot.21298

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  40 in total

1.  Improving computational protein design by using structure-derived sequence profile.

Authors:  Liang Dai; Yuedong Yang; Hyung Rae Kim; Yaoqi Zhou
Journal:  Proteins       Date:  2010-08-01

2.  Mimicking the folding pathway to improve homology-free protein structure prediction.

Authors:  Joe DeBartolo; Andrés Colubri; Abhishek K Jha; James E Fitzgerald; Karl F Freed; Tobin R Sosnick
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-23       Impact factor: 11.205

3.  Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by guided-learning through a two-layer neural network.

Authors:  Eshel Faraggi; Bin Xue; Yaoqi Zhou
Journal:  Proteins       Date:  2009-03

4.  Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates.

Authors:  Yuedong Yang; Eshel Faraggi; Huiying Zhao; Yaoqi Zhou
Journal:  Bioinformatics       Date:  2011-06-11       Impact factor: 6.937

5.  A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction.

Authors:  Matt Spencer; Jesse Eickholt
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2014-08-07       Impact factor: 3.710

6.  SPINE-D: accurate prediction of short and long disordered regions by a single neural-network based method.

Authors:  Tuo Zhang; Eshel Faraggi; Bin Xue; A Keith Dunker; Vladimir N Uversky; Yaoqi Zhou
Journal:  J Biomol Struct Dyn       Date:  2012

7.  Fluctuations of backbone torsion angles obtained from NMR-determined structures and their prediction.

Authors:  Tuo Zhang; Eshel Faraggi; Yaoqi Zhou
Journal:  Proteins       Date:  2010-12

8.  Fragment-free approach to protein folding using conditional neural fields.

Authors:  Feng Zhao; Jian Peng; Jinbo Xu
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

9.  SPOT-Seq-RNA: predicting protein-RNA complex structure and RNA-binding function by fold recognition and binding affinity prediction.

Authors:  Yuedong Yang; Huiying Zhao; Jihua Wang; Yaoqi Zhou
Journal:  Methods Mol Biol       Date:  2014

10.  A generic method for assignment of reliability scores applied to solvent accessibility predictions.

Authors:  Bent Petersen; Thomas Nordahl Petersen; Pernille Andersen; Morten Nielsen; Claus Lundegaard
Journal:  BMC Struct Biol       Date:  2009-07-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.