Literature DB >> 22434533

Sann: solvent accessibility prediction of proteins by nearest neighbor method.

Keehyoung Joo1, Sung Jong Lee, Jooyoung Lee.   

Abstract

We present a method to predict the solvent accessibility of proteins which is based on a nearest neighbor method applied to the sequence profiles. Using the method, continuous real-value prediction as well as two-state and three-state discrete predictions can be obtained. The method utilizes the z-score value of the distance measure in the feature vector space to estimate the relative contribution among the k-nearest neighbors for prediction of the discrete and continuous solvent accessibility. The Solvent accessibility database is constructed from 5717 proteins extracted from PISCES culling server with the cutoff of 25% sequence identities. Using optimal parameters, the prediction accuracies (for discrete predictions) of 78.38% (two-state prediction with the threshold of 25%), 65.1% (three-state prediction with the thresholds of 9 and 36%), and the Pearson correlation coefficient (between the predicted and true RSA's for continuous prediction) of 0.676 are achieved An independent benchmark test was performed with the CASP8 targets where we find that the proposed method outperforms existing methods. The prediction accuracies are 80.89% (for two state prediction with the threshold of 25%), 67.58% (three-state prediction), and the Pearson correlation coefficient of 0.727 (for continuous prediction) with mean absolute error of 0.148. We have also investigated the effect of increasing database sizes on the prediction accuracy, where additional improvement in the accuracy is observed as the database size increases. The SANN web server is available at http://lee.kias.re.kr/~newton/sann/.
Copyright © 2012 Wiley Periodicals, Inc.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22434533     DOI: 10.1002/prot.24074

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  24 in total

1.  Prediction of Protein-Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures.

Authors:  Guang-Hui Liu; Hong-Bin Shen; Dong-Jun Yu
Journal:  J Membr Biol       Date:  2015-11-12       Impact factor: 1.843

2.  Inadequate Reference Datasets Biased toward Short Non-epitopes Confound B-cell Epitope Prediction.

Authors:  Kh Shamsur Rahman; Erfan Ullah Chowdhury; Konrad Sachse; Bernhard Kaltenboeck
Journal:  J Biol Chem       Date:  2016-05-09       Impact factor: 5.157

3.  ATPbind: Accurate Protein-ATP Binding Site Prediction by Combining Sequence-Profiling and Structure-Based Comparisons.

Authors:  Jun Hu; Yang Li; Yang Zhang; Dong-Jun Yu
Journal:  J Chem Inf Model       Date:  2018-02-08       Impact factor: 4.956

4.  CRFalign: A Sequence-Structure Alignment of Proteins Based on a Combination of HMM-HMM Comparison and Conditional Random Fields.

Authors:  Sung Jong Lee; Keehyoung Joo; Sangjin Sim; Juyong Lee; In-Ho Lee; Jooyoung Lee
Journal:  Molecules       Date:  2022-06-09       Impact factor: 4.927

5.  Semirational Directed Evolution of Loop Regions in Aspergillus japonicus β-Fructofuranosidase for Improved Fructooligosaccharide Production.

Authors:  K M Trollope; J F Görgens; H Volschenk
Journal:  Appl Environ Microbiol       Date:  2015-08-07       Impact factor: 4.792

Review 6.  Current advances in biopharmaceutical informatics: guidelines, impact and challenges in the computational developability assessment of antibody therapeutics.

Authors:  Rahul Khetan; Robin Curtis; Charlotte M Deane; Johannes Thorling Hadsund; Uddipan Kar; Konrad Krawczyk; Daisuke Kuroda; Sarah A Robinson; Pietro Sormanni; Kouhei Tsumoto; Jim Warwicker; Andrew C R Martin
Journal:  MAbs       Date:  2022 Jan-Dec       Impact factor: 5.857

7.  Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.

Authors:  Fang Ge; Ying Zhang; Jian Xu; Arif Muhammad; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

8.  Sigma-RF: prediction of the variability of spatial restraints in template-based modeling by random forest.

Authors:  Juyong Lee; Kiho Lee; InSuk Joung; Keehyoung Joo; Bernard R Brooks; Jooyoung Lee
Journal:  BMC Bioinformatics       Date:  2015-03-21       Impact factor: 3.169

9.  AcconPred: Predicting Solvent Accessibility and Contact Number Simultaneously by a Multitask Learning Framework under the Conditional Neural Fields Model.

Authors:  Jianzhu Ma; Sheng Wang
Journal:  Biomed Res Int       Date:  2015-08-03       Impact factor: 3.411

10.  Logistic regression models to predict solvent accessible residues using sequence- and homology-based qualitative and quantitative descriptors applied to a domain-complete X-ray structure learning set.

Authors:  Reecha Nepal; Joanna Spencer; Guneet Bhogal; Amulya Nedunuri; Thomas Poelman; Thejas Kamath; Edwin Chung; Katherine Kantardjieff; Andrea Gottlieb; Brooke Lustig
Journal:  J Appl Crystallogr       Date:  2015-11-10       Impact factor: 3.304

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.