Literature DB >> 19091011

Real value prediction of protein solvent accessibility using enhanced PSSM features.

Darby Tien-Hao Chang1, Hsuan-Yu Huang, Yu-Tang Syu, Chih-Peng Wu.   

Abstract

BACKGROUND: Prediction of protein solvent accessibility, also called accessible surface area (ASA) prediction, is an important step for tertiary structure prediction directly from one-dimensional sequences. Traditionally, predicting solvent accessibility is regarded as either a two- (exposed or buried) or three-state (exposed, intermediate or buried) classification problem. However, the states of solvent accessibility are not well-defined in real protein structures. Thus, a number of methods have been developed to directly predict the real value ASA based on evolutionary information such as position specific scoring matrix (PSSM).
RESULTS: This study enhances the PSSM-based features for real value ASA prediction by considering the physicochemical properties and solvent propensities of amino acid types. We propose a systematic method for identifying residue groups with respect to protein solvent accessibility. The amino acid columns in the PSSM profile that belong to a certain residue group are merged to generate novel features. Finally, support vector regression (SVR) is adopted to construct a real value ASA predictor. Experimental results demonstrate that the features produced by the proposed selection process are informative for ASA prediction.
CONCLUSION: Experimental results based on a widely used benchmark reveal that the proposed method performs best among several of existing packages for performing ASA prediction. Furthermore, the feature selection mechanism incorporated in this study can be applied to other regression problems using the PSSM. The program and data are available from the authors upon request.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19091011      PMCID: PMC2638152          DOI: 10.1186/1471-2105-9-S12-S12

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  32 in total

1.  Application of multiple sequence alignment profiles to improve protein secondary structure prediction.

Authors:  J A Cuff; G J Barton
Journal:  Proteins       Date:  2000-08-15

2.  Prediction of protein surface accessibility with information theory.

Authors:  H Naderi-Manesh; M Sadeghi; S Arab; A A Moosavi Movahedi
Journal:  Proteins       Date:  2001-03-01

3.  RCNPRED: prediction of the residue co-ordination numbers in proteins.

Authors:  P Fariselli; R Casadio
Journal:  Bioinformatics       Date:  2001-02       Impact factor: 6.937

4.  PredAcc: prediction of solvent accessibility.

Authors:  M H Mucchielli-Giorgi; S Hazout; P Tufféry
Journal:  Bioinformatics       Date:  1999-02       Impact factor: 6.937

5.  New method for accurate prediction of solvent accessibility from protein sequence.

Authors:  X Li; X M Pan
Journal:  Proteins       Date:  2001-01-01

6.  Predicting residue solvent accessibility from protein sequence by considering the sequence environment.

Authors:  O Carugo
Journal:  Protein Eng       Date:  2000-09

7.  Prediction of coordination number and relative solvent accessibility in proteins.

Authors:  Gianluca Pollastri; Pierre Baldi; Pietro Fariselli; Rita Casadio
Journal:  Proteins       Date:  2002-05-01

Review 8.  Getting the most from PSI-BLAST.

Authors:  David T Jones; Mark B Swindells
Journal:  Trends Biochem Sci       Date:  2002-03       Impact factor: 13.807

9.  NETASA: neural network based prediction of solvent accessibility.

Authors:  Shandar Ahmad; M Michael Gromiha
Journal:  Bioinformatics       Date:  2002-06       Impact factor: 6.937

10.  Protein disorder prediction by condensed PSSM considering propensity for order or disorder.

Authors:  Chung-Tsai Su; Chien-Yu Chen; Yu-Yen Ou
Journal:  BMC Bioinformatics       Date:  2006-06-23       Impact factor: 3.169

View more
  15 in total

1.  FEPS: A Tool for Feature Extraction from Protein Sequence.

Authors:  Hamid Ismail; Clarence White; Hussam Al-Barakati; Robert H Newman; Dukka B Kc
Journal:  Methods Mol Biol       Date:  2022

2.  Accurate Prediction of Contact Numbers for Multi-Spanning Helical Membrane Proteins.

Authors:  Bian Li; Jeffrey Mendenhall; Elizabeth Dong Nguyen; Brian E Weiner; Axel W Fischer; Jens Meiler
Journal:  J Chem Inf Model       Date:  2016-02-05       Impact factor: 4.956

3.  Predicting the protein-protein interactions using primary structures with predicted protein surface.

Authors:  Darby Tien-Hao Chang; Yu-Tang Syu; Po-Chang Lin
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

4.  In-silico prediction of disorder content using hybrid sequence representation.

Authors:  Marcin J Mizianty; Tuo Zhang; Bin Xue; Yaoqi Zhou; A Keith Dunker; Vladimir N Uversky; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2011-06-17       Impact factor: 3.169

5.  Multi-level machine learning prediction of protein-protein interactions in Saccharomyces cerevisiae.

Authors:  Julian Zubek; Marcin Tatjewski; Adam Boniecki; Maciej Mnich; Subhadip Basu; Dariusz Plewczynski
Journal:  PeerJ       Date:  2015-07-02       Impact factor: 2.984

6.  Prediction of protein solvent accessibility using PSO-SVR with multiple sequence-derived features and weighted sliding window scheme.

Authors:  Jian Zhang; Wenhan Chen; Pingping Sun; Xiaowei Zhao; Zhiqiang Ma
Journal:  BioData Min       Date:  2015-01-31       Impact factor: 2.522

7.  3PFDB--a database of best representative PSSM profiles (BRPs) of protein families generated using a novel data mining approach.

Authors:  Khader Shameer; Paramasivam Nagarajan; Kumar Gaurav; Ramanathan Sowdhamini
Journal:  BioData Min       Date:  2009-12-04       Impact factor: 2.522

8.  Predicting secretory proteins of malaria parasite by incorporating sequence evolution information into pseudo amino acid composition via grey system model.

Authors:  Wei-Zhong Lin; Jian-An Fang; Xuan Xiao; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-11-26       Impact factor: 3.240

9.  Emerging strengths in Asia Pacific bioinformatics.

Authors:  Shoba Ranganathan; Wen-Lian Hsu; Ueng-Cheng Yang; Tin Wee Tan
Journal:  BMC Bioinformatics       Date:  2008-12-12       Impact factor: 3.169

10.  PredRSA: a gradient boosted regression trees approach for predicting protein solvent accessibility.

Authors:  Chao Fan; Diwei Liu; Rui Huang; Zhigang Chen; Lei Deng
Journal:  BMC Bioinformatics       Date:  2016-01-11       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.