Literature DB >> 16106377

Real value prediction of solvent accessibility in proteins using multiple sequence alignment and secondary structure.

Aarti Garg1, Harpreet Kaur, G P S Raghava.   

Abstract

The present study is an attempt to develop a neural network-based method for predicting the real value of solvent accessibility from the sequence using evolutionary information in the form of multiple sequence alignment. In this method, two feed-forward networks with a single hidden layer have been trained with standard back-propagation as a learning algorithm. The Pearson's correlation coefficient increases from 0.53 to 0.63, and mean absolute error decreases from 18.2 to 16% when multiple-sequence alignment obtained from PSI-BLAST is used as input instead of a single sequence. The performance of the method further improves from a correlation coefficient of 0.63 to 0.67 when secondary structure information predicted by PSIPRED is incorporated in the prediction. The final network yields a mean absolute error value of 15.2% between the experimental and predicted values, when tested on two different nonhomologous and nonredundant datasets of varying sizes. The method consists of two steps: (1) in the first step, a sequence-to-structure network is trained with the multiple alignment profiles in the form of PSI-BLAST-generated position-specific scoring matrices, and (2) in the second step, the output obtained from the first network and PSIPRED-predicted secondary structure information is used as an input to the second structure-to-structure network. Based on the present study, a server SARpred (http://www.imtech.res.in/raghava/sarpred/) has been developed that predicts the real value of solvent accessibility of residues for a given protein sequence. We have also evaluated the performance of SARpred on 47 proteins used in CASP6 and achieved a correlation coefficient of 0.68 and a MAE of 15.9% between predicted and observed values. Copyright 2005 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16106377     DOI: 10.1002/prot.20630

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  31 in total

1.  Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by guided-learning through a two-layer neural network.

Authors:  Eshel Faraggi; Bin Xue; Yaoqi Zhou
Journal:  Proteins       Date:  2009-03

2.  Combining sequence and structural profiles for protein solvent accessibility prediction.

Authors:  Rajkumar Bondugula; Dong Xu
Journal:  Comput Syst Bioinformatics Conf       Date:  2008

3.  Accessible surface area from NMR chemical shifts.

Authors:  Noor E Hafsa; David Arndt; David S Wishart
Journal:  J Biomol NMR       Date:  2015-06-16       Impact factor: 2.835

4.  Epitope Mapping of Rhi o 1 and Generation of a Hypoallergenic Variant: A CANDIDATE MOLECULE FOR FUNGAL ALLERGY VACCINES.

Authors:  Gaurab Sircar; Kuladip Jana; Angira Dasgupta; Sudipto Saha; Swati Gupta Bhattacharya
Journal:  J Biol Chem       Date:  2016-06-28       Impact factor: 5.157

5.  Cytotoxic T lymphocyte antigen 4 (CTLA4) gene polymorphism with bladder cancer risk in North Indian population.

Authors:  Praveen Kumar Jaiswal; Vibha Singh; Rama Devi Mittal
Journal:  Mol Biol Rep       Date:  2014-01-04       Impact factor: 2.316

6.  Identification of NAD interacting residues in proteins.

Authors:  Hifzur R Ansari; Gajendra P S Raghava
Journal:  BMC Bioinformatics       Date:  2010-03-30       Impact factor: 3.169

7.  Real value prediction of protein solvent accessibility using enhanced PSSM features.

Authors:  Darby Tien-Hao Chang; Hsuan-Yu Huang; Yu-Tang Syu; Chih-Peng Wu
Journal:  BMC Bioinformatics       Date:  2008-12-12       Impact factor: 3.169

8.  ESLpred2: improved method for predicting subcellular localization of eukaryotic proteins.

Authors:  Aarti Garg; Gajendra P S Raghava
Journal:  BMC Bioinformatics       Date:  2008-11-28       Impact factor: 3.169

9.  Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.

Authors:  Ce Zheng; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2008-10-10       Impact factor: 3.169

10.  A generic method for assignment of reliability scores applied to solvent accessibility predictions.

Authors:  Bent Petersen; Thomas Nordahl Petersen; Pernille Andersen; Morten Nielsen; Claus Lundegaard
Journal:  BMC Struct Biol       Date:  2009-07-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.