Literature DB >> 11933061

Prediction of coordination number and relative solvent accessibility in proteins.

Gianluca Pollastri1, Pierre Baldi, Pietro Fariselli, Rita Casadio.   

Abstract

Knowing the coordination number and relative solvent accessibility of all the residues in a protein is crucial for deriving constraints useful in modeling protein folding and protein structure and in scoring remote homology searches. We develop ensembles of bidirectional recurrent neural network architectures to improve the state of the art in both contact and accessibility prediction, leveraging a large corpus of curated data together with evolutionary information. The ensembles are used to discriminate between two different states of residue contacts or relative solvent accessibility, higher or lower than a threshold determined by the average value of the residue distribution or the accessibility cutoff. For coordination numbers, the ensemble achieves performances ranging within 70.6-73.9% depending on the radius adopted to discriminate contacts (6A-12A). These performances represent gains of 16-20% over the baseline statistical predictor, always assigning an amino acid to the largest class, and are 4-7% better than any previous method. A combination of different radius predictors further improves performance. For accessibility thresholds in the relevant 15-30% range, the ensemble consistently achieves a performance above 77%, which is 10-16% above the baseline prediction and better than other existing predictors, by up to several percentage points. For both problems, we quantify the improvement due to evolutionary information in the form of PSI-BLAST-generated profiles over BLAST profiles. The prediction programs are implemented in the form of two web servers, CONpro and ACCpro, available at http://promoter.ics.uci.edu/BRNN-PRED/. Copyright 2002 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2002        PMID: 11933061     DOI: 10.1002/prot.10069

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  75 in total

1.  MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts.

Authors:  Xin Deng; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2011-12-14       Impact factor: 3.169

2.  COBEpro: a novel system for predicting continuous B-cell epitopes.

Authors:  Michael J Sweredoski; Pierre Baldi
Journal:  Protein Eng Des Sel       Date:  2008-12-10       Impact factor: 1.650

3.  Accessible surface area from NMR chemical shifts.

Authors:  Noor E Hafsa; David Arndt; David S Wishart
Journal:  J Biomol NMR       Date:  2015-06-16       Impact factor: 2.835

4.  svmPRAT: SVM-based protein residue annotation toolkit.

Authors:  Huzefa Rangwala; Christopher Kauffman; George Karypis
Journal:  BMC Bioinformatics       Date:  2009-12-22       Impact factor: 3.169

5.  Analysis and prediction of the metabolic stability of proteins based on their sequential features, subcellular locations and interaction networks.

Authors:  Tao Huang; Xiao-He Shi; Ping Wang; Zhisong He; Kai-Yan Feng; Lele Hu; Xiangyin Kong; Yi-Xue Li; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2010-06-04       Impact factor: 3.240

6.  SELECTpro: effective protein model selection using a structure-based energy function resistant to BLUNDERs.

Authors:  Arlo Randall; Pierre Baldi
Journal:  BMC Struct Biol       Date:  2008-12-03

7.  A generic method for assignment of reliability scores applied to solvent accessibility predictions.

Authors:  Bent Petersen; Thomas Nordahl Petersen; Pernille Andersen; Morten Nielsen; Claus Lundegaard
Journal:  BMC Struct Biol       Date:  2009-07-31

8.  Characterization of non-trivial neighborhood fold constraints from protein sequences using generalized topohydrophobicity.

Authors:  Guillaume Fourty; Isabelle Callebaut; Jean-Paul Mornon
Journal:  Bioinform Biol Insights       Date:  2008-01-31

9.  A modular kernel approach for integrative analysis of protein domain boundaries.

Authors:  Paul D Yoo; Bing Bing Zhou; Albert Y Zomaya
Journal:  BMC Genomics       Date:  2009-12-03       Impact factor: 3.969

10.  Ab initio and homology based prediction of protein domains by recursive neural networks.

Authors:  Ian Walsh; Alberto J M Martin; Catherine Mooney; Enrico Rubagotti; Alessandro Vullo; Gianluca Pollastri
Journal:  BMC Bioinformatics       Date:  2009-06-26       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.