Literature DB >> 18174181

Prediction of protein functional residues from sequence by probability density estimation.

J D Fischer1, C E Mayer, J Söding.   

Abstract

MOTIVATION: The prediction of ligand-binding residues or catalytically active residues of a protein may give important hints that can guide further genetic or biochemical studies. Existing sequence-based prediction methods mostly rank residue positions by evolutionary conservation calculated from a multiple sequence alignment of homologs. A problem hampering more wide-spread application of these methods is the low per-residue precision, which at 20% sensitivity is around 35% for ligand-binding residues and 20% for catalytic residues.
RESULTS: We combine information from the conservation at each site, its amino acid distribution, as well as its predicted secondary structure (ss) and relative solvent accessibility (rsa). First, we measure conservation by how much the amino acid distribution at each site differs from the distribution expected for the predicted ss and rsa states. Second, we include the conservation of neighboring residues in a weighted linear score by analytically optimizing the signal-to-noise ratio of the total score. Third, we use conditional probability density estimation to calculate the probability of each site to be functional given its conservation, the observed amino acid distribution, and the predicted ss and rsa states. We have constructed two large data sets, one based on the Catalytic Site Atlas and the other on PDB SITE records, to benchmark methods for predicting functional residues. The new method FRcons predicts ligand-binding and catalytic residues with higher precision than alternative methods over the entire sensitivity range, reaching 50% and 40% precision at 20% sensitivity, respectively. AVAILABILITY: Server: http://frpred.tuebingen.mpg.de. Data sets: ftp://ftp.tuebingen.mpg.de/pub/protevo/FRpred/.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18174181     DOI: 10.1093/bioinformatics/btm626

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  62 in total

1.  Initial Cluster Analysis.

Authors:  Stephen F Altschul; Andrew F Neuwald
Journal:  J Comput Biol       Date:  2017-08-03       Impact factor: 1.479

2.  Assessment of ligand binding site predictions in CASP10.

Authors:  Tiziano Gallo Cassarino; Lorenza Bordoli; Torsten Schwede
Journal:  Proteins       Date:  2014-02

3.  LIBRUS: combined machine learning and homology information for sequence-based ligand-binding residue prediction.

Authors:  Chris Kauffman; George Karypis
Journal:  Bioinformatics       Date:  2009-09-28       Impact factor: 6.937

4.  HHsvm: fast and accurate classification of profile-profile matches identified by HHsearch.

Authors:  Mensur Dlakić
Journal:  Bioinformatics       Date:  2009-09-22       Impact factor: 6.937

5.  Evolutionary approach to predicting the binding site residues of a protein from its primary sequence.

Authors:  Yan Yuan Tseng; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2011-03-14       Impact factor: 11.205

6.  Sequence conservation in the prediction of catalytic sites.

Authors:  Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal:  Protein J       Date:  2011-04       Impact factor: 2.371

7.  Bridge helix and trigger loop perturbations generate superactive RNA polymerases.

Authors:  Lin Tan; Simone Wiesler; Dominika Trzaska; Hannah C Carney; Robert O J Weinzierl
Journal:  J Biol       Date:  2008-12-02

8.  Predicting conserved protein motifs with Sub-HMMs.

Authors:  Kevin Horan; Christian R Shelton; Thomas Girke
Journal:  BMC Bioinformatics       Date:  2010-04-26       Impact factor: 3.169

9.  Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

Authors:  John A Capra; Roman A Laskowski; Janet M Thornton; Mona Singh; Thomas A Funkhouser
Journal:  PLoS Comput Biol       Date:  2009-12-04       Impact factor: 4.475

10.  SitesIdentify: a protein functional site prediction tool.

Authors:  Tracey Bray; Pedro Chan; Salim Bougouffa; Richard Greaves; Andrew J Doig; Jim Warwicker
Journal:  BMC Bioinformatics       Date:  2009-11-18       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.