Literature DB >> 18024975

Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting.

Kai Ye1, K Anton Feenstra, Jaap Heringa, Adriaan P Ijzerman, Elena Marchiori.   

Abstract

MOTIVATION: Identification of residues that account for protein function specificity is crucial, not only for understanding the nature of functional specificity, but also for protein engineering experiments aimed at switching the specificity of an enzyme, regulator or transporter. Available algorithms generally use multiple sequence alignments to identify residue positions conserved within subfamilies but divergent in between. However, many biological examples show a much subtler picture than simple intra-group conservation versus inter-group divergence.
RESULTS: We present multi-RELIEF, a novel approach for identifying specificity residues that is based on RELIEF, a state-of-the-art Machine-Learning technique for feature weighting. It estimates the expected 'local' functional specificity of residues from an alignment divided in multiple classes. Optionally, 3D structure information is exploited by increasing the weight of residues that have high-weight neighbors. Using ROC curves over a large body of experimental reference data, we show that (a) multi-RELIEF identifies specificity residues for the seven test sets used, (b) incorporating structural information improves prediction for specificity of interaction with small molecules and (c) comparison of multi-RELIEF with four other state-of-the-art algorithms indicates its robustness and best overall performance. AVAILABILITY: A web-server implementation of multi-RELIEF is available at www.ibi.vu.nl/programs/multirelief. Matlab source code of the algorithm and data sets are available on request for academic use.

Mesh:

Substances:

Year:  2007        PMID: 18024975     DOI: 10.1093/bioinformatics/btm537

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  41 in total

1.  Surveying the manifold divergence of an entire protein class for statistical clues to underlying biochemical mechanisms.

Authors:  Andrew F Neuwald
Journal:  Stat Appl Genet Mol Biol       Date:  2011-08-04

2.  EALab (Eye Activity Lab): a MATLAB Toolbox for Variable Extraction, Multivariate Analysis and Classification of Eye-Movement Data.

Authors:  Javier Andreu-Perez; Celine Solnais; Kumuthan Sriskandarajah
Journal:  Neuroinformatics       Date:  2016-01

Review 3.  Engineering the acyltransferase substrate specificity of assembly line polyketide synthases.

Authors:  Briana J Dunn; Chaitan Khosla
Journal:  J R Soc Interface       Date:  2013-05-29       Impact factor: 4.118

Review 4.  Exploring the structure and function paradigm.

Authors:  Oliver C Redfern; Benoit Dessailly; Christine A Orengo
Journal:  Curr Opin Struct Biol       Date:  2008-06       Impact factor: 6.809

Review 5.  Relief-based feature selection: Introduction and review.

Authors:  Ryan J Urbanowicz; Melissa Meeker; William La Cava; Randal S Olson; Jason H Moore
Journal:  J Biomed Inform       Date:  2018-07-18       Impact factor: 6.317

6.  Multi-Harmony: detecting functional specificity from sequence alignment.

Authors:  Bernd W Brandt; K Anton Feenstra; Jaap Heringa
Journal:  Nucleic Acids Res       Date:  2010-06-04       Impact factor: 16.971

7.  An automated stochastic approach to the identification of the protein specificity determinants and functional subfamilies.

Authors:  Pavel V Mazin; Mikhail S Gelfand; Andrey A Mironov; Aleksandra B Rakhmaninova; Anatoly R Rubinov; Robert B Russell; Olga V Kalinina
Journal:  Algorithms Mol Biol       Date:  2010-07-15       Impact factor: 1.405

8.  Clustering of protein domains for functional and evolutionary studies.

Authors:  Pavle Goldstein; Jurica Zucko; Dusica Vujaklija; Anita Krisko; Daslav Hranueli; Paul F Long; Catherine Etchebest; Bojan Basrak; John Cullum
Journal:  BMC Bioinformatics       Date:  2009-10-15       Impact factor: 3.169

9.  Ensemble approach to predict specificity determinants: benchmarking and validation.

Authors:  Saikat Chakrabarti; Anna R Panchenko
Journal:  BMC Bioinformatics       Date:  2009-07-02       Impact factor: 3.169

10.  SDR: a database of predicted specificity-determining residues in proteins.

Authors:  Jason E Donald; Eugene I Shakhnovich
Journal:  Nucleic Acids Res       Date:  2008-10-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.