Literature DB >> 14764567

Fold-specific substitution matrices for protein classification.

R B Vilim1, R M Cunningham, B Lu, P Kheradpour, F J Stevens.   

Abstract

MOTIVATION: Methods that focus on secondary structures, such as Position Specific Scoring Matrices and Hidden Markov Models, have proved useful for assigning proteins to families. However, for assigning proteins to an attribute class within a family these methods may introduce more free parameters than are needed. There are fewer members and there is less variability among sequences within a family. We describe a method for organizing proteins in a family that exhibits up to an order of magnitude reduction in the number of parameters. The basis is the log odds ratio commonly used to measure similarity. We adapt this to characterize the sequence dissimilarities that give rise to attribute differentiation. This leads to the definition of Class Attribute Substitution Matrices (CLASSUM), a dual of the BLOSUM.
RESULTS: The method was applied to classify sequences hierarchically in the lambda and kappa subgroups of the immunoglobulin superfamily. Positions conferring class were identified based on the degree of amino acid variability at a position. The CLASSUM computed for these positions classified better than 90% of test data correctly compared with 35-50% for BLOSUM-62. The expected value for a random matrix is 14%. The results suggest that family-specific data-derived substitution matrices can improve the resolution of automated methods that use generic substitution matrices for searching for and classifying proteins.

Mesh:

Substances:

Year:  2004        PMID: 14764567     DOI: 10.1093/bioinformatics/btg492

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  Aligning protein sequence and analysing substitution pattern using a class-specific matrix.

Authors:  Hai Song Xu; Wen Ke Ren; Xiao Hui Liu; Xiao Qin Li
Journal:  J Biosci       Date:  2010-06       Impact factor: 1.826

Review 2.  Substitution scoring matrices for proteins - An overview.

Authors:  Rakesh Trivedi; Hampapathalu Adimurthy Nagarajaram
Journal:  Protein Sci       Date:  2020-10-12       Impact factor: 6.725

3.  RaacFold: a webserver for 3D visualization and analysis of protein structure by using reduced amino acid alphabets.

Authors:  Lei Zheng; Dongyang Liu; Yuan Alex Li; Siqi Yang; Yuchao Liang; Yongqiang Xing; Yongchun Zuo
Journal:  Nucleic Acids Res       Date:  2022-05-25       Impact factor: 19.160

4.  Protein sequence alignment with family-specific amino acid similarity matrices.

Authors:  Igor B Kuznetsov
Journal:  BMC Res Notes       Date:  2011-08-16

5.  Incorporating background frequency improves entropy-based residue conservation measures.

Authors:  Kai Wang; Ram Samudrala
Journal:  BMC Bioinformatics       Date:  2006-08-17       Impact factor: 3.169

6.  An interactive visualization tool to explore the biophysical properties of amino acids and their contribution to substitution matrices.

Authors:  Blazej Bulka; Marie desJardins; Stephen J Freeland
Journal:  BMC Bioinformatics       Date:  2006-07-03       Impact factor: 3.169

7.  EvDTree: structure-dependent substitution profiles based on decision tree classification of 3D environments.

Authors:  Jean-Christophe Gelly; Laurent Chiche; Jérôme Gracy
Journal:  BMC Bioinformatics       Date:  2005-01-10       Impact factor: 3.169

8.  PR2ALIGN: a stand-alone software program and a web-server for protein sequence alignment using weighted biochemical properties of amino acids.

Authors:  Igor B Kuznetsov; Michael McDuffie
Journal:  BMC Res Notes       Date:  2015-05-07

9.  On the evolution of the standard amino-acid alphabet.

Authors:  Yi Lu; Stephen Freeland
Journal:  Genome Biol       Date:  2006-02-01       Impact factor: 13.583

10.  Fold-specific sequence scoring improves protein sequence matching.

Authors:  Sumudu P Leelananda; Andrzej Kloczkowski; Robert L Jernigan
Journal:  BMC Bioinformatics       Date:  2016-08-30       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.