Literature DB >> 16184599

Grouping of amino acid types and extraction of amino acid properties from multiple sequence alignments using variance maximization.

James O Wrabl1, Nick V Grishin.   

Abstract

Understanding of amino acid type co-occurrence in trusted multiple sequence alignments is a prerequisite for improved sequence alignment and remote homology detection algorithms. Two objective approaches were used to investigate co-occurrence, both based on variance maximization of the weighted residue frequencies in columns taken from a large alignment database. The first approach discretely grouped amino acid types, and the second approach extracted orthogonal properties of amino acids using principal components analysis. The grouping results corresponded to amino acid physical properties such as side chain hydrophobicity, size, or backbone flexibility, and an optimal arrangement of approximately eight groups was observed. However, interpretation of the orthogonal properties was more complex. Although the principal components accounting for the largest variances exhibited modest correlations with hydrophobicity and conservation of glycine, in general principal components did not correspond to physical properties of amino acids. Although not intuitive, these amino acid mathematical properties were demonstrated to be robust and to improve local pairwise alignment accuracy, relative to 20 amino acid frequencies alone, for a simple test case. (c) 2005 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16184599     DOI: 10.1002/prot.20648

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  6 in total

1.  A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.

Authors:  C Etchebest; C Benros; A Bornot; A-C Camproux; A G de Brevern
Journal:  Eur Biophys J       Date:  2007-06-13       Impact factor: 1.733

2.  IHEC_RAAC: a online platform for identifying human enzyme classes via reduced amino acid cluster strategy.

Authors:  Hao Wang; Qilemuge Xi; Pengfei Liang; Lei Zheng; Yan Hong; Yongchun Zuo
Journal:  Amino Acids       Date:  2021-01-23       Impact factor: 3.520

3.  Using inferred residue contacts to distinguish between correct and incorrect protein models.

Authors:  Christopher S Miller; David Eisenberg
Journal:  Bioinformatics       Date:  2008-05-29       Impact factor: 6.937

Review 4.  Research progress of reduced amino acid alphabets in protein analysis and prediction.

Authors:  Yuchao Liang; Siqi Yang; Lei Zheng; Hao Wang; Jian Zhou; Shenghui Huang; Lei Yang; Yongchun Zuo
Journal:  Comput Struct Biotechnol J       Date:  2022-07-04       Impact factor: 6.155

5.  Automated alphabet reduction for protein datasets.

Authors:  Jaume Bacardit; Michael Stout; Jonathan D Hirst; Alfonso Valencia; Robert E Smith; Natalio Krasnogor
Journal:  BMC Bioinformatics       Date:  2009-01-06       Impact factor: 3.169

6.  Nature of protein family signatures: insights from singular value analysis of position-specific scoring matrices.

Authors:  Akira R Kinjo; Haruki Nakamura
Journal:  PLoS One       Date:  2008-04-09       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.