Literature DB >> 9053899

Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins.

K Tomii1, M Kanehisa.   

Abstract

An amino acid index is a set of 20 numerical values representing any of the different physicochemical and biochemical properties of amino acids. As a follow-up to the previous study, we have increased the size of the database, which currently contains 402 published indices, and re-performed the single-linkage cluster analysis. The results basically confirmed the previous findings. Another important feature of amino acids that can be represented numerically is the similarity between them. Thus, a similarity matrix, also called a mutation matrix, is a set of 20 x 20 numerical values used for protein sequence alignments and similarity searches. We have collected 42 published matrices, performed hierarchical cluster analyses and identified several clusters corresponding to the nature of the data set and the method used for constructing the mutation matrix. Further, we have tried to reproduce each mutation matrix by the combination of amino acid indices in order to understand which properties of amino acids are reflected most. There was a relationship between the PAM units of Dayhoff's mutation matrix and the volume and hydrophobicity of amino acids. The database of 402 amino acid indices and 42 amino acid mutation matrices is made publicly available on the Internet.

Mesh:

Substances:

Year:  1996        PMID: 9053899     DOI: 10.1093/protein/9.1.27

Source DB:  PubMed          Journal:  Protein Eng        ISSN: 0269-2139


  86 in total

1.  Detection of protein fold similarity based on correlation of amino acid properties.

Authors:  I V Grigoriev; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-07       Impact factor: 11.205

2.  AAindex: amino acid index database.

Authors:  S Kawashima; M Kanehisa
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species.

Authors:  P J Haney; J H Badger; G L Buldak; C I Reich; C R Woese; G J Olsen
Journal:  Proc Natl Acad Sci U S A       Date:  1999-03-30       Impact factor: 11.205

Review 4.  The case for an error minimizing standard genetic code.

Authors:  Stephen J Freeland; Tao Wu; Nick Keulmann
Journal:  Orig Life Evol Biosph       Date:  2003-10       Impact factor: 1.950

5.  An optimal structure-discriminative amino acid index for protein fold recognition.

Authors:  R H Leary; J B Rosen; P Jambeck
Journal:  Biophys J       Date:  2004-01       Impact factor: 4.033

6.  In search for more accurate alignments in the twilight zone.

Authors:  Lukasz Jaroszewski; Weizhong Li; Adam Godzik
Journal:  Protein Sci       Date:  2002-07       Impact factor: 6.725

7.  CRASP: a program for analysis of coordinated substitutions in multiple alignments of protein sequences.

Authors:  Dmitry A Afonnikov; Nikolay A Kolchanov
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

8.  On the classes of aminoacyl-tRNA synthetases, amino acids and the genetic code.

Authors:  Andre R O Cavalcanti; Elisa Soares Leite; Benício B Neto; Ricardo Ferreira
Journal:  Orig Life Evol Biosph       Date:  2004-08       Impact factor: 1.950

Review 9.  Designing antimicrobial peptides: form follows function.

Authors:  Christopher D Fjell; Jan A Hiss; Robert E W Hancock; Gisbert Schneider
Journal:  Nat Rev Drug Discov       Date:  2011-12-16       Impact factor: 84.694

10.  Real value prediction of protein folding rate change upon point mutation.

Authors:  Liang-Tsung Huang; M Michael Gromiha
Journal:  J Comput Aided Mol Des       Date:  2012-03-18       Impact factor: 3.686

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.