Literature DB >> 8977877

A new approach to clustering the amino acids.

L E Stanfel1.   

Abstract

Each amino acid is represented by a vector of numerical measurements for the attributes of volume, area, hydrophilicity, polarity, hydrogen bonding, shape, and charge. Inter-residue distances are then calculated according to common metrics, and we introduce a new clustering objective function derived from information-theoretic considerations. The arguments of the function are the inter-object distances of the things to be clustered: in this case the amino acids. By means of approximating the solution of an integer programming problem, then, the residues are partitioned into clusters. The clusters obtained are compared with groups obtained in substitution/mutation studies and found to be similar. Thus, probably the strongest and most objective evidence to date is supplied for believing that physico-chemical properties account for the viability of substitutions and that the important similarities/differences are explained by a relatively small and simple set of properties.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 8977877     DOI: 10.1006/jtbi.1996.0213

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  16 in total

1.  Analysis of sequence periodicity in E. coli proteins: empirical investigation of the "duplication and divergence" theory of protein evolution.

Authors:  Derek Gatherer; Neil R McEwan
Journal:  J Mol Evol       Date:  2003-08       Impact factor: 2.395

2.  LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.

Authors:  Michael Brudno; Chuong B Do; Gregory M Cooper; Michael F Kim; Eugene Davydov; Eric D Green; Arend Sidow; Serafim Batzoglou
Journal:  Genome Res       Date:  2003-03-12       Impact factor: 9.043

3.  Phylogenetic differences in content and intensity of periodic proteins.

Authors:  Derek Gatherer; Neil R McEwan
Journal:  J Mol Evol       Date:  2005-04       Impact factor: 2.395

4.  Structure prediction of a multi-domain EF-hand Ca2+ binding protein by PROPAINOR.

Authors:  Subramanian Jyothi; Sourajit M Mustafi; Kandala V R Chary; Rajani R Joshi
Journal:  J Mol Model       Date:  2005-08-11       Impact factor: 1.810

5.  Fast prediction of protein domain boundaries using conserved local patterns.

Authors:  Rajani R Joshi; Vivekanand V Samant
Journal:  J Mol Model       Date:  2006-04-29       Impact factor: 1.810

6.  Bayesian data mining of protein domains gives an efficient predictive algorithm and new insight.

Authors:  Rajani R Joshi; Vivekanand V Samant
Journal:  J Mol Model       Date:  2006-10-07       Impact factor: 1.810

7.  CodonTest: modeling amino acid substitution preferences in coding sequences.

Authors:  Wayne Delport; Konrad Scheffler; Gordon Botha; Mike B Gravenor; Spencer V Muse; Sergei L Kosakovsky Pond
Journal:  PLoS Comput Biol       Date:  2010-08-19       Impact factor: 4.475

8.  A maximum likelihood method for detecting directional evolution in protein sequences and its application to influenza A virus.

Authors:  Sergei L Kosakovsky Pond; Art F Y Poon; Andrew J Leigh Brown; Simon D W Frost
Journal:  Mol Biol Evol       Date:  2008-05-29       Impact factor: 16.240

9.  Benchmarking multi-rate codon models.

Authors:  Wayne Delport; Konrad Scheffler; Mike B Gravenor; Spencer V Muse; Sergei Kosakovsky Pond
Journal:  PLoS One       Date:  2010-07-21       Impact factor: 3.240

10.  Non-negative matrix factorization for learning alignment-specific models of protein evolution.

Authors:  Ben Murrell; Thomas Weighill; Jan Buys; Robert Ketteringham; Sasha Moola; Gerdus Benade; Lise du Buisson; Daniel Kaliski; Tristan Hands; Konrad Scheffler
Journal:  PLoS One       Date:  2011-12-22       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.