Literature DB >> 2051488

Amino acid substitution matrices from an information theoretic perspective.

S F Altschul1.   

Abstract

Protein sequence alignments have become an important tool for molecular biologists. Local alignments are frequently constructed with the aid of a "substitution score matrix" that specifies a score for aligning each pair of amino acid residues. Over the years, many different substitution matrices have been proposed, based on a wide variety of rationales. Statistical results, however, demonstrate that any such matrix is implicitly a "log-odds" matrix, with a specific target distribution for aligned pairs of amino acid residues. In the light of information theory, it is possible to express the scores of a substitution matrix in bits and to see that different matrices are better adapted to different purposes. The most widely used matrix for protein sequence comparison has been the PAM-250 matrix. It is argued that for database searches the PAM-120 matrix generally is more appropriate, while for comparing two specific proteins with suspected homology the PAM-200 matrix is indicated. Examples discussed include the lipocalins, human alpha 1 B-glycoprotein, the cystic fibrosis transmembrane conductance regulator and the globins.

Entities:  

Mesh:

Substances:

Year:  1991        PMID: 2051488      PMCID: PMC7130686          DOI: 10.1016/0022-2836(91)90193-a

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  49 in total

1.  Protein database searches for multiple alignments.

Authors:  S F Altschul; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1990-07       Impact factor: 11.205

2.  A sensitive procedure to compare amino acid sequences.

Authors:  P Argos
Journal:  J Mol Biol       Date:  1987-01-20       Impact factor: 5.469

3.  Rapid and sensitive protein similarity searches.

Authors:  D J Lipman; W R Pearson
Journal:  Science       Date:  1985-03-22       Impact factor: 47.728

4.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

5.  A family of related ATP-binding subunits coupled to many distinct biological processes in bacteria.

Authors:  C F Higgins; I D Hiles; G P Salmond; D R Gill; J A Downie; I J Evans; I B Holland; L Gray; S D Buckel; A W Bell
Journal:  Nature       Date:  1986 Oct 2-8       Impact factor: 49.962

6.  A nonlinear measure of subalignment similarity and its significance levels.

Authors:  S F Altschul; B W Erickson
Journal:  Bull Math Biol       Date:  1986       Impact factor: 1.758

7.  New scoring matrix for amino acid residue exchanges based on residue characteristic physical parameters.

Authors:  J K Mohana Rao
Journal:  Int J Pept Protein Res       Date:  1987-02

8.  fhuC and fhuD genes for iron (III)-ferrichrome transport into Escherichia coli K-12.

Authors:  J W Coulton; P Mason; D D Allatt
Journal:  J Bacteriol       Date:  1987-08       Impact factor: 3.490

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  Amino acid sequence of human plasma alpha 1B-glycoprotein: homology to the immunoglobulin supergene family.

Authors:  N Ishioka; N Takahashi; F W Putnam
Journal:  Proc Natl Acad Sci U S A       Date:  1986-04       Impact factor: 11.205

View more
  154 in total

1.  The estimation of statistical parameters for local alignment score distributions.

Authors:  S F Altschul; R Bundschuh; R Olsen; T Hwa
Journal:  Nucleic Acids Res       Date:  2001-01-15       Impact factor: 16.971

Review 2.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

3.  Predicting deleterious amino acid substitutions.

Authors:  P C Ng; S Henikoff
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

4.  Use of residue pairs in protein sequence-sequence and sequence-structure alignments.

Authors:  J Jung; B Lee
Journal:  Protein Sci       Date:  2000-08       Impact factor: 6.725

5.  The variable and conserved interfaces of modeled olfactory receptor proteins.

Authors:  Y Pilpel; D Lancet
Journal:  Protein Sci       Date:  1999-05       Impact factor: 6.725

6.  A strategy to retrieve the whole set of protein modules in microbial proteomes.

Authors:  Stéphanie Le Bouder-Langevin; Isabelle Capron-Montaland; Renaud De Rosa; Bernard Labedan
Journal:  Genome Res       Date:  2002-12       Impact factor: 9.043

7.  MTRAP: pairwise sequence alignment algorithm by a new measure based on transition probability between two consecutive pairs of residues.

Authors:  Toshihide Hara; Keiko Sato; Masanori Ohya
Journal:  BMC Bioinformatics       Date:  2010-05-08       Impact factor: 3.169

8.  More for less in structural genomics.

Authors:  A Heger; L Holm
Journal:  J Struct Funct Genomics       Date:  2003

9.  The compositional adjustment of amino acid substitution matrices.

Authors:  Yi-Kuo Yu; John C Wootton; Stephen F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  2003-12-08       Impact factor: 11.205

10.  Transcriptomic and proteomic analyses of pericycle cells of the maize primary root.

Authors:  Diana Dembinsky; Katrin Woll; Muhammad Saleem; Yan Liu; Yan Fu; Lisa A Borsuk; Tobias Lamkemeyer; Claudia Fladerer; Johannes Madlung; Brad Barbazuk; Alfred Nordheim; Dan Nettleton; Patrick S Schnable; Frank Hochholdinger
Journal:  Plant Physiol       Date:  2007-08-31       Impact factor: 8.340

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.