Literature DB >> 1633570

The rapid generation of mutation data matrices from protein sequences.

D T Jones1, W R Taylor, J M Thornton.   

Abstract

An efficient means for generating mutation data matrices from large numbers of protein sequences is presented here. By means of an approximate peptide-based sequence comparison algorithm, the set sequences are clustered at the 85% identity level. The closest relating pairs of sequences are aligned, and observed amino acid exchanges tallied in a matrix. The raw mutation frequency matrix is processed in a similar way to that described by Dayhoff et al. (1978), and so the resulting matrices may be easily used in current sequence analysis applications, in place of the standard mutation data matrices, which have not been updated for 13 years. The method is fast enough to process the entire SWISS-PROT databank in 20 h on a Sun SPARCstation 1, and is fast enough to generate a matrix from a specific family or class of proteins in minutes. Differences observed between our 250 PAM mutation data matrix and the matrix calculated by Dayhoff et al. are briefly discussed.

Mesh:

Substances:

Year:  1992        PMID: 1633570     DOI: 10.1093/bioinformatics/8.3.275

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  2000 in total

1.  Detection of protein fold similarity based on correlation of amino acid properties.

Authors:  I V Grigoriev; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-07       Impact factor: 11.205

2.  Toward a comprehensive phylogeny for mammalian and avian herpesviruses.

Authors:  D J McGeoch; A Dolan; A C Ralph
Journal:  J Virol       Date:  2000-11       Impact factor: 5.103

3.  Phylogenetic analysis of arthropods using two nuclear protein-encoding genes supports a crustacean + hexapod clade.

Authors:  J W Shultz; J C Regier
Journal:  Proc Biol Sci       Date:  2000-05-22       Impact factor: 5.349

4.  Ehrlichia ruminantium major antigenic protein gene (map1) variants are not geographically constrained and show no evidence of having evolved under positive selection pressure.

Authors:  M T Allsopp; C M Dorfling; J C Maillard; A Bensaid; D T Haydon; H van Heerden; B A Allsopp
Journal:  J Clin Microbiol       Date:  2001-11       Impact factor: 5.948

5.  A likelihood ratio test for evolutionary rate shifts and functional divergence among proteins.

Authors:  B Knudsen; M M Miyamoto
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-04       Impact factor: 11.205

6.  Inference of functional regions in proteins by quantification of evolutionary constraints.

Authors:  Alexander L Simon; Eric A Stone; Arend Sidow
Journal:  Proc Natl Acad Sci U S A       Date:  2002-03-05       Impact factor: 11.205

7.  Diversity and evolution of the green fluorescent protein family.

Authors:  Y A Labas; N G Gurskaya; Y G Yanushevich; A F Fradkov; K A Lukyanov; S A Lukyanov; M V Matz
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-02       Impact factor: 11.205

8.  Complementary advantageous substitutions in the evolution of an antiviral RNase of higher primates.

Authors:  Jianzhi Zhang; Helene F Rosenberg
Journal:  Proc Natl Acad Sci U S A       Date:  2002-03-26       Impact factor: 11.205

9.  Exploration of novel motifs derived from mouse cDNA sequences.

Authors:  Hideya Kawaji; Christian Schönbach; Yo Matsuo; Jun Kawai; Yasushi Okazaki; Yoshihide Hayashizaki; Hideo Matsuda
Journal:  Genome Res       Date:  2002-03       Impact factor: 9.043

10.  Pattern and timing of gene duplication in animal genomes.

Authors:  R Friedman; A L Hughes
Journal:  Genome Res       Date:  2001-11       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.