Literature DB >> 3162770

Improved tools for biological sequence comparison.

W R Pearson1, D J Lipman.   

Abstract

We have developed three computer programs for comparisons of protein and DNA sequences. They can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity. The FASTA program is a more sensitive derivative of the FASTP program, which can be used to search protein or DNA sequence data bases and can compare a protein sequence to a DNA sequence data base by translating the DNA data base as it is searched. FASTA includes an additional step in the calculation of the initial pairwise similarity score that allows multiple regions of similarity to be joined to increase the score of related sequences. The RDF2 program can be used to evaluate the significance of similarity scores using a shuffling method that preserves local sequence composition. The LFASTA program can display all the regions of local similarity between two sequences with scores greater than a threshold, using the same scoring parameters and a similar alignment algorithm; these local similarities can be displayed as a "graphic matrix" plot or as individual alignments. In addition, these programs have been generalized to allow comparison of DNA or protein sequences based on a variety of alternative scoring matrices.

Mesh:

Substances:

Year:  1988        PMID: 3162770      PMCID: PMC280013          DOI: 10.1073/pnas.85.8.2444

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  10 in total

1.  Pattern recognition in genetic sequences.

Authors:  P H Sellers
Journal:  Proc Natl Acad Sci U S A       Date:  1979-07       Impact factor: 11.205

2.  Rapid and sensitive protein similarity searches.

Authors:  D J Lipman; W R Pearson
Journal:  Science       Date:  1985-03-22       Impact factor: 47.728

3.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

4.  Efficient algorithms for folding and comparing nucleic acid sequences.

Authors:  J P Dumas; J Ninio
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

5.  On the statistical significance of nucleic acid similarities.

Authors:  D J Lipman; W J Wilbur; T F Smith; M S Waterman
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

6.  Enhanced graphic matrix analysis of nucleic acid and protein sequences.

Authors:  J V Maizel; R P Lenk
Journal:  Proc Natl Acad Sci U S A       Date:  1981-12       Impact factor: 11.205

7.  Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries.

Authors:  W B Goad; M I Kanehisa
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

8.  Similar amino acid sequences: chance or common ancestry?

Authors:  R F Doolittle
Journal:  Science       Date:  1981-10-09       Impact factor: 47.728

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  Rapid similarity searches of nucleic acid and protein data banks.

Authors:  W J Wilbur; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1983-02       Impact factor: 11.205

  10 in total
  2000 in total

1.  Purification and characterization of caffeine synthase from tea leaves.

Authors:  M Kato; K Mizuno; T Fujimura; M Iwama; M Irie; A Crozier; H Ashihara
Journal:  Plant Physiol       Date:  1999-06       Impact factor: 8.340

Review 2.  Protein kinase associated with ribosomes of streptomycetes.

Authors:  K Mikulík; E Zhoulanova; Q K Hoang; J Janecek; S Bezousková
Journal:  Folia Microbiol (Praha)       Date:  1999       Impact factor: 2.099

3.  Detection of protein fold similarity based on correlation of amino acid properties.

Authors:  I V Grigoriev; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-07       Impact factor: 11.205

4.  Cloning and characterization of a cDNA encoding topoisomerase II in pea and analysis of its expression in relation to cell proliferation.

Authors:  M K Reddy; S Nair; K K Tewari; Y Mudgil; B S Yadav; S K Sopory
Journal:  Plant Mol Biol       Date:  1999-09       Impact factor: 4.076

5.  HUGE: a database for human large proteins identified in the Kazusa cDNA sequencing project.

Authors:  R Kikuno; T Nagase; M Suyama; M Waki; M Hirosawa; O Ohara
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

6.  ProClass protein family database.

Authors:  H Huang; C Xiao; C H Wu
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

7.  PLMItRNA, a database for tRNAs and tRNA genes in plant mitochondria: enlargement and updating.

Authors:  V Volpetti; R Gallerani; C De Benedetto; S Liuni; F Licciulli; L R Ceci
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

8.  MEROPS: the peptidase database.

Authors:  N D Rawlings; A J Barrett
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

9.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

10.  ProtoMap: automatic classification of protein sequences and hierarchy of protein families.

Authors:  G Yona; N Linial; M Linial
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.