Literature DB >> 1584052

Dynamic programming algorithms for biological sequence comparison.

W R Pearson1, W Miller.   

Abstract

Efficient dynamic programming algorithms are available for a broad class of protein and DNA sequence comparison problems. These algorithms require computer time proportional to the product of the lengths of the two sequences being compared [O(N2)] but require memory space proportional only to the sum of these lengths [O(N)]. Although the requirement for O(N2) time limits use of the algorithms to the largest computers when searching protein and DNA sequence databases, many other applications of these algorithms, such as calculation of distances for evolutionary trees and comparison of a new sequence to a library of sequence profiles, are well within the capabilities of desktop computers. In particular, the results of library searches with rapid searching programs, such as FASTA or BLAST, should be confirmed by performing a rigorous optimal alignment. Whereas rapid methods do not overlook significant sequence similarities, FASTA limits the number of gaps that can be inserted into an alignment, so that a rigorous alignment may extend the alignment substantially in some cases. BLAST does not allow gaps in the local regions that it reports; a calculation that allows gaps is very likely to extend the alignment substantially. Although a Monte Carlo evaluation of the statistical significance of a similarity score with a rigorous algorithm is much slower than the heuristic approach used by the RDF2 program, the dynamic programming approach should take less than 1 hr on a 386-based PC or desktop Unix workstation. For descriptive purposes, we have limited our discussion to methods for calculating similarity scores and distances that use gap penalties of the form g = rk. Nevertheless, programs for the more general case (g = q+rk) are readily available. Versions of these programs that run either on Unix workstations, IBM-PC class computers, or the Macintosh can be obtained from either of the authors.

Mesh:

Substances:

Year:  1992        PMID: 1584052     DOI: 10.1016/0076-6879(92)10029-d

Source DB:  PubMed          Journal:  Methods Enzymol        ISSN: 0076-6879            Impact factor:   1.600


  14 in total

1.  Non-Euclidean properties of spike train metric spaces.

Authors:  Dmitriy Aronov; Jonathan D Victor
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-06-02

2.  Comparison of methods for searching protein sequence databases.

Authors:  W R Pearson
Journal:  Protein Sci       Date:  1995-06       Impact factor: 6.725

3.  Intron position as an evolutionary marker of thioredoxins and thioredoxin domains.

Authors:  M Sahrawy; V Hecht; J Lopez-Jaramillo; A Chueca; Y Chartier; Y Meyer
Journal:  J Mol Evol       Date:  1996-04       Impact factor: 2.395

4.  PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames.

Authors:  E Birney; J D Thompson; T J Gibson
Journal:  Nucleic Acids Res       Date:  1996-07-15       Impact factor: 16.971

5.  Searching databases of conserved sequence regions by aligning protein multiple-alignments.

Authors:  S Pietrokovski
Journal:  Nucleic Acids Res       Date:  1996-10-01       Impact factor: 16.971

6.  The gene G13 in the class III region of the human MHC encodes a potential DNA-binding protein.

Authors:  A Khanna; R D Campbell
Journal:  Biochem J       Date:  1996-10-01       Impact factor: 3.857

7.  Transmembrane helices predicted at 95% accuracy.

Authors:  B Rost; R Casadio; P Fariselli; C Sander
Journal:  Protein Sci       Date:  1995-03       Impact factor: 6.725

8.  Reconstructing evolutionary trees from DNA and protein sequences: paralinear distances.

Authors:  J A Lake
Journal:  Proc Natl Acad Sci U S A       Date:  1994-02-15       Impact factor: 11.205

9.  Defensin-like ZmES4 mediates pollen tube burst in maize via opening of the potassium channel KZM1.

Authors:  Suseno Amien; Irina Kliwer; Mihaela L Márton; Thomas Debener; Dietmar Geiger; Dirk Becker; Thomas Dresselhaus
Journal:  PLoS Biol       Date:  2010-06-01       Impact factor: 8.029

10.  Arabidopsis UVH6, a homolog of human XPD and yeast RAD3 DNA repair genes, functions in DNA repair and is essential for plant growth.

Authors:  Zongrang Liu; Suk-Whan Hong; Mindy Escobar; Elizabeth Vierling; David L Mitchell; David W Mount; Jennifer D Hall
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.