| Literature DB >> 12594080 |
Douglas J Cork1, Andre Toguem.
Abstract
The W-curve is a numerical mapping algorithm that provides tertiary information content of long and short genomic sequences. The most popular genomic pattern recognition algorithms depend on string matching of the primary information content of short genomic sequences. Herein, we describe a way to define the fuzzy properties of the W-curve. This approach improves a distance (dissimilarity) between two or more homologous long genomic sequences. Fourier analysis of W-curves delivers a smoother function for gap-stripped regions. Calculation of respective Fourier energies may improve the accuracy of the distance metric used to generate a phylogenetic tree of analyzed genomic sequences. This is especially the case for long genomic sequences that have been gap-stripped and aligned with the aid of previously published heuristic methods. These previous methods involved W-curve alignments used in concert with such programs as Clustal that use linear dynamic programming to align multiple gap-stripped W-curves.Mesh:
Substances:
Year: 2002 PMID: 12594080 DOI: 10.1111/j.1749-6632.2002.tb04887.x
Source DB: PubMed Journal: Ann N Y Acad Sci ISSN: 0077-8923 Impact factor: 5.691