Literature DB >> 12594080

Using fuzzy logic to confirm the integrity of a pattern recognition algorithm for long genomic sequences: the W-curve.

Douglas J Cork1, Andre Toguem.   

Abstract

The W-curve is a numerical mapping algorithm that provides tertiary information content of long and short genomic sequences. The most popular genomic pattern recognition algorithms depend on string matching of the primary information content of short genomic sequences. Herein, we describe a way to define the fuzzy properties of the W-curve. This approach improves a distance (dissimilarity) between two or more homologous long genomic sequences. Fourier analysis of W-curves delivers a smoother function for gap-stripped regions. Calculation of respective Fourier energies may improve the accuracy of the distance metric used to generate a phylogenetic tree of analyzed genomic sequences. This is especially the case for long genomic sequences that have been gap-stripped and aligned with the aid of previously published heuristic methods. These previous methods involved W-curve alignments used in concert with such programs as Clustal that use linear dynamic programming to align multiple gap-stripped W-curves.

Mesh:

Substances:

Year:  2002        PMID: 12594080     DOI: 10.1111/j.1749-6632.2002.tb04887.x

Source DB:  PubMed          Journal:  Ann N Y Acad Sci        ISSN: 0077-8923            Impact factor:   5.691


  2 in total

1.  W-curve alignments for HIV-1 genomic comparisons.

Authors:  Douglas J Cork; Steven Lembark; Sodsai Tovanabutra; Merlin L Robb; Jerome H Kim
Journal:  PLoS One       Date:  2010-06-01       Impact factor: 3.240

2.  Coronavirus phylogeny based on a geometric approach.

Authors:  Wen-Xin Zheng; Ling-Ling Chen; Hong-Yu Ou; Feng Gao; Chun-Ting Zhang
Journal:  Mol Phylogenet Evol       Date:  2005-08       Impact factor: 4.286

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.