Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The efficient computation of position-specific match scores with the fast fourier transform.

Literature DB >> 11911793

The efficient computation of position-specific match scores with the fast fourier transform.

Abstract

Historically, in computational biology the fast Fourier transform (FFT) has been used almost exclusively to count the number of exact letter matches between two biosequences. This paper presents an FFT algorithm that can compute the match score of a sequence against a position-specific scoring matrix (PSSM). Our algorithm finds the PSSM score simultaneously over all offsets of the PSSM with the sequence, although like all previous FFT algorithms, it still disallows gaps. Although our algorithm is presented in the context of global matching, it can be adapted to local matching without gaps. As a benchmark, our PSSM-modified FFT algorithm computed pairwise match scores. In timing experiments, our most efficient FFT implementation for pairwise scoring appeared to be 10 to 26 times faster than a traditional FFT implementation, with only a factor of 2 in the acceleration attributable to a previously known compression scheme. Many important algorithms for detecting biosequence similarities, e.g., gapped BLAST or PSIBLAST, have a heuristic screening phase that disallows gaps. This paper demonstrates that FFT algorithms merit reconsideration in these screening applications.

Mesh：

Substances：
Peptides

Year: 2002 PMID： 11911793 DOI： 10.1089/10665270252833172

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

Keyword Cloud
Cited

6 in total

The efficient computation of position-specific match scores with the fast fourier transform.

1. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

2. Sequence alignment by cross-correlation.

3. COSINE: non-seeding method for mapping long noisy sequences.

4. PSimScan: algorithm and utility for fast protein similarity search.

5. Fast index based algorithms and software for matching position specific scoring matrices.

6. Fast sequence analysis based on diamond sampling.