Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Statistical distance between texts and filtration methods in sequence comparison.

Literature DB >> 1591607

Statistical distance between texts and filtration methods in sequence comparison.

Abstract

Upon searching local similarities in long sequences, the necessity of a 'rapid' similarity search becomes acute. Quadratic complexity of dynamic programming algorithms forces the employment of filtration methods that allow elimination of the sequences with a low similarity level. The paper is devoted to the theoretical substantiations of the filtration method based on the statistical distance between texts. The notion of the filtration efficiency is introduced and the efficiency of several filters is estimated. It is shown that the efficiency of the statistical l-tuple filtration upon DNA database search is associated with a potential extension of the original four-letter alphabet and grows exponentially with increasing l. The formula that allows one to estimate the filtration parameters is presented.

Mesh：

Substances：
Proteins
DNA

Year: 1992 PMID： 1591607 DOI： 10.1093/bioinformatics/8.2.121

Source DB: PubMed Journal: Comput Appl Biosci ISSN： 0266-7061

Keyword Cloud
Cited

2 in total

1. A hybrid distance measure for clustering expressed sequence tags originating from the same gene family.

Authors: Keng-Hoong Ng; Chin-Kuan Ho; Somnuk Phon-Amnuaisuk
Journal: PLoS One Date: 2012-10-11 Impact factor: 3.240

2. Pervasive properties of the genomic signature.

Authors: Robert W Jernigan; Robert H Baran
Journal: BMC Genomics Date: 2002-08-09 Impact factor: 3.969

2 in total