Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Optimizing Spaced k-mer Neighbors for Efficient Filtration in Protein Similarity Search.

Literature DB >> 26355786

Optimizing Spaced k-mer Neighbors for Efficient Filtration in Protein Similarity Search.

Abstract

Large-scale comparison or similarity search of genomic DNA and protein sequence is of fundamental importance in modern molecular biology. To perform DNA and protein sequence similarity search efficiently, seeding (or filtration) method has been widely used where only sequences sharing a common pattern or "seed" are subject to detailed comparison. Therefore these methods trade search sensitivity with search speed. In this paper, we introduce a new seeding method, called spaced k-mer neighbors, which provides a better tradeoff between the sensitivity and speed in protein sequence similarity search. With the method of spaced k-mer neighbors, for each spaced k-mer, a set of spaced k-mers is selected as its neighbors. These pre-selected spaced k-mer neighbors are then used to detect hits between query sequence and database sequences. We propose an efficient heuristic algorithm for the spaced neighbor selection. Our computational experimental results demonstrate that the method of spaced k-mer neighbors can improve the overall tradeoff efficiency over existing seeding methods.

Entities: Species

Mesh：

Substances：
Proteins

Year: 2014 PMID： 26355786 DOI： 10.1109/TCBB.2014.2306831

Source DB: PubMed Journal: IEEE/ACM Trans Comput Biol Bioinform ISSN： 1545-5963 Impact factor: 3.710

Keyword Cloud
Cited

2 in total

1. A Graphic Encoding Method for Quantitative Classification of Protein Structure and Representation of Conformational Changes.

Authors: Hector Carrillo-Cabada; Jeremy Benson; Asghar M Razavi; Brianna Mulligan; Michel A Cuendet; Harel Weinstein; Michela Taufer; Trilce Estrada
Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2021-08-06 Impact factor: 3.702

2. Best hits of 11110110111: model-free selection and parameter-free sensitivity calculation of spaced seeds.

Authors: Laurent Noé
Journal: Algorithms Mol Biol Date: 2017-02-14 Impact factor: 1.405

2 in total