Literature DB >> 16761921

A deterministic finite automaton for faster protein hit detection in BLAST.

Michael Cameron1, Hugh E Williams, Adam Cannane.   

Abstract

BLAST is the most popular bioinformatics tool and is used to run millions of queries each day. However, evaluating such queries is slow, taking typically minutes on modern workstations. Therefore, continuing evolution of BLAST--by improving its algorithms and optimizations--is essential to improve search times in the face of exponentially increasing collection sizes. We present an optimization to the first stage of the BLAST algorithm specifically designed for protein search. It produces the same results as NCBI-BLAST but in around 59% of the time on Intel-based platforms; we also present results for other popular architectures. Overall, this is a saving of around 15% of the total typical BLAST search time. Our approach uses a deterministic finite automaton (DFA), inspired by the original scheme used in the 1990 BLAST algorithm. The techniques are optimized for modern hardware, making careful use of cache-conscious approaches to improve speed. Our optimized DFA approach has been integrated into a new version of BLAST that is freely available for download at http://www.fsa-blast.org/.

Entities:  

Mesh:

Year:  2006        PMID: 16761921     DOI: 10.1089/cmb.2006.13.965

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  BLAST+: architecture and applications.

Authors:  Christiam Camacho; George Coulouris; Vahram Avagyan; Ning Ma; Jason Papadopoulos; Kevin Bealer; Thomas L Madden
Journal:  BMC Bioinformatics       Date:  2009-12-15       Impact factor: 3.169

2.  PSimScan: algorithm and utility for fast protein similarity search.

Authors:  Anna Kaznadzey; Natalia Alexandrova; Vladimir Novichkov; Denis Kaznadzey
Journal:  PLoS One       Date:  2013-03-07       Impact factor: 3.240

3.  Virome diversity analysis reveals novel enteroviruses and a human picobirnavirus in stool samples from African green monkeys with diarrhea.

Authors:  Wenjuan Li; Xin Qiang; Si Qin; Yong Huang; Yan Hu; Bingke Bai; Jun Hou; Rong Gao; Xianglilan Zhang; Zhiqiang Mi; Hang Fan; Huahu Ye; Yigang Tong; Panyong Mao
Journal:  Infect Genet Evol       Date:  2020-03-10       Impact factor: 3.342

4.  The Genome Reverse Compiler: an explorative annotation tool.

Authors:  Andrew S Warren; João Carlos Setubal
Journal:  BMC Bioinformatics       Date:  2009-01-27       Impact factor: 3.169

Review 5.  Computing Platforms for Big Biological Data Analytics: Perspectives and Challenges.

Authors:  Zekun Yin; Haidong Lan; Guangming Tan; Mian Lu; Athanasios V Vasilakos; Weiguo Liu
Journal:  Comput Struct Biotechnol J       Date:  2017-08-14       Impact factor: 7.271

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.