Literature DB >> 11266569

ParAlign: a parallel sequence alignment algorithm for rapid and sensitive database searches.

T Rognes1.   

Abstract

There is a need for faster and more sensitive algorithms for sequence similarity searching in view of the rapidly increasing amounts of genomic sequence data available. Parallel processing capabilities in the form of the single instruction, multiple data (SIMD) technology are now available in common microprocessors and enable a single microprocessor to perform many operations in parallel. The ParAlign algorithm has been specifically designed to take advantage of this technology. The new algorithm initially exploits parallelism to perform a very rapid computation of the exact optimal ungapped alignment score for all diagonals in the alignment matrix. Then, a novel heuristic is employed to compute an approximate score of a gapped alignment by combining the scores of several diagonals. This approximate score is used to select the most interesting database sequences for a subsequent Smith-Waterman alignment, which is also parallelised. The resulting method represents a substantial improvement compared to existing heuristics. The sensitivity and specificity of ParAlign was found to be as good as Smith-Waterman implementations when the same method for computing the statistical significance of the matches was used. In terms of speed, only the significantly less sensitive NCBI BLAST 2 program was found to outperform the new approach. Online searches are available at http://dna.uio.no/search/

Entities:  

Mesh:

Year:  2001        PMID: 11266569      PMCID: PMC31274          DOI: 10.1093/nar/29.7.1647

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  21 in total

1.  Predicting protein structure using only sequence information.

Authors:  K Karplus; C Barrett; M Cline; M Diekhans; L Grate; R Hughey
Journal:  Proteins       Date:  1999

2.  Six-fold speed-up of Smith-Waterman sequence database searches using parallel processing on common microprocessors.

Authors:  T Rognes; E Seeberg
Journal:  Bioinformatics       Date:  2000-08       Impact factor: 6.937

3.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

4.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

5.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

6.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

7.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

8.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

9.  An improved algorithm for matching biological sequences.

Authors:  O Gotoh
Journal:  J Mol Biol       Date:  1982-12-15       Impact factor: 5.469

10.  Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms.

Authors:  W R Pearson
Journal:  Genomics       Date:  1991-11       Impact factor: 5.736

View more
  16 in total

1.  Human DNA glycosylases of the bacterial Fpg/MutM superfamily: an alternative pathway for the repair of 8-oxoguanine and other oxidation products in DNA.

Authors:  Ingrid Morland; Veslemøy Rolseth; Luisa Luna; Torbjørn Rognes; Magnar Bjørås; Erling Seeberg
Journal:  Nucleic Acids Res       Date:  2002-11-15       Impact factor: 16.971

2.  Testing statistical significance scores of sequence comparison methods with structure similarity.

Authors:  Tim Hulsen; Jacob de Vlieg; Jack A M Leunissen; Peter M A Groenen
Journal:  BMC Bioinformatics       Date:  2006-10-12       Impact factor: 3.169

3.  A quick guide for developing effective bioinformatics programming skills.

Authors:  Joel T Dudley; Atul J Butte
Journal:  PLoS Comput Biol       Date:  2009-12-24       Impact factor: 4.475

4.  PSimScan: algorithm and utility for fast protein similarity search.

Authors:  Anna Kaznadzey; Natalia Alexandrova; Vladimir Novichkov; Denis Kaznadzey
Journal:  PLoS One       Date:  2013-03-07       Impact factor: 3.240

5.  Accelerated Profile HMM Searches.

Authors:  Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2011-10-20       Impact factor: 4.475

6.  The human ortholog of the rodent testis-specific ABC transporter Abca17 is a ubiquitously expressed pseudogene (ABCA17P) and shares a common 5' end with ABCA3.

Authors:  Armin P Piehler; Jürgen J Wenzel; Ole K Olstad; Kari Bente Foss Haug; Peter Kierulf; Wolfgang E Kaminski
Journal:  BMC Mol Biol       Date:  2006-09-12       Impact factor: 2.946

7.  Cohesive versus flexible evolution of functional modules in eukaryotes.

Authors:  Like Fokkens; Berend Snel
Journal:  PLoS Comput Biol       Date:  2009-01-30       Impact factor: 4.475

8.  Overexpression of the LexA-regulated tisAB RNA in E. coli inhibits SOS functions; implications for regulation of the SOS response.

Authors:  Ragnhild Weel-Sneve; Magnar Bjørås; Knut Ivan Kristiansen
Journal:  Nucleic Acids Res       Date:  2008-10-01       Impact factor: 16.971

9.  A new protein superfamily includes two novel 3-methyladenine DNA glycosylases from Bacillus cereus, AlkC and AlkD.

Authors:  Ingrun Alseth; Torbjørn Rognes; Toril Lindbäck; Inger Solberg; Kristin Robertsen; Knut Ivan Kristiansen; Davide Mainieri; Lucy Lillehagen; Anne-Brit Kolstø; Magnar Bjørås
Journal:  Mol Microbiol       Date:  2006-03       Impact factor: 3.501

10.  RNAmmer: consistent and rapid annotation of ribosomal RNA genes.

Authors:  Karin Lagesen; Peter Hallin; Einar Andreas Rødland; Hans-Henrik Staerfeldt; Torbjørn Rognes; David W Ussery
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.