Literature DB >> 9088708

Reduced space sequence alignment.

J A Grice1, R Hughey, D Speck.   

Abstract

MOTIVATION: Sequence alignment is the problem of finding the optimal character-by-character correspondence between two sequences. It can be readily solved in O(n2) time and O(n2) space on a serial machine, or in O(n) time with O(n) space per O(n) processing elements on a parallel machine. Hirschberg's divide-and-conquer approach for finding the single best path reduces space use by a factor of n while inducing only a small constant slowdown to the serial version.
RESULTS: This paper presents a family of methods for computing sequence alignments with reduced memory that are well suited to serial or parallel implementation. Unlike the divide-and-conquer approach, they can be used in the forward-backward (Baum-Welch) training of linear hidden Markov models, and they avoid data-dependent repartitioning, making them easier to parallelize. The algorithms feature, for an arbitrary integer L, a factor proportional to L slowdown in exchange for reducing space requirement from O(n2) to O(n1 square root of n). A single best path member of this algorithm family matches the quadratic time and linear space of the divide-and-conquer algorithm. Experimentally, the O(n1.5)-space member of the family is 15-40% faster than the O(n)-space divide-and-conquer algorithm.

Mesh:

Year:  1997        PMID: 9088708     DOI: 10.1093/bioinformatics/13.1.45

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  9 in total

1.  Efficient algorithms for training the parameters of hidden Markov models using stochastic expectation maximization (EM) training and Viterbi training.

Authors:  Tin Y Lam; Irmtraud M Meyer
Journal:  Algorithms Mol Biol       Date:  2010-12-09       Impact factor: 1.405

2.  Genotype Imputation with Millions of Reference Samples.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2016-01-07       Impact factor: 11.025

3.  Dynamic programming algorithms for comparing multineuronal spike trains via cost-based metrics and alignments.

Authors:  Jonathan D Victor; David H Goldberg; Daniel Gardner
Journal:  J Neurosci Methods       Date:  2006-12-15       Impact factor: 2.390

4.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2009-02-05       Impact factor: 11.025

5.  Clustering ionic flow blockade toggles with a mixture of HMMs.

Authors:  Alexander Churbanov; Stephen Winters-Hilt
Journal:  BMC Bioinformatics       Date:  2008-08-12       Impact factor: 3.169

6.  A linear memory algorithm for Baum-Welch training.

Authors:  István Miklós; Irmtraud M Meyer
Journal:  BMC Bioinformatics       Date:  2005-09-19       Impact factor: 3.169

7.  Searching for evolutionary distant RNA homologs within genomic sequences using partition function posterior probabilities.

Authors:  Usman Roshan; Satish Chikkagoudar; Dennis R Livesay
Journal:  BMC Bioinformatics       Date:  2008-01-28       Impact factor: 3.169

8.  Memory-efficient dynamic programming backtrace and pairwise local sequence alignment.

Authors:  Lee A Newberg
Journal:  Bioinformatics       Date:  2008-06-16       Impact factor: 6.937

9.  Implementing EM and Viterbi algorithms for Hidden Markov Model in linear memory.

Authors:  Alexander Churbanov; Stephen Winters-Hilt
Journal:  BMC Bioinformatics       Date:  2008-04-30       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.