Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Randomized and parallel algorithms for distance matrix calculations in multiple sequence alignment.

Literature DB >> 16328949

Randomized and parallel algorithms for distance matrix calculations in multiple sequence alignment.

Sanguthevar Rajasekaran¹, Vishal Thapar, Hardik Dave, Chun-Hsi Huang.

Abstract

Multiple sequence alignment (MSA) is a vital problem in biology. Optimal alignment of multiple sequences becomes impractical even for a modest number of sequences since the general version of the problem is NP-hard. Because of the high time complexity of traditional MSA algorithms, even today's fast computers are not able to solve the problem for large number of sequences. In this paper we present a randomized algorithm to calculate distance matrices, which is a major step in many multiple sequence alignment algorithms. The basic idea employed is sampling (along the lines of). We also illustrate how to parallelize this algorithm. In Section we introduce the problem of multiple sequence alignments. In Section we provide a discussion on various methods that have been employed in the literature for Multiple Sequence Alignment. In this section we also introduce our new sampling approach. We extend our randomized algorithm to the case of non-uniform length sequences as well. We show that our algorithms are amenable to parallelism in Section. In Section we back up our claim of speedup and accuracy with empirical data and examples. In Section we provide some concluding remarks.

Entities: Chemical

Mesh：

Year: 2005 PMID： 16328949 DOI： 10.1007/s10877-005-0680-3

Source DB: PubMed Journal: J Clin Monit Comput ISSN： 1387-1307 Impact factor: 1.977

Keyword Cloud
References

10 in total

10. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors: J D Thompson; D G Higgins; T J Gibson
Journal: Nucleic Acids Res Date: 1994-11-11 Impact factor: 16.971

10 in total

Randomized and parallel algorithms for distance matrix calculations in multiple sequence alignment.

Review 1. Recent progress in multiple sequence alignment: a survey.

2. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

3. A novel randomized iterative strategy for aligning multiple protein sequences.

4. Structure-based assignment of the biochemical function of a hypothetical protein: a test case of structural genomics.

5. Multiple sequence alignment with hierarchical clustering.

6. A general method applicable to the search for similarities in the amino acid sequence of two proteins.

7. Progressive sequence alignment as a prerequisite to correct phylogenetic trees.

8. Further improvement in methods of group-to-group sequence alignment with generalized profile operations.

9. Optimal alignment between groups of sequences and its application to multiple sequence alignment.

10. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.