| Literature DB >> 15130540 |
David Hernandez1, Robin Gras, Ron Appel.
Abstract
We introduce a method for ungapped local multiple alignment (ULMA) in a given set of amino acid or nucleotide sequences. This method explores two search spaces using a linked optimization strategy. The first search space M consists of all possible words of a given length W, defined on the residue alphabet. An evolutionary algorithm searches this space globally. The second search space P consists of all possible ULMAs in the sequence set, each ULMA being represented by a position vector defining exactly one subsequence of length W per sequence. This search space is sampled with hill-climbing processes. The search of both spaces are coupled by projecting high scoring results from the global evolutionary search of M onto P. The hill-climbing processes then refine the optimization by local search, using the relative entropy between the ULMA and background residue frequencies as an objective function. We demonstrate some advantages of our strategy by analyzing difficult natural amino acid sequences and artificial datasets. A web interface is available atMesh:
Year: 2004 PMID: 15130540 DOI: 10.1016/j.compbiolchem.2004.01.001
Source DB: PubMed Journal: Comput Biol Chem ISSN: 1476-9271 Impact factor: 2.877