Literature DB >> 1747779

A novel randomized iterative strategy for aligning multiple protein sequences.

M P Berger1, P J Munson.   

Abstract

The rigorous alignment of multiple protein sequences becomes impractical even with a modest number of sequences, since computer memory and time requirements increase as the product of the lengths of the sequences. We have devised a strategy to approach such an optimal alignment, which modifies the intensive computer storage and time requirements of dynamic programming. Our algorithm randomly divides a group of unaligned sequences into two subgroups, between which an optimal alignment is then obtained by a Needleman-Wunsch style of algorithm. Our algorithm uses a matrix with dimensions corresponding to the lengths of the two aligned sequence subgroups. The pairwise alignment process is repeated using different random divisions of the whole group into two subgroups. Compared with the rigorous approach of solving the n-dimensional lattice by dynamic programming, our iterative algorithm results in alignments that match or are close to the optimal solution, on a limited set of test problems. We have implemented this algorithm in a computer program that runs on the IBM PC class of machines, together with a user-friendly environment for interactively selecting sequences or groups of sequences to be aligned either simultaneously or progressively.

Mesh:

Substances:

Year:  1991        PMID: 1747779     DOI: 10.1093/bioinformatics/7.4.479

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  23 in total

1.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

2.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

3.  ProbCons: Probabilistic consistency-based multiple sequence alignment.

Authors:  Chuong B Do; Mahathi S P Mahabhashyam; Michael Brudno; Serafim Batzoglou
Journal:  Genome Res       Date:  2005-02       Impact factor: 9.043

4.  An estimate of divergence time of Parazoa and Eumetazoa and that of Cephalochordata and Vertebrata by aldolase and triose phosphate isomerase clocks.

Authors:  N Nikoh; N Iwabe; K Kuma; M Ohno; T Sugiyama; Y Watanabe; K Yasui; Z Shi-cui; K Hori; Y Shimura; T Miyata
Journal:  J Mol Evol       Date:  1997-07       Impact factor: 2.395

5.  Phylogenetic position of Dictyostelium inferred from multiple protein data sets.

Authors:  K Kuma; N Nikoh; N Iwabe; T Miyata
Journal:  J Mol Evol       Date:  1995-08       Impact factor: 2.395

6.  Comprehensive comparison of graph based multiple protein sequence alignment strategies.

Authors:  Ilya Plyusnin; Liisa Holm
Journal:  BMC Bioinformatics       Date:  2012-04-29       Impact factor: 3.169

7.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

8.  Parallelization of the MAFFT multiple sequence alignment program.

Authors:  Kazutaka Katoh; Hiroyuki Toh
Journal:  Bioinformatics       Date:  2010-04-28       Impact factor: 6.937

9.  Randomized and parallel algorithms for distance matrix calculations in multiple sequence alignment.

Authors:  Sanguthevar Rajasekaran; Vishal Thapar; Hardik Dave; Chun-Hsi Huang
Journal:  J Clin Monit Comput       Date:  2005-10       Impact factor: 1.977

10.  Refining multiple sequence alignments with conserved core regions.

Authors:  Saikat Chakrabarti; Christopher J Lanczycki; Anna R Panchenko; Teresa M Przytycka; Paul A Thiessen; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2006-05-17       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.