Literature DB >> 8324637

Optimal alignment between groups of sequences and its application to multiple sequence alignment.

O Gotoh1.   

Abstract

Four algorithms, A-D, were developed to align two groups of biological sequences. Algorithm A is equivalent to the conventional dynamic programming method widely used for aligning ordinary sequences, whereas algorithms B-D are designed to evaluate the cost for a deletion/insertion more accurately when internal gaps are present in either or both groups of sequences. Rigorous optimization of the 'sum of pairs' (SP) score is achieved by algorithm D, whose average performance is close to O(MNL2), where M and N are numbers of sequences included in the two groups and L is the mean length of the sequences. Algorithm B uses some approximations to cope with profile-based operations, whereas algorithm C is a simpler variant of algorithm D. These group-to-group alignment algorithms were applied to multiple sequence alignment with two iterative strategies: a progressive method based on a given binary tree and a randomized grouping--realignment method. The advantages and disadvantages of the four algorithms are discussed on the basis of the results of examinations of several protein families.

Mesh:

Year:  1993        PMID: 8324637     DOI: 10.1093/bioinformatics/9.3.361

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  34 in total

1.  Endosymbiotic microbiota of the bamboo pseudococcid Antonina crawii (Insecta, Homoptera).

Authors:  T Fukatsu; N Nikoh
Journal:  Appl Environ Microbiol       Date:  2000-02       Impact factor: 4.792

2.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

3.  Finding weak similarities between proteins by sequence profile comparison.

Authors:  Anna R Panchenko
Journal:  Nucleic Acids Res       Date:  2003-01-15       Impact factor: 16.971

4.  Contact-based sequence alignment.

Authors:  Jens Kleinjung; John Romein; Kuang Lin; Jaap Heringa
Journal:  Nucleic Acids Res       Date:  2004-04-30       Impact factor: 16.971

5.  MAVID: constrained ancestral alignment of multiple sequences.

Authors:  Nicolas Bray; Lior Pachter
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

6.  MHC-BPS: MHC-binder prediction server for identifying peptides of flexible lengths from sequence-derived physicochemical properties.

Authors:  Juan Cui; Lian Yi Han; Hong Huang Lin; Zhi Qun Tang; Li Jiang; Zhi Wei Cao; Yu Zong Chen
Journal:  Immunogenetics       Date:  2006-07-11       Impact factor: 2.846

7.  cDNA cloning and tissue-specific expression of a novel basic helix-loop-helix/PAS factor (Arnt2) with close sequence similarity to the aryl hydrocarbon receptor nuclear translocator (Arnt).

Authors:  K Hirose; M Morita; M Ema; J Mimura; H Hamada; H Fujii; Y Saijo; O Gotoh; K Sogawa; Y Fujii-Kuriyama
Journal:  Mol Cell Biol       Date:  1996-04       Impact factor: 4.272

8.  Two intracellular symbiotic bacteria from the mulberry psyllid Anomoneura mori (Insecta, Homoptera).

Authors:  T Fukatsu; N Nikoh
Journal:  Appl Environ Microbiol       Date:  1998-10       Impact factor: 4.792

9.  RAGA: RNA sequence alignment by genetic algorithm.

Authors:  C Notredame; E A O'Brien; D G Higgins
Journal:  Nucleic Acids Res       Date:  1997-11-15       Impact factor: 16.971

10.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.