Literature DB >> 8590178

A weighting system and algorithm for aligning many phylogenetically related sequences.

O Gotoh1.   

Abstract

Most multiple sequence alignment programs explicitly or implicity try to optimize some score associated with the resulting alignment. Although the sum-of-pairs score is currently most widely used, it is inappropriate when the phylogenetic relationships among the sequences to be aligned are not evenly distributed, since the contributions of densely populated groups dominate those of minor members. This paper proposes an iterative multiple sequence alignment method which optimizes a weighted sum-of-pairs score, in which the weights given to individual sequence pairs are adjusted to compensate for the biased contributions. A simple method that rapidly calculates such a set of weights for a given phylogenetic tree is presented. The multiple sequence alignment is refined through partitioning and realignment restricted to the edges of the tree. Under this restriction, profile-based fast and rigorous group-to-group alignment is achieved at each iteration, rendering the overall computational cost virtually identical to that using an unweighted score. Consistency of nearly 90% was attained between structural and sequence alignments of multiple divergent globins, confirming the effectiveness of this strategy in improving the quality of multiple sequence alignment.

Mesh:

Substances:

Year:  1995        PMID: 8590178     DOI: 10.1093/bioinformatics/11.5.543

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  22 in total

Review 1.  Immune chemokines and their receptors: the key elements in the genesis, homeostasis and function of the immune system.

Authors:  O Yoshie
Journal:  Springer Semin Immunopathol       Date:  2000

Review 2.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

3.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

4.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  Two new members of the murine Sim gene family are transcriptional repressors and show different expression patterns during mouse embryogenesis.

Authors:  M Ema; M Morita; S Ikawa; M Tanaka; Y Matsuda; O Gotoh; Y Saijoh; H Fujii; H Hamada; Y Kikuchi; Y Fujii-Kuriyama
Journal:  Mol Cell Biol       Date:  1996-10       Impact factor: 4.272

7.  TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs.

Authors:  Zhen Tan; Yinghan Fu; Gaurav Sharma; David H Mathews
Journal:  Nucleic Acids Res       Date:  2017-11-16       Impact factor: 16.971

8.  The identification and functional characterization of WxL proteins from Enterococcus faecium reveal surface proteins involved in extracellular matrix interactions.

Authors:  Jessica R Galloway-Peña; Xiaowen Liang; Kavindra V Singh; Puja Yadav; Chungyu Chang; Sabina Leanti La Rosa; Samuel Shelburne; Hung Ton-That; Magnus Höök; Barbara E Murray
Journal:  J Bacteriol       Date:  2014-12-15       Impact factor: 3.490

9.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

10.  The construction and use of log-odds substitution scores for multiple sequence alignment.

Authors:  Stephen F Altschul; John C Wootton; Elena Zaslavsky; Yi-Kuo Yu
Journal:  PLoS Comput Biol       Date:  2010-07-15       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.