Literature DB >> 8415606

Weighting in sequence space: a comparison of methods in terms of generalized sequences.

M Vingron1, P R Sibbald.   

Abstract

Four methods for weighting aligned biological sequences have recently appeared that differ mathematically, philosophically, and in their results. Thus, while there is consensus about the need to weight sequences, the method to use is contentious. A geometric analysis based on a continuous sequence space is presented that provides a common framework in which to compare the methods. It is concluded that there are two "best" methods. When the sequences are known to be phylogenetically related and a tree can be generated without introducing excessive stress into the data, the method of Altschul et al. [Altschul, S. F., Carroll, R. J. & Lipman, D. J. (1989) J. Mol. Biol. 207, 647-653] is appropriate. When the sequences are not known to be phylogenetically related or a tree cannot be produced without unduly distorting the distances between the sequences, a modification of the method of Sibbald and Argos [Sibbald, P. R. & Argos, P. (1990) J. Mol. Biol. 216, 813-818] is preferable.

Mesh:

Substances:

Year:  1993        PMID: 8415606      PMCID: PMC47443          DOI: 10.1073/pnas.90.19.8777

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  20 in total

1.  Sequence ordinations: a multivariate analysis approach to analysing large sequence data sets.

Authors:  D G Higgins
Journal:  Comput Appl Biosci       Date:  1992-02

2.  Multiple sequence alignment with hierarchical clustering.

Authors:  F Corpet
Journal:  Nucleic Acids Res       Date:  1988-11-25       Impact factor: 16.971

3.  Weak hierarchies associated with similarity measures--an additive clustering technique.

Authors:  H J Bandelt; A W Dress
Journal:  Bull Math Biol       Date:  1989       Impact factor: 1.758

4.  Statistical geometry in sequence space: a method of quantitative comparative sequence analysis.

Authors:  M Eigen; R Winkler-Oswatitsch; A Dress
Journal:  Proc Natl Acad Sci U S A       Date:  1988-08       Impact factor: 11.205

5.  A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons.

Authors:  G J Barton; M J Sternberg
Journal:  J Mol Biol       Date:  1987-11-20       Impact factor: 5.469

6.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

7.  Prediction of protein secondary structure and active sites using the alignment of homologous sequences.

Authors:  M J Zvelebil; G J Barton; W R Taylor; M J Sternberg
Journal:  J Mol Biol       Date:  1987-06-20       Impact factor: 5.469

8.  Maximum-likelihood estimation of evolutionary trees from continuous characters.

Authors:  J Felsenstein
Journal:  Am J Hum Genet       Date:  1973-09       Impact factor: 11.025

9.  A flexible method to align large numbers of biological sequences.

Authors:  W R Taylor
Journal:  J Mol Evol       Date:  1988 Dec-1989 Feb       Impact factor: 2.395

10.  The alignment of sets of sequences and the construction of phyletic trees: an integrated method.

Authors:  P Hogeweg; B Hesper
Journal:  J Mol Evol       Date:  1984       Impact factor: 2.395

View more
  20 in total

Review 1.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

2.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

3.  Evolutionary dynamics of spliceosomal intron revealed by in silico analyses of the P-Type ATPase superfamily genes.

Authors:  Toshiyuki Oda; Ryosuke L Ohniwa; Yuki Suzuki; Masatsugu Denawa; Masahiro Kumeta; Hideyuki Okamura; Kunio Takeyasu
Journal:  Mol Biol Rep       Date:  2010-11-03       Impact factor: 2.316

4.  The relative inefficiency of sequence weights approaches in determining a nucleotide position weight matrix.

Authors:  Lee A Newberg; Lee Ann McCue; Charles E Lawrence
Journal:  Stat Appl Genet Mol Biol       Date:  2005-06-01

Review 5.  Statistical and Bayesian approaches to RNA secondary structure prediction.

Authors:  Ye Ding
Journal:  RNA       Date:  2006-03       Impact factor: 4.942

6.  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.

Authors:  J D Thompson; T J Gibson; F Plewniak; F Jeanmougin; D G Higgins
Journal:  Nucleic Acids Res       Date:  1997-12-15       Impact factor: 16.971

7.  A structural census of the current population of protein sequences.

Authors:  M Gerstein; M Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  1997-10-28       Impact factor: 11.205

8.  Multiple DNA and protein sequence alignment based on segment-to-segment comparison.

Authors:  B Morgenstern; A Dress; T Werner
Journal:  Proc Natl Acad Sci U S A       Date:  1996-10-29       Impact factor: 11.205

9.  A phylogenetic approach for weighting genetic sequences.

Authors:  Nicola De Maio; Alexander V Alekseyenko; William J Coleman-Smith; Fabio Pardi; Marc A Suchard; Asif U Tamuri; Jakub Truszkowski; Nick Goldman
Journal:  BMC Bioinformatics       Date:  2021-05-28       Impact factor: 3.169

10.  PhyloPars: estimation of missing parameter values using phylogeny.

Authors:  Jorn Bruggeman; Jaap Heringa; Bernd W Brandt
Journal:  Nucleic Acids Res       Date:  2009-05-13       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.