Literature DB >> 2515299

Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences.

B E Blaisdell1.   

Abstract

Various measures of sequence dissimilarity have been evaluated by how well the additive least squares estimation of edges (branch lengths) of an unrooted evolutionary tree fit the observed pairwise dissimilarity measures and by how consistent the trees are for different data sets derived from the same set of sequences. This evaluation provided sensitive discrimination among dissimilarity measures and among possible trees. Dissimilarity measures not requiring prior sequence alignment did about as well as did the traditional mismatch counts requiring prior sequence alignment. Application of Jukes-Cantor correction to singlet mismatch counts worsened the results. Measures not requiring alignment had the advantage of being applicable to sequences too different to be critically alignable. Two different measures of pairwise dissimilarity not requiring alignment have been used: (1) multiplet distribution distance (MDD), the square of the Euclidean distance between vectors of the fractions of base signlets (or doublets, or triplets, or ...) in the respective sequences, and (2) complements of long words (CLW), the count of bases not occurring in significantly long common words. MDD was applicable to sequences more different than was CLW (noncoding), but the latter often gave better results where both measures were available (coding). MDD results were improved by using longer mutliplets and, if the sequences were coding, by using the larger amino acid and codon alphabets rather than the nucleotide alphabet. The additive least squares method could be used to provide a reasonable consensus of different trees for the same set of species (or related genes).

Entities:  

Mesh:

Substances:

Year:  1989        PMID: 2515299     DOI: 10.1007/bf02602924

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  13 in total

1.  PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES.

Authors:  L L Cavalli-Sforza; A W F Edwards
Journal:  Evolution       Date:  1967-09       Impact factor: 3.694

2.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

3.  A measure of the similarity of sets of sequences not requiring sequence alignment.

Authors:  B E Blaisdell
Journal:  Proc Natl Acad Sci U S A       Date:  1986-07       Impact factor: 11.205

4.  How reliably do amino acid composition comparisons predict sequence similarities between proteins?

Authors:  A Cornish-Bowden
Journal:  J Theor Biol       Date:  1979-02-21       Impact factor: 2.691

5.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

6.  The evolution and sequence comparison of two recently diverged mouse chromosomal beta--globin genes.

Authors:  D A Konkel; J V Maizel; P Leder
Journal:  Cell       Date:  1979-11       Impact factor: 41.582

7.  Algorithms for identifying local molecular sequence features.

Authors:  S Karlin; M Morris; G Ghandour; M Y Leung
Journal:  Comput Appl Biosci       Date:  1988-03

8.  Efficient algorithms for molecular sequence analysis.

Authors:  S Karlin; M Morris; G Ghandour; M Y Leung
Journal:  Proc Natl Acad Sci U S A       Date:  1988-02       Impact factor: 11.205

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  A prevalent persistent global nonrandomness that distinguishes coding and non-coding eucaryotic nuclear DNA sequences.

Authors:  B E Blaisdell
Journal:  J Mol Evol       Date:  1983       Impact factor: 2.395

View more
  16 in total

1.  Genetic distance and species formation in evolving populations.

Authors:  P G Higgs; B Derrida
Journal:  J Mol Evol       Date:  1992-11       Impact factor: 2.395

2.  Metagenomic Classification Using an Abstraction Augmented Markov Model.

Authors:  Xiujun Sylvia Zhu; Monnie McGee
Journal:  J Comput Biol       Date:  2015-11-30       Impact factor: 1.479

3.  Phylogenetic continuum indicates "galaxies" in the protein universe: preliminary results on the natural group structures of proteins.

Authors:  I Ladunga
Journal:  J Mol Evol       Date:  1992-04       Impact factor: 2.395

4.  The perils of plenty: what are we going to do with all these genes?

Authors:  Allen Rodrigo; Frederic Bertels; Joseph Heled; Raphael Noder; Helen Shearman; Peter Tsai
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

5.  Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems.

Authors:  B E Blaisdell
Journal:  J Mol Evol       Date:  1991-06       Impact factor: 2.395

6.  Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.

Authors:  B E Blaisdell
Journal:  J Mol Evol       Date:  1989-12       Impact factor: 2.395

Review 7.  Multimodal Long Noncoding RNA Interaction Networks: Control Panels for Cell Fate Specification.

Authors:  Keriayn N Smith; Sarah C Miller; Gabriele Varani; J Mauro Calabrese; Terry Magnuson
Journal:  Genetics       Date:  2019-12       Impact factor: 4.562

8.  Protein sequence randomness and sequence/structure correlations.

Authors:  R S Rahman; S Rackovsky
Journal:  Biophys J       Date:  1995-04       Impact factor: 4.033

9.  An accurate alignment-free protein sequence comparator based on physicochemical properties of amino acids.

Authors:  Saeedeh Akbari Rokn Abadi; Azam Sadat Abdosalehi; Faezeh Pouyamehr; Somayyeh Koohi
Journal:  Sci Rep       Date:  2022-07-01       Impact factor: 4.996

10.  There appear to be conserved constraints on the distribution of nucleotide sequences in cellular genomes.

Authors:  A C Rogerson
Journal:  J Mol Evol       Date:  1991-01       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.