Literature DB >> 1908023

Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems.

B E Blaisdell1.   

Abstract

A measure of sequence similarity, dt, not requiring prior sequence alignment gave correct results for a variety of computer-generated model sequences without and with gaps for all degrees of substitution, s. Measure d was the squared Euclidean distance between vectors of counts of t-tuplets of characters in the two sequences. In models without gaps and without Needleman-Wunsch alignment, average d was very closely equal to twice average conventional mismatch counts, m. In these models one of each of the conditions on the Jukes-Cantor model was violated in turn: (1) both descendant lineages receive the same number of substitutions, (2) all sites are equally likely to be substituted, (3) all different replacement characters are equally likely to be chosen, and (4) all original characters are equally likely to be substituted. In Jukes-Cantor models with gaps Needleman-Wunsch alignment was necessarily performed, a procedure that generally produced incorrect values of m. For these models average d was found to be very closely equal to twice the average m estimated from the known value of s using the inverted Jukes-Cantor formula.

Entities:  

Mesh:

Year:  1991        PMID: 1908023     DOI: 10.1007/bf02102654

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  4 in total

1.  Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.

Authors:  B E Blaisdell
Journal:  J Mol Evol       Date:  1989-12       Impact factor: 2.395

2.  Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences.

Authors:  B E Blaisdell
Journal:  J Mol Evol       Date:  1989-12       Impact factor: 2.395

3.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

4.  Sequence analysis of a cDNA clone encoding the liver cell adhesion molecule, L-CAM.

Authors:  W J Gallin; B C Sorkin; G M Edelman; B A Cunningham
Journal:  Proc Natl Acad Sci U S A       Date:  1987-05       Impact factor: 11.205

  4 in total
  3 in total

1.  Similar cases retrieval from the database of laboratory test results.

Authors:  Zhenjun Yang; Yasushi Matsumura; Shigeki Kuwata; Hideo Kusuoka; Hiroshi Takeda
Journal:  J Med Syst       Date:  2003-06       Impact factor: 4.460

2.  Protein sequence randomness and sequence/structure correlations.

Authors:  R S Rahman; S Rackovsky
Journal:  Biophys J       Date:  1995-04       Impact factor: 4.033

3.  Alignment-free method for DNA sequence clustering using Fuzzy integral similarity.

Authors:  Ajay Kumar Saw; Garima Raj; Manashi Das; Narayan Chandra Talukdar; Binod Chandra Tripathy; Soumyadeep Nandi
Journal:  Sci Rep       Date:  2019-03-06       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.