Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems.

Literature DB >> 1908023

Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems.

Abstract

A measure of sequence similarity, dt, not requiring prior sequence alignment gave correct results for a variety of computer-generated model sequences without and with gaps for all degrees of substitution, s. Measure d was the squared Euclidean distance between vectors of counts of t-tuplets of characters in the two sequences. In models without gaps and without Needleman-Wunsch alignment, average d was very closely equal to twice average conventional mismatch counts, m. In these models one of each of the conditions on the Jukes-Cantor model was violated in turn: (1) both descendant lineages receive the same number of substitutions, (2) all sites are equally likely to be substituted, (3) all different replacement characters are equally likely to be chosen, and (4) all original characters are equally likely to be substituted. In Jukes-Cantor models with gaps Needleman-Wunsch alignment was necessarily performed, a procedure that generally produced incorrect values of m. For these models average d was found to be very closely equal to twice the average m estimated from the known value of s using the inverted Jukes-Cantor formula.

Entities: Disease

Mesh：

Year: 1991 PMID： 1908023 DOI： 10.1007/bf02102654

Source DB: PubMed Journal: J Mol Evol ISSN： 0022-2844 Impact factor: 2.395

4 in total

1. Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.

Authors: B E Blaisdell
Journal: J Mol Evol Date: 1989-12 Impact factor: 2.395

2. Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences.

Authors: B E Blaisdell
Journal: J Mol Evol Date: 1989-12 Impact factor: 2.395

3. A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors: S B Needleman; C D Wunsch
Journal: J Mol Biol Date: 1970-03 Impact factor: 5.469

4. Sequence analysis of a cDNA clone encoding the liver cell adhesion molecule, L-CAM.

Authors: W J Gallin; B C Sorkin; G M Edelman; B A Cunningham
Journal: Proc Natl Acad Sci U S A Date: 1987-05 Impact factor: 11.205

4 in total

3 in total

Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a variety of computer-generated model systems.

1. Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system.

2. Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences.

3. A general method applicable to the search for similarities in the amino acid sequence of two proteins.

4. Sequence analysis of a cDNA clone encoding the liver cell adhesion molecule, L-CAM.

1. Similar cases retrieval from the database of laboratory test results.

2. Protein sequence randomness and sequence/structure correlations.

3. Alignment-free method for DNA sequence clustering using Fuzzy integral similarity.