Literature DB >> 3460087

A measure of the similarity of sets of sequences not requiring sequence alignment.

B E Blaisdell.   

Abstract

Determination of first- and second-order Markov chain homogeneity of sets of nuclear eukaryotic DNA sequences, both coding and noncoding, finds similarities imperceptible to the standard Needleman-Wunsch base matching or dot-matrix algorithms. These measures of the similarities of the distributions of adjacent pairs or triplets are in agreement with accepted evolutionary-tree topologies. Hierarchical clustering of the distributions of doublets of 30 miscellaneous coding sequences gives clusters in reasonable agreement with accepted biological classifications. In addition to similarity by homology, there is also observed similarity of disparate genes in the same organism--for example, all three disparate yeast genes (two enzymes and actin) form a well-distinguished cluster.

Entities:  

Mesh:

Substances:

Year:  1986        PMID: 3460087      PMCID: PMC323909          DOI: 10.1073/pnas.83.14.5155

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  44 in total

1.  Sequences at the somatic recombination sites of immunoglobulin light-chain genes.

Authors:  H Sakano; K Hüppi; G Heinrich; S Tonegawa
Journal:  Nature       Date:  1979-07-26       Impact factor: 49.962

2.  Enzymatic synthesis of deoxyribonucleic acid. VIII. Frequencies of nearest neighbor base sequences in deoxyribonucleic acid.

Authors:  J JOSSE; A D KAISER; A KORNBERG
Journal:  J Biol Chem       Date:  1961-03       Impact factor: 5.157

3.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

4.  Molecular cloning and sequence analysis of adult chicken betal globin cDNA.

Authors:  R I Richards; J Shine; A Ullrich; J R Wells; H M Goodman
Journal:  Nucleic Acids Res       Date:  1979-11-10       Impact factor: 16.971

5.  Sequence of three introns in the chick ovalbumin gene.

Authors:  M A Robertson; R Staden; Y Tanaka; J F Catterall; B W O'Malley; G G Brownlee
Journal:  Nature       Date:  1979-03-22       Impact factor: 49.962

6.  The DNA sequence of sea urchin (S. purpuratus) H2A, H2B and H3 histone coding and spacer regions.

Authors:  I Sures; J Lowry; L H Kedes
Journal:  Cell       Date:  1978-11       Impact factor: 41.582

7.  Comparison of total sequence of a cloned rabbit beta-globin gene and its flanking regions with a homologous mouse sequence.

Authors:  A van Ooyen; J van den Berg; N Mantei; C Weissmann
Journal:  Science       Date:  1979-10-19       Impact factor: 47.728

8.  The DNA sequence of Bombyx mori fibroin gene including the 5' flanking, mRNA coding, entire intervening and fibroin protein coding regions.

Authors:  Y Tsujimoto; Y Suzuki
Journal:  Cell       Date:  1979-10       Impact factor: 41.582

9.  The structure and evolution of the two nonallelic rat preproinsulin genes.

Authors:  P Lomedico; N Rosenthal; A Efstratidadis; W Gilbert; R Kolodner; R Tizard
Journal:  Cell       Date:  1979-10       Impact factor: 41.582

10.  The primary structure of a glyceraldehyde-3-phosphate dehydrogenase gene from Saccharomyces cerevisiae.

Authors:  J P Holland; M J Holland
Journal:  J Biol Chem       Date:  1979-10-10       Impact factor: 5.157

View more
  86 in total

1.  Distributional regimes for the number of k-word matches between two random sequences.

Authors:  Ross A Lippert; Haiyan Huang; Michael S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-10-08       Impact factor: 11.205

2.  Metagenomic Classification Using an Abstraction Augmented Markov Model.

Authors:  Xiujun Sylvia Zhu; Monnie McGee
Journal:  J Comput Biol       Date:  2015-11-30       Impact factor: 1.479

3.  Phylogenetic continuum indicates "galaxies" in the protein universe: preliminary results on the natural group structures of proteins.

Authors:  I Ladunga
Journal:  J Mol Evol       Date:  1992-04       Impact factor: 2.395

4.  Measuring the coding potential of genomic sequences through a combination of triplet occurrence patterns and RNY preference.

Authors:  Christoforos Nikolaou; Yannis Almirantis
Journal:  J Mol Evol       Date:  2004-09       Impact factor: 2.395

5.  WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences.

Authors:  G Pesole; N Prunella; S Liuni; M Attimonelli; C Saccone
Journal:  Nucleic Acids Res       Date:  1992-06-11       Impact factor: 16.971

6.  "Word" preference in the genomic text and genome evolution: different modes of n-tuplet usage in coding and noncoding sequences.

Authors:  Christoforos Nikolaou; Yannis Almirantis
Journal:  J Mol Evol       Date:  2005-07-19       Impact factor: 2.395

7.  A simple method for global sequence comparison.

Authors:  E Pizzi; M Attimonelli; S Liuni; C Frontali; C Saccone
Journal:  Nucleic Acids Res       Date:  1992-01-11       Impact factor: 16.971

8.  Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions.

Authors:  Gregory E Sims; Se-Ran Jun; Guohong A Wu; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-02       Impact factor: 11.205

9.  The perils of plenty: what are we going to do with all these genes?

Authors:  Allen Rodrigo; Frederic Bertels; Joseph Heled; Raphael Noder; Helen Shearman; Peter Tsai
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

10.  Multiple alignment-free sequence comparison.

Authors:  Jie Ren; Kai Song; Fengzhu Sun; Minghua Deng; Gesine Reinert
Journal:  Bioinformatics       Date:  2013-08-29       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.