Literature DB >> 12870770

A large-scale comparison of genomic sequences: one promising approach.

Valery Kirzhner1, Eviatar Nevo, Abraham Korol, Alexander Bolshoy.   

Abstract

We introduce a novel, linguistic-like method of genome analysis. We propose a natural approach to characterizing genomic sequences based on occurrences of fixed length words from a predefined, sufficiently large set of words (strings over the alphabet [A, C, G, T]). A measure based on this approach is called compositional spectrum and is actually a histogram of imperfect word occurrences. Our results assert that the compositional spectrum is an overall characteristic of a long sequence i.e., a complete genome or an uninterrupted part of a chromosome. This attribute is manifested in the similarity of spectra obtained on different stretches of the same genome, and simultaneously in a broad range of dissimilarities between spectral representations of different genomes. High flexibility characterizes this approach due to imperfect matching and as a result sets of relatively long words can be considered. The proposed approach may have various applications in intra- and intergenomic sequence comparisons.

Mesh:

Substances:

Year:  2003        PMID: 12870770     DOI: 10.1023/a:1024553109779

Source DB:  PubMed          Journal:  Acta Biotheor        ISSN: 0001-5342            Impact factor:   1.774


  6 in total

Review 1.  Molecular signatures for the main phyla of photosynthetic bacteria and their subgroups.

Authors:  Radhey S Gupta
Journal:  Photosynth Res       Date:  2010-04-23       Impact factor: 3.573

2.  Different clustering of genomes across life using the A-T-C-G and degenerate R-Y alphabets: early and late signaling on genome evolution?

Authors:  V Kirzhner; A Paz; Z Volkovich; E Nevo; A Korol
Journal:  J Mol Evol       Date:  2007-03-19       Impact factor: 2.395

3.  Microbial lifestyle and genome signatures.

Authors:  Chitra Dutta; Sandip Paul
Journal:  Curr Genomics       Date:  2012-04       Impact factor: 2.236

4.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

5.  n-Gram characterization of genomic islands in bacterial genomes.

Authors:  Gordana M Pavlović-Lazetić; Nenad S Mitić; Milos V Beljanski
Journal:  Comput Methods Programs Biomed       Date:  2008-12-19       Impact factor: 5.428

6.  Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.

Authors:  Valery Kirzhner; Dvora Toledano-Kitai; Zeev Volkovich
Journal:  PLoS One       Date:  2020-11-06       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.