Literature DB >> 15984937

Four basic symmetry types in the universal 7-cluster structure of microbial genomic sequences.

Alexander N Gorban1, Tatyana G Popova, Andrei Y Zinovyev.   

Abstract

Coding information is the main source of heterogeneity (non-randomness) in the sequences of microbial genomes. The heterogeneity corresponds to a cluster structure in triplet distributions of relatively short genomic fragments (200-400 bp). We found a universal 7-cluster structure in microbial genomic sequences and explained its properties. We show that codon usage of bacterial genomes is a multi-linear function of their genomic G+C-content with high accuracy. Based on the analysis of 143 completely sequenced bacterial genomes available in Genbank in August 2004, we show that there are four "pure" types of the 7-cluster structure observed. All 143 cluster animated 3D-scatters are collected in a database which is made available on our web-site (http://www.ihes.fr/~zinovyev/7clusters). The findings can be readily introduced into software for gene prediction, sequence alignment or microbial genomes classification.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15984937

Source DB:  PubMed          Journal:  In Silico Biol        ISSN: 1386-6338


  2 in total

1.  Differentiation of regions with atypical oligonucleotide composition in bacterial genomes.

Authors:  Oleg N Reva; Burkhard Tümmler
Journal:  BMC Bioinformatics       Date:  2005-10-14       Impact factor: 3.169

2.  Amazing symmetrical clustering in chloroplast genomes.

Authors:  Michael G Sadovsky; Maria Yu Senashova; Andrew V Malyshev
Journal:  BMC Bioinformatics       Date:  2020-03-11       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.