Literature DB >> 15130543

A nucleotide composition constraint of genome sequences.

Chun-Ting Zhang1, Ren Zhang.   

Abstract

Let a, c, g and t denote the occurrence frequencies of A, C, G and T, respectively, in a genome. We calculated the statistical quantity S = a2 + c2 + g2 + t2 for each of 809 genomes (11 archaea, 42 bacteria, 3 eukaryota, 90 phages, 36 viroids and 627 viruses) and 236 plasmids. We found that S < 1/3 is strictly valid for almost all of the above genomes or plasmids. As a direct deduction of the above observation, it is shown that (i) the statistical quantity S is a kind of genome order index, which is negatively correlated with the Shannon H function; (ii) S < 1/3 suggests that a minimal value of the Shannon H function is required for each genome; (iii) S defined above would be a new biological statistical quantity, useful to describe the composition features of genomes; (iv) By jointly considering the Chargaff Parity Rule 2, it is shown that the genomic G + C content should be in between 0.211 and 0.789.

Mesh:

Substances:

Year:  2004        PMID: 15130543     DOI: 10.1016/j.compbiolchem.2004.02.002

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  8 in total

1.  'Genome order index' should not be used for defining compositional constraints in nucleotide sequences--a case study of the Z-curve.

Authors:  Eran Elhaik; Dan Graur; Kresimir Josić
Journal:  Biol Direct       Date:  2010-02-17       Impact factor: 4.540

2.  A rebuttal to the comments on the genome order index and the Z-curve.

Authors:  Ren Zhang
Journal:  Biol Direct       Date:  2011-02-16       Impact factor: 4.540

3.  GC-Profile: a web-based tool for visualizing and analyzing the variation of GC content in genomic sequences.

Authors:  Feng Gao; Chun-Ting Zhang
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

4.  Analysis of the relationship between genomic GC Content and patterns of base usage, codon usage and amino acid usage in prokaryotes: similar GC content adopts similar compositional frequencies regardless of the phylogenetic lineages.

Authors:  Hui-Qi Zhou; Lu-Wen Ning; Hui-Xiong Zhang; Feng-Biao Guo
Journal:  PLoS One       Date:  2014-09-25       Impact factor: 3.240

5.  Classifying genomic sequences by sequence feature analysis.

Authors:  Zhi Hua Liu; Dian Jiao; Xiao Sun
Journal:  Genomics Proteomics Bioinformatics       Date:  2005-11       Impact factor: 7.691

6.  Human Pol II promoter recognition based on primary sequences and free energy of dinucleotides.

Authors:  Jian-Yi Yang; Yu Zhou; Zu-Guo Yu; Vo Anh; Li-Qian Zhou
Journal:  BMC Bioinformatics       Date:  2008-02-24       Impact factor: 3.169

7.  A Brief Review: The Z-curve Theory and its Application in Genome Analysis.

Authors:  Ren Zhang; Chun-Ting Zhang
Journal:  Curr Genomics       Date:  2014-04       Impact factor: 2.236

8.  Identification of Horizontally-transferred Genomic Islands and Genome Segmentation Points by Using the GC Profile Method.

Authors:  Ren Zhang; Hong-Yu Ou; Feng Gao; Hao Luo
Journal:  Curr Genomics       Date:  2014-04       Impact factor: 2.236

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.