Literature DB >> 1956790

Analysis of distribution of bases in the coding sequences by a diagrammatic technique.

C T Zhang1, R Zhang.   

Abstract

The frequencies of occurrence of four bases in the first, second and third codon positions and in the total coding sequences have been calculated by the codon usage table published in 1990 by Ikemura et al. The distribution of frequencies are further analysed in detail by a graphic technique presented recently by us. Formulas expressing the frequencies of four bases in the first and second codon positions in terms of frequencies of amino acids have been given. It is shown by the graphic analysis that for 90 species, in the first codon position the purine bases are dominant and in most cases G is the most dominant base. In the second codon position A is the most dominant base, while G is the least dominant base. In the third codon position the G + C content varies from 0.1 to 0.9, keeping the A + C content equal to 1/2 and G content equal to that of C, approximately. If the frequencies for bases A, C, G and U in the total coding sequences are denoted by a, c, g and u, respectively, it is found that the unequal formula: a2 + c2 + g2 + u2 less than 1/3, is valid for each of the 90 species including the human and E.coli etc.

Entities:  

Mesh:

Substances:

Year:  1991        PMID: 1956790      PMCID: PMC329145          DOI: 10.1093/nar/19.22.6313

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  8 in total

1.  Diagrammatic representation of the distribution of DNA bases and its applications.

Authors:  C T Zhang; R Zhang
Journal:  Int J Biol Macromol       Date:  1991-02       Impact factor: 6.953

2.  Codon usage in plant genes.

Authors:  E E Murray; J Lotzer; M Eberle
Journal:  Nucleic Acids Res       Date:  1989-01-25       Impact factor: 16.971

3.  Codon usage tabulated from the GenBank Genetic Sequence Data.

Authors:  S Aota; T Gojobori; F Ishibashi; T Maruyama; T Ikemura
Journal:  Nucleic Acids Res       Date:  1988       Impact factor: 16.971

4.  Translation framing code and frame-monitoring mechanism as suggested by the analysis of mRNA and 16 S rRNA nucleotide sequences.

Authors:  E N Trifonov
Journal:  J Mol Biol       Date:  1987-04-20       Impact factor: 5.469

5.  Codon usage tabulated from the GenBank genetic sequence data.

Authors:  T Maruyama; T Gojobori; S Aota; T Ikemura
Journal:  Nucleic Acids Res       Date:  1986       Impact factor: 16.971

Review 6.  Codon catalog usage and the genome hypothesis.

Authors:  R Grantham; C Gautier; M Gouy; R Mercier; A Pavé
Journal:  Nucleic Acids Res       Date:  1980-01-11       Impact factor: 16.971

7.  Codon catalog usage is a genome strategy modulated for gene expressivity.

Authors:  R Grantham; C Gautier; M Gouy; M Jacobzone; R Mercier
Journal:  Nucleic Acids Res       Date:  1981-01-10       Impact factor: 16.971

Review 8.  Codon usage and tRNA content in unicellular and multicellular organisms.

Authors:  T Ikemura
Journal:  Mol Biol Evol       Date:  1985-01       Impact factor: 16.240

  8 in total
  40 in total

1.  Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve.

Authors:  C T Zhang; J Wang
Journal:  Nucleic Acids Res       Date:  2000-07-15       Impact factor: 16.971

2.  ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes.

Authors:  Feng-Biao Guo; Hong-Yu Ou; Chun-Ting Zhang
Journal:  Nucleic Acids Res       Date:  2003-03-15       Impact factor: 16.971

3.  Classifier assessment and feature selection for recognizing short coding sequences of human genes.

Authors:  Kai Song; Ze Zhang; Tuo-Peng Tong; Fang Wu
Journal:  J Comput Biol       Date:  2012-03       Impact factor: 1.479

Review 4.  Identification of replication origins in archaeal genomes based on the Z-curve method.

Authors:  Ren Zhang; Chun-Ting Zhang
Journal:  Archaea       Date:  2005-05       Impact factor: 3.273

5.  An analysis of base frequencies in the anti-sense strands corresponding to the 180 human protein coding sequences.

Authors:  C T Zhang; K C Chou
Journal:  Amino Acids       Date:  1996-09       Impact factor: 3.520

6.  Do "antisense proteins" exist?

Authors:  K C Chou; C T Zhang; D W Elrod
Journal:  J Protein Chem       Date:  1996-01

7.  Maximally efficient modeling of DNA sequence motifs at all levels of complexity.

Authors:  Gary D Stormo
Journal:  Genetics       Date:  2011-02-07       Impact factor: 4.562

8.  Isochore structures in the genome of the plant Arabidopsis thaliana.

Authors:  Ren Zhang; Chun-Ting Zhang
Journal:  J Mol Evol       Date:  2004-08       Impact factor: 2.395

9.  'Genome order index' should not be used for defining compositional constraints in nucleotide sequences--a case study of the Z-curve.

Authors:  Eran Elhaik; Dan Graur; Kresimir Josić
Journal:  Biol Direct       Date:  2010-02-17       Impact factor: 4.540

10.  DIGA--a database of improved gene annotation for phytopathogens.

Authors:  Na Gao; Ling-Ling Chen; Hong-Fang Ji; Wei Wang; Ji-Wei Chang; Bei Gao; Lin Zhang; Shi-Cui Zhang; Hong-Yu Zhang
Journal:  BMC Genomics       Date:  2010-01-21       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.