Literature DB >> 15553086

Measuring the coding potential of genomic sequences through a combination of triplet occurrence patterns and RNY preference.

Christoforos Nikolaou1, Yannis Almirantis.   

Abstract

The distribution of n-tuplet frequencies is shown to strongly correlate with functionality when examining a genomic sequence in a reading-frame specific manner. The approach described herein applies a coarse-graining procedure, which is able to reveal aspects of triplet usage that are related to protein coding, while at the same time remaining species independent, based on a simple summation of suitable triplet occurrences measures. These quantities are ratios of simple frequencies to suitable mononucleotide-frequency products promoting the incidence of the RNY motif, preferred in the most widely used codons. A significant distinction of coding and noncoding sequences is achieved.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15553086     DOI: 10.1007/s00239-004-2626-7

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  39 in total

1.  Mutually symmetric and complementary triplets: differences in their use distinguish systematically between coding and non-coding genomic sequences.

Authors:  Christoforos Nikolaou; Yannis Almirantis
Journal:  J Theor Biol       Date:  2003-08-21       Impact factor: 2.691

2.  Probability of occurrence of specific oligomers.

Authors:  E E Stückle; P J Nielsen; U Grob
Journal:  J Theor Biol       Date:  1992-12-07       Impact factor: 2.691

Review 3.  Translational selection and molecular evolution.

Authors:  H Akashi; A Eyre-Walker
Journal:  Curr Opin Genet Dev       Date:  1998-12       Impact factor: 5.578

4.  Ancient patterns in nucleic acid sequences.

Authors:  J C Shepherd
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

Review 5.  The isochore organization of the human genome.

Authors:  G Bernardi
Journal:  Annu Rev Genet       Date:  1989       Impact factor: 16.830

6.  On the origin of the periodicity of three in protein coding DNA sequences.

Authors:  G Gutiérrez; J L Oliver; A Marín
Journal:  J Theor Biol       Date:  1994-04-21       Impact factor: 2.691

7.  Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justification.

Authors:  J C Shepherd
Journal:  Proc Natl Acad Sci U S A       Date:  1981-03       Impact factor: 11.205

8.  The hypercycle. A principle of natural self-organization. Part A: Emergence of the hypercycle.

Authors:  M Eigen; P Schuster
Journal:  Naturwissenschaften       Date:  1977-11

9.  An analysis of gene-finding programs for Neurospora crassa.

Authors:  E Kraemer; J Wang; J Guo; S Hopkins; J Arnold
Journal:  Bioinformatics       Date:  2001-10       Impact factor: 6.937

10.  Dinucleotides and G+C content in human genes: opposite behavior of GpG, GpC, and TpC at II-III codon positions and in introns.

Authors:  G Gutiérrez; J L Oliver; A Marín
Journal:  J Mol Evol       Date:  1993-08       Impact factor: 2.395

View more
  4 in total

1.  "Word" preference in the genomic text and genome evolution: different modes of n-tuplet usage in coding and noncoding sequences.

Authors:  Christoforos Nikolaou; Yannis Almirantis
Journal:  J Mol Evol       Date:  2005-07-19       Impact factor: 2.395

2.  Classifying coding DNA with nucleotide statistics.

Authors:  Nicolas Carels; Diego Frías
Journal:  Bioinform Biol Insights       Date:  2009-10-28

3.  Universal Features for the Classification of Coding and Non-coding DNA Sequences.

Authors:  Nicolas Carels; Ramon Vidal; Diego Frías
Journal:  Bioinform Biol Insights       Date:  2009-06-03

4.  WSE, a new sequence distance measure based on word frequencies.

Authors:  Jun Wang; Xiaoqi Zheng
Journal:  Math Biosci       Date:  2008-06-12       Impact factor: 2.144

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.