| Literature DB >> 15553086 |
Christoforos Nikolaou1, Yannis Almirantis.
Abstract
The distribution of n-tuplet frequencies is shown to strongly correlate with functionality when examining a genomic sequence in a reading-frame specific manner. The approach described herein applies a coarse-graining procedure, which is able to reveal aspects of triplet usage that are related to protein coding, while at the same time remaining species independent, based on a simple summation of suitable triplet occurrences measures. These quantities are ratios of simple frequencies to suitable mononucleotide-frequency products promoting the incidence of the RNY motif, preferred in the most widely used codons. A significant distinction of coding and noncoding sequences is achieved.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15553086 DOI: 10.1007/s00239-004-2626-7
Source DB: PubMed Journal: J Mol Evol ISSN: 0022-2844 Impact factor: 2.395