| Literature DB >> 10991547 |
P Bernaola-Galván1, I Grosse, P Carpena, J L Oliver, R Román-Roldán, H E Stanley.
Abstract
We present a new computational approach to finding borders between coding and noncoding DNA. This approach has two features: (i) DNA sequences are described by a 12-letter alphabet that captures the differential base composition at each codon position, and (ii) the search for the borders is carried out by means of an entropic segmentation method which uses only the general statistical properties of coding DNA. We find that this method is highly accurate in finding borders between coding and noncoding regions and requires no "prior training" on known data sets. Our results appear to be more accurate than those obtained with moving windows in the discrimination of coding from noncoding DNA.Mesh:
Substances:
Year: 2000 PMID: 10991547 DOI: 10.1103/PhysRevLett.85.1342
Source DB: PubMed Journal: Phys Rev Lett ISSN: 0031-9007 Impact factor: 9.161