Literature DB >> 10991547

Finding borders between coding and noncoding DNA regions by an entropic segmentation method.

P Bernaola-Galván1, I Grosse, P Carpena, J L Oliver, R Román-Roldán, H E Stanley.   

Abstract

We present a new computational approach to finding borders between coding and noncoding DNA. This approach has two features: (i) DNA sequences are described by a 12-letter alphabet that captures the differential base composition at each codon position, and (ii) the search for the borders is carried out by means of an entropic segmentation method which uses only the general statistical properties of coding DNA. We find that this method is highly accurate in finding borders between coding and noncoding regions and requires no "prior training" on known data sets. Our results appear to be more accurate than those obtained with moving windows in the discrimination of coding from noncoding DNA.

Mesh:

Substances:

Year:  2000        PMID: 10991547     DOI: 10.1103/PhysRevLett.85.1342

Source DB:  PubMed          Journal:  Phys Rev Lett        ISSN: 0031-9007            Impact factor:   9.161


  10 in total

1.  In search of coding and non-coding regions of DNA sequences based on balanced estimation of diffusion entropy.

Authors:  Jin Zhang; Wenqing Zhang; Huijie Yang
Journal:  J Biol Phys       Date:  2015-08-29       Impact factor: 1.365

2.  Splice site prediction with quadratic discriminant analysis using diversity measure.

Authors:  Lirong Zhang; Liaofu Luo
Journal:  Nucleic Acids Res       Date:  2003-11-01       Impact factor: 16.971

3.  Identification of exonic regions in DNA sequences using cross-correlation and noise suppression by discrete wavelet transform.

Authors:  Omid Abbasi; Ali Rostami; Ghader Karimian
Journal:  BMC Bioinformatics       Date:  2011-11-03       Impact factor: 3.169

4.  MIA: Mutual Information Analyzer, a graphic user interface program that calculates entropy, vertical and horizontal mutual information of molecular sequence sets.

Authors:  Flavio Lichtenstein; Fernando Antoneli; Marcelo R S Briones
Journal:  BMC Bioinformatics       Date:  2015-12-10       Impact factor: 3.169

Review 5.  Well-characterized sequence features of eukaryote genomes and implications for ab initio gene prediction.

Authors:  Ying Huang; Shi-Yi Chen; Feilong Deng
Journal:  Comput Struct Biotechnol J       Date:  2016-07-27       Impact factor: 7.271

6.  Identification of protein-coding regions in DNA sequences using a time-frequency filtering approach.

Authors:  Sitanshu Sekhar Sahu; Ganapati Panda
Journal:  Genomics Proteomics Bioinformatics       Date:  2011-04       Impact factor: 7.691

7.  Detection of genomic islands via segmental genome heterogeneity.

Authors:  Aaron J Arvey; Rajeev K Azad; Alpan Raval; Jeffrey G Lawrence
Journal:  Nucleic Acids Res       Date:  2009-07-09       Impact factor: 16.971

8.  Detecting the borders between coding and non-coding DNA regions in prokaryotes based on recursive segmentation and nucleotide doublets statistics.

Authors:  Suping Deng; Yixiang Shi; Liyun Yuan; Yixue Li; Guohui Ding
Journal:  BMC Genomics       Date:  2012-12-17       Impact factor: 3.969

9.  Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples.

Authors:  Zhandong Liu; Santosh S Venkatesh; Carlo C Maley
Journal:  BMC Genomics       Date:  2008-10-30       Impact factor: 3.969

10.  Comparing segmentations by applying randomization techniques.

Authors:  Niina Haiminen; Heikki Mannila; Evimaria Terzi
Journal:  BMC Bioinformatics       Date:  2007-05-23       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.