Literature DB >> 19040362

Coding region prediction based on a universal DNA sequence representation method.

Xianyang Jiang1, Dominique Lavenier, Stephen S-T Yau.   

Abstract

Graphical representation of DNA sequences provides a simple and intuitive way of viewing, anchoring, and comparing various gene structures, so a simple and non-degenerate method is attractive to both biologists and computational biologists. In this study, a universal graphical representation method for DNA sequences based on S.S.-T. Yau's method is presented. The method adopts a trigonometric function to represent the four nucleotides A, G, C, and T. Some interesting characteristics of the universal representation are introduced. We exploit frequency analysis with our representation method on DNA sequences, demonstrating possible applications in coding region prediction, and sequence analysis. Based on the statistically experimental results from this frequency analysis, a simple coding region predictor and an optimized one are presented. An experiment on the broadly accepted ROSETTA data set demonstrates that the performance of the optimized predictor is comparable to that of other popular methods.

Mesh:

Year:  2008        PMID: 19040362     DOI: 10.1089/cmb.2008.0041

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  2 in total

1.  Visualization of the protein-coding regions with a self adaptive spectral rotation approach.

Authors:  Bo Chen; Ping Ji
Journal:  Nucleic Acids Res       Date:  2010-10-14       Impact factor: 16.971

2.  Effective gene prediction by high resolution frequency estimator based on least-norm solution technique.

Authors:  Manidipa Roy; Soma Barman
Journal:  EURASIP J Bioinform Syst Biol       Date:  2014-01-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.