Literature DB >> 15196938

The mutual information theory for the certification of rice coding sequences.

Nicolas Carels1, Ramon Vidal, Ricardo Mansilla, Diego Frías.   

Abstract

We report here the use of the mutual information theory for the certification of annotated rice coding sequences of both GenBank and TIGR databases. Considering coding sequences larger than 600 bp, we successfully screened out genes with aberrant compositional features. We found that they represent about 10% of both datasets after cleaning for gene redundancy. Most of the rejected accessions showed a different trend in GC3% vs GC2% plot compared to the set of accessions that have been published in international journals. This suggests the existence of a bias in the pattern recognition algorithms used by gene prediction programs.

Entities:  

Mesh:

Year:  2004        PMID: 15196938     DOI: 10.1016/j.febslet.2004.05.026

Source DB:  PubMed          Journal:  FEBS Lett        ISSN: 0014-5793            Impact factor:   4.124


  4 in total

1.  Classifying coding DNA with nucleotide statistics.

Authors:  Nicolas Carels; Diego Frías
Journal:  Bioinform Biol Insights       Date:  2009-10-28

2.  Universal Features for the Classification of Coding and Non-coding DNA Sequences.

Authors:  Nicolas Carels; Ramon Vidal; Diego Frías
Journal:  Bioinform Biol Insights       Date:  2009-06-03

Review 3.  Information theory applications for biological sequence analysis.

Authors:  Susana Vinga
Journal:  Brief Bioinform       Date:  2013-09-20       Impact factor: 11.622

4.  A Statistical Method without Training Step for the Classification of Coding Frame in Transcriptome Sequences.

Authors:  Nicolas Carels; Diego Frías
Journal:  Bioinform Biol Insights       Date:  2013-01-23
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.