Literature DB >> 18048129

Effective statistical features for coding and non-coding DNA sequence classification for yeast, C. elegans and human.

Alan Wee-Chung Liew, Yonghui Wu, Hong Yan, Mengsu Yang.   

Abstract

This study performs a quantitative evaluation of the different coding features in terms of their information content for the classification of coding and non-coding regions for three species. Our study indicated that coding features that are effective for yeast or C. elegans are generally not very effective for human, which has a short average exon length. By performing a correlation analysis, we identified a subset of human coding features with high discriminative power, but complementary in their information content. For this subset, a classification accuracy of up to 90% was obtained using a simple kNN classifier.

Entities:  

Mesh:

Year:  2005        PMID: 18048129     DOI: 10.1504/IJBRA.2005.007577

Source DB:  PubMed          Journal:  Int J Bioinform Res Appl        ISSN: 1744-5485


  3 in total

1.  LPS-induced galectin-3 oligomerization results in enhancement of neutrophil activation.

Authors:  Marise Lopes Fermino; Claudia Danella Polli; Karina Alves Toledo; Fu-Tong Liu; Dan K Hsu; Maria Cristina Roque-Barreira; Gabriela Pereira-da-Silva; Emerson Soares Bernardes; Lise Halbwachs-Mecarelli
Journal:  PLoS One       Date:  2011-10-21       Impact factor: 3.240

2.  On relationship of Z-curve and Fourier approaches for DNA coding sequence classification.

Authors:  Ngai-Fong Law; Kin-On Cheng; Wan-Chi Siu
Journal:  Bioinformation       Date:  2006-11-14

3.  Multi-scale parametric spectral analysis for exon detection in DNA sequences based on forward-backward linear prediction and singular value decomposition of the double-base curves.

Authors:  Miew Keen Choong; Hong Yan
Journal:  Bioinformation       Date:  2008-02-12
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.