Literature DB >> 16241270

Classification of short human exons and introns based on statistical features.

Yonghui Wu1, Alan Wee-Chung Liew, Hong Yan, Mengsu Yang.   

Abstract

The classification of human gene sequences into exons and introns is a difficult problem in DNA sequence analysis. In this paper, we define a set of features, called the simple Z (SZ) features, which is derived from the Z-curve features for the recognition of human exons and introns. The classification results show that SZ features, while fewer in numbers (three in total), can preserve the high recognition rate of the original nine Z-curve features. Since the size of SZ features is one-third of the Z-curve features, the dimensionality of the feature space is much smaller, and better recognition efficiency is achieved. If the stop codon feature is used together with the three SZ features, a recognition rate of up to 92% for short sequences of length <140 bp can be obtained.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 16241270     DOI: 10.1103/PhysRevE.67.061916

Source DB:  PubMed          Journal:  Phys Rev E Stat Nonlin Soft Matter Phys        ISSN: 1539-3755


  5 in total

1.  Integrating overlapping structures and background information of words significantly improves biological sequence comparison.

Authors:  Qi Dai; Lihua Li; Xiaoqing Liu; Yuhua Yao; Fukun Zhao; Michael Zhang
Journal:  PLoS One       Date:  2011-11-10       Impact factor: 3.240

2.  On relationship of Z-curve and Fourier approaches for DNA coding sequence classification.

Authors:  Ngai-Fong Law; Kin-On Cheng; Wan-Chi Siu
Journal:  Bioinformation       Date:  2006-11-14

3.  Short Exon Detection via Wavelet Transform Modulus Maxima.

Authors:  Xiaolei Zhang; Zhiwei Shen; Guishan Zhang; Yuanyu Shen; Miaomiao Chen; Jiaxiang Zhao; Renhua Wu
Journal:  PLoS One       Date:  2016-09-16       Impact factor: 3.240

4.  Exon prediction based on multiscale products of a genomic-inspired multiscale bilateral filtering.

Authors:  Xiaolei Zhang; Weijun Pan
Journal:  PLoS One       Date:  2019-03-21       Impact factor: 3.240

5.  A Brief Review: The Z-curve Theory and its Application in Genome Analysis.

Authors:  Ren Zhang; Chun-Ting Zhang
Journal:  Curr Genomics       Date:  2014-04       Impact factor: 2.236

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.