| Literature DB >> 17597898 |
Ngai-Fong Law1, Kin-On Cheng, Wan-Chi Siu.
Abstract
Z-curve features are one of the popular features used in exon/intron classification. We showed that although both Z-curve and Fourier approaches are based on detecting 3-periodicity in coding regions, there are significant differences in their spectral formulation. From the spectral formulation of the Z-curve, we obtained three modified sequences that characterize different biological properties. Spectral analysis on the modified sequences showed a much more prominent 3-periodicity peak in coding regions than the Fourier approach. For long sequences, prominent peaks at 2Pi/3 are observed at coding regions, whereas for short sequences, clearly discernible peaks are still visible. Better classification can be obtained using spectral features derived from the modified sequences.Entities:
Year: 2006 PMID: 17597898 PMCID: PMC1891701 DOI: 10.6026/97320630001242
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Classification results of coding and non-coding sequences
| Yeast | Human | |
|---|---|---|
| FFT approach | ||
| Sensitivity | 0.8580 | 0.8627 |
| Specificity | 0.8922 | 0.2873 |
| Average | 0.8751 | 0.5750 |
| Proposed approach | ||
| Sensitivity | 0.8607 | 0.7607 |
| Specificity | 0.9558 | 0.8413 |
| Average | 0.9083 | 0.8010 |