Literature DB >> 16305326

A Fourier characteristic of coding sequences: origins and a non-Fourier approximation.

Changchuan Yin1, Stephen S-T Yau.   

Abstract

The 3-base periodicity, identified as a pronounced peak at the frequency N/3 (N is the length of the DNA sequence) of the Fourier power spectrum of protein coding regions, is used as a marker in gene-finding algorithms to distinguish protein coding regions (exons) and noncoding regions (introns) of genomes. In this paper, we reveal the explanation of this phenomenon which results from a nonuniform distribution of nucleotides in the three coding positions. There is a linear correlation between the nucleotide distributions in the three codon positions and the power spectrum at the frequency N/3. Furthermore, this study indicates the relationship between the length of a DNA sequence and the variance of nucleotide distributions and the average Fourier power spectrum, which is the noise signal in gene-finding methods. The results presented in this paper provide an efficient way to compute the Fourier power spectrum at N/3 and the noise signal in gene-finding methods by calculating the nucleotide distributions in the three codon positions.

Mesh:

Substances:

Year:  2005        PMID: 16305326     DOI: 10.1089/cmb.2005.12.1153

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  20 in total

1.  TRII: A Probabilistic Scoring of Drosophila melanogaster Translation Initiation Sites.

Authors:  Michael P Weir; Michael D Rice
Journal:  EURASIP J Bioinform Syst Biol       Date:  2010-12-27

2.  SNR of DNA sequences mapped by general affine transformations of the indicator sequences.

Authors:  Jianfeng Shao; Xiaohua Yan; Shuo Shao
Journal:  J Math Biol       Date:  2012-07-21       Impact factor: 2.259

3.  Periodic power spectrum with applications in detection of latent periodicities in DNA sequences.

Authors:  Changchuan Yin; Jiasong Wang
Journal:  J Math Biol       Date:  2016-03-04       Impact factor: 2.259

4.  Disentangling single-cell omics representation with a power spectral density-based feature extraction.

Authors:  Seid Miad Zandavi; Forrest C Koch; Abhishek Vijayan; Fabio Zanini; Fatima Valdes Mora; David Gallego Ortega; Fatemeh Vafaee
Journal:  Nucleic Acids Res       Date:  2022-06-10       Impact factor: 19.160

5.  The 3-base periodicity and codon usage of coding sequences are correlated with gene expression at the level of transcription elongation.

Authors:  Edoardo Trotta
Journal:  PLoS One       Date:  2011-06-28       Impact factor: 3.240

6.  Effective gene prediction by high resolution frequency estimator based on least-norm solution technique.

Authors:  Manidipa Roy; Soma Barman
Journal:  EURASIP J Bioinform Syst Biol       Date:  2014-01-04

7.  Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.

Authors:  Vladimir Paar; Nenad Pavin; Ivan Basar; Marija Rosandić; Matko Gluncić; Nils Paar
Journal:  BMC Bioinformatics       Date:  2008-11-03       Impact factor: 3.169

8.  The GC skew index: a measure of genomic compositional asymmetry and the degree of replicational selection.

Authors:  Kazuharu Arakawa; Masaru Tomita
Journal:  Evol Bioinform Online       Date:  2007-09-06       Impact factor: 1.625

9.  Categorical spectral analysis of periodicity in human and viral genomes.

Authors:  Elizabeth D Howe; Jun S Song
Journal:  Nucleic Acids Res       Date:  2012-12-14       Impact factor: 16.971

10.  A coding measure scheme employing electron-ion interaction pseudopotential (EIIP).

Authors:  Achuthsankar S Nair; Sivarama Pillai Sreenadhan
Journal:  Bioinformation       Date:  2006-10-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.