Literature DB >> 9963221

Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis.

S V Buldyrev1, A L Goldberger, S Havlin, R N Mantegna, M E Matsa, C K Peng, M Simons, H E Stanley.   

Abstract

An open question in computational molecular biology is whether long-range correlations are present in both coding and noncoding DNA or only in the latter. To answer this question, we consider all 33301 coding and all 29453 noncoding eukaryotic sequences--each of length larger than 512 base pairs (bp)--in the present release of the GenBank to dtermine whether there is any statistically significant distinction in their long-range correlation properties. Standard fast Fourier transform (FFT) analysis indicates that coding sequences have practically no correlations in the range from 10 bp to 100 bp (spectral exponent beta=0.00 +/- 0.04, where the uncertainty is two standard deviations). In contrast, for noncoding sequences, the average value of the spectral exponent beta is positive (0.16 +/- 0.05) which unambiguously shows the presence of long-range correlations. We also separately analyze the 874 coding and the 1157 noncoding sequences that have more than 4096 bp and find a larger region of power-law behavior. We calculate the probability that these two data sets (coding and noncoding) were drawn from the same distribution and we find that it is less than 10(-10). We obtain independent confirmation of these findings using the method of detrended fluctuation analysis (DFA), which is designed to treat sequences with statistical heterogeneity, such as DNA's known mosaic structure ("patchiness") arising from the nonstationarity of nucleotide concentration. The near-perfect agreement between the two independent analysis methods, FFT and DFA, increases the confidence in the reliability of our conclusion.

Entities:  

Keywords:  NASA Discipline Cardiopulmonary; NASA Discipline Number 14-10; Non-NASA Center

Mesh:

Substances:

Year:  1995        PMID: 9963221     DOI: 10.1103/physreve.51.5084

Source DB:  PubMed          Journal:  Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics        ISSN: 1063-651X


  42 in total

1.  Statistical properties of nucleotide clusters in DNA sequences.

Authors:  Jun Cheng; Lin-Xi Zhang
Journal:  J Zhejiang Univ Sci B       Date:  2005-05       Impact factor: 3.066

2.  Formation and positioning of nucleosomes: effect of sequence-dependent long-range correlated structural disorder.

Authors:  C Vaillant; B Audit; C Thermes; A Arnéodo
Journal:  Eur Phys J E Soft Matter       Date:  2006-02-14       Impact factor: 1.890

3.  Paternal experience and stress responses in California mice (Peromyscus californicus).

Authors:  Massimo Bardi; Catherine L Franssen; Joseph E Hampton; Eleanor A Shea; Amanda P Fanean; Kelly G Lambert
Journal:  Comp Med       Date:  2011-02       Impact factor: 0.982

4.  Quantification of DNA patchiness using long-range correlation measures.

Authors:  G M Viswanathan; S V Buldyrev; S Havlin; H E Stanley
Journal:  Biophys J       Date:  1997-02       Impact factor: 4.033

5.  Wavelet Analysis of DNA Bending Profiles reveals Structural Constraints on the Evolution of Genomic Sequences.

Authors:  Benjamin Audit; Cédric Vaillant; Alain Arnéodo; Yves d'Aubenton-Carafa; Claude Thermes
Journal:  J Biol Phys       Date:  2004-03       Impact factor: 1.365

6.  Random fields approach to the study of DNA chains.

Authors:  Janusz Szczepański; Tomasz Michałek
Journal:  J Biol Phys       Date:  2003-03       Impact factor: 1.365

7.  A demonstration of the transition from ready-to-hand to unready-to-hand.

Authors:  Dobromir G Dotov; Lin Nie; Anthony Chemero
Journal:  PLoS One       Date:  2010-03-09       Impact factor: 3.240

8.  Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.

Authors:  Vladimir Paar; Nenad Pavin; Ivan Basar; Marija Rosandić; Matko Gluncić; Nils Paar
Journal:  BMC Bioinformatics       Date:  2008-11-03       Impact factor: 3.169

9.  On the evolution of the standard genetic code: vestiges of critical scale invariance from the RNA world in current prokaryote genomes.

Authors:  Marco V José; Tzipe Govezensky; José A García; Juan R Bobadilla
Journal:  PLoS One       Date:  2009-02-02       Impact factor: 3.240

10.  Genome analysis with inter-nucleotide distances.

Authors:  Vera Afreixo; Carlos A C Bastos; Armando J Pinho; Sara P Garcia; Paulo J S G Ferreira
Journal:  Bioinformatics       Date:  2009-09-16       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.