Literature DB >> 11690062

Statistical analysis of the DNA sequence of human chromosome 22.

D Holste1, I Grosse, H Herzel.   

Abstract

We study statistical patterns in the DNA sequence of human chromosome 22, the first completely sequenced human chromosome. We find that (i). the 33.4 x 10(6) nucleotide long human chromosome exhibits long-range power-law correlations over more than four orders of magnitude, (ii). the entropies H(n) of the frequency distribution of oligonucleotides of length n (n-mers) grow sublinearly with increasing n, indicating the presence of higher-order correlations for all of the studied lengths 1<or=n<or=10, and (iii). the generalized entropies H(n)(q) of n-mers decrease monotonically with increasing q and the decay of H(n)(q) with q becomes steeper with increasing n<or=10, indicating that the frequency distribution of oligonucleotides becomes increasingly nonuniform as the length n increases. We investigate to what degree known biological features may explain the observed statistical patterns. We find that (iv). the presence of interspersed repeats may cause the sublinear increase of H(n) with n, and that (v). the presence of monomeric tandem repeats as well as the suppression of CG dinucleotides may cause the observed decay of H(n)(q) with q.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11690062     DOI: 10.1103/PhysRevE.64.041917

Source DB:  PubMed          Journal:  Phys Rev E Stat Nonlin Soft Matter Phys        ISSN: 1539-3755


  5 in total

1.  Beyond the consensus: dissecting within-host viral population diversity of foot-and-mouth disease virus by using next-generation genome sequencing.

Authors:  Caroline F Wright; Marco J Morelli; Gaël Thébaud; Nick J Knowles; Pawel Herzyk; David J Paton; Daniel T Haydon; Donald P King
Journal:  J Virol       Date:  2010-12-15       Impact factor: 5.103

2.  Informational laws of genome structures.

Authors:  Vincenzo Bonnici; Vincenzo Manca
Journal:  Sci Rep       Date:  2016-06-29       Impact factor: 4.379

3.  Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.

Authors:  Vladimir Paar; Nenad Pavin; Ivan Basar; Marija Rosandić; Matko Gluncić; Nils Paar
Journal:  BMC Bioinformatics       Date:  2008-11-03       Impact factor: 3.169

4.  Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples.

Authors:  Zhandong Liu; Santosh S Venkatesh; Carlo C Maley
Journal:  BMC Genomics       Date:  2008-10-30       Impact factor: 3.969

5.  Local Renyi entropic profiles of DNA sequences.

Authors:  Susana Vinga; Jonas S Almeida
Journal:  BMC Bioinformatics       Date:  2007-10-16       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.