| Literature DB >> 26857665 |
Giulia Paci1, Giampaolo Cristadoro2, Barbara Monti3, Marco Lenci4, Mirko Degli Esposti2, Gastone C Castellani5, Daniel Remondini6.
Abstract
We perform a statistical study of the distances between successive occurrences of a given dinucleotide in the DNA sequence for a number of organisms of different complexity. Our analysis highlights peculiar features of the CG dinucleotide distribution in mammalian DNA, pointing towards a connection with the role of such dinucleotide in DNA methylation. While the CG distributions of mammals exhibit exponential tails with comparable parameters, the picture for the other organisms studied (e.g. fish, insects, bacteria and viruses) is more heterogeneous, possibly because in these organisms DNA methylation has different functional roles. Our analysis suggests that the distribution of the distances between CG dinucleotides provides useful insights into characterizing and classifying organisms in terms of methylation functionalities.Entities:
Keywords: DNA sequence; first return times; stochastic processes
Mesh:
Substances:
Year: 2016 PMID: 26857665 DOI: 10.1098/rsta.2015.0227
Source DB: PubMed Journal: Philos Trans A Math Phys Eng Sci ISSN: 1364-503X Impact factor: 4.226