Literature DB >> 21926435

Inter-dinucleotide distances in the human genome: an analysis of the whole-genome and protein-coding distributions.

Carlos A C Bastos1, Vera Afreixo, Armando J Pinho, Sara P Garcia, João M O S Rodrigues, Paulo J S G Ferreira.   

Abstract

We study the inter-dinucleotide distance distributions in the human genome, both in the whole-genome and protein-coding regions. The inter-dinucleotide distance is defined as the distance to the next occurrence of the same dinucleotide. We consider the 16 sequences of inter-dinucleotide distances and two reading frames. Our results show a period-3 oscillation in the protein-coding inter-dinucleotide distance distributions that is absent from the whole-genome distributions. We also compare the distance distribution of each dinucleotide to a reference distribution, that of a random sequence generated with the same dinucleotide abundances, revealing the CG dinucleotide as the one with the highest cumulative relative error for the first 60 distances. Moreover, the distance distribution of each dinucleotide is compared to the distance distribution of all other dinucleotides using the Kullback-Leibler divergence. We find that the distance distribution of a dinucleotide and that of its reversed complement are very similar, hence, the divergence between them is very small. This is an interesting finding that may give evidence of a stronger parity rule than Chargaff's second parity rule. Copyright 2011 The Author(s). Published by Journal of Integrative Bioinformatics.

Entities:  

Mesh:

Year:  2011        PMID: 21926435     DOI: 10.2390/biecoll-jib-2011-172

Source DB:  PubMed          Journal:  J Integr Bioinform        ISSN: 1613-4516


  2 in total

1.  Statistical modelling of CG interdistance across multiple organisms.

Authors:  Merlotti A; Faria do Valle I; Castellani G; Remondini D
Journal:  BMC Bioinformatics       Date:  2018-10-15       Impact factor: 3.169

2.  A Markov chain-based feature extraction method for classification and identification of cancerous DNA sequences.

Authors:  Amin Khodaei; Mohammad-Reza Feizi-Derakhshi; Behzad Mozaffari-Tazehkand
Journal:  Bioimpacts       Date:  2020-03-24
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.