Literature DB >> 15501469

Rényi continuous entropy of DNA sequences.

Susana Vinga1, Jonas S Almeida.   

Abstract

Entropy measures of DNA sequences estimate their randomness or, inversely, their repeatability. L-block Shannon discrete entropy accounts for the empirical distribution of all length-L words and has convergence problems for finite sequences. A new entropy measure that extends Shannon's formalism is proposed. Renyi's quadratic entropy calculated with Parzen window density estimation method applied to CGR/USM continuous maps of DNA sequences constitute a novel technique to evaluate sequence global randomness without some of the former method drawbacks. The asymptotic behaviour of this new measure was analytically deduced and the calculation of entropies for several synthetic and experimental biological sequences was performed. The results obtained were compared with the distributions of the null model of randomness obtained by simulation. The biological sequences have shown a different p-value according to the kernel resolution of Parzen's method, which might indicate an unknown level of organization of their patterns. This new technique can be very useful in the study of DNA sequence complexity and provide additional tools for DNA entropy estimation. The main MATLAB applications developed and additional material are available at the webpage . Specialized functions can be obtained from the authors.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15501469     DOI: 10.1016/j.jtbi.2004.06.030

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  15 in total

1.  Investigating Focal Adhesion Substructures by Localization Microscopy.

Authors:  Hendrik Deschout; Ilia Platzman; Daniel Sage; Lely Feletti; Joachim P Spatz; Aleksandra Radenovic
Journal:  Biophys J       Date:  2017-12-05       Impact factor: 4.033

Review 2.  Entropy Perspectives of Molecular and Evolutionary Biology.

Authors:  Bartolomé Sabater
Journal:  Int J Mol Sci       Date:  2022-04-07       Impact factor: 6.208

3.  Learning vector quantization as an interpretable classifier for the detection of SARS-CoV-2 types based on their RNA sequences.

Authors:  Marika Kaden; Katrin Sophie Bohnsack; Mirko Weber; Mateusz Kudła; Kaja Gutowska; Jacek Blazewicz; Thomas Villmann
Journal:  Neural Comput Appl       Date:  2021-04-27       Impact factor: 5.606

4.  Pattern matching through Chaos Game Representation: bridging numerical and discrete data structures for biological sequence analysis.

Authors:  Susana Vinga; Alexandra M Carvalho; Alexandre P Francisco; Luís Ms Russo; Jonas S Almeida
Journal:  Algorithms Mol Biol       Date:  2012-05-02       Impact factor: 1.405

5.  Biological sequences as pictures: a generic two dimensional solution for iterated maps.

Authors:  Jonas S Almeida; Susana Vinga
Journal:  BMC Bioinformatics       Date:  2009-03-31       Impact factor: 3.169

6.  Computing distribution of scale independent motifs in biological sequences.

Authors:  Jonas S Almeida; Susana Vinga
Journal:  Algorithms Mol Biol       Date:  2006-10-18       Impact factor: 1.405

7.  Entropic Profiler - detection of conservation in genomes using information theory.

Authors:  Francisco Fernandes; Ana T Freitas; Jonas S Almeida; Susana Vinga
Journal:  BMC Res Notes       Date:  2009-05-05

8.  Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples.

Authors:  Zhandong Liu; Santosh S Venkatesh; Carlo C Maley
Journal:  BMC Genomics       Date:  2008-10-30       Impact factor: 3.969

9.  Local Renyi entropic profiles of DNA sequences.

Authors:  Susana Vinga; Jonas S Almeida
Journal:  BMC Bioinformatics       Date:  2007-10-16       Impact factor: 3.169

10.  A generalized topological entropy for analyzing the complexity of DNA sequences.

Authors:  Shuilin Jin; Renjie Tan; Qinghua Jiang; Li Xu; Jiajie Peng; Yong Wang; Yadong Wang
Journal:  PLoS One       Date:  2014-02-12       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.