Literature DB >> 28113780

Efficient Algorithms for Sequence Analysis with Entropic Profiles.

Cinzia Pizzi, Mattia Ornamenti, Simone Spangaro, Simona E Rombo, Laxmi Parida.   

Abstract

Entropy, being closely related to repetitiveness and compressibility, is a widely used information-related measure to assess the degree of predictability of a sequence. Entropic profiles are based on information theory principles, and can be used to study the under-/over-representation of subwords, by also providing information about the scale of conserved DNA regions. Here, we focus on the algorithmic aspects related to entropic profiles. In particular, we propose linear time algorithms for their computation that rely on suffix-based data structures, more specifically on the truncated suffix tree (TST) and on the enhanced suffix array (ESA). We performed an extensive experimental campaign showing that our algorithms, beside being faster, make it possible the analysis of longer sequences, even for high degrees of resolution, than state of the art algorithms.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 28113780     DOI: 10.1109/TCBB.2016.2620143

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  Efficient computation of spaced seed hashing with block indexing.

Authors:  Samuele Girotto; Matteo Comin; Cinzia Pizzi
Journal:  BMC Bioinformatics       Date:  2018-11-30       Impact factor: 3.169

2.  FEDRO: a software tool for the automatic discovery of candidate ORFs in plants with c →u RNA editing.

Authors:  Fabio Fassetti; Claudia Giallombardo; Ofelia Leone; Luigi Palopoli; Simona E Rombo; Adolfo Saiardi
Journal:  BMC Bioinformatics       Date:  2019-04-18       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.