Literature DB >> 26356018

Fast Entropic Profiler: An Information Theoretic Approach for the Discovery of Patterns in Genomes.

Matteo Comin, Morris Antonello.   

Abstract

Information theory has been used for quite some time in the area of computational biology. In this paper we present a pattern discovery method, named Fast Entropic Profiler, that is based on a local entropy function that captures the importance of a region with respect to the whole genome. The local entropy function has been introduced by Vinga and Almeida in , here we discuss and improve the original formulation. We provide a linear time and linear space algorithm called Fast Entropic Profiler ( FastEP), as opposed to the original quadratic implementation. Moreover we propose an alternative normalization that can be also efficiently implemented. We show that FastEP is suitable for large genomes and for the discovery of patterns with unbounded length. FastEP is available at http://www.dei.unipd.it/~ciompin/main/FastEP.html.

Mesh:

Year:  2014        PMID: 26356018     DOI: 10.1109/TCBB.2013.2297924

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  8 in total

1.  Assembly-free genome comparison based on next-generation sequencing reads and variable length patterns.

Authors:  Matteo Comin; Michele Schimd
Journal:  BMC Bioinformatics       Date:  2014-09-10       Impact factor: 3.169

2.  Clustering of reads with alignment-free measures and quality values.

Authors:  Matteo Comin; Andrea Leoni; Michele Schimd
Journal:  Algorithms Mol Biol       Date:  2015-01-28       Impact factor: 1.405

3.  On the comparison of regulatory sequences with multiple resolution Entropic Profiles.

Authors:  Matteo Comin; Morris Antonello
Journal:  BMC Bioinformatics       Date:  2016-03-18       Impact factor: 3.169

4.  Fast comparison of genomic and meta-genomic reads with alignment-free measures based on quality values.

Authors:  Matteo Comin; Michele Schimd
Journal:  BMC Med Genomics       Date:  2016-08-12       Impact factor: 3.063

5.  Efficient computation of spaced seed hashing with block indexing.

Authors:  Samuele Girotto; Matteo Comin; Cinzia Pizzi
Journal:  BMC Bioinformatics       Date:  2018-11-30       Impact factor: 3.169

6.  Benchmarking of alignment-free sequence comparison methods.

Authors:  Andrzej Zielezinski; Hani Z Girgis; Guillaume Bernard; Chris-Andre Leimeister; Kujin Tang; Thomas Dencker; Anna Katharina Lau; Sophie Röhling; Jae Jin Choi; Michael S Waterman; Matteo Comin; Sung-Hou Kim; Susana Vinga; Jonas S Almeida; Cheong Xin Chan; Benjamin T James; Fengzhu Sun; Burkhard Morgenstern; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2019-07-25       Impact factor: 13.583

7.  FSH: fast spaced seed hashing exploiting adjacent hashes.

Authors:  Samuele Girotto; Matteo Comin; Cinzia Pizzi
Journal:  Algorithms Mol Biol       Date:  2018-03-22       Impact factor: 1.405

8.  MetaCon: unsupervised clustering of metagenomic contigs with probabilistic k-mers statistics and coverage.

Authors:  Jia Qian; Matteo Comin
Journal:  BMC Bioinformatics       Date:  2019-11-22       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.