Literature DB >> 27138275

ALFRED: A Practical Method for Alignment-Free Distance Computation.

Sharma V Thankachan1, Sriram P Chockalingam2, Yongchao Liu1, Alberto Apostolico1, Srinivas Aluru1.   

Abstract

Alignment-free approaches are gaining persistent interest in many sequence analysis applications such as phylogenetic inference and metagenomic classification/clustering, especially for large-scale sequence datasets. Besides the widely used k-mer methods, the average common substring (ACS) approach has emerged to be one of the well-known alignment-free approaches. Two recent works further generalize this ACS approach by allowing a bounded number k of mismatches in the common substrings, relying on approximation (linear time) and exact computation, respectively. Albeit having a good worst-case time complexity [Formula: see text], the exact approach is complex and unlikely to be efficient in practice. Herein, we present ALFRED, an alignment-free distance computation method, which solves the generalized common substring search problem via exact computation. Compared to the theoretical approach, our algorithm is easier to implement and more practical to use, while still providing highly competitive theoretical performances with an expected run-time of [Formula: see text]. By applying our program to phylogenetic inference as a case study, we find that our program facilitates to exactly reconstruct the topology of the reference phylogenetic tree for a set of 27 primate mitochondrial genomes, at reasonably acceptable speed. ALFRED is implemented in C++ programming language and the source code is freely available online.

Entities:  

Keywords:  approximate string matching; phylogenetic tree; suffix trees

Mesh:

Year:  2016        PMID: 27138275     DOI: 10.1089/cmb.2015.0217

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  10 in total

1.  Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences.

Authors:  Chris-Andre Leimeister; Jendrik Schellhorn; Svenja Dörrer; Michael Gerth; Christoph Bleidorn; Burkhard Morgenstern
Journal:  Gigascience       Date:  2019-03-01       Impact factor: 6.524

2.  De novo genome assembly and in natura epigenomics reveal salinity-induced DNA methylation in the mangrove tree Bruguiera gymnorhiza.

Authors:  Matin Miryeganeh; Ferdinand Marlétaz; Daria Gavriouchkina; Hidetoshi Saze
Journal:  New Phytol       Date:  2021-10-16       Impact factor: 10.323

3.  rasbhari: Optimizing Spaced Seeds for Database Searching, Read Mapping and Alignment-Free Sequence Comparison.

Authors:  Lars Hahn; Chris-André Leimeister; Rachid Ounit; Stefano Lonardi; Burkhard Morgenstern
Journal:  PLoS Comput Biol       Date:  2016-10-19       Impact factor: 4.475

4.  A Novel Method for Alignment-free DNA Sequence Similarity Analysis Based on the Characterization of Complex Networks.

Authors:  Jie Zhou; Pianyu Zhong; Tinghui Zhang
Journal:  Evol Bioinform Online       Date:  2016-10-06       Impact factor: 1.625

5.  Phylogeny reconstruction based on the length distribution of k-mismatch common substrings.

Authors:  Burkhard Morgenstern; Svenja Schöbel; Chris-André Leimeister
Journal:  Algorithms Mol Biol       Date:  2017-12-11       Impact factor: 1.405

6.  Fast and accurate phylogeny reconstruction using filtered spaced-word matches.

Authors:  Chris-André Leimeister; Salma Sohrabi-Jahromi; Burkhard Morgenstern
Journal:  Bioinformatics       Date:  2017-04-01       Impact factor: 6.937

7.  A greedy alignment-free distance estimator for phylogenetic inference.

Authors:  Sharma V Thankachan; Sriram P Chockalingam; Yongchao Liu; Ambujam Krishnan; Srinivas Aluru
Journal:  BMC Bioinformatics       Date:  2017-06-07       Impact factor: 3.169

8.  Alignment-free method for DNA sequence clustering using Fuzzy integral similarity.

Authors:  Ajay Kumar Saw; Garima Raj; Manashi Das; Narayan Chandra Talukdar; Binod Chandra Tripathy; Soumyadeep Nandi
Journal:  Sci Rep       Date:  2019-03-06       Impact factor: 4.379

Review 9.  Alignment-free sequence comparison: benefits, applications, and tools.

Authors:  Andrzej Zielezinski; Susana Vinga; Jonas Almeida; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2017-10-03       Impact factor: 13.583

10.  An alignment-free heuristic for fast sequence comparisons with applications to phylogeny reconstruction.

Authors:  Sriram P Chockalingam; Jodh Pannu; Sahar Hooshmand; Sharma V Thankachan; Srinivas Aluru
Journal:  BMC Bioinformatics       Date:  2020-11-18       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.