Bernhard Haubold1, Linda Krause, Thomas Horn, Peter Pfaffelhuber. 1. Department of Evolutionary Genetics, Max-Planck-Institute for Evolutionary Biology, 24306 Plön, Institute for Neuro- and Bioinformatics, Lübeck University, 23562 Lübeck and Mathematical Stochastics, Mathematical Institute, Freiburg University, 79104 Freiburg, Germany.
Abstract
MOTIVATION: Why recombination? is one of the central questions in biology. This has led to a host of methods for quantifying recombination from sequence data. These methods are usually based on aligned DNA sequences. Here, we propose an efficient alignment-free alternative. RESULTS: Our method is based on the distribution of match lengths, which we look up using enhanced suffix arrays. By eliminating the alignment step, the test becomes fast enough for application to whole bacterial genomes. Using simulations we show that our test has similar power as established tests when applied to long pairs of sequences. When applied to 58 genomes of Escherichia coli, we pick up the strongest recombination signal from a 125 kb horizontal gene transfer engineered 20 years ago. AVAILABILITY AND IMPLEMENTATION: We have implemented our method in the command-line program rush. Its C sources and documentation are available under the GNU General Public License from http://guanine.evolbio.mpg.de/rush/.
MOTIVATION: Why recombination? is one of the central questions in biology. This has led to a host of methods for quantifying recombination from sequence data. These methods are usually based on aligned DNA sequences. Here, we propose an efficient alignment-free alternative. RESULTS: Our method is based on the distribution of match lengths, which we look up using enhanced suffix arrays. By eliminating the alignment step, the test becomes fast enough for application to whole bacterial genomes. Using simulations we show that our test has similar power as established tests when applied to long pairs of sequences. When applied to 58 genomes of Escherichia coli, we pick up the strongest recombination signal from a 125 kb horizontal gene transfer engineered 20 years ago. AVAILABILITY AND IMPLEMENTATION: We have implemented our method in the command-line program rush. Its C sources and documentation are available under the GNU General Public License from http://guanine.evolbio.mpg.de/rush/.
Authors: Peter C Turner; Lorraine P Yomano; Laura R Jarboe; Sean W York; Christy L Baggett; Brélan E Moritz; Emily B Zentz; K T Shanmugam; Lonnie O Ingram Journal: J Ind Microbiol Biotechnol Date: 2011-11-11 Impact factor: 3.346
Authors: Guillaume Bernard; Cheong Xin Chan; Yao-Ban Chan; Xin-Yi Chua; Yingnan Cong; James M Hogan; Stefan R Maetschke; Mark A Ragan Journal: Brief Bioinform Date: 2019-03-22 Impact factor: 11.622