Literature DB >> 19825795

Efficient estimation of pairwise distances between genomes.

Mirjana Domazet-Loso1, Bernhard Haubold.   

Abstract

MOTIVATION: Genome comparison is central to contemporary genomics and typically relies on sequence alignment. However, genome-wide alignments are difficult to compute. We have, therefore, recently developed an accurate alignment-free estimator of the number of substitutions per site based on the lengths of exact matches between pairs of sequences. The previous implementation of this measure requires n(n-1) suffix tree constructions and traversals, where n is the number of sequences analyzed. This does not scale well for large n.
RESULTS: We present an algorithm to extract pairwise distances in a single traversal of a single suffix tree containing n sequences. As a result, the run time of the suffix tree construction phase of our algorithm is reduced from O(n(2)L) to O(nL), where L is the length of each sequence. We implement this algorithm in the program kr version 2 and apply it to 825 HIV genomes, 13 genomes of enterobacteria and the complete genomes of 12 Drosophila species. We show that, depending on the input dataset, the new program is at least 10 times faster than its predecessor. AVAILABILITY: Version 2 of kr can be tested via a web interface at http://guanine.evolbio.mpg.de/kr2/. It is written in standard C and its source code is available under the GNU General Public License from the same web site. CONTACT: haubold@evolbio.mpg.de Supplementary informations: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2009        PMID: 19825795     DOI: 10.1093/bioinformatics/btp590

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  Alignment-free detection of horizontal gene transfer between closely related bacterial genomes.

Authors:  Mirjana Domazet-Lošo; Bernhard Haubold
Journal:  Mob Genet Elements       Date:  2011-09-01

2.  An alignment-free test for recombination.

Authors:  Bernhard Haubold; Linda Krause; Thomas Horn; Peter Pfaffelhuber
Journal:  Bioinformatics       Date:  2013-09-23       Impact factor: 6.937

3.  Gene duplications in prokaryotes can be associated with environmental adaptation.

Authors:  Marit S Bratlie; Jostein Johansen; Brad T Sherman; Da Wei Huang; Richard A Lempicki; Finn Drabløs
Journal:  BMC Genomics       Date:  2010-10-20       Impact factor: 3.969

4.  Identification of antimicrobial peptides from the human gut microbiome using deep learning.

Authors:  Yue Ma; Zhengyan Guo; Binbin Xia; Yuwei Zhang; Xiaolin Liu; Ying Yu; Na Tang; Xiaomei Tong; Min Wang; Xin Ye; Jie Feng; Yihua Chen; Jun Wang
Journal:  Nat Biotechnol       Date:  2022-03-03       Impact factor: 68.164

5.  A novel hierarchical clustering algorithm for gene sequences.

Authors:  Dan Wei; Qingshan Jiang; Yanjie Wei; Shengrui Wang
Journal:  BMC Bioinformatics       Date:  2012-07-23       Impact factor: 3.169

6.  Sequence comparison alignment-free approach based on suffix tree and L-words frequency.

Authors:  Inês Soares; Ana Goios; António Amorim
Journal:  ScientificWorldJournal       Date:  2012-09-10

7.  A phylogenetic analysis of the brassicales clade based on an alignment-free sequence comparison method.

Authors:  Klas Hatje; Martin Kollmar
Journal:  Front Plant Sci       Date:  2012-08-29       Impact factor: 5.753

8.  Alignment-free population genomics: an efficient estimator of sequence diversity.

Authors:  Bernhard Haubold; Peter Pfaffelhuber
Journal:  G3 (Bethesda)       Date:  2012-08-01       Impact factor: 3.154

9.  Co-phylog: an assembly-free phylogenomic approach for closely related organisms.

Authors:  Huiguang Yi; Li Jin
Journal:  Nucleic Acids Res       Date:  2013-01-18       Impact factor: 16.971

10.  Kmacs: the k-mismatch average common substring approach to alignment-free sequence comparison.

Authors:  Chris-Andre Leimeister; Burkhard Morgenstern
Journal:  Bioinformatics       Date:  2014-05-13       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.