Literature DB >> 19407343

A novel heuristic for local multiple alignment of interspersed DNA repeats.

Todd J Treangen1, Aaron E Darling, Guillaume Achaz, Mark A Ragan, Xavier Messeguer, Eduardo P C Rocha.   

Abstract

Pairwise local sequence alignment methods have been the prevailing technique to identify homologous nucleotides between related species. However, existing methods that identify and align all homologous nucleotides in one or more genomes have suffered from poor scalability and limited accuracy. We propose a novel method that couples a gapped extension heuristic with an efficient filtration method for identifying interspersed repeats in genome sequences. During gapped extension, we use the MUSCLE implementation of progressive global multiple alignment with iterative refinement. The resulting gapped extensions potentially contain alignments of unrelated sequence. We detect and remove such undesirable alignments using a hidden Markov model (HMM) to predict the posterior probability of homology. The HMM emission frequencies for nucleotide substitutions can be derived from any time-reversible nucleotide substitution matrix. We evaluate the performance of our method and previous approaches on a hybrid data set of real genomic DNA with simulated interspersed repeats. Our method outperforms a related method in terms of sensitivity, positive predictive value, and localizing boundaries of homology. The described methods have been implemented in freely available software, Repeatoire, available from: http://wwwabi.snv.jussieu.fr/public/Repeatoire.

Mesh:

Substances:

Year:  2009        PMID: 19407343     DOI: 10.1109/TCBB.2009.9

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  10 in total

1.  MetAMOS: a modular and open source metagenomic assembly and analysis pipeline.

Authors:  Todd J Treangen; Sergey Koren; Daniel D Sommer; Bo Liu; Irina Astrovskaya; Brian Ondov; Aaron E Darling; Adam M Phillippy; Mihai Pop
Journal:  Genome Biol       Date:  2013-01-15       Impact factor: 13.583

2.  progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors:  Aaron E Darling; Bob Mau; Nicole T Perna
Journal:  PLoS One       Date:  2010-06-25       Impact factor: 3.240

3.  Progressive genome-wide introgression in agricultural Campylobacter coli.

Authors:  Samuel K Sheppard; Xavier Didelot; Keith A Jolley; Aaron E Darling; Ben Pascoe; Guillaume Meric; David J Kelly; Alison Cody; Frances M Colles; Norval J C Strachan; Iain D Ogden; Ken Forbes; Nigel P French; Philip Carter; William G Miller; Noel D McCarthy; Robert Owen; Eva Litrup; Michael Egholm; Jason P Affourtit; Stephen D Bentley; Julian Parkhill; Martin C J Maiden; Daniel Falush
Journal:  Mol Ecol       Date:  2012-12-20       Impact factor: 6.185

4.  Horizontal transfer, not duplication, drives the expansion of protein families in prokaryotes.

Authors:  Todd J Treangen; Eduardo P C Rocha
Journal:  PLoS Genet       Date:  2011-01-27       Impact factor: 5.917

5.  Split-alignment of genomes finds orthologies more accurately.

Authors:  Martin C Frith; Risa Kawaguchi
Journal:  Genome Biol       Date:  2015-05-21       Impact factor: 13.583

6.  Insights from the metagenome of an acid salt lake: the role of biology in an extreme depositional environment.

Authors:  Sarah Stewart Johnson; Marc Gerard Chevrette; Bethany L Ehlmann; Kathleen Counter Benison
Journal:  PLoS One       Date:  2015-04-29       Impact factor: 3.240

Review 7.  Integrative workflows for metagenomic analysis.

Authors:  Efthymios Ladoukakis; Fragiskos N Kolisis; Aristotelis A Chatziioannou
Journal:  Front Cell Dev Biol       Date:  2014-11-19

8.  Scaffolding of long read assemblies using long range contact information.

Authors:  Jay Ghurye; Mihai Pop; Sergey Koren; Derek Bickhart; Chen-Shan Chin
Journal:  BMC Genomics       Date:  2017-07-12       Impact factor: 3.969

9.  Graph-based modeling of tandem repeats improves global multiple sequence alignment.

Authors:  Adam M Szalkowski; Maria Anisimova
Journal:  Nucleic Acids Res       Date:  2013-07-22       Impact factor: 16.971

10.  Genomic repeats, misassembly and reannotation: a case study with long-read resequencing of Porphyromonas gingivalis reference strains.

Authors:  Luis Acuña-Amador; Aline Primot; Edouard Cadieu; Alain Roulet; Frédérique Barloy-Hubler
Journal:  BMC Genomics       Date:  2018-01-16       Impact factor: 3.969

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.