Literature DB >> 20047659

Phylogenetic comparative assembly.

Peter Husemann1, Jens Stoye.   

Abstract

BACKGROUND: Recent high throughput sequencing technologies are capable of generating a huge amount of data for bacterial genome sequencing projects. Although current sequence assemblers successfully merge the overlapping reads, often several contigs remain which cannot be assembled any further. It is still costly and time consuming to close all the gaps in order to acquire the whole genomic sequence.
RESULTS: Here we propose an algorithm that takes several related genomes and their phylogenetic relationships into account to create a graph that contains the likelihood for each pair of contigs to be adjacent. Subsequently, this graph can be used to compute a layout graph that shows the most promising contig adjacencies in order to aid biologists in finishing the complete genomic sequence. The layout graph shows unique contig orderings where possible, and the best alternatives where necessary.
CONCLUSIONS: Our new algorithm for contig ordering uses sequence similarity as well as phylogenetic information to estimate adjacencies of contigs. An evaluation of our implementation shows that it performs better than recent approaches while being much faster at the same time.

Entities:  

Year:  2010        PMID: 20047659      PMCID: PMC2826331          DOI: 10.1186/1748-7188-5-3

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  21 in total

1.  GenBank.

Authors:  D A Benson; I Karsch-Mizrachi; D J Lipman; J Ostell; B A Rapp; D L Wheeler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

3.  Database resources of the National Center for Biotechnology Information.

Authors:  D L Wheeler; C Chappey; A E Lash; D D Leipe; T L Madden; G D Schuler; T A Tatusova; B A Rapp
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  Efficient q-gram filters for finding all epsilon-matches over a given length.

Authors:  Kim R Rasmussen; Jens Stoye; Eugene W Myers
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

5.  Consed: a graphical tool for sequence finishing.

Authors:  D Gordon; C Abajian; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

6.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

7.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

8.  DNA sequencing with chain-terminating inhibitors.

Authors:  F Sanger; S Nicklen; A R Coulson
Journal:  Proc Natl Acad Sci U S A       Date:  1977-12       Impact factor: 11.205

9.  PHY.FI: fast and easy online creation and manipulation of phylogeny color figures.

Authors:  Jakob Fredslund
Journal:  BMC Bioinformatics       Date:  2006-06-22       Impact factor: 3.169

10.  Projector 2: contig mapping for efficient gap-closure of prokaryotic genome sequence assemblies.

Authors:  Sacha A F T van Hijum; Aldert L Zomer; Oscar P Kuipers; Jan Kok
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

View more
  8 in total

1.  Opera: reconstructing optimal genomic scaffolds with high-throughput paired-end sequences.

Authors:  Song Gao; Wing-Kin Sung; Niranjan Nagarajan
Journal:  J Comput Biol       Date:  2011-09-19       Impact factor: 1.479

2.  Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes).

Authors:  Christophe Dessimoz; Stefan Zoller; Tereza Manousaki; Huan Qiu; Axel Meyer; Shigehiro Kuraku
Journal:  Brief Bioinform       Date:  2011-06-28       Impact factor: 11.622

3.  r2cat: synteny plots and comparative assembly.

Authors:  Peter Husemann; Jens Stoye
Journal:  Bioinformatics       Date:  2009-12-16       Impact factor: 6.937

4.  Linearization of ancestral multichromosomal genomes.

Authors:  Ján Maňuch; Murray Patterson; Roland Wittler; Cedric Chauve; Eric Tannier
Journal:  BMC Bioinformatics       Date:  2012-12-19       Impact factor: 3.169

5.  Genome reassembly with high-throughput sequencing data.

Authors:  Nathaniel Parrish; Benjamin Sudakov; Eleazar Eskin
Journal:  BMC Genomics       Date:  2013-01-21       Impact factor: 3.969

Review 6.  The inference of gene trees with species trees.

Authors:  Gergely J Szöllősi; Eric Tannier; Vincent Daubin; Bastien Boussau
Journal:  Syst Biol       Date:  2014-07-28       Impact factor: 15.683

7.  Ancestral gene synteny reconstruction improves extant species scaffolding.

Authors:  Yoann Anselmetti; Vincent Berry; Cedric Chauve; Annie Chateau; Eric Tannier; Sèverine Bérard
Journal:  BMC Genomics       Date:  2015-10-02       Impact factor: 3.969

8.  Phylogenetic signal from rearrangements in 18 Anopheles species by joint scaffolding extant and ancestral genomes.

Authors:  Yoann Anselmetti; Wandrille Duchemin; Eric Tannier; Cedric Chauve; Sèverine Bérard
Journal:  BMC Genomics       Date:  2018-05-09       Impact factor: 3.969

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.