Literature DB >> 22492645

PhyLAT: a phylogenetic local alignment tool.

Hongtao Sun1, Jeremy D Buhler.   

Abstract

MOTIVATION: The expansion of DNA sequencing capacity has enabled the sequencing of whole genomes from a number of related species. These genomes can be combined in a multiple alignment that provides useful information about the evolutionary history at each genomic locus. One area in which evolutionary information can productively be exploited is in aligning a new sequence to a database of existing, aligned genomes. However, existing high-throughput alignment tools are not designed to work effectively with multiple genome alignments.
RESULTS: We introduce PhyLAT, the phylogenetic local alignment tool, to compute local alignments of a query sequence against a fixed multiple-genome alignment of closely related species. PhyLAT uses a known phylogenetic tree on the species in the multiple alignment to improve the quality of its computed alignments while also estimating the placement of the query on this tree. It combines a probabilistic approach to alignment with seeding and expansion heuristics to accelerate discovery of significant alignments. We provide evidence, using alignments of human chromosome 22 against a five-species alignment from the UCSC Genome Browser database, that PhyLAT's alignments are more accurate than those of other commonly used programs, including BLAST, POY, MAFFT, MUSCLE and CLUSTAL. PhyLAT also identifies more alignments in coding DNA than does pairwise alignment alone. Finally, our tool determines the evolutionary relationship of query sequences to the database more accurately than do POY, RAxML, EPA or pplacer.

Entities:  

Mesh:

Year:  2012        PMID: 22492645      PMCID: PMC3465089          DOI: 10.1093/bioinformatics/bts158

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  48 in total

1.  Ultraconserved elements in the human genome.

Authors:  Gill Bejerano; Michael Pheasant; Igor Makunin; Stuart Stephen; W James Kent; John S Mattick; David Haussler
Journal:  Science       Date:  2004-05-06       Impact factor: 47.728

2.  Aligning short reads to reference alignments and trees.

Authors:  Simon A Berger; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2011-06-02       Impact factor: 6.937

3.  A new generation of homology search tools based on probabilistic inference.

Authors:  Sean R Eddy
Journal:  Genome Inform       Date:  2009-10

4.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

5.  BLAST+: architecture and applications.

Authors:  Christiam Camacho; George Coulouris; Vahram Avagyan; Ning Ma; Jason Papadopoulos; Kevin Bealer; Thomas L Madden
Journal:  BMC Bioinformatics       Date:  2009-12-15       Impact factor: 3.169

6.  Performance, accuracy, and Web server for evolutionary placement of short sequence reads under maximum likelihood.

Authors:  Simon A Berger; Denis Krompass; Alexandros Stamatakis
Journal:  Syst Biol       Date:  2011-03-23       Impact factor: 15.683

7.  pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree.

Authors:  Frederick A Matsen; Robin B Kodner; E Virginia Armbrust
Journal:  BMC Bioinformatics       Date:  2010-10-30       Impact factor: 3.169

8.  Island method for estimating the statistical significance of profile-profile alignment scores.

Authors:  Aleksandar Poleksic
Journal:  BMC Bioinformatics       Date:  2009-04-20       Impact factor: 3.169

9.  The UCSC Genome Browser database: update 2010.

Authors:  Brooke Rhead; Donna Karolchik; Robert M Kuhn; Angie S Hinrichs; Ann S Zweig; Pauline A Fujita; Mark Diekhans; Kayla E Smith; Kate R Rosenbloom; Brian J Raney; Andy Pohl; Michael Pheasant; Laurence R Meyer; Katrina Learned; Fan Hsu; Jennifer Hillman-Jackson; Rachel A Harte; Belinda Giardine; Timothy R Dreszer; Hiram Clawson; Galt P Barber; David Haussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

10.  A probabilistic model of local sequence alignment that simplifies statistical significance estimation.

Authors:  Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2008-05-30       Impact factor: 4.475

View more
  2 in total

1.  Adding unaligned sequences into an existing alignment using MAFFT and LAST.

Authors:  Kazutaka Katoh; Martin C Frith
Journal:  Bioinformatics       Date:  2012-09-27       Impact factor: 6.937

2.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.