Literature DB >> 14962922

Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems.

Catherine Grasso1, Christopher Lee.   

Abstract

MOTIVATION: Partial order alignment (POA) has been proposed as a new approach to multiple sequence alignment (MSA), which can be combined with existing methods such as progressive alignment. This is important for addressing problems both in the original version of POA (such as order sensitivity) and in standard progressive alignment programs (such as information loss in complex alignments, especially surrounding gap regions).
RESULTS: We have developed a new Partial Order-Partial Order alignment algorithm that optimally aligns a pair of MSAs and which therefore can be applied directly to progressive alignment methods such as CLUSTAL. Using this algorithm, we show the combined Progressive POA alignment method yields results comparable with the best available MSA programs (CLUSTALW, DIALIGN2, T-COFFEE) but is far faster. For example, depending on the level of sequence similarity, aligning 1000 sequences, each 500 amino acids long, took 15 min (at 90% average identity) to 44 min (at 30% identity) on a standard PC. For large alignments, Progressive POA was 10-30 times faster than the fastest of the three previous methods (CLUSTALW). These data suggest that POA-based methods can scale to much larger alignment problems than possible for previous methods. AVAILABILITY: The POA source code is available at http://www.bioinformatics.ucla.edu/poa

Mesh:

Year:  2004        PMID: 14962922     DOI: 10.1093/bioinformatics/bth126

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  39 in total

1.  A novel method for multiple alignment of sequences with repeated and shuffled elements.

Authors:  Benjamin Raphael; Degui Zhi; Haixu Tang; Pavel Pevzner
Journal:  Genome Res       Date:  2004-11       Impact factor: 9.043

2.  Conspicuous accumulation of transcription elongation repressor hrp130/CA150 on the intron-rich Balbiani ring 3 gene.

Authors:  Xin Sun; Jian Zhao; Karin Kylberg; Teresa Soop; Kevin Palka; Erik Sonnhammer; Neus Visa; Alla T Alzhanova-Ericsson; Bertil Daneholt
Journal:  Chromosoma       Date:  2004-10-06       Impact factor: 4.316

3.  Optimal gene trees from sequences and species trees using a soft interpretation of parsimony.

Authors:  Ann-Charlotte Berglund-Sonnhammer; Pär Steffansson; Matthew J Betts; David A Liberles
Journal:  J Mol Evol       Date:  2006-07-07       Impact factor: 2.395

4.  Rapid likelihood analysis on large phylogenies using partial sampling of substitution histories.

Authors:  A P Jason de Koning; Wanjun Gu; David D Pollock
Journal:  Mol Biol Evol       Date:  2009-09-25       Impact factor: 16.240

5.  Generic Repeat Finder: A High-Sensitivity Tool for Genome-Wide De Novo Repeat Detection.

Authors:  Jieming Shi; Chun Liang
Journal:  Plant Physiol       Date:  2019-05-31       Impact factor: 8.340

Review 6.  Pangenome Graphs.

Authors:  Jordan M Eizenga; Adam M Novak; Jonas A Sibbesen; Simon Heumos; Ali Ghaffaari; Glenn Hickey; Xian Chang; Josiah D Seaman; Robin Rounthwaite; Jana Ebler; Mikko Rautiainen; Shilpa Garg; Benedict Paten; Tobias Marschall; Jouni Sirén; Erik Garrison
Journal:  Annu Rev Genomics Hum Genet       Date:  2020-05-26       Impact factor: 8.929

7.  Reconstructing the history of maize streak virus strain a dispersal to reveal diversification hot spots and its origin in southern Africa.

Authors:  Adérito L Monjane; Gordon W Harkins; Darren P Martin; Philippe Lemey; Pierre Lefeuvre; Dionne N Shepherd; Sunday Oluwafemi; Michelo Simuyandi; Innocent Zinga; Ephrem K Komba; Didier P Lakoutene; Noella Mandakombo; Joseph Mboukoulida; Silla Semballa; Appolinaire Tagne; Fidèle Tiendrébéogo; Julia B Erdmann; Tania van Antwerpen; Betty E Owor; Bradley Flett; Moses Ramusi; Oliver P Windram; Rizwan Syed; Jean-Michel Lett; Rob W Briddon; Peter G Markham; Edward P Rybicki; Arvind Varsani
Journal:  J Virol       Date:  2011-06-29       Impact factor: 5.103

8.  The spread of tomato yellow leaf curl virus from the Middle East to the world.

Authors:  Pierre Lefeuvre; Darren P Martin; Gordon Harkins; Philippe Lemey; Alistair J A Gray; Sandra Meredith; Francisco Lakay; Adérito Monjane; Jean-Michel Lett; Arvind Varsani; Jahangir Heydarnejad
Journal:  PLoS Pathog       Date:  2010-10-28       Impact factor: 6.823

9.  Dating the origins of the maize-adapted strain of maize streak virus, MSV-A.

Authors:  Gordon W Harkins; Darren P Martin; Siobain Duffy; Aderito L Monjane; Dionne N Shepherd; Oliver P Windram; Betty E Owor; Lara Donaldson; Tania van Antwerpen; Rizwan A Sayed; Bradley Flett; Moses Ramusi; Edward P Rybicki; Michel Peterschmitt; Arvind Varsani
Journal:  J Gen Virol       Date:  2009-08-19       Impact factor: 3.891

10.  Comparative analysis of Panicum streak virus and Maize streak virus diversity, recombination patterns and phylogeography.

Authors:  Arvind Varsani; Aderito L Monjane; Lara Donaldson; Sunday Oluwafemi; Innocent Zinga; Ephrem K Komba; Didier Plakoutene; Noella Mandakombo; Joseph Mboukoulida; Silla Semballa; Rob W Briddon; Peter G Markham; Jean-Michel Lett; Pierre Lefeuvre; Edward P Rybicki; Darren P Martin
Journal:  Virol J       Date:  2009-11-10       Impact factor: 4.099

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.