Catherine Grasso1, Christopher Lee. 1. Department of Chemistry and Biochemistry, Molecular Biology Institute, Center for Genomics and Proteomics, University of California, Los Angeles, CA 90095-1570, USA.
Abstract
MOTIVATION: Partial order alignment (POA) has been proposed as a new approach to multiple sequence alignment (MSA), which can be combined with existing methods such as progressive alignment. This is important for addressing problems both in the original version of POA (such as order sensitivity) and in standard progressive alignment programs (such as information loss in complex alignments, especially surrounding gap regions). RESULTS: We have developed a new Partial Order-Partial Order alignment algorithm that optimally aligns a pair of MSAs and which therefore can be applied directly to progressive alignment methods such as CLUSTAL. Using this algorithm, we show the combined Progressive POA alignment method yields results comparable with the best available MSA programs (CLUSTALW, DIALIGN2, T-COFFEE) but is far faster. For example, depending on the level of sequence similarity, aligning 1000 sequences, each 500 amino acids long, took 15 min (at 90% average identity) to 44 min (at 30% identity) on a standard PC. For large alignments, Progressive POA was 10-30 times faster than the fastest of the three previous methods (CLUSTALW). These data suggest that POA-based methods can scale to much larger alignment problems than possible for previous methods. AVAILABILITY: The POA source code is available at http://www.bioinformatics.ucla.edu/poa
MOTIVATION: Partial order alignment (POA) has been proposed as a new approach to multiple sequence alignment (MSA), which can be combined with existing methods such as progressive alignment. This is important for addressing problems both in the original version of POA (such as order sensitivity) and in standard progressive alignment programs (such as information loss in complex alignments, especially surrounding gap regions). RESULTS: We have developed a new Partial Order-Partial Order alignment algorithm that optimally aligns a pair of MSAs and which therefore can be applied directly to progressive alignment methods such as CLUSTAL. Using this algorithm, we show the combined Progressive POA alignment method yields results comparable with the best available MSA programs (CLUSTALW, DIALIGN2, T-COFFEE) but is far faster. For example, depending on the level of sequence similarity, aligning 1000 sequences, each 500 amino acids long, took 15 min (at 90% average identity) to 44 min (at 30% identity) on a standard PC. For large alignments, Progressive POA was 10-30 times faster than the fastest of the three previous methods (CLUSTALW). These data suggest that POA-based methods can scale to much larger alignment problems than possible for previous methods. AVAILABILITY: The POA source code is available at http://www.bioinformatics.ucla.edu/poa
Authors: Jordan M Eizenga; Adam M Novak; Jonas A Sibbesen; Simon Heumos; Ali Ghaffaari; Glenn Hickey; Xian Chang; Josiah D Seaman; Robin Rounthwaite; Jana Ebler; Mikko Rautiainen; Shilpa Garg; Benedict Paten; Tobias Marschall; Jouni Sirén; Erik Garrison Journal: Annu Rev Genomics Hum Genet Date: 2020-05-26 Impact factor: 8.929
Authors: Adérito L Monjane; Gordon W Harkins; Darren P Martin; Philippe Lemey; Pierre Lefeuvre; Dionne N Shepherd; Sunday Oluwafemi; Michelo Simuyandi; Innocent Zinga; Ephrem K Komba; Didier P Lakoutene; Noella Mandakombo; Joseph Mboukoulida; Silla Semballa; Appolinaire Tagne; Fidèle Tiendrébéogo; Julia B Erdmann; Tania van Antwerpen; Betty E Owor; Bradley Flett; Moses Ramusi; Oliver P Windram; Rizwan Syed; Jean-Michel Lett; Rob W Briddon; Peter G Markham; Edward P Rybicki; Arvind Varsani Journal: J Virol Date: 2011-06-29 Impact factor: 5.103
Authors: Pierre Lefeuvre; Darren P Martin; Gordon Harkins; Philippe Lemey; Alistair J A Gray; Sandra Meredith; Francisco Lakay; Adérito Monjane; Jean-Michel Lett; Arvind Varsani; Jahangir Heydarnejad Journal: PLoS Pathog Date: 2010-10-28 Impact factor: 6.823
Authors: Gordon W Harkins; Darren P Martin; Siobain Duffy; Aderito L Monjane; Dionne N Shepherd; Oliver P Windram; Betty E Owor; Lara Donaldson; Tania van Antwerpen; Rizwan A Sayed; Bradley Flett; Moses Ramusi; Edward P Rybicki; Michel Peterschmitt; Arvind Varsani Journal: J Gen Virol Date: 2009-08-19 Impact factor: 3.891
Authors: Arvind Varsani; Aderito L Monjane; Lara Donaldson; Sunday Oluwafemi; Innocent Zinga; Ephrem K Komba; Didier Plakoutene; Noella Mandakombo; Joseph Mboukoulida; Silla Semballa; Rob W Briddon; Peter G Markham; Jean-Michel Lett; Pierre Lefeuvre; Edward P Rybicki; Darren P Martin Journal: Virol J Date: 2009-11-10 Impact factor: 4.099