Literature DB >> 22407710

Alignment methods: strategies, challenges, benchmarking, and comparative overview.

Ari Löytynoja1.   

Abstract

Comparative evolutionary analyses of molecular sequences are solely based on the identities and differences detected between homologous characters. Errors in this homology statement, that is errors in the alignment of the sequences, are likely to lead to errors in the downstream analyses. Sequence alignment and phylogenetic inference are tightly connected and many popular alignment programs use the phylogeny to divide the alignment problem into smaller tasks. They then neglect the phylogenetic tree, however, and produce alignments that are not evolutionarily meaningful. The use of phylogeny-aware methods reduces the error but the resulting alignments, with evolutionarily correct representation of homology, can challenge the existing practices and methods for viewing and visualising the sequences. The inter-dependency of alignment and phylogeny can be resolved by joint estimation of the two; methods based on statistical models allow for inferring the alignment parameters from the data and correctly take into account the uncertainty of the solution but remain computationally challenging. Widely used alignment methods are based on heuristic algorithms and unlikely to find globally optimal solutions. The whole concept of one correct alignment for the sequences is questionable, however, as there typically exist vast numbers of alternative, roughly equally good alignments that should also be considered. This uncertainty is hidden by many popular alignment programs and is rarely correctly taken into account in the downstream analyses. The quest for finding and improving the alignment solution is complicated by the lack of suitable measures of alignment goodness. The difficulty of comparing alternative solutions also affects benchmarks of alignment methods and the results strongly depend on the measure used. As the effects of alignment error cannot be predicted, comparing the alignments' performance in downstream analyses is recommended.

Mesh:

Year:  2012        PMID: 22407710     DOI: 10.1007/978-1-61779-582-4_7

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  5 in total

1.  Inferring Phylogenomic Relationship of Microbes Using Scalable Alignment-Free Methods.

Authors:  Guillaume Bernard; Timothy G Stephens; Raúl A González-Pech; Cheong Xin Chan
Journal:  Methods Mol Biol       Date:  2021

2.  Multiple Sequence Alignment Averaging Improves Phylogeny Reconstruction.

Authors:  Haim Ashkenazy; Itamar Sela; Eli Levy Karin; Giddy Landan; Tal Pupko
Journal:  Syst Biol       Date:  2019-01-01       Impact factor: 15.683

3.  The number of reduced alignments between two DNA sequences.

Authors:  Helena Andrade; Iván Area; Juan J Nieto; Angela Torres
Journal:  BMC Bioinformatics       Date:  2014-04-01       Impact factor: 3.169

4.  COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification.

Authors:  Daniel Struck; Glenn Lawyer; Anne-Marie Ternes; Jean-Claude Schmit; Danielle Perez Bercoff
Journal:  Nucleic Acids Res       Date:  2014-08-12       Impact factor: 16.971

5.  Partitional Classification: A Complement to Phylogeny.

Authors:  Marc Salomon; Bruno Dassy
Journal:  Evol Bioinform Online       Date:  2016-06-16       Impact factor: 1.625

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.