Literature DB >> 23144040

Class of multiple sequence alignment algorithm affects genomic analysis.

Benjamin P Blackburne1, Simon Whelan.   

Abstract

Multiple sequence alignment (MSA) is the heart of comparative sequence analysis. Recent studies demonstrate that MSA algorithms can produce different outcomes when analyzing genomes, including phylogenetic tree inference and the detection of adaptive evolution. These studies also suggest that the difference between MSA algorithms is of a similar order to the uncertainty within an algorithm and suggest integrating across this uncertainty. In this study, we examine further the problem of disagreements between MSA algorithms and how they affect downstream analyses. We also investigate whether integrating across alignment uncertainty affects downstream analyses. We address these questions by analyzing 200 chordate gene families, with properties reflecting those used in large-scale genomic analyses. We find that newly developed distance metrics reveal two significantly different classes of MSA methods (MSAMs). The similarity-based class includes progressive aligners and consistency aligners, representing many methodological innovations for sequence alignment, whereas the evolution-based class includes phylogenetically aware alignment and statistical alignment. We proceed to show that the class of an MSAM has a substantial impact on downstream analyses. For phylogenetic inference, tree estimates and their branch lengths appear highly dependent on the class of aligner used. The number of families, and the sites within those families, inferred to have undergone adaptive evolution depend on the class of aligner used. Similarity-based aligners tend to identify more adaptive evolution. We also develop and test methods for incorporating MSA uncertainty when detecting adaptive evolution but find that although accounting for MSA uncertainty does affect downstream analyses, it appears less important than the class of aligner chosen. Our results demonstrate the critical role that MSA methodology has on downstream analysis, highlighting that the class of aligner chosen in an analysis has a demonstrable effect on its outcome.

Mesh:

Year:  2012        PMID: 23144040     DOI: 10.1093/molbev/mss256

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  29 in total

1.  Patterns of molecular evolution of the germ line specification gene oskar suggest that a novel domain may contribute to functional divergence in Drosophila.

Authors:  Abha Ahuja; Cassandra G Extavour
Journal:  Dev Genes Evol       Date:  2014-01-10       Impact factor: 0.900

2.  Evaluating Statistical Multiple Sequence Alignment in Comparison to Other Alignment Methods on Protein Data Sets.

Authors:  Michael Nute; Ehsan Saleh; Tandy Warnow
Journal:  Syst Biol       Date:  2019-05-01       Impact factor: 15.683

3.  Simultaneous Bayesian estimation of alignment and phylogeny under a joint model of protein sequence and structure.

Authors:  Joseph L Herman; Christopher J Challis; Ádám Novák; Jotun Hein; Scott C Schmidler
Journal:  Mol Biol Evol       Date:  2014-06-04       Impact factor: 16.240

4.  Erasing errors due to alignment ambiguity when estimating positive selection.

Authors:  Benjamin Redelings
Journal:  Mol Biol Evol       Date:  2014-05-27       Impact factor: 16.240

5.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

6.  Multiple evolution of flavonoid 3',5'-hydroxylase.

Authors:  Christian Seitz; Stefanie Ameres; Karin Schlangen; Gert Forkmann; Heidi Halbwirth
Journal:  Planta       Date:  2015-04-28       Impact factor: 4.116

7.  Incorporating alignment uncertainty into Felsenstein's phylogenetic bootstrap to improve its reliability.

Authors:  Jia-Ming Chang; Evan W Floden; Javier Herrero; Olivier Gascuel; Paolo Di Tommaso; Cedric Notredame
Journal:  Bioinformatics       Date:  2019-02-06       Impact factor: 6.937

8.  ITS2 Secondary Structure Improves Discrimination between Medicinal "Mu Tong" Species when Using DNA Barcoding.

Authors:  Wei Zhang; Yuan Yuan; Shuo Yang; Jianjun Huang; Luqi Huang
Journal:  PLoS One       Date:  2015-07-01       Impact factor: 3.240

9.  Coelacanth SERINC2 Inhibits HIV-1 Infectivity and Is Counteracted by Envelope Glycoprotein from Foamy Virus.

Authors:  Pavitra Ramdas; Vipin Bhardwaj; Aman Singh; Nagarjun Vijay; Ajit Chande
Journal:  J Virol       Date:  2021-06-10       Impact factor: 5.103

10.  Efficacy of computational predictions of the functional effect of idiosyncratic pharmacogenetic variants.

Authors:  Hannah McConnell; T Daniel Andrews; Matt A Field
Journal:  PeerJ       Date:  2021-07-15       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.