Literature DB >> 17387100

Heads or tails: a simple reliability check for multiple sequence alignments.

Giddy Landan1, Dan Graur.   

Abstract

The question of multiple sequence alignment quality has received much attention from developers of alignment methods. Less forthcoming, however, are practical measures for addressing alignment quality issues in real life settings. Here, we present a simple methodology to help identify and quantify the uncertainties in multiple sequence alignments and their effects on subsequent analyses. The proposed methodology is based upon the a priori expectation that sequence alignment results should be independent of the orientation of the input sequences. Thus, for totally unambiguous cases, reversing residue order prior to alignment should yield an exact reversed alignment of that obtained by using the unreversed sequences. Such "ideal" alignments, however, are the exception in real life settings, and the two alignments, which we term the heads and tails alignments, are usually different to a greater or lesser degree. The degree of agreement or discrepancy between these two alignments may be used to assess the reliability of the sequence alignment. Furthermore, any alignment dependent sequence analysis protocol can be carried out separately for each of the two alignments, and the two sets of results may be compared with each other, providing us with valuable information regarding the robustness of the whole analytical process. The heads-or-tails (HoT) methodology can be easily implemented for any choice of alignment method and for any subsequent analytical protocol. We demonstrate the utility of HoT for phylogenetic reconstruction for the case of 130 sequences belonging to the chemoreceptor superfamily in Drosophila melanogaster, and by analysis of the BaliBASE alignment database. Surprisingly, Neighbor-Joining methods of phylogenetic reconstruction turned out to be less affected by alignment errors than maximum likelihood and Bayesian methods.

Entities:  

Mesh:

Year:  2007        PMID: 17387100     DOI: 10.1093/molbev/msm060

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  85 in total

1.  Reconstructing the fungal tree of life using phylogenomics and a preliminary investigation of the distribution of yeast prion-like proteins in the fungal kingdom.

Authors:  Edgar M Medina; Gary W Jones; David A Fitzpatrick
Journal:  J Mol Evol       Date:  2011-09-22       Impact factor: 2.395

2.  Endosymbiotic origin and differential loss of eukaryotic genes.

Authors:  Chuan Ku; Shijulal Nelson-Sathi; Mayo Roettger; Filipa L Sousa; Peter J Lockhart; David Bryant; Einat Hazkani-Covo; James O McInerney; Giddy Landan; William F Martin
Journal:  Nature       Date:  2015-08-19       Impact factor: 49.962

3.  Old fossils-young species: evolutionary history of an endemic gastropod assemblage in Lake Malawi.

Authors:  Roland Schultheiss; Bert Van Bocxlaer; Thomas Wilke; Christian Albrecht
Journal:  Proc Biol Sci       Date:  2009-05-13       Impact factor: 5.349

Review 4.  Getting a better picture of microbial evolution en route to a network of genomes.

Authors:  Tal Dagan; William Martin
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2009-08-12       Impact factor: 6.237

5.  Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes.

Authors:  Romain A Studer; Simon Penel; Laurent Duret; Marc Robinson-Rechavi
Journal:  Genome Res       Date:  2008-06-18       Impact factor: 9.043

6.  AtnMat2, a nuclear-encoded maturase required for splicing of group-II introns in Arabidopsis mitochondria.

Authors:  Ido Keren; Ayenachew Bezawork-Geleta; Max Kolton; Inbar Maayan; Eduard Belausov; Maggie Levy; Anahit Mett; David Gidoni; Felix Shaya; Oren Ostersetzer-Biran
Journal:  RNA       Date:  2009-12       Impact factor: 4.942

7.  An alignment confidence score capturing robustness to guide tree uncertainty.

Authors:  Osnat Penn; Eyal Privman; Giddy Landan; Dan Graur; Tal Pupko
Journal:  Mol Biol Evol       Date:  2010-03-05       Impact factor: 16.240

8.  The physiology and habitat of the last universal common ancestor.

Authors:  Madeline C Weiss; Filipa L Sousa; Natalia Mrnjavac; Sinje Neukirchen; Mayo Roettger; Shijulal Nelson-Sathi; William F Martin
Journal:  Nat Microbiol       Date:  2016-07-25       Impact factor: 17.745

9.  Estimates of positive Darwinian selection are inflated by errors in sequencing, annotation, and alignment.

Authors:  Adrian Schneider; Alexander Souvorov; Niv Sabath; Giddy Landan; Gaston H Gonnet; Dan Graur
Journal:  Genome Biol Evol       Date:  2009-06-05       Impact factor: 3.416

10.  Reproducing the manual annotation of multiple sequence alignments using a SVM classifier.

Authors:  Christian Blouin; Scott Perry; Allan Lavell; Edward Susko; Andrew J Roger
Journal:  Bioinformatics       Date:  2009-09-21       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.