Literature DB >> 31209473

Identifying Clusters of High Confidence Homologies in Multiple Sequence Alignments.

Raja Hashim Ali1,2, Marcin Bogusz1, Simon Whelan1.   

Abstract

Multiple sequence alignment (MSA) is ubiquitous in evolution and bioinformatics. MSAs are usually taken to be a known and fixed quantity on which to perform downstream analysis despite extensive evidence that MSA accuracy and uncertainty affect results. These errors are known to cause a wide range of problems for downstream evolutionary inference, ranging from false inference of positive selection to long branch attraction artifacts. The most popular approach to dealing with this problem is to remove (filter) specific columns in the MSA that are thought to be prone to error. Although popular, this approach has had mixed success and several studies have even suggested that filtering might be detrimental to phylogenetic studies. We present a graph-based clustering method to address MSA uncertainty and error in the software Divvier (available at https://github.com/simonwhelan/Divvier), which uses a probabilistic model to identify clusters of characters that have strong statistical evidence of shared homology. These clusters can then be used to either filter characters from the MSA (partial filtering) or represent each of the clusters in a new column (divvying). We validate Divvier through its performance on real and simulated benchmarks, finding Divvier substantially outperforms existing filtering software by retaining more true pairwise homologies calls and removing more false positive pairwise homologies. We also find that Divvier, in contrast to other filtering tools, can alleviate long branch attraction artifacts induced by MSA and reduces the variation in tree estimates caused by MSA uncertainty.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  filtering; homology; multiple sequence alignment; phylogenetic inference

Year:  2019        PMID: 31209473     DOI: 10.1093/molbev/msz142

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  15 in total

1.  A kleptoplastidic dinoflagellate and the tipping point between transient and fully integrated plastid endosymbiosis.

Authors:  Elisabeth Hehenberger; Rebecca J Gast; Patrick J Keeling
Journal:  Proc Natl Acad Sci U S A       Date:  2019-08-19       Impact factor: 11.205

2.  Site-and-branch-heterogeneous analyses of an expanded dataset favour mitochondria as sister to known Alphaproteobacteria.

Authors:  Sergio A Muñoz-Gómez; Edward Susko; Kelsey Williamson; Laura Eme; Claudio H Slamovits; David Moreira; Purificación López-García; Andrew J Roger
Journal:  Nat Ecol Evol       Date:  2022-01-13       Impact factor: 19.100

3.  A standardized archaeal taxonomy for the Genome Taxonomy Database.

Authors:  Christian Rinke; Maria Chuvochina; Aaron J Mussig; Pierre-Alain Chaumeil; Adrián A Davín; David W Waite; William B Whitman; Donovan H Parks; Philip Hugenholtz
Journal:  Nat Microbiol       Date:  2021-06-21       Impact factor: 17.745

4.  A molecular timescale for eukaryote evolution with implications for the origin of red algal-derived plastids.

Authors:  Jürgen F H Strassert; Iker Irisarri; Tom A Williams; Fabien Burki
Journal:  Nat Commun       Date:  2021-03-25       Impact factor: 14.919

5.  Evolutionary Models for the Diversification of Placental Mammals Across the KPg Boundary.

Authors:  Mark S Springer; Nicole M Foley; Peggy L Brady; John Gatesy; William J Murphy
Journal:  Front Genet       Date:  2019-11-29       Impact factor: 4.599

6.  Markers for genetic change.

Authors:  Giovanni Forcina; Miguel Camacho-Sanchez; Fred Y Y Tuh; Sacramento Moreno; Jennifer A Leonard
Journal:  Heliyon       Date:  2021-01-02

7.  Gregarine single-cell transcriptomics reveals differential mitochondrial remodeling and adaptation in apicomplexans.

Authors:  Eric D Salomaki; Kristina X Terpis; Sonja Rueckert; Michael Kotyk; Zuzana Kotyková Varadínová; Ivan Čepička; Christopher E Lane; Martin Kolisko
Journal:  BMC Biol       Date:  2021-04-16       Impact factor: 7.431

Review 8.  Ancestral sequence reconstruction - An underused approach to understand the evolution of gene function in plants?

Authors:  Federico Scossa; Alisdair R Fernie
Journal:  Comput Struct Biotechnol J       Date:  2021-03-16       Impact factor: 7.271

9.  Vestiges of the Bacterial Signal Recognition Particle-Based Protein Targeting in Mitochondria.

Authors:  Jan Pyrih; Tomáš Pánek; Ignacio Miguel Durante; Vendula Rašková; Kristýna Cimrhanzlová; Eva Kriegová; Anastasios D Tsaousis; Marek Eliáš; Julius Lukeš
Journal:  Mol Biol Evol       Date:  2021-07-29       Impact factor: 16.240

10.  Phylogenetic Diversity of Lhr Proteins and Biochemical Activities of the Thermococcales aLhr2 DNA/RNA Helicase.

Authors:  Mirna Hajj; Petra Langendijk-Genevaux; Manon Batista; Yves Quentin; Sébastien Laurent; Régine Capeyrou; Ziad Abdel-Razzak; Didier Flament; Hala Chamieh; Gwennaele Fichant; Béatrice Clouet-d'Orval; Marie Bouvier
Journal:  Biomolecules       Date:  2021-06-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.