Literature DB >> 24899668

Simultaneous Bayesian estimation of alignment and phylogeny under a joint model of protein sequence and structure.

Joseph L Herman1, Christopher J Challis2, Ádám Novák3, Jotun Hein3, Scott C Schmidler4.   

Abstract

For sequences that are highly divergent, there is often insufficient information to infer accurate alignments, and phylogenetic uncertainty may be high. One way to address this issue is to make use of protein structural information, since structures generally diverge more slowly than sequences. In this work, we extend a recently developed stochastic model of pairwise structural evolution to multiple structures on a tree, analytically integrating over ancestral structures to permit efficient likelihood computations under the resulting joint sequence-structure model. We observe that the inclusion of structural information significantly reduces alignment and topology uncertainty, and reduces the number of topology and alignment errors in cases where the true trees and alignments are known. In some cases, the inclusion of structure results in changes to the consensus topology, indicating that structure may contain additional information beyond that which can be obtained from sequences. We use the model to investigate the order of divergence of cytoglobins, myoglobins, and hemoglobins and observe a stabilization of phylogenetic inference: although a sequence-based inference assigns significant posterior probability to several different topologies, the structural model strongly favors one of these over the others and is more robust to the choice of data set.
© The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Bayesian phylogenetics; globin evolution; statistical alignment; stochastic processes; structural alignment

Mesh:

Substances:

Year:  2014        PMID: 24899668      PMCID: PMC4137710          DOI: 10.1093/molbev/msu184

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  59 in total

1.  Structure comparison and structure patterns.

Authors:  I Eidhammer; I Jonassen; W R Taylor
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

2.  MRBAYES: Bayesian inference of phylogenetic trees.

Authors:  J P Huelsenbeck; F Ronquist
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

3.  A vertebrate globin expressed in the brain.

Authors:  T Burmester; B Weich; S Reinhardt; T Hankeln
Journal:  Nature       Date:  2000-09-28       Impact factor: 49.962

4.  Effects of models of rate evolution on estimation of divergence dates with special reference to the metazoan 18S ribosomal RNA phylogeny.

Authors:  Stéphane Aris-Brosou; Ziheng Yang
Journal:  Syst Biol       Date:  2002-10       Impact factor: 15.683

5.  Protein evolution with dependence among codons due to tertiary structure.

Authors:  Douglas M Robinson; David T Jones; Hirohisa Kishino; Nick Goldman; Jeffrey L Thorne
Journal:  Mol Biol Evol       Date:  2003-07-28       Impact factor: 16.240

6.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

7.  Heterogeneity and inaccuracy in protein structures solved by X-ray crystallography.

Authors:  Mark A DePristo; Paul I W de Bakker; Tom L Blundell
Journal:  Structure       Date:  2004-05       Impact factor: 5.006

8.  Structure is three to ten times more conserved than sequence--a study of structural response in protein cores.

Authors:  Kristoffer Illergård; David H Ardell; Arne Elofsson
Journal:  Proteins       Date:  2009-11-15

9.  Cytoglobin: a novel globin type ubiquitously expressed in vertebrate tissues.

Authors:  Thorsten Burmester; Bettina Ebner; Bettina Weich; Thomas Hankeln
Journal:  Mol Biol Evol       Date:  2002-04       Impact factor: 16.240

10.  Accurate reconstruction of insertion-deletion histories by statistical phylogenetics.

Authors:  Oscar Westesson; Gerton Lunter; Benedict Paten; Ian Holmes
Journal:  PLoS One       Date:  2012-04-20       Impact factor: 3.240

View more
  14 in total

1.  Historian: accurate reconstruction of ancestral sequences and evolutionary rates.

Authors:  Ian H Holmes
Journal:  Bioinformatics       Date:  2017-04-15       Impact factor: 6.937

2.  Multiple Sequence Alignment Averaging Improves Phylogeny Reconstruction.

Authors:  Haim Ashkenazy; Itamar Sela; Eli Levy Karin; Giddy Landan; Tal Pupko
Journal:  Syst Biol       Date:  2019-01-01       Impact factor: 15.683

3.  Incorporating Nearest-Neighbor Site Dependence into Protein Evolution Models.

Authors:  Gary Larson; Jeffrey L Thorne; Scott Schmidler
Journal:  J Comput Biol       Date:  2020-02-13       Impact factor: 1.479

4.  Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs.

Authors:  Joseph L Herman; Ádám Novák; Rune Lyngsø; Adrienn Szabó; István Miklós; Jotun Hein
Journal:  BMC Bioinformatics       Date:  2015-04-01       Impact factor: 3.169

5.  Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations.

Authors:  Andrew F Neuwald; Stephen F Altschul
Journal:  PLoS Comput Biol       Date:  2016-12-21       Impact factor: 4.475

6.  A Generative Angular Model of Protein Structure Evolution.

Authors:  Michael Golden; Eduardo García-Portugués; Michael Sørensen; Kanti V Mardia; Thomas Hamelryck; Jotun Hein
Journal:  Mol Biol Evol       Date:  2017-08-01       Impact factor: 16.240

7.  The Enigmatic Origin of Papillomavirus Protein Domains.

Authors:  Mikk Puustusmaa; Heleri Kirsip; Kevin Gaston; Aare Abroi
Journal:  Viruses       Date:  2017-08-23       Impact factor: 5.048

8.  Protein Structure-Guided Hidden Markov Models (HMMs) as A Powerful Method in the Detection of Ancestral Endogenous Viral Elements.

Authors:  Heleri Kirsip; Aare Abroi
Journal:  Viruses       Date:  2019-04-02       Impact factor: 5.048

9.  Phylogeny of Echinoderm Hemoglobins.

Authors:  Ana B Christensen; Joseph L Herman; Maurice R Elphick; Kord M Kober; Daniel Janies; Gregorio Linchangco; Dean C Semmens; Xavier Bailly; Serge N Vinogradov; David Hoogewijs
Journal:  PLoS One       Date:  2015-08-06       Impact factor: 3.240

10.  Inferring Indel Parameters using a Simulation-based Approach.

Authors:  Eli Levy Karin; Avigayel Rabin; Haim Ashkenazy; Dafna Shkedy; Oren Avram; Reed A Cartwright; Tal Pupko
Journal:  Genome Biol Evol       Date:  2015-11-03       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.