Literature DB >> 23275296

Evolutionary inference via the Poisson Indel Process.

Alexandre Bouchard-Côté1, Michael I Jordan.   

Abstract

We address the problem of the joint statistical inference of phylogenetic trees and multiple sequence alignments from unaligned molecular sequences. This problem is generally formulated in terms of string-valued evolutionary processes along the branches of a phylogenetic tree. The classic evolutionary process, the TKF91 model [Thorne JL, Kishino H, Felsenstein J (1991) J Mol Evol 33(2):114-124] is a continuous-time Markov chain model composed of insertion, deletion, and substitution events. Unfortunately, this model gives rise to an intractable computational problem: The computation of the marginal likelihood under the TKF91 model is exponential in the number of taxa. In this work, we present a stochastic process, the Poisson Indel Process (PIP), in which the complexity of this computation is reduced to linear. The Poisson Indel Process is closely related to the TKF91 model, differing only in its treatment of insertions, but it has a global characterization as a Poisson process on the phylogeny. Standard results for Poisson processes allow key computations to be decoupled, which yields the favorable computational profile of inference under the PIP model. We present illustrative experiments in which Bayesian inference under the PIP model is compared with separate inference of phylogenies and alignments.

Entities:  

Mesh:

Year:  2012        PMID: 23275296      PMCID: PMC3557041          DOI: 10.1073/pnas.1220450110

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  42 in total

1.  An algorithm for statistical alignment of sequences related by a binary tree.

Authors:  J Hein
Journal:  Pac Symp Biocomput       Date:  2001

2.  Large-scale comparison of protein sequence alignment algorithms with structure alignments.

Authors:  J M Sauder; J W Arthur; R L Dunbrack
Journal:  Proteins       Date:  2000-07-01

3.  Protein-length distributions for the three domains of life.

Authors:  J Zhang
Journal:  Trends Genet       Date:  2000-03       Impact factor: 11.639

4.  Assessing variability by joint sampling of alignments and mutation rates.

Authors:  D Metzler; R Fleissner; A Wakolbinger; A von Haeseler
Journal:  J Mol Evol       Date:  2001-12       Impact factor: 2.395

5.  Evolutionary HMMs: a Bayesian approach to multiple alignment.

Authors:  I Holmes; W J Bruno
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

6.  Sequence alignments and pair hidden Markov models using evolutionary history.

Authors:  Bjarne Knudsen; Michael M Miyamoto
Journal:  J Mol Biol       Date:  2003-10-17       Impact factor: 5.469

7.  Effect of nonindependent substitution on phylogenetic accuracy.

Authors:  J P Huelsenbeck; R Nielsen
Journal:  Syst Biol       Date:  1999-06       Impact factor: 15.683

8.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

Authors:  Stéphane Guindon; Olivier Gascuel
Journal:  Syst Biol       Date:  2003-10       Impact factor: 15.683

9.  Finite-state models in the alignment of macromolecules.

Authors:  L Allison; C S Wallace; C N Yee
Journal:  J Mol Evol       Date:  1992-07       Impact factor: 2.395

Review 10.  Inching toward reality: an improved likelihood model of sequence evolution.

Authors:  J L Thorne; H Kishino; J Felsenstein
Journal:  J Mol Evol       Date:  1992-01       Impact factor: 2.395

View more
  12 in total

1.  Profile of Michael I. Jordan.

Authors:  Nicholette Zeliadt
Journal:  Proc Natl Acad Sci U S A       Date:  2013-01-22       Impact factor: 11.205

2.  Simultaneous Bayesian estimation of alignment and phylogeny under a joint model of protein sequence and structure.

Authors:  Joseph L Herman; Christopher J Challis; Ádám Novák; Jotun Hein; Scott C Schmidler
Journal:  Mol Biol Evol       Date:  2014-06-04       Impact factor: 16.240

3.  Measuring Phylogenetic Information of Incomplete Sequence Data.

Authors:  Tae-Kun Seo; Olivier Gascuel; Jeffrey L Thorne
Journal:  Syst Biol       Date:  2022-04-19       Impact factor: 9.160

4.  A Model of Indel Evolution by Finite-State, Continuous-Time Machines.

Authors:  Ian Holmes
Journal:  Genetics       Date:  2020-10-05       Impact factor: 4.562

5.  Parameterizing sequence alignment with an explicit evolutionary model.

Authors:  Elena Rivas; Sean R Eddy
Journal:  BMC Bioinformatics       Date:  2015-12-10       Impact factor: 3.169

6.  Ancestral Reconstruction.

Authors:  Jeffrey B Joy; Richard H Liang; Rosemary M McCloskey; T Nguyen; Art F Y Poon
Journal:  PLoS Comput Biol       Date:  2016-07-12       Impact factor: 4.475

7.  Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps.

Authors:  Jakub Truszkowski; Nick Goldman
Journal:  Syst Biol       Date:  2015-11-28       Impact factor: 15.683

8.  Solving the master equation for Indels.

Authors:  Ian H Holmes
Journal:  BMC Bioinformatics       Date:  2017-05-12       Impact factor: 3.169

9.  Inferring Indel Parameters using a Simulation-based Approach.

Authors:  Eli Levy Karin; Avigayel Rabin; Haim Ashkenazy; Dafna Shkedy; Oren Avram; Reed A Cartwright; Tal Pupko
Journal:  Genome Biol Evol       Date:  2015-11-03       Impact factor: 3.416

10.  Progressive multiple sequence alignment with indel evolution.

Authors:  Massimo Maiolo; Xiaolei Zhang; Manuel Gil; Maria Anisimova
Journal:  BMC Bioinformatics       Date:  2018-09-21       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.