Literature DB >> 19661240

Improving phylogenetic analyses by incorporating additional information from genetic sequence databases.

Li-Jung Liang1, Robert E Weiss, Benjamin Redelings, Marc A Suchard.   

Abstract

MOTIVATION: Statistical analyses of phylogenetic data culminate in uncertain estimates of underlying model parameters. Lack of additional data hinders the ability to reduce this uncertainty, as the original phylogenetic dataset is often complete, containing the entire gene or genome information available for the given set of taxa. Informative priors in a Bayesian analysis can reduce posterior uncertainty; however, publicly available phylogenetic software specifies vague priors for model parameters by default. We build objective and informative priors using hierarchical random effect models that combine additional datasets whose parameters are not of direct interest but are similar to the analysis of interest.
RESULTS: We propose principled statistical methods that permit more precise parameter estimates in phylogenetic analyses by creating informative priors for parameters of interest. Using additional sequence datasets from our lab or public databases, we construct a fully Bayesian semiparametric hierarchical model to combine datasets. A dynamic iteratively reweighted Markov chain Monte Carlo algorithm conveniently recycles posterior samples from the individual analyses. We demonstrate the value of our approach by examining the insertion-deletion (indel) process in the enolase gene across the Tree of Life using the phylogenetic software BALI-PHY; we incorporate prior information about indels from 82 curated alignments downloaded from the BAliBASE database.

Mesh:

Year:  2009        PMID: 19661240      PMCID: PMC2800350          DOI: 10.1093/bioinformatics/btp473

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs.

Authors:  J D Thompson; F Plewniak; O Poch
Journal:  Bioinformatics       Date:  1999-01       Impact factor: 6.937

2.  MRBAYES: Bayesian inference of phylogenetic trees.

Authors:  J P Huelsenbeck; F Ronquist
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

3.  The potential value of indels as phylogenetic markers: position of trichomonads as a case study.

Authors:  Eric Bapteste; Hervé Philippe
Journal:  Mol Biol Evol       Date:  2002-06       Impact factor: 16.240

4.  Identifiability of parameters in MCMC Bayesian inference of phylogeny.

Authors:  Bruce Rannala
Journal:  Syst Biol       Date:  2002-10       Impact factor: 15.683

5.  The order of sequence alignment can bias the selection of tree topology.

Authors:  J A Lake
Journal:  Mol Biol Evol       Date:  1991-05       Impact factor: 16.240

6.  Joint Bayesian estimation of alignment and phylogeny.

Authors:  Benjamin D Redelings; Marc A Suchard
Journal:  Syst Biol       Date:  2005-06       Impact factor: 15.683

7.  Branch-length prior influences Bayesian posterior probability of phylogeny.

Authors:  Ziheng Yang; Bruce Rannala
Journal:  Syst Biol       Date:  2005-06       Impact factor: 15.683

8.  BAli-Phy: simultaneous Bayesian inference of alignment and phylogeny.

Authors:  Marc A Suchard; Benjamin D Redelings
Journal:  Bioinformatics       Date:  2006-05-05       Impact factor: 6.937

9.  A hierarchical semiparametric regression model for combining HIV-1 phylogenetic analyses using iterative reweighting algorithms.

Authors:  Li-Jung Liang; Robert E Weiss
Journal:  Biometrics       Date:  2007-09       Impact factor: 2.571

10.  Incorporating indel information into phylogeny estimation for rapidly emerging pathogens.

Authors:  Benjamin D Redelings; Marc A Suchard
Journal:  BMC Evol Biol       Date:  2007-03-14       Impact factor: 3.260

View more
  3 in total

1.  Does history repeat itself? Wavelets and the phylodynamics of influenza A.

Authors:  Jennifer A Tom; Janet S Sinsheimer; Marc A Suchard
Journal:  Mol Biol Evol       Date:  2011-12-08       Impact factor: 16.240

2.  Reuse, Recycle, Reweigh: Combating Influenza through Efficient Sequential Bayesian Computation for Massive Data.

Authors:  Jennifer A Tom; Janet S Sinsheimer; Marc A Suchard
Journal:  Ann Appl Stat       Date:  2010       Impact factor: 2.083

3.  EmpPrior: using outside empirical data to inform branch-length priors for Bayesian phylogenetics.

Authors:  John J Andersen; Bradley J Nelson; Jeremy M Brown
Journal:  BMC Bioinformatics       Date:  2016-06-24       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.