Literature DB >> 12446820

Combining multiple data sets in a likelihood analysis: which models are the best?

Tal Pupko1, Dorothée Huchon, Ying Cao, Norihiro Okada, Masami Hasegawa.   

Abstract

Until recently, phylogenetic analyses have been routinely based on homologous sequences of a single gene. Given the vast number of gene sequences now available, phylogenetic studies are now based on the analysis of multiple genes. Thus, it has become necessary to devise statistical methods to combine multiple molecular data sets. Here, we compare several models for combining different genes for the purpose of evaluating the likelihood of tree topologies. Three methods of branch length estimation were studied: assuming all genes have the same branch lengths (concatenate model), assuming that branch lengths are proportional among genes (proportional model), or assuming that each gene has a separate set of branch lengths (separate model). We also compared three models of among-site rate variation: the homogenous model, a model that assumes one gamma parameter for all genes, and a model that assumes one gamma parameter for each gene. On the basis of two nuclear and one mitochondrial amino acid data sets, our results suggest that, depending on the data set chosen, either the separate model or the proportional model represents the most appropriate method for branch length analysis. For all the data sets examined, one gamma parameter for each gene represents the best model for among-site rate variation. Using these models we analyzed alternative mammalian tree topologies, and we describe the effect of the assumed model on the maximum likelihood tree. We show that the choice of the model has an impact on the best phylogeny obtained.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12446820     DOI: 10.1093/oxfordjournals.molbev.a004053

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  13 in total

1.  Incorporating gene-specific variation when inferring and evaluating optimal evolutionary tree topologies from multilocus sequence data.

Authors:  Tae-Kun Seo; Hirohisa Kishino; Jeffrey L Thorne
Journal:  Proc Natl Acad Sci U S A       Date:  2005-03-11       Impact factor: 11.205

Review 2.  New methods for inferring population dynamics from microbial sequences.

Authors:  Marcos Pérez-Losada; Megan L Porter; Loubna Tazi; Keith A Crandall
Journal:  Infect Genet Evol       Date:  2006-04-19       Impact factor: 3.342

Review 3.  The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation.

Authors:  Andrew J Roger; Laura A Hug
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2006-06-29       Impact factor: 6.237

4.  Information Criteria for Comparing Partition Schemes.

Authors:  Tae-Kun Seo; Jeffrey L Thorne
Journal:  Syst Biol       Date:  2018-07-01       Impact factor: 15.683

5.  The convergence of carbohydrate active gene repertoires in human gut microbes.

Authors:  Catherine A Lozupone; Micah Hamady; Brandi L Cantarel; Pedro M Coutinho; Bernard Henrissat; Jeffrey I Gordon; Rob Knight
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-19       Impact factor: 11.205

6.  Molecular phylogeny of the higher and lower taxonomy of the Fusarium genus and differences in the evolutionary histories of multiple genes.

Authors:  Maiko Watanabe; Takahiro Yonezawa; Ken-ichi Lee; Susumu Kumagai; Yoshiko Sugita-Konishi; Keiichi Goto; Yukiko Hara-Kudo
Journal:  BMC Evol Biol       Date:  2011-11-03       Impact factor: 3.260

7.  A genomic approach to examine the complex evolution of laurasiatherian mammals.

Authors:  Björn M Hallström; Adrian Schneider; Stefan Zoller; Axel Janke
Journal:  PLoS One       Date:  2011-12-02       Impact factor: 3.240

8.  Phylogeny of the Centrohelida inferred from SSU rRNA, tubulins, and actin genes.

Authors:  Miako Sakaguchi; Takeshi Nakayama; Tetsuo Hashimoto; Isao Inouye
Journal:  J Mol Evol       Date:  2005-10-06       Impact factor: 3.973

9.  Phylogenetic relationships of typical antbirds (Thamnophilidae) and test of incongruence based on Bayes factors.

Authors:  Martin Irestedt; Jon Fjeldså; Johan A A Nylander; Per G P Ericson
Journal:  BMC Evol Biol       Date:  2004-07-30       Impact factor: 3.260

10.  Fast and accurate branch lengths estimation for phylogenomic trees.

Authors:  Manuel Binet; Olivier Gascuel; Celine Scornavacca; Emmanuel J P Douzery; Fabio Pardi
Journal:  BMC Bioinformatics       Date:  2016-01-07       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.