Literature DB >> 17456014

Phylogeny of mixture models: robustness of maximum likelihood and non-identifiable distributions.

Daniel Stefankovic1, Eric Vigoda.   

Abstract

We address phylogenetic reconstruction when the data is generated from a mixture distribution. Such topics have gained considerable attention in the biological community with the clear evidence of heterogeneity of mutation rates. In our work we consider data coming from a mixture of trees which share a common topology, but differ in their edge weights (i.e., branch lengths). We first show the pitfalls of popular methods, including maximum likelihood and Markov chain Monte Carlo algorithms. We then determine in which evolutionary models, reconstructing the tree topology, under a mixture distribution, is (im)possible. We prove that every model whose transition matrices can be parameterized by an open set of multilinear polynomials, either has non-identifiable mixture distributions, in which case reconstruction is impossible in general, or there exist linear tests which identify the topology. This duality theorem, relies on our notion of linear tests and uses ideas from convex programming duality. Linear tests are closely related to linear invariants, which were first introduced by Lake, and are natural from an algebraic geometry perspective.

Mesh:

Year:  2007        PMID: 17456014     DOI: 10.1089/cmb.2006.0126

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  6 in total

1.  Evolutionary medicine: A meaningful connection between omics, disease, and treatment.

Authors:  Mones Abu-Asab; Mohamed Chaouchi; Hakima Amri
Journal:  Proteomics Clin Appl       Date:  2008-02       Impact factor: 3.494

2.  Identifiability and inference of non-parametric rates-across-sites models on large-scale phylogenies.

Authors:  Elchanan Mossel; Sebastien Roch
Journal:  J Math Biol       Date:  2012-08-09       Impact factor: 2.259

3.  Phylogenetic mixtures and linear invariants for equal input models.

Authors:  Marta Casanellas; Mike Steel
Journal:  J Math Biol       Date:  2016-09-07       Impact factor: 2.259

4.  Biomarkers in the age of omics: time for a systems biology approach.

Authors:  Mones S Abu-Asab; Mohamed Chaouchi; Salvatore Alesci; Susana Galli; Majid Laassri; Amrita K Cheema; Fouad Atouf; John VanMeter; Hakima Amri
Journal:  OMICS       Date:  2011-02-14

5.  Phylogenetic modeling of heterogeneous gene-expression microarray data from cancerous specimens.

Authors:  Mones S Abu-Asab; Mohamed Chaouchi; Hakima Amri
Journal:  OMICS       Date:  2008-09

6.  The space of phylogenetic mixtures for equivariant models.

Authors:  Marta Casanellas; Jesús Fernández-Sánchez; Anna M Kedzierska
Journal:  Algorithms Mol Biol       Date:  2012-11-28       Impact factor: 1.405

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.