Literature DB >> 15059836

A weighted least-squares approach for inferring phylogenies from incomplete distance matrices.

Vladimir Makarenkov1, François-Joseph Lapointe.   

Abstract

MOTIVATION: The problem of phylogenetic inference from datasets including incomplete or uncertain entries is among the most relevant issues in systematic biology. In this paper, we propose a new method for reconstructing phylogenetic trees from partial distance matrices. The new method combines the usage of the four-point condition and the ultrametric inequality with a weighted least-squares approximation to solve the problem of missing entries. It can be applied to infer phylogenies from evolutionary data including some missing or uncertain information, for instance, when observed nucleotide or protein sequences contain gaps or missing entries.
RESULTS: In a number of simulations involving incomplete datasets, the proposed method outperformed the well-known Ultrametric and Additive procedures. Generally, the new method also outperformed all the other competing approaches including Triangle and Fitch which is the most popular least-squares method for reconstructing phylogenies. We illustrate the usefulness of the introduced method by analyzing two well-known phylogenies derived from complete mammalian mtDNA sequences. Some interesting theoretical results concerning the NP-hardness of the ordinary and weighted least-squares fitting of a phylogenetic tree to a partial distance matrix are also established. AVAILABILITY: The T-Rex package including this method is freely available for download at http://www.info.uqam.ca/~makarenv/trex.html

Entities:  

Mesh:

Year:  2004        PMID: 15059836     DOI: 10.1093/bioinformatics/bth211

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  A hierarchical model for incomplete alignments in phylogenetic inference.

Authors:  Fuxia Cheng; Stefanie Hartmann; Mayetri Gupta; Joseph G Ibrahim; Todd J Vision
Journal:  Bioinformatics       Date:  2009-01-15       Impact factor: 6.937

2.  T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks.

Authors:  Alix Boc; Alpha Boubacar Diallo; Vladimir Makarenkov
Journal:  Nucleic Acids Res       Date:  2012-06-06       Impact factor: 16.971

3.  Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human.

Authors:  Leo Goodstadt; Chris P Ponting
Journal:  PLoS Comput Biol       Date:  2006-09-29       Impact factor: 4.475

4.  Using ESTs for phylogenomics: can one accurately infer a phylogenetic tree from a gappy alignment?

Authors:  Stefanie Hartmann; Todd J Vision
Journal:  BMC Evol Biol       Date:  2008-03-26       Impact factor: 3.260

5.  ANItools web: a web tool for fast genome comparison within multiple bacterial strains.

Authors:  Na Han; Yujun Qiang; Wen Zhang
Journal:  Database (Oxford)       Date:  2016-06-05       Impact factor: 3.451

6.  Machine learning based imputation techniques for estimating phylogenetic trees from incomplete distance matrices.

Authors:  Ananya Bhattacharjee; Md Shamsuzzoha Bayzid
Journal:  BMC Genomics       Date:  2020-07-20       Impact factor: 3.969

7.  A new effective method for estimating missing values in the sequence data prior to phylogenetic analysis.

Authors:  Abdoulaye Baniré Diallo; François-Joseph Lapointe; Vladimir Makarenkov
Journal:  Evol Bioinform Online       Date:  2007-02-01       Impact factor: 1.625

8.  Fast NJ-like algorithms to deal with incomplete distance matrices.

Authors:  Alexis Criscuolo; Olivier Gascuel
Journal:  BMC Bioinformatics       Date:  2008-03-26       Impact factor: 3.169

9.  DV-curve representation of protein sequences and its application.

Authors:  Wei Deng; Yihui Luan
Journal:  Comput Math Methods Med       Date:  2014-05-08       Impact factor: 2.238

10.  Native American gene flow into Polynesia predating Easter Island settlement.

Authors:  Alexander G Ioannidis; Javier Blanco-Portillo; Karla Sandoval; Erika Hagelberg; Juan Francisco Miquel-Poblete; J Víctor Moreno-Mayar; Juan Esteban Rodríguez-Rodríguez; Consuelo D Quinto-Cortés; Kathryn Auckland; Tom Parks; Kathryn Robson; Adrian V S Hill; María C Avila-Arcos; Alexandra Sockell; Julian R Homburger; Genevieve L Wojcik; Kathleen C Barnes; Luisa Herrera; Soledad Berríos; Mónica Acuña; Elena Llop; Celeste Eng; Scott Huntsman; Esteban G Burchard; Christopher R Gignoux; Lucía Cifuentes; Ricardo A Verdugo; Mauricio Moraga; Alexander J Mentzer; Carlos D Bustamante; Andrés Moreno-Estrada
Journal:  Nature       Date:  2020-07-08       Impact factor: 49.962

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.