Literature DB >> 12169558

Statistically based postprocessing of phylogenetic analysis by clustering.

Cara Stockham1, Li-San Wang, Tandy Warnow.   

Abstract

MOTIVATION: Phylogenetic analyses often produce thousands of candidate trees. Biologists resolve the conflict by computing the consensus of these trees. Single-tree consensus as postprocessing methods can be unsatisfactory due to their inherent limitations.
RESULTS: In this paper we present an alternative approach by using clustering algorithms on the set of candidate trees. We propose bicriterion problems, in particular using the concept of information loss, and new consensus trees called characteristic trees that minimize the information loss. Our empirical study using four biological datasets shows that our approach provides a significant improvement in the information content, while adding only a small amount of complexity. Furthermore, the consensus trees we obtain for each of our large clusters are more resolved than the single-tree consensus trees. We also provide some initial progress on theoretical questions that arise in this context.

Mesh:

Year:  2002        PMID: 12169558     DOI: 10.1093/bioinformatics/18.suppl_1.s285

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  Large-scale multiple sequence alignment and tree estimation using SATé.

Authors:  Kevin Liu; Tandy Warnow
Journal:  Methods Mol Biol       Date:  2014

2.  Robust Analysis of Phylogenetic Tree Space.

Authors:  Martin R Smith
Journal:  Syst Biol       Date:  2022-08-10       Impact factor: 9.160

3.  An intuitive, informative, and most balanced representation of phylogenetic topologies.

Authors:  Wataru Iwasaki; Toshihisa Takagi
Journal:  Syst Biol       Date:  2010-09-03       Impact factor: 15.683

4.  On Defining and Finding Islands of Trees and Mitigating Large Island Bias.

Authors:  Ana Serra Silva; Mark Wilkinson
Journal:  Syst Biol       Date:  2021-10-13       Impact factor: 15.683

5.  MrsRF: an efficient MapReduce algorithm for analyzing large collections of evolutionary trees.

Authors:  Suzanne J Matthews; Tiffani L Williams
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

6.  A new support measure to quantify the impact of local optima in phylogenetic analyses.

Authors:  Grant Brammer; Seung-Jin Sul; Tiffani L Williams
Journal:  Evol Bioinform Online       Date:  2011-09-29       Impact factor: 1.625

7.  STBase: one million species trees for comparative biology.

Authors:  Michelle M McMahon; Akshay Deepak; David Fernández-Baca; Darren Boss; Michael J Sanderson
Journal:  PLoS One       Date:  2015-02-13       Impact factor: 3.240

8.  Multiple consensus trees: a method to separate divergent genes.

Authors:  Alain Guénoche
Journal:  BMC Bioinformatics       Date:  2013-02-09       Impact factor: 3.169

9.  A support vector machine based test for incongruence between sets of trees in tree space.

Authors:  David C Haws; Peter Huggins; Eric M O'Neill; David W Weisrock; Ruriko Yoshida
Journal:  BMC Bioinformatics       Date:  2012-08-21       Impact factor: 3.169

10.  A family of RS domain proteins with novel subcellular localization and trafficking.

Authors:  Steven J Kavanagh; Thomas C Schulz; Philippa Davey; Charles Claudianos; Carrie Russell; Peter D Rathjen
Journal:  Nucleic Acids Res       Date:  2005-03-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.