Literature DB >> 15878123

Identifying optimal incomplete phylogenetic data sets from sequence databases.

Changhui Yan1, J Gordon Burleigh, Oliver Eulenstein.   

Abstract

We introduce a new method for identifying optimal incomplete data sets from large sequence databases based on the graph theoretic concept of alpha-quasi-bicliques. The quasi-biclique method searches large sequence databases to identify useful phylogenetic data sets with a specified amount of missing data while maintaining the necessary amount of overlap among genes and taxa. The utility of the quasi-biclique method is demonstrated on large simulated sequence databases and on a data set of green plant sequences from GenBank. The quasi-biclique method greatly increases the taxon and gene sampling in the data sets while adding only a limited amount of missing data. Furthermore, under the conditions of the simulation, data sets with a limited amount of missing data often produce topologies nearly as accurate as those built from complete data sets. The quasi-biclique method will be an effective tool for exploiting sequence databases for phylogenetic information and also may help identify critical sequences needed to build large phylogenetic data sets.

Mesh:

Year:  2005        PMID: 15878123     DOI: 10.1016/j.ympev.2005.02.008

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  6 in total

1.  Exploring biological interaction networks with tailored weighted quasi-bicliques.

Authors:  Wen-Chieh Chang; Sudheer Vakati; Roland Krause; Oliver Eulenstein
Journal:  BMC Bioinformatics       Date:  2012-06-25       Impact factor: 3.169

2.  Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms.

Authors:  J Gordon Burleigh; Khidir W Hilu; Douglas E Soltis
Journal:  BMC Evol Biol       Date:  2009-03-17       Impact factor: 3.260

3.  OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.

Authors:  Fabian Schreiber; Kerstin Pick; Dirk Erpenbeck; Gert Wörheide; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2009-07-16       Impact factor: 3.169

4.  Mega-phylogeny approach for comparative biology: an alternative to supertree and supermatrix approaches.

Authors:  Stephen A Smith; Jeremy M Beaulieu; Michael J Donoghue
Journal:  BMC Evol Biol       Date:  2009-02-11       Impact factor: 3.260

5.  SCaFoS: a tool for selection, concatenation and fusion of sequences for phylogenomics.

Authors:  Béatrice Roure; Naiara Rodriguez-Ezpeleta; Hervé Philippe
Journal:  BMC Evol Biol       Date:  2007-02-08       Impact factor: 3.260

6.  Selecting informative subsets of sparse supermatrices increases the chance to find correct trees.

Authors:  Bernhard Misof; Benjamin Meyer; Björn Marcus von Reumont; Patrick Kück; Katharina Misof; Karen Meusemann
Journal:  BMC Bioinformatics       Date:  2013-12-03       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.