Literature DB >> 12801879

The small-world dynamics of tree networks and data mining in phyloinformatics.

William H Piel1, Michael J Sanderson, Michael J Donoghue.   

Abstract

MOTIVATION: A noble and ultimate objective of phyloinformatic research is to assemble, synthesize, and explore the evolutionary history of life on earth. Data mining methods for performing these tasks are not yet well developed, but one avenue of research suggests that network connectivity dynamics will play an important role in future methods. Analysis of disordered networks, such as small-world networks, has applications as diverse as disease propagation, collaborative networks, and power grids. Here we apply similar analyses to networks of phylogenetic trees in order to understand how synthetic information can emerge from a database of phylogenies.
RESULTS: Analyses of tree network connectivity in TreeBASE show that a collection of phylogenetic trees behaves as a small-world network-while on the one hand the trees are clustered, like a non-random lattice, on the other hand they have short characteristic path lengths, like a random graph. Tree connectivities follow a dual-scale power-law distribution (first power-law exponent approximately 1.87; second approximately 4.82). This unusual pattern is due, in part, to the presence of alternative tree topologies that enter the database with each published study. As expected, small collections of trees decrease connectivity as new trees are added, while large collections of trees increase connectivity. However, the inflection point is surprisingly low: after about 600 trees the network suddenly jumps to a higher level of coherence. More stringent definitions of 'neighbour' greatly delay the threshold whence a database achieves sufficient maturity for a coherent network to emerge. However, more stringent definitions of 'neighbour' would also likely show improved focus in data mining. AVAILABILITY: http://treebase.org

Entities:  

Mesh:

Year:  2003        PMID: 12801879     DOI: 10.1093/bioinformatics/btg131

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  The net of life: reconstructing the microbial phylogenetic network.

Authors:  Victor Kunin; Leon Goldovsky; Nikos Darzentas; Christos A Ouzounis
Journal:  Genome Res       Date:  2005-06-17       Impact factor: 9.043

Review 2.  Taking the first steps towards a standard for reporting on phylogenies: Minimum Information About a Phylogenetic Analysis (MIAPA).

Authors:  Jim Leebens-Mack; Todd Vision; Eric Brenner; John E Bowers; Steven Cannon; Mark J Clement; Clifford W Cunningham; Claude dePamphilis; Rob deSalle; Jeff J Doyle; Jonathan A Eisen; Xun Gu; John Harshman; Robert K Jansen; Elizabeth A Kellogg; Eugene V Koonin; Brent D Mishler; Hervé Philippe; J Chris Pires; Yin-Long Qiu; Seung Y Rhee; Kimmen Sjölander; Douglas E Soltis; Pamela S Soltis; Dennis W Stevenson; Kerr Wall; Tandy Warnow; Christian Zmasek
Journal:  OMICS       Date:  2006

3.  maxAlike: maximum likelihood-based sequence reconstruction with application to improved primer design for unknown sequences.

Authors:  Peter Menzel; Peter F Stadler; Jan Gorodkin
Journal:  Bioinformatics       Date:  2010-12-01       Impact factor: 6.937

4.  Advancing data reuse in phyloinformatics using an ontology-driven Semantic Web approach.

Authors:  Maryam Panahiazar; Amit P Sheth; Ajith Ranabahu; Rutger A Vos; Jim Leebens-Mack
Journal:  BMC Med Genomics       Date:  2013-11-11       Impact factor: 3.063

5.  TBMap: a taxonomic perspective on the phylogenetic database TreeBASE.

Authors:  Roderic D M Page
Journal:  BMC Bioinformatics       Date:  2007-05-18       Impact factor: 3.169

6.  Fast structural search in phylogenetic databases.

Authors:  Jason T L Wang; Huiyuan Shan; Dennis Shasha; William H Piel
Journal:  Evol Bioinform Online       Date:  2007-02-20       Impact factor: 1.625

7.  PhyloFinder: an intelligent search engine for phylogenetic tree databases.

Authors:  Duhong Chen; J Gordon Burleigh; Mukul S Bansal; David Fernández-Baca
Journal:  BMC Evol Biol       Date:  2008-03-21       Impact factor: 3.260

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.