BACKGROUND: To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted. RESULTS: Here, we present a new software library, PhyloPattern, for automating tree manipulations and analysis. PhyloPattern includes three main modules, which address essential tasks in high-throughput phylogenetic tree analysis: node annotation, pattern matching, and tree comparison. PhyloPattern thus allows the programmer to focus on: i) the use of predefined or user defined annotation functions to perform immediate or deferred evaluation of node properties, ii) the search for user-defined patterns in large phylogenetic trees, iii) the pairwise comparison of trees by dynamically generating patterns from one tree and applying them to the other. CONCLUSION: PhyloPattern greatly simplifies and accelerates the work of the computer scientist in the evolutionary biology field. The library has been used to automatically identify phylogenetic evidence for domain shuffling or gene loss events in the evolutionary histories of protein sequences. However any workflow that relies on phylogenetic tree analysis, could be automated with PhyloPattern.
BACKGROUND: To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted. RESULTS: Here, we present a new software library, PhyloPattern, for automating tree manipulations and analysis. PhyloPattern includes three main modules, which address essential tasks in high-throughput phylogenetic tree analysis: node annotation, pattern matching, and tree comparison. PhyloPattern thus allows the programmer to focus on: i) the use of predefined or user defined annotation functions to perform immediate or deferred evaluation of node properties, ii) the search for user-defined patterns in large phylogenetic trees, iii) the pairwise comparison of trees by dynamically generating patterns from one tree and applying them to the other. CONCLUSION:PhyloPattern greatly simplifies and accelerates the work of the computer scientist in the evolutionary biology field. The library has been used to automatically identify phylogenetic evidence for domain shuffling or gene loss events in the evolutionary histories of protein sequences. However any workflow that relies on phylogenetic tree analysis, could be automated with PhyloPattern.
Authors: Anthony Levasseur; Ludovic Orlando; Xavier Bailly; Michel C Milinkovitch; Etienne G J Danchin; Pierre Pontarotti Journal: Biol Rev Camb Philos Soc Date: 2007-11
Authors: Jue Ruan; Heng Li; Zhongzhong Chen; Avril Coghlan; Lachlan James M Coin; Yiran Guo; Jean-Karim Hériché; Yafeng Hu; Karsten Kristiansen; Ruiqiang Li; Tao Liu; Alan Moses; Junjie Qin; Søren Vang; Albert J Vilella; Abel Ureta-Vidal; Lars Bolund; Jun Wang; Richard Durbin Journal: Nucleic Acids Res Date: 2007-12-01 Impact factor: 16.971
Authors: Anthony Levasseur; Julien Paganini; Jacques Dainat; Julie D Thompson; Olivier Poch; Pierre Pontarotti; Philippe Gouret Journal: Evol Bioinform Online Date: 2012-08-01 Impact factor: 1.625
Authors: Julien Paganini; Amandine Campan-Fournier; Martine Da Rocha; Philippe Gouret; Pierre Pontarotti; Eric Wajnberg; Pierre Abad; Etienne G J Danchin Journal: PLoS One Date: 2012-11-30 Impact factor: 3.240
Authors: Caroline Cayrou; Bernard Henrissat; Philippe Gouret; Pierre Pontarotti; Michel Drancourt Journal: BMC Microbiol Date: 2012-12-18 Impact factor: 3.605
Authors: Mohamed Sassi; Philippe Gouret; Olivier Chabrol; Pierre Pontarotti; Michel Drancourt Journal: Biol Direct Date: 2014-09-15 Impact factor: 4.540