| Literature DB >> 18424797 |
A Dereeper1, V Guignon, G Blanc, S Audic, S Buffet, F Chevenet, J-F Dufayard, S Guindon, V Lefort, M Lescot, J-M Claverie, O Gascuel.
Abstract
Phylogenetic analyses are central to many research areas in biology and typically involve the identification of homologous sequences, their multiple alignment, the phylogenetic reconstruction and the graphical representation of the inferred tree. The Phylogeny.fr platform transparently chains programs to automatically perform these tasks. It is primarily designed for biologists with no experience in phylogeny, but can also meet the needs of specialists; the first ones will find up-to-date tools chained in a phylogeny pipeline to analyze their data in a simple and robust way, while the specialists will be able to easily build and run sophisticated analyses. Phylogeny.fr offers three main modes. The 'One Click' mode targets non-specialists and provides a ready-to-use pipeline chaining programs with recognized accuracy and speed: MUSCLE for multiple alignment, PhyML for tree building, and TreeDyn for tree rendering. All parameters are set up to suit most studies, and users only have to provide their input sequences to obtain a ready-to-print tree. The 'Advanced' mode uses the same pipeline but allows the parameters of each program to be customized by users. The 'A la Carte' mode offers more flexibility and sophistication, as users can build their own pipeline by selecting and setting up the required steps from a large choice of tools to suit their specific needs. Prior to phylogenetic analysis, users can also collect neighbors of a query sequence by running BLAST on general or specialized databases. A guide tree then helps to select neighbor sequences to be used as input for the phylogeny pipeline. Phylogeny.fr is available at: http://www.phylogeny.fr/Entities:
Mesh:
Year: 2008 PMID: 18424797 PMCID: PMC2447785 DOI: 10.1093/nar/gkn180
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Main tools available on Phylogeny.fr
| Program | Function | Input | Output | Speed | Current limitations | Use |
|---|---|---|---|---|---|---|
| Blastall 2.2.17 | Sequence searching | Raw, FASTA | FASTA | Fast | None | Advanced and A la Carte |
| MUSCLE 3.6 | Multiple alignment | FASTA, EMBL/Uniprot, GenBank | FASTA, Clustal, PHYLIP | Fast | <200 sequences | All modes Large dataset |
| T-Coffee 4.97 | Multiple alignment | FASTA, EMBL/Uniprot, GenBank | FASTA, Clustal, PHYLIP, others | Very slow | <50 sequences < 2000 sites | A la Carte Small dataset |
| 3DCoffee 4.97 | Multiple alignment using structural information | FASTA, EMBL/Uniprot, GenBank | FASTA, Clustal, PHYLIP, others | Very slow | <50 sequences <2000 sites | A la Carte Small dataset |
| ClustalW 1.83 | Multiple alignment | FASTA, EMBL/Uniprot, GenBank | FASTA, Clustal, PHYLIP | Fast | <200 sequences | A la Carte Large dataset |
| Gblocks 0.91b | Alignment refinement | FASTA | FASTA, Clustal, PHYLIP | Fast | None | All modes Large dataset |
| PhyML 3.0 | Phylogeny using maximum likelihood | FASTA, Clustal, PHYLIP, EMBL, PAUP*/Nexus | Newick | Fast to slow | (sequence size) × (number of taxa)2 < 10 000 000 | All modes Medium to large dataset |
| NONA 2.0 | DNA phylogeny using parsimony | FASTA, Clustal, PHYLIP, EMBL, PAUP*/Nexus | Newick | Fast to slow | (sequence size) × (number of taxa)2 < 10 000 000 | A la Carte Medium to large datasets |
| Protpars 3.66 | Protein phylogeny using parsimony | FASTA, Clustal, PHYLIP, EMBL, PAUP*/Nexus | Newick | Very slow | (sequence size) × (number of taxa)2 < 10 000 000 | A la Carte Small to medium dataset |
| Dnadist Protdist BioNJ NJ PHYLYP 3.66 | Phylogeny using distances | FASTA, Clustal, PHYLIP, EMBL, PAUP*/Nexus | Newick | Fast | None | A la Carte Large dataset |
| Bootstrap with PhyML, NONA, Protpars | Estimations of clade supports | FASTA, Clustal, PHYLIP, EMBL, PAUP*/Nexus | Newick | Very slow | <100 replicates | Advanced and A la Carte Small to medium datasets |
| Bootstrap with distance methods | Estimations of clade supports | FASTA, Clustal, PHYLIP, EMBL, PAUP*/Nexus | Newick | Slow | <500 replicates | A la Carte Medium to large datasets |
| TreeDyn 198 | Tree rendering | Newick | PNG, PDF, TGF | Fast | None | All modes |
| Drawgram 3.66 | Various tree shapes rendering | Newick | PNG, PDF | Fast | None | A la Carte |
| Drawtree 3.66 | Unrooted tree rendering | Newick | PNG, PDF | Fast | None | A la Carte |
With fast programs Phylogeny.fr displays the results within a few minutes or less. Slow programs require about one hour or so, but their results are still displayed on-line, while the results of very slow programs are sent by email.