| Literature DB >> 31028399 |
Frédéric Lemoine1,2, Damien Correia1,3,4, Vincent Lefort3, Olivia Doppelt-Azeroual2, Fabien Mareuil2, Sarah Cohen-Boulakia4, Olivier Gascuel1,3.
Abstract
Phylogeny.fr, created in 2008, has been designed to facilitate the execution of phylogenetic workflows, and is nowadays widely used. However, since its development, user needs have evolved, new tools and workflows have been published, and the number of jobs has increased dramatically, thus promoting new practices, which motivated its refactoring. We developed NGPhylogeny.fr to be more flexible in terms of tools and workflows, easily installable, and more scalable. It integrates numerous tools in their latest version (e.g. TNT, FastME, MrBayes, etc.) as well as new ones designed in the last ten years (e.g. PhyML, SMS, FastTree, trimAl, BOOSTER, etc.). These tools cover a large range of usage (sequence searching, multiple sequence alignment, model selection, tree inference and tree drawing) and a large panel of standard methods (distance, parsimony, maximum likelihood and Bayesian). They are integrated in workflows, which have been already configured ('One click'), can be customized ('Advanced'), or are built from scratch ('A la carte'). Workflows are managed and run by an underlying Galaxy workflow system, which makes workflows more scalable in terms of number of jobs and size of data. NGPhylogeny.fr is deployable on any server or personal computer, and is freely accessible at https://ngphylogeny.fr.Entities:
Mesh:
Year: 2019 PMID: 31028399 PMCID: PMC6602494 DOI: 10.1093/nar/gkz303
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of tools currently integrated in NGPhylogeny.fr
| Step | Tool name | New | Version | Dataset size ability | ‘ | ‘ | ‘ | ‘ |
|---|---|---|---|---|---|---|---|---|
| MSA | Clustal Ω ( | Yes | 1.2.4.1 | Very large | ✓ | ✓ | ||
| MSA | MAFFT ( | Yes | 7.407 | Large | ✓ | ✓ | ✓ | ✓ |
| MSA | MUSCLE ( | Up | 3.8.37 | Medium | ✓ | ✓ | ||
| AC | Gblocks ( | - | 0.91b | Very large | ✓ | ✓ | ||
| AC | trimAl ( | Yes | 1.4.1 | Very large | ✓ | ✓ | ||
| AC | BMGE ( | Yes | 1.12 | Medium | ✓ | ✓ | ✓ | ✓ |
| AC | Noisy ( | Yes | 1.5.12.1 | Small | ✓ | ✓ | ||
| TI (Fast max-likelihood) | FastTree ( | Yes | 2.1.10 | Very large | ✓ | ✓ | ✓ | ✓ |
| TI (Distance) | FastME ( | Yes | 2.1.6.1 | Large | ✓ | ✓ | ✓ | ✓ |
| TI (Parsimony) | TNT ( | Yes | 1.5.0a | Large | ✓ | ✓ | ||
| TI (Max-likelihood) | PhyML ( | Up | 3.1 | Medium | ✓ | ✓ | ✓ | ✓ |
| TI (PhyML+MS) | PhyML ( | Yes | 1.8.1 | Medium | ✓ | ✓ | ✓ | ✓ |
| TI (Bayesian) | MrBayes ( | Yes | 3.2.6 | Small | ✓ | ✓ | ||
| TV | Newick Utilities ( | Yes | 1.6 | Large | ✓ | ✓ | ✓ | ✓ |
| BS | BOOSTER ( | Yes | 0.2.4 | Large | ✓ | ✓ | ✓ |
Step: MSA for multiple sequence alignment, AC for alignment curation, TI for tree inference, TV for tree visualization, BS for branch support, and MS for model selection. New: Yes for new tools, Up for updated tools and - for tools already present in Phylogeny.fr. Dataset size ability: dataset dimension able to be analyzed by each tool, very large (typically >10 000 sequences), large (>5000), medium (>1000), small (≤1000). ‘One click’, ‘Advanced’, ‘A la carte’ and ‘Stand-alone’: tools that are available in each run mode.
Figure 1.Analysis of human TRIM5α protein with Blast-Search and PhyML+SMS ‘One click’ workflow. (A) NGPhylogeny.fr workflow monitoring page. All workflow steps are listed and their status indicated as pending, running, finished successfully (green check mark) or with error. For each result, a link to a dedicated viewer is provided. (B) Multiple sequence alignment visualization. The multiple sequence alignment resulting of the cleaning step can be visualized dynamically in the browser. (C) Phylogenetic tree visualization. Human, Chimpanzee, Gorilla, Orangutan and Gibbon sequences are branched as expected. The blue arrow indicates the user input sequence.