| Literature DB >> 31015974 |
Fangluan Gao1,2, Chengjie Chen3, Daej A Arab2, Zhenguo Du1, Yehua He3, Simon Y W Ho2.
Abstract
The genomic signatures of positive selection and evolutionary constraints can be detected by analyses of nucleotide sequences. One of the most widely used programs for this purpose is CodeML, part of the PAML package. Although a number of bioinformatics tools have been developed to facilitate the use of CodeML, these have various limitations. Here, we present a wrapper tool named EasyCodeML that provides a user-friendly graphical interface for using CodeML. EasyCodeML has a custom running mode in which parameters can be adjusted to meet different requirements. It also offers a preset running mode in which an evolutionary analysis pipeline and publication-quality tables can be exported by a single click. EasyCodeML allows visualized, interactive tree labelling, which greatly simplifies the use of the branch, branch-site, and clade models of selection. The program allows comparison of major codon-based models for analyses of selection. EasyCodeML is a stand-alone package that is supported in Windows, Mac, and Linux operating systems, and is freely available at https://github.com/BioEasy/EasyCodeML.Entities:
Keywords: CodeML; codon‐based models; likelihood‐ratio test; molecular evolution; positive selection
Year: 2019 PMID: 31015974 PMCID: PMC6467853 DOI: 10.1002/ece3.5015
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Comparison of features in EasyCodeML and other tools
| Key features | IDEA | pamlX | IMPACT_S | LMAP | BlastPhyMe | EasyCodeML |
|---|---|---|---|---|---|---|
| Supported codon models | ||||||
| Branch model | × | ✓ | × | ✓ | ✓ | ✓ |
| Branch‐site model | × | ✓ | × | ✓ | ✓ | ✓ |
| Site model | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Clade model | × | ✓ | × | ✓ | ✓ | ✓ |
| LRT automatically performed | × | × | ✓ | ✓ | ✓ | ✓ |
| Visual labelling of tree by one click | × | × | × | × | × | ✓ |
| Customizing control files | × | ✓ | ✓ | ✓ | × | ✓ |
| Exporting preformatted table | × | × | × | ✓ | ✓ | ✓ |
| Multithreading | × | × | × | ✓ | ✓ | ✓ |
| Drag‐and‐drop functionality | × | ✓ | × | × | × | ✓ |
aOnly a few codon‐based models available. bMaldonado et al, 2016, https://doi.org/10.1186/s12859-016-1204-5. cSchott et al, 2016, http://dx.doi.org/10.1101/059881.
Figure 1Screenshot of the main interface of EasyCodeML under the (a) preset and (b) custom running modes. In the preset mode, all key parameters of the nested models are built‐in and there is a pipeline from data input to the output of results. In the custom mode, the parameters of any codon‐based model can be modified to meet the requirements of the user
Codon‐based models available in EasyCodeML
| Codon‐based models | Running mode | Nested models (null vs. alternative) | |
|---|---|---|---|
| Preset | Custom | ||
| Site models | |||
| M0 (one‐ratio) | ✓ | ✓ | M3 versus M0 |
| M1 (nearly neutral) | ✓ | ✓ | M1a versus M2a |
| M2a (positive selection) | ✓ | ✓ | M7 versus M8 |
| M3 (discrete) | ✓ | ✓ | M8a versus M8 |
| M7 (beta) | ✓ | ✓ | |
| M8 (beta and | ✓ | ✓ | |
| M8a (beta and | ✓ | ✓ | |
| Branch model | |||
| One‐ratio model (M0) | ✓ | ✓ | M0 versus BM |
| Two‐ratio model (BM) | ✓ | ✓ | M0 versus FM |
| Free‐ratio model (FM) | × | ✓ | |
| Branch‐site models | |||
| Model Anull | ✓ | ✓ | Model Anull versus Model A |
| Model A | ✓ | ✓ | |
| Clade models | |||
| M2a_rel | ✓ | ✓ | M2a_rel versus CmC |
| CmC | ✓ | ✓ | |
The M0–M3 comparison does not allow detection of positive selection.
Figure 2Labelling branches in a tree for the branch‐related models can be done in a simple and intuitive way for the (a) clade models and (b) branch and branch‐site models
Figure 3Two utilities available in EasyCodeML: (a) the LRT calculator, and Seqformat convertor in (b) a user‐friendly GUI or (c) command line. Seqformat convertor can convert between diverse types of sequence data formats
Example of a publication‐quality table created by the export module in EasyCodeML, based on a comparison of site models for the ECP‐EDN gene family from primates
| Site model | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Model | np | Ln L | Estimates of parameters | Models compared | LRT | Positively selected sites | |||
| M3 | 15 | −1,876.512700 |
| 0.94718 | 0.04809 | 0.00473 | M0 versus M3 | 0.00E+00 | [] |
|
| 0.07365 | 8.33208 | 87.60345 | ||||||
| M0 | 11 | −1,915.094842 |
| 0.30409 | Not allowed | ||||
| M2a | 14 | −1,877.941799 |
| 0.81874 | 0.15483 | 0.02642 | M1a versus M2a | 4.68E−07 | [] |
|
| 0.00000 | 1.00000 | 25.64346 | ||||||
| M1a | 12 | −1,892.515966 |
| 0.82898 | 0.17102 | Not allowed | |||
|
| 0.00000 | 1.00000 | |||||||
| M8 | 14 | −1,878.735192 |
|
|
| M7 versus M8 | 9.00E−09 | 14 L 0.603, 133 G 0.961*, 228 S 0.972*, 230 F 0.996**, 231 V 0.993**, 232 S 0.567, 233 K 0.881, 235 D 0.953*, 236 G 0.996**, 237 G 0.973*, 238 R 0.999**, 239 Y 0.924, 279 N 0.877, 354 K 0.997** | |
| ( |
| ||||||||
| M7 | 12 | −1,897.250798 |
|
| Not allowed | ||||
[], no data available.