| Literature DB >> 26579193 |
Abstract
Substitution models of evolution describe the process of genetic variation through fixed mutations and constitute the basis of the evolutionary analysis at the molecular level. Almost 40 years after the development of first substitution models, highly sophisticated, and data-specific substitution models continue emerging with the aim of better mimicking real evolutionary processes. Here I describe current trends in substitution models of DNA, codon and amino acid sequence evolution, including advantages and pitfalls of the most popular models. The perspective concludes that despite the large number of currently available substitution models, further research is required for more realistic modeling, especially for DNA coding and amino acid data. Additionally, the development of more accurate complex models should be coupled with new implementations and improvements of methods and frameworks for substitution model selection and downstream evolutionary analysis.Entities:
Keywords: model selection; molecular adaptation; molecular evolution; phylogenetics; phylogenomics; substitution model
Year: 2015 PMID: 26579193 PMCID: PMC4620419 DOI: 10.3389/fgene.2015.00319
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Substitution models implemented in relevant software for phylogenetic inference.
| PhyML | Inference | All | – | Blosum62, CpRev, Dayhoff, FLU, HIVb, HIVw, JTT, LG, Mtart, Mtmam, Mtrev, RtRev, VT, WAG +F | – | +I +G | Guindon et al., |
| CodonPhyML | Inference | – | GY94 | – | – | +G | Gil et al., |
| RAxML | Inference | JC, K80, HKY, GTR | Nt | Blosum62, CpRev, Dayhoff, DUMMY, FLU, HIVb, HIVw, JTT, JonesDCMUT, LG, Mtart, Mtmam, Mtrev, Mtzoa, PMB, RtRev, STMREV, VT, WAG +F | Partitioned models can be specified | +I +G | Stamatakis, |
| MEGA | Inference | JC, K2P, HKY, TN93, GTR | NG86 | CpRev, Dayhoff, JTT, LG, MtRev, RtRev, WAG | – | +I +G | Tamura et al., |
| Hyphy | Inference | All | GY94 | Dayhoff, HIVb, HIVw, JTT, Mtmam, Mtrev, RtRev, WAG +F | Partitioned models can be specified | +I +G | Pond et al., |
| PAML | Inference | All | GY94 | CpRev, Dayhoff, DayhoffDCMUT, Grantham, JTT, JonesDCMUT, LG, Miyata, Mtart, Mtmam, Mtrev24, Mtzoa, WAG +F | – | +I +G | Yang, |
| MrBayes and BEST | Inference | All | GY94 | Blosum62, CpRev, Dayhoff, Mtmam, Mtrev, RtRev, VT, WAG +F | Partitioned models can be specified | +I +G | Ronquist et al., |
| BEAST | Inference | JC, HKY, TN93, GTR | Nt | Blosum62, CpRev, Dayhoff, FLU, JTT, LG, Mtrev, WAG | – | +I +G | Bouckaert et al., |
| OmegaMap | Inference | – | NY98 | – | – | – | Wilson and McVean, |
| Lamarc | Inference | JC, K2P, F84, GTR | – | – | – | +G | Kuhner, |
| CodABC | Inference | – | GY94 | – | – | +I +G | Arenas et al., |
| MySSP | Simulation | All | – | – | – | +G | Rosenberg, |
| Seq-Gen | Simulation | All | Nt | Blosum62, CpRev, JTT, mtREV, PAM, and WAG +F | – | +I +G | Rambaut and Grassly, |
| indel-Seq-Gen | Simulation | All | Nt | Blosum62, CpRev, JTT, mtREV, PAM, WAG +F | – | +I +G | Strope et al., |
| INDELible | Simulation | All | GY94 | Blosum62, CpRev, Dayhoff, DayhoffDCMUT, HIVb, HIVw, JTT, JonesDCMUT, LG, Mtart, Mtmam, Mtrev24, RtRev, VT, WAG +F | – | +I +G | Fletcher and Yang, |
| EVOLVER | Simulation | All | GY94 | CpRev, Dayhoff, DayhoffDCMUT, Grantham, JTT, JonesDCMUT, LG, Miyata, Mtart, Mtmam, Mtrev24, Mtzoa, WAG +F | – | +I +G | Yang, |
| Recodon and NetRecodon | Simulation | All | GY94 | – | – | +I +G | Arenas and Posada, |
| ProteinEvolver | Simulation | All | – | 3D structural constraints, Blosum62, CpRev, Dayhoff, DayhoffDCMUT, HIVb, HIVw, JTT, JonesDCMUT, LG, Mtart, Mtmam, Mtrev24, RtRev, VT, WAG +F | – | +I +G | Arenas et al., |
| CoalEvol | Simulation | All | GY94 | Blosum62, CpRev, Dayhoff, DayhoffDCMUT, HIVb, HIVw, JTT, JonesDCMUT, LG, Mtart, Mtmam, Mtrev24, RtRev, VT, WAG +F | – | +I +G | Arenas and Posada, |
| SIMPROT | Simulation | – | – | PAM, JTT, PMB | – | +G | Pang et al., |
| PhyloSim | Simulation | All | GY94 | CpRev, JTT, JonesDCMUT, LG, Mtart, Mtmam, Mtrev24, Mtzoa, WAG +F | – | +I +G | Sipos et al., |
| π BUSS | Simulation | HKY, TN93, GTR | GY94, MG94 | BLOSUM, Dayhoff, LG, JTT, WAG +F | – | +I +G | Bielejec et al., |
| SGWE | Simulation | All | GY94 | Blosum62, CpRev, Dayhoff, DayhoffDCMUT, HIVb, HIVw, JTT, JonesDCMUT, LG, Mtart, Mtmam, Mtrev24, RtRev, VT, WAG +F | User-specified regions | +I +G | Arenas and Posada, |
| ALF | Simulation | F84, HKY, TN93, GTR | GY94 | Gonnet, JTT, LG, PAM, WAG +F | User-specified regions | +I +G | Dalquen et al., |
“Task” indicates if the program is oriented to perform evolutionary inference or simulation of molecular evolution. “Substitution model” indicates the implemented models of non-coding DNA evolution [“All” means that all the reversible nucleotide substitution models are considered, JC, …, GTR], codon models [“ECMs” indicates empirical codon models, MG94 model refers to Muse and Gaut (.
Coding sequences are simulated by nucleotide substitution models, avoiding stop codons.
dN/dS can vary across codons.
dN/dS can vary across branches.