| Literature DB >> 29423346 |
Stephanie J Spielman1, Sergei L Kosakovsky Pond1.
Abstract
We introduce LEISR (Likehood Estimation of Individual Site Rates, pronounced "laser"), a tool to infer relative evolutionary rates from protein and nucleotide data, implemented in HyPhy. LEISR is based on the popular Rate4Site (Pupko et al., 2002) approach for inferring relative site-wise evolutionary rates, primarily from protein data. We extend the original method for more general use in several key ways: (i) we increase the support for nucleotide data with additional models, (ii) we allow for datasets of arbitrary size, (iii) we support analysis of site-partitioned datasets to correct for the presence of recombination breakpoints, (iv) we produce rate estimates at all sites rather than at just a subset of sites, and (v) we implemented LEISR as MPI-enabled to support rapid, high-throughput analysis. LEISR is available in HyPhy starting with version 2.3.8, and it is accessible as an option in the HyPhy analysis menu ("Relative evolutionary rate inference"), which calls the HyPhy batchfile LEISR.bf.Entities:
Keywords: Comparative sequence analysis; Evolutionary rate; Phylogenetics; Protein evolution
Year: 2018 PMID: 29423346 PMCID: PMC5804317 DOI: 10.7717/peerj.4339
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Nucleotide and Protein models, both generalist and specialist, available for use in LEISR, as of HyPhy version 2.3.8.
Future HyPhy versions are expected to include more models. Users can alternatively define and fit other parametric and empirical models with the use of HBL, the HyPhy batch language.
| Data type | Models |
|---|---|
| Nucleotide | GTR ( |
| Protein, Generalist | LG ( |
| Protein, Specialist | mtMet ( |
Figure 1Mean R2 values (across 10 replicates) between inferred evolutionary rates across platforms and simulations.
Bars represent the standard error of the mean. Note that the y-axis of this figure begins at R2 = 0.85. All code to generate simulations and reproduce figures is available from https://github.com/sjspielman/leisr_validation.
Figure 2Inferred evolutionary rates for a single simulation replicate with 100 taxa.
The line shown in (A–C) is y = x. All code to generate simulations and reproduce figures is available from https://github.com/sjspielman/leisr_validation.
Figure 3Inferred evolutionary rates with LEISR and Rate4Site on an empirical alignment of mammalian HRH1 receptors.
The line shown is y = x. Code to infer rates and reproduce this figure is available from https://github.com/sjspielman/leisr_validation.