| Literature DB >> 33045094 |
Nicolas Rodrigue1, Thibault Latrille2, Nicolas Lartillot2.
Abstract
In recent years, codon substitution models based on the mutation-selection principle have been extended for the purpose of detecting signatures of adaptive evolution in protein-coding genes. However, the approaches used to date have either focused on detecting global signals of adaptive regimes-across the entire gene-or on contexts where experimentally derived, site-specific amino acid fitness profiles are available. Here, we present a Bayesian site-heterogeneous mutation-selection framework for site-specific detection of adaptive substitution regimes given a protein-coding DNA alignment. We offer implementations, briefly present simulation results, and apply the approach on a few real data sets. Our analyses suggest that the new approach shows greater sensitivity than traditional methods. However, more study is required to assess the impact of potential model violations on the method, and gain a greater empirical sense its behavior on a broader range of real data sets. We propose an outline of such a research program.Entities:
Keywords: Dirichlet process; Markov chain Monte Carlo; fitness landscape; nearly neutral evolution
Mesh:
Year: 2021 PMID: 33045094 PMCID: PMC7947879 DOI: 10.1093/molbev/msaa265
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Figure 1:Posterior distributions of ω (red, under MG-M0) and (blue, under MutSel-M0*) on simulated data sets with 10% of sites evolved under adaptive evolution (see Materials and Methods section).
Figure 2:Posterior distributions of ω (red, under MG-M0) and (blue, under MutSel-M0*) on -Globin, Adh, Vwf, Adora3, Rbp3, S1pr1 data sets (see Materials and Methods section).
Figure 3:Site-specific posterior probabilities of ω (red, under MG-M3) and (blue, under MutSel-M3*) being greater than 1 on data sets simulated under the pure mutation–selection framework.
Figure 4:Site-specific posterior probabilities of ω (red, under MG-M3) and (blue, under MutSel-M3*) being greater than 1 on data sets simulated with 30 sites (marked with at top of panels) under an adaptive regime, and the remaining 270 sites under the pure mutation–selection framework.
Figure 5:Site-specific posterior probabilities of ω (red, under MG-M3) and (blue, under MutSel-M3*) being greater than 1 on -Globin, Adh, and Vwf.
Figure 6:Site-specific posterior probabilities of ω (red, under MG-M3) and (blue, under MutSel-M3*) being greater than 1 on Adora3, Rbp3, and S1pr1.
Amino acid sites under positive selection.
| Data | Model | Sites |
|---|---|---|
| MG-M3 | 7, 11, 42, 48, 50 54, 67, 85, 123 | |
|
| ||
| MutSel-M3* | 7, 10, 11, | |
| MG-M3 | – | |
|
| ||
| MutSel-M3* | 9, 39, 49, 57, 68, 69, | |
| MG-M3 | – | |
|
| ||
| MutSel-M3* | 5, 9, 26, 41, | |
| MG-M3 | – | |
|
| ||
| MutSel-M3* | 2, | |
| MG-M3 | – | |
|
| ||
| MutSel-M3* | – | |
| MG-M3 | – | |
| S1 | ||
| MutSel-M3* |
|
Note.—Numbers in italic font are at the 0.9 level, in plain font at the 0.95 level, and in bold font at 0.99 level.