| Literature DB >> 27744408 |
Nicolas Rodrigue1, Nicolas Lartillot2.
Abstract
Codon substitution models have traditionally attempted to uncover signatures of adaptation within protein-coding genes by contrasting the rates of synonymous and non-synonymous substitutions. Another modeling approach, known as the mutation-selection framework, attempts to explicitly account for selective patterns at the amino acid level, with some approaches allowing for heterogeneity in these patterns across codon sites. Under such a model, substitutions at a given position occur at the neutral or nearly neutral rate when they are synonymous, or when they correspond to replacements between amino acids of similar fitness; substitutions from high to low (low to high) fitness amino acids have comparatively low (high) rates. Here, we study the use of such a mutation-selection framework as a null model for the detection of adaptation. Following previous works in this direction, we include a deviation parameter that has the effect of capturing the surplus, or deficit, in non-synonymous rates, relative to what would be expected under a mutation-selection modeling framework that includes a Dirichlet process approach to account for across-codon-site variation in amino acid fitness profiles. We use simulations, along with a few real data sets, to study the behavior of the approach, and find it to have good power with a low false-positive rate. Altogether, we emphasize the potential of recent mutation-selection models in the detection of adaptation, calling for further model refinements as well as large-scale applications.Entities:
Keywords: Dirichlet process; Markov chain Monte Carlo; epistasis; fitness landscape; nearly neutral evolution
Mesh:
Substances:
Year: 2016 PMID: 27744408 PMCID: PMC5854120 DOI: 10.1093/molbev/msw220
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1Posterior distributions of ω (for MG model, panels A, B, and C) and (for MutSelYN model in D, E, and F, and MutSelDP in G, H, and I) using three different sets of amino acid profiles (those obtained from SAMHD1 in left panels, those obtained from TRIM5α in middle panels, and those obtained from BRCA1 in right panels). Three simulation conditions were used: the nearly neutral regime (green), an adaptive regime (red) and an epistatic regime (blue).
Posterior Means and 95% Credibility Intervals, in Parentheses, of ω (with the MG model) and (with MutSelYN and MutSelDP models) on Six Mammalian Genes is shown in square brackets.
| Data | MG | MutSelYN | MutSelDP |
|---|---|---|---|
| S1pr1-67-325 | 0.049 (0.042, 0.055) | 0.058 (0.051, 0.065) | 0.681 (0.538, 0.857) [0.001] |
| Rbp3-54-412 | 0.190 (0.177, 0.203) | 0.193 (0.181, 0.206) | 0.726 (0.654, 0.806) [0.000] |
| Vwf-62-392 | 0.205 (0.188, 0.220) | 0.212 (0.199, 0.226) | 0.960 (0.865, 1.063) [0.220] |
| Samhd1-67-543 | 0.309 (0.288, 0.332) | 0.324 (0.300, 0.348) | |
| Trim5 | 0.454 (0.426, 0.484) | 0.468 (0.439, 0.498) | |
| Brca1-64-941 | 0.783 (0.750, 0.817) | 0.802 (0.770, 0.837) |
Note.—With the MutSelDP model, the posterior probability is shown in square brackets.
Fig. 2Summary of posterior means of 100 replicates for each boxplot. Results for simulations under the nearly-neutral regime are in green, whereas results for four different degrees of epistatic regimes (with increasing percentage of possible pairwise amino acids being in contact) are in blue, and results of four different rates of the Red-Queen are in red. Simulations were based on three initial sets of amino acid profiles, taken from SAMHD1 in A, TRIM5α in B, and BRCA1 in C.
Number of Replicates among sets of 100 Where (and )
| Samhd1-67-543 | Trim5 | Brca1-64-941 | |
|---|---|---|---|
| Nearly-neutral | 0 (0) | 1 (0) | 2 (0) |
| Epistatic | |||
| 1 | 0 (0) | 0 (0) | 0 (0) |
| 2 | 0 (0) | 0 (0) | 0 (0) |
| 3 | 0 (0) | 0 (0) | 3 (3) |
| 4 | 0 (0) | 3 (3) | 1 (1) |
| Adaptive | |||
| 25 | 95 (89) | 97 (95) | 96 (85) |
| 50 | 100 (100) | 100 (100) | 100 (100) |
| 75 | 100 (100) | 100 (100) | 100 (100) |
| 100 | 100 (100) | 100 (100) | 100 (100) |