| Literature DB >> 26069460 |
M J Rivas1, S Domínguez-García1, A Carvajal-Rodríguez1.
Abstract
The study of local adaptation is a main focus of evolutionary biology since it may contribute to explain the current species diversity. The genomic scan procedures permit for the first time to study the connection between specific DNA patterns and processes as natural selection, genetic drift, recombination, mutation and gene flow. Accordingly, the information on genomes from non-model organisms increases and the interest on detecting the signal of natural selection in the DNA sequences of different populations also raises. The main goal of the present work is to explore a sequence-based method for detecting natural selection in divergent populations connected by migration. In doing so, we rely on a recently published statistic based upon th e definition of haplotype allelic classes (HAC). The original measure was modified to be more sensitive to intermediate frequencies in non-model species. A linkage-disequilibrium-based method was also assayed and individual-based simulations were performed to test the methods. The results suggest that the HAC-based methods and, specifically, the new proposed method are quite powerful for detecting the footprint of moderate divergent selection. They are also robust to reasonable model misspecification. One obvious advantage of the new algorithm is that it does not require knowledge of the allelic state.Entities:
Keywords: Detection of selection; Divergent populations; Gene flow; Local adaptation; Selective sweep; Single nucleotide polymorphism
Year: 2015 PMID: 26069460 PMCID: PMC4460224 DOI: 10.2174/1389202916666150313230943
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Parameter values in the simulations.
| N | t | s | θ | P |
|---|---|---|---|---|
| 1000 | 100 | 0 | 12 | 0 |
| 10000 | 500 | ± 0.15 | 60 | 4 |
| 5000 | 12 | |||
| 10000 | 60 |
N: Population size; t: number of generations; s: coefficient of selection; θ mutation rate, p: recombination rate.
Performance for the different selection detection methods in the long-term (t = 10,000) and weak selection (α = 600) cases. Data from population 1 (initial frequency of favored allele 1/1000). Values correspond to percentages of detection through replicates. Threshold for each test is between brackets.
| S | θ | ρ | %Svd | %SvdM | %Omega |
|---|---|---|---|---|---|
| 81 | 12 | 0 | 35 (1.43) | 27 (2.03) | 4.86 (8.42) |
| 82 | 12 | 4 | 70 (0.67) | 61 (0.86) | 4.60 (8.46) |
| 78 | 12 | 12 | 67 (0.45) | 69 (0.47) | 4.32 (9.40) |
| 411 | 60 | 0 | 44.5 (6.64) | 37.5 (9.55) | 19.84 (164.84) |
| 408 | 60 | 4 | 79 (3.51) | 71 (4.28) | 14.58 (212.08) |
| 374 | 60 | 60 | 63 (0.84) | 78 (0.57) | 23.95 (232.36) |
S: Window size in the selective case for the HAC-based methods. q: Mutation rate. r: Recombination rate. OmegaPlus conditions: Grid 1000; Minwin 1000; Maxwin 20000.
Performance for the HAC-based selection detection methods in the long-term (t = 10,000) and weak selection (α = 600) cases. Data from populations 1 and 2 joined (metapopulation scenario). Values correspond to percentages of detection through replicates. Threshold for each test is between brackets
| S | θ | ρ | %Svdmetapop | %SvdMmetapop |
|---|---|---|---|---|
| 103 | 12 | 0 | 19 (0.64) | 42 (1.0) |
| 102 | 12 | 4 | 40 (0.38) | 64 (0.54) |
| 97 | 12 | 12 | 50 (0.22) | 80 (0.29) |
| 523 | 60 | 0 | 11 (3.80) | 43 (5.83) |
| 511 | 60 | 4 | 39 (2.70) | 75 (2.02) |
| 449 | 60 | 60 | 69 (0.40) | 94 (0.40) |
S: Window size in the selective case. θ Mutation rate. p: Recombination rate.
Performance for the HAC-based selection detection methods in the very short-term (t = 100) and weak selection (α = 600) cases. Data from population 1 (initial frequency of favored allele 1/1000) or from the metapopulation (last column). Values correspond to percentages of detection through replicates. Threshold for each test is between brackets.
| S | θ | ρ | % Svd | %SvdM | %SvdMmetapop |
|---|---|---|---|---|---|
| 48 | 12 | 0 | 15 (0.22) | 54 (0.13) | 59 (0.08) |
| 50 | 12 | 4 | 41 (0.16) | 40 (0.19) | 37 (0.13) |
| 33 | 12 | 12 | 35 (0.11) | 26 (0.14) | 21 (0.10) |
| 104 | 60 | 0 | 14 (0.17) | 5 (0.26) | 9 (0.21) |
| 113 | 60 | 4 | 46 (0.18) | 59 (0.17) | 37 (0.13) |
| 182 | 60 | 60 | 45 (0.31) | 57 (0.27) | 60 (0.18) |
S: Window size in the selective case. q: Mutation rate. r: Recombination rate.
Performance for the HAC-based selection detection methods in the short-term (t = 500) and weak selection (α = 600) cases. Data from population 1 (initial frequency of favored allele 1/1000) or from the metapopulation (last column). Values correspond to percentages of detection through replicates. Threshold for each test is between brackets.
| S | θ | ρ | % Svd | %SvdM | %SvdMmetapop |
|---|---|---|---|---|---|
| 38 | 12 | 0 | 46 (0.17) | 5 (0.27) | 6 (0.28) |
| 31* | 12 | 4 | 8 (0.15) | 3 (0.21) | 3 (0.14) |
| 36 | 12 | 12 | 58 (0.15) | 36 (0.19) | 21 (0.16) |
| 125 | 60 | 0 | 43 (0.20) | 21 (0.22) | 20 (0.41) |
| 153 | 60 | 4 | 79 (0.15) | 70 (0.22) | 59 (0.18) |
| 165 | 60 | 60 | 69 (0.22) | 67 (0.19) | 61 (0.15) |
S: Window size in the selective case.α: Mutation rate.
p: Recombination rate. *: in the selective case only 37 replicates having a minimum of 25 SNPs.
Performance for the HAC-based selection detection methods in the short-term (t = 100-500) and strong selection (α = 6000) cases. Data from population 1 (initial frequency of favored allele 1/1000) or from the metapopulation (last column). Values correspond to percentages of detection through replicates. Threshold for each test is between brackets.
| S | t | θ | ρ | % Svd | %SvdM | %SvdMmetapop |
|---|---|---|---|---|---|---|
| 155 | 100 | 60 | 0 | 76 (0.09) | 81 (0.15) | 61 (0.10) |
| 73 | 500 | 60 | 0 | 28 (0.08)* | 15 (0.16)* | 14 (0.15) |
| 198 | 100 | 60 | 4 | 30(0.45) | 81.5 (0.19) | 82.5 (0.12) |
| 40 | 500 | 60 | 4 | 63 (0.038) | 65 (0.035) | 59 (0.07) |
| 194 | 100 | 60 | 60 | 30 (0.375) | 76 (0.19) | 78.5 (0.11) |
| 67 | 500 | 60 | 60 | 90 (0.14) | 96 (0.09) | 70 (0.13) |
S: Window size in the selective case. t: number of generations. θ: Mutation rate. p: Recombination rate. *: only 46 runs having a minimum of 25 SNPs.
Difference (Dsel) between the real position of the selective site and the localization of the maximum SvdM value. And the same difference in the neutral case (Dneu). Populations in equilibrium (t = 10,000), α = 600, p = 60.
| ρ |
|
|
|
|---|---|---|---|
| 0 | 0 | 0.448 | 0.471 |
| 4 | 0.400 | 0.469 | |
| 60 | 0.294 | 0.494 | |
| 0 | 0.01 | 0.467 | 0.460 |
| 4 | 0.413 | 0.462 | |
| 60 | 0.283 | 0.486 | |
| 0 | 0.1 | 0.354 | 0.374 |
| 4 | 0.301 | 0.370 | |
| 60 | 0.164 | 0.392 | |
| 0 | 0.25 | 0.219 | 0.221 |
| 4 | 0.164 | 0.220 | |
| 60 | 0.051 | 0.246 | |
| 0 | 0.5 | 0.042 | 0.028 |
| 4 | 0.009 | 0.030 | |
| 60 | 0.012 | 0.006 |