| Literature DB >> 27069794 |
Tara N Furstenau1, Reed A Cartwright1.
Abstract
Under models of isolation-by-distance, population structure is determined by the probability of identity-by-descent between pairs of genes according to the geographic distance between them. Well established analytical results indicate that the relationship between geographical and genetic distance depends mostly on the neighborhood size of the population which represents a standardized measure of gene flow. To test this prediction, we model local dispersal of haploid individuals on a two-dimensional landscape using seven dispersal kernels: Rayleigh, exponential, half-normal, triangular, gamma, Lomax and Pareto. When neighborhood size is held constant, the distributions produce similar patterns of isolation-by-distance, confirming predictions. Considering this, we propose that the triangular distribution is the appropriate null distribution for isolation-by-distance studies. Under the triangular distribution, dispersal is uniform over the neighborhood area which suggests that the common description of neighborhood size as a measure of an effective, local panmictic population is valid for popular families of dispersal distributions. We further show how to draw random variables from the triangular distribution efficiently and argue that it should be utilized in other studies in which computational efficiency is important.Entities:
Keywords: Correlograms; Fine scale; Identity-by-descent; Individual based; Kinship coefficients; Neighborhood size; Simulation; Triangular distribution
Year: 2016 PMID: 27069794 PMCID: PMC4824897 DOI: 10.7717/peerj.1848
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Estimated neighborhood sizes are similar across all dispersal distributions.
Estimates of allele diversity, , effective population density, , dispersal, s2, and neighborhood size. Neighborhood size is estimated two different ways. is where is estimated from . is twice the inverse of the slope of a and the log of distance. The expected neighborhood size (4πσ2⋅1) is 12.56, 28.28, 50.26, and 201.06 for σ = 1, 1.5, 2, and 4, respectively.
| Ray | 1.82 | 0.91 | 0.99 | 11.31 | 13.07 | 1.83 | 0.91 | 2.33 | 26.79 | 31.16 |
| Exp | 2.09 | 1.04 | 1.04 | 13.70 | 14.32 | 2.04 | 1.02 | 2.26 | 29.04 | 29.00 |
| Nor | 1.94 | 0.97 | 0.98 | 11.94 | 13.49 | 1.91 | 0.95 | 2.31 | 27.69 | 30.61 |
| Tri | 1.82 | 0.91 | 1.00 | 11.37 | 13.58 | 1.83 | 0.92 | 2.36 | 27.18 | 31.02 |
| Gam 1 | 2.07 | 1.04 | 1.05 | 13.63 | 14.41 | 2.01 | 1.00 | 2.32 | 29.22 | 30.07 |
| Gam 2 | 1.89 | 0.94 | 0.98 | 11.62 | 12.80 | 1.85 | 0.92 | 2.32 | 26.98 | 30.13 |
| Gam 4 | 1.83 | 0.92 | 1.00 | 11.49 | 12.88 | 1.87 | 0.94 | 2.32 | 27.31 | 28.27 |
| Gam 8 | 1.80 | 0.90 | 1.01 | 11.45 | 13.31 | 1.79 | 0.90 | 2.32 | 26.16 | 29.91 |
| Lom 2.4 | 2.97 | 1.49 | 1.06 | 19.70 | 13.41 | 2.62 | 1.31 | 2.16 | 35.53 | 26.65 |
| Lom 2.6 | 2.88 | 1.44 | 0.97 | 17.61 | 13.23 | 2.47 | 1.24 | 2.34 | 36.25 | 26.11 |
| Lom 2.8 | 2.73 | 1.36 | 1.04 | 17.78 | 12.82 | 2.41 | 1.21 | 2.22 | 33.66 | 25.30 |
| Lom 3 | 2.72 | 1.36 | 1.00 | 17.07 | 14.28 | 2.36 | 1.18 | 2.34 | 34.71 | 28.50 |
| Par 2.4 | 1.98 | 0.99 | 0.98 | 12.18 | 11.71 | 1.93 | 0.97 | 2.19 | 26.56 | 27.12 |
| Par 2.6 | 1.95 | 0.98 | 1.04 | 12.74 | 13.82 | 1.81 | 0.91 | 2.28 | 25.98 | 27.95 |
| Par 2.8 | 1.90 | 0.95 | 0.97 | 11.57 | 12.25 | 1.85 | 0.93 | 2.25 | 26.16 | 30.85 |
| Par 3 | 1.89 | 0.95 | 0.99 | 11.80 | 13.56 | 1.89 | 0.94 | 2.24 | 26.54 | 29.79 |
| Ray | 1.97 | 0.99 | 4.07 | 50.39 | 58.81 | 2.02 | 1.01 | 16.11 | 204.93 | 236.23 |
| Exp | 2.02 | 1.01 | 4.08 | 51.88 | 49.60 | 2.09 | 1.05 | 16.16 | 212.48 | 154.94 |
| Nor | 1.95 | 0.97 | 4.08 | 49.87 | 55.00 | 2.04 | 1.02 | 16.04 | 205.76 | 189.69 |
| Tri | 1.94 | 0.97 | 4.11 | 50.13 | 54.57 | 2.09 | 1.04 | 16.09 | 210.87 | 245.02 |
| Gam 1 | 2.03 | 1.01 | 4.06 | 51.74 | 52.25 | 2.16 | 1.08 | 16.15 | 218.67 | 257.28 |
| Gam 2 | 1.89 | 0.95 | 4.12 | 48.88 | 54.39 | 2.02 | 1.01 | 16.08 | 204.41 | 214.04 |
| Gam 4 | 1.94 | 0.97 | 4.08 | 49.80 | 55.60 | 1.98 | 0.99 | 15.94 | 197.97 | 191.02 |
| Gam 8 | 1.89 | 0.94 | 4.06 | 48.21 | 52.47 | 2.02 | 1.01 | 16.11 | 203.96 | 231.04 |
| Lom 2.4 | 2.48 | 1.24 | 3.98 | 62.01 | 47.94 | 2.19 | 1.09 | 16.06 | 220.82 | 180.03 |
| Lom 2.6 | 2.36 | 1.18 | 3.94 | 58.49 | 48.10 | 2.15 | 1.07 | 15.45 | 208.62 | 219.14 |
| Lom 2.8 | 2.27 | 1.13 | 4.16 | 59.23 | 51.08 | 2.14 | 1.07 | 15.81 | 212.44 | 241.05 |
| Lom 3 | 2.24 | 1.12 | 3.97 | 56.05 | 47.23 | 2.07 | 1.04 | 16.55 | 215.21 | 211.19 |
| Par 2.4 | 1.93 | 0.97 | 4.13 | 50.12 | 48.20 | 2.03 | 1.02 | 16.04 | 204.74 | 192.65 |
| Par 2.6 | 1.95 | 0.97 | 4.11 | 50.29 | 51.74 | 2.03 | 1.02 | 15.91 | 203.23 | 189.19 |
| Par 2.8 | 1.98 | 0.99 | 4.02 | 49.95 | 47.73 | 1.95 | 0.97 | 15.53 | 189.90 | 219.90 |
| Par 3 | 1.98 | 0.99 | 4.10 | 50.92 | 49.58 | 2.01 | 1.00 | 16.30 | 205.48 | 169.53 |
Figure 1Identity-by-descent is similar between different dispersal models.
Each plot shows the average probability of identity-by-descent for pairs of individuals in each distance class. Each panel represents simulations run with different σ parameters (gray box) for different groups of dispersal distributions.
Figure 2Kinship coefficients are similar between different dispersal models.
Each plot shows the average kinship coefficient for pairs of individuals over the log of the distance between them. Each panel represents simulations run with different σ parameters (gray box) for different groups of dispersal distributions. The gray dashed line is at zero so values above the line are more similar than the sample as a whole while values below the line are less similar than the population as a whole.
Figure 3Slopes of genetic differentiation are similar between different dispersal models.
Each plot shows the average differentiation, a, for pairs of individuals over the log of the distance between them. Each panel represents simulations run with different σ parameters (gray box) for different groups of dispersal distributions.
Figure 4Estimated neighborhood sizes are similar across all dispersal distributions.
Neighborhood size is estimated two different ways. (A) N is where is estimated from . The dot is the average from all populations samples and the bars represent the middle 50% of estimates from individual samples. (B) The slope estimates, , of a and the log of distance. The dots represent the slope estimate from the combined data from all samples and the bars represent the middle 50% of slopes from individual samples.
Triangular dispersal algorithm is the most efficient.
Execution time and relative time for 109 dispersal events from different dispersal functions ordered from most to least efficient.
| Dispersal function | CPU seconds | Relative time |
|---|---|---|
| Triangular | 21.853 | 1 |
| Rayleigh | 27.713 | 1.268 |
| Exponential | 106.434 | 4.870 |
| Half Normal | 106.771 | 4.886 |
| Gamma | 119.357 | 5.462 |
| Pareto | 127.218 | 5.822 |
| Lomax | 127.376 | 5.829 |