| Literature DB >> 32724541 |
Bin Li1,2, Sakiko Yaegashi2,3, Thaddeus M Carvajal2, Maribet Gamboa2, Ming-Chih Chiu2, Zongming Ren1, Kozo Watanabe2.
Abstract
Adaptive divergence is a key mechanism shaping the genetic variation of natural populations. A central question linking ecology with evolutionary biology is how spatial environmental heterogeneity can lead to adaptive divergence among local populations within a species. In this study, using a genome scan approach to detect candidate loci under selection, we examined adaptive divergence of the stream mayfly Ephemera strigata in the Natori River Basin in northeastern Japan. We applied a new machine-learning method (i.e., random forest) besides traditional distance-based redundancy analysis (dbRDA) to examine relationships between environmental factors and adaptive divergence at non-neutral loci. Spatial autocorrelation analysis based on neutral loci was employed to examine the dispersal ability of this species. We conclude the following: (a) E. strigata show altitudinal adaptive divergence among the populations in the Natori River Basin; (b) random forest showed higher resolution for detecting adaptive divergence than traditional statistical analysis; and (c) separating all markers into neutral and non-neutral loci could provide full insight into parameters such as genetic diversity, local adaptation, and dispersal ability.Entities:
Keywords: STRUCTURE; adaptive divergence; altitude; aquatic insect; local adaptation; random forest
Year: 2020 PMID: 32724541 PMCID: PMC7381564 DOI: 10.1002/ece3.6398
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
FIGURE 1Map of 11 sampling sites and photograph of species Ephemera strigata in the Natori River Basin in northeastern Japan
FIGURE 2Subpopulation structure of Ephemera strigata as determined using STRUCTURE with hierarchical iterations. Dashed boxes indicate subpopulations, and solid boxes indicate final populations. Numbers at the top of boxes indicate the number of individuals assigned to the populations. A total of 14 groups (K) were defined from 216 individuals
Genetic diversity and divergence measured using the following: (1) all loci, (2) only neutral loci, and (3) only non‐neutral loci
|
|
|
| |
|---|---|---|---|
| All loci | 0.1358 | 0.1357 | 0.029 |
| Neutral loci | 0.1173 | 0.1155 | 0.021 |
| Non‐neutral loci | 0.4379 | 0.3523 | 0.039 |
H = total expected heterozygosity; H = mean expected heterozygosity within sites; and F ST = Wright's fixation index among sites.
Sample size, AUC, OOB error rates, and key factors defined by random forest for each non‐neutral locus (sample size was shown with abundant category/rare category to show data imbalance)
| Locus | Sample size ( | AUC | OOB | Key factor |
|---|---|---|---|---|
| 56 | 202/14 | 0.85 | 5.12% | Altitude |
| 254 | 175/41 | 0.79 | 12.54% | Altitude |
| 89 | 199/17 | 0.74 | 11.48% | Altitude |
| 247 | 204/12 | 0.67 | 7.72% | River width |
| 36 | 182/34 | 0.52 | 13.43% | Stream order |
| 90 | 152/64 | 0.51 | 35.22% | Latitude |
| 98 | 174/42 | 0.51 | 22.78% | Latitude |
| 97 | 130/86 | 0.51 | 33.09% | Distance to river mouth |
| 260 | 185/31 | 0.50 | 15.41% | River width |
| 289 | 200/16 | 0.50 | 12.98% | River width |
FIGURE 3Relative importance of environmental variables based on the random forest model for three non‐neutral loci (56, 89, and 254)
FIGURE 4Distance‐based redundancy analysis (dbRDA) describing the influence of environmental heterogeneity on genetic variation at a non‐neutral locus (254)
FIGURE 5Isolation by distance calculated using geographic (a) and riverine (b) distance. Solid lines indicate correlations between Wright's fixation index (FST) and geographic (r = .11, p = .33) or riverine distance (r = .06, p = .49) calculated with the Mantel tests
FIGURE 6Spatial autocorrelation at 4‐km distance classes based on geographic distance for neutral loci. Dashed lines indicate permutated 95% confidence intervals, and error bars indicate jackknifed 95% confidence intervals. * indicates significant spatial autocorrelation (p < .05)