| Literature DB >> 24665338 |
Wesley A Larson1, Lisa W Seeb1, Meredith V Everett1, Ryan K Waples1, William D Templin2, James E Seeb1.
Abstract
Recent advances in population genomics have made it possible to detect previously unidentified structure, obtain more accurate estimates of demographic parameters, and explore adaptive divergence, potentially revolutionizing the way genetic data are used to manage wild populations. Here, we identified 10 944 single-nucleotide polymorphisms using restriction-site-associated DNA (RAD) sequencing to explore population structure, demography, and adaptive divergence in five populations of Chinook salmon (Oncorhynchus tshawytscha) from western Alaska. Patterns of population structure were similar to those of past studies, but our ability to assign individuals back to their region of origin was greatly improved (>90% accuracy for all populations). We also calculated effective size with and without removing physically linked loci identified from a linkage map, a novel method for nonmodel organisms. Estimates of effective size were generally above 1000 and were biased downward when physically linked loci were not removed. Outlier tests based on genetic differentiation identified 733 loci and three genomic regions under putative selection. These markers and genomic regions are excellent candidates for future research and can be used to create high-resolution panels for genetic monitoring and population assignment. This work demonstrates the utility of genomic data to inform conservation in highly exploited species with shallow population structure.Entities:
Keywords: Chinook salmon; RAD sequencing; SNPs; effective population size; genetic stock identification; population genomics; western Alaska
Year: 2014 PMID: 24665338 PMCID: PMC3962296 DOI: 10.1111/eva.12128
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 5.183
Figure 1Map of sampling locations. See Table 1 for additional details about each sampling site.
Populations analyzed in this study with year sampled, sample size (N), observed heterozygosity (HO), and expected heterozygosity (HE).
| Sampling location | Region | Year | GPS coordinates | |||
|---|---|---|---|---|---|---|
| Tubutulik River | Norton Sound | 2009 | 64.740, −161.888 | 56 | 0.248 | 0.252 |
| Anvik River | Lower Yukon R | 2007 | 62.681, −160.214 | 54 | 0.260 | 0.261 |
| Kogrukluk River | Kuskokwim R | 2007 | 60.841, −157.846 | 57 | 0.251 | 0.258 |
| Koktuli River | Bristol Bay | 2010 | 59.935, −156.427 | 56 | 0.256 | 0.259 |
| Big Salmon River | Upper Yukon R | 2007 | 61.867, −134.917 | 47 | 0.232 | 0.232 |
Figure 2Individual-based principal component analysis for all populations and 10 944 SNPs. The five intermediate individuals from the Anvik River were removed from further analyses (see).
Estimates of effective population size (Ne) for five populations calculated with 1118 RAD-derived SNPs that were placed on the linkage map and 39 of the 43 SNPs that were in linkage equilibrium from Templin et al. (2011). Estimates with RAD SNPs are calculated using only comparisons between loci on different linkage groups (Ne linkage removed) and all comparisons (Ne all data). The ratio of effective population size to census size (Ne/N) and effective population size to census size multiplied by generation length (Ne/NG) for each population is also reported (G is generation length and N is an approximate value of yearly escapement for each population, see methods). The Ne used for these calculations is Ne linkage removed (column 2). We did not calculate Ne/N or Ne/NG for the Bristol Bay and upper Yukon populations because confidence intervals included infinity, suggesting our point estimates may not be completely representative.
| Population | G | Source of | Source of G | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Tubutulik River | 1909 (1295–3602) | 808 (674–1009) | Inf (174–Inf) | 5.43 | 3100 | 0.62 | 0.11 | Banducci et al. ( | Lingnau ( |
| Anvik River | 516 (451–604) | 505 (443–586) | 209 (65-Inf) | 5.48 | 1700 | 0.30 | 0.06 | Howard et al. ( | Sandone ( |
| Kogrukluk River | 2026 (1375–3825) | 1723 (1233–2842) | Inf (134–Inf) | 5.20 | 12 000 | 0.17 | 0.03 | Williams and Shelden ( | Howard et al. ( |
| Koktuli River | Inf (6055-Inf) | 26 071 (3733-Inf) | Inf (Inf-Inf) | 5.13 | 6000 | N/A | N/A | Woody ( | Howard et al. ( |
| Big Salmon River | 13 101 (1505-Inf) | 4243 (1806-Inf) | 520 (70-Inf) | 5.65 | 5000 | N/A | N/A | Mercer and Wilson ( | Howard et al. ( |
Pairwise FST values calculated using 10 944 SNPs and number of genomic regions that were under putative selection (in parentheses). All pairwise comparisons are significantly differentiated (P < 0.01).
| Tubutulik River | Anvik River | Kogrukluk River | Koktuli River | |
|---|---|---|---|---|
| Anvik River | 0.030 (20) | |||
| Kogrukluk River | 0.027 (20) | 0.005 (20) | ||
| Koktuli River | 0.028 (20) | 0.006 (23) | 0.003 (20) | |
| Big Salmon River | 0.098 (24) | 0.075 (25) | 0.075 (21) | 0.077 (25) |
Results from two amovas with 10 944 SNPs.
| Source of variation | d.f. | Percentage of variation |
|---|---|---|
| Among groups | 2 | 5.26 |
| Among populations within groups | 2 | 0.45 |
| Within populations | 529 | 94.32 |
| Among groups | 1 | 2.41 |
| Among populations within groups | 2 | 0.43 |
| Within populations | 436 | 97.18 |
Figure 3Individual-based principal component analysis for the Anvik River, Kogrukluk River, and Koktuli River populations using (A) 10 944 RAD SNPs and (B) 39 SNPs from Templin et al. (2011).
Results of leave-one-out tests for individual assignment with four SNP panels. Panels are: (1) 39 EST: 39 SNPs previously developed for Chinook salmon from expressed sequence tags (ESTs, Templin et al. 2011), (2) 39 RAD: 39 randomly chosen SNPs from the complete data set of 10 944 RAD SNPs, (3) 10 944 RAD: the complete data set of RAD SNPs, and (4) 10 211 RAD no outliers: the full set of RAD SNPs with the 733 outlier SNPs that were found to be under putative selection removed. Individuals were considered to be correctly assigned if the assignment probability to population of origin was higher than to any other population. See Table S3 for assignment probabilities for each individual.
| Regions | % Correct assignment | |||
|---|---|---|---|---|
| 39 EST | 39 RAD | 10 944 RAD | 10 211 RAD no outliers | |
| Tubutulik River | 67 | 65 | 100 | 100 |
| Anvik River | 46 | 30 | 91 | 89 |
| Kogrukluk River | 34 | 30 | 93 | 93 |
| Koktuli River | 29 | 30 | 98 | 95 |
| Big Salmon River | 96 | 87 | 100 | 100 |
Figure 4Regions of the genome under putative selection as inferred by pairwise FST across all population pairs. Each vertical line represents a linkage group, and the length of the line is proportional to the size of the linkage group in cM. Shaded areas indicate regions which are significantly diverged in at least one population pair indicating putative selection. The color of the shading corresponds to the number of significant pairwise population comparisons with red and purple indicating over half of the population pairs are divergent in the given region.