| Literature DB >> 29928295 |
Charles D Waters1, Jeffrey J Hard2, Marine S O Brieuc1,3, David E Fast4, Kenneth I Warheit5, Curtis M Knudsen6, William J Bosch4, Kerry A Naish1.
Abstract
A novel application of genomewide association analyses is to use trait-associated loci to monitor the effects of conservation strategies on potentially adaptive genetic variation. Comparisons of fitness between captive- and wild-origin individuals, for example, do not reveal how captive rearing affects genetic variation underlying fitness traits or which traits are most susceptible to domestication selection. Here, we used data collected across four generations to identify loci associated with six traits in adult Chinook salmon (Oncorhynchus tshawytscha) and then determined how two alternative management approaches for captive rearing affected variation at these loci. Loci associated with date of return to freshwater spawning grounds (return timing), length and weight at return, age at maturity, spawn timing, and daily growth coefficient were identified using 9108 restriction site-associated markers and random forest, an approach suitable for polygenic traits. Mapping of trait-associated loci, gene annotations, and integration of results across multiple studies revealed candidate regions involved in several fitness-related traits. Genotypes at trait-associated loci were then compared between two hatchery populations that were derived from the same source but are now managed as separate lines, one integrated with and one segregated from the wild population. While no broad-scale change was detected across four generations, there were numerous regions where trait-associated loci overlapped with signatures of adaptive divergence previously identified in the two lines. Many regions, primarily with loci linked to return and spawn timing, were either unique to or more divergent in the segregated line, suggesting that these traits may be responding to domestication selection. This study is one of the first to utilize genomic approaches to demonstrate the effectiveness of a conservation strategy, managed gene flow, on trait-associated-and potentially adaptive-loci. The results will promote the development of trait-specific tools to better monitor genetic change in captive and wild populations.Entities:
Keywords: captive rearing; conservation; domestication selection; genomewide association analysis; managed gene flow; random forest
Year: 2018 PMID: 29928295 PMCID: PMC5999212 DOI: 10.1111/eva.12599
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 5.183
Figure 1Schematic illustrating the initiation (the founding P 1 generation) and subsequent propagation (F 1–F 4 generations) of the integrated and segregated hatchery lines of anadromous Chinook salmon at the Cle Elum Supplementation and Research Facility (modified from Waters et al., 2015). Each box denotes the number of spawners (wild environment) and the number of broodstock (hatchery environment) for each year surveyed. Linear arrows indicate the contribution of wild spawners or hatchery broodstock to the subsequent generation. Circular arrows represent unobserved mating between wild‐born (unmarked) and hatchery‐born (marked) spawners in the wild environment. Fish from the two lines are differentially marked, so only hatchery‐born fish from the integrated line are permitted to spawn in the wild. Dark gray boxes represent wild adults, light gray boxes represent natural‐origin adults with hatchery, wild, or hybrid ancestry, and white boxes represent adults born in the hatchery
Individual traits measured in adult Chinook salmon returning to the upper Yakima River. Fish are measured at the Roza Dam Adult Monitoring Facility (RAMF) and again if used as broodstock at the Cle Elum Supplementation and Research Facility (CESRF). Measurements from RAMF were used in the analyses for fork length and weight
| Trait category | Traits (units) | Locations measured | |
|---|---|---|---|
| Life history | Age at maturity (years) | RAMF | CESRF |
| Return timing (day of year) | RAMF | ||
| Spawn timing (day of year) | CESRF | ||
| Morphometric | Fork length (cm) | RAMF | CESRF |
| Weight (kg) | RAMF | CESRF | |
| Growth | Daily growth coefficient (no units) | RAMF to CESRF | |
Results of random forest association analyses for six phenotypic traits. For each trait, the number of individuals analyzed, the total number of predictor loci identified by the backward purging approach and the number of those that were mapped, and the percent trait variation explained by the predictor loci are given
| Trait | Individuals | Predictor loci (mapped) | Percent variation explained |
|---|---|---|---|
| Age at maturity | 383 | 30 (20) | 24.9 |
| Return timing | 383 | 26 (15) | 29.2 |
| Spawn timing | 379 | 68 (36) | 26.7 |
| Fork length | 380 | 44 (20) | 32.0 |
| Weight | 381 | 37 (15) | 31.5 |
| Daily growth coefficient | 372 | 35 (20) | 31.8 |
Denotes the out‐of‐bag classification error rate for age at maturity, rather than variation explained, as the trait was analyzed using classification trees. Here, classification error rate refers to the percentage of hold‐out samples whose ages were misclassified using the tree.
Figure 2Graphical representation of four Chinook salmon chromosomes (center panels, a–d) showing the map positions (cM) of loci associated with six fitness‐related traits, as identified by random forest analyses. Loci associated with different traits mapped to the same regions, including loci on Ots01 and Ots08 that were associated with both fork length and weight (highlighted in yellow). Divergence (F ST) of the F 3 INT and F 3 SEG hatchery lines when compared to the P 1 founders is displayed in the left and right panels of each figure, respectively. The F 3 generation is shown because it is the most recent hatchery generation for which there are relatively large sample sizes (>50), and thus has greater statistical support for all outlier tests. The black line denotes the moving average of F ST across the chromosome, with regions exhibiting significant levels of divergence (i.e., outlier regions) from the P 1 Founders in red (Waters et al., 2015, 2017). The centromere of each chromosome is shaded with diagonal black lines. Black circles represent outlier loci previously identified by F TEMP and Bayescan, blue triangles correspond to trait‐associated loci, and gray points are all other study loci. Locations where trait‐associated loci are in close proximity to outlier loci or regions are marked with black arrows, including one outlier locus on Ots10 that was also associated with spawn timing (circled)
Sample sizes from each hatchery line, regression coefficients (standard errors) of terms, and test statistics with p‐values based on linear or generalized linear models for each phenotypic trait. Significant p‐values are in bold font. The reference level for the intercept is integrated females. Due to the form of the models, coefficients for age at maturity and weight were exponentiated (except for βfork length) and refer to the proportionate response compared to the reference level. βfork length is the allometric coefficient
| Age at maturity (years) | Return timing (day of year) | Fork length (cm) | Weight (kg) | Spawn timing (day of year) | DGC | |
|---|---|---|---|---|---|---|
|
| 1135 | 984 | 984 | 984 | 60 | 60 |
|
| 247 | 200 | 200 | 200 | 52 | 48 |
| βIntercept | 4.00 (0.078) | 156.57 (0.77) | 72.34 (0.17) | 1.28e−05 (1.98e−06) | 265.75 (1.01) | −0.05 (0.003) |
| βLine | 1.00 (0.047) | 3.35 (1.86) | −0.70 (0.41) | 0.98 (0.01) | −9.26 (1.36) | 0.01 (0.004) |
| βSex | 0.92 (0.028) | −0.87 (1.33) | 0.44 (0.30) | 0.99 (0.005) | −1.46 (1.47) | −0.03 (0.004) |
| βLine*Sex | 0.98 (0.071) | 7.34 (3.25) |
0.74 (0.72) | 0.98 (0.01) | −0.79 (2.35) | 0.02 (0.007) |
| βfork length | N/A | N/A | N/A | 2.98 (0.03) | N/A | N/A |
Figure 3Loci and regions of chromosome Ots12 showing signatures of adaptive divergence based on measures of pairwise F ST between each generation of each line and the P 1 founders. The results are given for the integrated (top panel) and segregated (bottom panel) hatchery lines through the F 1, F 2, F 3, and F 4 generations. Blue squares are loci that were identified as outliers by Bayescan (Foll & Gaggiotti, 2008) and orange triangles are outliers identified by F TEMP, a method designed to detect selection in a single population over time (Therkildsen et al., 2013). The red line represents the kernel‐smoothed moving average of F ST, and the gray shaded area is the 95% confidence interval. Genomic regions exhibiting significant levels of divergence (i.e., outlier regions) from the P 1 founders occur where the moving average of F ST exceeds the 95% confidence intervals. The centromere of the chromosome is shaded with diagonal black lines. The black circle designates a locus predictive of return timing, Ot005185_Ots12p, which was also identified as an outlier by Bayescan and, in the segregated line, by F TEMP. Negative F ST values occur due to finite sample sizes and slight variance in sample sizes between populations (Weir & Cockerham, 1984)