| Literature DB >> 22408722 |
Morten T Limborg, Scott M Blankenship, Sewall F Young, Fred M Utter, Lisa W Seeb, Mette H H Hansen, James E Seeb.
Abstract
Recent advances in molecular interrogation techniques now allow unprecedented genomic inference about the role of adaptive genetic divergence in wild populations. We used high-throughput genotyping to screen a genome-wide panel of 276 single nucleotide polymorphisms (SNPs) for the economically and culturally important salmonid Oncorhynchus mykiss. Samples included 805 individuals from 11 anadromous and resident populations from the northwestern United States and British Columbia, and represented two major lineages including paired populations of each life history within single drainages of each lineage. Overall patterns of variation affirmed clear distinctions between lineages and in most instances, isolation by distance within them. Evidence for divergent selection at eight candidate loci included significant landscape correlations, particularly with temperature. High diversity of two nonsynonymous mutations within the peptide-binding region of the major histocompatibility complex (MHC) class II (DAB) gene provided signatures of balancing selection. Weak signals for potential selection between sympatric resident and anadromous populations were revealed from genome scans and allele frequency comparisons. Our results suggest an important adaptive role for immune-related functions and present a large genomic resource for future studies.Entities:
Keywords: Candidate genes; Interleukin; Local adaptation; MHC; Salmonid; Steelhead
Year: 2012 PMID: 22408722 PMCID: PMC3297173 DOI: 10.1002/ece3.59
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Wild rainbow trout (Oncorhynchus mykiss) in their natural environment (Photo by Finn Sivebæk).
Figure 2Map of sampling locations. An approximate projection of the current divide between the inland and coastal lineages is shown by a thick broken line (From Behnke 1992).
Sample information and summary statistics. Sample size (n), expected (H) and observed heterozygosity (H), allelic richness (A), and percent polymorphic SNPs are given. Statistics are given for pooled samples for locations with temporal replicates
| Population name | ID | Year | Lineage | Life history | Percent polymorphic SNPs | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Sustut River | SUST96 | 1996 | Coastal | Anadromous | 50 | 0.20 | 0.20 | 1.62 | 70% |
| Sustut River | SUST97 | 1997 | Coastal | Anadromous | 45 | |||||
| 2 | Skagit River | SKAG | 2007 | Coastal | Anadromous | 59 | 0.25 | 0.24 | 1.80 | 89% |
| 3 | Skagit River | SKAGRES | 2009 | Coastal | Resident | 23 | 0.09 | 0.09 | 1.45 | 55% |
| 4 | Green River | GREEN | 2007 | Coastal | Anadromous | 35 | 0.22 | 0.22 | 1.73 | 80% |
| 5 | Sol Duc River | SOL | 2009 | Coastal | Anadromous | 94 | 0.23 | 0.23 | 1.76 | 92% |
| 6 | Chehalis River | CHEH | 2007 | Coastal | Anadromous | 95 | 0.22 | 0.21 | 1.69 | 82% |
| 7 | Deschutes River | DESCH | 1999 | Inland | Anadromous | 95 | 0.18 | 0.18 | 1.60 | 75% |
| 8 | Crooked Fork Creek | CROOK99 | 1999 | Inland | Anadromous | 48 | 0.18 | 0.18 | 1.59 | 82% |
| Crooked Fork Creek | CROOK01 | 2001 | Inland | Anadromous | 47 | |||||
| 9 | Twisp River | TWISP | 2008 | Inland | Anadromous | 81 | 0.19 | 0.18 | 1.65 | 85% |
| 10 | Twisp River | TWISPRES07 | 2007 | Inland | Resident | 25 | 0.20 | 0.18 | 1.71 | 86% |
| Twisp River | TWISPRES08 | 2008 | Inland | Resident | 13 | |||||
| 11 | Deadman Creek | DEADM97 | 1997 | Inland | Anadromous | 76 | 0.19 | 0.18 | 1.57 | 64% |
| Deadman Creek | DEADM99 | 1999 | Inland | Anadromous | 19 |
Appendix 1SNP exclusion pipeline. Ellipses isolate numbers of SNPs in the various categories. Solid arrows connect the stages of analysis, and broken arrows identify sets of SNPs that were excluded at each stage.
TaqMan assays for O. mykiss developed in the Washington Department of Fish and Wildlife (WDFW) Molecular Genetics Laboratory for genotyping steelhead. Gene targets were identified by BLASTX alignments in the Swiss-Prot database with E-values <1 and amino acid identities >25%
| Assay name | Gene target | Design strand | Primers | Dye label | Probe sequences | Allele calls | Ascertainment set |
|---|---|---|---|---|---|---|---|
| Omy_BAC-F5-284 | Unknown | F | ACAACGCCAACAACTTTCTCTTG | VIC | CAGTAGGGCGGCAAG | C | 1 |
| R | CCTCATTTACTGTAGGACCATGCA | FAM | ACAGTAGGACGGCAAG | T | |||
| Omy_Il-1b_-028 | Interleukin 1 beta_2 | F | ACTGTCTGGCTAGAGCACATTG | VIC | CTGAGGCAACTTTTGT | T | 2 |
| R | ATCTTCTACCACCGCACTGTTTTAA | FAM | TGAGGCAGCTTTTGT | C | |||
| Omy_UT11_2a-018 | Clock homolog | F | GCCTCCTGTGTATGAGTATGTTAAGTT | VIC | CTGAATAATGGTAAGTTTTC | T | 2 |
| R | GCCAGCTACTTGTGTACTACATTCA | FAM | TGAATAATGGTACGTTTTC | G | |||
| Omy_UT11_2a-114 | Clock homolog | F | GCTGGCCTGTATTTGGTTTGTG | VIC | TGGTCGGTCTAAGTGC | C | 2 |
| R | TTGGAGTTGAGTTAGGCACTATAGGA | FAM | TGGTCAGTCTAAGTGC | T | |||
| Omy_UT11_2a-132 | Clock homolog | F | GCTGGCCTGTATTTGGTTTGTG | VIC | AGGCAACAATGACTAATT | C | 2 |
| R | CCATTTGGAGTTGAGTTAGGCACTA | FAM | AAGGCAACAATTACTAATT | A | |||
| Omy_u07-79.166 | Unknown | F | CCCGCTATATTATTTGATCACCCTTGA | VIC | ACTTGGGAATACCCCAGCC | G | 1 |
| R | ATTTAAATCCATTTCTAAAAATAAGCAAACCTAACCA | FAM | CTTGGGAATAACCCAGCC | T | |||
| Omy_u09-52.284 | Unknown | F | TTTGTGTGTATTGTTGTGACTTG | VIC | ACTGCATTGTTGTAGCTAG | T | 1 |
| R | TGATGTTATTGCAGGTCTAGCGAAA | FAM | CTGCATTGTTGTCGCTAG | G | |||
| Omy_u09-53.469 | Unknown | F | ACAGCCTGAGCGTTTGCA | VIC | TTGCAGCCCTTATTGTG | T | 1 |
| R | GGAAACTGGGAGAGATCAAAGGA | FAM | TTGCAGCCCTTGTTGTG | C | |||
| Omy_09-55-233 | Unknown | F | CTCGTTTGATAGAAGAAACAAAGTGAAAGTG | VIC | AGCACTGACATCTGC | A | 1 |
| R | CCAACATCTTTGGGCCTAAACAAGA | FAM | AGCACTGACATCCGC | G | |||
| Omy_u09-56.073 | Unknown | F | CCCACTACATCCTCATCCAAGGT | VIC | AGCGGCATTCTC | C | 1 |
| R | CTCACTGCAAATCCAACTTCATCAT | FAM | CAGCGTCATTCTC | A | |||
| Omy_u09-56.119 | Unknown | F | CCAAGGTGGACCCACCAG | VIC | AGTGAGCTGAAACAGAGCA | T | 1 |
| R | GCTGAGTTTATAGGTCAGTCATTATACATATTGA | FAM | TGAGCTGAAGCAGAGCA | C | |||
| Omy_M09AAC-301 | Polymerase (DNA directed), | F | CATTGATGGTTATTGTCATGCGTTTCA | VIC | CTGACAGATTTTGAAGTCT | T | 1 |
| epsilon 2 (p59 subunit) | R | GCAGTAGAGATAGAAATTGAGCACACT | FAM | CTGACAGATTTTGACGTCT | G | ||
| Omy_M09AAF-098 | Exostosin like | F | CGGCGCCCGTCAGTA | VIC | CACAGGAGTAGTTAGAGTTA | T | 1 |
| R | CAACCACCAGTCATGGCTCTA | FAM | CACAGGAGTAGTTCGAGTTA | G | |||
| Omy_M09AAD-076 | Family with sequence | F | ACTGTTACCACTCTCTCATCAACCT | VIC | CACCAACCACTGGTGAA | T | 1 |
| similarity 203, member A | R | GGGTCCAGGAGGTTTTTAAACAACAT | FAM | CCAACCGCTGGTGAA | C | ||
| Omy_09AAH-172 | Rab-like protein 2A | F | GGCGTGAGCTTTGTGGTAGTA | VIC | AGGGATGCATACTCTG | T | 1 |
| R | GCTGGACAGGAGCGGTTC | FAM | AGGGATGCATGCTCTG | C | |||
| Omy_09AAI-092 | Ubiquitin-like modifier | F | GGGCCTTGCTTTGCTTCTG | VIC | ATCTACTGAGTCTCTGCTGC | G | 1 |
| activating enzyme 3 | R | ACGGCTTAGATCCTCCTTGGA | FAM | CTACTGAGTGTCTGCTGC | C |
Ascertainment set 1 included steelhead from Kamchatka, Russia (n = 3), Alaska, USA (n = 3), British Columbia, Canada (n = 3), Washington, USA (n = 180). Ascertainment set 2 included rainbow trout from Kamchatka, Russia (n = 10), Alaska, USA (n = 10), Washington, USA (n = 70), and cutthroat trout from Montana, USA (n = 10).
Tested environmental variables and data sources
| Environmental variable | SUST | SKAG | GREEN | SOL | CHEH | DESCH | CROOK | TWISP | DEADM |
|---|---|---|---|---|---|---|---|---|---|
| Precipitation (mm) | 965 | 873 | 1082 | 2501 | 1413 | 279 | 979 | 395 | 263 |
| Maximum temperature (°C) | 21.0 | 22.7 | 24.6 | 20.0 | 24.8 | 32.2 | 28.0 | 29.5 | 26.2 |
| Minimum temperature (°C) | –13.3 | 0.9 | 1.7 | 1.9 | 1.0 | –3.5 | –8.7 | –9.6 | –9.2 |
| Elevation (m) | 1352 | 9 | 22 | 9 | 28 | 397 | 1049 | 494 | 336 |
| Latitude | 56.58 | 48.44 | 47.29 | 47.91 | 46.80 | 44.82 | 46.51 | 48.37 | 50.74 |
| Longitude | –126.45 | –122.34 | –122.17 | –124.54 | –123.17 | –121.09 | –114.68 | –120.14 | –120.92 |
Sources:
Precipitation and temperature data for the BC samples obtained from http://www.genetics.forestry.ubc.ca/cfcg/climate-models.html
http://www.prism.oregonstate.edu/products/matrix.phtml?vartype=ppt&view=maps.
http://www.prism.oregonstate.edu/products/matrix.phtml?vartype=tmax&view=maps.
http://www.prism.oregonstate.edu/products/matrix.phtml?vartype=tmin&view=maps.
Obtained from Google Earth using coordinates.
Figure 3Multi dimensional scaling (MDS) plot showing: (a) spatial population structure for all populations including the two resident populations, and (b) a similar plot without the SKAGRES population. Population symbols follow Figure 2.
Figure 4Outlier tests for identifying signatures of selection. (a) FST-based global test assuming hierarchical structure by grouping all anadromous populations within each lineage into two major groups. (b) FCT-based global test assuming hierarchical structure as in (a). Local outlier tests for the Skagit River (c) and Twisp River (d) anadromous and resident population pairs are also shown. All outliers above the 99% confidence threshold are labeled including two interleukin genes at the 95% threshold (c and d) shown in italic. Plotted heterozygosity values are scaled by estimates of within population heterozygosity (h0) and locus specific FST as: (H1 = h0/[1 –FST]) as described in Excoffier et al. (2009).
Figure 5Frequency plots of major allele frequencies for loci detected as outliers (P < 0.01) in the global genome scan including nine anadromous populations. (a) Allele frequencies for five outliers detected at both the FST and FCT level. (b) Allele frequencies for three outliers only detected at the FST level.
Figure 6Frequency plots of major allele frequencies for loci detected as outliers (P < 0.01) between sympatric and resident population pairs. (a) Allele frequencies for six markers detected as outliers between the Twisp River populations. (b) Allele frequencies for one outlier detected between the populations within the Skagit River. Arrows denote populations included in the local genome scans.
Results from Bayesian inference of locus-specific landscape correlations. Gray cells denote a locus–parameter relationship with a log10 (BF) between 1.3 and 2.0, which can be interpreted as a P-value between 0.01 and 0.05. Black cells represent decisive relationships with log10 (BF) > 2.0 or equivalent P-values below 0.01. Here, global outliers include loci at the 5% significance level (see Table S1, Supporting information)
Appendix 4Frequency plots of major allele frequencies for five MHC-related SNPs. Omy_DAB_431 and Omy_DABb are nonsynonymous mutations in the peptide-binding region of a MHC class II gene, while Omy_DABc represents a synonymous mutation in the same gene. Omy_UBA3a and Omy_UBA3b show allele frequencies for two SNPs residing within a MHC class I gene.
For each population, number of reconstructed haplotypes, observed, and expected levels of homozygosity (F value) are given with results from the Ewens–Watterson homozygosity test for deviation from neutrality at an MHC class II gene (see text for more details). P-values below 0.05 are highlighted in bold, and P-values between 0.05 and 0.10 are shown in italic
| Population | No. of haplotypes | Observed | Expected | |
|---|---|---|---|---|
| CHEH | 6 | 0.289 | 0.467 | 0.104 |
| CROOK | 6 | 0.295 | 0.468 | 0.113 |
| DEADM | 5 | 0.317 | 0.527 | |
| DESCH | 5 | 0.255 | 0.530 | |
| GREEN | 5 | 0.328 | 0.463 | 0.187 |
| SKAG | 5 | 0.313 | 0.500 | |
| SKAGRES | 2 | 0.841 | 0.777 | 0.569 |
| SOL | 6 | 0.277 | 0.467 | |
| SUST | 4 | 0.500 | 0.606 | 0.363 |
| TWISP | 6 | 0.248 | 0.457 | |
| TWISPRES | 6 | 0.312 | 0.409 | 0.274 |
| Mean | 5.1 | 0.361 | 0.516 | 0.173 |
| SD | 1.2 | 0.173 | 0.101 | 0.168 |