| Literature DB >> 30622641 |
Maria E López1,2, Laura Benestan3, Jean-Sebastien Moore3, Charles Perrier4, John Gilbey5, Alex Di Genova6, Alejandro Maass6, Diego Diaz6, Jean-Paul Lhorente7, Katharina Correa7, Roberto Neira2, Louis Bernatchez3, José M Yáñez1,7,8.
Abstract
Selective breeding and genetic improvement have left detectable signatures on the genomes of domestic species. The elucidation of such signatures is fundamental for detecting genomic regions of biological relevance to domestication and improving management practices. In aquaculture, domestication was carried out independently in different locations worldwide, which provides opportunities to study the parallel effects of domestication on the genome of individuals that have been selected for similar traits. In this study, we aimed to detect potential genomic signatures of domestication in two independent pairs of wild/domesticated Atlantic salmon populations of Canadian and Scottish origins, respectively. Putative genomic regions under divergent selection were investigated using a 200K SNP array by combining three different statistical methods based either on allele frequencies (LFMM, Bayescan) or haplotype differentiation (Rsb). We identified 337 and 270 SNPs potentially under divergent selection in wild and hatchery populations of Canadian and Scottish origins, respectively. We observed little overlap between results obtained from different statistical methods, highlighting the need to test complementary approaches for detecting a broad range of genomic footprints of selection. The vast majority of the outliers detected were population-specific but we found four candidate genes that were shared between the populations. We propose that these candidate genes may play a role in the parallel process of domestication. Overall, our results suggest that genetic drift may have override the effect of artificial selection and/or point toward a different genetic basis underlying the expression of similar traits in different domesticated strains. Finally, it is likely that domestication may predominantly target polygenic traits (e.g., growth) such that its genomic impact might be more difficult to detect with methods assuming selective sweeps.Entities:
Keywords: Salmo salar; selective sweeps; single nucleotide polymorphisms
Year: 2018 PMID: 30622641 PMCID: PMC6304691 DOI: 10.1111/eva.12689
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 5.183
Atlantic salmon populations analyzed in this study with geographical origin and sample size (n)
| Population code | Geographical origin |
|
| % | Observations |
|---|---|---|---|---|---|
| Can‐W | Canada | 46 | 44 | 95.7 | Wild population from Saint‐Jean River, Québec, Canada |
| Can‐D | Canada | 40 | 37 | 92.5 | Farmed population selected for fast growth rate and characterized for late sexual maturity |
| Sct‐W | Scotland | 47 | 41 | 87.2 | Wild population from the East Coast of Scotland. |
| Sct‐D | Scotland | 50 | 43 | 86 | Farmed population selected for fast growth rate and characterized for early sexual maturity |
aInitial number of individuals. bNumber of individuals passing sample call rate of 90%. cPercentage of individuals passing sample call rate of 90%.
Figure 1Population genetic structure. (a) Principal components analysis (PCA) of genetic differentiation among the 165 individuals, based on 55,406 polymorphic markers. Each point represents one individual, and different colors represent populations: Canadian domestic = dark green, Canadian wild = light green, Scottish domestic = dark blue, and Scottish wild = light blue. (b) Admixture analysis of the Atlantic salmon populations. The analysis was performed using the program ADMIXTURE with K = 4, based on 55,406 polymorphic markers. Each color represents a different theoretical ancestral population and each vertical line represent a single individual
Figure 2Venn diagrams showing shared SNPs with evidence of selection among three independent tests in Canada populations (a) and Scottish populations (b). Also shown is the overlap of identified loci between both Canada and Scotland across all three tests (BayeScan, LFMM, Rsb [c])
Figure 3Manhattan plots showing the results of the BayeScan, LFMM, and Rsb outlier analyses applied across the domesticated strain/wild population pairs for Canada (Can; a–c) and Scotland (Sct; d–f). Salmon linkage groups were mapped are indicated on the x‐axis. The dashed red line indicates the significant threshold applied to detect the outlier SNPs from the entire dataset. In BAYESCAN analysis this line represents the q‐value threshold at log10(0.05) = 1.32, which corresponds to Bayes factor of 10. In LFMM, the threshold was set at z‐score = 10, while Rsb significant threshold was set at −log10 one‐tailed p‐value = 3. Green dots in Canadian populations are the SNP common between BAYESCAN and LFMM analysis. In the case of Scottish population, blue dots are loci shared between BAYESCAN and Rsb
Results of Bayescan analysis showing SNPs identified by chromosome
| Chromosome | Can‐D/Can‐W | Sct‐D/Sct‐W | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Initial SNPs number | Strong | Very strong | Decisive | Total | Initial SNPs number | Strong | Very strong | Decisive | Total | |
| Ssa01 | 4,649 | 7 | 13 | 22 | 42 | 10,510 | 13 | 8 | 9 | 30 |
| Ssa02 | 1,687 | 3,341 | 2 | 2 | ||||||
| Ssa03 | 1,821 | 4 | 3 | 8 | 15 | 5,571 | 2 | 2 | ||
| Ssa04 | 1,839 | 2 | 5 | 20 | 27 | 3,954 | 4 | 5 | 1 | 10 |
| Ssa05 | 2,658 | 1 | 1 | 2 | 5,237 | 1 | 1 | |||
| Ssa06 | 1,759 | 1 | 1 | 4,672 | ||||||
| Ssa07 | 1,448 | 1 | 1 | 3,836 | ||||||
| Ssa08 | 525 | 2 | 2 | 1,243 | ||||||
| Ssa09 | 4,166 | 2 | 1 | 3 | 7,981 | 3 | 2 | 2 | 7 | |
| Ssa10 | 3,626 | 1 | 1 | 6,882 | 2 | 5 | 2 | 9 | ||
| Ssa11 | 2,590 | 2 | 1 | 3 | 5,400 | 4 | 4 | 1 | 9 | |
| Ssa12 | 3,027 | 5,440 | 2 | 1 | 2 | 5 | ||||
| Ssa13 | 2,902 | 2 | 6 | 5 | 13 | 5,811 | 1 | 2 | 1 | 4 |
| Ssa14 | 2,683 | 1 | 1 | 2 | 5,583 | 8 | 5 | 2 | 15 | |
| Ssa15 | 2,674 | 1 | 4 | 5 | 10 | 6,150 | 1 | 1 | ||
| Ssa16 | 2,547 | 2 | 2 | 4,669 | 4 | 5 | 1 | 10 | ||
| Ssa17 | 1,501 | 1 | 1 | 2,696 | 3 | 3 | ||||
| Ssa18 | 1,954 | 4 | 3 | 7 | 4,037 | 5 | 8 | 3 | 16 | |
| Ssa19 | 2,269 | 4,625 | 1 | 1 | ||||||
| Ssa20 | 2,581 | 3 | 1 | 4 | 5,022 | 2 | 3 | 5 | ||
| Ssa21 | 1,172 | 2 | 9 | 4 | 15 | 2,894 | 2 | 4 | 6 | |
| Ssa22 | 2,076 | 3,921 | ||||||||
| Ssa23 | 1,369 | 1 | 1 | 2 | 2,995 | 1 | 1 | 2 | 4 | |
| Ssa24 | 1,384 | 5 | 5 | 2,982 | 1 | 1 | 1 | 3 | ||
| Ssa25 | 1,569 | 1 | 1 | 3,316 | 9 | 13 | 16 | 38 | ||
| Ssa26 | 765 | 2,003 | 2 | 2 | ||||||
| Ssa27 | 1,014 | 2,873 | 2 | 1 | 1 | 4 | ||||
| Ssa28 | 744 | 1 | 1 | 2,042 | 5 | 5 | ||||
| Ssa29 | 1,307 | 2,982 | 1 | 1 | ||||||
| Unknown | 893 | 1,918 | 4 | |||||||
| Total | 61,199 | 31 | 58 | 71 | 160 | 130,586 | 64 | 79 | 54 | 197 |
Results of LFMM and Rsb Analyses SNPs identified by chromosome
| Chromosome | Can‐D/Can‐W | Sct‐D/Sct‐W | ||||
|---|---|---|---|---|---|---|
| Initial SNPs number | LFMM | Rsb | Initial SNPs number | LFMM | Rsb | |
| Ssa01 | 4,649 | 15 | 16 | 10,510 | 6 | |
| Ssa02 | 1,687 | 1 | 3,341 | |||
| Ssa03 | 1,821 | 11 | 21 | 5,571 | ||
| Ssa04 | 1,839 | 14 | 1 | 3,954 | 1 | 53 |
| Ssa05 | 2,658 | 2 | 4 | 5,237 | ||
| Ssa06 | 1,759 | 6 | 1 | 4,672 | 3 | |
| Ssa07 | 1,448 | 2 | 3,836 | |||
| Ssa08 | 525 | 4 | 1,243 | |||
| Ssa09 | 4,166 | 7 | 12 | 7,981 | 1 | |
| Ssa10 | 3,626 | 2 | 6,882 | |||
| Ssa11 | 2,590 | 3 | 5,400 | |||
| Ssa12 | 3,027 | 3 | 1 | 5,440 | 2 | |
| Ssa13 | 2,902 | 4 | 5,811 | 1 | ||
| Ssa14 | 2,683 | 4 | 5,583 | 1 | ||
| Ssa15 | 2,674 | 4 | 6,150 | |||
| Ssa16 | 2,547 | 6 | 4,669 | 2 | ||
| Ssa17 | 1,501 | 1 | 2,696 | |||
| Ssa18 | 1,954 | 1 | 1 | 4,037 | ||
| Ssa19 | 2,269 | 3 | 4,625 | 1 | ||
| Ssa20 | 2,581 | 2 | 5,022 | 1 | ||
| Ssa21 | 1,172 | 2 | 2,894 | |||
| Ssa22 | 2,076 | 7 | 1 | 3,921 | ||
| Ssa23 | 1,369 | 3 | 2 | 2,995 | ||
| Ssa24 | 1,384 | 1 | 1 | 2,982 | 2 | |
| Ssa25 | 1,569 | 3 | 3,316 | 1 | ||
| Ssa26 | 765 | 2 | 2,003 | |||
| Ssa27 | 1,014 | 2 | 14 | 2,873 | 1 | |
| Ssa28 | 744 | 1 | 2,042 | |||
| Ssa29 | 1,307 | 1 | 2,982 | |||
| Unknown | 893 | 2 | 1,918 | |||
| Total | 61,199 | 115 | 79 | 130,586 | 9 | 67 |
Figure 4Kernel‐smoothing calculated in 500‐kb with steps of 50 kb. (a) Pattern of heterozygosity in domestic (blue) and wild (green) Canadian populations. (b) Pattern of between domestic/wild Canadian populations. (c) Pattern of heterozygosity in domestic (blue) and wild (green) Scottish populations. (d) Pattern of between domestic/wild Scottish populations
Figure 5Distribution of the number total of outliers along the Atlantic salmon chromosome considering Canadian (CAN in black) and Scottish population (SCT)
Summary of genomic regions under selection and associated genes in Canadian populations
| Chromosome | Position SNPs | Test | Gene | Protein name |
|---|---|---|---|---|
| Ssa01 | 118333490 | RSB | LOC106563782 | PREDICTED: unconventional myosin‐Vb‐like |
| Ssa01 | 119321920 | LFMM_BYS | opn4x1b2 | Melanopsin |
| Ssa01 | 119472753 | LFMM_BYS | gdnf | Glial cell‐derived neurotrophic factor |
| Ssa01 |
120337869 | BYS | LOC106564357 | PREDICTED: polypeptide N‐acetylgalactosaminyltransferase 9‐like |
| Ssa03 |
35341493 | RSB | LOC106600280 | PREDICTED: matrix metalloproteinase‐14‐like |
| Ssa03 | 58772401 | LFMM_BYS | LOC106600794 | PREDICTED: supervillin‐like |
| Ssa03 | 58814947 | LFMM_BYS | LOC106600792 | PREDICTED: uncharacterized protein LOC106600792 |
| Ssa04 | 1988842 | LFMM_BYS | LOC106602233 | PREDICTED: uncharacterized protein LOC106602233 |
| Ssa04 |
2491374 | BYS | LOC106602228 | PREDICTED: dual specificity testis‐specific protein kinase 2‐like |
| Ssa04 | 41701375 | RSB | LOC106603279 | PREDICTED: unconventional myosin‐XVIIIa‐like |
| Ssa04 | 54112595 | LFMM | LOC106603471 | PREDICTED: collagen alpha‐1(XXVI) chain‐like |
| Ssa06 | 37003521 | LFMM | LOC106607315 | PREDICTED: unconventional myosin‐X‐like |
| Ssa08 | 19722947 | LFMM_BYS | LOC106610360 | PREDICTED: TBC domain‐containing protein kinase‐like protein |
| Ssa17 | 54473681 | LFMM_BYS | LOC106576598 | PREDICTED: plexin‐B2‐like |
| Ssa18 | 13286009 | RSB | coda1 | Collagen alpha‐1XIII chain |
| Ssa18 | 62851227 | BYS | LOC106577990 | PREDICTED: E3 ubiquitin‐protein ligase TRIM39‐like |
| Ssa20 | 15597033 | BYS | LOC106580056 | PREDICTED: transient receptor potential cation channel subfamily M member 3‐like |
| Ssa22 |
29365659 | LFMM | LOC106583145 | PREDICTED: collagen alpha‐1(VII) chain‐like |
| Ssa24 | 12213523 | BYS | LOC106585083 | PREDICTED: autism susceptibility gene 2 protein‐like |
| Ssa25 | 19817836 | LFMM | ube2f | Ubiquitin‐conjugating enzyme E2F (putative) |
| Ssa27 |
10232307 | RSB | brd2 | Bromodomain containing 2 |
| Ssa27 | 10251922 | RSB | LOC106588396 | PREDICTED: collagen alpha‐2(XI) chain‐like |
Summary of genomic regions under selection and associated genes in Scottish populations
| Chromosome | Position SNPs | Test | Gene | Protein name |
|---|---|---|---|---|
| Ssa01 |
48495280 | BYS | LOC106604517 | PREDICTED: myopalladin‐like |
| Ssa03 | 50571649 | BYS | LOC106600700 | PREDICTED: transient receptor potential cation channel subfamily M member 4‐like |
| Ssa04 | 44415367 | BYS_RSB | LOC106603326 | PREDICTED: SH3 domain‐binding glutamic acid‐rich‐like protein |
| Ssa04 |
44432033 | BYS_RSB | LOC106603328 | PREDICTED: bromodomain and WD repeat‐containing protein 3‐like |
| Ssa04 |
45658450 | RSB | LOC106603377 | PREDICTED: long‐chain fatty acid–CoA ligase 4‐like |
| Ssa04 |
47256953 | RSB | LOC106603586 | PREDICTED: E3 ubiquitin‐protein ligase Siah2 |
| Ssa04 |
51300396 | RSB | LOC106603511 | PREDICTED: active breakpoint cluster region‐related protein‐like |
| Ssa10 |
16255371 | BYS | LOC106613666 | PREDICTED: unconventional myosin‐IXb‐like |
| Ssa14 | 47278390 | BYS | tekt2 | Tektin 2 (testicular) |
| Ssa14 | 65526017 | BYS | LOC106570217 | PREDICTED: collagen alpha‐1(IX) chain‐like |
| Ssa14 |
65541964 | BYS | LOC106570216 | PREDICTED: collagen alpha‐1(XXII) chain‐like |
| Ssa14 |
79347457 | BYS | LOC106570512 | PREDICTED: collagen alpha‐2(IX) chain‐like |
| Ssa18 |
13282526 | BYS | coda1 | Collagen alpha‐1XIII chain |
| Ssa18 | 47314912 | BYS | LOC106577539 | PREDICTED: neurexin‐2‐like |
| Ssa20 | 15586930 | BYS | LOC106580056 | PREDICTED: transient receptor potential cation channel subfamily M member 3‐like |
| Ssa24 |
12125145 | RSB | LOC106585083 | PREDICTED: autism susceptibility gene 2 protein‐like |
| Ssa25 | 19817836 | BYS | ube2f | Ubiquitin‐conjugating enzyme E2F (putative) |
| Ssa25 |
32313916 | BYS | mspd2 | Motile sperm domain‐containing protein 2 |
| Ssa27 | 18583143 | BYS | LOC106588644 | PREDICTED: zona pellucida sperm‐binding protein 3‐like |