| Literature DB >> 32318091 |
Marina Naval-Sanchez1, Sean McWilliam1, Bradley Evans2, José M Yáñez3, Ross D Houston4, James W Kijas1.
Abstract
The introduction of wild Atlantic salmon into captivity, and their subsequent artificial selection for production traits, has caused phenotypic differences between domesticated fish and their wild counterparts. Identification of regions of the genome underling these changes offers the promise of characterizing the early biological consequences of domestication. In the current study, we sequenced a population of farmed European Atlantic salmon and compared the observed patterns of SNP variation to those found in conspecific wild populations. This identified 139 genomic regions that contained significantly elevated SNP homozygosity in farmed fish when compared to their wild counterparts. The most extreme was adjacent to versican, a gene involved in control of neural crest cell migration. To control for false positive signals, a second and independent dataset of farmed and wild European Atlantic salmon was assessed using the same methodology. A total of 81 outlier regions detected in the first dataset showed significantly reduced homozygosity within the second one, strongly suggesting the genomic regions identified are enriched for true selection sweeps. Examination of the associated genes identified a number previously characterized as targets of selection in other domestic species and that have roles in development, behavior and olfactory system. These include arcvf, sema6, errb4, id2-like, and 6n1-like genes. Finally, we searched for evidence of parallel sweeps using a farmed population of North American origin. This failed to detect a convincing overlap to the putative sweeps present in European populations, suggesting the factors that drive patterns of variation under domestication and early artificial selection were largely independent. This is the first analysis on domestication of aquaculture species exploiting whole-genome sequence data and resulted in the identification of sweeps common to multiple independent populations of farmed European Atlantic salmon.Entities:
Keywords: Atlantic salmon; domestication; evolution; selection sweep; whole genome sequence
Year: 2020 PMID: 32318091 PMCID: PMC7147387 DOI: 10.3389/fgene.2020.00264
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Origin of the whole genome sequences used.
| 1 | EU | Farmed | Pool | Scotland | 4*22 | M/F | This Study |
| EU | Wild | Pool | Eidselva | 4*20 | M | ||
| EU | Wild | Pool | Aardselva | 4*20 | M | ||
| EU | Wild | Pool | Flekke | 4*20 | M | ||
| EU | Wild | Pool | Gloppenelva | 4*20 | M | ||
| EU | Wild | Pool | Sulsdagen | 4*20 | M | ||
| EU | Wild | Pool | Vorma | 4*20 | M | ||
| 2 | EU | Farmed | Indiv | Chile | 13 | M/F | |
| EU | Wild | Indiv | Jølstra | 3 | 2M/1F | ||
| EU | Wild | Indiv | Naustra | 3 | 2M/1F | ||
| EU | Wild | Indiv | Namsen | 3 | 2M/1F | ||
| EU | Wild | Indiv | Argadsvassdraget | 3 | 2M/1F | ||
| 3 | NA | Farmed | Indiv | Tasmanian | 20 | M/F |
FIGURE 1Distribution of pooled heterozygosity (ZHp) for Dataset 1. (A) Number and proportion of SNP shared between farmed and wild European Atlantic salmon. (B) Genomic relationship assessed using PCA of 28 pooled samples in Dataset1. PC1 explains 1.59% and PC2 1.13% of total variability. (C) Genome-wide plot of -ZHp estimated for farmed fish. (D) Genome-wide plot of -ZHp estimated for farmed fish.
FIGURE 2Selective sweep analysis for Dataset 1. (A) Genome wide distribution of ΔZHp derived using both farmed and wild fish. Positive values indicate regions putatively under positive selection in farmed animals. (B) The –ZHp value for each genomic window was plotted for both farmed (x axis) and wild populations (y axis) to show their correlation (r2 = 0.457) and outlier behavior. Windows indicated in red have elevated homozygosity in farmed Atlantic salmon pools compared to wild (p-val ≤ 0.05) while windows indicated in blue are the opposite (p-val ≤ 0.05). (C) A 430 kb region of Atlantic salmon chromosome 20 is shown spanning two windows exhibiting the highest positive ΔZHp values detected genome wide (5.79 and 5.89, Table 2). Genotypic status is shown as separate horizontal tracts for 20 pools of wild fish and 4 pools of farmed fish using the integrated genome viewer (IGV). SNP polymorphism within a pooled sample is indicated using dark blue. SNP homozygosity within a pool is indicated using either gray (where the fixed allele was the same as the reference genome) or cyan (where the fixed allele was the non-reference allele).
Putative selective sweeps detected in farmed Atlantic salmon (Dataset 1).
| 20 | 9375001 | 9525000 | −2.72 | 3.18 | 5.89 | Vcan | LOC106580143 versican core protein-like | Cell adhesion, proliferation, migration. Neural crest cells migration. Wound healing | ||
| 20 | 9300001 | 9450000 | −3.00 | 2.79 | 5.79 | – | – | |||
| – | ||||||||||
| 27 | 26325001 | 26475000 | −2.50 | 2.53 | 5.02 | – | – | |||
| 9 | 95175001 | 95325000 | −2.36 | 2.20 | 4.55 | Eif4h gsonmt 00055384001 atp1b3a | if4h Eukaryotic translation initiation factor 4H LOC106612405 E3 ubiquitin-protein ligase rififylin-likeLOC106612413 sodium/potassium-transporting ATPase subunit beta-3-like | Atp1b3:osmoregulation |
FIGURE 3Common signals of selection in two independent datasets. (A) Putative selection sweeps in farmed fish from Dataset 1 (139 regions p-val < 0.001) contain higher fixation than expected in farmed but not in wild individuals in Dataset 2. The blue and green bars denote farmed and wild fish data respectively. The distribution of –ZHp values is shown for all windows genome wide (A, distribution plots with white interior) and separately only for the 139 windows putatively under selection in farmed salmon within Dataset 1 (B, distribution plots with black interior). (B) Correlation between ΔZHp for Dataset 1 and Dataset 2 identified genomic windows with outlier behavior in one or both datasets (r2 = 0.09; p-val < 2.2 × 10–16). Windows colored red denote significant positive ΔZHp in both Dataset 1 and Dataset 2 (Supplementary Table S7). Green windows were significantly positive only within Dataset 1 and blue only within Dataset 2. Outliers with significantly negative ΔZHp were not colored but are listed in Supplementary Table S8.
FIGURE 4Selective sweep analysis in farmed Atlantic salmon of North American origin. (A) The genome wide distribution of ZHp in farmed Atlantic salmon from Tasmania (Dataset 3). (B) Genome windows with elevated homozygosity in Dataset 3 do not exhibit similarity increased homozygosity in either Datasets 1 or 2. The blue and green bars denote farmed and wild fish data respectively. The distribution of –ZHp values is shown for all windows genome wide (A, distribution plots with white interior) and separately only for the 79 windows putatively under selection in farmed Atlantic salmon within Dataset 3 (B, distribution plots with black interior).
Candidate Selective sweeps detected in North American Atlantic Salmon samples from Tasmania (Dataset 3).
| 5 | 32325001 | 32475000 | 3.34 | ubl3 atl3 dipa fibp slc7a3a snx12 fosl2 prs23 efemp2b foxn3 cof2 | ubl3; ubiquitin-like 3LOC106604707; serine protease 23-like%2C transcript variant LOC106604709; atlastin-3-likeLOC106604711; protein YIF1A-like%2C transcript variant LOC106604713; coiled-coil domain-containing protein 85B-like%2C transcript variant LOC106604715; acidic fibroblast growth factor intracellular-binding protein-likesnx12; sorting nexin 12%2C transcript variant slc7a3; solute carrier family 7 (cationic amino acid transporter%2C y+ system)%2C member 3%2C transcript variant LOC106604708; cofilin-2-likeLOC106604710; forkhead box protein N2-like%2C transcript variant LOC106604712; fos-related antigen 1-like%2C transcript variant LOC106604716; EGF-containing fibulin-like extracellular matrix protein 2%2C transcript variant |
| 21 | 17625001 | 17775000 | 3.26 | i20ra orf2p i10r2 crfb2 btn1a1 crfb1a | 20ra; Interleukin-20 receptor alpha chainLOC106581740; uncharacterized LOC106581740%2C transcript variant LOC106581741; interleukin-10 receptor subunit beta-likeLOC106581720; CD276 antigen homolog%2C transcript variant LOC106581739; uncharacterized protein C21orf62-like%2C transcript variant |
| 15 | 45075001 | 45225000 | 3.25 | LOC106571716; uncharacterized protein K02A2.6-likeLOC106571620; C-C chemokine receptor type 6-like | |
| 12 | 72000001 | 72150000 | 3.24 | gsonmt00065759001 arl6ip5b trnt1 nft-vtr mitfa gsonmt00065604001 | |
| 24 | 30900001 | 31050000 | 3.19 | cep78 naa35 gnaq vps13a agtpbp1 gna14 | naa35; |
| 15 | 45150001 | 45300000 | 3.09 | LOC106571620; C-C chemokine receptor type 6-like | |
| 25 | 16725001 | 16875000 | 3.07 | gypc | LOC106586280; glycophorin-C-like%2C transcript variant |
| 25 | 16800001 | 16950000 | 3.05 | Unknown | LOC106586279; contactin-associated protein-like 5 |
| 25 | 17100001 | 17250000 | 3.03 | gsonmt00061935001 |