| Literature DB >> 33933591 |
C Peñaloza1, T Manousaki2, R Franch3, A Tsakogiannis2, A K Sonesson4, M L Aslam4, F Allal5, L Bargelloni3, R D Houston6, C S Tsigenopoulos7.
Abstract
SNP arrays are powerful tools for high-resolution studies of the genetic basis of complex traits, facilitating both selective breeding and population genomic research. The European seabass (Dicentrarchus labrax) and the gilthead seabream (Sparus aurata) are the two most important fish species for Mediterranean aquaculture. While selective breeding programmes increasingly underpin stock supply for this industry, genomic selection is not yet widespread. Genomic selection has major potential to expedite genetic gain, particularly for traits practically impossible to measure on selection candidates, such as disease resistance and fillet characteristics. The aim of our study was to design a combined-species 60 K SNP array for European seabass and gilthead seabream, and to test its performance on farmed and wild populations from numerous locations throughout the species range. To achieve this, high coverage Illumina whole-genome sequencing of pooled samples was performed for 24 populations of European seabass and 27 populations of gilthead seabream. This resulted in a database of ~20 million SNPs per species, which were then filtered to identify high-quality variants and create the final set for the development of the 'MedFish' SNP array. The array was then tested by genotyping a subset of the discovery populations, highlighting a high conversion rate to functioning polymorphic assays on the array (92% in seabass; 89% in seabream) and repeatability (99.4-99.7%). The platform interrogates ~30 K markers in each species, includes features such as SNPs previously shown to be associated with performance traits, and is enriched for SNPs predicted to have high functional effects on proteins. The array was demonstrated to be effective at detecting population structure across a wide range of fish populations from diverse geographical origins, and to examine the extent of haplotype sharing among Mediterranean farmed fish populations. In conclusion, the new MedFish array enables efficient and accurate high-throughput genotyping for genome-wide distributed SNPs for each fish species, and will facilitate stock management, population genomics approaches, and acceleration of selective breeding through genomic selection.Entities:
Keywords: Aquaculture; European seabass; SNP array; gilthead seabream
Mesh:
Year: 2021 PMID: 33933591 PMCID: PMC8276775 DOI: 10.1016/j.ygeno.2021.04.038
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736
Summary of the European seabass and gilthead seabream populations sampled for sequencing and SNP discovery.
| Species | Origin | Region | Country | Pool ID | № individuals per pool | N° pools prepared |
|---|---|---|---|---|---|---|
| European seabass | Farmed | Mediterranean | France | Sba_farm_1 | 12 | 1 |
| Spain | Sba_farm_2 | 25 | 2 | |||
| Spain | Sba_farm_3 | 25 | 2 | |||
| Italy | Sba_farm_4 | 25 | 2 | |||
| Croatia | Sba_farm_5 | 25 | 2 | |||
| Croatia | Sba_farm_6 | 25 | 2 | |||
| Greece | Sba_farm_7 | 25 | 2 | |||
| Greece | Sba_farm_8 | 25 | 2 | |||
| Greece | Sba_farm_9 | 25 | 2 | |||
| Greece | Sba_farm_10 | 25 | 2 | |||
| Greece | Sba_farm_11 | 25 | 2 | |||
| Greece | Sba_farm_12 | 23 | 1 | |||
| Cyprus | Sba_farm_13 | 25 | 2 | |||
| Egypt | Sba_farm_14 | 15 | 1 | |||
| Wild | Mediterranean | France | Sba_wild_1 | 25 | 2 | |
| Spain | Sba_wild_2 | 11 | 1 | |||
| Morocco | Sba_wild_3 | 25 | 2 | |||
| Italy | Sba_wild_4 | 25 | 2 | |||
| Croatia | Sba_wild_5 | 12 | 1 | |||
| Greece | Sba_wild_6 | 25 | 2 | |||
| Greece | Sba_wild_7 | 25 | 2 | |||
| Cyprus | Sba_wild_8 | 15 | 1 | |||
| Turkey | Sba_wild_9 | 25 | 2 | |||
| Turkey | Sba_wild_10 | 25 | 2 | |||
| Gilthead seabream | Farmed | Mediterranean | France | Sbr_farm_1 | 25 | 2 |
| Spain | Sbr_farm_2 | 25 | 2 | |||
| Spain | Sbr_farm_3 | 25 | 2 | |||
| Italy | Sbr_farm_4 | 25 | 2 | |||
| Croatia | Sbr_farm_5 | 25 | 2 | |||
| Greece | Sbr_farm_6 | 14 | 1 | |||
| Greece | Sbr_farm_7 | 13 | 1 | |||
| Greece | Sbr_farm_8 | 25 | 2 | |||
| Greece | Sbr_farm_9 | 25 | 2 | |||
| Greece | Sbr_farm_10 | 25 | 2 | |||
| Israel | Sbr_farm_11 | 25 | 2 | |||
| Egypt | Sbr_farm_12 | 15 | 1 | |||
| Wild | Atlantic | France | Sbr_wild_1 | 25 | 2 | |
| Spain | Sbr_wild_2 | 25 | 2 | |||
| Spain | Sbr_wild_3 | 25 | 2 | |||
| Mediterranean | Spain | Sbr_wild_4 | 25 | 2 | ||
| Spain | Sbr_wild_5 | 25 | 2 | |||
| Tunisia | Sbr_wild_6 | 25 | 2 | |||
| Italy | Sbr_wild_7 | 25 | 2 | |||
| Italy | Sbr_wild_8 | 25 | 2 | |||
| Greece | Sbr_wild_9 | 25 | 2 | |||
| Greece | Sbr_wild_10 | 25 | 2 | |||
| Greece | Sbr_wild_11 | 25 | 2 | |||
| Greece | Sbr_wild_12 | 25 | 2 | |||
| Greece | Sbr_wild_13 | 25 | 2 | |||
| Turkey | Sbr_wild_14 | 25 | 2 | |||
| Turkey | Sbr_wild_15 | 25 | 2 | |||
Summary of SNP selection approach. A variable number of SNPs was selected along chromosomes according to the local nucleotide diversity (π) estimates for non-overlapping genomic windows.
| Genomic window diversity class | Range | № of SNPs sampled per window | № SNPs sampled per MAF category per window | ||
|---|---|---|---|---|---|
| >0.3 | 0.2–0.3 | 0.1–0.2 | |||
| Class 1 | π ≤0.001 | 1 | 1 | 0 | 0 |
| Class 2 | 0.001 < π ≤0.002 | 2 | 2 | 0 | 0 |
| Class 3 | 0.002 < π ≤0.003 | 3 | 2 | 1 | 0 |
| Class 4 | 0.003 < π ≤0.004 | 4 | 2 | 2 | 0 |
| Class 5 | π >0.004 | 5 | 2 | 2 | 1 |
Fish samples genotyped using the combined species MedFish SNP array.
| Species | Origin | Population ID | Country | N° individuals typed | N° individuals passing QC |
|---|---|---|---|---|---|
| European seabass | Farmed | Sba_farm_1 | France | 12 | 8 |
| Sba_farm_2 | Spain | 25 | 18 | ||
| Sba_farm_3 | Spain | 25 | 16 | ||
| Sba_farm_4 | Italy | 24 | 24 | ||
| Sba_farm_6 | Croatia | 25 | 16 | ||
| Sba_farm_7 | Greece | 25 | 18 | ||
| Sba_farm_8 | Greece | 25 | 16 | ||
| Sba_farm_9 | Greece | 24 | 16 | ||
| Sba_farm_10 | Greece | 25 | 17 | ||
| Sba_farm_11 | Greece | 24 | 11 | ||
| Sba_farm_12 | Greece | 23 | 6 | ||
| Sba_farm_13 | Cyprus | 23 | 14 | ||
| Sba_farm_14 | Egypt | 14 | 12 | ||
| Wild | Sba_wild_Mediterranean | 208 | 184 | ||
| gilthead seabream | Farmed | Sbr_farm_1 | France | 24 | 19 |
| Sbr_farm_2 | Spain | 18 | 17 | ||
| Sbr_farm_3 | Spain | 25 | 19 | ||
| Sbr_farm_5 | Croatia | 19 | 19 | ||
| Sbr_farm_6 | Greece | 13 | 12 | ||
| Sbr_farm_7 | Greece | 13 | 13 | ||
| Sbr_farm_8 | Greece | 21 | 19 | ||
| Sbr_farm_9 | Greece | 24 | 22 | ||
| Sbr_farm_10 | Greece | 20 | 17 | ||
| Sbr_farm_11 | Israel | 13 | 9 | ||
| Sbr_farm_12 | Egypt | 15 | 14 | ||
| Wild | Sbr_wild_Atlantic | 28 | 27 | ||
| Sbr_wild_Mediterranean | 245 | 221 |
Fig. 1Schematic representation of the distribution of array markers in the European seabass (left) and gilthead seabream (right) genomes after following a SNP selection strategy based on local nucleotide diversity. (A) Chromosome number. (B) Levels of diversity (π) estimated over 70 Kb and 85 Kb windows in the European seabass and gilthead seabream, respectively. Red bars represent regions with high nucleotide diversity. (C) Genome-wide distribution of markers on the combined-species SNP chip. Light blue bars represent windows for which 1–3 SNPs were selected. Red bars represent windows for which more than four SNPs were selected.
Number of SNPs for each species of each Axiom quality class.
| Conversion typea | № European seabass (%) | № gilthead seabream (%) |
|---|---|---|
| Polymorphic high resolution | 26,466 (88.55%) | 26,369 (88.47%) |
| No minor homozygote | 993 (3.32%) | 75 (0.25%) |
| Monomorphic high resolution | 26 (0.09%) | 36 (0.12%) |
| Off-target-variant (OTV) | 50 (0.17%) | 78 (0.26%) |
| Call rate below threshold (97%) | 889 (2.97%) | 1292 (4.33%) |
| Other | 1464 (4.90%) | 1957 (6.57%) |
The categories are based on cluster properties and QC metrics.
aThe Conversion type follows Thermo Fisher's terminology:
PolyHighResolution = Class with the highest quality probes. SNP is polymorphic and the presence of both the major and minor homozygous clusters is observed.
NoMinorHom = similar to a PolyHighResolution, but no evidence of individuals with minor homozygous genotypes, presumably due to a low genotype frequency.
MonoHighResolution = SNP can reliably be scored as monomorphic.
Off-target variant (OTV) = SNPs where additional (i.e. more than three) clusters are observed, making genotype calling ambiguous.
CallRateBelowThreshold = SNP with the expected number of clusters (usually 3, one for each possible genotype), but where the proportion of samples scored at the SNP falls below a user-defined threshold.
Other = SNPs that do not fall in any of the above categories.
Fig. 2Evaluation of SNP ascertainment on pairwise FST estimates in European seabass and gilthead seabream pooled population samples. Pairwise FST values obtained for (A) European seabass and (B) gilthead seabream populations based on a ‘non-ascertained’ marker panel (below the diagonal) and an ‘ascertained’ panel containing the array SNPs (above the diagonal). Principle coordinate analysis of pairwise FST among (C) European seabass and (D) gilthead seabream populations, and comparison between results obtained from a ‘non-ascertained’ (left panel) vs ‘ascertained’ (right panel) set of SNPs.
Fig. 3MDS analysis performed on individuals from farmed and wild (A) European seabass and (B) gilthead seabream populations. The different point symbols separate samples by origin in (i) farms from the West of the Mediterranean (■), (ii) farms from the centre of the Mediterranean (●), and (iii) farms from the East of the Mediterranean (▲), from (iv) wild populations (+).
Fig. 4Heatmaps indicating the percentage of shared haplotypes between pairs of (A) European seabass and (B) gilthead seabream farmed populations. The colorbar on the right indicates the percentage of shared haplotypes. Entries are shown for pairs of farms with a statistically significant excess of shared haplotypes (p-value <0.05).