| Literature DB >> 28533337 |
Alejandro P Gutierrez1, Frances Turner2, Karim Gharbi2, Richard Talbot2, Natalie R Lowe1, Carolina Peñaloza1, Mark McCullough3, Paulo A Prodöhl3, Tim P Bean4, Ross D Houston5.
Abstract
SNP arrays are enabling tools for high-resolution studies of the genetic basis of complex traits in farmed and wild animals. Oysters are of critical importance in many regions from both an ecological and economic perspective, and oyster aquaculture forms a key component of global food security. The aim of our study was to design a combined-species, medium density SNP array for Pacific oyster (Crassostrea gigas) and European flat oyster (Ostrea edulis), and to test the performance of this array on farmed and wild populations from multiple locations, with a focus on European populations. SNP discovery was carried out by whole-genome sequencing (WGS) of pooled genomic DNA samples from eight C. gigas populations, and restriction site-associated DNA sequencing (RAD-Seq) of 11 geographically diverse O. edulis populations. Nearly 12 million candidate SNPs were discovered and filtered based on several criteria, including preference for SNPs segregating in multiple populations and SNPs with monomorphic flanking regions. An Affymetrix Axiom Custom Array was created and tested on a diverse set of samples (n = 219) showing ∼27 K high quality SNPs for C. gigas and ∼11 K high quality SNPs for O. edulis segregating in these populations. A high proportion of SNPs were segregating in each of the populations, and the array was used to detect population structure and levels of linkage disequilibrium (LD). Further testing of the array on three C. gigas nuclear families (n = 165) revealed that the array can be used to clearly distinguish between both families based on identity-by-state (IBS) clustering parental assignment software. This medium density, combined-species array will be publicly available through Affymetrix, and will be applied for genome-wide association and evolutionary genetic studies, and for genomic selection in oyster breeding programs.Entities:
Keywords: Pacific oyster; aquaculture; array; flat oyster; single nucleotide polymorphism (SNP)
Mesh:
Year: 2017 PMID: 28533337 PMCID: PMC5499128 DOI: 10.1534/g3.117.041780
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Detail of populations sampled for sequencing and SNP discovery
| Population | Location (Lat, Long) | Population | Location (Lat, Long) | ||
|---|---|---|---|---|---|
| Guernsey, England | 49.497, −2.502 | 47 | Croatia | 42.855, 17.688 | 14 |
| Maldon, England | 51.724, 0.710 | 15 | Lough Foyle, Ireland | 55.130, −7.087 | 15 |
| Sea Salter, England | 51.378, 1.212 | 13 | Lake Grevelingen, The Netherlands | 51.709, 4.017 | 15 |
| Ifremer, France | n/a | 16 | Larne, Northern Ireland | 54.817, −5.751 | 14 |
| Hatchery 1 (Marinove), France | 46.987, −2.238 | 29 | Mersea, England | 51.776, 0.9646 | 15 |
| Hatchery 2 (SATMAR), France | 46.948, −2.052 | 26 | Baie de Quiberon, France | 47.548, −2.996 | 15 |
| Hatchery 3 (France Naissain), France | 47.514, −2.666 | 29 | Rossmore (Cork), Ireland | 51.883, −8. 247 | 15 |
| Hatchery 4 (Novostrea), France | 46.954, −2.044 | 28 | Sveio, Norway | 59.519, 5.227 | 15 |
| Swansea Bay, England | 51.604, −3.981 | 15 | |||
| Tralee, Ireland | 52.316, −10. 028 | 13 | |||
| Damariscotta, Maine | 44.028, −69.534 | 14 | |||
Lat, latitude; Long, longitude.
Descriptive population genetic estimates for the sampled C. gigas populations included in the validation of the array
| MAF > 0 | |||||
|---|---|---|---|---|---|
| Sample | # SNPs | Average MAF | Ho | He | |
| UK (combined) | 56 | 27,313 | 0.186 | 0.294 | 0.298 |
| GSF + parents | 38 | 26,549 | 0.19 | 0.308 | 0.304 |
| Maldon | 9 | 22,079 | 0.216 | 0.308 | 0.303 |
| Sea Salter | 9 | 22,821 | 0.214 | 0.317 | 0.302 |
| Average within UK populations | 23,816 | 0.207 | 0.311 | 0.303 | |
| France (combined) | 52 | 26,891 | 0.182 | 0.240 | 0.254 |
| Ifremer | 13 | 23,010 | 0.203 | 0.312 | 0.328 |
| Hatchery 1 | 10 | 21,479 | 0.217 | 0.321 | 0.303 |
| Hatchery 2 | 10 | 20,141 | 0.221 | 0.322 | 0.307 |
| Hatchery 3 | 10 | 21,730 | 0.215 | 0.302 | 0.302 |
| Hatchery 4 | 9 | 22,052 | 0.214 | 0.317 | 0.301 |
| Average within French populations | 21,682 | 0.214 | 0.315 | 0.308 | |
| All populations (combined) | 108 | 27,697 | 0.182 | 0.268 | 0.283 |
MAF, minor allele frequency; #, number; SNPs, single nucleotide polymorphisms; Ho, level of genetic variability in terms of observed heterozygosity; He, level of genetic variability in terms of expected heterozygosity; GSF, Guernsey Sea Farm.
Values were obtained by the analysis of the combined dataset, not the average of the individual populations.
Values represent the within-population average.
Descriptive population genetic estimates for the sampled O. edulis populations included in the validation of the array
| MAF > 0 | |||||
|---|---|---|---|---|---|
| Sample | # SNPs | Average MAF | Ho | He | |
| Croatia | 9 | 8,474 | 0.234 | 0.323 | 0.320 |
| Foyle_IRL | 10 | 10,013 | 0.224 | 0.319 | 0.311 |
| Grevelingen_NLD | 10 | 9,946 | 0.224 | 0.319 | 0.310 |
| Larne_NIRL | 10 | 8,927 | 0.231 | 0.354 | 0.316 |
| Mersea_UK | 10 | 9,980 | 0.224 | 0.318 | 0.310 |
| Quiberon_FR | 10 | 9,973 | 0.226 | 0.315 | 0.312 |
| Rossmore_IRL | 10 | 9,846 | 0.228 | 0.327 | 0.314 |
| Sveio_NOR | 10 | 9,118 | 0.226 | 0.322 | 0.313 |
| Swansea_UK | 9 | 9,696 | 0.224 | 0.319 | 0.311 |
| Tralee_IRL | 10 | 9,980 | 0.219 | 0.317 | 0.306 |
| Maine_USA | 10 | 9,614 | 0.221 | 0.317 | 0.305 |
| Average within population | 9,597 | 0.225 | 0.323 | 0.312 | |
| All populations (combined) | 108 | 11,151 | 0.210 | 0.292 | 0.311 |
MAF, minor allele frequency; #, number; SNPs, single nucleotide polymorphisms; Ho, level of genetic variability in terms of observed heterozygosity; He, level of genetic variability in terms of expected heterozygosity.
Values represent the within-population average.
Values were obtained by the analysis of the combined dataset, not the average of the individual populations.
Figure 1Identity-by-state clustering of selected O. edulis populations. Neth, The Netherlands; N.Ire, Northern Ireland.
Figure 2Identity-by-state-based clustering of the three nuclear C. gigas families. Samples in purple (wrong pedigree “wp”) were not assigned to any of the three families.
Figure 3Distribution of SNPs on the C. gigas genome. Number of scaffolds containing SNPs (primary axis) and the average length of the scaffolds holding an increasing number of SNPs (secondary axis). SNP, single nucleotide polymorphism.
Figure 4Decay of linkage disequilibrium (LD) with physical distance between markers among all the sampled C. gigas populations. Fr, France.
Figure 5Decay of linkage disequilibrium (LD) among the three C. gigas families.