| Literature DB >> 24524230 |
Ross D Houston1, John B Taggart, Timothé Cézard, Michaël Bekaert, Natalie R Lowe, Alison Downing, Richard Talbot, Stephen C Bishop, Alan L Archibald, James E Bron, David J Penman, Alessandro Davassi, Fiona Brew, Alan E Tinch, Karim Gharbi, Alastair Hamilton.
Abstract
BACKGROUND: Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array.Entities:
Mesh:
Year: 2014 PMID: 24524230 PMCID: PMC3923896 DOI: 10.1186/1471-2164-15-90
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of the sequencing experiments for SNP discovery
| Samples (number) | Farmed (40), Wild (16), Haploid (1) | Farmed (160) | Farmed (72) |
| Sequencing | Illumina 100 bp PE | Illumina 100 bp S&PE | Illumina 100 bp PE |
| Initial putative SNPs | 472,072 | 467,268 | 816,570 |
| SNPs for array design | 99,097 | 83,151 | 229,754 |
| Final SNPs on array | 73,800 | 54,197 | 156,979 |
Quantity and source of the SNPs on the array at different stages of quality filtering
| Total candidate SNPs | 73,800 | 54,197 | 156,979 | 4,714 | 286,021 |
| Low quality clusters** | 9,795 | 8,192 | 21,609 | 219 | 39,429 |
| Monomorphic | 9,010 | 18,157 | 83,368 | 811 | 110,910 |
| High quality polymorphic SNPs | 54,995 | 27,848 | 52,002 | 3,684 | 135,682 |
| Mendelian error | 1,292 | 756 | 1,595 | 66 | 3,649 |
| Final total filtered SNPs | 53,703 | 27,092 | 50,407 | 3,618 | 132,033 |
*The row total is lower than sum of the individual column totals because a proportion of these SNPs were from multiple categories (e.g. discovered in both the RR and RAD experiments); see Additional file 1: Table S1.
**‘Quality’ refers to the cluster properties of the SNP when genotyped on the Axiom array. Low quality SNPs are those with cluster properties that fall below a threshold value (e.g. < 97% call rate).
Figure 1Source of the SNPs on the ssalar01 array. Proportion of total SNPs derived from each of the SNP discovery categories (RR-Seq, RAD-Seq, RNA-Seq and other). ‘Putative SNPs’ comprise the 286,021 putative SNPs placed on the array, and ‘QC-filtered SNPs’ comprise the 132,033 final quality-control filtered SNPs used for analysis. Note that some SNPs were detected in multiple discovery categories (see Additional file 1: Table S1).
Frequency of the filtered SNPs in the tested populations (four yeargroups of farmed Scottish fish, two populations of farmed Norwegian, and a combination of the wild fish)
| Overall | 68 | 132,033 | 122,063 |
| Farmed Scottish | 39 | 130,062 | 120,157 |
| Year-group 1 | 10 | 121,849 | 109,487 |
| Year-group 2 | 10 | 117,182 | 105,752 |
| Year-group 3 | 9 | 117,185 | 117,184 |
| Year-group 4 | 10 | 117,290 | 105,111 |
| Farmed Norwegian | 16 | 108,885 | 101,536 |
| Population 1 | 8 | 97,631 | 97,631 |
| Population 2 | 8 | 74,154 | 74,154 |
| Wild | 13 | 119,526 | 110,320 |
*Note that only unrelated animals from each population were included; i.e. the offspring from trios (Farmed Scottish) or linkage mapping families (Wild) were removed. The full list of animals is given in Additional file 2.
Figure 2Population segregation of SNPs and minor allele frequency. (A) Sharing of the QC-filtered SNPs (with minor allele frequency higher than 0.05) between the different Atlantic salmon populations depicted by a Venn diagram (number of SNPs given in parentheses). (B) Distribution of the minor allele frequency of the final QC-filtered SNPs across all unrelated animals in the test population.
Figure 3Genomic distribution of SNPs and comparison of linkage maps. (A) Number of final QC-filtered SNPs per reference genome contig; Number of final QC-filtered SNPs contained per reference genome contig. (B) Scatterplot of number of SNPs per chromosome comparing the current study to the map of Lien et al. [3]. Note that chromosomes 2, 6, 22 and 23 are not included because the number of SNPs on those chromosomes is underestimated in the current study (see ‘Methods’).
Number of SNPs assigned to the Atlantic salmon chromosomes using sire-based linkage mapping (chromosome and linkage group nomenclature as given in [43])
| 1 | 17 | 3,411 |
| 2 | 1 | 774 |
| 3 | 11 | 2,007 |
| 4 | 28 | 2,189 |
| 5 | 12 | 1,496 |
| 6 | 4 | 657 |
| 7 | 24 | 633 |
| 8 | 19 | 634 |
| 9 | 10 | 2,266 |
| 10 | 2 | 2,015 |
| 11 | 9 | 1,641 |
| 12 | 6 | 1,804 |
| 13 | 5 | 1,942 |
| 14 | 3 | 1,576 |
| 15 | 8 | 1,830 |
| 16 | 23 | 1,398 |
| 17 | 22 | 1,085 |
| 18 | 16 | 1,443 |
| 19 | 13 | 1,247 |
| 20 | 25 | 1,381 |
| 21 | 14 | 950 |
| 22 | 32 | 652 |
| 23 | 18 | 589 |
| 24 | 7 | 2,134 |
| 25 | 20 | 950 |
| 26 | 21 | 927 |
| 27 | 15 | 980 |
| 28 | 33 | 807 |
| 29 | 31 | 796 |
| Total | 40,214 |
Figure 4Clustering of samples based on genetic similarity. Clustering of samples based on genome-wide identity-by-state and multidimensional scaling to detect population structure.
Figure 5Use of Y-specific probes to predict phenotypic sex. Correspondence between genetic sex of the fish (based on the Y-specific probes on the array) and phenotypic sex (where known).