| Literature DB >> 24571138 |
Serap Gonen1, Natalie R Lowe, Timothé Cezard, Karim Gharbi, Stephen C Bishop, Ross D Houston.
Abstract
BACKGROUND: Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24571138 PMCID: PMC4028894 DOI: 10.1186/1471-2164-15-166
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
SNP and individual filtering procedure
| Raw RAD-Seq processing | 0 | 28,415 | 0 | 96 |
| Missing Genotypes | 14,778 | 13,637 | 0 | 96 |
| SNPs (> 50%) | ||||
| Missing Genotypes | 0 | 13,637 | 15 | 81 |
| Individuals (> 25%) | ||||
| Excess heterozygosity | 4,895 | 8,742 | 0 | 81 |
| (PSVs; > 70%) | ||||
| Mendelian errors | 485 | 8,257 | 0 | 81 |
| SNPs (≥ 2) | ||||
| Mendelian errors | 0 | 8,257 | 4 | 77 |
| Individuals (> 200) |
Stringent quality control filtering was applied to the initial set of 28,415 putative SNPs generated from the raw RAD-Seq reads. Filtering parameters for SNPs included removing excess missing data (> 50%), excess Mendelian errors (≥ 2 individuals) and excess heterozygosity (putative PSVs; > 70%). The final number of SNPs left for linkage map construction was 8,257. Individuals were removed if they showed excess missing genotypes (> 25% of the SNPs) and/or excess Mendelian errors (> 200 SNPs). 77 individuals remained for linkage map construction post-filtering.
Figure 1Relationship between read depth and call rate. The number of reads per individual following exclusion of PCR duplicates (x-axis) plotted against the proportion of SNP genotypes successfully called for all putative SNPs (y-axis). The red lines on the graph indicate the thresholds below which individuals were removed from the current analysis due to an excess of missing genotypes.
Number of SNPs assigned to linkage groups
| 1 | 2 | 244 | 59 | 23 | 73 | 68 |
| 2 | 10 | 350 | 102 | 86 | 88 | 79 |
| 3 | 14 | 197 | 31 | 35 | 68 | 76 |
| 4 | 6 | 283 | 47 | 43 | 84 | 79 |
| 5 | 13 | 306 | 67 | 84 | 84 | 85 |
| 6 | 12 | 257 | 72 | 46 | 81 | 71 |
| 7 | 24 | 138 | 27 | 47 | 51 | 39 |
| 8 | 15 | 520 | 78 | 47 | 69 | 84 |
| 9 | 11 | 226 | 50 | 64 | 67 | 49 |
| 10 | 9 | 394 | 95 | 94 | 113 | 103 |
| 11 | 3 | 336 | 48 | 92 | 98 | 103 |
| 12 | 5 | 224 | 24 | 29 | 74 | 68 |
| 13 | 19 | 197 | 49 | 58 | 43 | 59 |
| 14 | 21 | 152 | 38 | 40 | 47 | 33 |
| 15 | 27 | 132 | 31 | 39 | 40 | 43 |
| 16 | 18 | 209 | 33 | 58 | 65 | 67 |
| 17 | 1 | 442 | 70 | 78 | 130 | 129 |
| 18 | 23 | 155 | 36 | 33 | 55 | 51 |
| 19 | 8 | 42 | 8 | 0 | 13 | 15 |
| 20 | 25 | 115 | 20 | 33 | 26 | 23 |
| 21 | 26 | 113 | 25 | 25 | 30 | 35 |
| 22 | 17 | 158 | 19 | 44 | 33 | 64 |
| 23 | 16 | 215 | 58 | 49 | 66 | 55 |
| 24 | 7 | 169 | 22 | 17 | 57 | 56 |
| 25 | 20 | 237 | 65 | 77 | 62 | 73 |
| 28 | 4 | 220 | 19 | 32 | 80 | 83 |
| 30 | 29 | 116 | 43 | 23 | 36 | 37 |
| 31 | 28 | 132 | 34 | 37 | 42 | 39 |
| 32 | 22 | 179 | 41 | 66 | 58 | 51 |
SNPs were assigned to a linkage group using previously mapped anchor markers in CRI-MAP. SNPs were then ordered on each linkage group in Onemap for each individual mapping parent separately.
Number of SNPs showing sex-specific segregation patterns and the total map length for each mapping parent
| Br5 mother | 1,688 | 2,807 |
| Br5 father | 1,952 | 2,170 |
| Br6 mother | 1,804 | 2,358 |
| Br6 father | 1,895 | 1,426 |
6,458 SNP markers were assigned to a linkage group using the CRI-MAP software. The table gives the total number of markers segregating for each parent from this subset of assigned markers.
Figure 2Example linkage maps. Maps for linkage group 13 for: (i) Br5 mother; (ii) Br5 father, (iii) Br6 mother; (iv) Br6 father. Map lengths are shorter in the male parent maps and markers are more widely spaced in the female parent maps. Marker names on the drawn maps are coded as RAD1-RADX depending on the ordered position of the marker on the linkage group.
Figure 3Comparison of marker clustering between male and female linkage maps. For each linkage group for each parent, the five intervals with the highest percentage of markers were identified. For each of these intervals, an average percentage of markers was calculated across all linkage groups and both families Br5 and Br6. Blue bars = Male parent average percentages; Red bars = Female parent average percentages. A greater clustering of markers in interval one is apparent in male parents.
Reference genome contigs assigned to Atlantic salmon linkage groups
| 1 | 2 | 177 | 2,030 | 20 | |
| 2 | 10 | 262 | 3,410 | 19 | |
| 3 | 14 | 156 | 2,170 | 3/10 | |
| 4 | 6 | 230 | 2,880 | 11 | |
| 5 | 13 | 209 | 2,530 | NA | |
| 6 | 12 | 211 | 3,090 | 17/9 | |
| 7 | 24 | 94 | 1,160 | 13 | |
| 8 | 15 | 175 | 2,220 | 18 | |
| 9 | 11 | 180 | 2,160 | 2 | |
| 10 | 9 | 298 | 3,850 | 1 | |
| 11 | 3 | 232 | 3,480 | 11/3 | |
| 12 | 5 | 173 | 2,110 | 20 | |
| 13 | 19 | 110 | 1,500 | 21/5 | |
| 14 | 21 | 111 | 1,570 | 16 | |
| 15 | 27 | 118 | 1,330 | 10/20 | |
| 16 | 18 | 159 | 1,900 | 6 | |
| 17 | 1 | 254 | 3,780 | 14/6 | |
| 18 | 23 | 103 | 1,300 | 8 | |
| 19 | 8 | 33 | 454 | NA | |
| 20 | 25 | 60 | 809 | 16 | |
| 21 | 26 | 97 | 1,330 | 2 | |
| 22 | 17 | 125 | 1,530 | NA | |
| 23 | 16 | 164 | 2,210 | 19 | |
| 24 | 7 | 109 | 1,370 | 4 | |
| 25 | 20 | 196 | 3,100 | 13 | |
| 28 | 4 | 165 | 2,200 | 7 | |
| 30 | 29 | 76 | 992 | 21 | |
| 31 | 28 | 84 | 1,130 | 5 | |
| 32 | 22 | 120 | 1,520 | 17 | |
Atlantic salmon reference genome contigs were assigned to linkage groups by Blastn alignment against mapped RAD contigs. Column 5 shows the stickleback linkage groups orthologous to the Atlantic salmon linkage groups identified by this study. Column 6 shows the Atlantic salmon - rainbow trout orthologous linkage groups as defined by Phillips et al. [44] (bold) and Danzmann et al. [6] (bold italic) individually, and those identified in both studies (italic). 50% of the stickleback - rainbow trout relationships suggested in this table have been identified in another study comparing rainbow trout and stickleback only [45].
*Grand total includes genome contigs assigned to more than one linkage group only once, thus is less than the sum of genome contigs per linkage group.
**Grand total given includes sequence data of genome contigs assigned to more than one linkage group only once, thus is less than the sum of sequence (kb) assigned per linkage group.
Homeology between linkage groups
| 4/11§* | 6/3 | 25 | E |
| 10/25 | 9/20 | 11 | G/H |
| 1/12§* | 2/5 | 9 | B |
| 9/21* | 11/26 | 9 | J |
| 3/11 | 14/3 | 8 | M |
| 22/24§* | 17/7 | 8 | K |
| 1/6§* | 2/12 | 5 | D |
| 3/15 | 14/27 | 4 | B |
| 2/23 | 10/16 | 3 | M,J/K |
| 2/18 | 10/23 | 3 | M |
| 7/25 | 24/20 | 3 | I |
| 19/28§* | 8/4 | 3 | - |
| 5/17 | 13/1 | 2 | I |
| 9/28 | 11/4 | 2 | G/H |
| 3/14 | 14/21 | 1 | - |
| 5/28 | 13/4 | 1 | G/H |
| 6/32§ | 12/22 | 1 | L |
| 16/17§ | 18/1 | 1 | D |
| 17/31 | 1/28 | 1 | D |
| 22/23§* | 17/16 | 1 | K |
| 22/30 | 17/29 | 1 | K |
| 17/32 | 1/22 | 1 | - |
| 12/17 | 5/18 | 1 | - |
| 10/32 | 9/22 | 1 | - |
| 9/25 | 11/20 | 1 | G/H,I |
| 4/6 | 6/12 | 1 | - |
| 2/25 | 10/20 | 1 | - |
| 2/6 | 10/12 | 1 | - |
| 9/14 | 11/21 | 1 | - |
| 3/13/17 | 14/19/1 | 1 | 3&13 = M |
| 1/6/32 | 2/12/22 | 1 | 1&6 = D |
112 Atlantic salmon reference genome contigs showed alignment to two or more linkage groups. The number of times two or more Atlantic salmon linkage groups shared an Atlantic salmon genome contig was counted, and is presented in column three. Column four shows the proto-Acinopterygian ancestral linkage group shared between Atlantic salmon linkage groups in column one, as defined by Danzmann et al. [6].
§ Also identified in [8]; *Also identified in [44].
RAD-Seq SNPs located proximal to a putative gene
| Br5 Mother | 1,311 | 212 | 16.2 | 541 | 41.3 |
| Br6 Mother | 1,399 | 227 | 16.2 | 621 | 44.4 |
| Br5 Father | 1,833 | 327 | 17.8 | 843 | 46.0 |
| Br6 Father | 1,817 | 298 | 16.4 | 815 | 44.9 |
The number of RAD-Seq SNPs located within or close to genes based on direct alignment of mapped RAD contigs (Stage 1; columns three and four) or Atlantic salmon reference genome contigs (Stage 2; columns five and six) against the stickleback nucleotide gene sequences downloaded from Ensembl BioMart (http://www.ensembl.org/biomart/martview/, Database = Ensembl Genes 72, Dataset = Gasterosterus aculeatus genes (BROADS1)). Column two gives the total number of sex specific segregating SNPs which were ordered and positioned within a linkage group using the Onemap software.
Figure 4Distribution of RAD paired-end contig lengths. Paired-end (PE) RAD-sequencing generates a read from the P1 adaptor which is located at the restriction site and a PE contig generated from sequencing at the P2 adaptor at the sheared end of the fragment. The most frequent length of the PE contigs was between 450 and 600 bp. A small number of PE contigs were over 650 bp in length; these are not shown in the figure.
Figure 5Comparison of the number of SNPs per linkage group with a previously published map. The number of SNPs per linkage group (assigned using the CRI-MAP software) in the current study was plotted against the number of SNPs per linkage group in the SNP map study conducted by Lien et al. [11]. The number of SNPs identified per linkage group in the two studies were highly correlated (r = 0.83).