| Literature DB >> 30951561 |
Cameron M Nugent1, Jong S Leong2, Kris A Christensen3, Eric B Rondeau3, Matthew K Brachmann1, Anne A Easton1, Christine L Ouellet-Fagg1, Michelle T T Crown4, William S Davidson4, Ben F Koop2, Roy G Danzmann1, Moira M Ferguson1.
Abstract
We have generated a high-density, high-throughput genotyping array for characterizing genome-wide variation in Arctic charr (Salvelinus alpinus). Novel single nucleotide polymorphisms (SNPs) were identified in charr from the Fraser, Nauyuk and Tree River aquaculture strains, which originated from northern Canada and fish from Iceland using high coverage sequencing, reduced representation sequencing and RNA-seq datasets. The array was designed to capture genome-wide variation from a diverse suite of Arctic charr populations. Cross validation of SNPs from various sources and comparison with previously published Arctic charr SNP data provided a set of candidate SNPs that generalize across populations. Further candidate SNPs were identified based on minor allele frequency, association with RNA transcripts, even spacing across intergenic regions and association with the sex determining (sdY) gene. The performance of the 86,503 SNP array was assessed by genotyping Fraser, Nauyuk and Tree River strain individuals, as well as wild Icelandic Arctic charr. Overall, 63,060 of the SNPs were polymorphic within at least one group and 36.8% were unique to one of the four groups, suggesting that the array design allows for characterization of both within and across population genetic diversity. The concordance between sdY markers and known phenotypic sex indicated that the array can accurately determine the sex of individuals based on genotype alone. The Salp87k genotyping array provides researchers and breeders the opportunity to analyze genetic variation in Arctic charr at a more detailed level than previously possible.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30951561 PMCID: PMC6450613 DOI: 10.1371/journal.pone.0215008
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sources of fish, sequence data types used in the design and testing of the Arctic charr genotyping array.
| Source of fish | Methodology | Individuals | Structure | Publication |
|---|---|---|---|---|
| Fraser aquaculture strain (Coastal Zones Research Institute, New Brunswick, Canada) | GBS | 91 | Full sib family | Nugent |
| mRNA-seq | 18 | Two full sib families | Norman | |
| Fraser aquaculture strain (Alma Aquaculture Research Station) | GBS | 108 | Two half sib families | |
| Array testing | 33 | Population | ||
| Nauyuk aquaculture strain (Icy Waters, Yukon, Canada) | GBS | 24 | Population | |
| Array testing | 42 | Population | ||
| Nauyuk–Tree River aquaculture strain hybrids (Icy Waters) | RAD-seq | 238 | Nine hybrid Families | Christensen |
| Fraser–Nauyuk aquaculture strain hybrids | RAD-seq | 67 | Two hybrid families | Christensen et al. 2018 |
| Nauyuk–Tree River aquaculture strain hybrids | High coverage sequencing | 8 | Population | Christensen |
| Tree River aquaculture strain (Icy Waters) | Array testing | 18 | Population | |
| Lake Þingvallavatn Iceland | GBS | 320 | Four full sib families | Parsons |
| Array testing | 95 | Population | ||
| Lake Vatnshlíðarvatn, Iceland | GBS | 362 | Four full sib families | Parsons |
| Array testing | 64 | Population | ||
| Lake Mývatn, Iceland | GBS | 9 | Population | |
| Mývatn lava caves, Iceland | GBS | 39 | Population | |
| Array testing | 20 | Population | ||
| Lake Galtaból, Iceland | Array testing | 57 | Population | |
| Lake Mjóavatn, Iceland | Array testing | 31 | Population | |
| Lake Svínavatn, Iceland | Array testing | 90 | Population | |
| River Fljótaá, Iceland | Array testing | 32 | Population |
* Three of the Fraser strain individuals in the test set were also parents of the two half sib families utilized in GBS in SNP discovery.
† Twenty-four of the Nauyuk strain individuals in the test set were also used in SNP discovery.
⧧All 20 of the Mývatn lava cave fish in the test set were also used for SNP discovery
Summary of the number of candidate SNPs derived from each data source and the number of SNPs from each data source that were included in the final array design.
| Data Source | Number included on array | Candidates for array design | Conversion rate |
|---|---|---|---|
| 3,149 | 5,451 | 57.8% | |
| 368 | 583 | 63.1% | |
| 3,875 | 8,734 | 44.36% | |
| 6,046 | 14,959 | 40.5% | |
| 10,491 | 14,922 | 70.3% | |
| 62,568 | 59,277 + 13,912 | 85.5% | |
| 6 | 6 | 100% | |
| 86,503 | 117,844 | 73.4% |
*The initial candidate list was 103,932 SNPs. When this initial set of candidates failed to yield enough SNPs recommended for use on the array 13,912 high coverage SNPs were added to the list of candidates
The number of polymorphic and monomorphic SNPs observed within the different test groups of Arctic charr.
The number of polymorphisms unique to each strain (Unique to strain) and the number shared with at least one other strain (Multiple strains) are shown. ‘Total recommended’ indicates markers classified as ‘recommended’ by the Axiom Analysis Suite’s best practices workflow. Polymorphic markers include the Axiom Analysis Suite, ‘polymorphic high resolution’ and ‘no minor homozygote’. Across the four groups, 79,692 unique SNPs were recommended for use.
| Fraser | Nauyuk | Tree River | Icelandic | ||
|---|---|---|---|---|---|
| Unique to strain | 1,864 | 10,924 | 8,551 | 1,864 | |
| Multiple strains | 17,898 | 34,250 | 29,865 | 12,329 | |
| Total observed | 19,762 | 45,174 | 38,416 | 14,193 | |
| 42,898 | 25,151 | 25,531 | 54,602 | ||
| 62,660 | 70,325 | 63,947 | 68,795 |
Fig 1A Venn diagram depicting the number of unique and shared polymorphic SNPs across the four groups of Arctic charr in the test set.
In total, 63,060 of the 86,503 SNPs on the Salp87k genotyping were identified as polymorphic while 21,785 were polymorphic within only one of the groups.
Summary table of Arctic charr reference genome coverage (GCA_002910315.2) by markers included on the 87k Arctic charr SNP genotyping array.
| Chromosome name | NCBI | Number of markers | Average base pair gap between markers | Total chromosome length | Average markers per megabase |
|---|---|---|---|---|---|
| NC_036838.1 | 2275 | 25.5 | 58017 | ||
| NC_036839.1 | 1641 | 26.5 | 43539 | ||
| NC_036840.1 | 1525 | 23.6 | 36001 | ||
| NC_036841.1 | 1035 | 27.3 | 28293 | ||
| NC_036842.1 | 3230 | 28.0 | 90519 | ||
| NC_036843.1 | 1082 | 27.3 | 29596 | ||
| NC_036844.1 | 1414 | 26.2 | 37081 | ||
| NC_036845.1 | 1351 | 22.4 | 30249 | ||
| NC_036846.1 | 937 | 27.7 | 26025 | ||
| NC_036847.1 | 1449 | 23.7 | 34303 | ||
| NC_036848.1 | 2117 | 25.9 | 54842 | ||
| NC_036849.1 | 1285 | 25.4 | 32654 | ||
| NC_036850.1 | 924 | 24.3 | 22457 | ||
| NC_036851.1 | 1768 | 28.9 | 51124 | ||
| NC_036852.1 | 468 | 29.8 | 13981 | ||
| NC_036853.1 | 1923 | 26.5 | 50975 | ||
| NC_036854.1 | 1961 | 27.6 | 54096 | ||
| NC_036855.1 | 2682 | 25.1 | 67329 | ||
| NC_036856.1 | 1623 | 26.4 | 42871 | ||
| NC_036857.1 | 1721 | 24.3 | 41841 | ||
| NC_036858.1 | 2617 | 27.8 | 72741 | ||
| NC_036859.1 | 1692 | 22.6 | 38229 | ||
| NC_036860.1 | 3350 | 23.9 | 79996 | ||
| NC_036861.1 | 292 | 23.6 | 6905 | ||
| NC_036862.1 | 1445 | 26.0 | 37604 | ||
| NC_036863.1 | 1814 | 27.3 | 49633 | ||
| NC_036864.1 | 443 | 25.7 | 11433 | ||
| NC_036865.1 | 962 | 27.2 | 26198 | ||
| NC_036866.1 | 1943 | 25.7 | 49931 | ||
| NC_036867.1 | 1491 | 26.0 | 38733 | ||
| NC_036868.1 | 1213 | 27.0 | 32734 | ||
| NC_036869.1 | 1029 | 25.4 | 26194 | ||
| NC_036870.1 | 1351 | 23.7 | 32007 | ||
| NC_036871.1 | 1594 | 24.1 | 38481 | ||
| NC_036872.1 | 1503 | 25.3 | 38085 | ||
| NC_036873.1 | 311 | 28.7 | 8959 | ||
| NC_036874.1 | 909 | 23.7 | 21596 | ||
| NC_036875.1 | 1327 | 31.0 | 41233 | ||
| NC_036876.1 | 798 | 24.5 | 19547 |
Fig 2Visual representation of the distribution of the array markers among the 39 chromosomes of the Arctic charr genome (GCA_002910315.2).
The black bars in the background represent the chromosome sequence, and each dot represents the location of SNP on the Arctic charr genotyping array. A total of 58,495 SNPs from the array are located along the chromosomes, while 26,425 additional SNPs are found on the unplaced contigs.
Fig 3The distribution of polymorphic markers across the Arctic charr genome identified in four test groups.
A: Fraser strain, Panel B: Nauyuk strain, Panel C: Tree River strain, Panel D: Icelandic.