| Literature DB >> 28079141 |
Qifan Zeng1, Qiang Fu1, Yun Li1, Geoff Waldbieser2, Brian Bosworth2, Shikai Liu1, Yujia Yang1, Lisui Bao1, Zihao Yuan1, Ning Li1, Zhanjiang Liu1.
Abstract
Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such as SNP arrays. In this work, we developed a high-density SNP array with 690,662 unique SNPs (herein 690 K array) that were relatively evenly distributed across the entire genome, and covered 98.6% of the reference genome sequence. Here we also report linkage mapping using the 690 K array, which allowed mapping of over 250,000 SNPs on the linkage map, the highest marker density among all the constructed linkage maps. These markers were mapped to 29 linkage groups (LGs) with 30,591 unique marker positions. This linkage map anchored 1,602 scaffolds of the reference genome sequence to LGs, accounting for over 97% of the total genome assembly. A total of 1,007 previously unmapped scaffolds were placed to LGs, allowing validation and in few instances correction of the reference genome sequence assembly. This linkage map should serve as a valuable resource for various genetic and genomic analyses, especially for GWAS and QTL mapping for genes associated with economically important traits.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28079141 PMCID: PMC5228154 DOI: 10.1038/srep40347
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sample and data size for development of 690 K SNP array.
| Project | Species | Sample size | Data type | Data accession | Reads (x106) |
|---|---|---|---|---|---|
| Gene-associated SNP mining | Channel, blue catfish | 66 | RNA-seq | SRA025099 | 506 |
| Studies on catfish domestication | Channel catfish | 150 | WGS | SRX275877 | 2,347 |
| Gonad development | Channel catfish | 279 | RNA-seq | SRP018265, SRP067841, Unpublished | 1,185 |
| Barbel regeneration | Channel catfish | 90 | RNA-seq | SRX1003145 | 395 |
| Skin regeneration | Channel catfish | 10 | RNA-seq | SRX1003286 | 193 |
| ESC chanllenge | Channel catfish | 180 | RNA-seq | SRP009069 | 197 |
| Columnaris challenge | Channel catfish | 218 | RNA-seq | SRP017689, SRP012586 | 1,108 |
| Feed deprivation | Blue catfish | 30 | RNA-seq | SRP020252 | 194 |
| SNP mining with GBS | Blue catfish | 190 | GBS | SRP039640 | 359 |
| Total | − | 1,213 | − | − | 6,484 |
Summary of SNPs identified in channel and blue catfish.
| Channel catfish | | |||||||
|---|---|---|---|---|---|---|---|---|
| Thompson | Hatchery | Marion | USDA103 | Wild | Subtotal | Blue catfish | Total | |
| Number of putative SNPs | 5,723,886 | 4,270,527 | 4,817,089 | 4,900,634 | 5,488,642 | 8,644,975 | 3,850,662 | 9,662,958 |
| Number of uniquely mapped SNPs | 3,992,954 | 2,861,190 | 3,283,718 | 3,345,046 | 3,852,203 | 5,667,615 | 3,138,197 | 6,535,576 |
| Number of SNPs without adjacent SNPs in 35 bp | 1,487,079 | 1,032,160 | 1,184,548 | 1,225,949 | 1,442,459 | 2,270,690 | 347,958 | 2,512,422 |
| Number of SNPs passed flanking sequence complexity inspection | 1,455,201 | 1,009,026 | 1,155,782 | 1,196,716 | 1,414,789 | 2,199,952 | 340,921 | 2,435,602 |
| Number of bi-allelic SNPs | 1,443,954 | 1,000,560 | 1,146,365 | 1,187,187 | 1,403,727 | 2,184,946 | 332,934 | 2,418,168 |
| Number of SNPs passed Affymetrix | 1,224,483 | 846,622 | 969,303 | 1,004,560 | 1,191,020 | 1,839,478 | 294,593 | 2,046,175 |
Summary of the catfish 690 K SNP array.
| SNP array | Number |
|---|---|
| Number of genic SNPs | 238,484 |
| Number of intergenic SNPs | 452,178 |
| Number of SNPs tiled with single probe | 687,757 |
| Number of SNPs tiled with two probes | 2,905 |
| Channel catfish-specific SNPs | 581,002 |
| Blue catfish-specific SNPs | 44,694 |
| Inter-species SNPs | 19,124 |
| Channel-blue both possessed | 45,842 |
| Strain-specific SNPs | 48,434 |
| Domesticate strain (Thompson) | 12,672 |
| Domesticate strain (Hatchery) | 9,498 |
| Domesticate strain (Marion) | 10,309 |
| Domesticate strain (USDA 103) | 9,333 |
| Wild population | 6,622 |
| Total number of SNPs on array | 690,662 |
Figure 1Summary of SNP metrics.
(A) Distribution of SNP probes based on p-convert values. The x-axis represents the p-convert value; the y-axis represents the number of SNPs; (B) Distribution of SNPs based on MAF. The x-axis represents the Minor Allele Frequency (MAF); the y-axis represents the number of SNPs. (C) Distribution of inter-SNPs spacing for SNPs included on the array. The x-axis represents the length of intervals of SNPs; the y-axis represents the number of SNPs.
Figure 2Genome distribution of SNPs on the 690 K SNP array.
Each of the 29 channel catfish chromosomes was laid out in the x-axis with one million base pairs intervals. The number of SNPs residing in the interval was plotted on the y-axis.
SNP metrics summary.
| Backcross hybrids | Channel catfish | Total | |
|---|---|---|---|
| Samples processed | 84 | 396 | 480 |
| Samples passed QC | 81 (96.4%) | 392 (98.9%) | 473 (98.5%) |
| PolyHighResolution | 292,185 (42.1%) | 326,411 (47.1%) | 401,815 (57.9%) |
| NoMinorHom | 211,980 (30.6%) | 141,410 (20.4%) | 133,803 (19.3%) |
| Total Polymorphic SNPs | 504,265 (72.7%) | 467,821 (67.5%) | 535,618 (77.2%) |
| MonoHighResolution | 93,058 (13.4%) | 111,047 (16.0%) | 62,262 (8.9%) |
| Total converted SNPs | 597,323 (86.1%) | 578,868 (83.5%) | 597,880 (86.1%) |
Figure 3Sample structure identified by multidimensional scaling analysis of IBS distances.
The coordinates were the first two clustering dimensions. Large dots represent parent samples.
Summary of the sex-average linkage map of channel catfish.
| Linkage group | Chromosome | Sex-average map | |||
|---|---|---|---|---|---|
| Mapped markers | Unique positions | Genetic length (cM) | Marker interval | ||
| 1 | 1 | 13,426 | 1,198 | 122.48 | 0.1 |
| 2 | 20 | 6,626 | 1,031 | 97.624 | 0.09 |
| 3 | 2 | 12,838 | 1,340 | 136.613 | 0.1 |
| 4 | 4 | 7,754 | 1,090 | 111.502 | 0.1 |
| 5 | 13 | 9,869 | 1,017 | 91.712 | 0.09 |
| 6 | 3 | 11,507 | 1,312 | 126.625 | 0.1 |
| 7 | 11 | 8,000 | 832 | 83.681 | 0.1 |
| 8 | 7 | 10,999 | 1,216 | 112.518 | 0.09 |
| 9 | 8 | 8,516 | 1,267 | 122.109 | 0.1 |
| 10 | 26 | 6,138 | 850 | 80.721 | 0.09 |
| 11 | 5 | 9,256 | 1,364 | 135.765 | 0.1 |
| 12 | 12 | 8,264 | 1,010 | 84.4 | 0.08 |
| 13 | 18 | 9,330 | 1,286 | 109.138 | 0.08 |
| 14 | 28 | 6,728 | 899 | 86.76 | 0.1 |
| 15 | 15 | 7,672 | 999 | 92.755 | 0.09 |
| 16 | 9 | 9,293 | 1,180 | 126.838 | 0.11 |
| 17 | 6 | 10,052 | 1,286 | 114.924 | 0.09 |
| 18 | 14 | 13,475 | 1,226 | 113.228 | 0.09 |
| 19 | 23 | 7,429 | 1,038 | 89.178 | 0.09 |
| 20 | 10 | 10,797 | 1,050 | 110.144 | 0.1 |
| 21 | 21 | 10,074 | 1,021 | 91.477 | 0.09 |
| 22 | 17 | 5,628 | 771 | 103.989 | 0.13 |
| 23 | 19 | 10,492 | 995 | 98.423 | 0.1 |
| 24 | 24 | 5,361 | 778 | 88.577 | 0.11 |
| 25 | 16 | 7,873 | 994 | 101.438 | 0.1 |
| 26 | 22 | 7,161 | 863 | 91.43 | 0.11 |
| 27 | 27 | 7,907 | 864 | 86.565 | 0.1 |
| 28 | 25 | 6,084 | 1,068 | 104.627 | 0.1 |
| 29 | 29 | 4,538 | 746 | 89.494 | 0.12 |
| Total | 253,087 | 30,591 | 3,004.735 | 0.1 | |
Figure 4Illustration of sex-average linkage map.
Summary of the sex-specific linkage map of channel catfish.
| Linkage group | Female-specific map | Male-specific map | F:M ratio | ||||
|---|---|---|---|---|---|---|---|
| Unique positions | Genetic length (cM) | Marker interval (cM) | Unique positions | Genetic length (cM) | Marker interval (cM) | ||
| 1 | 880 | 150.398 | 0.17 | 764 | 96.932 | 0.13 | 1.55 |
| 2 | 914 | 128.927 | 0.14 | 576 | 80.311 | 0.14 | 1.61 |
| 3 | 1,005 | 167.41 | 0.17 | 797 | 107.838 | 0.14 | 1.55 |
| 4 | 798 | 133.264 | 0.17 | 608 | 94.902 | 0.16 | 1.4 |
| 5 | 797 | 107.165 | 0.13 | 564 | 76.823 | 0.14 | 1.39 |
| 6 | 1,020 | 147.84 | 0.14 | 1,088 | 154.592 | 0.14 | 0.96 |
| 7 | 593 | 98.666 | 0.17 | 515 | 69.364 | 0.13 | 1.42 |
| 8 | 951 | 139.04 | 0.15 | 642 | 89.167 | 0.14 | 1.56 |
| 9 | 1,006 | 150.924 | 0.15 | 754 | 97.856 | 0.13 | 1.54 |
| 10 | 643 | 94.239 | 0.15 | 498 | 72.248 | 0.15 | 1.3 |
| 11 | 1,012 | 145.177 | 0.14 | 883 | 127.862 | 0.14 | 1.14 |
| 12 | 718 | 97.291 | 0.14 | 640 | 72.927 | 0.11 | 1.33 |
| 13 | 975 | 123.13 | 0.13 | 663 | 95.595 | 0.14 | 1.29 |
| 14 | 690 | 102.484 | 0.15 | 517 | 72.634 | 0.14 | 1.41 |
| 15 | 754 | 112.926 | 0.15 | 607 | 73.845 | 0.12 | 1.53 |
| 16 | 939 | 154.868 | 0.16 | 639 | 101.333 | 0.16 | 1.53 |
| 17 | 924 | 134.912 | 0.15 | 760 | 96.545 | 0.13 | 1.4 |
| 18 | 1,030 | 152.931 | 0.15 | 610 | 74.318 | 0.12 | 2.06 |
| 19 | 814 | 103.42 | 0.13 | 552 | 75.193 | 0.14 | 1.38 |
| 20 | 773 | 134.035 | 0.17 | 592 | 84.287 | 0.14 | 1.59 |
| 21 | 753 | 94.889 | 0.13 | 612 | 89.754 | 0.15 | 1.06 |
| 22 | 700 | 128.097 | 0.18 | 502 | 87.469 | 0.17 | 1.46 |
| 23 | 808 | 126.172 | 0.16 | 578 | 75.567 | 0.13 | 1.67 |
| 24 | 632 | 104.576 | 0.17 | 462 | 72.957 | 0.16 | 1.43 |
| 25 | 693 | 113.383 | 0.16 | 662 | 89.252 | 0.13 | 1.27 |
| 26 | 637 | 104.682 | 0.16 | 504 | 78.919 | 0.16 | 1.33 |
| 27 | 777 | 108.956 | 0.14 | 528 | 68.471 | 0.13 | 1.59 |
| 28 | 755 | 112.341 | 0.15 | 697 | 98.77 | 0.14 | 1.14 |
| 29 | 619 | 110.117 | 0.18 | 525 | 69.86 | 0.13 | 1.58 |
| Total | 23,610 | 3,582.26 | 0.15 | 18,339 | 2,545.591 | 0.14 | 1.41 |
Figure 5Concordance of SNP marker positions on reference sequence with those on genetic linkage map.
The x-axis represents the physical position (Mb) of markers, the y-axis represents the genetic positions of markers on the linkage map (cM). Dark blue dots represent unique markers from previously mapped scaffolds; orange dots represent markers from previously unmapped genome or transcriptome sequences; markers in light blue circles were duplicated markers from previously mapped scaffolds.
Figure 6Genetic map suggested reorganization of scf00172 (LG8), scf00249 (LG11), scf00274 (LG12), scf00439 and scf00436 (LG22).