| Literature DB >> 33329724 |
Tianfu Yang1, Michelle Miller2, David Forgacs3, James Derr3, Paul Stothard1.
Abstract
Genomic technologies have been increasingly applied in livestock production due to their utility in production management and animal genetic improvement. The current project aimed to develop genomic resources for the Canadian bison industry, specifically a parentage verification tool and a subspecies composition tool. Both products stand to help with building and maintaining purebred and crossbred bison populations, and in turn bison conservation and production. The development of this genomic toolkit proceeded in two stages. In the single-nucleotide polymorphism (SNP) discovery and selection stage, raw sequence information from 41 bison samples was analyzed, and approximately 52.5 million candidate biallelic SNPs were discovered from 21 samples with high sequence quality. A set of 19,954 SNPs (2,928 for parentage verification and 17,026 for subspecies composition) were then selected for inclusion on an Axiom myDesign custom array. In the refinement and validation stage, 480 bison were genotyped using the custom SNP panel, and the resulting genotypes were analyzed to further filter SNPs and assess tool performance. In various tests using real and simulated genotypes, the two genomic tools showed excellent performance for their respective tasks. Final SNP sets consisting of 191 SNPs for parentage and 17,018 SNPs for subspecies composition are described. As the first SNP-based genomic toolkit designed for the Canadian bison industry, our results may provide a new opportunity in improving the competitiveness and profitability of the industry in a sustainable manner.Entities:
Keywords: SNP genotyping; bison; genomic tools; parentage verification; subspecies composition
Year: 2020 PMID: 33329724 PMCID: PMC7714993 DOI: 10.3389/fgene.2020.585999
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
List of sequenced bison.
| ID | Type | Location | Source | # Raw reads | % Reads mapped* | Average sequencing depth |
| P1 | Plains | Caprock Canyons State Park, TX | Existing | 33,573,898 | 38% | 0.43 |
| P2 | Plains | Caprock Canyons State Park, TX | Existing | 35,711,512 | 44% | 0.53 |
| P3 | Plains | Caprock Canyons State Park, TX | Existing | 63,428,284 | 46% | 1.01 |
| P4 | Plains | Caprock Canyons State Park, TX | Existing | 71,988,836 | 50% | 1.24 |
| P5 | Plains | the Greater Yellowstone Area | Existing | 345,862,044 | 12% | 1.21 |
| P6 | Plains | the Greater Yellowstone Area | Existing | 300,481,696 | 26% | 2.68 |
| P7 | Plains | Yellowstone National Park | Existing | 171,167,934 | 88% | 5.51 |
| P8 | Plains | Yellowstone National Park | Existing | 193,911,748 | 87% | 6.19 |
| P9 | Plains | Yellowstone National Park | Existing | 285,445,650 | 83% | 8.67 |
| P10 | Plains | Yellowstone National Park | Existing | 329,693,516 | 84% | 10.12 |
| P11 | Plains | Caprock Canyons State Park, TX | New | 764,693,316 | 66% | 23.47 |
| P12 | Plains | Caprock Canyons State Park, TX | New | 843,431,132 | 62% | 23.94 |
| P13 | Plains | Cypress Hills, SK | New | 939,516,516 | 7% | 0.48 |
| P14 | Plains | Drumheller, AB | New | 1,080,363,884 | 8% | 0.59 |
| P15 | Plains | Elk Island National Park, AB | New | 872,769,454 | 63% | 25.40 |
| P16 | Plains | Elk Island National Park, AB | New | 874,658,572 | 67% | 27.36 |
| P17 | Plains | Junction of Bow and Belly Rivers, AB | New | 1,075,618,578 | 8% | 1.68 |
| P18 | Plains | Prince Albert, SK | New | 962,343,124 | 42% | 19.78 |
| P19 | Plains | Red Rock/YNP Turner Ranch | New | 899,967,488 | 78% | 37.39 |
| P20 | Plains | Santa Catalina Island, CA | New | 899,471,994 | 58% | 25.17 |
| P21 | Plains | Santa Catalina Island, CA | New | 1,268,225,668 | 44% | 26.43 |
| P22 | Plains | Swift Current, SK | New | 946,903,882 | 11% | 0.71 |
| P23 | Plains | Unknown | New | 610,183,530 | 11% | 0.48 |
| P24 | Plains | Wind Cave National Park, SD | New | 776,956,818 | 65% | 23.71 |
| P25 | Plains | Wind Cave National Park, SD | New | 853,107,274 | 66% | 26.35 |
| P26 | Plains | Yellowstone National Park | New | 973,793,586 | 73% | 36.37 |
| U1 | Unknown | Unknown | New | 540,095,712 | 2% | 0.08 |
| U2 | Unknown | Unknown | New | 838,346,982 | 1% | 0.12 |
| W1 | Wood | Elk Island National Park, AB | Existing | 12,968,260 | 41% | 0.18 |
| W2 | Wood | Elk Island National Park, AB | Existing | 17,859,638 | 48% | 0.29 |
| W3 | Wood | Elk Island National Park, AB | Existing | 58,711,530 | 45% | 0.91 |
| W4 | Wood | Elk Island National Park, AB | Existing | 75,729,836 | 38% | 0.99 |
| W5 | Wood | Alberta, Canada | New | 465,548,626 | 18% | 2.95 |
| W6 | Wood | Athabasca Lake, SK | New | 974,026,536 | 11% | 1.23 |
| W7 | Wood | Elk Island National Park, AB | New | 979,762,842 | 69% | 34.14 |
| W8 | Wood | Elk Island National Park, AB | New | 991,502,862 | 73% | 36.54 |
| W9 | Wood | Elk Island National Park, AB | New | 986,508,748 | 75% | 37.33 |
| W10 | Wood | Unknown | New | 1,013,582,276 | 4% | 0.44 |
| W11 | Wood | Unknown | New | 1,026,497,582 | 70% | 38.42 |
| W12 | Wood | Wood Buffalo National Park | New | 1,000,759,832 | 24% | 8.75 |
| W13 | Wood | Wood Buffalo National Park | New | 1,014,848,338 | 39% | 15.27 |
FIGURE 1Sequence alignment and SNP calling workflow.
FIGURE 2SNP selection procedure for the custom panel for parentage and subspecies composition analysis.
Confidence level of the subspecies label in the validation population.
| Confidence level | Description | Count in the validation population | ||
| Plains | Wood | Hybrid | ||
| 1 | Absolutely confident. | 203 | 57 | 0 |
| 2 | Somewhat confident. | 31 | 0 | 1 |
| 3 | Less confident. | 31 | 0 | 72 |
| 4 | No data about the confidence | 61 | 5 | 0 |
FIGURE 3Workflow of SNP selection for parentage verification, based on genotype data from the validation population.
FIGURE 4The distribution of the SNPs for parentage verification on autosomes.
FIGURE 5Probability of exclusion (PE) and probability of non-exclusion (PN) for different numbers of top parentage SNPs. Multi-locus PE is the probability to exclude (1) a random unrelated parent when the other parent is known (Q1), (2) a random unrelated parent when the other parent is unknown (Q2), or (3) a random unrelated offspring (Q3).
FIGURE 6Multi-dimensional scaling (MDS) visualization of genetic distances between plains and wood bison determined using the custom panel.
FIGURE 7Multi-dimensional scaling (MDS) visualization of genetic distances between plains and wood bison determined using the custom panel. Cluster inference was performed by k-means clustering.
FIGURE 8Multi-dimensional scaling (MDS) visualization of genetic distances between simulated populations determined using the custom panel.
Estimated subspecies composition for reference population and simulated population.
| # | Population | Plains score (%) | |||
| Median | Mean | SD | |||
| 1 | Reference – Plains | 203 | 100.00 | 99.09 | 2.28 |
| 2 | Reference – Wood | 57 | 0.87 | 2.16 | 3.02 |
| 3 | Plains bison | 31 | 98.67 | 94.81 | 9.81 |
| 4 | Simulated–Plains | 500 | 99.94 | 99.85 | 0.20 |
| 5 | Simulated–Wood | 500 | 0.00 | 0.17 | 0.26 |
| 6 | Simulated – F1 | 500 | 49.95 | 49.97 | 0.43 |
| 7 | Simulated – F2 | 500 | 50.04 | 50.05 | 0.54 |
| 8 | Simulated–Backcross–Plains | 500 | 75.00 | 74.99 | 0.48 |
| 9 | Simulated–Backcross–Wood | 500 | 25.03 | 25.03 | 0.49 |