| Literature DB >> 28751500 |
Hengxing Ba1, Boyin Jia1, Guiwu Wang1, Yifeng Yang1, Gilead Kedem2, Chunyi Li3.
Abstract
Sika deer are an economically valuable species owing to their use in traditional Chinese medicine, particularly their velvet antlers. Sika deer in northeast China are mostly farmed in enclosure. Therefore, genetic management of farmed sika deer would benefit from detailed knowledge of their genetic diversity. In this study, we generated over 1.45 billion high-quality paired-end reads (288 Gbp) across 42 unrelated individuals using double-digest restriction site-associated DNA sequencing (ddRAD-seq). A total of 96,188 (29.63%) putative biallelic SNP loci were identified with an average sequencing depth of 23×. Based on the analysis, we found that the majority of the loci had a deficit of heterozygotes (FIS >0) and low values of Hobs, which could be due to inbreeding and Wahlund effects. We also developed a collection of high-quality SNP probes that will likely be useful in a variety of applications in genotyping for cervid species in the future.Entities:
Keywords: GenPred; Genomic Selection; SNP discovery; STACKS; Shared Data Resources; ddRAD-seq; genetic diversity; sika deer
Mesh:
Year: 2017 PMID: 28751500 PMCID: PMC5592941 DOI: 10.1534/g3.117.300082
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Filtering results for STACKS pipeline
| Quality Filtering Steps | Number of ddRAD Loci | % |
|---|---|---|
| Populating observed ddRAD loci for 42 animals | 7,576,300 | 100.0 |
| Removing ddRAD loci below the minimum depth of 5× | 7,456,091 | 98.4 |
| Removing ddRAD loci below the log likelihood threshold of −8 | 1,989,000 | 26.3 |
| Removing ddRAD loci below the minimum constraint of 32 (75%) individuals | 324,564 | 4.3 |
| Putative biallelic SNP (MAF ≥0.05) | 98,166 | 1.3 |
Figure 1Evaluation of putative SNP quality. (A) Distribution of SNP frequency on ddRAD tag positions. Distribution of the identified Ts/Tv is also illustrated. (B) Distribution of six types of substitutions. (C) Distribution of SNP frequency in major allele, minor allele, and ddRAD loci against their sequencing depth. (D) Distribution of SNP frequency of the number of SNPs in ddRAD loci tags.
Figure 2Population genetic parameters for putative SNP loci. (A) Frequency of SNPs across the MAF bins. The line depicts the cumulative density function across the MAF bins. (B) Frequency of SNPs across the heterozygosity bins. (C) Frequency of SNPs across the FIS bins.
Figure 3Population genetic structure. (A) STRUCTURE analysis for K = 2 and 3 for 96,188 SNPs. (B) PCA analysis for 96,188 SNPs. (C) STRUCTURE analysis for K = 2 and 3 for 13,739 SNPs. (D) PCA analysis for 13,739 SNPs.
Figure 4Comparison of SNP frequency distribution between all putative SNPs and high-quality SNPs.
Information of SNP frequency among four species
| Species | Study Population | Sample Size | Average SNP Frequency in Individual (kb−1) | Average Depth | Reference |
|---|---|---|---|---|---|
| Sika Deer | Deer from eight farms | 42 | 0.74 | 23× | This study |
| Milu (Pere David’s Deer) | Captive population | 5 | 0.51 | NA | |
| Panda | Wild population | 34 | 1.32 | 4.7× | |
| Holstein-Friesian Cows | Breeding population | 32 | 1.35 | 14.3× |
NA, not applicable.