| Literature DB >> 28819310 |
Xian Qiao1, Rui Su1,2,3,4,5, Yang Wang6,7, Ruijun Wang1,2,3,4, Ting Yang6,7, Xiaokai Li1, Wei Chen8,9, Shiyang He7, Yu Jiang10, Qiwu Xu6,11, Wenting Wan12, Yaolei Zhang6,11, Wenguang Zhang1,2, Jiang Chen6,7, Bin Liu13, Xin Liu6,11, Yixing Fan1, Duoyuan Chen7, Huaizhi Jiang14, Dongming Fang7, Zhihong Liu1,2,3,4, Xiaowen Wang7, Yanjun Zhang1,2,3,4, Danqing Mao7, Zhiying Wang1,2,3,4, Ran Di15, Qianjun Zhao15, Tao Zhong16, Huanming Yang7,17, Jian Wang7,17, Wen Wang5, Yang Dong7,8,9, Xiaoli Chen18,19, Xun Xu20,21, Jinquan Li22,23,24,25.
Abstract
Compared with the commercially available single nucleotide polymorphism (SNP) chip based on the Bead Chip technology, the solution hybrid selection (SHS)-based target enrichment SNP chip is not only design-flexible, but also cost-effective for genotype sequencing. In this study, we propose to design an animal SNP chip using the SHS-based target enrichment strategy for the first time. As an update to the international collaboration on goat research, a 66 K SNP chip for cashmere goat was created from the whole-genome sequencing data of 73 individuals. Verification of this 66 K SNP chip with the whole-genome sequencing data of 436 cashmere goats showed that the SNP call rates was between 95.3% and 99.8%. The average sequencing depth for target SNPs were 40X. The capture regions were shown to be 200 bp that flank target SNPs. This chip was further tested in a genome-wide association analysis of cashmere fineness (fiber diameter). Several top hit loci were found marginally associated with signaling pathways involved in hair growth. These results demonstrate that the 66 K SNP chip is a useful tool in the genomic analyses of cashmere goats. The successful chip design shows that the SHS-based target enrichment strategy could be applied to SNP chip design in other species.Entities:
Mesh:
Year: 2017 PMID: 28819310 PMCID: PMC5561203 DOI: 10.1038/s41598-017-09285-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic comparison between Illumina Infinium assay and solution hybrid selection (SHS)-based targeted sequencing method. (a) The schematic diagram of Illumina Infinium assay. (b) The schematic diagram of SHS-based targeted sequencing method.
Figure 2SNP selection and probe design workflow for the 66 K SNP chip.
Figure 3The characterization of final SNPs selected for designing the 66 K SNP chip. (a) The schematic graph of probe design. The distance between the center of probe and the target SNP is shown. (b) The number of SNPs corresponds to the lengths of scaffolds/chromosomes in the goat genome. (c) The frequency distribution of the spaces between neighboring SNPs. (d) The frequency distribution of the MAFs of the selected SNPs.
Detailed description of sequences generated from catches.
| List | amount or percentage |
|---|---|
| Aggregate length of target flanking region | 19 Mb |
| Aggregate length of baits | 11 Mb |
| Total raw unfiltered sequence | 2,100 Mb |
| Aligned to reference genome | 1,849 Mb (87%) |
| Uniquely aligned to reference genome | 1,666 Mb (78%) |
| Aligned to flanking region | 1,149 Mb (56%) |
| Aligned to off target region | 700 Mb (30%) |
| Target SNP coverage | 95.3–99.8% |
Figure 4The sequencing depths of target SNP and its flanking regions. The curve is the base-by-base sequencing coverage around the target SNP. The probes are marked as the horizontal blue bars. The red dashed line in 0 position indicated the target SNP location.
Detailed depth description of different type of position.
|
|
|
|---|---|
| MAF (<0.05) in 65 K target SNP | 109 (<0.2%) |
| Average sample missing in 65 K target SNP | 5 (1.26%) |
| Depth in 65 K target SNP | ~40X |
| Depth in 200 bp flanking region | ~20X |
| Depth not in target capture region | ~2X |
Figure 5The relationship between GC content and sequencing depths. (a) Average depth of target SNP was 37. (b) Average depth of target SNP was 69. The red lines indicated mean depth. (c) Normalized coverage distribution plots in four different GC% categories.
Figure 6Normalized coverage distribution plots.
Figure 7Population structure analysis of Inner Mongolia cashmere goat (Erlangshan). (a) Population structure analysis using STRUCTURE. Each sample is represented by a vertical bar. Enery color represents one ancestral population and the lenth of each colored seqment in each vertical bar represents the proportion contributed by ancestral populations. (b) PCA using all identified SNPs as markers. Each dots are index to samples, most samples cluster together.
Figure 8Manhattan and quantile-quantile plot of GWAS for fiber fineness in cashmere goat. (a) Manhattan plot of the compressed MLM model for fiber fineness. (b) Quantile-quantile plot of the compressed MLM model for fiber fineness.
Biological inferences analysis of top-hit SNP.
| ID | Symbol name | P value | Signal Pathway | Type | Reference |
|---|---|---|---|---|---|
| K04456 | AKT1 | 9.29E-06 | MAPK&Notch | Hair follicle | Dipoï, N. |
| Yang, Z. Z. | |||||
| Mauro, T. M. | |||||
| K09451 | ALX4 | 7.46E-05 | WNT&MAPK | Hair follicle | Kayserili, H. |
| Boras, K. | |||||
| K00844 | HK1 | 5.21E-06 | TGF&Shh | Hair follicle | Ellis, T. S. I. |
| K05101 | NT-3 growth factor receptor | 1.72E-06 | Hair follicle | Botchkarev, V. A. | |
| K02328 | POLD2 | 1.08E-05 | WNT | Skin | Baldeck, N. |
| K02726 | PSMA2 | 1.08E-05 | WNT | Skin | Park |
| K04440 | MAPK8 (JNKs) | 7.46E-05 | WNT&MAPK | Skin | Schumacher, M. |
| K04963 | RYR3 | 3.04E-05 | TGF&MAPK | Skin | Denda, S. |
| K20183 | VPS39 (TLP) | 5.36E-05 | MAPK | Skin | Wang, X. |
| K16491 | STARD9 | 5.36E-05 | MAPK | Skin | Lin. |