| Literature DB >> 28706593 |
Shudong Liu1, Sangang He2,3, Lei Chen2,3, Wenrong Li2,3, Jiang Di4, Mingjun Liu2,3.
Abstract
Knowledge of linkage disequilibrium (LD) is important for effective genome-wide association studies and accurate genomic prediction. Chinese Merino (Xinjiang type) is well-known fine wool sheep breed. However, the extent of LD across the genome remains unexplored. In this study, we calculated autosomal LD based on genome-wide SNPs of 635 Chinese Merino (Xinjiang type) sheep by Illumina Ovine SNP50 BeadChip. A moderate level of LD (r2 ≥ 0.25) across the whole genome was observed at short distances of 0-10 kb. Further, the ancestral effective population size (Ne ) was analyzed by extent of LD and found that Ne increased with the increase of generations and declined rapidly within the most recent 50 generations, which is consistent with the history of Chinese Merino sheep breeding, initiated in 1971. We also noted that even when the effective population size was estimated across different single chromosomes, Ne only ranged from 140.36 to 183.33 at five generations in the past, exhibiting a rapid decrease compared with that at ten generations in the past. These results indicated that the genetic diversity in Chinese Merino sheep recently decreased and proper protective measures should be taken to maintain the diversity. Our datasets provided essential genetic information to track molecular variations which potentially contribute to phenotypic variation in Chinese Merino sheep.Entities:
Keywords: Chinese Merino (Xinjiang Type) sheep; Effective population size; Genome-wide SNPs; Linkage disequilibrium
Year: 2017 PMID: 28706593 PMCID: PMC5486679 DOI: 10.1007/s13258-017-0539-2
Source DB: PubMed Journal: Genes Genomics ISSN: 1976-9571 Impact factor: 1.839
Summary of SNPs included in the analysis
| Chromosome | Length (Mb) | Number of SNP | Average SNP interval (Mb) | Longest SNP interval (Mb) | Shortest SNP interval (kb) |
|---|---|---|---|---|---|
| 1 | 299.637 | 5180 | 0.058 | 0.913 | 5.291 |
| 2 | 263.109 | 4856 | 0.054 | 0.984 | 1.936 |
| 3 | 242.770 | 4409 | 0.055 | 1.358 | 5.326 |
| 4 | 127.202 | 2375 | 0.054 | 0.557 | 5.405 |
| 5 | 116.343 | 2089 | 0.056 | 0.843 | 0.065 |
| 6 | 129.054 | 2295 | 0.056 | 2.966 | 5.342 |
| 7 | 108.923 | 1952 | 0.056 | 0.105 | 5.303 |
| 8 | 97.814 | 1827 | 0.054 | 0.464 | 5.385 |
| 9 | 100.791 | 1884 | 0.053 | 0.406 | 5.406 |
| 10 | 94.128 | 1621 | 0.058 | 3.419 | 5.286 |
| 11 | 66.878 | 1048 | 0.064 | 0.611 | 5.300 |
| 12 | 86.402 | 1507 | 0.057 | 1.168 | 5.288 |
| 13 | 89.063 | 1492 | 0.060 | 0.903 | 5.478 |
| 14 | 69.303 | 1014 | 0.068 | 1.329 | 0.037 |
| 15 | 90.000 | 1481 | 0.061 | 1.855 | 5.414 |
| 16 | 77.051 | 1372 | 0.056 | 0.424 | 5.691 |
| 17 | 78.614 | 1254 | 0.063 | 0.563 | 5.413 |
| 18 | 72.434 | 1239 | 0.058 | 0.675 | 5.366 |
| 19 | 64.803 | 1104 | 0.059 | 0.473 | 5.678 |
| 20 | 55.394 | 983 | 0.056 | 1.069 | 5.499 |
| 21 | 55.476 | 781 | 0.071 | 2.422 | 5.475 |
| 22 | 55.747 | 960 | 0.058 | 2.258 | 5.312 |
| 23 | 66.685 | 998 | 0.067 | 0.715 | 5.385 |
| 24 | 44.851 | 648 | 0.069 | 0.344 | 5.514 |
| 25 | 48.288 | 885 | 0.055 | 0.590 | 5.514 |
| 26 | 50.044 | 808 | 0.062 | 1.692 | 0.915 |
Statistical summary of linkage disequilibrium (LD) over various distances
| Distance | Average r2 | Number of SNP pairs |
|---|---|---|
| 0–10 KB | 0.25 | 1786 |
| 10–25 KB | 0.17 | 6771 |
| 25–50 KB | 0.11 | 19,821 |
| 50–100 KB | 0.07 | 42,771 |
| 100–500 KB | 0.03 | 330,617 |
| 0.5–1 MB | 0.02 | 406,873 |
| 1–5 MB | 0.015 | 2,398,635 |
| 5–10 MB | 0.010 | 2,657,037 |
Statistical information for average r 2 as distance between pairs of SNP up to 10 Mb for the genome
| CHR | SNP pairs distance | |||||||
|---|---|---|---|---|---|---|---|---|
| 0–10 KB | 10–25 KB | 25–50 KB | 50–100 KB | 100–500 KB | 0.5–1 MB | 1–5 MB | 5–10 MB | |
| CHR1 | 0.247 ± 0.299 | 0.162 ± 0.219 | 0.105 ± 0.176 | 0.066 ± 0.122 | 0.028 ± 0.002 | 0.019 ± 0.029 | 0.016 ± 0.020 | 0.011 ± 0.013 |
| CHR2 | 0.254 ± 0.306 | 0.183 ± 0.252 | 0.114 ± 0.186 | 0.075 ± 0.141 | 0.030 ± 0.058 | 0.020 ± 0.031 | 0.016 ± 0.021 | 0.011 ± 0.013 |
| CHR3 | 0.238 ± 0.304 | 0.156 ± 0.220 | 0.114 ± 0.184 | 0.074 ± 0.133 | 0.032 ± 0.061 | 0.021 ± 0.033 | 0.016 ± 0.021 | 0.011 ± 0.013 |
| CHR4 | 0.252 ± 0.326 | 0.169 ± 0.250 | 0.112 ± 0.181 | 0.073 ± 0.134 | 0.030 ± 0.056 | 0.019 ± 0.030 | 0.015 ± 0.019 | 0.010 ± 0.012 |
| CHR5 | 0.253 ± 0.292 | 0.145 ± 0.224 | 0.098 ± 0.176 | 0.060 ± 0.113 | 0.030 ± 0.051 | 0.020 ± 0.031 | 0.016 ± 0.021 | 0.010 ± 0.012 |
| CHR6 | 0.205 ± 0.267 | 0.144 ± 0.218 | 0.102 ± 0.156 | 0.062 ± 0.119 | 0.030 ± 0.055 | 0.020 ± 0.033 | 0.015 ± 0.020 | 0.011 ± 0.013 |
| CHR7 | 0.239 ± 0.294 | 0.163 ± 0.236 | 0.108 ± 0.171 | 0.063 ± 0.116 | 0.028 ± 0.048 | 0.020 ± 0.030 | 0.016 ± 0.020 | 0.011 ± 0.012 |
| CHR8 | 0.271 ± 0.326 | 0.150 ± 0.233 | 0.097 ± 0.152 | 0.058 ± 0.108 | 0.028 ± 0.048 | 0.019 ± 0.029 | 0.015 ± 0.019 | 0.010 ± 0.012 |
| CHR9 | 0.255 ± 0.307 | 0.182 ± 0.244 | 0.104 ± 0.169 | 0.064 ± 0.119 | 0.028 ± 0.054 | 0.018 ± 0.026 | 0.015 ± 0.019 | 0.010 ± 0.012 |
| CHR10 | 0.303 ± 0.346 | 0.181 ± 0.264 | 0.111 ± 0.196 | 0.073 ± 0.140 | 0.033 ± 0.074 | 0.022 ± 0.043 | 0.017 ± 0.025 | 0.012 ± 0.016 |
| CHR11 | 0.201 ± 0.291 | 0.077 ± 0.157 | 0.072 ± 0.157 | 0.036 ± 0.092 | 0.029 ± 0.054 | 0.019 ± 0.030 | 0.014 ± 0.018 | 0.009 ± 0.010 |
| CHR12 | 0.262 ± 0.277 | 0.170 ± 0.238 | 0.105 ± 0.180 | 0.062 ± 0.105 | 0.026 ± 0.049 | 0.018 ± 0.027 | 0.015 ± 0.018 | 0.010 ± 0.012 |
| CHR13 | 0.237 ± 0.340 | 0.185 ± 0.250 | 0.117 ± 0.201 | 0.076 ± 0.141 | 0.031 ± 0.059 | 0.020 ± 0.030 | 0.015 ± 0.023 | 0.010 ± 0.012 |
| CHR14 | 0.252 ± 0.312 | 0.102 ± 0.199 | 0.069 ± 0.147 | 0.037 ± 0.100 | 0.028 ± 0.053 | 0.019 ± 0.030 | 0.014 ± 0.019 | 0.009 ± 0.011 |
| CHR15 | 0.210 ± 0.286 | 0.181 ± 0.260 | 0.104 ± 0.176 | 0.058 ± 0.110 | 0.026 ± 0.047 | 0.019 ± 0.028 | 0.015 ± 0.018 | 0.010 ± 0.012 |
| CHR16 | 0.271 ± 0.341 | 0.154 ± 0.220 | 0.106 ± 0.172 | 0.061 ± 0.112 | 0.027 ± 0.045 | 0.020 ± 0.030 | 0.015 ± 0.019 | 0.010 ± 0.011 |
| CHR17 | 0.209 ± 0.293 | 0.160 ± 0.244 | 0.086 ± 0.165 | 0.059 ± 0.123 | 0.028 ± 0.054 | 0.019 ± 0.030 | 0.015 ± 0.019 | 0.010 ± 0.012 |
| CHR18 | 0.291 ± 0.309 | 0.144 ± 0.210 | 0.093 ± 0.165 | 0.072 ± 0.127 | 0.025 ± 0.043 | 0.017 ± 0.026 | 0.014 ± 0.018 | 0.010 ± 0.011 |
| CHR19 | 0.183 ± 0.220 | 0.172 ± 0.240 | 0.123 ± 0.205 | 0.064 ± 0.122 | 0.028 ± 0.051 | 0.019 ± 0.029 | 0.014 ± 0.017 | 0.010 ± 0.011 |
| CHR20 | 0.219 ± 0.298 | 0.142 ± 0.227 | 0.090 ± 0.145 | 0.052 ± 0.974 | 0.027 ± 0.045 | 0.018 ± 0.027 | 0.014 ± 0.018 | 0.010 ± 0.012 |
| CHR21 | 0.248 ± 0.307 | 0.099 ± 0.191 | 0.060 ± 0.136 | 0.041 ± 0.094 | 0.025 ± 0.046 | 0.017 ± 0.025 | 0.014 ± 0.018 | 0.010 ± 0.011 |
| CHR22 | 0.278 ± 0.325 | 0.164 ± 0.249 | 0.090 ± 0.161 | 0.059 ± 0.113 | 0.026 ± 0.049 | 0.017 ± 0.026 | 0.014 ± 0.017 | 0.009 ± 0.010 |
| CHR23 | 0.203 ± 0.319 | 0.111 ± 0.175 | 0.092 ± 0.155 | 0.055 ± 0.105 | 0.025 ± 0.043 | 0.018 ± 0.027 | 0.014 ± 0.017 | 0.009 ± 0.011 |
| CHR24 | 0.284 ± 0.292 | 0.091 ± 0.191 | 0.056 ± 0.133 | 0.033 ± 0.091 | 0.025 ± 0.041 | 0.0169 ± 0.025 | 0.013 ± 0.016 | 0.009 ± 0.010 |
| CHR25 | 0.317 ± 0.352 | 0.142 ± 0.216 | 0.092 ± 0.153 | 0.057 ± 0.104 | 0.028 ± 0.054 | 0.018 ± 0.027 | 0.014 ± 0.017 | 0.009 ± 0.010 |
| CHR26 | 0.154 ± 0.148 | 0.139 ± 0.207 | 0.099 ± 0.168 | 0.055 ± 0.105 | 0.028 ± 0.049 | 0.018 ± 0.028 | 0.014 ± 0.018 | 0.014 ± 0.011 |
CHR denotes chromosome
r 2 Means ± SE
Fig. 1Linkage disequilibrium (LD) decay for six different sample sizes from Chinese Merino (Xinjiang type) sheep across all autosomal chromosomes. The X axis represents physical distance; the Y axis represents the average r 2
Fig. 2Average r 2 estimates at physical distance of 50-kb for six (N = 635, 400, 200, 100, 50, 30) population samples. Histograms 1, 2, 3, 4, 5, and 6 represent N = 635, 400, 200, 100, 50, and 30 on the X axis, respectively. The Y axis represents the average r 2. Error bars ± SE. Bars with the same lower-case letter were not significantly different, and those with different lowercase letters were significantly different. Statistical analysis was implemented in SPSS v19 using least-significant differences (LSD) and analysis of variance (ANOVA)
List of candidate genes in biological processes related to wool traits
| Gene symbol | NCBI gene ID(chr) | GO term | GO name | Coordinates (bp) | Gene description |
|---|---|---|---|---|---|
| SETDB2 | 101116064 (10) | GO:0060429 | Epithelium development | 19,339,266–19,403,310 | SET domain, bifurcated 2 |
| VPS36 | 101121612 (10) | GO:0007219 | Notch signaling pathway | 21,876,927–21,896,255 | Vacuolar protein sorting 36 homolog |
| FOXO1 | 101107877 (10) | GO:0060429 | Epithelium development | 22,237,525–22,238,862 | Forkhead box O1 |
| FREM2 | 101101900 (10) | GO:0060429 | Epithelium development | 23,560,959–23,715,430 | FRAS1 related extracellular matrix protein 2 |
| POSTN | 101103329 (10) | GO:0007219 | Notch signaling pathway | 24,474,862–24,508,290 | Periostin, osteoblast specific factor |
| SMAD9 | 101105084 (10) | GO:0060429 | Epithelium development | 25,005,441–25,028,218 | SMAD family member 9 |
| SPG20 | 101105830 (10) | GO:0030509 | BMP signaling pathway | 25,400,303–25,453,683 | spastic paraplegia 20 |
| KL | 101110249 (10) | GO:0007173 | Epidermal growth factor receptor signaling pathway | 28,314,117–28,432,709 | klotho |
| BRCA2 | 101109980 (10) | GO:0060429 | Epithelium development | 28,872,321–28,924,398 | Breast cancer 2 |
| FRY | 101110521 (10) | GO:0060429 | Epithelium development | 28,986,741–29,316,033 | FRY microtubule binding protein |
| HMGB1 | 101112071 (10) | GO:0010631 | Epithelial cell migration | 30,583,831–30,588,285 | High mobility group protein B1 |
| POMP | 101113603 (10) | GO:0060429 | Epithelium development | 31,695,500–31,707,769 | Proteasome maturation protein |
| FLT1 | 443088 (10) | GO:0060429 | Epithelium development | 31,838,211–32,043,089 | Fms-related tyrosine kinase 1 |
| CDK8 | 101115551 (10) | GO:0060429 | Epithelium development | 33,310,224–33,390,657 | Cyclin-dependent kinase 8 |
| ATP8A2 | 101115632 (10) | GO:0043588 | Skin development | 33,449,390–33,534,534 | ATPase phospholipid transporting 8A2 |
| TNFRSF19 | 101116664 (10) | GO:0001942 | Hair follicle development | 34,607,296–34,694,188 | Tumor necrosis factor receptor superfamily, member 19 |
| FGF9 | 101116844 (10) | GO:0007173 | Epidermal growth factor receptor signaling pathway | 35,583,842–35,609,316 | Fibroblast growth factor 9 |
| LATS2 | 101117941 (10) | GO:0043588 | Skin development | 35,862,795–35,885,746 | Large tumor suppressor kinase 2 |
| IFT88 | 101118114 (10) | GO:0002064 | Epithelial cell development | 36,048,434–36,103,776 | Intraflagellar transport 88 |
| KLF5 | 101123641 (10) | GO:0002064 | Epithelial cell development | 48,247,059–48,264,092 | Kruppel-like factor 5 (intestinal) |
| MYCBP2 | 101102993 (10) | GO:0043473 | pigmentation | 52,632,844–52,881,924 | MYC binding protein 2, E3 ubiquitin protein ligase |
| SCEL | 101103405 (10) | GO:0043588 | Skin development | 53,073,650–53,289,357 | Sciellin |
| EDNRB | 443139 (10) | GO:0060429 | Epithelium development | 53,508,345–53,534,498 | Endothelin receptor type B |
| CCDC88C | 101112005 (10) | GO:0060429 | Epithelium development | 54,988,440–55,027,517 | Coiled-coil domain containing 88C |
| EIF3F | 101108947 (10) | GO:0007219 | Notch signaling pathway | 55,307,436–55,338,033 | Eukaryotic translation initiation factor 3 subunit F |
| SPRY2 | 780494 (10) | GO:0060429 | Epithelium development | 55,821,638–55,876,082 | Sprouty RTK signaling antagonist 2 |
| SLITRK6 | 101105831 (10) | GO:0002064 | Epithelial cell development | 61,223,444–61,227,178 | SLIT and NTRK like family member 6 |
| EFNB2 | 101115469 (10) | GO:0010631 | Epithelial cell migration | 81,604,311–81,649,792 | Ephrin B2 |
| FURIN | 780454 (18) | GO:0007219 | Notch signaling pathway | 20,886,453–20,893,295 | Furin (paired basic amino acid cleaving enzyme) |
| TJP1 | 443200 (18) | GO:0002064 | Epithelial cell development | 27,453,888–27,538,010 | Tight junction protein 1 |
| SIN3A | 101106112 (18) | GO:0010631 | Epithelial cell migration | 32,287,440–32,352,962 | SIN3 transcription regulator family member A |
| CSK | 101108956 (18) | GO:0060429 | Epithelium development | 32,781,761–32,811,209 | c-src tyrosine kinase |
| SET | 101118680 (18 | GO:0060429 | Epithelium development | 33,066,496–33,069,337 | SET nuclear proto-oncogene |
| STRA6 | 101120527 (18) | GO:0060429 | Epithelium development | 33,348,745–33,395,596 | Stimulated by retinoic acid 6 |
| PML | 101111580 (18) | GO:0060429 | Epithelium development | 33,464,431–33,515,176 | Promyelocytic leukemia |
| TGIF1 | 101105544 (18) | GO:0060429 | Epithelium development | 34,892,531–34,894,053 | TGFB-induced factor homeobox 1 |
| PRKD1 | 101102016 (18) | GO:0010631 | Epithelial cell migration | 39,035,982–39,190,566 | Protein kinase D1 |
| HECTD1 | 100135435 (18) | GO:0060429 | Epithelium development | 40,626,204–40,697,112 | HECT domain containing E3 ubiquitin protein ligase 1 |
| SNX6 | 101118137 (18) | GO:0007173 | Epidermal growth factor receptor signaling pathway | 43,896,580–43,993,579 | Sorting nexin 6 |
| PSMA6 | 100192424 (18) | GO:0007173 | Epidermal growth factor receptor signaling pathway | 44,447,953–44,468,701 | Proteasome subunit alpha 6 |
| NFKBIA | 780520 (18) | GO:0007219 | Notch signaling pathway | 44,535,115–44,538,036 | NFKB inhibitor alpha |
| TGM7 | 101106629 (18) | GO:0043588 | Skin development | 53,988,800–54,041,218 | Transglutaminase 7 |
| IFT140 | 101109504 (24) | GO:0060429 | Epithelium development | 1,173,624–1,189,733 | Intraflagellar transport 140 |
| LAT | 101110725 (24) | GO:0010631 | Epithelial cell migration | 25,748,047–25,870,792 | Linker for activation of T-cells |
| RASGEF1A | 101120047 (25) | GO:0007173 | Epidermal growth factor receptor signaling pathway | 13,514,772–13,521,910 | RasGEF domain family member 1 A |
| CDH23 | 101107570 (25) | GO:0002064 | Epithelial cell development | 27,373,089–27,820,109 | Cadherin-related 23 |
| DLG5 | 101110731 (25) | GO:0010631 | Epithelial cell migration; Epithelial cell development | 33,385,018–33,473,757 | Discs, large homolog 5 (Drosophila) |
Statistical summary of the effective population sizes of sheep
| SNP pairs distance | |||||||
|---|---|---|---|---|---|---|---|
| 25 KB | 50 KB | 100 KB | 500 KB | 1 MB | 5 MB | 10 MB | |
| Genetic distance | 0.00025 | 0.0005 | 0.001 | 0.005 | 0.01 | 0.05 | 0.1 |
| Generations ago | 2000 | 1000 | 500 | 100 | 50 | 10 | 5 |
| Ne | 4471.67 | 3358.00 | 2477.67 | 1206.44 | 837.07 | 253.51 | 159.77 |
Genetic distance: Morgan
N effective population size
Fig. 3Estimated N for Chinese Merino (Xinjiang type) sheep over time based on linkage disequilibrium data. Y axis represents the effective population size; X axis represents time elapsed (number of generations)
Effective population size of Chinese Merino (Xinjiang type) sheep over time estimated by different autosomal chromosomes
| CHR | Generations | ||||||
|---|---|---|---|---|---|---|---|
| 2000 | 1000 | 500 | 100 | 50 | 10 | 5 | |
| 1 | 4473.99 | 3302.02 | 2469.60 | 1225.03 | 831.14 | 246.26 | 157.76 |
| 2 | 4034.74 | 3096.04 | 2221.71 | 1128.94 | 781.77 | 237.21 | 152.78 |
| 3 | 4782.04 | 3300.12 | 2331.18 | 1106.14 | 753.72 | 232.59 | 151.07 |
| 4 | 4257.22 | 3204.97 | 2311.83 | 1148.25 | 799.50 | 247.53 | 161.54 |
| 5 | 4219.43 | 3245.12 | 2428.83 | 1187.06 | 803.48 | 241.31 | 155.83 |
| 6 | 5380.44 | 3705.36 | 2699.85 | 1204.32 | 804.64 | 245.25 | 157.86 |
| 7 | 4105.49 | 3221.11 | 2496.41 | 1220.42 | 825.24 | 246.26 | 157.76 |
| 8 | 4308.70 | 3537.53 | 2658.67 | 1256.06 | 846.30 | 252.73 | 161.65 |
| 9 | 4145.04 | 3244.30 | 2496.15 | 1240.66 | 866.59 | 257.47 | 165.29 |
| 10 | 3858.85 | 3134.65 | 2281.99 | 1075.61 | 729.60 | 222.27 | 140.36 |
| 11 | 5812.27 | 3531.54 | 2623.76 | 1230.01 | 833.19 | 260.53 | 170.43 |
| 12 | 4294.37 | 3278.60 | 2549.43 | 1288.44 | 885.75 | 259.55 | 165.40 |
| 13 | 3929.02 | 3035.82 | 2157.23 | 1081.27 | 761.41 | 245.00 | 158.34 |
| 14 | 4790.39 | 3482.79 | 2543.48 | 1221.29 | 835.29 | 256.60 | 168.31 |
| 15 | 4326.74 | 3324.72 | 2645.53 | 1308.70 | 878.24 | 260.25 | 166.21 |
| 16 | 4709.33 | 3492.97 | 2632.91 | 1287.08 | 855.59 | 254.07 | 163.91 |
| 17 | 4957.46 | 3770.99 | 2650.23 | 1262.34 | 851.89 | 256.57 | 164.39 |
| 18 | 4460.37 | 3701.68 | 2517.35 | 1301.85 | 896.69 | 265.78 | 169.56 |
| 19 | 4754.47 | 2968.20 | 2370.55 | 1204.71 | 832.40 | 261.01 | 167.05 |
| 20 | 4877.87 | 3948.13 | 3035.15 | 1367.48 | 917.02 | 268.13 | 167.80 |
| 21 | 4620.63 | 3712.66 | 2734.12 | 1354.49 | 925.61 | 271.74 | 172.57 |
| 22 | 4267.18 | 3552.59 | 2685.41 | 1327.79 | 913.80 | 275.73 | 177.36 |
| 23 | 6339.45 | 4193.07 | 2918.57 | 1404.77 | 941.00 | 275.63 | 176.53 |
| 24 | 3630.12 | 3282.15 | 2648.25 | 1366.43 | 936.47 | 286.40 | 183.33 |
| 25 | 4721.48 | 3788.17 | 2862.57 | 1304.14 | 882.97 | 274.49 | 176.85 |
| 26 | 6105.10 | 4064.54 | 2988.76 | 1320.36 | 885.42 | 263.82 | 167.80 |
Fig. 4Boxplot represents the trend in log10(N ) over time. The variability at each time point reflects variation in the estimates among the 26 autosomes. The X axis represents the number of generations in the past; the Y axis represents log10(N )