Literature DB >> 28706593

Estimates of linkage disequilibrium and effective population sizes in Chinese Merino (Xinjiang type) sheep by genome-wide SNPs.

Shudong Liu1, Sangang He2,3, Lei Chen2,3, Wenrong Li2,3, Jiang Di4, Mingjun Liu2,3.   

Abstract

Knowledge of linkage disequilibrium (LD) is important for effective genome-wide association studies and accurate genomic prediction. Chinese Merino (Xinjiang type) is well-known fine wool sheep breed. However, the extent of LD across the genome remains unexplored. In this study, we calculated autosomal LD based on genome-wide SNPs of 635 Chinese Merino (Xinjiang type) sheep by Illumina Ovine SNP50 BeadChip. A moderate level of LD (r2 ≥ 0.25) across the whole genome was observed at short distances of 0-10 kb. Further, the ancestral effective population size (Ne ) was analyzed by extent of LD and found that Ne increased with the increase of generations and declined rapidly within the most recent 50 generations, which is consistent with the history of Chinese Merino sheep breeding, initiated in 1971. We also noted that even when the effective population size was estimated across different single chromosomes, Ne only ranged from 140.36 to 183.33 at five generations in the past, exhibiting a rapid decrease compared with that at ten generations in the past. These results indicated that the genetic diversity in Chinese Merino sheep recently decreased and proper protective measures should be taken to maintain the diversity. Our datasets provided essential genetic information to track molecular variations which potentially contribute to phenotypic variation in Chinese Merino sheep.

Entities:  

Keywords:  Chinese Merino (Xinjiang Type) sheep; Effective population size; Genome-wide SNPs; Linkage disequilibrium

Year:  2017        PMID: 28706593      PMCID: PMC5486679          DOI: 10.1007/s13258-017-0539-2

Source DB:  PubMed          Journal:  Genes Genomics        ISSN: 1976-9571            Impact factor:   1.839


Introduction

Linkage disequilibrium (LD) is the nonrandom co-occurrence of alleles within a chromosome or haplotype, i.e., statistical associations between alleles at separate loci that differ from the expectation for independent, and randomly sampled alleles (Wall and Pritchard 2003). The effective population size (N ) refers to the size of an idealized population that has the same dispersion of gene frequency under random genetic drift or the same degree of inbreeding as the population under consideration (Wright 1938). Both of these are crucial parameters for evaluations of population genetic diversity and can provide a powerful method to characterize and understand the genetic architecture underlying complex traits. With the advent of high-density SNP chips, high-throughput genotyping provides substantial data for more accurate estimate of LD and N . These datasets help to improve our understanding of historical recombination within population (Reich and Lander 2001). LD over long distances can be used to evaluate recent N , while LD at short distances can be applied to estimate ancient N (Hayes et al. 2003). Accordingly, N provides important information for the protection of populations. In domestic livestock, many economic traits have been subjected to natural and artificial selection. These traits are passed on to offspring, and shape pattern of LD in populations. Thus, LD patterns throughout the genome reflect selection in individual breeds in domestic livestock species. In fact, the LD pattern in a population is the basis for genomic prediction, genome-wide association studies (GWAS), and quantitative trait loci (QTL) mapping for complex traits. The extent of LD has been examined in diverse taxa, including pigs (Amaral et al. 2008; Badke et al. 2012), horses (Corbin et al. 2010), cattle (Khatkar 2008; Mokry et al. 2014), and chickens (Qanbari et al. 2010). In the study of sheep, LD has been estimated using microsatellite markers across the genome in the early stage (Meadows et al. 2008).With the development of the Illumina Ovine SNP50 BeadChip, researchers investigated the extent of LD genome-wide in the wild big horn sheep (Miller et al. 2011), Dorper Spanish Churra sheep, Sunite (García-Gámez et al. 2012), German Mutton Merino, Dorper (Zhao et al. 2014), etc. Previous sheep studies revealed that LD in sheep persists for relatively shorter genomic distances compared with LD in other domestic species, and the behavior of LD varies among sheep breeds (Kijas et al. 2012). These findings provided insights into population structure, population diversity and genomic selection programs in sheep. Chinese Merino (Xinjiang type) is the most well-known sheep breed for fine wool production in China. It has been breeding for highly specific purposes, such as fine and soft wool and hornless females, by the introduction of Australian Merino sheep via stages of grading, crossing, and collective reproduction. However, to date, the extent of LD across the genome in Chinese Merino sheep remains unexplored. The objective of this study was to investigate the behavior of LD in the genome of Chinese Merino (Xinjiang type) using ovine 50 K SNP panels.

Materials and methods

Animals

A total of 635 female Chinese Merino (Xinjiang type) sheep were randomly selected. All sheep were born between 2006 and 2012 in Bohu farm and Gongnaisi farm (Xinjiang, China). A very small sample of ear tissue was obtained for genomic DNA preparation using the saturated phenolchloroform method (Sambrook and Russell 2002). DNA samples (2500 ng) with a 260/280 absorbance ratio of ≥1.8 and a DNA concentration of ≥50 ng/µl were submitted for genotyping. The DNA was genotyped using the Illumina Ovine SNP50 BeadChip (Illumina Inc., San Diego, CA,USA) (Teo et al. 2007), which contained 54,241 SNPs and at an average distance of 50.9 kb. The marker-QC (quality control) process included three steps: (i) control of the call rate (≥0.95); (ii) minor allele frequency (MAF) (≥0.05); (iii) correspondence with the Hardy–Weinberg equilibrium (HWE) (P-value > 0.00001).These parameters were calculated using PLINK v1.07.

LD estimation and functional annotation of genes

Two measures, r 2 (William 1974) and D′ (Lewontin 1964), have been proposed to estimate the extent of LD. In this study, we used r 2 as a measure of LD because it is more robust to allele frequency variation than D′ (Ardlie et al. 2002) and population size has a greater influence on D′ than r 2 (Zhao et al. 2007). The r 2 value can be expressed as follows: where PA1, PA2, PB1, and PB2 are the frequencies of each allele at loci A and B, and P11, P12, P21, and P22 are the frequencies of haplotypes A1B1, A1B2, A2B1, and A2B2, respectively. PLINK (Purcell et al. 2007) includes a set of options to calculate pair-wise LD between SNPs and to process these data. The average r 2 values for distances of 0–25 kb, 25–50 kb, 50–100 kb, 100 − 500 kb, 0.5–1 Mb, 1–5 Mb, and 5–10 Mb in 635 Chinese Merino (Xinjiang type) sheep were calculated. LD was estimated as the mean r 2 value for each autosomal chromosome. Moreover, LD was calculated for randomly sampled populations (N = 30, 50, 100, 200, and 400). The four highest average r 2 values for a distance of 0–10 kb were detected on the four individual autosomes (OAR25, 24, 18, and 10). Genes were further evaluated when they were located between the pair of SNPs with r 2 ≥ 0.25 or were within 100 kb upstream or downstream of the pair of SNPs. The annotated gene lists were then used as inputs for a UniProt (http://www.uniprot.org/) analysis to obtain NCBI gene and protein IDs. Functional annotations of identified genes were made using BLAST2.0 and a parameter score of alpha greater than 0.6 was used to obtain gene ontology (GO) annotations (Conesa et al. 2005).

Effective population size estimation and genetic diversity

LD data make it feasible to estimate N . Sved (1971) described the relationship between LD and N using the following equation: Here, r 2 is the LD between different markers, N is the effective population size, and c is the genetic distance between various markers measured in Morgans; n is the chromosome experimental sample size; α = 1 in the absence of mutation and α = 2 if mutation is taken into account; k = 4 for autosomes and k = 2 for the X chromosome. In contemporary studies, physical distance is used instead of genetic distance to estimate population size. A physical distance of 100 kb is approximately equivalent to a genetic distance of 0.1 cM. The genetic distance reflects the effective population size at a certain number of generations in the past according to 1/2c (Hayes et al. 2003). Generally, the formula that assumes the absence of mutation is used to estimate N . Hence, we used k = 4, and c = 1 to calculate N . It is generally agreed that abundant genetic diversity within a livestock species is a prerequisite for coping with potential changes in livestock farming conditions. Given that genetic diversity influences LD and , Chinese Merino (Xinjiang type) were also used to analyze the proportion of polymorphic loci (P ), inbreeding coefficient and gene diversity (H ) using PLINK v1.07 (Purcell et al. 2007).

Results

SNP statistics

After quality control, we identified 46,062 SNPs in Chinese Merino (Xinjiang type) sheep distributed over 26 autosomal chromosomes. SNP information for each autosomal chromosome is summarized in Table 1. The total autosomal chromosome length was 2650.80 Mb, with an average chromosome length of 101.95 Mb; the longest Ovis aries autosomal chromosome was OAR1 (299.64 Mb) and the shortest was OAR24 (44.85 Mb). The average distance between adjacent SNPs was 57.49 kb; the longest adjacent SNP interval was 3.42 Mb within OAR10 and the shortest interval was observed in OAR14.
Table 1

Summary of SNPs included in the analysis

ChromosomeLength (Mb)Number of SNPAverage SNP interval (Mb)Longest SNP interval (Mb)Shortest SNP interval (kb)
1299.63751800.0580.9135.291
2263.10948560.0540.9841.936
3242.77044090.0551.3585.326
4127.20223750.0540.5575.405
5116.34320890.0560.8430.065
6129.05422950.0562.9665.342
7108.92319520.0560.1055.303
897.81418270.0540.4645.385
9100.79118840.0530.4065.406
1094.12816210.0583.4195.286
1166.87810480.0640.6115.300
1286.40215070.0571.1685.288
1389.06314920.0600.9035.478
1469.30310140.0681.3290.037
1590.00014810.0611.8555.414
1677.05113720.0560.4245.691
1778.61412540.0630.5635.413
1872.43412390.0580.6755.366
1964.80311040.0590.4735.678
2055.3949830.0561.0695.499
2155.4767810.0712.4225.475
2255.7479600.0582.2585.312
2366.6859980.0670.7155.385
2444.8516480.0690.3445.514
2548.2888850.0550.5905.514
2650.0448080.0621.6920.915
Summary of SNPs included in the analysis

Extent of LD across the genome

Generally, at a distance of greater than 10 Mb, free recombination is assumed (De Roos et al. 2008; Zhao et al. 2014); accordingly, the range between markers was set at 0–10 Mb to estimate LD between SNP markers. The mean values of r for genetic distances of 0–10 kb, 10–25 kb, 25–50 kb, 50–100 kb, 100–500 kb, 0.5–1 Mb, 1–5 Mb, and 5–10 Mb were 0.25, 0.17, 0.11, 0.07, 0.03, 0.02, 0.015, and 0.010, respectively, as summarized in Table 2. There is a decline in average values of r with the increasing physical distance between SNPs.
Table 2

Statistical summary of linkage disequilibrium (LD) over various distances

DistanceAverage r2 Number of SNP pairs
0–10 KB0.251786
10–25 KB0.176771
25–50 KB0.1119,821
50–100 KB0.0742,771
100–500 KB0.03330,617
0.5–1 MB0.02406,873
1–5 MB0.0152,398,635
5–10 MB0.0102,657,037
Statistical summary of linkage disequilibrium (LD) over various distances The extent of LD was quite different among individual chromosomes. The average r for SNPs separated by intervals 0–10 kb, 10–25 kb, 25–50 kb, 50–100 kb, 100–500 kb, 0.5–1 Mb, 1–5 Mb and 5–10 Mb in each autosomal chromosome are presented in Table 3. However, when the genetic distance exceeded 100 kb, the mean r 2 values exhibited even smaller differences among chromosomes. The mean r 2 values were 0.017–0.022 within 0.5–1 Mb, and were maximal within 100 kb–1 Mb for OAR10.
Table 3

Statistical information for average r 2 as distance between pairs of SNP up to 10 Mb for the genome

CHRSNP pairs distance
0–10 KB10–25 KB25–50 KB50–100 KB100–500 KB0.5–1 MB1–5 MB5–10 MB
CHR10.247 ± 0.2990.162 ± 0.2190.105 ± 0.1760.066 ± 0.1220.028 ± 0.0020.019 ± 0.0290.016 ± 0.0200.011 ± 0.013
CHR20.254 ± 0.3060.183 ± 0.2520.114 ± 0.1860.075 ± 0.1410.030 ± 0.0580.020 ± 0.0310.016 ± 0.0210.011 ± 0.013
CHR30.238 ± 0.3040.156 ± 0.2200.114 ± 0.1840.074 ± 0.1330.032 ± 0.0610.021 ± 0.0330.016 ± 0.0210.011 ± 0.013
CHR40.252 ± 0.3260.169 ± 0.2500.112 ± 0.1810.073 ± 0.1340.030 ± 0.0560.019 ± 0.0300.015 ± 0.0190.010 ± 0.012
CHR50.253 ± 0.2920.145 ± 0.2240.098 ± 0.1760.060 ± 0.1130.030 ± 0.0510.020 ± 0.0310.016 ± 0.0210.010 ± 0.012
CHR60.205 ± 0.2670.144 ± 0.2180.102 ± 0.1560.062 ± 0.1190.030 ± 0.0550.020 ± 0.0330.015 ± 0.0200.011 ± 0.013
CHR70.239 ± 0.2940.163 ± 0.2360.108 ± 0.1710.063 ± 0.1160.028 ± 0.0480.020 ± 0.0300.016 ± 0.0200.011 ± 0.012
CHR80.271 ± 0.3260.150 ± 0.2330.097 ± 0.1520.058 ± 0.1080.028 ± 0.0480.019 ± 0.0290.015 ± 0.0190.010 ± 0.012
CHR90.255 ± 0.3070.182 ± 0.2440.104 ± 0.1690.064 ± 0.1190.028 ± 0.0540.018 ± 0.0260.015 ± 0.0190.010 ± 0.012
CHR100.303 ± 0.3460.181 ± 0.2640.111 ± 0.1960.073 ± 0.1400.033 ± 0.0740.022 ± 0.0430.017 ± 0.0250.012 ± 0.016
CHR110.201 ± 0.2910.077 ± 0.1570.072 ± 0.1570.036 ± 0.0920.029 ± 0.0540.019 ± 0.0300.014 ± 0.0180.009 ± 0.010
CHR120.262 ± 0.2770.170 ± 0.2380.105 ± 0.1800.062 ± 0.1050.026 ± 0.0490.018 ± 0.0270.015 ± 0.0180.010 ± 0.012
CHR130.237 ± 0.3400.185 ± 0.2500.117 ± 0.2010.076 ± 0.1410.031 ± 0.0590.020 ± 0.0300.015 ± 0.0230.010 ± 0.012
CHR140.252 ± 0.3120.102 ± 0.1990.069 ± 0.1470.037 ± 0.1000.028 ± 0.0530.019 ± 0.0300.014 ± 0.0190.009 ± 0.011
CHR150.210 ± 0.2860.181 ± 0.2600.104 ± 0.1760.058 ± 0.1100.026 ± 0.0470.019 ± 0.0280.015 ± 0.0180.010 ± 0.012
CHR160.271 ± 0.3410.154 ± 0.2200.106 ± 0.1720.061 ± 0.1120.027 ± 0.0450.020 ± 0.0300.015 ± 0.0190.010 ± 0.011
CHR170.209 ± 0.2930.160 ± 0.2440.086 ± 0.1650.059 ± 0.1230.028 ± 0.0540.019 ± 0.0300.015 ± 0.0190.010 ± 0.012
CHR180.291 ± 0.3090.144 ± 0.2100.093 ± 0.1650.072 ± 0.1270.025 ± 0.0430.017 ± 0.0260.014 ± 0.0180.010 ± 0.011
CHR190.183 ± 0.2200.172 ± 0.2400.123 ± 0.2050.064 ± 0.1220.028 ± 0.0510.019 ± 0.0290.014 ± 0.0170.010 ± 0.011
CHR200.219 ± 0.2980.142 ± 0.2270.090 ± 0.1450.052 ± 0.9740.027 ± 0.0450.018 ± 0.0270.014 ± 0.0180.010 ± 0.012
CHR210.248 ± 0.3070.099 ± 0.1910.060 ± 0.1360.041 ± 0.0940.025 ± 0.0460.017 ± 0.0250.014 ± 0.0180.010 ± 0.011
CHR220.278 ± 0.3250.164 ± 0.2490.090 ± 0.1610.059 ± 0.1130.026 ± 0.0490.017 ± 0.0260.014 ± 0.0170.009 ± 0.010
CHR230.203 ± 0.3190.111 ± 0.1750.092 ± 0.1550.055 ± 0.1050.025 ± 0.0430.018 ± 0.0270.014 ± 0.0170.009 ± 0.011
CHR240.284 ± 0.2920.091 ± 0.1910.056 ± 0.1330.033 ± 0.0910.025 ± 0.0410.0169 ± 0.0250.013 ± 0.0160.009 ± 0.010
CHR250.317 ± 0.3520.142 ± 0.2160.092 ± 0.1530.057 ± 0.1040.028 ± 0.0540.018 ± 0.0270.014 ± 0.0170.009 ± 0.010
CHR260.154 ± 0.1480.139 ± 0.2070.099 ± 0.1680.055 ± 0.1050.028 ± 0.0490.018 ± 0.0280.014 ± 0.0180.014 ± 0.011

CHR denotes chromosome

r 2 Means ± SE

Statistical information for average r 2 as distance between pairs of SNP up to 10 Mb for the genome CHR denotes chromosome r 2 Means ± SE

Sample size and LD estimates

In order to determine the proper sample size for LD evaluation, population subsets of five different sizes (N = 30, 50, 100, 200, and 400) were randomly selected from the population and the level of LD was estimated for each sub-population. As shown in Fig. 1, as the sample size increased, the physical distance increased and the degree of LD tended to decrease. However, as shown in Fig. 2, within a physical distance of 50 kb, when N = 200 and physical distance was fixed at 50 kb, LD was not significantly different from what observed for the overall samples (P > 0.05).
Fig. 1

Linkage disequilibrium (LD) decay for six different sample sizes from Chinese Merino (Xinjiang type) sheep across all autosomal chromosomes. The X axis represents physical distance; the Y axis represents the average r 2

Fig. 2

Average r 2 estimates at physical distance of 50-kb for six (N = 635, 400, 200, 100, 50, 30) population samples. Histograms 1, 2, 3, 4, 5, and 6 represent N = 635, 400, 200, 100, 50, and 30 on the X axis, respectively. The Y axis represents the average r 2. Error bars ± SE. Bars with the same lower-case letter were not significantly different, and those with different lowercase letters were significantly different. Statistical analysis was implemented in SPSS v19 using least-significant differences (LSD) and analysis of variance (ANOVA)

Linkage disequilibrium (LD) decay for six different sample sizes from Chinese Merino (Xinjiang type) sheep across all autosomal chromosomes. The X axis represents physical distance; the Y axis represents the average r 2 Average r 2 estimates at physical distance of 50-kb for six (N = 635, 400, 200, 100, 50, 30) population samples. Histograms 1, 2, 3, 4, 5, and 6 represent N = 635, 400, 200, 100, 50, and 30 on the X axis, respectively. The Y axis represents the average r 2. Error bars ± SE. Bars with the same lower-case letter were not significantly different, and those with different lowercase letters were significantly different. Statistical analysis was implemented in SPSS v19 using least-significant differences (LSD) and analysis of variance (ANOVA)

Functional annotation of genes

We performed a search for positional candidate genes for the regions with the four highest average r 2 values at 0–10 kb. In total, 280 Ensembl genes were found. Of these, 236 genes were located between pairs of SNPs (r 2 ≥ 0.25), and 44 genes were located within 100 kb upstream and downstream of pairs of SNPs. These genes were evaluated in a GO analysis. In total, 2353, 320, and 395 GO terms related to biological processes, cellular components, and molecular function, respectively, were identified. Ten GO terms (GO:0001942, GO:0043588, GO:0010631, GO: 0 002064, GO:0060429, GO:0043473,GO:0007173, GO:0016055, GO:0007219, and GO:0030509) were related to hair follicle development, skin development, epithelial cell migration, epithelial cell development, epithelium development, pigmentation, epidermal growth factor receptor signaling pathway, Wnt signaling pathway, Notch signaling pathway, and BMP signaling pathway, respectively (Table 4). Among them, 47 genes were related to epithelium and hair follicle development, of which 28 genes were located on OAR10, 14 genes were located on OAR18, 2 genes were located on OAR24, and 3 genes were located on OAR25.
Table 4

List of candidate genes in biological processes related to wool traits

Gene symbolNCBI gene ID(chr)GO termGO nameCoordinates (bp)Gene description
SETDB2101116064 (10)GO:0060429Epithelium development19,339,266–19,403,310SET domain, bifurcated 2
VPS36101121612 (10)GO:0007219Notch signaling pathway21,876,927–21,896,255Vacuolar protein sorting 36 homolog
FOXO1101107877 (10)GO:0060429GO:0007173GO:0016055Epithelium developmentEpidermal growth factor receptor signaling pathwayWnt signaling pathway22,237,525–22,238,862Forkhead box O1
FREM2101101900 (10)GO:0060429Epithelium development23,560,959–23,715,430FRAS1 related extracellular matrix protein 2
POSTN101103329 (10)GO:0007219Notch signaling pathway24,474,862–24,508,290Periostin, osteoblast specific factor
SMAD9101105084 (10)GO:0060429GO:0030509Epithelium developmentBMP signaling pathway25,005,441–25,028,218SMAD family member 9
SPG20101105830 (10)GO:0030509BMP signaling pathway25,400,303–25,453,683spastic paraplegia 20
KL101110249 (10)GO:0007173Epidermal growth factor receptor signaling pathway28,314,117–28,432,709klotho
BRCA2101109980 (10)GO:0060429Epithelium development28,872,321–28,924,398Breast cancer 2
FRY101110521 (10)GO:0060429GO:0016055Epithelium developmentWnt signaling pathway28,986,741–29,316,033FRY microtubule binding protein
HMGB1101112071 (10)GO:0010631Epithelial cell migration30,583,831–30,588,285High mobility group protein B1
POMP101113603 (10)GO:0060429Epithelium development31,695,500–31,707,769Proteasome maturation protein
FLT1443088 (10)GO:0060429Epithelium development31,838,211–32,043,089Fms-related tyrosine kinase 1
CDK8101115551 (10)GO:0060429GO:0007219Epithelium developmentNotch signaling pathway33,310,224–33,390,657Cyclin-dependent kinase 8
ATP8A2101115632 (10)GO:0043588Skin development33,449,390–33,534,534ATPase phospholipid transporting 8A2
TNFRSF19101116664 (10)GO:0001942GO:0043588GO:0002064GO:0060429Hair follicle developmentSkin developmentEpithelial cell developmentEpithelium development34,607,296–34,694,188Tumor necrosis factor receptor superfamily, member 19
FGF9101116844 (10)GO:0007173GO:0016055Epidermal growth factor receptor signaling pathwayWnt signaling pathway35,583,842–35,609,316Fibroblast growth factor 9
LATS2101117941 (10)GO:0043588GO:0060429GO:0016055Skin developmentEpithelium developmentWnt signaling pathway35,862,795–35,885,746Large tumor suppressor kinase 2
IFT88101118114 (10)GO:0002064GO:0060429GO:0007219Epithelial cell developmentEpithelium developmentNotch signaling pathway36,048,434–36,103,776Intraflagellar transport 88
KLF5101123641 (10)GO:0002064GO:0060429Epithelial cell developmentEpithelium development48,247,059–48,264,092Kruppel-like factor 5 (intestinal)
MYCBP2101102993 (10)GO:0043473pigmentation52,632,844–52,881,924MYC binding protein 2, E3 ubiquitin protein ligase
SCEL101103405 (10)GO:0043588GO:0060429Skin developmentEpithelium development53,073,650–53,289,357Sciellin
EDNRB443139 (10)GO:0060429GO:0043473Epithelium developmentPigmentation53,508,345–53,534,498Endothelin receptor type B
CCDC88C101112005 (10)GO:0060429GO:0016055Epithelium developmentWnt signaling pathway54,988,440–55,027,517Coiled-coil domain containing 88C
EIF3F101108947 (10)GO:0007219Notch signaling pathway55,307,436–55,338,033Eukaryotic translation initiation factor 3 subunit F
SPRY2780494 (10)GO:0060429GO:0007173Epithelium developmentEpidermal growth factor receptor signaling pathway55,821,638–55,876,082Sprouty RTK signaling antagonist 2
SLITRK6101105831 (10)GO:0002064GO:0060429Epithelial cell developmentEpithelium development61,223,444–61,227,178SLIT and NTRK like family member 6
EFNB2101115469 (10)GO:0010631GO:0060429Epithelial cell migrationEpithelium development81,604,311–81,649,792Ephrin B2
FURIN780454 (18)GO:0007219Notch signaling pathway20,886,453–20,893,295Furin (paired basic amino acid cleaving enzyme)
TJP1443200 (18)GO:0002064GO:0060429Epithelial cell developmentEpithelium development27,453,888–27,538,010Tight junction protein 1
SIN3A101106112 (18)GO:0010631GO:0060429GO:0016055Epithelial cell migrationEpithelium developmentWnt signaling pathway32,287,440–32,352,962SIN3 transcription regulator family member A
CSK101108956 (18)GO:0060429GO:0043473Epithelium developmentPigmentation32,781,761–32,811,209c-src tyrosine kinase
SET101118680 (18GO:0060429Epithelium development33,066,496–33,069,337SET nuclear proto-oncogene
STRA6101120527 (18)GO:0060429Epithelium development33,348,745–33,395,596Stimulated by retinoic acid 6
PML101111580 (18)GO:0060429Epithelium development33,464,431–33,515,176Promyelocytic leukemia
TGIF1101105544 (18)GO:0060429Epithelium development34,892,531–34,894,053TGFB-induced factor homeobox 1
PRKD1101102016 (18)GO:0010631Epithelial cell migration39,035,982–39,190,566Protein kinase D1
HECTD1100135435 (18)GO:0060429Epithelium development40,626,204–40,697,112HECT domain containing E3 ubiquitin protein ligase 1
SNX6101118137 (18)GO:0007173GO:0016055Epidermal growth factor receptor signaling pathwayWnt signaling pathway43,896,580–43,993,579Sorting nexin 6
PSMA6100192424 (18)GO:0007173GO:0016055Epidermal growth factor receptor signaling pathwayWnt signaling pathway44,447,953–44,468,701Proteasome subunit alpha 6
NFKBIA780520 (18)GO:0007219Notch signaling pathway44,535,115–44,538,036NFKB inhibitor alpha
TGM7101106629 (18)GO:0043588GO:0060429Skin developmentEpithelium development53,988,800–54,041,218Transglutaminase 7
IFT140101109504 (24)GO:0060429Epithelium development1,173,624–1,189,733Intraflagellar transport 140
LAT101110725 (24)GO:0010631GO:0002064GO:0007173GO:0060429Epithelial cell migrationEpithelial cell developmentEpidermal growth factor receptor signaling pathwayEpithelium development25,748,047–25,870,792Linker for activation of T-cells
RASGEF1A101120047 (25)GO:0007173Epidermal growth factor receptor signaling pathway13,514,772–13,521,910RasGEF domain family member 1 A
CDH23101107570 (25)GO:0002064GO:0060429Epithelial cell developmentEpithelium development27,373,089–27,820,109Cadherin-related 23
DLG5101110731 (25)GO:0010631GO:0002064GO:0060429Epithelial cell migration; Epithelial cell developmentEpithelium development33,385,018–33,473,757Discs, large homolog 5 (Drosophila)
List of candidate genes in biological processes related to wool traits

Effective population size and genetic diversity

We calculated N of seven generations of whole autosomal and individual autosomes, respectively. Table 5 summarizes the N of Chinese Merino (Xinjiang type) over time. Estimates of N for Chinese Merino (Xinjiang type) showed an increasing trend with the increase of generations, as shown in Fig. 3. N at 2000 generations ago was approximately 4,471.67 and decreased to 159.77 at five generations ago. The effective population size of Chinese Merino (Xinjiang type) differed depending on chromosome, and this was mainly caused by differences among chromosomes in LD between SNP markers (Table 6; Fig. 4). In the 5th generation (from the present), N was 140.36–183.33. This indicated that the effective population size of Chinese Merino (Xinjiang type) decreased over time, and this decrease was more rapid at 100–50 years ago and recently stabilized. Within the same generation, there were significant differences in effective population size between chromosomes (ANOVA, P < 0.001, F-statistics, SPSSv19). When we calculated the effective population size for each chromosome, we found the value of generations became bigger. Within the current generation, OAR10 exhibited the smallest effective population compared with the other chromosomes. Due to OAR3 exhibit small effective population after 50 generations, OAR3 genetic information was selected in Chinese Merino (Xinjiang type) since 50 generations.
Table 5

Statistical summary of the effective population sizes of sheep

SNP pairs distance
25 KB50 KB100 KB500 KB1 MB5 MB10 MB
Genetic distance0.000250.00050.0010.0050.010.050.1
Generations ago2000100050010050105
Ne 4471.673358.002477.671206.44837.07253.51159.77

Genetic distance: Morgan

N effective population size

Fig. 3

Estimated N for Chinese Merino (Xinjiang type) sheep over time based on linkage disequilibrium data. Y axis represents the effective population size; X axis represents time elapsed (number of generations)

Table 6

Effective population size of Chinese Merino (Xinjiang type) sheep over time estimated by different autosomal chromosomes

CHRGenerations
2000100050010050105
14473.993302.022469.601225.03831.14246.26157.76
24034.743096.042221.711128.94781.77237.21152.78
34782.043300.122331.181106.14753.72232.59151.07
44257.223204.972311.831148.25799.50247.53161.54
54219.433245.122428.831187.06803.48241.31155.83
65380.443705.362699.851204.32804.64245.25157.86
74105.493221.112496.411220.42825.24246.26157.76
84308.703537.532658.671256.06846.30252.73161.65
94145.043244.302496.151240.66866.59257.47165.29
103858.853134.652281.991075.61729.60222.27140.36
115812.273531.542623.761230.01833.19260.53170.43
124294.373278.602549.431288.44885.75259.55165.40
133929.023035.822157.231081.27761.41245.00158.34
144790.393482.792543.481221.29835.29256.60168.31
154326.743324.722645.531308.70878.24260.25166.21
164709.333492.972632.911287.08855.59254.07163.91
174957.463770.992650.231262.34851.89256.57164.39
184460.373701.682517.351301.85896.69265.78169.56
194754.472968.202370.551204.71832.40261.01167.05
204877.873948.133035.151367.48917.02268.13167.80
214620.633712.662734.121354.49925.61271.74172.57
224267.183552.592685.411327.79913.80275.73177.36
236339.454193.072918.571404.77941.00275.63176.53
243630.123282.152648.251366.43936.47286.40183.33
254721.483788.172862.571304.14882.97274.49176.85
266105.104064.542988.761320.36885.42263.82167.80
Fig. 4

Boxplot represents the trend in log10(N ) over time. The variability at each time point reflects variation in the estimates among the 26 autosomes. The X axis represents the number of generations in the past; the Y axis represents log10(N )

Statistical summary of the effective population sizes of sheep Genetic distance: Morgan N effective population size Estimated N for Chinese Merino (Xinjiang type) sheep over time based on linkage disequilibrium data. Y axis represents the effective population size; X axis represents time elapsed (number of generations) Effective population size of Chinese Merino (Xinjiang type) sheep over time estimated by different autosomal chromosomes Boxplot represents the trend in log10(N ) over time. The variability at each time point reflects variation in the estimates among the 26 autosomes. The X axis represents the number of generations in the past; the Y axis represents log10(N ) In general, sheep breeds have roughly comparable levels of polymorphism, with a polymorphic loci (P ) value ranging from 0.767 (Border Leicester) (Kijas et al. 2014) to 0.973 (Badger Faced) (Beynon et al. 2015). Accordingly, expected heterozygosity (H ) was relatively high in all sheep breeds (range 0.306–0.380) (Beynon et al. 2015). For the 46,062 SNPs analyzed in this study, estimates of polymorphic loci and expected heterozygosity were 0.994 and 0.393, respectively. The mean inbreeding coefficient was 0.017, which was lower than that of other sheep breeds in previous studies (Kijas et al. 2012).

Discussion

LD maps have improved the power and precision of association mapping, and are used to define optimal marker spacing. In this study, we analyzed LD in 635 Chinese Merino (Xinjiang type) using 46,062 SNPs distributed over 26 autosomal chromosomes. LD was distinct from populations, and with the sample size increased, the extent of LD droped down. When N = 200, the estimated LD within a physical distance of 50 kb was close to that of the general population. We used r 2 as a measure of LD because it is more robust to allele frequency variation than D′ (Ardlie et al. 2002). For Chinese Merino (Xinjiang type) sheep, a moderate level of LD (r 2 ≥ 0.25) was only observed at marker distances of 0–10 kb. When moving from 10 to 500 kb, average r 2 declined from 0.25 to 0.03. Marker pairs with r 2 ≥ 0.25 were, on average, separated by 10 kb. However, not all marker pairs within 10 kb featured by r 2 ≥ 0.25. Our calculated r 2 values were smaller than those observed in Poll Dorset and Border Leicester sheep, but greater than those of Australia Merino and Suffolk breeds for 0–10 kb (Kijas et al. 2014). Compared to LD calculated by Zhao et al. (2014) within 0–10 Mb, Chinese Merino (Xinjiang type) LD was less than that of German Mutton Merino, but greater than that of Sunite sheep. Meanwhile, the extent of LD was different from the same fragment length on different autosomal chromosomes, which was consistent with previous results of sheep and cows (Edea et al. 2015; Lu et al. 2012; Zhao et al. 2014). Potential causes of these differences include variations in recombination rates between and within chromosomes, heterozygosity, genetic drift, and selection for economic traits. Meat, wool, and milk are the most important economic traits in livestock production. In our analysis, we found 280 genes in 0–10 kb with maximum r 2 values for the first four autosomes (OAR25, 24, 18, and 10), of which 47 genes were involved in biological processes related to wool traits, particularly TNFRSF19, FOXO1, SMAD9, and FGF9. TNFRSF19 is located on OAR10 from 34,589,405 to 34,690,170 bp in sheep. It is a member of the tumor necrosis factor receptor superfamily (Hu et al. 1999) and plays a crucial role in hair follicle development, skin development, epithelial cell development, and epithelium development (Kojima et al. 2000; Pispa et al. 2008). FOXO1 functions on epithelium development and the epidermal growth factor receptor signaling pathway, as well as Wnt signaling pathway (Guan et al. 2015; Mori et al. 2014; Xu et al. 2015). This gene, on OAR10 from 22,235,779 to 22,242,617 bp in sheep, belongs to the forkhead transcription factors. The SMAD9 gene, located on OAR10 from 25,001,877 to 25,031,583 bp in sheep, belongs to the SMAD family. It is involved in epithelium development and the BMP signaling pathway (Tsukamoto et al. 2014; Yoshimoto et al. 2005). FGF9 located on OAR10 from 35,583,772 to 35,610,032 bp, is a member of the fibroblast growth factor family. This gene is involved in the epidermal growth factor receptor signaling pathway and the Wnt signaling pathway (Loke et al. 2013; Zheng et al. 2012). Wang et al. (2014) found 28 significant SNPs in Chinese Merino (Junken type) mainly through a GWAS approach and from these SNPs, 25 genes were found to be affected. The 25 genes are dispersed located on OAR1, OAR2, OAR3, OAR4, OAR5, OAR6, OAR7, OAR8, OAR9, OAR10, OAR13, OAR23 and OAR25. In contrast, we found 280 genes locating on four chromosomes, and these genes showed a non-random distribution. The main reason is that Wang et al. used quantitative trait correlation analysis to detect genes under selection, while we detected genes directly from the perspective of genetic linkage. The genetic difference of these two breeds may also contribute to the distinction of the results. Wang et al. found candidate genes harboured in OAR10 and OAR25. Noteablly, after careful study, we also found some of these candidate genes located at LD blocks, such as CFDP2, TPTE2, NBEA as well as SLC25A5. The LD pattern in a population is generally shaped by selection, mutation rate, recombination rate, consanguinity, genetic drift, and other factors. Moreover, LD and effective population size are closely related to each other. These data can help us understand the evolutionary history and genetic mechanisms of complex traits formation (Reich and Lander 2001). In the present study, we found that level of LD in Chinese merino sheep was lower and the effective population size of Chinese Merino (Xinjiang type) increased with the number of generations increase. The effective population size within the most recent 100 generations decreased more slowly, and a relatively rapid decrease was observed after 100 generations. The rate of decay increased at 50 generations, which corresponds to the time of Chinese Merino (Xinjiang type) breeding. N estimated using each chromosome increased with the number of generations raised and decreased with LD drop down. These results were consistent with the analysis of N for the whole autosomal genome. The effective population size estimated by OAR10 was smaller than other autosomes. The corresponding r 2 value was high. QTL related to horn characters is located on this chromosome, and as expectation, we detected RXFP2 gene, which is a candidate for horn phenotypes (Montgomery et al. 1996). This was consistent with the history of long-term breeding of hornless Merino sheep. OAR3 exhibited a relatively small effective population size after 50 generations (corresponding to the completion of Xinjiang type Merino breeding). This chromosome has QTL related to greasy fleece weight and staple length (Ponz et al. 2001). Keratin family genes related to wool quality were also located on this chromosome (McLaren et al. 1997). Factors that influence LD also influence N according to the formulae used to estimate these parameters. For example, artificial selection will lead to higher LD, and a high LD would result in decreased N . Therefore, the small effective population estimated for OAR10 and three may be explained by selection for characters related to wool quality on the basis of hornless sheep. This breed had a larger effective population size than that of Sunite sheep calculated by Zhao et al. (2014) within the same 2000 generations. This indicated that Chinese Merino (Xinjiang type) sheep may have originated from a large effective population. These results may be related to natural selection of Chinese Merino sheep. We did not estimate present-day N in this study. Therefore, we selected 10 Mb as the maximal fragment length. The average r 2 for Chinese Merino (Xinjiang type) population was 0.010. N of Chinese Merino (Xinjiang type) was determined in this estimate, which may estimates the effective population size in current populations. For genetic diversity, Al-Mamun et al. (2015) examined crosses of Merino × Border Leicester with Poll Dorset (MxBxP; N = 231) and reported an expected heterozygosity of 0.38, but an estimated N of 152, which is lower than that of other sheep breeds. In our study, Chinese Merino (Xinjiang type) displayed the highest genetic diversity as measured by polymorphic loci and gene diversity. Historically, Chinese Merino (Xinjiang type) did not undergo intensive selection, unlike the other sheep breeds. Hence, it is reasonable to assume that Chinese Merino (Xinjiang type) sustain higher levels of genetic variability compare with other sheep breeds. However, the N of Chinese Merino (Xinjiang type) was much lower than that of other sheep breeds, particularly Badger Faced and Llandovery White Faced sheep breeds (Beynon et al. 2015). This difference may be explained by the history of Chinese Merino sheep breeding, in which a few Australia Merino rams were introduced, led to a decrease in N , while the large number of native ewes facilitated the maintenance of high genetic diversity in Chinese Merino. In short, in Chinese Merino (Xinjiang type) population, LD decayed as genomic distance increased and the observed LD values were consistent with previously estimates in other sheep populations. After 50 generations (in the past), N decreased more rapidly than in recent generations and proper protective measures should be taken to maintain the breed resources genetic diversity.
  40 in total

1.  Evaluation of linkage disequilibrium measures between multi-allelic markers as predictors of linkage disequilibrium between single nucleotide polymorphisms.

Authors:  H Zhao; D Nettleton; J C M Dekkers
Journal:  Genet Res       Date:  2007-02       Impact factor: 1.588

2.  Posttranslational modifications of FOXO1 regulate epidermal growth factor receptor tyrosine kinase inhibitor resistance for non-small cell lung cancer cells.

Authors:  Zhi-hong Xu; Wen-wen Shun; Jun-biao Hang; Bei-li Gao; Jia-an Hu
Journal:  Tumour Biol       Date:  2015-06-03

3.  Linkage disequilibrium over short physical distances measured in sheep using a high-density SNP chip.

Authors:  James W Kijas; Laercio Porto-Neto; Sonja Dominik; Antonio Reverter; Rowan Bunch; Russell McCulloch; Ben J Hayes; Rudiger Brauning; John McEwan
Journal:  Anim Genet       Date:  2014-07-17       Impact factor: 3.169

4.  Regulation of ocular lens development by Smad-interacting protein 1 involving Foxe3 activation.

Authors:  Aki Yoshimoto; Yuka Saigou; Yujiro Higashi; Hisato Kondoh
Journal:  Development       Date:  2005-09-14       Impact factor: 6.868

5.  Linkage disequilibrium and persistence of phase in Holstein-Friesian, Jersey and Angus cattle.

Authors:  A P W de Roos; B J Hayes; R J Spelman; M E Goddard
Journal:  Genetics       Date:  2008-07-13       Impact factor: 4.562

6.  Estimation of linkage disequilibrium in four US pig breeds.

Authors:  Yvonne M Badke; Ronald O Bates; Catherine W Ernst; Clint Schwab; Juan P Steibel
Journal:  BMC Genomics       Date:  2012-01-17       Impact factor: 3.969

7.  Smad9 is a new type of transcriptional regulator in bone morphogenetic protein signaling.

Authors:  S Tsukamoto; T Mizuta; M Fujimoto; S Ohte; K Osawa; A Miyamoto; K Yoneyama; E Murata; A Machiya; E Jimi; S Kokabu; T Katagiri
Journal:  Sci Rep       Date:  2014-12-23       Impact factor: 4.379

8.  Linkage disequilibrium and haplotype block structure in a composite beef cattle breed.

Authors:  Fabiana Barichello Mokry; Marcos Eli Buzanskas; Maurício de Alvarenga Mudadu; Daniela do Amaral Grossi; Roberto Hiroshi Higa; Ricardo Vieira Ventura; Andressa Oliveira de Lima; Mehdi Sargolzaei; Sarah Laguna Conceição Meirelles; Flávio Schramm Schenkel; Marcos Vinicius Gualberto Barbosa da Silva; Simone Cristina Méo Niciura; Maurício Mello de Alencar; Danísio Munari; Luciana Correia de Almeida Regitano
Journal:  BMC Genomics       Date:  2014-10-27       Impact factor: 3.969

9.  A genotype calling algorithm for the Illumina BeadArray platform.

Authors:  Yik Y Teo; Michael Inouye; Kerrin S Small; Rhian Gwilliam; Panagiotis Deloukas; Dominic P Kwiatkowski; Taane G Clark
Journal:  Bioinformatics       Date:  2007-09-10       Impact factor: 6.937

10.  Extent of genome-wide linkage disequilibrium in Australian Holstein-Friesian cattle based on a high-density SNP panel.

Authors:  Mehar S Khatkar; Frank W Nicholas; Andrew R Collins; Kyall R Zenger; Julie A L Cavanagh; Wes Barris; Robert D Schnabel; Jeremy F Taylor; Herman W Raadsma
Journal:  BMC Genomics       Date:  2008-04-24       Impact factor: 3.969

View more
  12 in total

1.  Goat CMTM2: mRNA expression profiles of different alternative spliced variants and associations analyses with growth traits.

Authors:  Libang He; Zihong Kang; Yuxin Kang; Weixuan Xiang; Chuanying Pan; Hong Chen; Haijing Zhu; Lei Qu; Xianyong Lan; Xiaoyue Song
Journal:  3 Biotech       Date:  2020-02-22       Impact factor: 2.406

2.  Genome-Wide Linkage Disequilibrium and the Extent of Effective Population Sizes in Six Chinese Goat Populations Using a 50K Single Nucleotide Polymorphism Panel.

Authors:  Haile Berihulay; Rabiul Islam; Lin Jiang; Yuehui Ma
Journal:  Animals (Basel)       Date:  2019-06-13       Impact factor: 2.752

3.  Genome-Wide Runs of Homozygosity, Effective Population Size, and Detection of Positive Selection Signatures in Six Chinese Goat Breeds.

Authors:  Rabiul Islam; Yefang Li; Xuexue Liu; Haile Berihulay; Adam Abied; Gebremedhin Gebreselassie; Qing Ma; Yuehui Ma
Journal:  Genes (Basel)       Date:  2019-11-17       Impact factor: 4.096

4.  Linkage disequilibrium and within-breed genetic diversity in Iranian Zandi sheep.

Authors:  Seyed Mohammad Ghoreishifar; Hossein Moradi-Shahrbabak; Nahid Parna; Pourya Davoudi; Majid Khansefid
Journal:  Arch Anim Breed       Date:  2019-04-02

5.  A Combined Multi-Cohort Approach Reveals Novel and Known Genome-Wide Selection Signatures for Wool Traits in Merino and Merino-Derived Sheep Breeds.

Authors:  Sami Megdiche; Salvatore Mastrangelo; Mohamed Ben Hamouda; Johannes A Lenstra; Elena Ciani
Journal:  Front Genet       Date:  2019-10-25       Impact factor: 4.599

6.  Genome-Wide Analysis Revealed Homozygosity and Demographic History of Five Chinese Sheep Breeds Adapted to Different Environments.

Authors:  Adam Abied; Lei Xu; Bahlibi W Sahlu; Feng Xing; Abulgasim Ahbara; Yabin Pu; Jiang Lin; Haile Berihulay; Rabiul Islam; Xiaohong He; Joram M Mwacharo; Qianjun Zhao; Yuehui Ma
Journal:  Genes (Basel)       Date:  2020-12-09       Impact factor: 4.096

7.  The genomic structure of isolation across breed, country and strain for important South African and Australian sheep populations.

Authors:  Cornelius Nel; Phillip Gurman; Andrew Swan; Julius van der Werf; Margaretha Snyman; Kennedy Dzama; Klint Gore; Anna Scholtz; Schalk Cloete
Journal:  BMC Genomics       Date:  2022-01-04       Impact factor: 3.969

8.  Linkage disequilibrium in Brazilian Santa Inês breed, Ovis aries.

Authors:  Amanda Botelho Alvarenga; Gregori Alberto Rovadoscki; Juliana Petrini; Luiz Lehmann Coutinho; Gota Morota; Matthew L Spangler; Luís Fernando Batista Pinto; Gleidson Giordano Pinto Carvalho; Gerson Barreto Mourão
Journal:  Sci Rep       Date:  2018-06-11       Impact factor: 4.379

9.  The Effect of Integrating Genomic Information into Genetic Evaluations of Chinese Merino Sheep.

Authors:  Chen Wei; Hanpeng Luo; Bingru Zhao; Kechuan Tian; Xixia Huang; Yachun Wang; Xuefeng Fu; Yuezhen Tian; Jiang Di; Xinming Xu; Weiwei Wu; Hanikezi Tulafu; Maerziya Yasen; Yajun Zhang; Wensheng Zhao
Journal:  Animals (Basel)       Date:  2020-03-28       Impact factor: 2.752

10.  Detection of a 4 bp Mutation in the 3'UTR Region of Goat Sox9 Gene and Its Effect on the Growth Traits.

Authors:  Libang He; Yi Bi; Ruolan Wang; Chuanying Pan; Hong Chen; Xianyong Lan; Lei Qu
Journal:  Animals (Basel)       Date:  2020-04-13       Impact factor: 2.752

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.