Literature DB >> 26083354

Genome-Wide Specific Selection in Three Domestic Sheep Breeds.

Huihua Wang1, Li Zhang1, Jiaxve Cao1, Mingming Wu1, Xiaomeng Ma1, Zhen Liu1, Ruizao Liu1, Fuping Zhao1, Caihong Wei1, Lixin Du1.   

Abstract

BACKGROUND: Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed.
RESULTS: We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study.
CONCLUSIONS: Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.

Entities:  

Mesh:

Year:  2015        PMID: 26083354      PMCID: PMC4471085          DOI: 10.1371/journal.pone.0128688

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

China is the largest mutton producer in the world. According to 2012 statistics from the Food and Agriculture Organization of the United Nations, China accounts for almost one third of the world's yield of mutton (http://faostat.fao.org/). One reason for this is that there are a large number of Muslim and Mongolian residents in China and mutton is their main meat source. Meanwhile, more and more people of Han Chinese like eating mutton. As the status of mutton increases, so the deficit in the domestic supply of mutton also increases and the annual amount imported becomes ever larger. China does not have its own commercial mutton sheep breed, and the average meat production capacity of traditional Chinese breeds is lower compared with other countries. Therefore, development of a special Chinese sheep breed for meat production is needed. Meat production traits have significant economic importance. Hybridization can quickly improve the meat quality of Chinese sheep, but cannot stabilize the inheritance of desirable traits. Identification of genomic regions that influence meat performance would enable improvement of local Chinese varieties by cross-breeding. This would have very real significance, not only to improve the weakness in Chinese mutton production, but also to improve to mutton production throughout the world. To mine for genome selection information, the selection signal method has become popular. For the specific selection of genomic regions, pairwise FST, combined with a haplotype approach, such as REHH (Relative extended haplotype homozygosity), XPEHH (Cross population extended haplotype homozygosity)[1] or RSB (Across pairs of populations)[2] can determine the selection from a population when dealing with two groups. But it is relatively complex for multi-groups. Global FST, applied to multiple groups, cannot determine which breeds have undergone selection. At present, there are two better methods, locus-specific branch lengths (LSBL) and d , which detect the locus specific divergence for each breed. LSBL is generally suitable for three or four groups [3], whereas d is suitable for three or more groups [4]. In our previous study, we identified candidate genes associated with growth and meat production traits by using Illumina Ovine SNP50 BeadChip technology and genome-wide association study (GWAS) methodology to analyze three sheep populations including one indigenous Chinese sheep breed and two well-known commercial mutton sheep breeds [5]. Here we also applied these data to identify artificial selection regions using LSBL and d statistics.

Materials and Methods

Population samples and quality control

We analyzed SNP (Single-nucleotide polymorphism) data from our previous GWAS [5]. A total of 322 sheep from three breeds, including 61 Chinese Mongolian fat-tailed (CMF), 161 German Mutton Merino (GMM) and 100 African white Dorper (AWD) sheep were analyzed. There were not any family structure and half sib family in the selected sheep. Two SNP sets were used. First, SNPs that did not pass the following three criteria were excluded: (1) SNPs with minor allele frequency > 0.01; (2) Hardy–Weinberg Equilibrium P-value > 0.000001; (3) SNPs that were located on autosomes. After quality control, there were 322 individuals and 46,752 SNPs in the genetic diversity analysis dataset. The first SNP set was then pruned using the indep-pairwise option, with a non overlapped window size of 25 SNPs, a step of 5 SNPs, and pairwise r2 threshold of 0.1, resulting in 10,260 independent SNP markers. The second SNP set was for population analysis.

Population analyses

Principal component analysis (PCA) was conducted using snpStats in R (http://cran.r-project.org). We constructed two neighbor-joining trees. One of uncorrected p-distances for individuals using SplitsTree software [6] and one of pairwise FST for populations using R package ape [7].

Statistical analyses

We first calculated pairwise FST for each locus of first SNP set between breeds using Genepop 4.2.2 software [8]. Neighbor-joining tree breed-specific population differentiation within 300 kb windows across the 26 autosomes was calculated using Locus-specific branch lengths (LSBL) [3] and d statistics [4]. As described in Shriver et al. 2004 [3], LSBL (LGMM, LAWD, LCMF) were calculated from single locus pairwise FST distances, where LGMM = (GMM-AWD FST + GMM-CMF FST − AWD-CMF FST)/2, LAWD = (GMM-AWD FST + AWD-CMF FST − GMM-CMF FST)/2 and LCMF = (GMM-CMF FST + AWD-CMF FST − GMM-AWD FST)/2. Akey et al. [4] first described how to calculate d statistics for each SNP; d = , where and denote the expected value and standard deviation of pairwise FST values between breeds i and j calculated from all SNPs. Only windows with a minimum of three SNPs were considered. For each breed, windows of significance were determined as those with LSBL or d values falling into the 99th percentile of the empirical distribution.

Gene annotation

We used the latest sheep genome release Ovis_aries_v3.1 (http://www.livestockgenomics.csiro.au/sheep/oar3.1.php) to identify relationships between significant selection windows and ovine genes. Owing to the structural imperfection and incomplete sheep genome sequence (before October, 2012), we also referenced genomic information of other species such as human, cow, mouse and rat.

Results

Population stratification

In the present study, we first performed principal component analysis on a pruned set of 10,260 genome-wide SNPs, to characterize the pattern of individual clustering in the sample set. As shown in Fig 1, PC1 (which accounts for 13.01% of the total variance) and PC2 (which accounts for 9.47% of the total variance) both separate all three population samples from each other, as the same with the former study[5].
Fig 1

A. Animals clustered on the basis of principal component (PC) analysis using individual genotypes B. Scree-plot of proportion of variance.

A. Animals clustered on the basis of principal component (PC) analysis using individual genotypes B. Scree-plot of proportion of variance. We then calculated pairwise FST [9] for the SNP data generated from the three sheep population samples (Fig 2). According to Wright’s theory [10], we found medium divergence (FST = 0.13, FST = 0.14) between CMF and GMM or AWD populations respectively, and high divergence (FST = 0.19) between AWD and GMM populations. We constructed a simple three-branch phylogeny from pairwise FST values (Fig 2) and also a neighbor-joining (NJ) tree among the individuals (S1 Fig). The results clearly showed that there were no conflicts concerning the origins of individuals assigned to each breed.
Fig 2

Three-branch phylogeny constructed from pairwise FST.

Correlation of two locus specific analysis approaches

Locus-specific branch lengths (LSBL) [3] and d statistics [4] are both summary statistical methods to measure the locus specific divergence in allele frequencies for each breed based on unbiased estimates of pairwise FST [11]. LSBL is suited to the analysis of three populations, and d is preferred for analysis of more than three populations. When the populations number is three, both approaches can be used. In this study, we calculated genome-wide LSBL and d values. The maximal LCMF and dCMF values were higher than those of the other two breeds (Table 1). Obviously, the mean LAWD and LGMM were higher than LCMF, and branch lengths of AWD and GMM were longer than those of CMF (Table 1, Fig 2). In other words, the CMF breed shows more loci having shorter LSBL compared with the other two breeds. Histograms of the distribution of LSBL and d statistics for each breed are shown in Fig 3. AWD and GMM have similar LSBL distributions. But GMM and CMF are similar d statistics distributions. Further, we used Pearson’s product-moment correlation to estimate the correlation between LSBL and d statistics within each breed. All three breeds showed significant correlation (P-value<2.2e-16) between the two approaches. The correlations for AWD (r = 0.85) and GMM (r = 0.84) were higher than that for CMF (r = 0.68).
Table 1

The descried of LSBL and d values for each breed

ValueMean(SD)MinMax
LAWD 0.079(0.13)-0.1130.838
LGMM 0.071(0.13)-0.1250.914
LCMF 0.045(0.10)-0.0830.940
dAWD -0.003(1.62)-1.8599.728
dGMM -0.012(1.61)-1.80910.1
dCMF -0.010(1.46)-1.78511.352
Fig 3

Histogram of the LSBL and d statistics distribution for each breed.

We also investigated the correlation between LSBL and d statistics in 5000 SNPs in bin order, from high to low of LSBL value (Fig 4). The highest correlation (r>0.9) occurred in the region of the top 1–5000 SNPs in all breeds. The correlation values then sharply declined in the top 5001–10000 SNPs.
Fig 4

Correlation between LSBL and d statistics.

Detecting breed specific selection regions

For each breed, we performed two locus-specific analyses to identify candidate regions involved in selection. These two statistical methods were calculated for autosomal SNPs in 300 kb windows, with a minimum of three SNPs per window, and defining the populations by breed. In total, 46,752 SNPs were evaluated within 7734 windows, ordered from 1 to 7734, averaging 5.97 SNPs per window (SD = 1.6). We defined candidate selection regions as those that fell into the upper 99th percentile of the empirical distribution. Within each breed, 78 windows were considered putative signatures of selection. S1 Fig shows the genome-wide distribution of the two analyses. In total, 259 of the windows met this criterion under both approaches in three breeds. Venn Diagrams were produced for the three breeds for LSBL and d , respectively (Fig 5A). The numbers of overlapping windows for LSBL were fewer than for the d approach. This indicates that LSBL has a greater ability to detect specific selection than d .
Fig 5

A. Former: Venn diagram of selection windows from d approach in three breeds, Latter: Venn diagram of selection windows from LSBL approach in three breeds; B. Venn diagrams of each breed’s selection windows from LSBL and d approaches; C. Venn diagram of specific selection windows in three breeds.

A. Former: Venn diagram of selection windows from d approach in three breeds, Latter: Venn diagram of selection windows from LSBL approach in three breeds; B. Venn diagrams of each breed’s selection windows from LSBL and d approaches; C. Venn diagram of specific selection windows in three breeds. To detect breed specific selection regions for each breed, we merged the window lists generated by these two approaches to identify three subsets of 54 (AWD), 58 (GMM) and 45 (CMF) windows that showed the strongest signature of selection by displaying both high LSBL and d values (Fig 5B). Because the correlation of CMF is lower than that of AWD and GMM, the number of overlapping windows for CMF is smaller than for the other breeds. Finally, there were also five overlapping windows in the final selected windows that were selected in two breeds (Fig 5C). Fig 6 shows LSBL and d values of-SNPs in five overlapping selection windows and in two nearby windows. The plot of LSBL values shows three clusters in each window. But these clusters are not clear in d windows. All overlapping windows include 23 SNPs. Then we investigated the diversity of these SNPs. The distribution of genotypes for each SNP in the three breeds shows a stepladder, two extreme types and one middle type (S3 Fig). Fig 7 illustrates a representative SNP (OAR13_67857725.1) in window 5305. There is clearly a large difference in genotype proportion between AWD and CMF; therefore, the overlapping selection window means the two breeds, which have overlapping selection, are different in this region and maybe one or both has undergone selection.
Fig 6

The two statistic of per-SNP of three regions with three consecutive windows, the selected widows in the middle, GMM: green dot, AWD: yellow dot, CMF: red dot.

Fig 7

The diversity of OAR13_67857725.1 SNP in 3 sheep breeds.

We used the latest sheep genome release, Ovis_aries_v3.1 (http://www.livestockgenomics.csiro.au/sheep/oar3.1.php), to identify relationships between significant selection windows and ovine genes. We removed uncharacterized genes and genes that overlapped among the three breeds. In total, 478 non-overlapping selected genes were annotated and 164, 201and 113 were selected in GMM, AWD and CMF breed, respectively (Table 2). Because of selective sweep or hitchiking effort, the effect of a strongly selected allele at one locus on the frequencies of neutral alleles at a linked locus, fewer genes were in fact selected[12]. We performed a further screen for each selection window. We selected genes located in or near a peak value SNP in each selection window. At last, we got 46, 51 and 32 candidate genes for GMM, AWD and CMF breed, respectively (Table 2, S1, S2 and S3 Tables). We did not screen overlapping windows.
Table 2

The annotation details in specific selected and overlapping selected region.

BreedNo. of selected windowsNo. of genes in windowsNo. of genes within or near Peak SNP
overlapping selected
GMM & AWD29
GMM& CMF23
AWD& CMF15
specific selected
GMM5516446
AWD5120151
CMF4111332

Specific selection genes in each breed

Here we identified many selection genes for each breed. We focused on production, meat, reproduction and health traits because these are highly valued traits in mutton sheep production. We identified candidate genes are for enrichment of these main traits. We list below some genes previously identified to be important in each breed for various traits (Table 3).
Table 3

The information of main candidate gene of three breeds.

BreedWindowChrRegionLSBL d i candidate gene
GMM
7461234.6–234.90.333.20IGSF10
7641240–240.30.344.22PLSCR2
15742213.6–213.90.434.03FAM113B
1881361.8–62.10.383.31EDAR
1984393.9–94.20.424.61EXOC6B
21503145.2–145.50.373.78PDZRN4
22123165–165.30.353.29NTN4
23663213–213.30.373.92MICAL3
2999566.9–67.20.364.24CCNB2
34546103.2–103.50.423.93STK32B
45791055.2–55.50.343.31EIF3F
47941139.9–40.20.353.64PSMD3,THRA,MSL1
53421374.7–750.323.10TRHR
59131631.8–32.10.505.66GHR
6058174.8–5.10.484.54TMEM154
66731959.1–59.40.363.34EEFSEC
70192215–15.30.302.85PLCE1
70412222.2–22.50.333.28SUFU
74022425.8–26.10.474.90ATP2A1, APOBR
74032426.1–26.40.454.78ALDOA
AWD
7561237.6–237.90.353.43HMGB1
170837.5–7.80.393.85SPTAN1
2001399.3–99.60.322.86IL1RL1
23673213.3–213.60.414.58TRIOBP
3170613.8–14.10.322.97 POL
3606734.2–34.50.303.44SPTBN5
3749778.6–78.90.352.86SLC8A3
44791021.9–22.20.454.47TPTE2
45041029.7–300.394.59B3GALTL
52701351.9–52.20.413.67TMC2
54381421.3–21.60.322.63RPGRIP1L, FTO
54521425.5–25.80.342.95SETD6
55461457.9–58.20.312.76 RPL7
63601829.4–29.70.363.35CIB2
66231943.5–43.80.332.76DNAH3
66491951.9–52.20.353.22SCAP
70132212.3–12.60.333.75NUDT9
CMF
286189.2–89.40.324.30SLC16A1
20993129.6–129.90.466.02CRADD
23223198.6–198.90.242.55DERA
2858522.5–22.80.262.42SLC27A6
3235633.9–34.20.283.31FAM190A
3239636–36.30.242.71HERC3
3380679.5–79.80.272.73TECRL
3605733.9–34.20.262.85TYRO3
3608734.8–35.10.242.71CAPN3
48341152.8–53.10.282.63SOCS3
59181633.6–33.90.222.64PRKAA1
73632411.7–120.252.86 SHISA9
74072427.6–27.90.273.08PHKG1

Underlined fonts indicate candidate gene in our former GWAS study.

Underlined fonts indicate candidate gene in our former GWAS study.

Specific selection genes in GMM breed Production traits

Two important genes TRHR and APOBR as candidate association with body mass [13, 14]. PDS5B showed negative covariance between average daily weight gain and backfat thickness [15]. IGSF10 is differentially expressed in cattle with high and low residual feed intake [16]. Meat traits: GHR is a well-known gene that not only effects meat production and quality but also reproduction traits in many species [17, 18]. STK32B is a QTL(quantitative trait loci) for marbling score in Hanwoo [19]. ALDOA, which encodes a glycolytic metabolic enzyme, was expressed at around 2-fold lower levels in the longissimus muscle of Wagyu-sired fetuses at day 195 compared with Piedmontese-sired fetuses [20]. FAM113B is expressed in dairy cattle at least twice the level of that in beef cattle [21]. NTN4 was down-regulated in differentiated adiposities compared with intramuscular fibroblast-like cells [22]. Reproduction traits: PLSCR2 is a candidate endometrial gene in the regulation of conceptus growth and elongation [23]. EIF3F gene transcripts were more highly enriched in brilliant cresyl blue (BCB)+ oocytes compared with BCB− oocytes [24]. CCNB2 was identified as significantly associated with developmental competence of bovine oocytes [25]. PDZRN4 is associated with sperm motility of Holstein-Friesian cattle and EEFSEC is related to buffalo bull fertility [26, 27]. Health traits: TMEM154 can reduce lentivirus susceptibility in sheep [28] and GWAS indicate this gene to be associated with susceptibility to and control of ovine lentivirus [29]. MICAL3 is associated with immune response traits in Canadian Holstein cattle [30]. ATP2A1 is associated with pseudomyotonia, a muscle function disorder, in cattle [31]. PSMD3 shows significant association with the mean corpuscular volume [32]. Other traits: GMM is merino fine wool sheep, so wool trait was also selected when in process of breeding. Unsurprisingly, we found three important genes involved in wool trait. EDAR is associated with hair thickness in human [33]. Mutation in Mpzl3, a gene encoding a predicted adhesion protein, is responsible for rough coat mice with severe skin and hair abnormalities [34]. THRA is located at quantitative. Meanwhile, we found three genes looks association with milk traits. Such as, EXOC6B is a candidate gene for teat morphology and function [35]. PLCE1 is associated with total protein weight in milk and SUFU is associated with the mammary system, somatic cell count and survival [36].

Specific selection genes in AWD breed Production traits

FTO is associated with BMI in human and growth rate and fat mass in pig [37-40]. SCAP, part of the INSIG-SCAP-SREBP pathway, is involved in obesity risk in Chinese children [41]. Mutations in B3GALTL can cause disproportionate short stature in human, and developmental delay [42]. Reproduction traits: SLC8A3 is a transporter that can potentially increase the availability of L-alanine and L-histidine for gap junctional transfer in oocytes [43]. SETD6 is involved in the transcriptional regulation of gonadotropin-releasing hormone [44]. Health traits: SPTAN1 is a candidate gene for parasite resistance in livestock [45]. CIB2 is associated with influencing interleukin levels in African Americans [46]. HMGB1 is involved in mastitis in dairy cattle [47]. TRIOBP and TMC2 can cause recessive hearing loss in human [48, 49]. NUDT9 is a candidate gene for an inherited cataract in sheep [50]. Mutations in SPTBN5 and RPGRIP1L cause retinitis pigmentosa [51, 52]. A SNP mutation in DNAH3 is involved in recurrent airway obstruction in European horses [53]. A functional SNP in IL1RL1 is associated with asthma in human [54]. Other traits: TPTE2 may be directly or indirectly related to epithelial cells or skin development [44] and is a candidate gene associated with wool traits in Chinese Merino Sheep [55].

Specific selection genes in CMF breed Production traits

TECRL is associated with withers height in racing quarter horse [56]. SLC27A6 is part of the peroxisome proliferator-activated receptor (PPAR) signaling pathway, which is associated with carcass conformation in cattle [57]. Meat trait: FAM190A is a QTL associated with weight after slaughter in Hanwoo cattle [58]. CRADD is associated with muscle compactness [59]. PHKG1 causes high glycogen content and low meat quality in pig skeletal muscle [60]. CAPN3 is related to meat quality traits in chickens [61]. Reproduction traits: TYRO3 modulates female reproduction by influencing gonadotropin-releasing hormone [62]. SLC16A1 plays an important role in the transport of mevalonate and ketone bodies [63] and may be involved in differences in efficiency of reproduction in cattle[64]. Health traits: SOCS3 is associated with somatic cell score trait in cattle and is expressed in goat milk fat globules in response to experimental intramammary infection with Staphylococcus aureus [65]. Other traits: In milk traits, PRKAA1 is associated with fat percentage and may have effects on fat metabolism affecting milk production traits in cattle [66]. DERA is a positional candidate gene for milk fat percentage in the German Holstein-Friesian population [67]. HERC3 is associated with milk production performance in Chinese Holstein cattle [68].

Overlapping selection regions

According to the above analysis, overlapping windows means there are differences between the two selected breeds. In Table 4, 17 selected genes in these overlapping regions are annotated.
Table 4

The genes in overlapping selection windows.

LSBLdi
ChrwindowSNP No.RegionGMMAWDCMFGMMAWDCMFSelected genes
1288489.7–90.0-0.07 0.30 0.24 0.75 3.29 3.17 LRIG2, RPS6
212465114.3–114.6 0.33 -0.09 0.26 3.84 0.85 3.60 -
215884219.6–219.9 0.29 0.30 -0.04 3.54 3.52 1.80BCS1L, CYP27A1, PRKAG3, RNF25, STK36, TTLL4, WNT10A, WNT6, ZNF142
63241637.2–37.5 0.37 0.00 0.31 5.20 2.55 5.14 FAM184B, NCAPG, LCORL
135305462.7–63.0-0.02 0.49 0.24 2.58 6.08 4.91 RALY, EIF2S2, CHMP4B

Blot fonts as candidate gene. Underlined fonts indicate values in the top first percentiles.

Blot fonts as candidate gene. Underlined fonts indicate values in the top first percentiles. Firstly, two overlapping windows were detected between GMM and CMF breeds. There is no gene involved in the 1246 window. We then identified two well-known genes, NCAPG and its near neighbor LCORL within 37.2–37.5Mb on OAR6, which are reported to be involved in fetal growth, stillbirth, and carcass size in sheep and other livestock (Table 4). GWAS revealed that these two genes are associated with body weight in Australian Merino sheep[69]. Kijas et al. suggest that variation in the NCAPG/LCORL region also influences production traits in sheep [70]. In horses, GWAS indicates LCORL/NCAPG as a candidate region for withers height [71]. In cattle, LCORL and NCAPG genes are associated with feed intake and weight gain [72] and body frame size [73]. Xu et al. detected that LCORL/NCAPG have undergone positive selection in five distinct cattle breeds [74]. Secondly, there are two windows that are different between AWD and CMF breeds. One region, 89.7 to 90.0 Mb on OAR10, coincides with LRIG2 and RPS6 genes. RPS6 is a candidate gene in a QTL region affecting growth and reproduction traits in swine [75]. The other region, from 62.7 to 63.0 Mb, on OAR13 included three genes, RALY, EIF2S2 and CHMP4B (Table 4). Another nearby gene, ASIP, regulates pigmentation in mice, while duplication of ASIP in sheep controls a series of alleles for black and white coat color [76]. The ASIP region is one of four known melanoma-susceptibility regions and includes the four genes (RALY, EIF2S2, CHMP4B and ASIP) [77]. In Kijas et al. research, a SNP s51670.1 has peak value of global FST in similar region, ASIP as candidate gene, on OAR13 [78]; here this SNP also has peak value of LSBL and d in 5305 windows (Fig 7). In a recent GWAS analysis ASIP was associated with white versus non-white coat-color variation in sheep [79]. Thirdly, only one region was different between AWD and GMM, at 219.6–219.9 Mb on OAR4. There are nine genes involved (Table 4), three of which have been are reported. The most important gene is PRKAG3 (protein kinase, AMP-activated, gamma 3 noncatalytic subunit), which increases fatty acid oxidation and glucose uptake to satisfy muscle energy demands [80] and is a candidate gene associated with meat quality and production traits in pig [81] and cattle [82]. A mutation in PRKAG3 is associated with excess glycogen content in pig skeletal muscle [83]. Recently GWAS analysis indicated that PRKAG3 affected meat pH and color in crossbred commercial pig lines [84]. The other two genes are WNT10A and WNT6, which are strongly co-expressed in human SW480 cells [85]. Wnt6 is an early negative regulator of limb chondrogenesis and ectoderm development in the chicken embryo [86]. Interestingly, Christodoulides et al. identified a proband with early onset obesity that is heterozygous for a WNT10 C256Y mutation, which blocks adipogenesis [87].

Discussion

Three breeds of sheep were investigated in this study; CMF comes from China, GMM originates from Germany and AWD was originally developed in South Africa. The FST results showed significant genetic divergence between GMM and AWD (FST = 0.19) and medium divergence between CMF and GMM (FST = 0.13) or AWD (FST = 0.14). This is consistent with domestic sheep being first domesticated in Asia, the Fertile Crescent, and then dispersing to Europe and Africa [88]. The PCA and neighbor-joining tree clearly separate these three population samples from each other. In the present study, we used two locus specific analysis approaches to detect candidate regions targeted by selection. Both of them calculated for each breed based on pairwise FST. From the previously describe, the d approach measures the standardized locus-specific deviation in levels of population structure [4]. However, the LSBL approach geometrically isolates allele frequency change [3]. First we compared the values of these two statistical approaches. The two methods had a high correlation, especially in the selected regions. For example, the highest correlation (r>0.9) occurred in the region of the top 1–5000 SNPs by LSBLs in all three breeds. Furthermore, the result shows the high correlation of LSBL and d in AWD and GMM, while lower in CMF. It might be relevant with the evolution process of these three breeds. AWD and GMM are notable commercial breeds in the world, which developed through strict selection pressure. However, CMF is local breed which mainly selected for body weight and conformation in recent years [5]. We then calculated the mean value respectively of the two approaches for autosomal SNPs in 300 kb windows for each breed. Interestingly, LSBL had a greater ability to detect specific selection than d . We merged the window lists generated by these two approaches to identify breed specific selection regions. In total, 142 windows showed the strongest signature of selection, five of which overlapped. This means that the two breeds are different in these regions and one or both may have undergone selection. We have defined candidate genes in selection windows located at or near a peak value SNPs. Some genes were identified in earlier sheep selection studies, such as NF1 and ASIP [78], RNF180 and GHR [89]. GHR, identified in the GMM breed, is an important growth-related gene that, not only affects meat production and quality, but also reproduction traits [17, 18]. Two genes were detected in sheep by GWAS, such as TPTE2 [55], TMEM154 [29]. In our previous study four genes, POL, RPL7, MSL1 and SHISA9, are associated with growth and meat production traits [5]. We notice that there are only a littler common results in these two studies, although using the same data. Because the sample sizes were too small, we combined three population data as a whole object in our GWA study. But herein, we respectively detected the specific selection for each breed. Therefore, our study provides additional information for interpreting selection in different domestic sheep breeds. Production, meat, reproduction and health traits of sheep were investigated because these are highly valued traits in mutton sheep production. So the candidate genes enrich for these main traits. For production traits, there are two genes, APOBR and FTO, are associated with BMI [14, 37]. For reproduction traits, we found no major genes controlling reproduction prolificacy, such as GDF9 and BMPR1B; however, we found some genes which can influence development of the oocyte and sperm. For example, EIF3F, CCNB2 and SLC8A3 affect oocyte development [24, 25, 43] and PDZRN4 and EEFSEC affect sperm [26, 27]. For meat traits, ALDOA, STK32B and FAM190A are related to marbling in cattle[19, 20, 58]. For wool traits, EDAR was selected in the GMM breed and is associated with hair thickness [33]. AWD has a characteristic of molting, and TPTE2 is related to epithelial cells or skin development [44]. For health traits, we noticed that association of candidate genes related to disease resistance traits is more common in Chinese compared with Mongolian commercial mutton sheep. This shows that the artificial selection of Mongolian sheep has not received sufficient attention. An important gene was found, TMEM154, which can control and reduce lentivirus susceptibility in sheep [28, 29]. Currently, there is no vaccine to prevent ovine lentivirus infection and no cost-effective treatment for infected animals. This gene should therefore be used in breeding projects. In the AWD breed, we found a lot of genes associated with disease (except for immune related genes). These included sensory disorders and respiratory system diseases. Interestingly, some genes related to milk traits were selected in GMM and CMF breeds, both of which are from the Northern hemisphere, but not in AWD. It is worth mentioning that the early growth speed of Chinese Mongolian sheep is too slow compared with commercial breeds. This is because the Chinese Mongolian sheep is a fat-tailed sheep and deposition of tail fat reduces early growth speed. We therefore focused on the pathways and genes associated with fat formation. Interestingly, five such genes (SOCS2, SOCS3, PPP1CC, PHKG1 and PRKAA1) are in the insulin signaling pathway. SOCS2 and SOCS3 (suppressor of cytokine signaling 2 and 3), regulate insulin signaling in different tissues by impacting on the insulin receptor and insulin receptor substrates [90]. PPP1CC, also known as PPP1G, is a subunit of protein phosphatase 1. It is a glycogen-associated phosphatase responsible for dephosphorylation and subsequent inactivation of glycogen synthase and is universal in skeletal muscle [91]. PHKG1, causes high glycogen content and low meat quality in pig skeletal muscle [60]. PRKAA1/2 acts as an energy sensor, sensing an increased AMP/ATP ratio, and is known to regulate substrates that mediate metabolic activity, such as phosphorylation of acetyl coA carboxylase (ACACA, also known as ACC) [92]. Furthermore, studies have shown that PDGF promotes proliferation and inhibits differentiation of preadipocytes [93, 94]. Real-time quantitative PCR indicates that PDGFD is expressed at a higher level in adipose tissue than in normal human tissues, except the thyroid [95]. Insulin also stimulates cell growth and differentiation, and promotes the storage of substrates in fat, liver and muscle by stimulating lipogenesis, glycogen and protein synthesis, and inhibiting lipolysis, glycogenolysis and protein breakdown [96]. We therefore suggest that these genes affect fat-tail formation but this requires further study. In this study, we also found some different selection regions between breeds; however, we were unable to determine in which breed the candidate gene was selected. For instance, CMF has a black head and legs, while the AWD are white. It appears as though ASIP, a key gene of pigmentation, may provide evidence for selection in CMF. According to the same principle, the LCORL/NCAPG region was selected in GMM, which grows faster and has a bigger carcass than CMF. Of course, not all genes can be judged, such like PRKAG3 affecting meat pH and color, because the relevant data was lacking. These genes, in addition to RPS6, WNT10A and WNT6, require further study.

Conclusions

In the present study, we used the two approaches, LSBL and d statistics, to detect selection regions in three different sheep breeds (populations). These approaches clearly identified selected regions in each breed, and provided many candidate genes, including some well-known genes. Overall, growth, meat and health traits are undergoing different levels of selection in these three breeds, but the choice of focus differs for each breed according to origin, local preferences and environment.

Neighbor-Joining (NJ) phylogeny for 322 sheep.

(TIF) Click here for additional data file.

Genomic distribution of LSBL and di in 3 sheep breeds.

(TIF) Click here for additional data file.

The diversity of 23 SNPs of 3 sheep breeds.

(TIF) Click here for additional data file.

The main candidate genes of specific selections in GMM.

(DOCX) Click here for additional data file.

The main candidate genes of specific selections in AWD.

(DOCX) Click here for additional data file.

The main candidate genes of specific selections in CMF.

(DOCX) Click here for additional data file.
  80 in total

Review 1.  Genetic hitchhiking versus background selection: the controversy and its implications.

Authors:  Wolfgang Stephan
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-04-27       Impact factor: 6.237

2.  A genome-wide scan study identifies a single nucleotide substitution in ASIP associated with white versus non-white coat-colour variation in sheep (Ovis aries).

Authors:  M-H Li; T Tiirikka; J Kantanen
Journal:  Heredity (Edinb)       Date:  2013-09-11       Impact factor: 3.821

3.  The influence of follicle size, FSH-enriched maturation medium, and early cleavage on bovine oocyte maternal mRNA levels.

Authors:  Marina Mourot; Isabelle Dufort; Catherine Gravel; Omran Algriany; Steph Dieleman; Marc-André Sirard
Journal:  Mol Reprod Dev       Date:  2006-11       Impact factor: 2.609

4.  Mutation in Mpzl3, a novel [corrected] gene encoding a predicted [corrected] adhesion protein, in the rough coat (rc) mice with severe skin and hair abnormalities.

Authors:  Tongyu Cao; Peter Racz; Kornelia M Szauter; Gergely Groma; Garrett Y Nakamatsu; Benjamin Fogelgren; Eszter Pankotai; Qing-Ping He; Katalin Csiszar
Journal:  J Invest Dermatol       Date:  2007-02-01       Impact factor: 8.551

5.  genepop'007: a complete re-implementation of the genepop software for Windows and Linux.

Authors:  François Rousset
Journal:  Mol Ecol Resour       Date:  2008-01       Impact factor: 7.090

6.  Genomic signatures reveal new evidences for selection of important traits in domestic cattle.

Authors:  Lingyang Xu; Derek M Bickhart; John B Cole; Steven G Schroeder; Jiuzhou Song; Curtis P Van Tassell; Tad S Sonstegard; George E Liu
Journal:  Mol Biol Evol       Date:  2014-11-26       Impact factor: 16.240

7.  Dominant and recessive deafness caused by mutations of a novel gene, TMC1, required for cochlear hair-cell function.

Authors:  Kiyoto Kurima; Linda M Peters; Yandan Yang; Saima Riazuddin; Zubair M Ahmed; Sadaf Naz; Deidre Arnaud; Stacy Drury; Jianhong Mo; Tomoko Makishima; Manju Ghosh; P S N Menon; Dilip Deshmukh; Carole Oddoux; Harry Ostrer; Shaheen Khan; Sheikh Riazuddin; Prescott L Deininger; Lori L Hampton; Susan L Sullivan; James F Battey; Bronya J B Keats; Edward R Wilcox; Thomas B Friedman; Andrew J Griffith
Journal:  Nat Genet       Date:  2002-02-19       Impact factor: 38.330

8.  Ectodermal Wnt6 is an early negative regulator of limb chondrogenesis in the chicken embryo.

Authors:  Poongodi Geetha-Loganathan; Suresh Nimmagadda; Bodo Christ; Ruijin Huang; Martin Scaal
Journal:  BMC Dev Biol       Date:  2010-03-25       Impact factor: 1.978

9.  Detection of QTL for Carcass Quality on Chromosome 6 by Exploiting Linkage and Linkage Disequilibrium in Hanwoo.

Authors:  J-H Lee; Y Li; J-J Kim
Journal:  Asian-Australas J Anim Sci       Date:  2012-01       Impact factor: 2.509

10.  Genome-wide association scan shows genetic variants in the FTO gene are associated with obesity-related traits.

Authors:  Angelo Scuteri; Serena Sanna; Wei-Min Chen; Manuela Uda; Giuseppe Albai; James Strait; Samer Najjar; Ramaiah Nagaraja; Marco Orrú; Gianluca Usala; Mariano Dei; Sandra Lai; Andrea Maschio; Fabio Busonero; Antonella Mulas; Georg B Ehret; Ashley A Fink; Alan B Weder; Richard S Cooper; Pilar Galan; Aravinda Chakravarti; David Schlessinger; Antonio Cao; Edward Lakatta; Gonçalo R Abecasis
Journal:  PLoS Genet       Date:  2007-07       Impact factor: 5.917

View more
  18 in total

1.  Genome-Wide Analysis and Function Prediction of Long Noncoding RNAs in Sheep Pituitary Gland Associated with Sexual Maturation.

Authors:  Hua Yang; Jianyu Ma; Zhibo Wang; Xiaolei Yao; Jie Zhao; Xinyue Zhao; Feng Wang; Yanli Zhang
Journal:  Genes (Basel)       Date:  2020-03-17       Impact factor: 4.096

2.  Unveiling genomic regions that underlie differences between Afec-Assaf sheep and its parental Awassi breed.

Authors:  Eyal Seroussi; Alexander Rosov; Andrey Shirak; Alon Lam; Elisha Gootwine
Journal:  Genet Sel Evol       Date:  2017-02-10       Impact factor: 4.297

3.  Kompetitive Allele Specific PCR (KASP™) genotyping of 48 polymorphisms at different caprine loci in French Alpine and Saanen goat breeds and their association with milk composition.

Authors:  Szilvia Kusza; Ludovic Toma Cziszter; Daniela Elena Ilie; Maria Sauer; Ioan Padeanu; Dinu Gavojdian
Journal:  PeerJ       Date:  2018-02-21       Impact factor: 2.984

4.  Genome-wide genetic structure and selection signatures for color in 10 traditional Chinese yellow-feathered chicken breeds.

Authors:  Xunhe Huang; Newton O Otecko; Minsheng Peng; Zhuoxian Weng; Weina Li; Jiebo Chen; Ming Zhong; Fusheng Zhong; Sihua Jin; Zhaoyu Geng; Wei Luo; Danlin He; Cheng Ma; Jianlin Han; Sheila C Ommeh; Yaping Zhang; Xiquan Zhang; Bingwang Du
Journal:  BMC Genomics       Date:  2020-04-20       Impact factor: 3.969

5.  Genomic scan of selective sweeps in Djallonké (West African Dwarf) sheep shed light on adaptation to harsh environments.

Authors:  Isabel Álvarez; Iván Fernández; Amadou Traoré; Lucía Pérez-Pardal; Nuria A Menéndez-Arias; Félix Goyache
Journal:  Sci Rep       Date:  2020-02-18       Impact factor: 4.379

6.  A Combined Multi-Cohort Approach Reveals Novel and Known Genome-Wide Selection Signatures for Wool Traits in Merino and Merino-Derived Sheep Breeds.

Authors:  Sami Megdiche; Salvatore Mastrangelo; Mohamed Ben Hamouda; Johannes A Lenstra; Elena Ciani
Journal:  Front Genet       Date:  2019-10-25       Impact factor: 4.599

7.  Comparative Transcriptome Analysis Identifying the Different Molecular Genetic Markers Related to Production Performance and Meat Quality in Longissimus Dorsi Tissues of MG × STH and STH Sheep.

Authors:  Shuru Cheng; Xueying Wang; Quanwei Zhang; Yuqin He; Xia Zhang; Lei Yang; Jinping Shi
Journal:  Genes (Basel)       Date:  2020-02-10       Impact factor: 4.096

8.  Comparative Analysis of Skeletal Muscle DNA Methylation and Transcriptome of the Chicken Embryo at Different Developmental Stages.

Authors:  Jinshan Ran; Jingjing Li; Lingqian Yin; Donghao Zhang; Chunlin Yu; Huarui Du; Xiaosong Jiang; Chaowu Yang; Yiping Liu
Journal:  Front Physiol       Date:  2021-07-02       Impact factor: 4.566

9.  Genetic characterization of indigenous goat breeds in Romania and Hungary with a special focus on genetic resistance to mastitis and gastrointestinal parasitism based on 40 SNPs.

Authors:  Daniela Elena Ilie; Szilvia Kusza; Maria Sauer; Dinu Gavojdian
Journal:  PLoS One       Date:  2018-05-09       Impact factor: 3.240

10.  Recent advances in understanding genetic variants associated with growth, carcass and meat productivity traits in sheep (Ovis aries): an update.

Authors:  Alexander S Zlobin; Natalia A Volkova; Pavel M Borodin; Tatiana I Aksenovich; Yakov A Tsepilov
Journal:  Arch Anim Breed       Date:  2019-10-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.