Literature DB >> 23762451

Genome-wide genetic diversity and differentially selected regions among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep.

Lifan Zhang1, Michelle R Mousel, Xiaolin Wu, Jennifer J Michal, Xiang Zhou, Bo Ding, Michael V Dodson, Nermin K El-Halawany, Gregory S Lewis, Zhihua Jiang.   

Abstract

Sheep are among the major economically important livestock species worldwide because the animals produce milk, wool, skin, and meat. In the present study, the Illumina OvineSNP50 BeadChip was used to investigate genetic diversity and genome selection among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds from the United States. After quality-control filtering of SNPs (single nucleotide polymorphisms), we used 48,026 SNPs, including 46,850 SNPs on autosomes that were in Hardy-Weinberg equilibrium and 1,176 SNPs on chromosome × for analysis. Phylogenetic analysis based on all 46,850 SNPs clearly separated Suffolk from Rambouillet, Columbia, Polypay, and Targhee, which was not surprising as Rambouillet contributed to the synthesis of the later three breeds. Based on pair-wise estimates of F(ST), significant genetic differentiation appeared between Suffolk and Rambouillet (F(ST) = 0.1621), while Rambouillet and Targhee had the closest relationship (F(ST) = 0.0681). A scan of the genome revealed 45 and 41 differentially selected regions (DSRs) between Suffolk and Rambouillet and among Rambouillet-related breed populations, respectively. Our data indicated that regions 13 and 24 between Suffolk and Rambouillet might be good candidates for evaluating breed differences. Furthermore, ovine genome v3.1 assembly was used as reference to link functionally known homologous genes to economically important traits covered by these differentially selected regions. In brief, our present study provides a comprehensive genome-wide view on within- and between-breed genetic differentiation, biodiversity, and evolution among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds. These results may provide new guidance for the synthesis of new breeds with different breeding objectives.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23762451      PMCID: PMC3677876          DOI: 10.1371/journal.pone.0065942

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

During the last five years, the animal genome community has made significant progress in mapping, sequencing, assembly, and annotation of the ovine genome. Based on BAC (bacterial artificial chromosome) end sequences, Dalrymple and colleagues [1] first reported a virtual sheep genome by painting a total of 84,624 sheep BACs (about 5.4-fold genome coverage) to orthologous regions in the human genome, which were assembled into 1,172 sheep BAC comparative genome contigs that covered 91.2% of the human genome. In 2009, Goldammer and coworkers [2] constructed a cytogenetic map of the sheep genome with 566 loci, which helped link and order genome regions, such as sequence contigs, genes, and polymorphic DNA markers to ovine chromosomes. Approximately two years ago, the International Sheep Genomics Consortium (ISGC) began assembly of a draft reference genome of sheep (Ovis aries) using both Sanger sequencing and the next-generation sequencing platforms [3]. This large scale sequencing of the ovine genome led to discovery of more than 2.8 million ovine single nucleotide polymorphisms (SNPs; http://www.ncbi.nlm.nih.gov/SNP/). In collaboration with the ISGC, Illumina developed the OvineSNP50 Genotyping BeadChip that contains a total of 54,241 SNPs with a marker placed approximately every 46 Kb along the sheep genome (www.illumina.com). The Illumina OvineSNP50 Genotyping BeadChip has been successfully used in sheep and goat genome research. For example, BeadChip analysis revealed that the PITX3 gene is responsible for microphthalmia [4]. A similar approach also helped identify the dentin matrix protein 1 gene (DMP1) as responsible for inherited rickets in Corriedale sheep [5] and the solute carrier family 13 (sodium/sulphate symporters), member 1 (SLC13A1) gene for chondrodysplasia in Texel sheep [6]. Both OvineSNP50 BeadChips and microsatellite markers were used to refine two quantitative trait loci (QTL) mapped on OAR5 and 13 for resistance to Haemonchus contortus in sheep [7]. Other applications of OvineSNP50 BeadChip include investigating gene drivers of pigmentation in Merino sheep [8], long range linkage disequilibrium analysis in wild sheep [9], inbreeding coefficient and pairwise relatedness in Finnsheep [10], and genomic selection in different sheep breeds from around the world by the ISGC [11]. It is well known that Suffolk and Rambouillet were developed in England and France, respectively, but the breeding history of American synthetic breeds may be unfamiliar to readers. In brief, Columbia was one of the first breeds of sheep developed in the United States. In 1912, rams of the long wool breeds were crossed with high quality Rambouillet ewes to produce large ewes yielding more pounds of wool and more pounds of lamb. The original cross was made at Laramie, Wyoming, and then moved to the Sheep Experiment Station, Dubois, Idaho, in 1918. Subsequently, Columbia sheep were released to the public [12]. Polypay sheep were developed at the U.S. Sheep Experiment Station starting in 1968. The objective was to develop a breed with a reproductive capacity markedly superior to that of domestic Western range breeds. The final composition of the Polypay is 1/4 Dorset × 1/4 Finnsheep × 1/4 Targhee × 1/4 Rambouillet. The first “Polypay” ewes and rams were sold 1975–1977 [13]. Targhee sheep were developed at the U.S. Sheep Experiment Station, Dubois, Idaho in 1926. A group of cross-bred ewes, consisting of Rambouillet, Lincoln, and Corriedale blood, was bred to USSES Rambouillet rams. After three years, first generation ewes were carefully selected and bred intensely. The U.S. Targhee Sheep Association was founded in 1951 (http://www.ustargheesheep.org/). Generally speaking, sheep breeds can be classified into three groups: meat, wool, or dual-purpose breeds based on their breeding objectives. For example, Suffolk is a typical meat breed as the animals possess large body size, rapid growth rate, and high cutability carcasses (http://u-s-s-a.org/). On the other hand, Rambouillet sheep represent a fine wool breed with a well-developed flocking instinct, an extended breeding season, and high-quality fleece (http://www.countrylovin.com/ARSBA/facts.htm). Columbia, Targhee, and Polypay are, however, considered as dual-purpose breeds, because they are fast-growing, high-quality market lambs that also yield heavy, medium-wool fleeces with good staple length [12]–[13] (http://www.sheepusa.org/). Previously, microsatellite markers were the main source of markers used to investigate genetic diversity of sheep breeds. For instance, Bayesian cluster analysis on microsatellite genotypes of 666 animals for 28 U.S. sheep breeds derived from 222 producers located in 38 states was able to distinguish meat vs. wool producers due to physiological differences rather than geographic origin [14]. In the present study, our goal was to test the power of the Illumina OvineSNP50 Genotyping BeadChip in evaluating genetic diversity, genome selection, and breed differentiation among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds. These results may provide new guidance for the synthesis of new breeds with different breeding objectives.

Results

Illumina OvineSNP50 BeadChip Genotyping Basics

Among the 54,241 SNPs on the Illumina OvineSNP50 BeadChip that were genotyped on the 94 sheep DNA samples, we observed that 695 SNPs had no calls, 1,019 SNPs were not genotyped for at least 95% of all the individuals, 1,235 SNPs were monomorphic in all breeds, 350 SNPs could not be assigned to chromosome locations and 2,057 SNPs had MAF ≤0.05 for the whole dataset. By excluding the SNPs described above, the remaining 48,885 SNPs, including 47,597 autosomal SNPs and 1,288 SNPs on chromosome X, were used for further analysis. Of the 1,288 SNPs on chromosome X, 1,176 SNPs identified non-heterozygous males, while 112 SNPs were heterozygous in some rams. This might be related to a homologous region between chromosomes X and Y because our data do not show that the heterozygous regions are random. As suggested by Gautier et al [15], we further excluded a total of 747 autosomal SNPs, which showed significant (P<0.01) deviations from the Hardy-Weinberg equilibrium (HWE) test due to the small number of samples. We did not test chromosome X because the SNPs on chromosome X in males carry only one copy. As a consequence, 46,850 autosomal SNPs were included in linkage disequilibrium, genetic diversity and DSR analyses, while the 1,176 SNPs with non-heterozygous males on chromosome X were used for DSRs analysis only. As shown in Figure S1, the final 46,850 and 1,176 SNPs were uniformly distributed on different autosomes (1–26) and the X chromosome, and were comparable to the initial distribution of the 54,241 SNPs on these chromosomes, although the numbers of SNPs in each chromosome were different between them.

r2 Measurements by Chromosomes

The r2 values for pairs of loci were measured along with the physical distance separating the loci and averaged within each breed. As the sheep genome is currently estimated to be 2.86 Gb in size (http://genome.ucsc.edu/), the 46,850 SNPs used in linkage disequilibrium (LD) analysis would have an average inter-marker distance of approximately 60 Kb. As shown in Figure S2, the average within-population pairwise r2 dropped quickly toward its asymptotic value when physical distances reached 200 Kb. More interestingly, the decreasing trends of the average r2 values remained similar among these five breeds, but the Suffolk breed had the highest r2, followed by Columbia, Rambouillet, Targhee, and Polypay, respectively.

The Genetic Structure at the Individual Level

Figure 1 demonstrates a neighbor-joining (NJ) tree based on allele sharing distances (ASD) among 94 rams derived from Columbia, Polypay, Rambouillet, Suffolk, and Targhee breeds. The results clearly showed that there were no conflicts about the origin of individuals assigned to each breed. Also, the individuals from different sheep breeds were clearly clustered with closer genetic distances observed among Targhee, Columbia, Rambouillet, and Polypay breeds as compared to the Suffolk population.
Figure 1

Neighbor-Joining tree relating the 94 individuals.

The tree was constructed using allele sharing distances averaged over 46,850 SNPs. Different colors in labels represent the origin of breed individuals. S, R, C, P, and T represent Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds, respectively. The meanings of S, R, C, P, and T are same in the following figures.

Neighbor-Joining tree relating the 94 individuals.

The tree was constructed using allele sharing distances averaged over 46,850 SNPs. Different colors in labels represent the origin of breed individuals. S, R, C, P, and T represent Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds, respectively. The meanings of S, R, C, P, and T are same in the following figures.

Accessing the Genetic Structure at the Population Level

As shown in Figure S3, the gene diversity, heterozygosity, and polymorphism information content (PIC) among five sheep populations were 0.3291–0.3576, 0.3496–0.3722 and 0.2619–0.2837 respectively. Polypay and Targhee populations had the highest gene diversity, heterozygosity, and PIC while the Suffolk population had the lowest values in these indexes. Classical F-statistics showed that most variation originated from individuals within a breed, while only 11% of the variation resulted from different breeds (Table S1). In particular, Targhee sheep had the highest within-breed variation. Furthermore, a multidimensional scaling plot clearly showed the genetic origin of breed between Suffolk and Rambouillet and among Rambouillet-related sheep breeds (Figure 2). Based on pair-wise estimates of F ST, significant genetic differentiation appeared between Suffolk (meat breed) and Rambouillet (fine wool breed) (F ST = 0.1621). In comparison, Rambouillet-related breeds were not significantly separated (F ST = 0.0681–0.0952). In particular, Rambouillet and Targhee had the closest relationship (F ST = 0.0681) (Figure 2).
Figure 2

Multidimensional scaling plots for the genetic differentiations between Suffolk and Rambouillet (left) and among four Rambouillet-related breeds (right).

F ST represent the pair-wise F ST between any two sheep breeds.

Multidimensional scaling plots for the genetic differentiations between Suffolk and Rambouillet (left) and among four Rambouillet-related breeds (right).

F ST represent the pair-wise F ST between any two sheep breeds.

Population Differences in Minor Allele Frequencies

Based on minor allele frequency of SNPs, different levels of variation across breeds were observed. As shown in Figure S4, over 80% of the SNPs with one allele in all breeds had a MAF > 0.10. In all MAF ranges, the proportion of loci were significantly different among all sheep breeds (χ2 = 204.1084–1510.9140, P = 0.0000), indicating that each sheep breed had different numbers of SNPs in each MAF range.

Characterization of DSRs between Suffolk and Rambouillet

Based on the SNP F ST estimates, a total of 45 DSRs were identified in genomes between Suffolk and Rambouillet, which contained the top 0.1% of markers (48 SNPs in autosomal chromosomes and 6 SNPs in chromosome X; Table 1). Further examination of these DSRs identified 608 unique known genes, including 507 from autosomal DGRs and 101 from X chromosome DSRs (Table S2). The GO analysis revealed pathways enriched for a wide range of biological processes, such as regulation of organelle/cytoskeleton organization, translational elongation, protein catabolic processes, and cilium morphogenesis (Table S3). Among these 45 DSRs between Suffolk and Rambouillet, 13 also appeared as DSRs in cattle (Table S4). Among autosomal DSRs, the highest F ST signal (OAR3_163921101, F ST = 0.95, region 13) (Table 1 and Figure 3) was located at 153.26 Mb on ovine chromosome 3 (Ovine v3.1 genome), where glutamate receptor interacting protein 1 (GRIP1) resides. On the other hand, the DSR (named region 24, spans 28.58 to 29.84 Mb) with the highest number of top 0.1% SNPs was located on chromosome 10 (Table 1 and Figure 3). Furry homolog (Drosophila) (FRY), which included four of the top 0.1% SNPs of this region, is an evolutionarily conserved protein implicated in cell division and morphology [16]. Additionally, selection signals were detected for genes associated with economically important traits, i.e., MITF and GHR.
Table 1

Differential Genomic Regions between Suffolk and Rambouillet.

RegionChrOvineSNP50 BeadChipPosition (Mb)Ovine version 3.1Position (Mb)Peak SNP (Fst)Top0.1%Top5%GeneCandidates
11223,692,105.224,089,918207,151,194. 207,545,071OAR1_224053727 (0.83)130NA
21250,742,595.252,518,595232,582,008. 233,998,801OAR1_251993749 (0.80)173NA
31257,381,496.260,183,641238,188,431. 240,583,116OAR1_258333225 (0.76)146NA
42202,599,712.203,260,925191,179,157. 191,834,956OAR2_202880426 (0.89)153NA
52204,712,172.204,901,274193,176,635. 193,374,050OAR2_204724536 (0.78)131NA
62238,542,709.240,388,073225,842,690. 227,694,352OAR2_23999476 7(0.78)113NA
7326,522,262.27,337,38024,478,117. 25,283,037s53615 (0.73)131NA
8336,636,298.39,008,46534,218,962. 36,296,085s07478 (0.73)1129 PLB1
9351,184,039.53,141,88748,017,454. 50,434,716OAR3_51654982 (0.83)1110 HSPA4
10359,599,168.61,257,66356,387,458. 57,906,999OAR3_59779674 (0.83)1224NA
113146,832,060.147,380,630137,368,282. 137,726,913OAR3_147229298 (0.88)122NA
123154,771,125.155,555,840144,946,795. 145,685,656OAR3_154940822 (0.78)143NA
133163,186,840.164,185,125152,644,200. 153,519,437OAR3_163921101 (0.95)274 GRIP1,HELB
143179,997,804.181,039,892167,562,727. 168,548,617OAR3_180711543 (0.94)1410 ANKS1B
153216,437,540.217,530,305201,198,665. 202,134,428OAR3_217179573 (0.78)1311 HEBP1
16422,765,057.24,779,24821,632,745. 23,648,876OAR4_23178649 (0.79)OAR4_23188917 (0.79)233NA
17515,497,764.16,376,23913,268,366. 14,154,057s48780 (0.73)1431 ARHGEF18
1867,001,684.8,507,1645,094,411. 6,363,162OAR6_7822475 (0.78)126NA
19674,570,690.77,508,49568,044,077. 71,068,886s32922 (0.83)1018NA
206109,205,577.109,456,09899,193,263. 99,428,191OAR6_109386725 (0.78)134NA
216111,022,637.112,892,917100,900,651. 102,765,374s61454 (0.76)1415 PPP2R2C
22718,503,023.20,430,11017,777,251. 19,581,841OAR7_18600852 (0.78)1212 THSD4
23847,992,613.49,442,59344,527,752. 45,986,304OAR8_48260342 (0.73)120NA
241028,598,904.29,867,19228,584,727. 29,842,383OAR10_29223007 (0.89)OAR10_29341212 (0.89)489 FRY
251034,081,682.36,999,16333,723,894. 36,238,012DU468275_284 (0.80)1022 SGCG
261080,478,253.82,187,88173,523,887. 75,018,763OAR10_80709863 (0.89)117NA
271149,315,455.51,153,76346,334,232. 48,164,176s53904 (0.83)1532NA
281162,887,032.64,089,97958,151,039. 59,339,383OAR11_63882013 (0.76)130NA
291528,713,885.31,235,15827,396,963. 29,749,473s10361 (0.80)3654 TMPRSS4, PVRL1
301555,184,101.56,938,64950,486,417. 51,939,057s34973 (0.73)1218NA
311615,458,830.17,547,58514,215,465. 15,879,198OAR16_16902918 (0.73)1111 RNF180
321632,916,237.34,666,51830,290,785. 31,926,643OAR16_34620156 (0.78)1412 GHR
3317172,595.2,807,05747,789. 2,338,716s47560 (0.78)129NA
341764,389,126.64,669,86058,960,116. 59,261,632OAR17_64627979 (0.83)120NA
35191,721,354.4,144,2311,700,469. 3,900,485OAR19_2610848 (0.73)127NA
36195,578,148.5,861,1495,354,385. 5,635,464OAR19_5820545 (0.74)120NA
371931,917,811.33,355,17030,269,881. 31,674,797OAR19_33278780 (0.89)112 MITF
382147,142,810.49,259,84342,654,067. 44,580,177OAR21_47788299 (0.89)1173 OVOL1
392225,234,645.28,159,14721,379,260. 23,858,085OAR22_26729825 (0.83)1046 CNNM2
402314,719,599.17,399,61413,512,270.16,274,959OAR23_15999547 (0.73)113NA
412325,829,771.28,768,75924,678,908. 27,575,686OAR23_27272887 (0.83)1016NA
42X51,771,664.53,703,82251,193,144. 53,231,464OARX_52914005_X (1.00)2235 SHROOM4
43X60,238,540. 64,779,88756,827,620. 61,295,149OARX_63571789 (1.00)2642 MED12 FAM155B
44X97,956,675.99,324,95877,903,961.79,702,349S47111 (0.78)1218NA
45X103,934,729.106,069,04683,501,497.85,670,129OARX_105162278 (0.80)116NA

Note: A total of 45 genomic regions contained the top 0.1% of SNPs ranked using F ST (48 SNPs in autosomal and 6 SNPs in chromosome X). The candidate genes are given within the SNPs of top SNP (0.1%) for each region.

Figure 3

Genome-wide distribution of F ST between Suffolk and Rambouillet.

Based on OvineSNP50 BeadChip position, smoothed F ST show that strong selection signals are observed in regions 13 and 24. S-R represents Suffolk-Rambouillet, while R-C-P-T means Rambouillet-Columbia-Polypay-Targhee.

Genome-wide distribution of F ST between Suffolk and Rambouillet.

Based on OvineSNP50 BeadChip position, smoothed F ST show that strong selection signals are observed in regions 13 and 24. S-R represents Suffolk-Rambouillet, while R-C-P-T means Rambouillet-Columbia-Polypay-Targhee. Note: A total of 45 genomic regions contained the top 0.1% of SNPs ranked using F ST (48 SNPs in autosomal and 6 SNPs in chromosome X). The candidate genes are given within the SNPs of top SNP (0.1%) for each region.

Characterization of DSRs among Rambouillet-related Breeds

Genome-wide distribution of F ST among Rambouillet, Columbia, Polypay, and Targhee are shown in Figure 4. Among these four Rambouillet-related breeds, 41 DSRs were identified with the top 0.1% of markers ranked by SNP F ST (46 in autosomal and 1 in chromosome X, Table 2). These DSRs harbor a total of 526 unique genes, including 524 from autosomal DSRs and 2 from chromosome X DSRs (Table S5). Interestingly, GO analysis revealed that the enriched pathways were mainly related to cell adhesion processes (Table S6). Among these four sheep breeds, the OAR21_19719146 SNP (F ST = 0.65, region 37) (Table 2) that belongs to potassium channel tetramerisation domain containing 14 (KCTD14) gene ranked highest, but unfortunately, little is known about this gene. Our data also show that both sheep and cattle may share eight DSRs identified among Rambouillet, Columbia, Polypay, and Targhee (Table S7).
Figure 4

Genome-wide distribution of F ST among Rambouillet, Columbia, Polypay and Targhee.

R-C-P-T means Rambouillet-Columbia-Polypay-Targhee.

Table 2

Differential Genomic Regions among Rambouillet, Columbia, Polypay and Targhee.

RegionChrOvineSNP50 BeadChip Position (Mb)Ovine version 3.1 Position (Mb)Peak SNP (Fst)Top 0.1%Top 5%GeneCandidates
1130,388,117.30,798,95029,732,489. 30,143,388OAR1_30552484 (0.41)130NA
2131,470,501.32,656,97830,814,041. 31,918,917OAR1_31791056 (0.44)126 C10RF168
3143,609,775.44,153,57542,128,684. 42,656,182s54617 (0.48)125 IL23R
4165,172,267.67,951,62061,589,657. 64,314,437OAR1_66495276 (0.42)1124 CLCA1
51202,259,180.203,821,806187,627,690. 189,001,556s48685 (0.44)1216NA
61234,341,600.234,884,834217,261,740. 217,734,182OAR1_234474464 (0.50)343 GOLIM4
71235,109,323.236,540,864217,827,095. 219,332,381OAR1_235893252 (0.42)194NA
81238,519,182.238,893,868221,199,906. 221,519,414OAR1_238532462 (0.43)120NA
91243,197,038.243,803,482225,691,787. 226,307,550OAR1_243366256 (0.41)183NA
10252,036,310.54,944,04848,501,546. 51,384,849s67253 (0.40)1133 TDRD7
112199,204,762.200,621,949187,885,054. 189,186,475OAR2_200574953 (0.44)131 CNTNAP5
122218,777,904.220,621,478206,643,029. 208,373,192OAR2_219736519 (0.41)1313 ADAM23
132221,160,555.223,548,439208,910,057. 211,274,257s71750 (0.43)2616NA
142240,718,168.241,220,099228,019,818. 228,451,875OAR2_241048971 (0.42)134 COL4A4
152258,064,261.258,169,201244,129,054. 244,237,213s36316 (0.42)120NA
16357,803,870.59,844,18854,668,289. 56,584,259OAR3_59312021 (0.41)114NA
17379,812,234.81,533,15075,538,807. 77,148,543OAR3_79936157 (0.43)1217NA
183101,572,092.102,519,14395,521,406. 96,371,620s65088 (0.53)1232 LOXL3
193124,569,037.125,411,739116,863,473. 117,540,039OAR3_125387325 (0.44)133NA
20541,108,751.43,897,57437,314,853. 40,020,855OAR5_42607507 (0.43)1020NA
21550,423,296.51,450,14446,337,373. 47,282,405OAR5_50467325 (0.41)1218NA
22552,451,051.55,344,98148,225,599. 51,058,240OAR5_53870555 (0.42)1166 PCDHB4
235110,241,515.110,742,508101,265,722. 101,776,198OAR5_110440932 (0.40)120NA
24882,084,037.82,635,99476,042,356. 76,469,230s02095 (0.42)124NA
25891,759,596.92,094,26585,080,887. 85,410,438s19702 (0.43)131NA
26976,165,857.78,371,60671,768,469. 73,808,972OAR9_77597438 (0.48)139 RIMS2
271092,069,072.93,254,91784,440,241. 85,596,458DU434120_194 (0.58)269 ARHGEF7, TUBGCP3
281113,571,422.16,047,30413,680,542. 15,833,930OAR11_13845361 (0.42)1436NA
291121,539,612.22,907,34620,832,059. 22,055,035s50268 (0.41)1319NA
301142,268,122.42,641,54039,778,473. 40,158,382OAR11_42371614 (0.45)1316NA
311146,846,457.49,449,23544,059,749. 46,470,862s17065 (0.40)1433NA
321325,313,379.26,260,93422,757,476. 23,728,650OAR13_25941147 (0.45)125NA
331753,094,617.53,266,87948,796,359. 48,949,569OAR17_53158609 (0.42)131NA
341844,151,635.46,578,38041,525,466. 43,758,521OAR18_44175536 (0.42)125NA
352045,499,537.49,145,52741,875,844. 45,195,888OAR20_47285319 (0.52)3022NA
36219,818,736.10,702,5278,402,965. 9,308,676s42617 (0.45)138 ME3
372118,653,273.20,194,61216,451,437. 17,888,494OAR21_19719146 (0.65)1516 KCTD14
382146,857,187.48,167,16842,551,316. 43,585,536s67834 (0.52)1451 BATF2
392531,163,834.32,984,88129,829,111. 31,596,285OAR25_32856981 (0.48)1212NA
402613,102,143.13,432,72210,556,336. 10,883,077OAR26_13165931 (0.41)130NA
41X112,364,319.112,895,373107,382,577. 107,865,244OARX_112730737 (0.44)152NA

Note: A total of 41 genomic regions contained the top 0.1% of SNPs ranked using F ST (46 in autosomal and 1 in chromosome X). The candidate genes are given within the SNPs of top SNP (0.1%) for each region.

Genome-wide distribution of F ST among Rambouillet, Columbia, Polypay and Targhee.

R-C-P-T means Rambouillet-Columbia-Polypay-Targhee. Note: A total of 41 genomic regions contained the top 0.1% of SNPs ranked using F ST (46 in autosomal and 1 in chromosome X). The candidate genes are given within the SNPs of top SNP (0.1%) for each region.

Discussion

In the present study, we used the Illumina OvineSNP50 Genotyping BeadChip to analyze the genetic diversity and genome selection among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds from the USDA, ARS, U.S. Sheep Experiment Station. Our present study determined that close genetic relationships exist within Rambouillet-related breeds: Rambouillet, Columbia, Polypay, and Targhee, while Suffolk sheep are well separated from the Rambouillet-related breeds. The F ST results showed significant genetic difference between Suffolk and Rambouillet (F ST = 0.1621). Between these two distinct breeds, 45 DSRs and 608 candidate genes were identified using the Ovine Genome v3.1 Assembly as a reference. On the other hand, 41 DSRs and 526 genes were also determined among the four Rambouillet-related breeds. Polypay (Finn × Targhee × Rambouillet × Dorset) [13], Targhee (Rambouillet × Lincoln × Corriedale), and Columbia (Lincoln × Rambouillet) are three breeds that were originally developed at the U.S. Sheep Experiment Station decades ago. Rambouillet and Suffolk were developed in France and England, respectively. In the present study, the highest gene diversity, heterozygosity and PIC were shown in Polypay and Targhee. This is not surprising because these two sheep breeds are the most recently developed breeds and we expect them to retain greater heterozygosity than the three other sheep breeds. Also, we found the F ST averaged 0.1140 but Rambouillet-related breeds were not significantly separated, suggesting that the genetic differentiation is mainly between Suffolk and Rambouillet-related breeds. Not unexpectedly, cluster analysis also clearly showed that Suffolk is genetically distant from the other four sheep breeds (Figure 1). These results are rational because Columbia, Targhee, and Polypay are Rambouillet-related breeds. In particular, pair-wise F ST estimation suggested the Targhee should be considered genetically most similar to Rambouillet (Figure 2). These results are reasonable because they are supported by our records and breed selection history of Columbia, Targhee, and Rambouillet. Recently, the same SNP chip was used to assign population of origin between wild sheep breeds, including bighorn and thinhorn sheep [9,] and to determine the historic selection of 74 sheep breeds [11]. And in cattle, the bovine SNP chip had also been used to reveal genetic history or population diversity [17]–[18]. Now, our results further confirm that the SNP chip is a powerful tool to discover the population genetic diversity in livestock and these data provided strong evidence of the genetic structure in these five sheep breeds. Meat, wool, and dual-purpose breeds of sheep were developed because these are highly valued traits in sheep production. Based on genetic distance, our results indicated that the meat breed (Suffolk) is very distant from the fine wool breed (Rambouillet), which is very much in line with the functional purpose of the different breeds. For example, sheep selected for meat production generally have greater body weights. Mature weights of Suffolk rams, which have been historically selected for meat production, range from 113 to 159 kg and the fleece is considered a medium wool type with a staple length of 5 to 8.75 cm (http://u-s-s-a.org/). In comparison, mature Rambouillet rams, that have been bred to produce high quality wool, are smaller and weigh between 113 to 135 kg while the fleece staple length varies from 5 to10 cm and fiber diameter ranges from 18.5 to 24.5 microns (http://www.countrylovin.com/ARSBA/facts.htm). In the present study, a genome-wide scan or differentiation analysis using F ST revealed 45 chromosomal regions with evidence for selection. Interestingly, three regions, 19, 24, and 37, are almost identical to the regions identified in the 74 sheep breeds examined in [11], implying that important genomic selections might appear in these regions. Interestingly, region 24, which includes the RXFP2 gene that is involved in horn morphology had a selection signal that was reconstituted only when comparing horned with polled populations [11], [19], was discovered in this study. Kijas et al (2012) [11] indicated this gene had the strongest selection signal due to the long-standing nature of selection. But in our study, only had two Rambouillet rams had horns., Therefore, the sample size was most likely too small to detect a difference in our study. Not unexpectedly, GHR, an important growth-related gene, was identified (region 32 on OAR 16). It is well-known that this gene affects body growth and decreases fatness [20], and its genetic variations are associated with growth traits in sheep or cattle [20]–[21]. Therefore, our study provides additional information for interpreting the difference in growth ability between Suffolk and Rambouillet. The highest ranked SNPs (F ST>0.90) were located in glutamate receptor interacting protein 1 (GRIP1) and ankyrin repeat and sterile alpha motif domain containing 1B (ANKS1B). Many studies have found GRIP1 plays an important role in receptor trafficking, synaptic organization, transmission in glutamatergic and GABAergic synapses and modulating autistic phenotype [22]–[24]. But unfortunately, little is known about the function of GRIP1 in livestock. Here our results might provide a new clue for its role in sheep production. Recently, ANKS1B gene had been shown to be associated with body weight index and waist circumference in human GWAS studies [25], and Parker et al [26] indicated this gene may underlie the QTL associated with body weight in mice. Our studies also suggested ANKS1B gene might be a good growth trait candidate. However, additional studies are required to confirm this speculation. Interestingly, we discovered MITF gene in DSRs (region 37 on OAR 19). This gene accounts for pigmentation phenotypes in cattle [27]. However, ASIP, which controls a series of alleles of black and white coat color [28], was not included in our DSRs. In sheep, gene duplications might also cause black fleece [28]. In this study, the Suffolk has black head and legs, while the Rambouillet does have recessive black. It appears as though the key gene of pigmentation may provide evidence for selection between the two sheep breeds. In this study we also identified FRY, a gene involved in growing wing hairs [29] and bristles [30] in Drosophila. Mutations in FRY resulted in the formation of a strong multiple hair cell phenotypes that consisted of clusters of epidermal hairs and branched hairs [31]. But there is a little known about the role of FRY in livestock. In the present study, Rambouillet is often considered a fine wool breed while Suffolk has rather poor quality wool. Therefore, our results provide strong evidence for the role of FRY in sheep wool development. Additionally, 13 of the 45 DSRs identified in sheep represent those in cattle, suggesting that these genes are targets for selection across multiple species. As described above, Columbia, Polypay, and Targhee are related to Rambouillet sheep. Among these four sheep breeds, a total of 41 DSRs were identified (Table 2). Interestingly, GO terms analyses of functionally known genes in these regions discovered pathways related to hemophilic/cell adhesion, translational elongation, germ cell development, sexual reproduction, and macromolecule biosynthetic processes. Some signature genes suggested strong selection given their roles, such as CNTNAP5, ADAM23, and PCDHB4 in cell adhesion, ME3, RIMS2, and TDRD7 in cellular respiration, cellular macromolecule/protein localization, and multicellular organism reproduction, respectively. These might result from long-term selection for improved reproduction and wool traits in these four sheep breeds [32]–[33]. In summary, we revealed the genetic diversity among Suffolk, Rambouillet, Columbia, Polypay, and Targhee sheep breeds of the United States using the Illumina OvineSNP50 BeadChip. Meanwhile, DSRs between Suffolk and Rambouillet and among Rambouillet-related sheep breeds were also identified with production of a list of candidate genes in these regions based on Ovine Genome v3.1 Assembly. Estimation of genome-wide diversity and identification of DSRs regions provide a powerful method to identify economically important trait-related genes that have been enriched during a long-term selection for different breeding objectives. Furthermore, our results also provide a foundation to further investigate sheep evolution and gene functions in the near future.

Materials and Methods

Ethics Statement

The U.S. Sheep Experiment Station Animal Institutional Care and Use Committee specifically approved this study (Protocol number: 11–01). All efforts were made to minimize any discomfort during blood collection.

Sheep, DNA Preparation, and Genotyping on Illumina OvineSNP50 BeadChips

In the present study, blood samples were collected from 19 Columbia, 19 Polypay, 16 Rambouillet, 18 Suffolk, and 22 Targhee rams at the U.S. Sheep Experiment Station in Dubois, Idaho. Rams were produced from unique dams and 12, 12, 9, 10, and 17 unique sires of the Columbia, Polypay, Rambouillet, Suffolk, and Targhee breeds, respectively. The number of sheep per breed in this study is similar to the average number of sheep per breed Kijas and coworkers [11] used to quantify breed mixture and selection using the OvineSNP50 BeadChip. Blood was collected via jugular venipuncture into EDTA coated vacutainer tubes. Thereafter, DNA was extracted from 200 µL of whole blood with the GenElute Blood Genomic DNA extraction kit (Sigma, St. Louis, MO) according to the manufacturer’s instructions. All DNA samples were genotyped with standard procedures at GeneSeek (Lincoln, NE, US) on the OvineSNP50 genotyping BeadChip. Basic information on the 54,241 SNPs on the BeadChip, including SNP name, chromosome, and map location was provided by the service provider. The genotype quality control process was as previously described [34].

Population Genetic Basics Analysis

Analysis of minor gene allele frequencies (MAF) was conducted with the chi-squared test using SAS Software for Windows v9.2 (SAS Institute Inc., Cary, NC). An exact test for Hardy-Weinberg Equilibrium (HWE) [35] of polymorphic SNPs was further carried out within each breed separately. We also computed the r2 measure between each marker pair within each breed separately using Haploview 4.1 [36]. Allele sharing distances of the neighbor-joining tree relating sheep individuals were computed by PowerMarker V3.25 software [37], and then the neighbor-joining tree was constructed by MEGA 5 [38]. Gene diversity, heterozygosity, polymorphism information content (PIC), and classical F-statistics [39] were calculated in the present study using PowerMarker V3.25. FSTAT 2.9.3.2 [40] was used to evaluate population relatedness using pair-wise estimates of F ST. An IBS matrix of distance (D) was constructed by Plink v1.07 [41], and then multidimensional scaling (MDS) analysis of 46,850 autosomal SNPs was determined using R 2.14.0 (www.r-project.org).

Detection of Differentially Selected Regions (DSRs)

Fisher’s exact test was performed by R 2.14.0 to compare the allele frequencies between Suffolk and Rambouillet and among Rambouillet-related breed populations first. A SNP with a P value <0.05 was considered to be a statistically significant SNP after Bonferroni correction. Then, estimation of SNP and population-specific F ST were based on the model proposed by Nicholson et al [42] and Flori et al [43]. The DSR algorithm was described previously [11], but with slight modifications: 1) raw values were ranked and used to identify regions; 2) the significant SNPs with 0.1% or 5% highest F ST values were selected as the top significant SNPs; 3) centered on the top significant SNP (0.1%), neighboring markers were included until markers were encountered more than three consecutive SNPs ranking outside of the top significant 5%. We only considered the range between the upstream and downstream 1.5 Mb of the top SNP (0.1%) if the length of candidate regions were more than 3 Mb and combined any two regions as one region if they overlapped. SNP-specific Fst values were smoothed over each chromosome with a local variable bandwidth kernel estimator [44]. Genes in these DSRs were examined for potential involvement in phenotypes using the Ovine Genome v3.1 Assembly (http://www.livestockgenomics.csiro.au/cgi-bin/gbrowse/oarv3.1/). The functional annotation of target genes for the gene ontology was performed using DAVID bioinformatics resources [45]. Allele frequency per breed for all DSR can be found at www.animalgenome.org/repository/pub/USDA2013.0411/. Distributions of SNPs on different chromosomes. (TIF) Click here for additional data file. Decay of average pairwise r (TIF) Click here for additional data file. Genetic diversity analysis in different sheep breeds. (TIF) Click here for additional data file. Minor allele frequencies (MAF) with 46,850 SNPs for different sheep breeds. (TIF) Click here for additional data file. Classical F-statistics in different sheep breeds. (XLS) Click here for additional data file. List of ovine positional candidate genes based on the predicted protein coding genes from DSRs between Suffolk and Rambouillet. (XLS) Click here for additional data file. Gene ontology analysis related to the ovine positional candidate genes from DSRs between Suffolk and Rambouillet. (XLS) Click here for additional data file. Selection signals identified in both sheep and cattle using the ovine positional candidate genes from DSRs between Suffolk and Rambouillet. (XLS) Click here for additional data file. List of ovine positional candidate genes based on the predicted protein coding genes from DSRs among Rambouillet, Columbia, Polypay, and Targee. (XLS) Click here for additional data file. Gene ontology analysis related to the ovine positional candidate genes from DSRs among Rambouillet, Columbia, Polypay, and Targhee. (XLS) Click here for additional data file. Selection signals identified in both sheep and cattle using the ovine positional candidate genes from DSRs among Rambouillet, Columbia, Polypay, and Targhee. (XLS) Click here for additional data file.
  40 in total

1.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

2.  A note on exact tests of Hardy-Weinberg equilibrium.

Authors:  Janis E Wigginton; David J Cutler; Goncalo R Abecasis
Journal:  Am J Hum Genet       Date:  2005-03-23       Impact factor: 11.025

3.  PowerMarker: an integrated analysis environment for genetic marker analysis.

Authors:  Kejun Liu; Spencer V Muse
Journal:  Bioinformatics       Date:  2005-02-10       Impact factor: 6.937

4.  GRIP1 in GABAergic synapses.

Authors:  Rong-Wen Li; David R Serwanski; Celia P Miralles; Xuejing Li; Erik Charych; Raquel Riquelme; Richard L Huganir; Angel L de Blas
Journal:  J Comp Neurol       Date:  2005-07-18       Impact factor: 3.215

5.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

6.  The tricornered Ser/Thr protein kinase is regulated by phosphorylation and interacts with furry during Drosophila wing hair development.

Authors:  Ying He; Xiaolan Fang; Kazuo Emoto; Yuh-Nung Jan; Paul N Adler
Journal:  Mol Biol Cell       Date:  2004-12-09       Impact factor: 4.138

7.  Genetic parameters among weight, prolificacy, and wool traits of Columbia, Polypay, Rambouillet, and Targhee sheep.

Authors:  C M Bromley; G D Snowder; L D Van Vleck
Journal:  J Anim Sci       Date:  2000-04       Impact factor: 3.159

8.  A genome scan for QTL affecting resistance to Haemonchus contortus in sheep.

Authors:  G Sallé; P Jacquiet; L Gruner; J Cortet; C Sauvé; F Prévot; C Grisez; J P Bergeaud; L Schibler; A Tircazes; D François; C Pery; F Bouvier; J C Thouly; J C Brunel; A Legarra; J M Elsen; J Bouix; R Rupp; C R Moreno
Journal:  J Anim Sci       Date:  2012-07-05       Impact factor: 3.159

9.  Genetic relationship between milk score and litter weight for Targhee, Columbia, Rambouillet, and Polypay sheep.

Authors:  R M Sawalha; G D Snowder; J F Keown; L D Van Vleck
Journal:  J Anim Sci       Date:  2005-04       Impact factor: 3.159

10.  Development of the Polypay breed of sheep.

Authors:  C V Hulet; S K Ercanbrack; A D Knight
Journal:  J Anim Sci       Date:  1984-01       Impact factor: 3.159

View more
  19 in total

1.  Effects of rearing triplet lambs on ewe productivity, lamb survival and performance, and future ewe performance.

Authors:  David R Notter; Michelle R Mousel; Timothy D Leeds; Gregory S Lewis; J Bret Taylor
Journal:  J Anim Sci       Date:  2018-12-03       Impact factor: 3.159

2.  Genome-Wide Specific Selection in Three Domestic Sheep Breeds.

Authors:  Huihua Wang; Li Zhang; Jiaxve Cao; Mingming Wu; Xiaomeng Ma; Zhen Liu; Ruizao Liu; Fuping Zhao; Caihong Wei; Lixin Du
Journal:  PLoS One       Date:  2015-06-17       Impact factor: 3.240

3.  Genome-wide analysis reveals population structure and selection in Chinese indigenous sheep breeds.

Authors:  Caihong Wei; Huihua Wang; Gang Liu; Mingming Wu; Jiaxve Cao; Zhen Liu; Ruizao Liu; Fuping Zhao; Li Zhang; Jian Lu; Chousheng Liu; Lixin Du
Journal:  BMC Genomics       Date:  2015-03-17       Impact factor: 3.969

4.  Selection signatures in worldwide sheep populations.

Authors:  Maria-Ines Fariello; Bertrand Servin; Gwenola Tosser-Klopp; Rachel Rupp; Carole Moreno; Magali San Cristobal; Simon Boitard
Journal:  PLoS One       Date:  2014-08-15       Impact factor: 3.240

5.  Composite selection signals can localize the trait specific genomic regions in multi-breed populations of cattle and sheep.

Authors:  Imtiaz Ahmed Sajid Randhawa; Mehar Singh Khatkar; Peter Campbell Thomson; Herman Willem Raadsma
Journal:  BMC Genet       Date:  2014-03-17       Impact factor: 2.797

Review 6.  Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development.

Authors:  Zhihua Jiang; Hongyang Wang; Jennifer J Michal; Xiang Zhou; Bang Liu; Leah C Solberg Woods; Rita A Fuchs
Journal:  Int J Biol Sci       Date:  2016-01-01       Impact factor: 6.580

7.  Genome-wide analysis reveals signatures of selection for important traits in domestic sheep from different ecoregions.

Authors:  Zhaohua Liu; Zhibin Ji; Guizhi Wang; Tianle Chao; Lei Hou; Jianmin Wang
Journal:  BMC Genomics       Date:  2016-11-03       Impact factor: 3.969

8.  Ethiopian indigenous goats offer insights into past and recent demographic dynamics and local adaptation in sub-Saharan African goats.

Authors:  Getinet M Tarekegn; Negar Khayatzadeh; Bin Liu; Sarah Osama; Aynalem Haile; Barbara Rischkowsky; Wenguang Zhang; Kassahun Tesfaye; Tadelle Dessie; Okeyo A Mwai; Appolinaire Djikeng; Joram M Mwacharo
Journal:  Evol Appl       Date:  2021-06-15       Impact factor: 5.183

9.  Genome wide screening of candidate genes for improving piglet birth weight using high and low estimated breeding value populations.

Authors:  Lifan Zhang; Xiang Zhou; Jennifer J Michal; Bo Ding; Rui Li; Zhihua Jiang
Journal:  Int J Biol Sci       Date:  2014-02-07       Impact factor: 6.580

10.  Genome-wide linkage disequilibrium and genetic diversity in five populations of Australian domestic sheep.

Authors:  Hawlader Abdullah Al-Mamun; Samuel A Clark; Paul Kwan; Cedric Gondro
Journal:  Genet Sel Evol       Date:  2015-11-24       Impact factor: 4.297

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.