Literature DB >> 28855565

Genome-wide genetic structure and differentially selected regions among Landrace, Erhualian, and Meishan pigs using specific-locus amplified fragment sequencing.

Zhen Li1, Shengjuan Wei1, Hejun Li2, Keliang Wu3, Zhaowei Cai4, Dongfeng Li1, Wei Wei1, Qifa Li1, Jie Chen1, Honglin Liu1, Lifan Zhang5.   

Abstract

As typical Chinese indigenous pig breeds, Erhualian and Meishan have been widely used to produce new strain or breed in the world. However, the genetic basis of characteristics of these pig breeds is still limited. Moreover, considering cost and output of sequencing, it is necessary to further develop cost-effective method for pig genome screening. To contribute on this issue, we developed a SLAF-seq (specific-locus amplified fragment sequencing) method for pigs and applied it to analyze the genetic difference among Landrace, Erhualian, and Meishan pigs. A total of 453.75 million reads were produced by SLAF-seq. After quality-control, 165,670 SNPs (single nucleotide polymorphisms) were used in further analysis. The results showed that Landrace had distinct genetic relationship compared to Erhualian (F ST = 0.5480) and Meishan (F ST = 0.5800), respectively, while Erhualian and Meishan held the relatively close genetic relationship (F ST = 0.2335). Furthermore, a genome-wide scanning revealed 268 differentially selected regions (DSRs) with 855 genes and 256 DSRs with 347 genes between Landrace and the two Chinese indigenous pig breeds and between Erhualian and Meishan, respectively. This study provides a new cost-effective method for pig genome study and might contribute to a better understanding on the formation mechanism of genetic difference among pigs with different geographical origins.

Entities:  

Mesh:

Year:  2017        PMID: 28855565      PMCID: PMC5577042          DOI: 10.1038/s41598-017-09969-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

In 2009, the pig genome was being sequenced and annotated for public utility[1], which made it possible for analysis and application of genome-wide genetic variants of pig. Shortly afterwards, Illumina developed the PorcineSNP60 Genotyping BeadChip, which included around 60,000 loci by genotyping 158 individuals across multiple pig breeds[2]. Due to the advantages of high density and inexpensive price, the Illumina PorcineSNP60 Genotyping BeadChip has been widely used in multiple research areas. For example, identifying candidate genes associated with economically important traits in different pig populations[3-5], detecting inbreeding depression of reproductive traits in Iberian pigs[6], mining genetic diversity and selection signatures in Chinese and Western pig breeds[7], and discovering population structure of local pig breeds from Russia, Belorussia, Kazakhstan, and Ukraine[8]. These studies greatly enhance our knowledge for understanding the information implied in the pig genome. Simultaneously, with the rapid development of the next generation sequencing technology, genome resequencing has also been introduced into the pig genome research. i.e., identifying candidate loci related to economically important traits between Korean and European origin breeds[9], digging signatures of selection in European or Asian domestic pig genome[10, 11], and measuring coancestry and fitness in Sus cebifrons and Pietrain pigs to design optimal breeding or conservation programs[12]. As a full-scanning approach for discovering all possible genome-wide variants, genome resequencing is considered as a more comprehensive solution for resolving the mystery of the genome. But so far, to some extent, the high costs of sequencing and bioinformatics analysis from genome resequencing have limited its application in population genetics of domestic animals, particularly in some research groups with insufficient funds. Alternatively, other approaches based on reduced representation genome sequencing strategy to reduce complexity in the genome for resequencing can be applied in pig genome-wide study[13-15]. For example, genome reducing and sequencing (GGRS), which contains more number of loci but similar or lower costs than that of Illumina PorcineSNP60 Genotyping BeadChip, has recently been used to identify selection signatures in Yorkshire and Landrace pigs[16] and discover the genetic diversity of Chinese indigenous pig breeds of the Taihu Lake region[17]. Meanwhile, another reduced representation genome sequencing, named specific-locus amplified fragment sequencing (SLAF-seq), was also exploited. As a reduced representation library and high-throughput sequencing based approach, SLAF-seq can pre-design the experimental project through bioinformatics and selection of fragments with a specific length in constructing library. More importantly, SLAF-seq has been considered as a high cost-effective way for large-scale genotyping due to its advantages of high genotyping accuracy, high efficiency for marker discovery, low cost, and high capacity for large populations[18]. Therefore, SLAF-seq is widely applied for revealing loci or genes associated with characteristic traits of multiple breeds. i.e., fruit traits in cucumber[19], tolerance to low-phosphorus stress in soybean[20], and disease, growth and carcass traits in chicken[21-23]. Obviously, these data support that reduced representation genome sequencing strategy is a convenient and effective way for genome-wide study in plants and animals. As representatives of Chinese pigs, Erhualian and Meishan are two native pig breeds with high reproductive performance and excellent meat quality such as high litter size and intramuscular fat content. Most recently, many single nucleotide polymorphisms (SNPs) and candidate genes associated with reproductive or meat quality traits in Erhualian pigs have been identified using Illumina PorcineSNP60 Genotyping BeadChip[24, 25]. Additionally, due to high costs, a limited number of pigs were used (usually from 4 to 10 individuals) for genome resequencing and most of the studies classified Erhualian and Meishan as one of domesticated pig breeds rather than separate groups for further analysis[10, 26, 27]. Knowledge of the genetic difference between Erhualian and Meishan pig breeds is lacking. Furthermore, a genome-wide view of possible candidate loci or genes associated with the characteristics of Erhualian and Meishan pigs compared with other distant breeds such as European origin pigs has still not been well investigated. Taken together, for the first time, a SLAF-seq approach for pigs was developed in this study and then was performed to analyze the genetic difference among Landrace, Erhualian, and Meishan pigs, thus providing a cost-effective way for pig genome-wide study and new guidance for understanding the genetic basis of germplasm characteristics of these pig breeds.

Results

Development of SLAF markers in the pig genome

In order to exploit SLAF markers, in silico restriction analysis was firstly performed on the current pig reference genome (Sscrofa 10.2). As a result, two restriction enzymes, RsaI and HaeIII, were used as enzyme combination for developing around 500,000 SLAFs. Finally, a total of 516,733 SLAF tags were predicted using in silico restriction analysis, which showed similar average SLAF distance among different chromosomes (Supplementary Table S1).

Specific-locus amplified fragment sequencing (SLAF) basics

To obtain the actual SLAF markers used in this study, SLAF-seq was performed in 28 Landrace, 26 Erhualian, and 27 Meishan individuals using the same enzyme combination as in silico restriction analysis. As shown in Table 1 and Supplementary Table S2, a total of 453.75 million reads were obtained from all individuals, which showed that average Q30 and GC content were 87.67% and 44.55%, respectively. Similar to the number of expected SLAFs, 461,261 unique SLAF tags, with average SLAF numbers and sequencing depth as 398,034 (12.74×), 344,471 (6.10×), and 369,160 (6.13×) in Landrace, Erhualian, and Meishan pig populations, respectively, were identified further (Table 1 and Supplementary Table S3). Moreover, compared to the distribution of the expected 516,733 SLAF tags, similar uniformly distribution of 461,261 SLAF tags was also found on different chromosomes (Supplementary Figure S1). In addition, Oryza sativa indica was used as a control during sequencing, the results found that the percentage of digestion normally and paired-end mapped reads of control were 91.73% and 90.68%, respectively, indicating that SLAF-seq process was normal and available.
Table 1

Characteristics of SLAF-seq among Landrace, Erhualian, and Meishan.

ItemLandraceErhualianMeishanAverage of individuals Oryza sativa indica
Statistics of reads
Q30 percentage (%)83.26–87.4087.03–89.2187.51–89.8987.6788.24
GC percentage (%)42.93–46.8944.26–46.7642.74–45.3244.5542.48
Number of reads7,314,304–11,830,0782,572,513–5,014,9553,151,950–5,353,3325,601,9721,452,138
Average of reads9,038,4723,601,4843,964,589
Statistics of SLAFs
Number of SLAFs352,574–420,742316,233–382,547349,762–389,120371,217
Average of SLAFs398,034344,471369,160
Average depth12.746.106.13
Statistics of SNPs
Number of SNPs1,615,995–2,604,0781,160,477–1,663,7071,322,101–1,701,1531,680,561
Heterozygosity (%)4.81–8.125.81–8.455.20–7.236.51
Ratio of Integrity (%)30.07–48.4521.59–30.9524.60–31.6531.26
Average of SNPs2,176,1311,302,9271,530,285
Characteristics of SLAF-seq among Landrace, Erhualian, and Meishan.

Identification of SNPs in Landrace, Erhualian, and Meishan pigs

After genomic mapping and SNP calling, a total of 5,373,997 SNPs were discovered using all individuals (Supplementary Table S4). Moreover, an average of 2,176,131, 1,302,927, and 1,530,285 SNPs were discovered in Landrace, Erhualian, and Meishan pig populations, respectively (Table 1 and Supplementary Table S4). A series of quality control filtering of SNPs were performed to identify 165,670 SNPs used in the further analysis (Supplementary Table S5). Specifically, 161,377 autosomal SNPs were included in linkage disequilibrium and differentially selected region (DSR) analyses, while the 4,293 SNPs on chromosome X were used for DSRs analysis only (Supplementary Figure S2). To reduce the effect from ascertainment bias, a subset of 107,626 autosomal SNPs with minor allele frequency (MAF) ≥ 0.2 was produced and used in genetic structure analysis (Supplementary Figure S2).

Measuring of linkage disequilibrium by chromosomes

Linkage disequilibrium (r2) for pairs of loci was measured by chromosomes in each pig population. As shown in Supplementary Figure S3, the average r2 values dropped quickly when physical distances reached 40 kilo base pairs (Kb). With the increasing of physical distance, the similar decreasing trends of the average r2 values were observed among three populations. In addition, average r2 values of Landrace and Meishan were higher than that of Erhualian.

Accessing genetic structure, heterozygosity and gene diversity

As expected, population structure analysis revealed that Landrace and Meishan formed the first two independent populations (K = 2) followed by the Erhualian (K = 3) (Fig. 1A). Moreover, a multidimensional scaling plot clearly showed that Landrace was a distant population compared with Erhualian and Meishan (Fig. 1B). Based on pair-wise estimates of F ST described by Weir and Cockerham[28], the significant genetic differentiation appeared between Landrace and Erhualian (F ST = 0.5480), between Landrace and Meishan (F ST = 0.5800), and between Erhualian and Meishan (F ST = 0.2335), respectively, demonstrating that Erhualian and Meishan hold closer genetic relationship compared to Landrace. As shown in Supplementary Figure S4, observed heterozygosity of Landrace, Erhualian, and Meishan was 0.1956, 0.2410, and 0.2432, respectively, while the gene diversity of Landrace, Erhualian, and Meishan was 0.2276, 0.2952, and 0.2776, respectively. Erhualian and Meishan had the higher observed heterozygosity and gene diversity than that of Landrace.
Figure 1

Population structure analysis of the genetic differentiation among Landrace, Erhualian, and Meishan. (A) Population structure among Landrace, Erhualian, and Meishan using STRUCTURE software. (B) Multidimensional scaling plots among Landrace, Erhualian, and Meishan using PLINK software. F ST represents the pair-wise F ST between any two pig populations.

Population structure analysis of the genetic differentiation among Landrace, Erhualian, and Meishan. (A) Population structure among Landrace, Erhualian, and Meishan using STRUCTURE software. (B) Multidimensional scaling plots among Landrace, Erhualian, and Meishan using PLINK software. F ST represents the pair-wise F ST between any two pig populations.

Identification of DSRs between Landrace and the two Chinese indigenous pig breeds

To obtain the genomic difference among different pig breeds, a SNP-based F ST estimation was performed to identify DSRs, which defined regions with shared differentially selection signals across breeds. As these two Chinese indigenous pig breeds have similar origin, Erhualian and Meishan were firstly merged as one population compared with Landrace. As a result, a total of 268 DSRs were identified between Landrace and the two Chinese indigenous pig breeds, which included 237 DSRs in autosomes and 31 DSRs in chromosome X (Supplementary Table S6). The DSR (named region 135) with both the richest top SNPs and the highest W value was located on chromosome 7 (Table 2 and Fig. 2A–C). Moreover, 855 unique known genes, including 511 genes from autosomal DSRs and 344 genes from X chromosome DSRs (Table 2 and Supplementary Table S6), were discovered further, which included many genes associated with economically important traits, i.e., ESR1 and KIT. Furthermore, the gene functional analysis showed that the biological processes between Landrace and the two Chinese indigenous pig breeds were enriched in flavonoid related metabolic process, pigment biosynthetic process, response to hormone, and regulation of developmental process (Supplementary Table S7).
Table 2

Top 20 percent of autosomal differentially selected regions (DSRs) among Landrace, Erhualian, and Meishan.

RegionW valueChrPositionTop 0.5%Top 5%GeneCandidates
DSRs between Landrace and the two Chinese indigenous pig breeds
28.12116639564..168943730161 ESR1
128.021125860631..1262372341135 ALDH1A2,AQP9
157.821157931791..1586470022168 NA
279.071245581640..2464934273206 SMARCA2
3411.891248939813..24934167111173 FAM189A2,C1H9orf135
517.561305534056..305560192491 RAPGEF1
638.742132479091..132493356471 CSNK1G3
648.292139541646..139596463092 ACSL6,MEIKIN
687.562153497601..153536938061 PRELID2
759.52340440733..406713079725 SRRM2,TCEB2,BICDL2, FLYWCH1,CCDC64B, LOC100737121,LOC102158577
778.60341967307..428420439448 KCTD5,UNKL,PDPK1,RNPS1
788.41342957871..43071250472 IARS
8211.03377090252..773744237157 ARHGAP25,BMP10, APLF
837.87378080175..78118235260 NA
897.86447500407..47599740270 NA
927.69453117484..53787644372 MMP16
937.95455932479..56325201333 CA2,LOC100154871
979.45490369121..903991190172 TBX19,SFT2D2
1228.81680274787..80286579620 NA
1237.60680425263..80462131430 NA
1267.93682230546..823558610101 NA
1279.18682692715..82761603097 TXLNA,LOC100739580
1289.25682958401..83041048184 ZBTB8A,RBBP4, ZBTB8OS
12911.94683940808..840776746141 LOC100523068
1309.63684350867..84495848391 C6H1orf94
13516.74755136..27480711231 EXOC2
1418.48760246223..60417535383 ANPEP,ARPIN
1597.89855208909..55760237567 LOC102159670
16010.37857463293..5802701111112 HOPX,THEGL, REST
1627.70860649748..62169164940 NA
16610.48869657439..699889364167 CENPC,UBA6, LOC100512727
1677.86870578162..712335669410 AMTN,UGT2A1,LOC100515222
16910.27871666052..720138051343 SLC4A4,LOC100739482
1758.188148318210..148486319758 CSN1S2,ODAM, PRR27
1779.10944454513..44465001521 DIXDC1
1827.721152521631..52714330362 NA
1868.961220061666..20149830274 BRCA1
1897.581342228668..42395388072 CCDC66,FAM208A
1918.4113194337969..194393374080 NA
1928.7613200757712..2013534850131 NA
1958.4613210377658..210418642090 NA
1978.7013216483385..216515468622 NA
1989.561422924788..23026160090 NA
2038.8614135519785..135551251091 NA
2057.9114137758835..137835354351 CCDC172
2079.2914144278853..1443305210122 BUB3
2339.621843951415..443442723102 BBS9,LOC100738822
DSRs between Erhualian and Meishan
79.63115312287..15312564420 NA
2910.25226179507..264974903150 NA
3510.892122905147..123003827370 NA
4510.222143611291..143611845171 SLC25A48
469.5134355611..4371774512 RADIL,MMD2
529.87390528360..90824884338 CCDC104,CLHC1, SMEK2, MTIF2
5611.873102297392..102494238454 PPM1B,PREPL, SLC3A1
589.913102678119..102743126421 LRPPRC
599.833106283461..106442746181 SLC8A1
619.973113055246..113120757411 LOC100626840
6410.873115856309..116048881674 YPEL5
678.83418251390..18266000060 NA
759.1155199728..5220416342 SGSM3,MKL1
7810.20519478109..19576660481 NA
9810.376102930166..103055274512 ZNF521
9915.016103154776..1035288191402 NA
10012.806103780912..104046778934 PSMA8,KCTD1,TAF4B
10411.466124785239..124938047642 PTGFR
11010.20734828000..34927976312 GRM4
1119.08739781167..39790945330 NA
11312.53791745010..91808309650 NA
11710.977105180004..105206797332 IFT43,TGFB3
1189.18810911848..10912137402 PROM1,LOC102160409
1219.71813697594..14004707320 NA
13414.91940038695..402282681228 CWF19L2,ELMOD1, ALKBH8,LOC100626234
1388.79995612266..95753391070 NA
13912.759105189739..105220813700 NA
14310.809142936175..1430290982100 NA
14610.06103770676..3770960150 NA
1479.171031350537..31364218221 C10H9orf3
1648.861150583012..509645152172 KLF12,LOC100622863
1779.001246196899..46237519460 NA
18014.141251152923..51247795802 NA
1829.991312498982..12503925071 THRB
18311.921313052089..13079543090 NA
18515.3513189288665..1893545901101 LIPI
18612.3913190678701..1907003370130 NA
18810.5113216737510..216756384230 NA
19216.191416460290..164865691050 NA
19511.061421329899..21678452584 CLCN3,NEK1
20110.681447963670..485813781115 MN1,PITPNB,TTC28
20212.121448900016..4966310921517 EWSR1,KREMEN1,NIPSP1, RHBDD3
21810.3915152011389..152018640221 UBE2F
2238.721624195627..244179362151 NA
2309.001682918895..82946838320 NA
2358.671711918899..11919532070 NA
2369.231715112581..15187900333 TMEM230,CDS2
2379.471715425824..15480579181 GPCPD1
24010.061747494168..47511555411 PPP1R16B
2529.811841156162..41171465262 AOAH

Note: The candidate genes were given within the SNPs of top SNP (0.5% or 5%) for each region.

Figure 2

Global distribution of F ST among Landrace, Erhualian, and Meishan. (A) Global distribution of F ST between Landrace and the two Chinese indigenous pig breeds on autosomes 1–18. A representative DSR (region 135 of SSC7) with both the richest top SNPs and the highest W value was indicated. (B) The W-statistics was performed for DSRs between Landrace and the two Chinese indigenous pig breeds. (C) Smoothed F ST showed strong selection signals in region 135 between Landrace and the two Chinese indigenous pig breeds. L-EM represents Landrace-the two Chinese indigenous pig breeds, while E-M means Erhualian-Meishan. (D) Global distribution of F ST between Erhualian and Meishan on autosomes 1–18.

Top 20 percent of autosomal differentially selected regions (DSRs) among Landrace, Erhualian, and Meishan. Note: The candidate genes were given within the SNPs of top SNP (0.5% or 5%) for each region. Global distribution of F ST among Landrace, Erhualian, and Meishan. (A) Global distribution of F ST between Landrace and the two Chinese indigenous pig breeds on autosomes 1–18. A representative DSR (region 135 of SSC7) with both the richest top SNPs and the highest W value was indicated. (B) The W-statistics was performed for DSRs between Landrace and the two Chinese indigenous pig breeds. (C) Smoothed F ST showed strong selection signals in region 135 between Landrace and the two Chinese indigenous pig breeds. L-EM represents Landrace-the two Chinese indigenous pig breeds, while E-M means Erhualian-Meishan. (D) Global distribution of F ST between Erhualian and Meishan on autosomes 1–18.

DSRs associated with female characters between Landrace and the two Chinese indigenous pig breeds

In order to further investigate DSRs or genes related to female characters, a female based genome-wide screening was performed between Landrace and the two Chinese indigenous pig breeds. A dataset with MAF ≥ 0.1 containing 175,952 SNPs from autosomes and 8,320 SNPs from chromosome X was used in this analysis. Based on the SNP F ST estimates, 241 DSRs in autosomes and 60 DSRs in chromosome X were revealed. Further examination of these DSRs identified 443 unique known genes from autosomes and 327 genes from chromosome X (Supplementary Table S8). GO terms of these genes from DSRs were involved in chromatin organization, odontogenesis, animal organ development, and reproductive system development (Supplementary Table S9).

Characterization of DSRs between Erhualian and Meishan

A genome-wide distribution of F ST between Erhualian and Meishan on autosomes is shown in Fig. 2D. Between these two pig breeds, 256 DSRs, including 252 DSRs in autosomes and 4 DSRs in chromosome X, were discovered. Moreover, 341 unique known genes from autosomal DSRs and 6 genes from X chromosome DSRs were discovered further (Supplementary Table S10). The gene enrichment analysis found that DSRs between Erhualian and Meishan were enriched in sodium-independent organic anion transport, glycerophospholipid metabolic process, and phosphorus or phospholipid related metabolic process (Supplementary Table S11).

Discussion

In the present study, we used SLAF-seq to analyze the genetic structure and differentially selected regions among Landrace, Erhualian, and Meishan pigs. By using over 160,000 SNPs developed in this study, our results clearly indicated that there was a closer genetic relationship between Erhualian and Meishan compared with the genetic relationship between Landrace and Erhualian or Meishan. Moreover, 268 DSRs including 855 genes were discovered between Landrace and the two Chinese indigenous pig breeds, while 256 DSRs including 347 genes were found between Erhualian and Meishan. As such, our data provide a comprehensive view of the genome-wide basis for genetic difference among these pig breeds. Basically, as a high-resolution method for genome-wide genotyping, SLAF-seq is considered as an enhanced reduced representation sequencing and possesses several remarkable features[18], i.e., accurate genotyping by depth paired-end sequencing, reduced sequencing costs, pre-design strategy for fragmentation efficiency, and double barcode system for large populations. In this study, a total of 516,733 SLAFs from the pig reference genome (Sscrofa 10.2) were predicted using RsaI and HaeIII enzyme combination. With more than 6 × sequencing depth but below $80 cost for each individual, 5,373,997 SNPs were discovered from Landrace, Erhualian, and Meishan pig populations by SLAF-seq. After filtering, 165,670 SNPs were available for analyzing the genetic difference among these pig populations, which would have an average inter-marker distance of approximately 17 Kb and several multiples of the number of SNPs from Illumina PorcineSNP60 Genotyping BeadChip. Compared with the published pig GGRS, which contained 34,789–105,550 available SNPs for different objective analysis[14, 17], our SLAF-seq method might provide more genomic variation information. More importantly, our data explored the genetic difference between Landrace and the two Chinese pig breeds using reduced representation sequencing strategy, which has not been well studied by GGRS. Considering the huge difference in genome variation between European and Chinese pig breeds, our data demonstrate that SLAF-seq is a powerful approach and has great potential for further studies in more breeds. Therefore, the SLAF-seq approach could be considered as a more competitive choice for pig genome study. It is well known that Landrace was originally developed in Denmark, while Erhualian and Meishan were domesticated in China, implying that Landrace was as a geographically distant breed relative to Erhualian and Meishan. As expected, both population structure and cluster analysis showed that Landrace was genetically distant from two Chinese indigenous pig breeds (Fig. 1). Moreover, pair-wise F ST estimation supported Landrace had distant genetic relationship compared with two Chinese indigenous pig breeds (F ST = 0.5480 for Erhualian and F ST = 0.5800 for Meishan), demonstrating that there was a great genetic differentiation between Landrace and two Chinese indigenous pig breeds. These results were consistent with the data from other genetic markers such as mitochondrial DNA and microsatellite markers[29, 30], which showed European and Chinese pig breeds exist long genetic distances. In particular, we found there was close but significant genetic differentiation (F ST = 0.2335) between Erhualian and Meishan. Actually, a significant genetic differentiation between Erhualian and Meishan had been observed using GGRS methods[17]. Therefore, our data provide a further evidence that Erhualian and Meishan should be two pig breeds, although they had been classified as the same pig breed named Taihu pigs in the annals of history as both pig breeds lived in Taihu Lake regions, eastern of China. Additionally, higher observed heterozygosity and gene diversity were found in these two Chinese indigenous pig breeds compared with Landrace, implying that these two Chinese indigenous pig breeds had more abundant genetic diversity. This is not surprising because Chinese indigenous breeds had more genetic diversity than that of European pig breeds[30]. Accordingly, this study provides a global view of the genetic structure and relationship among these three pig populations. Generally speaking, Erhualian and Meishan are two of the most prolific pig breeds known in the world[24, 31, 32] and have been mainly considered as the maternal line for synthetic strain selection. Meanwhile, these two pig breeds are used to build specific populations for identifying candidate genes or QTLs (quantitative trait loci) related to reproductive traits, i.e., Iberian x Meishan F2 population[33], Large White × Meishan cross gilts[34], and Duroc x Erhualian resource population[35]. Intuitively, compared to Landrace, a major of the maternal sources in European pig breeds, Erhualian and Meishan pigs possess the better farrowing capacity, which results in far more litter size[36, 37]. To further identify the candidates related to female reproductive traits, a female based dataset was used at the same time. Compared with the candidate genes from all individuals, a total of 502 genes from female based DSRs were found overlapping (Supplementary Tables S6 and S8). Many important genes related to female reproductive traits such as ESR1, RBBP4, and BRCA1, were discovered in both DSR datasets. In the previous study, as a key gene for mediating estrogen, ESR1 with its PvuII site polymorphism had been shown to be associated with litter size in different pig populations, i.e., Meishan synthetic line[38], Landrace population[39], a population of Erhualian and Xiang pig breeds[40], and Large White x Meishan F2 crossbred gilts[41]. However, other studies did not find significant associations of ESR1 with litter size in a synthetic line of Duroc and Large White origin or a Meishan x Large White F2 population[42, 43], demonstrating that the role of ESR1 in sow prolificacy among different pig populations was still controversial. Therefore our data might provide additional information for interpreting the difference in litter size between Landrace and the two Chinese indigenous pig breeds. RBBP4, a biomarker of oocyte competence, had different expression levels between in vivo- and in vitro-matured oocytes in bovine[44] and could affect bipolar spindle assembly by regulating histone deacetylation during oocyte maturation in mouse[45]. However, little is known about the function of RBBP4 in pig reproductive process. Our data suggest that RBBP4 might have an important role in sow reproductive performance, but confirmation of this assumption warrants further study. Interestingly, BRCA1, a key DNA repair protein, was identified in DSRs. Previously, genetic variants of BRCA1 were mainly related to breast or ovarian related diseases in human and cattle[46, 47]. Besides, reducing numbers of primordial follicles and simultaneously increasing DNA double-strand break repair with age were observed in BRCA1-deficient mice compared to the wild-type mice, showing that BRCA1-deficient led to ovarian aging[48, 49]. In women, BRCA1 mutation produced lower numbers of eggs compared with control[50]. These data indicated BRCA1 might play a critical role in regulating ovarian development and the number of eggs. Hence our results might give us a novel clue for its role in pig reproductive performance. Furthermore, a DSR with both the richest top SNPs and the highest W value in autosomes was found on Sus scrofa chromosome 7 (SSC7, 0.05 to 0.27 Mb) between Landrace and the two Chinese indigenous pig breeds (Fig. 2, Table 2 and Supplementary Table S6). Interestingly, EXOC2, a component of the exocyst complex, was found in this region. Previous studies indicated that EXOC2 played a role in filopodia formation by binding the GTPase RalA in Swiss 3T3 cells and mediated vesicle trafficking by interacting with the Rho guanine nucleotide exchange factor (GEF)-H1 in HeLa cells[51, 52]. Additionally, genetic variants of EXOC2 or adjacent EXOC2 had been considered as contributing to pigmentary traits such as hair color, skin pigmentation and tanning ability in Europeans as well as vitamin D status in the Caucasian population[53-57]. In this study, Landrace has white hair color and skin, while Erhualian and Meishan have colored hair and skin[37], indicating that there is great difference in pigmentary traits between Landrace and the two Chinese indigenous pig breeds. Our data suggest that EXOC2 might be a good pigmentation trait candidate of pigs. However, the biological mechanism of EXOC2 has not been well characterized in regulating human pigmentation. Moreover, little is known about the role of EXOC2 in pigs. Therefore, confirmation and elucidation the role of EXOC2 in pigmentation trait in pigs requires further investigation. Actually, another key well-documented pigmentary gene, KIT was also discovered in DSRs (Supplementary Table S6). In the past two decades, KIT had been deeply investigated for its mutation associated with coat color of the dominant white in pigs[58-60]. Moreover, as a colored breed, Meishan had been found to have different KIT allele distinguish from white breed such as Landrace and Large White[58, 60]. Obviously, our data provide further evidence for the role of KIT in pigmentary traits. More interestingly, we discovered some genes associated with growth or meat quality traits in DSRs, i.e., ACSL6, RAPGEF1, BMPER, and KDR (Table 2 and Supplementary Table S6). ACSL6, a member of the long-chain acyl-CoA synthetase gene family for catalyzing the formation of acyl-CoA from fatty acids, ATP, and CoA, was shown to be associated with dry matter intake and metabolic mid-test body weight in cattle[61]. RAPGEF1, a guanine nucleotide releasing protein, was discovered to play an important role in skeletal muscle differentiation[62] and was significantly associated with type 2 diabetes in the Korean population and Finns[63, 64]. These studies demonstrated that ACSL6 and RAPGEF1 might have important function in body growth or development of skeletal muscle and fat. Hence our results might provide new ideas for their function in affecting pig growth and development. In addition, BMPER and KDR, two key genes in the development of angiogenesis and vasculogenesis[65, 66], were identified in DSRs. It is not surprising because our previous study had found genetic variants in the promoters of BMPER and KDR affected intramuscular fat deposition in Erhualian pigs[67, 68]. Moreover, BMPER was found to be associated with rump length and body size in Qinchuan cattle population[69]. Here our data suggest that BMPER and KDR might contribute to the regulation of pig growth and meat quality, but additional studies are needed to further confirm this speculation. In a word, our results provide a comprehensive clue for discovering the genetic difference between Landrace and the two Chinese indigenous pig breeds. As described above, Erhualian and Meishan are two famous pig breeds with the typical characters of Chinese indigenous breeds such as high productive performance and good meat quality. However, as two different pig breeds, some characteristics, i.e., head wrinkles and colors of limbs and nose[37], are very different between these two pig breeds. In this study, the genetic differentiation (F ST = 0.2335) between Erhualian and Meishan was observed, further confirming that there exists a significant genetic difference between these two pig breeds. But until now, only variants from single gene such as FUT1 have been used to compare the difference in gene frequencies between these two pig breeds[70]. In a genome-wide level, the genetic difference between these two pig breeds is still largely unknown. Our present study identified 256 DSRs containing 347 genes between Erhualian and Meishan. Importantly, GO terms associated with these genes were enriched in sodium-independent organic anion transport, glycerophospholipid metabolic process, and phospholipid related metabolic process. Some important candidates, i.e., SLCO3A1 in sodium-independent organic anion transport, CDS2, CRLS1, and MTMR7 in glycerophospholipid metabolic process, and SPTLC1, SGMS2, and CRLS1 in phospholipid related metabolic process, were discovered from enrichment analysis. The difference of the biological processes between these two pig breeds might result from a long-term artificial selection based on different breeding objectives, but further investigation of the cause of breed difference is required. In summary, a SLAF-seq approach for pigs was developed to reveal the genetic structure and relationship among Landrace, Erhualian, and Meishan pigs. Meanwhile, DSRs including hundreds of genes between Landrace and the two Chinese indigenous pig breeds and between Erhualian and Meishan were identified further. Consequently, our study not only provides a cost-effective approach for pig genome-wide screening, but also establishes the genetic basis for further investigating important gene functions from DSRs among these pig breeds in future research.

Materials and Methods

Animal

Blood or ear tissue samples were collected from 28 Landrace (8 male and 20 female), 26 Erhualian (8 male and 18 female), and 27 Meishan (Small Meishan, 8 male and 19 female) at the Xiang Xin livestock and poultry Co., Ltd (Shanghai), Changzhou Erhualian production cooperation (Jiangsu), and Taicang pig breeding farm (Jiangsu), respectively. Total DNA was extracted by the phenol-chloroform extraction method, with concentration and purity measured using the NanodropTM 2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA) and electrophoresis. All animal care and handling procedures were approved by the Animal Ethics Committee at Nanjing Agricultural University, China. All methods were performed in accordance with the guidelines and regulations of the Animal Ethics Committee at Nanjing Agricultural University.

SLAF library construction and sequencing

A simulated restriction enzyme digestion was carried out on the current pig genome (Sscrofa 10.2) to identify expected SLAF yield, avoid repetitive SLAFs, and obtain the relatively uniform distribution of restriction fragments in the genome. As a result, genomic DNA was digested with RsaI and HaeIII restriction enzyme combination. Meanwhile, in order to assess the experimental procedure, Oryza sativa indica (http://rapdb.dna.affrc.go.jp/) was used as a control for evaluating the effectiveness of enzyme digestion and paired-end mapped reads. In brief, SLAF library construction and sequencing for each individual was conducted as described previously[18] with slight modifications: DNA fragments of 314–364 base pair (bp) were selected as SLAFs and used for paired-end sequencing by Illumina HiSeq 2500 system (Illumina, Inc., San Diego, CA, USA) at Beijing Biomarker Technologies Corporation.

Genome mapping, SNP calling and filtering

Raw paired-end reads were mapped to the pig reference genome (Sscrofa 10.2) using BWA software[71]. In general, SLAF groups produced by reads were mapped to the same position. If an accession was only partly digested by the restriction enzymes, reads mapped to the reference genome should have overlaps with two SLAF tags and were assigned to both of the SLAF tags in the same accession. SNP calling was performed by both GATK and samtools analysis[72, 73], and a locus was defined as a SNP if it was simultaneously called from these two packages. Filtering high-quality SNPs was conducted by PLINK v1.07[74]. The analysis of Hardy-Weinberg Equilibrium (HWE) for SNPs from autosomes and chromosome X was performed by Pedstats tools and HWE package for R[75, 76], respectively.

Population genetic basics analysis

Population structure analysis was performed using STRUCTURE with 10,000 iterations via the correlated allele model[77], and then plotted by DISTRUCT software[78]. Linkage disequilibrium analysis was computed between each marker pair within each breed separately using Haploview 4.1[79]. The population relatedness of pair-wise F ST and gene diversity were calculated using the HIERFSTAT package for R. Observed heterozygosity and IBS matrix of distance were estimated by PLINK v1.07[74], and then multidimensional scaling (MDS) analysis of autosomal SNPs was determined by R 3.2.4.

Detection of differentially selected regions (DSRs)

The DSR algorithm was carried out as described previously[3, 80] with slight modifications: 1) raw values were ranked; 2) Fisher’s exact test was executed in R 3.2.4 to compare the allele frequencies between Landrace and the two Chinese indigenous pig breeds and between Erhualian and Meishan, respectively. SNPs with P values < 0.05 were considered as statistically significant after Bonferroni correction; 3) F ST of SNP was estimated using the model proposed by Nicholson et al.[81] and Flori et al.[82], and then the significant SNPs with 0.5% or 5% highest F ST values were selected as the top significant SNPs; 4) by placing the top significant SNPs (0.5%) in the center of a DSR, adjacent SNPs were collected to determine the region boundaries until more than two consecutive SNPs were not in the top significant 5% threshold. The overlapped DSRs were combined as the same DSR. When a region contained more than five SNPs from the top significant SNPs (5%) but without top significant SNPs (0.5%), we also considered it as a DSR in our study. In addition, the W-statistics was used to identify the relative importance of DSRs[83] and smoothed F ST values were obtained by a local variable bandwidth kernel estimator[84].

Gene functional enrichment analysis

Genes in these DSRs were identified using the Sscrofa 10.2 assembly (www.animalgenome.org/cgi-bin/gbrowse/pig) and NCBI database. The functional enrichment analysis of target genes was performed using Panther bioinformatics resources (www.pantherdb.org). Terms with P values less than 0.01 and the enrichment value more than 1 were selected as significant or enriched terms.

Data accessibility

The datasets have been submitted to the SRA database of NCBI (accession number SRP090907). Supplementary Figures Dataset 1 Dataset 2 Dataset 3 Dataset 4 Dataset 5 Dataset 6 Dataset 7 Dataset 8 Dataset 9 Dataset 10 Dataset 11
  79 in total

1.  RBBP4 regulates histone deacetylation and bipolar spindle assembly during oocyte maturation in the mouse.

Authors:  Ahmed Z Balboula; Paula Stein; Richard M Schultz; Karen Schindler
Journal:  Biol Reprod       Date:  2015-03-18       Impact factor: 4.285

2.  The effect of variants in the promoter of BMPER on the intramuscular fat deposition in longissimus dorsi muscle of pigs.

Authors:  Zhi Liu; Wenxing Sun; Yongyan Zhao; Chunying Xu; Yingying Fu; Yan Li; Jie Chen
Journal:  Gene       Date:  2014-03-22       Impact factor: 3.688

3.  Genome-wide study refines the quantitative trait locus for number of ribs in a Large White × Minzhu intercross pig population and reveals a new candidate gene.

Authors:  Long-Chao Zhang; Jing-Wei Yue; Lei Pu; Li-Gang Wang; Xin Liu; Jing Liang; Hua Yan; Ke-Bin Zhao; Na Li; Hui-Bi Shi; Yue-Bo Zhang; Li-Xian Wang
Journal:  Mol Genet Genomics       Date:  2016-06-15       Impact factor: 3.291

4.  Molecular basis for the dominant white phenotype in the domestic pig.

Authors:  S Marklund; J Kijas; H Rodriguez-Martinez; L Rönnstrand; K Funa; M Moller; D Lange; I Edfors-Lilja; L Andersson
Journal:  Genome Res       Date:  1998-08       Impact factor: 9.043

5.  Genetic determinants of hair, eye and skin pigmentation in Europeans.

Authors:  Patrick Sulem; Daniel F Gudbjartsson; Simon N Stacey; Agnar Helgason; Thorunn Rafnar; Kristinn P Magnusson; Andrei Manolescu; Ari Karason; Arnar Palsson; Gudmar Thorleifsson; Margret Jakobsdottir; Stacy Steinberg; Snaebjörn Pálsson; Fridbert Jonasson; Bardur Sigurgeirsson; Kristin Thorisdottir; Rafn Ragnarsson; Kristrun R Benediktsdottir; Katja K Aben; Lambertus A Kiemeney; Jon H Olafsson; Jeffrey Gulcher; Augie Kong; Unnur Thorsteinsdottir; Kari Stefansson
Journal:  Nat Genet       Date:  2007-10-21       Impact factor: 38.330

6.  Genome-wide association study of growth traits in Jinghai Yellow chicken hens using SLAF-seq technology.

Authors:  W H Wang; J Y Wang; T Zhang; Y Wang; Y Zhang; K Han
Journal:  Anim Genet       Date:  2015-09-13       Impact factor: 3.169

7.  Whole-genome resequencing analyses of five pig breeds, including Korean wild and native, and three European origin breeds.

Authors:  Jung-Woo Choi; Won-Hyong Chung; Kyung-Tai Lee; Eun-Seok Cho; Si-Woo Lee; Bong-Hwan Choi; Sang-Heon Lee; Wonjun Lim; Dajeong Lim; Yun-Gyeong Lee; Joon-Ki Hong; Doo-Wan Kim; Hyeon-Jeong Jeon; Jiwoong Kim; Namshin Kim; Tae-Hun Kim
Journal:  DNA Res       Date:  2015-06-27       Impact factor: 4.458

8.  SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing.

Authors:  Xiaowen Sun; Dongyuan Liu; Xiaofeng Zhang; Wenbin Li; Hui Liu; Weiguo Hong; Chuanbei Jiang; Ning Guan; Chouxian Ma; Huaping Zeng; Chunhua Xu; Jun Song; Long Huang; Chunmei Wang; Junjie Shi; Rui Wang; Xianhu Zheng; Cuiyun Lu; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

9.  A genome-wide association study identifies novel alleles associated with hair color and skin pigmentation.

Authors:  Jiali Han; Peter Kraft; Hongmei Nan; Qun Guo; Constance Chen; Abrar Qureshi; Susan E Hankinson; Frank B Hu; David L Duffy; Zhen Zhen Zhao; Nicholas G Martin; Grant W Montgomery; Nicholas K Hayward; Gilles Thomas; Robert N Hoover; Stephen Chanock; David J Hunter
Journal:  PLoS Genet       Date:  2008-05-16       Impact factor: 5.917

10.  High-Density Genetic Mapping Identifies New Major Loci for Tolerance to Low-Phosphorus Stress in Soybean.

Authors:  Dan Zhang; Hongyan Li; Jinshe Wang; Hengyou Zhang; Zhenbin Hu; Shanshan Chu; Haiyan Lv; Deyue Yu
Journal:  Front Plant Sci       Date:  2016-03-30       Impact factor: 5.753

View more
  5 in total

1.  Genomic structure of a crossbred Landrace pig population.

Authors:  Letícia Borges Joaquim; Tatiane Cristina Seleguim Chud; Jorge Augusto Petroli Marchesi; Rodrigo Pelicioni Savegnago; Marcos Eli Buzanskas; Ricardo Zanella; Mauricio Egidio Cantão; Jane Oliveira Peixoto; Mônica Correa Ledur; Renato Irgang; Danísio Prado Munari
Journal:  PLoS One       Date:  2019-02-28       Impact factor: 3.240

2.  Genome-wide association study identifying genetic variants associated with carcass backfat thickness, lean percentage and fat percentage in a four-way crossbred pig population using SLAF-seq technology.

Authors:  Huiyu Wang; Xiaoyi Wang; Dawei Yan; Hao Sun; Qiang Chen; Mingli Li; Xinxing Dong; Yuchun Pan; Shaoxiong Lu
Journal:  BMC Genomics       Date:  2022-08-15       Impact factor: 4.547

3.  Genome-wide detection of copy number variation in American mink using whole-genome sequencing.

Authors:  Pourya Davoudi; Duy Ngoc Do; Bruce Rathgeber; Stefanie M Colombo; Mehdi Sargolzaei; Graham Plastow; Zhiquan Wang; Karim Karimi; Guoyu Hu; Shafagh Valipour; Younes Miar
Journal:  BMC Genomics       Date:  2022-09-13       Impact factor: 4.547

4.  Retrotransposon Insertion Polymorphisms (RIPs) in Pig Reproductive Candidate Genes.

Authors:  Zhanyu Du; Enrico D'Alessandro; Emmanuel Asare; Yao Zheng; Mengli Wang; Cai Chen; Xiaoyan Wang; Chengyi Song
Journal:  Genes (Basel)       Date:  2022-07-28       Impact factor: 4.141

5.  Analysis of Population Structure and Differentially Selected Regions in Guangxi Native Breeds by Restriction Site Associated with DNA Sequencing.

Authors:  Zhuliang Yang; Jixian Deng; Dongfeng Li; Tiantian Sun; Li Xia; Wenwen Xu; Linghu Zeng; Hesheng Jiang; Xiurong Yang
Journal:  G3 (Bethesda)       Date:  2020-01-07       Impact factor: 3.154

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.