Literature DB >> 25327778

A genome-wide scan for selection signatures in Yorkshire and Landrace pigs based on sequencing data.

Zhen Wang1, Qiang Chen, Yumei Yang, Hongjie Yang, Pengfei He, Zhe Zhang, Zhenliang Chen, Rongrong Liao, Yingying Tu, Xiangzhe Zhang, Qishan Wang, Yuchun Pan.   

Abstract

Pigs have experienced dramatic selection due to domestication, which has led to many different phenotypes when compared to their wild counterparts, especially in the last several decades. Currently, genome-wide scans in both cattle and humans showing positive selection footprints have been investigated. However, few studies have focused on porcine selection footprints, particularly on a genome-wide scale. Surveying for selection footprints across porcine genomes can be quite valuable for revealing the genetic mechanisms of phenotypic diversity. Here, we employed a medium sequencing depth (5-20x/site per individual, on average) approach called genotyping by genome reducing and sequencing (GGRS) to detect genome-wide selection signatures of two domestic pig breeds (Yorkshire and Landrace) that have been under intensive selection for traits of muscle development, growth and behavior. The relative extended haplotype homozygosity test, which identifies selection signatures by measuring the characteristics of haplotypes' frequency distribution within a single population, was also applied to identify potential positively selected regions. As a result, signatures of positive selection were found in each breed. However, most selection signatures were population specific and related to genomic regions containing genes for biological categories including brain development, metabolism, growth and olfaction. Furthermore, the result of the gene set enrichment analysis indicated that selected regions of the two breeds presented a different over-representation of genes in the Gene Ontology annotations and Kyoto Encyclopedia of Genes and Genomes pathways. Our results revealed a genome-wide map of selection footprints in pigs and may help us better understand the mechanisms of selection in pig breeding.
© 2014 The Authors. Animal Genetics published by John Wiley & Sons Ltd on behalf of Stichting International Foundation for Animal Genetics.

Entities:  

Keywords:  REHH test; genome reducing and sequencing; pig genome; selective sweep

Mesh:

Year:  2014        PMID: 25327778      PMCID: PMC4276287          DOI: 10.1111/age.12229

Source DB:  PubMed          Journal:  Anim Genet        ISSN: 0268-9146            Impact factor:   3.169


Introduction

Yorkshire and Landrace, two commercial breeds of pigs used worldwide, have been subject to intensive selection for particular production attributes in the last decades. With this in mind, identifying genomic loci under selection that correlate with these production attributes would be beneficial for future pig breeding as well as for the identification of porcine genes related to biological processes and traits of interest. The advent of high-throughput and cost-effective genotyping techniques allows us to thoroughly explore the patterns of genetic variation in domestic animals at the genome level. One of the strategies for studying genetic variation has been carried out of the genome to phenotypes and aims mainly to detect selection signatures based on identified patterns of linkage disequilibrium (LD; Ennis 2007), which are inconsistent with the hypothesis of genetic neutrality. As the concept of a selective sweep was first proposed by Smith & Haigh (1974) to detect selection signatures, it has been validated by other researchers (Barton 1995; Durrett & Schweinsberg 2004; Przeworski et al. 2005; Pennings & Hermisson 2006). It was assumed that, under the conditions of selection, the frequency of a beneficial allele rises and the allele is driven to fixation within a short period of time in a random-mating population of constant size. So far, different statistics have been established to detect different kinds of selection signatures, for instance, extended haplotype homozygosity (EHH; Sabeti et al. 2002) for recent selection and Tajimas’ D (Tajima 1989) for relatively ancient selection. Although many methods, such as Tajima's D (Tajima 1989), the Hudson–Kreitman–Aguade (HKA) test (Hudson et al. 1987) and Fay and Wu's H test (Fay & Wu 2000), were developed to detect selection signatures; they were not designed for locating genome-wide SNPs. Some other methods, including FST (Wright 1965), EHH (Sabeti et al. 2002) and iHS (integrated haplotype score; Voight et al. 2006), are suitable for detecting selection signatures at the genome level. Unfortunately, their applications are limited as the high-throughput genotyping techniques were not widely used in domestic animals at that time. Recently, the development of sequencing techniques and SNP chips provides us with high-density markers and the ability to identify selection signatures genome wide. For example, genome-wide scans of selection signatures have been well studied in cattle (Prasad et al. 2008; Flori et al. 2009; Gautier et al. 2009). However, most of these studies applied SNP chips for the identification of SNP genotypes. Although SNP chips are widely used, this technique also brings some bias because of the small size of samples, which leads to a frequency-specific distortion in detecting SNPs. Additionally, Amaral et al.'s (2011) study showed that sequencing of genomic pools by next-generation sequencing was a cost-effective approach to identifying selection signatures without the effect of ascertainment bias. Consequently, next-generation sequencing provides us with the opportunity to estimate the genome-wide genetic diversity of a breed. To date, we have devised a new method for SNP genotyping, called genotyping by genome reducing and sequencing (GGRS), which is cost-effective, can genotype outbred species and is highly reproducible (Chen et al. 2013). This method can identify more than 70 000 SNPs for only $80 (USD)/sample. Among various statistics used in discovering positive selection signatures from the SNP data, iHS and REHH (relative extended haplotype homozygosity) are two of the most widely used tools (Enard et al. 2014; Fagny et al. 2014; Garke et al. 2014). Unfortunately, iHS requires both the genotype of the selected mutation and a known ancestor allele, which makes it difficult to apply in many cases. We therefore used the REHH test for our analysis (Walsh et al. 2006; Zhang et al. 2006). Yorkshire and Landrace may have selection footprints in their genome resulting from intensive selection for production attributes over the past few decades. To date, some research involving selective sweep analyses in pigs has revealed strong signatures of selection affecting genomic regions that harbor genes underlying economic traits such as body length, disease resistance, pork yield, muscle development and fertility (Amaral et al. 2011; Li et al. 2014). Some specific genes related to coat color (Johansson Moller et al. 1996; Fang et al. 2009), growth (Van Laere et al. 2003), RNA processing and regulation (Groenen et al. 2012), olfaction and hypoxia (Li et al. 2013) have also been found previously to show a correlation between domestication and selection. Therefore, identification of the regions that have been subjected to selective breeding would be beneficial for identifying genes related to traits of interest and biological processes. Artificial selection is not only an important aspect reflecting the domestication process, but also still important for the ongoing improvement of particular traits of value to humans during breed formation. Therefore, a scan of genome-wide selection signatures will help us identify porcine genes related to biological processes and traits of interest and, as well, allow us to better understand the mechanisms of selection in pig breeding. Here, we investigated patterns of selection in these two pig breeds and found several regions related to the traits of growth, muscle development and disease resistance. By analyzing nucleotide diversity, we aimed to identify genomic regions exhibiting signatures of selection and candidate genes reported in proximity to the genomic positions showing the most significant indications of selection and to gain further insight into the genome-wide footprints of pig selection. The functions associated with the putative genes under selection were also investigated by gene set enrichment analysis of Gene Ontology (GO) annotations and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways.

Materials and methods

DNA samples collection and sequencing data preparation

Unrelated or distantly related pigs were chosen according to pedigree information so as to have as few related individuals as possible in the sample set. A total of 70 DNA samples of Yorkshire (n = 34) and Landrace (n = 36) were obtained from breeding farms in Xiangxin, Shanghai, China. The protocol of acquiring each individual genotype, that is, DNA extraction, sequencing and SNP calling, was carried out using GGRS (http://klab.sjtu.edu.cn/GGRS/) (Chen et al. 2013), and the missing genotypes were imputed by iBLUP (http://klab.sjtu.edu.cn/iBLUP/) (Yang et al. 2014). The sequencing library (fragments ranging from 200 to 300 bp) was sequenced by an Illumina Hiseq2000 instrument with a paired-end (2 × 100 bp) pattern; the sequencing process is given in detail by the manufacturer (Illumina). GGRS is one approach of next-generation sequencing technology and can genotype species cost-effectively and with high reproducibility, especially for outbred species with a large genome size, for example, in the pig. Compared with SNP chips, it is not only able to identify novel SNPs, rather than genotypes of ascertainment, but also to discover high-density SNPs with lower cost. These advantages are a benefit to identifying selection signatures without the effect of ascertainment bias (Amaral et al. 2011). iBLUP is a genotype imputation method that imputes missing genotypes using identity-by-descent and linkage disequilibrium information. This method can impute missing genotypes with greater accuracy than can other common imputation methods, for example, beagle. Even at a high missing rate of 70%, it retained an accuracy of 0.95, whereas that of beagle is lower at 0.82 (Yang et al. 2014). Approximately 347 million 100-bp reads were generated from DNA samples. The raw reads with a base average quality score of at least 20 (error rate of base calling of 1 in 100) and the first 65 bp of at least 30 (error rate of base call of 1 in 1000) were aligned to the pig genome reference. A total of approximately 2.01% of the porcine genome met the alignment quality parameters (see GGRS, Chen et al. 2013), and the average sequencing depth of the whole genome was 5.97 × 102 170 SNPs, which could be identified in more than 30 samples included in the final analysis, with an average sequencing depth of more than 5×. The distance between approximately 88% pairs of adjacent SNPs was <50 Kb. The percentage of those with a distance of more than 150 Kb was <5% (Fig. S1). After imputing the missing genotypes using iBLUP, those genotyped SNPs were phased by fastphase (Scheet & Stephens 2006) for further positive selection analysis.

REHH test

The REHH test, which was first proposed by Sabeti et al. (2002), was used to detect the recent positive selection signatures by evaluating how LD decays across the genome. Under the pressure of positive selection, regions of selection present unusually rapid rises in allele frequency and long-range LD over a short period of time. Thus, Sabeti et al. (2002) defined a region of interest in the genome as the ‘core region’, which contains a set of ‘core haplotypes’ and has a strong LD among SNPs. Their general tests for selection are based on comparing a core haplotype with both higher frequency and higher EHH with other core haplotypes at the same locus. According to selection signatures theory, the core haplotypes harboring the beneficial allele would have a higher frequency due to the hitchhiking effect (Sabeti et al. 2002). For this reason, we discarded core haplotypes with a frequency <0.25. However, the long-range haplotype may be due to low local recombination rates rather than the recent positive selection. We therefore applied the REHH test (Sabeti et al. 2002), which corrects for local variation in recombination rate, to detect selection signatures. As fully phased haplotype data were required for the analysis, we first reconstructed haplotypes for every chromosome in the two breeds separately using the default parameters of fastphase (Scheet & Stephens 2006). Then, fully phased haplotype data were further analyzed using sweep v.1.1 (http://www.broadinstitute.org/mpg/sweep/index.html) with the default parameter marker H = 0.04 (one type of distance to match) to identify core regions. Furthermore, we placed the data in order, according to the frequency of all the haplotypes, into 20 bins to calculate the significance of the REHH value, obtaining P-values by log-transforming the REHH in the bin to reach normality and calculating the mean and standard deviation. Finally, core haplotypes with extreme REHH values (threshold level: P < 0.01) were regarded as significant.

Identifying QTL overlapping with selection signatures

The candidate regions (the top five with lowest P-values) of selection were regarded as overlapping if their locations were included within the QTL for porcine traits. We wrote a Perl script to identify the distribution of the candidate regions of selection in QTL using PigQTLdb (Hu et al. 2013).

Functional gene set enrichment analysis (FGSEA)

Our FGSEA of KEGG pathways and GO terms was performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID; Dennis et al. 2003; Huang da et al. 2009). First, we retrieved the Ensembl IDs of the genes, which were considered to be overlapping if their positions were contained inside the boundaries of 250 kb upstream or downstream of the candidate regions of selection, based on the annotation for Sus scrofa assembly 10.2 available from the Ensembl database (http://www.ensembl.org/index.html). Because of the incomplete GO annotation for the pig, corresponding human orthologous Ensembl IDs were retrieved using a Perl script and were used to get enrichment function categories. The DAVID was used to analyze enrichment in the KEGG pathways and the GO terms (http://www.genome.jp/kegg/, http://www.geneontology.org/). Finally, the enriched pathways with P-values <0.05 and GO terms with P-values <0.01 [with a false discovery rate (FDR) of <25%] were used for further analysis in our study.

Results and Discussion

Marker and core haplotype statistics

A total of 10 932 and 11 185 core regions with 72 989 and 71 354 SNPs (71.4% and 69.8%) spanning 1437 Mb and 1377 Mb (63.5% and 60.9%) of the genome were detected in Yorkshire and Landrace respectively. Their mean lengths of core region were calculated as 131.5 ± 287.6 kb and 123.1 ± 274.4 kb, with a maximum of 4139.2 kb and 4505.4 kb in chromosomes X and 4 respectively. The distance between approximately 72% of pairs of adjacent core regions was <50 Kb, and close to 10% of pairs had a distance of more than 150 Kb. The position information for the core regions for each pig breed is listed in Tables S1 and S2. The distribution of the size and number of SNPs in the core regions is depicted in Fig.1. As shown in Fig.1(a) and (b), for both breeds, there was a high frequency of core regions with a length <50 kb. This is probably because of the advantage of the sequencing technology we used, which can identify novel SNPs. This resulted in regions with a higher density of SNP markers. According to EHH, a region with a high density of SNPs indicates a stronger LD among these SNPs and contributes to a higher possibility for them to form into core regions even with a short length. Therefore, the core regions took up a high percentage of the length and number of SNPs in the genomes. This suggests that the sequencing technique could influence the final numbers of core region because of the high density of SNPs identified compared to SNP chips. This would be beneficial for finding regions under selection, even with a small length. As the maximum number of SNPs in core regions was set to 20 (the core region would retain only 20 SNPs, even if it included more than 20), the frequency of 20 SNPs in a core region was increased, as shown in Fig.1(c) and (d).
Figure 1

Distribution of length of core regions and the number of SNPs forming the core regions in Yorkshire and Landrace. (a) and (b) show the distribution of length of core regions, separately, in Yorkshire and Landrace respectively. The Y-axis represents frequency, and the X-axis represents the length of core regions. (c) and (d) show the distribution of the number of SNPs forming the core regions, separately, in Yorkshire and Landrace respectively. The Y-axis represents the frequency, and the X-axis represents the number of SNPs.

Distribution of length of core regions and the number of SNPs forming the core regions in Yorkshire and Landrace. (a) and (b) show the distribution of length of core regions, separately, in Yorkshire and Landrace respectively. The Y-axis represents frequency, and the X-axis represents the length of core regions. (c) and (d) show the distribution of the number of SNPs forming the core regions, separately, in Yorkshire and Landrace respectively. The Y-axis represents the frequency, and the X-axis represents the number of SNPs.

Genome-wide scanning for selection signatures

To identify outlying core haplotypes potentially representing candidate regions under positive selection, we estimated REHH using sweep. The results of this test are presented in Table1, which also includes the number of candidate regions and SNPs putatively under selection for each chromosome in both breeds. In total, 251 and 269 candidate core regions with lengths >0.5 kb displayed outlying peaks at a threshold level of 0.01 in Yorkshire and Landrace respectively. The total length and number of SNPs in candidate regions of the two breeds were nearly the same, approximately 1.3% and 1.7% respectively of the whole genome. This is not consistent with the previous study of Amaral et al. (2011), who estimated that approximately 7% of the porcine genome has been affected by selection events. The differences come from mainly two aspects: First, the P-valve used in our study (P < 0.01) was stricter than was theirs (P < 0.05), and second, the usage of SNPs and identification methods of selection signatures was different, which may have led to underestimating the proportion.
Table 1

Summary of significant (P ≤ 0.01) core region (CR) and SNPs distribution in Yorkshire and Landrace.

PopulationChrNo. CRCR SNPs1CR length2 (kb)Chr length (Mbp)CR length/Chr length3Chr SNPs (n)CR SNPs/Chr SNPs4
Yorkshire1139173693.918295.50.01390280.015
2190242169.685140.10.01595570.020
3130192832.248123.60.02375060.017
4136172089.096136.30.01557960.023
5317445.340100.50.00437360.008
6186342529.724123.30.02112 9100.014
7168213167.319136.40.02358170.029
8172358.1941200.00336870.005
9184272978.478132.50.02252020.035
10707941.01466.70.01431480.022
11448611.47979.80.00837950.012
128511773.87357.40.01340280.021
137993345.927145.20.02346610.017
14113191084.485148.50.00770940.016
154810547.061134.50.00449680.010
16406868.42377.40.01127960.014
174981005.97464.40.01630790.016
18323434.74154.30.00820060.016
X202220.651125.90.00233560.006
Total176125130097.6302262.30.013102 1700.017
Landrace180111 823 125295.50.00690280.009
2137242 477 535140.10.01895570.014
38114911 766123.60.00775060.011
46512719 409136.30.00557960.011
557121 623 945100.50.01637360.015
610818598 300123.30.005129100.008
7126161 154 729136.40.00858170.022
84271 025 2671200.00936870.011
9232314 361 383132.50.03352020.045
1070141 188 16066.70.01831480.022
1194142 351 95679.80.02937950.025
1210517960 26957.40.01740280.026
1373121 880 105145.20.01346610.016
14122241 048 849148.50.00770940.017
15132213 050 612134.50.02349680.027
16175383 36577.40.00527960.006
17358720 06364.40.01130790.011
188113 43354.30.00020060.004
X508541 350125.90.00433560.015
Total16342692 683 36212262.30.012102 1700.016

Number of SNPs forming significant core regions.

Total length covered by significant core regions.

Proportion of total significant core region lengths on chromosome length.

Proportion of total number of SNPs forming significant core regions on number of SNPs used.

Summary of significant (P ≤ 0.01) core region (CR) and SNPs distribution in Yorkshire and Landrace. Number of SNPs forming significant core regions. Total length covered by significant core regions. Proportion of total significant core region lengths on chromosome length. Proportion of total number of SNPs forming significant core regions on number of SNPs used. The positional information of the candidate selected regions for the two breeds is listed in Tables S3 and S4. The signals were vastly overrepresented on parts of chromosome 9. This pattern of distribution was similar to bovine (Qanbari et al. 2010). Studies on selection signatures in pigs were also carried out by other researchers who relied on the Porcine60K SNP chip (Wilkinson et al. 2013). Based on comparisons with Wilkinson et al.'s data (five genomic regions showing selection in five or more breeds), we found that the selected region identified in their study (SSC7, 54.00–57.00 Mb) was consistent with our results (SSC7, 53.8–54.00 Mb). The different statistical method used in their study (FST, comparing different populations) and ours (REHH, studying within a population) might account for the low overlap rate. However, the results of Wilkinson et al.'s study may still provide a piece of evidence of the reliability of our study.

Identifying QTL overlapping with positively selected regions

To test whether the selection signatures we detected were a result of human's selection during pig breeding, we explored the pig QTL database (http://www.animalgenome.org/cgi-bin/QTLdb/SS/download?file=bedSS_10.2). We identified any overlapping of the outlying core regions (the top five with lowest P-value) with published QTL in pigs. The overlapping pig QTL for the core regions with lowest P-values (top five) is shown in Table S5. Interestingly, we found that the core regions showing the most significant indications of selection were contained in hundreds of reported QTL related to production, reproduction, health and meat quality. The number of QTL relating to meat quality (such as loin muscle area, average backfat thickness, meat color, pH value and drip loss; details shown in Table S5) is especially greater than others, with a proportion of 71% (Fig. S2). This observation is in accordance with the history of meat quality selection (such as lean meat percentage) in pig breeding programs (Vidal et al. 2005; Amaral et al. 2011). Furthermore, this indicates that selection during pig breeding has left a detectable footprint in the pig genome.

Genes within positively selected regions

We further investigated the genes in candidate regions of selection and identified corresponding genes by comparing their genomic locations with the available annotation of the porcine genome (Sscrofa 10.2). We extended core regions in both directions up to 250 kb. A summary of statistics for the five positively selected core regions with lengths >0.5 kb and with the lowest P-values (top five) using the REHH test is shown in Table2. These top five positively selected regions harbored genes related to muscle growth or immune response, such as BARX2, which is an important regulator of muscle growth, regeneration and maintenance (Meech et al. 2012), and APON, which is speculated to play a role in either the regulation of steroidogenesis or immunosuppression (O'Bryan et al. 2004).
Table 2

Summary statistics for five core haplotypes of two breeds showing the lowest P-value after the relative extended haplotype homozygosity (REHH) test.

PopulationChrPosition startPosition endCore length (kb)REHH P-valueCandidate geneFunction
Yorkshire16313348031374433.9630.00000043NA1NA
96231360262547091233.4890.00000135BARX2Controls cell adhesion and remodeling of the actin cytoskeleton in myoblast fusion and chondrogenesis
SNORD112Small nucleolar RNAs (snoRNAs), like SNORD112, guide the formation of 2-prime O-methylation of ribosomal RNA (rRNA) and small nuclear RNAs (snRNAs) through a specific RNA duplex at each modification site
5492743649301812.7450.00000430XPNPEP3Encodes a protein that belongs to the family of X-pro- aminopeptidases that utilize a metal cofactor and remove the N-terminal amino acid from peptides with a proline residue in the penultimate position
ST13The assembly process of glucocorticoid receptor
SLC25A17Encodes a peroxisomal membrane protein
RPL31Encodes a ribosomal protein
MCHR1Encodes an integral plasma membrane protein that binds melanin-concentrating hormone
637712730377171844.4540.00000667ZNF507Encodes a zinc finger protein
412353977512362269182.9160.00000957NANA
Landrace11715197807155924639.4660.00000037NANA
142459293924779049186.110.00000040P2RX2Encodes a ligand-gated ion channel receptor
PUS1Stabilizes the secondary and tertiary structure of many RNAs
MMP17Involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction and tissue remodeling
1177297111772994612.350.00000158ITGBL1Encodes a beta-integrin-related protein
5851233818514116417.7830.00000294U6Encodes U6 spliceosomal RNA
52328899624074802785.8060.00000541APOFEncodes olipoprotein
APONEncodes ovarian and testicular apolipoprotein
MIPEncodes a major intrinsic protein of lens fiber
STAT6Encodes a signal transducer and activator
MYL6Encodes a myosin light polypeptide
CSEncodes citrate synthase

NA, no genes were found.

Summary statistics for five core haplotypes of two breeds showing the lowest P-value after the relative extended haplotype homozygosity (REHH) test. NA, no genes were found. We screened the subset of genes in all core regions displaying extreme REHH values. Interestingly, some regions overlapped with genes previously detected as being under selection. For example, in Amaral et al.'s (2011) study, they identified a few genes relating to neuron function, growth, muscle development, metabolism and disease, such as MAPK8IP3, L3MBTL2, SLC22A17, ENO2, CACNG7, SPHK and FGFR2, under positive selection, which were also detected in our study. Among them, the MAPK8IP3 gene is involved in the MAPK signaling pathway, which is essential in regulating many cellular processes including inflammation, cell differentiation, cell proliferation and death (Wilkinson & Millar 2000). According to Miyamoto et al.'s (2011) study, SLC22A17, in cooperation with LCN2, is involved in the acquisition of aggressive behavior among endometrial carcinoma cells. In another study (Groenen et al. 2012), the ERI2 gene, which is located around position 26Mb on SSC3 and encodes ERI1 (exoribonuclease family member 2), was detected in a selective sweep region. This was also observed in our study. Although the exact function of ERI2 is unknown, the ERI1 exoribonuclease family members have been shown to be involved in the degradation of mRNA (Kupsco et al. 2006). What's more, the results suggest a large number of genes related to olfaction were within candidate selection regions. This finding is consistent with Amaral et al.'s (2011) study, in which a significant enrichment of genes related to olfaction within positive selection regions was observed. There has been a significant expansion of the olfactory receptor gene family in the porcine genome, and this probably reflects the strong reliance of pigs on their sense of smell while searching for food (Groenen et al. 2012). We speculated that a well-developed sense of smell might spur the pigs’ appetite and increase in their food intake and, as a result, further accelerate their growth. This observation is consistent with the history of the domestic pig breeds currently studied, which have higher growth rates than do the ancestral wild boar populations. In addition, in our study, we observed several genes related to brain and neuron functions overlapped within regions likely experiencing positive selection in pigs. For example, the NRXN2 gene is a member of the neurexin gene family encoding polymorphic presynaptic proteins that are implicated in synaptic plasticity and memory processing (Rozic et al. 2012). It was reported that neurexin genes are related to neurodevelopmental disorders affecting cognition and behavior, such as the diseases of autism spectrum disorder (Kim et al. 2008), intellectual disability (Ching et al. 2010) and schizophrenia (Rujescu et al. 2009). Because it is expected that farmers would have selected for more docile animals, the process of domestication leads to a relative change in behavior (Price 1999). Our results may support the hypothesis that these positive signatures might be a result of domestication (Amaral et al. 2011). This is similar to the dog, which has inferior observational learning skills compared to the wolf (Frank 1980).

Involved biological processes under selection

We then sought to investigate the functions associated with the putative genes undergoing positive selection by analyzing over-represented annotations and pathways using DAVID (Dennis et al. 2003; Huang da et al. 2009). If the P-value was <0.01 for GO annotation and <0.05 for the KEGG pathway (with a FDR of <25%), that was considered significant. The significant GO terms and KEGG pathways of over-represented genes are shown in Table3 for Yorkshire and Landrace.
Table 3

Enrichment of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways among the positively selected regions.

PopulationCategoryTermP-value
YorkshireGOTERM_BP_2GO:0055114∼oxidation reduction0.0083
GOTERM_CC_2GO:0031012∼extracellular matrix0.0071
GOTERM_MF_2GO:0060589∼nucleoside triphosphatase regulator activity0.0042
LandraceKEGG_PATHWAYhsa00590: Arachidonic acid metabolism0.0172
GOTERM_BP_2GO:0006810∼transport0.0034
GOTERM_BP_2GO:0051234∼establishment of localization0.0039
GOTERM_BP_2GO:0019725∼cellular homeostasis0.0072
Enrichment of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways among the positively selected regions. The results suggest that the two breeds presented a different over-representation of genes with GO annotations and KEGG pathways. The GO terms for Yorkshire included ‘oxidation reduction’, ‘extracellular matrix’ and ‘nucleoside triphosphatase regulator activity’, whereas the Landrace GO terms included ‘transport’, ‘establishment of localization’ and ‘cellular homeostasis’. These results indicate that the two breeds displayed different types of biological processes under the selected regions. Previous studies have reported that, in pigs, the genetic selection for lean, large muscle blocks and fast growth is associated with an increased prevalence of metabolic diseases, such as mulberry heart disease (Rice & Kennedy 1989) and porcine stress syndrome. These diseases are linked to cardiovascular inadequacy, which may result in oxidative stress (Brambilla et al. 2002). Among our findings, genes over-represented in oxidation reduction (P = 0.0083) may relate to selection against the above diseases. Moreover, our results of genes over-represented in transport (P = 0.0034) and establishment of localization (P = 0.0039) revealed genes involved in growth that overlap with positive selection regions, such as TBRG1, which is related to cell growth and differentiation (Garcia-Alai et al. 2010). As pigs with higher growth rates and a higher proportion of muscle are preferred in current breeding programs, they need to have an efficient gene network (such as transport) to support their high growth system. In addition, the pathway of arachidonic acid metabolism (P = 0.0172) was identified in Landrace. Arachidonic acid, routinely added to infant formula along with docosahexaenoic acid, is a natural component of breast milk. It plays an important role in growth and development during the perinatal period (Innis 2005, 2007). The observation that this particular pathway is also related to porcine growth is in agreement with the history of domestic pig breeds currently studied, which have higher growth rates than do their wild relatives. Yorkshire and Landrace are two of several European domestic pig breeds that are more similar to their ancestors and yet are highly differentiated in terms of genotypes and phenotypes (Amaral et al. 2011). The different phenotypes selected during the breeding of two pigs may leave footprints of selection in different parts of the pig genome. This may be an explanation of why the two breeds presented a different over-representation of genes with GO annotations and KEGG pathways. However, our dataset included approximately only 2% of the genome, which may have limited the utility of the gene set enrichment analyses. The current annotation of the pig genome has a limited availability of GO terms and genes mapped in the KEGG pathways, further decreasing the sensitivity of the analysis. Therefore, we could provide only suggestive evidence for the over-represented annotations and pathways affected by positive selection. In conclusion, this study provides a genome-wide map of selection signatures in Yorkshire and Landrace genomes and yields insight into the mechanisms of selection in pig breeding. Our results show that genes related to metabolism, olfaction and nerves may also experience positive selection. Furthermore, there are indications that selection has impacted different genes and pathways in the two breeds studied.
  50 in total

1.  Hitchhiking under positive Darwinian selection.

Authors:  J C Fay; C I Wu
Journal:  Genetics       Date:  2000-07       Impact factor: 4.562

2.  Detecting recent positive selection in the human genome from haplotype structure.

Authors:  Pardis C Sabeti; David E Reich; John M Higgins; Haninah Z P Levine; Daniel J Richter; Stephen F Schaffner; Stacey B Gabriel; Jill V Platko; Nick J Patterson; Gavin J McDonald; Hans C Ackerman; Sarah J Campbell; David Altshuler; Richard Cooper; Dominic Kwiatkowski; Ryk Ward; Eric S Lander
Journal:  Nature       Date:  2002-10-09       Impact factor: 49.962

3.  DAVID: Database for Annotation, Visualization, and Integrated Discovery.

Authors:  Glynn Dennis; Brad T Sherman; Douglas A Hosack; Jun Yang; Wei Gao; H Clifford Lane; Richard A Lempicki
Journal:  Genome Biol       Date:  2003-04-03       Impact factor: 13.583

4.  Approximating selective sweeps.

Authors:  Richard Durrett; Jason Schweinsberg
Journal:  Theor Popul Biol       Date:  2004-09       Impact factor: 1.570

5.  A test of neutral molecular evolution based on nucleotide data.

Authors:  R R Hudson; M Kreitman; M Aguadé
Journal:  Genetics       Date:  1987-05       Impact factor: 4.562

6.  The hitch-hiking effect of a favourable gene.

Authors:  J M Smith; J Haigh
Journal:  Genet Res       Date:  1974-02       Impact factor: 1.588

7.  Evolution of canine information processing under conditions of natural and artificial selection.

Authors:  H Frank
Journal:  Z Tierpsychol       Date:  1980

8.  Identification of a novel apolipoprotein, ApoN, in ovarian follicular fluid.

Authors:  Moira K O'Bryan; Lynda M Foulds; James F Cannon; Wendy R Winnall; Julie A Muir; Kim Sebire; A Ian Smith; Hooi-Hong Keah; Milton T W Hearn; David M de Kretser; Mark P Hedger
Journal:  Endocrinology       Date:  2004-07-15       Impact factor: 4.736

9.  A regulatory mutation in IGF2 causes a major QTL effect on muscle growth in the pig.

Authors:  Anne-Sophie Van Laere; Minh Nguyen; Martin Braunschweig; Carine Nezer; Catherine Collette; Laurence Moreau; Alan L Archibald; Chris S Haley; Nadine Buys; Michael Tally; Göran Andersson; Michel Georges; Leif Andersson
Journal:  Nature       Date:  2003-10-23       Impact factor: 49.962

10.  Response to oxidative stress as a welfare parameter in swine.

Authors:  Gianfranco Brambilla; Cinzia Civitareale; Alfredo Ballerini; Maurizio Fiori; Massimo Amadori; Laura Ivonne Archetti; Michaela Regini; Marco Betti
Journal:  Redox Rep       Date:  2002       Impact factor: 4.412

View more
  6 in total

1.  Whole-genome sequencing of European autochthonous and commercial pig breeds allows the detection of signatures of selection for adaptation of genetic resources to different breeding and production systems.

Authors:  Samuele Bovo; Anisa Ribani; Maria Muñoz; Estefania Alves; Jose P Araujo; Riccardo Bozzi; Marjeta Čandek-Potokar; Rui Charneca; Federica Di Palma; Graham Etherington; Ana I Fernandez; Fabián García; Juan García-Casco; Danijel Karolyi; Maurizio Gallo; Vladimir Margeta; José Manuel Martins; Marie J Mercat; Giulia Moscatelli; Yolanda Núñez; Raquel Quintanilla; Čedomir Radović; Violeta Razmaite; Juliette Riquet; Radomir Savić; Giuseppina Schiavo; Graziano Usai; Valerio J Utzeri; Christoph Zimmer; Cristina Ovilo; Luca Fontanesi
Journal:  Genet Sel Evol       Date:  2020-06-26       Impact factor: 4.297

2.  In silico approaches to discover the functional impact of non-synonymous single nucleotide polymorphisms in selective sweep regions of the Landrace genome.

Authors:  Donghyun Shin; Kyung-Hye Won; Ki-Duk Song
Journal:  Asian-Australas J Anim Sci       Date:  2018-05-31       Impact factor: 2.509

3.  Genome-Wide Signatures of Selection Detection in Three South China Indigenous Pigs.

Authors:  Shuqi Diao; Shuwen Huang; Zitao Chen; Jinyan Teng; Yunlong Ma; Xiaolong Yuan; Zanmou Chen; Hao Zhang; Jiaqi Li; Zhe Zhang
Journal:  Genes (Basel)       Date:  2019-05-07       Impact factor: 4.096

4.  Genome-Wide Detection of Selective Signatures in Chicken through High Density SNPs.

Authors:  Zhuang Liu; Congjiao Sun; Liang Qu; Kehua Wang; Ning Yang
Journal:  PLoS One       Date:  2016-11-07       Impact factor: 3.240

5.  Genome-wide genetic variation discovery in Chinese Taihu pig breeds using next generation sequencing.

Authors:  Z Wang; Q Chen; R Liao; Z Zhang; X Zhang; X Liu; M Zhu; W Zhang; M Xue; H Yang; Y Zheng; Q Wang; Y Pan
Journal:  Anim Genet       Date:  2016-07-27       Impact factor: 3.169

6.  Genome-wide genetic structure and differentially selected regions among Landrace, Erhualian, and Meishan pigs using specific-locus amplified fragment sequencing.

Authors:  Zhen Li; Shengjuan Wei; Hejun Li; Keliang Wu; Zhaowei Cai; Dongfeng Li; Wei Wei; Qifa Li; Jie Chen; Honglin Liu; Lifan Zhang
Journal:  Sci Rep       Date:  2017-08-30       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.