Literature DB >> 25781935

A genome-wide association study in large white and landrace pig populations for number piglets born alive.

Sarah Bergfelder-Drüing1, Christine Grosse-Brinkhaus1, Bianca Lind2, Malena Erbe3, Karl Schellander1, Henner Simianer3, Ernst Tholen1.   

Abstract

The number of piglets born alive (NBA) per litter is one of the most important traits in pig breeding due to its influence on production efficiency. It is difficult to improve NBA because the heritability of the trait is low and it is governed by a high number of loci with low to moderate effects. To clarify the biological and genetic background of NBA, genome-wide association studies (GWAS) were performed using 4,012 Large White and Landrace pigs from herdbook and commercial breeding companies in Germany (3), Austria (1) and Switzerland (1). The animals were genotyped with the Illumina PorcineSNP60 BeadChip. Because of population stratifications within and between breeds, clusters were formed using the genetic distances between the populations. Five clusters for each breed were formed and analysed by GWAS approaches. In total, 17 different significant markers affecting NBA were found in regions with known effects on female reproduction. No overlapping significant chromosome areas or QTL between Large White and Landrace breed were detected.

Entities:  

Mesh:

Year:  2015        PMID: 25781935      PMCID: PMC4363374          DOI: 10.1371/journal.pone.0117468

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Reproduction traits of livestock are important because of the major role they play in the economic success of production [1]. The efficiency of pig production largely depends on the number of piglets born alive (NBA) and the number of piglets weaned (NPW). Up to the present, selection based on traditional breeding programmes using Best Linear Unbiased Prediction (BLUP) has been successful in improving maternal reproductive traits such as NBA. However, genetic improvement of female reproduction traits is difficult and complex because of low heritability and sex limited expression and because phenotyping is only possible late in a sow’s life. These conditions constitute a challenge for traditional animal breeding programmes. The exploration of the genetic architecture of reproduction traits is necessary because of the complex genetic and biological processes involved [1,2]. Since the very beginning of quantitative trait loci (QTL) mapping [3], about 10,000 QTL for 653 different traits have been identified in the pig genome (PigQTLdb, http://www.animalgenome.org/cgi-bin/QTLdb/SS/index, [4]). Most of the reported QTL affect production and meat quality traits. For reproduction traits, 137 QTL were identified for total number born, 110 QTL for body weight at birth and 106 QTL for NBA (July 2014). Several studies have investigated the biological foundation in regard to the high impact of NBA on pig production. Genes such as retinol binding protein 4 (RBP4), estrogen receptor 1 and 2 (ESR1, ESR2) and porcine insulin-like growth factor 2 (IGF2) were identified to be positively associated with NBA [1,5-7], but these genes explain only a relatively small proportion of the genetic variation of NBA. In the past, genome-wide scans using microsatellites were performed to identify regions affecting the potentially interesting traits. The development of the PorcineSNP60 BeadChip [8] allows the detection of QTL and candidate genes in a higher resolution. In a recent study Onteru et al. [9] have detected novel QTL regions for pig reproduction traits which do not overlap with QTL intervals previously reported using microsatellites. In Europe, the two breeds Large White (LW) and Landrace (LR) are typical dam lines in commercial pig breeding programmes. However, differences between the two breeds were found in several studies which investigated reproduction traits such as NBA. For example, it was shown that LW sows had slightly higher NBA compared to LR sows [10-12]. Moreover, most breeding companies have their own LW and LR populations with different breeding objectives. Breeding stock is not normally exchanged between organisations. This leads one to expect differences between the breeding companies and their breeding stock. In order to map QTL affecting NBA, genome-wide association studies (GWAS) were performed in LW and LR populations of different breeding companies located in Germany, Switzerland, and Austria. The aims of the study were to reveal genetic similarities and differences between LW and LR populations of different breeding organisations, to identify significant associated SNPs for NBA, and to clarify the biological relevance of these significant markers.

Materials and Methods

Animals and phenotype data

The study included a total of 4,012 LW and LR pigs from herdbook and commercial breeding companies across Germany (3), Austria (1) and Switzerland (1). Data of 2,365 (boars: 1,435, sows: 930) LW and 1,647 (boars: 1,159, sows: 488) LR animals born between 1990 and 2011 were recorded (Table 1). The frequencies of years of birth of all animals are shown by gender in Fig. 1. Breeding values for NBA were routinely estimated by the breeding companies using a standard animal repeatability model and were provided for the study.
Table 1

Number of genotyped animals.

Landrace Large White
Country N Boar Sow N Boar Sow
Germany 1288 925363 1146 790356
Austria 266 141125 191 14843
Switzerland 93 93- 1028 497531
Σ 1647 1159488 2365 1435930
Fig 1

Frequencies of years of birth from all animals by gender.

SNP Quality control

Tissue samples (semen or hair follicle) of the pigs were genotyped with the Illumina PorcineSNP60 Bead Chip [8] in the laboratory Life & Brain GmbH, Bonn. SNPs were excluded from further analysis under the following conditions: a) Minor allele frequency (MAF) < 0.5%, b) Call rate < 95% and c) strong deviation from the Hardy-Weinberg-Equilibrium (p < 10−3). Quality control was performed as implemented in the GenABEL package [13] within defined population clusters.

Population structure

GWAS were performed within breeds (LW or LR) and clusters comprising different sub-populations. In order to visualize possible population stratifications, multidimensional scaling (MDS) plots of an identity-by-state (IBS) matrix were generated containing the two most important principal components of the underlying genetic variation. These two-dimensional MDS plots of the IBS matrix revealed the overall genetic distances between the animals. Based on the visualized genetic distances, animals of the LW and LR populations were analysed separately. In addition, four sub-populations were identified within the LW and LR breeds. Additional GWAS were performed within these clusters, which comprise animals from one to four different breeding organisations.

Genome-wide association study

The GWAS were based on an combined approach developed by Amin et al. [14] and Price et al. [15] and implemented in the R-Package GenABEL [13,14,16]. In order to control population stratification the “Genome-wide Rapid Analysis using Mixed Models and Regression” (GRAMMAR) [14] combined with EIGENSTRAT [15] was used. A similar, combined procedure was suggested by Zhao et al. [17]. In a first step, the phenotypic data (breeding values) were corrected for the fixed effect “breeding organization” and a polygenetic effect (a) by means of Equation (1): with y and y as vectors of pre-corrected and original estimated breeding values (EBVs), respectively, β as solution vector of the fixed effect ‘breeding organisation’, and a as random additive polygenic (a ∼ N (0,G×σ )) effect, which estimates the contribution from the polygene (breeding value) with G as the genomic kinship matrix and the additive genetic variance σ . X and Z are the corresponding design matrices for the fixed and random effects. The genomic kinship (Gij) was estimated by applying the method suggested by Astle & Balding [18]: with L as the number of SNP, p as the allelic frequency at l-th locus (major allele) and g / g as the genotype of j-th / i-th individual at the l-th locus, coded as 0, 1/2 and 1, corresponding to the rare homozygous, heterozygous, and common homozygous genotype. Ignoring the covariance between animals from one family can lead to a high number of false-positive SNPs. The residuals computed with GRAMMAR are corrected for polygenic relationships between the animals and can be used as a new phenotype in association analyses [14,16]. In a second step, these familial correlation-free residuals were included in a simple linear regression as new phenotype for association test (2): with y as the vector pre-corrected EBVs from (1), μ as the mean, g is the vector of genotypes at the marker, k as the marker genotype effect and e as the vector of random residuals. In order to verify remaining population stratification, the inflation factor λ, which depends on the squared original test statistic of the i-th SNP () was calculated as Aluchenko et al.[13] and Price et al. [19] showed that an inflation factor λ in the range of 1.0 to 1.05 is an indicator of a sufficiently corrected population stratification which can be analysed with an acceptable risk of false positive results. Preliminary results of our analysis showed that λ deviates considerably from this optimum. This implies that serious population stratifications still exist. In order to correct for this problem, model 2 was extended by principal components (PC) estimated from the genomic kinship (EIGENSTRAT) [13,15] which were included as fixed covariables. The genomic kinship matrix was used to reveal the PC reflecting the axes of genetic variation and describing the stratification of the populations involved in this study. These PC were used to adjust the phenotype and the genotype for population stratification. The estimation of the PC and the association analysis was performed with the function ‘egscore’ as implemented in the R-package GenABEL [13]. The number of PC used in this step is variable and depends on the ability to correct different levels of population stratifications. The number of PC was increased stepwise and after each step the level of population stratification was quantified via the inflation factor λ. The final number of PC was chosen so that the inflation factor λ [20] was nearest to 1. The inflation factor λ and the observed versus the expected p-values for each SNP are illustrated in quantile-quantile (Q-Q) plots for each cluster. Two regression lines are fitted which represent the optimal (λ = 1) and the calculated inflation factor λ. In case of unstratified population structures, no visible differences can be observed between the two regression lines. In order to reduce the risk of false-positive associations, the p-values of the SNP significance tests were corrected using the Bonferroni-adjustment. Thresholds for genome-wide and chromosome-wide significance levels were 5%. Variance of the pre-corrected EBVs (σ2 y*) explained by each SNP was calculated approximately using following formula: with χ as the test statistic for each SNP resulted from association test and N as the number of animals. This formula resulted from the transformation of a student’s t-distribution into a z-distribution [21]. In our analysis, r cannot be interpreted as the proportion of explained phenotypic variance of NBA—as is usually the case—, because pre-corrected EBVs were analyzed instead of phenotypes. However, r might be a rough indicator of the explained proportion of the additive genetic variance of NBA and could be used to rank the importance of QTL only. Pig Sscrofa 10.2 (International Swine Genome Sequencing Consortium) [22] was used to annotate the significant associated SNPs. The search for biologically relevant genes was performed with Ensembl BioMart [23,24]. For that, a 2 Mb window around a significant region was chosen.

Results

Population structure analysis

MDS plots were used to visualize the genomic distances between the animals (Figs. 2, 3, and 4). Fig. 2 revealed that the breeds LW and LR had a large genetic distance and should be regarded as more or less genetically disconnected. Each breed was analysed separately because of distinct genetic differences between LW and LR. Additionally performed visual inspections of the breed specific MDS plots of LW and LR populations led to various cluster definitions (Figs. 3, 4).
Fig 2

MDS Plot of Landrace (left) and Large White (right) populations of 5 European breeding companies.

Fig 3

MDS plot of Large White population, each colour represents one breeding company, circles show two different clusters.

Fig 4

MDS Plot of Landrace population of 5 European breeding companies, circles indicate different clusters.

The animals of the breed LW (LW_1) were grouped into four sub-clusters (Fig. 3). Cluster LW_3 and LW_2b contained only animals of one breeding organisation, whereas cluster LW_2a covered genetically overlapping pigs of three breeding organisations. In addition, cluster LW_2 combined the clusters LW_2a and LW_2b, which overlap only to a small extent. In the LR population (cluster LR_1) four sub-clusters were assigned (Fig. 4). Cluster LR_2 was formed by excluding the breeding company (cluster LR_3) with the highest deviation from the LR_1 dataset. In addition, two distinct sub-populations were extracted from cluster LR_3 which form cluster LR_3a and LR_3b.

Quality control

SNP quality control was performed within the various clusters. The quantity of remaining genetic markers lay between 39,408 and 45,303 (LW) and 42,205 and 46,066 (LR) clusters. The number of animals ranged between 553 and 2,272 for LW or 206 and 1,598 for LR clusters. More detailed information about each cluster is given in Table 2.
Table 2

Dataset and results of association analyses.

Dataset Association analyses *
Data set N animal N marker PC λ Chromosome (Genome)-wide significant SNPs σ2 y* (%) MAF
LW_12272394083721.0043 (0)0.7–0.90.8–21.1
LW_21719432162561.0055 (0)1.1–1.40.5–22.5
LW_2a73845242741.0024 (3)2.4–4.60.6–21.2
LW_2b938453031511.0042 (0)1.8–2.116.3–17.6
LW_3553435491091.0040 (0)--
LR_11598427212931.0042 (0)1.1–1.331.4–39.4
LR_21144460661851.0010 (0)00
LR_345442205761.0092 (0)4.2–4.81.2–3.7
LR_3a20643416261.0150 (0)--
LR_3b24844013221.0091 (0)8.02.2

* = Numbers of chromosome-wide and genome-wide significant associated SNPs with NBA (p>0.05%); PC = number of principal components; λ = inflation factor; MAF = minor allele frequency; σ2 y* = Variance of the pre-corrected EBVs.

* = Numbers of chromosome-wide and genome-wide significant associated SNPs with NBA (p>0.05%); PC = number of principal components; λ = inflation factor; MAF = minor allele frequency; σ2 y* = Variance of the pre-corrected EBVs.

Influence of population stratification

In order to ensure the power and accuracy of GWAS, it is essential to take possible population stratifications [13,25,26] into consideration. Therefore, associations between SNP and NBA were estimated within the genetically more or less overlapping clusters. In addition, PC which condensed the genetic relationships between the animals was used in the statistical model as covariates to correct for existing population stratification. Depending on the cluster, different numbers of PC were required in order to avoid negative effects of population stratification on the validity of the GWAS analysis. The number of PC used in the analyses of various clusters ranged from 22 (LR_3b) to 372 (LW_1). Genomic inflation factors in all clusters were close to one (Table 2). Cluster specific Q-Q plots (Fig. 5) contain regression lines which were calculated by a linear regression of expected test statistics (independent variable) on observed test statistic (dependent variable). The slopes of these lines correspond to the calculated inflation factor, which is close to 1 in all clusters analysed. This shows that possibly existing stratifications of the populations do not adversely affect the validity of corresponding GWAS analysis.
Fig 5

Q-Q plots of all association studies for all breed clusters.

Genome-wide association analyses

The Manhattan plots show the p-values of the SNP association test for the target trait NBA ordered according to the genomic positions (representative by Fig. 6; S1 Fig., S2 Fig., S3 Fig., S4 Fig., S5 Fig., S6 Fig., S7 Fig., S8 Fig., and S9 Fig.). 14 different chromosome-wide and three genome-wide significant SNPs were detected in the analysed clusters. Three of these SNPs had a MAF below 1%.
Fig 6

Manhattan plot of genome wide association study for NBA in LW_1.

SNPs which were significant in both breeds or in different clusters containing animals from different breeding organisations would have been of particular interest. However, no significant markers or chromosome regions were found to be shared by the breeds. Moreover, only a small number of SNPs were found to be identical in the different clusters of each breed. These SNPs and cluster specific significant markers will be described in the following sections. Large White. In LW_1 three chromosome-wide significant markers were found on SSC5 and SSC10. Each of these markers explained less than 1.0% of σ2 y*. The population LW_1 was subdivided into clusters LW_2 (animals from four breeding organisations) and LW_3 (one breeding organisation). In LW_2 and LW_3 no genome-wide significant SNPs were found. However, within cluster LW_2 five QTL were detected on a chromosome-wide significant level. Each of these QTL explained between 1.1 to 1.3% of σ2 y* of the target trait NBA (S1 Fig. and S4 Fig., Tables 2 and 3). Because of a smaller degree of genetic overlapping, LW_2 was further subdivided into clusters LW_2a and LW_2b. The analyses of these clusters revealed three (LW_2a) and two (LW_2b) chromosome-wide significant SNPs for NBA (Tables 2 and 3), which explained 2.4 to 4.6% and 1.8 to 2.2% of σ2 y*. Three of the QTL detected in dataset LW_2a were significant on a genome-wide level (S2 Fig., S3 Fig.). One of the significant SNP associations on SSC9, identified in cluster LW_2, was confirmed by the analysis of sub-cluster LW_2a. Additionally, three SNPs which were found on SSC5 and SSC10 in LW_1 were also identified in LW_2. This was to be expected, because LW_2 is a subset of the larger cluster LW_1 and LW_2a is one of LW_2.
Table 3

Statistic of significant SNPs in LW.

SNP SSC N A MAF B Var C p-value D Cluster
ALGA001816037382.03.40.001** LW_2a
ASGA00236855227220.90.80.02LW_1
5171922.31.20.01LW_2
MARC01035935227221.10.80.05LW_1
5171922.51.10.03LW _2
MARC010498257380.62.50.03LW _2a
ALGA0055303917190.61.30.004LW _2
97380.64.6<0.001** LW _2a
ASGA00468111073821.32.60.02LW _2a
MARC00700301022720.80.90.007LW _1
1017190.81.10.01LW _2
MARC00434801093816.32.20.01LW _2b
DRGA00106011093817.61.90.04LW _2b
ASGA0090608107385.72.40.05LW _2a
H3GA00308531117181.91.10.03LW _2
MARC0006510117380.63.0<0.001** LW _2a
ASGA0079878187380.63.50.0004** LW _2a

A Number of analysed animals,

B minor allele frequency (MAF),

C σ2 y* = Variance of the pre-corrected EBVs (Var, %),

D nominal p-value and corresponding significant thresholds:

**genome-wide significant (pgem < 0.05).

A Number of analysed animals, B minor allele frequency (MAF), C σ2 y* = Variance of the pre-corrected EBVs (Var, %), D nominal p-value and corresponding significant thresholds: **genome-wide significant (pgem < 0.05). Landrace. In the data set LR_1 two SNPs reached the chromosome-wide significance threshold of 5% (SI 5, Tables 2 and 4). These associations were located on chromosome 9 and 11, they explained up to 1.3% of σ2 y*. After visual inspection of the MDS plots, LR_1 was subdivided into clusters LR_2 and LR_3 which contained 4 or 1 breeding organisations, respectively. In the case of LR_2, no SNP reached the genome- or chromosome-wide significance level (S6 Fig., Tables 2 and 4). On the other hand, association test performed for cluster LR_3 resulted in two SNPs with chromosome-wide significance, explaining up to 4.8% of σ2 y*. These significant SNPs were located on SSC 7 and SSC16 (S7 Fig., Tables 2 and 4). Although cluster LR_3 contained only animals from one breeding organisation, two genetically disconnected sub-clusters (LR_3a and LR_3b) were identified. Association tests in LR_3a resulted in no significant SNPs. For LR_3b and LR_3, one marker located on SSC16 reached the chromosome-wide significance level and explained up to 8.0% of σ2 y* (S9 Fig., Tables 2 and 4).
Table 4

Statistic of significant SNPs in LR.

SNP SSC N A MAF B Var C p-value D Cluster
CASI000675074543.74.20.04LR_3
MARC00709529159831.31.30.01LR _1
H3GA003098511159839.41.10.05LR _1
ASGA0072103164541.24.80.005LR _3
162482.28.00.01LR _3b

A Number of analysed animals,

B minor allele frequency (MAF),

C σ2 y* = Variance of the pre-corrected EBVs (Var, %),

D nominal p-value and corresponding significant thresholds.

A Number of analysed animals, B minor allele frequency (MAF), C σ2 y* = Variance of the pre-corrected EBVs (Var, %), D nominal p-value and corresponding significant thresholds.

Discussion

Population stratification

In the present study, a combined GWAS-approach was used to identify QTL influencing NBA in two maternal pig breeds. When analysing such large scale heterogeneous data, it is of major importance to correct for potential population stratifications in order to ensure the accuracy of the statistical analysis. Several studies have shown that ignoring population stratification will lead to an inflation of false positive QTL and to a loss of statistical power [13,25,26]. In order to avoid such negative effects, our study analysed several clusters compromising animals from only one or from genetically overlapping breeding organisations. As a first result, it was found that animals of the LW and LR breed in the present study do not genetically overlap. This can be seen in the corresponding MDS plot (Fig. 2). For this reason both breeds were analysed separately. In addition, sub-clusters within the two breeds were identified. These sub-clusters are presumably the result of the different selection strategies used by the different breeding organisations. Sub-populations from a limited number of breeding organisations were investigated to identify common regions affecting the target trait NBA. This is a generally accepted procedure and has been utilised in several GWAS in pigs and cattle [27,28]. The defined clusters were statistically evaluated with an approach that combines the GRAMMAR [14] und EIGENSTRAT [15] methods. Within the GRAMMAR approach estimated breeding values for the trait NBA are pre-corrected for the effects ‘breeding organisation’ and ‘familial correlations’, taking into account the genomic “true” relationship between animals. This approach has two advantages: a) the genomic kinship matrix shows the true proportion of shared alleles whereas a pedigree based kinship matrix displays the expected proportion and b) familial correlations are removed from the new phenotype by calculating environmental residuals for association test [14,29]. This is especially important for analysing EBVs as dependent variables because in this case distinct correlations between the EBVs of relatives can be expected. Despite these corrections, the inflation factor, which was calculated according to model 1 (GRAMMAR approach), deviates considerably from the optimum of λ = 1 in each cluster. Therefore, in the second part of the combined approach (EIGENSTRAT), the detection of QTL is based on a model which includes a number of genomic PC depending on the cluster as fixed covariates. This method (EIGENSTRAT) has been applied in several other studies [26-28,30]. The PC condenses the genomic covariance structure of the animals into a series of factors with decreasing importance. The PC act as a correction factor for possible population stratification, but on the other hand, they also reduce the genetic variation which can be used to detect QTL. Although this method leads to an efficient elimination of population stratification, it remains unclear if the inclusion of a high number of PC (>10) leads to an unacceptable loss of utilizable genetic variation. This might have a considerable impact on the power of the association tests [25,28]. In order to balance the two conflicting objectives—removal of population stratification and retention of utilizable genetic variation—, we increased the number of PC stepwise until an acceptable solution was found. The effects of increasing the number of PC were monitored by evaluating the level of the inflation factor λ, which is an indicator of the remaining population stratification. Generally, a value of λ between 1.00 and 1.05 is regarded as tolerable [13,19]. This acceptable range was reached in all analyses after the inclusion of 22 to 372 PC. Aulchenko et al. [13] suggested including 10 PC in the GWAS model in human, which can be regarded as a compromise between correcting for population stratification and retention of utilizable genetic variation. As expected, the number of significant markers increased substantially when 10 PC were used. However, the inflation factors in all analysis were below one, so that the results were not further interpreted.

Minor allele frequency

In GWAS, SNPs with a MAF lower than 1% are frequently excluded from the data set. In the present study a threshold of 0.5% was chosen instead, which can be justified by the findings of Tabangin et al. [31] and Stephens & Balding [32]. Tabangin et al. [31] found that rare SNPs did not show significantly higher false-positive results than common SNPs. They concluded that the removal of SNPs with a low MAF would not be necessary to reduce false-positive results. Stephens & Balding [32] pointed out that the consideration of the p-value alone is not sufficient to characterize the association between the SNP and trait. The statistical power in association tests is of high importance in order to quantify the true dimension of the association. This power is influenced by the MAF and is reduced when SNPs with low MAFs are removed [32,33]. Only five out of a total 17 significant SNPs in the present study had a MAF of < 1%. These SNPs were located in regions where trait specific QTL or genes have been mapped (Tables 3, 4, and 5). Their physiological role could indicate a functional relevance regarding the variation of the trait examined here. Gorlov et al. [33] and Cargill et al. [34] found in their analyses that the proportion of functional SNPs was highest among SNPs with a low MAF. The elimination of rare SNPs could thus decrease the potential for genetic improvement when using genomic selection in animal breeding.
Table 5

Results of annotation for all analyses with previously reported candidate genes, QTL or association in SNP region.

SSC SNP Position (Mbp) Genes in SNP Region * Previously reported QTL or Associations in SNP region ** Cluster
3ALGA001816027925965-NBA, CLNLW_2a
5ASGA0023685876762 PPARα, Fbln1 NSBLW_1,LW_2
5MARC0103593961240 PPARα, Fbln1 NSBLW_1,LW_2
5MARC010498291550413 VEZT -LW_2a
7CASI0006750115511369 FLRT2 -LR_3
9MARC007095214861213-TNBLR_2
9ALGA0055303139041276 PTGS2,PLA2G4A CLNLW_2,LW_2a
10ASGA004681118203672 AHCTF1 -LW_2a
10MARC007003032526661-CLNLW_1,LW_2
10MARC004348063867699 ITGB1 CLN, FSHLW_2b
10DRGA001060163869377 ITGB1 CLN, FSHLW_2b
10ASGA009060876815569 CTSL -LW_2a
11H3GA003085382720--LW_2
11H3GA00309853733271 FLT1 -LR_1
11MARC000651074240078-NSBLW_2a
16ASGA00721036470509-NBALR_3,LR_3b
18ASGA007987847312409-NBALW_2a

SSC = Sus scrofa; TNB = total number born; NBA = number born alive, NSB = number of stillborn piglets; CLN = corpus luteum number, FSH = plasma follicle-stimulating hormone.

* The declaration of gene symbols can be obtained from Ensembl or http://www.ncbi.nlm.nih.gov/gene.

** The QTL information was obtained using http://www.animalgenome.org/cgi-bin/gbrowse/pig/.

SSC = Sus scrofa; TNB = total number born; NBA = number born alive, NSB = number of stillborn piglets; CLN = corpus luteum number, FSH = plasma follicle-stimulating hormone. * The declaration of gene symbols can be obtained from Ensembl or http://www.ncbi.nlm.nih.gov/gene. ** The QTL information was obtained using http://www.animalgenome.org/cgi-bin/gbrowse/pig/.

Significant markers for NBA: Across population

In LW, SNPs significant across sub-populations were found in the analysis of clusters LW_1 and LW_2 as well as in LW_2 and LW_2a, which had a certain proportion of animals in common contain shared proportions of identical animals. A remarkably low number of QTL were found in the genomically homogeneous cluster LW_2b, which consists of animals from only one breeding organisation. The high number of PC (151) with negative impact on the utilizable genetic variation might explain this result. In addition, the year of birth of the pigs from this breeding organisation covers the years 1990 to 2011 (Fig. 1). This long period of selection might influence the frequency of important genes and/or the linkage phase between marker and QTL, but not necessarily the genomic population structure displayed by the MDS plots (Figs. 2 and 3). The LR population of one breeding organisation (LR_3) was genetically disconnected, so that two sub-clusters (LR_3a and LR_3b) were formed and analysed separately. The genetic disconnection can be explained by the import of breeding animals into this breeding organization in the past. Within the different LR clusters, only one SNP located on SSC16 was found in two clusters, LR_3 and its subset LR_3b.

Significant markers for NBA: Position and biological relevance

Detailed information about significant SNPs and the results of annotation for all analyses with previously reported candidate genes, QTL or association in SNP regions are given in Table 5. In the analysis of LW_2a, one SNP significantly associated with NBA on SSC3 at 27.9 Mb was located within a region where QTL have been found for NBA and ovulation rate (OR) in previous studies [9,35] (Table 5). Up to the present, no gene with an influence on these reproductive traits has been located in this chromosome region. At the distal end of the p-arm of SSC5 two significant markers (ASGA0023685, MARC0103593) were found in LW_1 as well as in LW_2 (Table 5). In the cluster LW_2b, these two markers slightly exceeded the 5% significance threshold. The gene peroxisome proliferator activated receptor α (PPARα), which is part of a nuclear hormone receptor family, was mapped within the 2 Mb window around these marker positions. In Polish LR and Pietrain, it has been shown that the expression of PPARα is significantly higher in endometrial tissue at early stage of pregnancy than during the estrous cycle [36]. Gene expression was lower at day 10–12 and 22–30 of pregnancy when the maternal recognition of pregnancy and the end of the implantation of the fetus in the endometrium take place. The study concluded that PPARα is involved in these two important events. A second gene (Fibulin-1, Fbln1), involved in building blood vessel walls, is located at 1.07–1.16 Mb on SSC5 (Table 5). The importance of this gene was illustrated by a perinatal mortality of mice with homozygous knock-out phenotype [37]. Vezatin (VEZT) was located at 92.2 to 92.3 Mb which was next to the found marker at 91.5 Mb on SSC5 when analyzing cluster LW_2a. The physiological role of VEZT has not been established in pigs, but Hyenne et al. [38] reported a function of VEZT during preimplantation of mice embryos. They inhibited the expression of this gene and found developmentally arrested embryos with limited cell-cell interactions which failed to form a young blastocyst. This finding underlines the potential importance of VEZT for maternal reproduction. In cluster LR_3, one chromosome-wide significant marker (CASI0006750) was found at 115.5 Mb on SSC7 with a MAF of 2%. Fibronectin leucine-rich repeat transmembrane protein (Flrt2) was mapped close to this marker (114.35–114.36 Mb) (Table 5) which is involved in the embryonic development of the heart. Mice homozygous null embryos were developmentally arrested and died at mid-gestation caused by cardiac insufficiency [39]. At position 14.8 Mb the marker MARC0070952 was found on SSC9 in LR_1 and in LR_2, but in LR_2 the marker exceeds the chromosome-wide 5% p-value threshold only by a small amount (p = 5,5%). In pigs, Onteru et al. [9] detected one QTL affecting TNB in this region (Table 5). Up to the present, no genes with an influence on reproduction in pigs have been identified in this chromosome region. A second detected marker on SSC9 was found in the overlapping clusters LW_2 and LW_2a (ALGA0055303, 139.0 Mb) with a genome-wide significance in LW_2a although the MAF was below 1%. In a previous study, QTL for corpus luteum number have been detected in this chromosome region of SSC9 [35]. Additionally, prostaglandin-endoperixode synthase 2 (PTGS2, also known as cyclooxygenase II), was mapped in this area of SSC9 (140.2–140.3 Mb) (Table 5). PTSG2-null mice showed defects in the mentioned reproduction traits [40,41], e.g. implantation failure [41]. Ashworth et al. [42] investigated the role of PTSG2 in the estrous cycle and early pregnancy of pigs. They concluded that this gene has an impact on placental attachment and embryo survival in pigs. An early estrogen exposure at the beginning of the pig’s pregnancy leads to an altered PTSG2 expression. This could be one of the reasons for a total embryonic loss during implantation due to endocrine disruption of pregnancy [42]. Additionally, it has been shown that PTGS2 is important for the regulation of ovulation and fertilization which determine the number of preimplanted embryos [41,43,44] and therefore influences litter size in pigs. Phospholipase A2 group 4A (PLA G4A) is required for a normal PTGS2 induction [45,46]. PLA G4A is also mapped in the chromosome region of the significant associated marker, which was found in LW_2 and LW_2a on SSC9 (140.4–140.6 Mb) (Table 5). Knocking out this gene leads to reduced litter sizes in mice caused by defects during implantation [47-51]. Kurusu et al. [52] also found a significantly reduced number of oocytes and preimplanted embryos in PLA G4A -/- mice in comparison to PLA G4A +/+ mice leading to a reduction in litter size. The SNP ASGA0046811 at position 18.2 Mb on SSC10 was significantly associated with NBA in LW_2a. The gene AT hook containing transcription factor 1 (AHCTF1 also known as ELYS), was mapped close to this marker (17.3–17.4 Mb) (Table 5). The function of this gene in pigs is not clarified yet. Okita et al. [53] showed that AHCTF1 deficient mice with a homozygous genotype for this mutation died after implantation phase. They observed impaired proliferation of the inner cells of the embryos and concluded that this gene is an important factor for the proliferation and survival of the inner cells and thus for the survival of the mouse embryo [53]. SNP MARC0070030 mapped on SSC10 at 32.5MB was found in LW_1 and LW_2, but had a MAF below 1% in both sub-populations. This marker is located in a previously described QTL for corpus luteum number [35] which is one of the main factors influencing NBA [54]. In the upstream chromosome region of SSC10, the SNPs DRGA0010601 and MARC0043480 (63.8 Mb) were associated with NBA in LW_2b. QTL affecting ovulation rate and plasma follicle-stimulating hormone (FSH) concentration were detected within that chromosome region in previous studies [35,55] (Table 5). In addition, integrin β 1 (ITGβ1) was mapped close to these markers (61.4–61.5 Mb). It has been shown that the G allele of ITGβ1 has an effect on litter size in LW and LR [56]. Cathepsin L1 (CTSL1, at 76.9–77.0 Mb) is located close to the significant marker which was identified at position 76.8 Mb and was found to be associated with NBA in LW_2a. In pigs, this gene has the function of regulating the transport of macromolecules between mother and embryo. This is essential for the nutrition and development and thus the survival of the embryo [57]. On the p-arm of SSC11 one marker was found to be associated with NBA in LW_2. This is the first time that a QTL for NBA has been reported in this region. The chromosome-wide significant SNP H3GA0030985 was found at position 3.7 Mb in LR_1. The FMS-like tyrosine kinase 1 (Flt1) gene, which is one of the two receptors for vascular endothelial growth factor (VEGF)-A [58], was mapped at 5.3–5.5 Mb. It has a major impact on embryonic vascular development and on the cyclic blood vessel proliferation in the female reproduction tract [59]. An adequate vascular development is a key factor for the fetal-maternal exchange of nutrients, gases and wastes [60]. It has been shown that a targeted change of VEGF-A in mice leads to embryonic death [61,62]. Fong et al. [63] found that the gene Flt1 has an essential function in embryonic vasculature. This was underlined by the fact that mutant mice homozygous in the Flt1 locus did not survive the embryonic stage. Death was caused by abnormal vascular channels which these mutant embryos had developed. Furthermore, Ferrara et al. [59] suggested that Flt1 appears as a “decoy” receptor for VEGF-A agonist during embryogenesis. In LW_2a, one marker was found on the q-arm of SSC11 in the QTL region which was reported to be responsible for the number of stillborn piglets in LW and French LR populations by Tribout et al. [64]. The SNP ASGA0072103, located on SSC 16, had chromosome-wide significance in LR_3 and LR_3b. Tribout et al. [64] detected a QTL affecting NBA at this position in LW and French LR populations. In the same study, a QTL for NBA was found on SSC18 [64]. This supports the findings of the present study. We detected a SNP with genome-wide significance at position 47.3 Mb on SSC18 with a MAF of 0.6%. The results reported by Tribout et al. [64] and our own findings indicate that this chromosome region may have an impact on NBA in Large White populations.

Conclusion

A distinct genetic stratification between different pig breeds and pig sub-populations was detected in our data set. This might be characteristic for commercial pig populations from competing pig breeding organisation with different breeding goals. In summary, we found 17 different SNPs in the various sub-clusters. Five of the SNPs had a low MAF (<1%). Taking into account the long selection history for fertility traits and the low heritability of NBA, this result was to be expected. Most of the significant SNPs were detected in chromosome regions where candidate genes or QTL affecting litter size had been mapped in previous studies. Against this background, the removal of SNPs with a low MAF jeopardises the potential for genetic progress in genomic selection programs. Because of the low MAF of many QTL, the probability of finding many SNPs which act as QTL across breeds or sub-clusters was low. This assumption was supported by the low number of across sub-cluster QTL in our study. It appears that in each sub-population litter size is influenced by different alleles. Because there are no such overlapping QTL regions, it is questionable if the combination of genetically divergent sub-populations is a useful strategy for detecting relevant QTL or improving the accuracy of genomic selection.

Manhattan plot of genome wide association study for NBA in LW_2.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LW_2a.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LW_2b.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LW_3.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LR_1.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LR_2.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LR_3.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LR_3a.

(TIFF) Click here for additional data file.

Manhattan plot of genome wide association study for NBA in LR_3b.

(TIFF) Click here for additional data file.
  59 in total

1.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  GenABEL: an R library for genome-wide association analysis.

Authors:  Yurii S Aulchenko; Stephan Ripke; Aaron Isaacs; Cornelia M van Duijn
Journal:  Bioinformatics       Date:  2007-03-23       Impact factor: 6.937

4.  Anovulation in cyclooxygenase-2-deficient mice is restored by prostaglandin E2 and interleukin-1beta.

Authors:  B J Davis; D E Lennard; C A Lee; H F Tiano; S G Morham; W C Wetsel; R Langenbach
Journal:  Endocrinology       Date:  1999-06       Impact factor: 4.736

5.  Renal abnormalities and an altered inflammatory response in mice lacking cyclooxygenase II.

Authors:  J E Dinchuk; B D Car; R J Focht; J J Johnston; B D Jaffee; M B Covington; N R Contel; V M Eng; R J Collins; P M Czerniak
Journal:  Nature       Date:  1995-11-23       Impact factor: 49.962

6.  High affinity VEGF binding and developmental expression suggest Flk-1 as a major regulator of vasculogenesis and angiogenesis.

Authors:  B Millauer; S Wizigmann-Voos; H Schnürch; R Martinez; N P Møller; W Risau; A Ullrich
Journal:  Cell       Date:  1993-03-26       Impact factor: 41.582

7.  Role of the Flt-1 receptor tyrosine kinase in regulating the assembly of vascular endothelium.

Authors:  G H Fong; J Rossant; M Gertsenstein; M L Breitman
Journal:  Nature       Date:  1995-07-06       Impact factor: 49.962

8.  Genetic mapping of quantitative trait loci for growth and fatness in pigs.

Authors:  L Andersson; C S Haley; H Ellegren; S A Knott; M Johansson; K Andersson; L Andersson-Eklund; I Edfors-Lilja; M Fredholm; I Hansson
Journal:  Science       Date:  1994-03-25       Impact factor: 47.728

9.  The secretory phospholipase A2 gene is a candidate for the Mom1 locus, a major modifier of ApcMin-induced intestinal neoplasia.

Authors:  M MacPhee; K P Chepenik; R A Liddell; K K Nelson; L D Siracusa; A M Buchberg
Journal:  Cell       Date:  1995-06-16       Impact factor: 41.582

10.  Accounting for population stratification in practice: a comparison of the main strategies dedicated to genome-wide association studies.

Authors:  Matthieu Bouaziz; Christophe Ambroise; Mickael Guedj
Journal:  PLoS One       Date:  2011-12-21       Impact factor: 3.240

View more
  11 in total

1.  Genome-wide association analyses for boar taint components and testicular traits revealed regions having pleiotropic effects.

Authors:  Christine Große-Brinkhaus; Leonie C Storck; Luc Frieden; Christiane Neuhoff; Karl Schellander; Christian Looft; Ernst Tholen
Journal:  BMC Genet       Date:  2015-04-09       Impact factor: 2.797

2.  Identification of Stem Leydig Cells Derived from Pig Testicular Interstitium.

Authors:  Shuai Yu; Pengfei Zhang; Wuzi Dong; Wenxian Zeng; Chuanying Pan
Journal:  Stem Cells Int       Date:  2017-01-24       Impact factor: 5.443

3.  miR-205 Expression Elevated With EDS Treatment and Induced Leydig Cell Apoptosis by Targeting RAP2B via the PI3K/AKT Signaling Pathway.

Authors:  Yang Cui; Rui Chen; Lin Ma; Wenjing Yang; Mingyue Chen; Yanghai Zhang; Shuai Yu; Wuzi Dong; Wenxian Zeng; Xianyong Lan; Chuanying Pan
Journal:  Front Cell Dev Biol       Date:  2020-06-09

4.  Genome-wide associations and detection of potential candidate genes for direct genetic and maternal genetic effects influencing dairy cattle body weight at different ages.

Authors:  Tong Yin; Sven König
Journal:  Genet Sel Evol       Date:  2019-02-06       Impact factor: 4.297

Review 5.  Survey of SNPs Associated with Total Number Born and Total Number Born Alive in Pig.

Authors:  Siroj Bakoev; Lyubov Getmantseva; Faridun Bakoev; Maria Kolosova; Valeria Gabova; Anatoly Kolosov; Olga Kostyunina
Journal:  Genes (Basel)       Date:  2020-04-30       Impact factor: 4.096

6.  Genetic and genomic characterization of vulva size traits in Yorkshire and Landrace gilts.

Authors:  Flor-Anita Corredor; Leticia P Sanglard; Richard J Leach; Jason W Ross; Aileen F Keating; Nick V L Serão
Journal:  BMC Genet       Date:  2020-03-12       Impact factor: 2.797

7.  Genome-Wide Association Analysis Reveals Genetic Loci and Candidate Genes for Chest, Abdominal, and Waist Circumferences in Two Duroc Pig Populations.

Authors:  Shenping Zhou; Rongrong Ding; Zhanwei Zhuang; Haiyu Zeng; Shuxian Wen; Donglin Ruan; Jie Wu; Yibin Qiu; Enqin Zheng; Gengyuan Cai; Jie Yang; Zhenfang Wu; Ming Yang
Journal:  Front Vet Sci       Date:  2022-01-28

8.  Genetic determinants of pig birth weight variability.

Authors:  Xuemin Wang; Xiaolei Liu; Dadong Deng; Mei Yu; Xiaoping Li
Journal:  BMC Genet       Date:  2016-01-27       Impact factor: 2.797

9.  Association between single-nucleotide polymorphisms within candidate genes and fertility in Landrace and Duroc pigs.

Authors:  Nina Hårdnes Tremoen; Maren Van Son; Ina Andersen-Ranberg; Eli Grindflek; Frøydis Deinboll Myromslien; Ann Helen Gaustad; Dag Inge Våge
Journal:  Acta Vet Scand       Date:  2019-12-03       Impact factor: 1.695

10.  Integrated analysis of lncRNA, miRNA and mRNA reveals novel insights into the fertility regulation of large white sows.

Authors:  Huiyan Hu; Qing Jia; Jianzhong Xi; Bo Zhou; Zhiqiang Li
Journal:  BMC Genomics       Date:  2020-09-14       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.