Literature DB >> 35565484

A Whole Genome Sequencing-Based Genome-Wide Association Study Reveals the Potential Associations of Teat Number in Qingping Pigs.

Zezhang Liu1, Hong Li2, Zhuxia Zhong1, Siwen Jiang1.   

Abstract

Teat number plays an important role in the reproductive performance of sows and the growth of piglets. However, the quantitative trait loci (QTLs) and candidate genes for the teat number-related traits in Qingping pigs remain unknown. In this study, we performed GWAS based on whole-genome single-nucleotide polymorphisms (SNPs) and insertions/deletions (Indels) for the total number of teats and five other related traits in 100 Qingping pigs. SNPs and Indels of all 100 pigs were genotyped using 10× whole genome resequencing. GWAS using General Linear Models (GLM) detected a total of 28 SNPs and 45 Indels as peak markers for these six traits. We also performed GWAS for the absolute difference between left and right teat number (ADIFF) using Fixed and random model Circulating Probability Unification (FarmCPU). The most strongly associated SNP and Indel with a distance of 562,788 bp were significantly associated with ADIFF in both GLM and FarmCPU models. In the 1-Mb regions of the most strongly associated SNP and Indel, there were five annotated genes, including TRIML1, TRIML2, ZFP42, FAT1 and MTNR1A. We also highlighted TBX3 as an interesting candidate gene for SSC14. Enrichment analysis of candidate genes suggested the Wnt signaling pathway may contribute to teat number-related traits. This study expanded significant marker-trait associations for teat number and provided useful molecular markers and candidate genes for teat number improvement in the breeding of sows.

Entities:  

Keywords:  Qingping pig; TBX3; Wnt signaling pathway; teat number; whole genome resequencing

Year:  2022        PMID: 35565484      PMCID: PMC9100799          DOI: 10.3390/ani12091057

Source DB:  PubMed          Journal:  Animals (Basel)        ISSN: 2076-2615            Impact factor:   3.231


1. Introduction

Porcine teats are located from the anterior to posterior limb bud and symmetrical to the abdominal midline, and teat number is an important trait for the mothering ability of a sow, which can affect the piglets’ weight gain and mortality. It has been shown that teat number is a quantitative trait with a medium level of heritability (0.32) [1]. Using genetic markers can speed up the genetic improvement of teat number. Thus far, 655 teat number quantitative trait loci (QTLs) in pigs have been reported and included in the PigQTL database [2]. Previous genome-wide association studies (GWAS) found microsatellite markers or single nucleotide polymorphisms (SNPs) in popular breeds, including Durocs [1,3], Large Whites [4], and their crosses with Meishan pigs [5,6]. GWAS were also performed for teat number-related traits in several Chinese native pig breeds, including Erhualian [7], Sushan [8] and Beijing black [9]. However, the genetic architecture for teat number in Qingping pig, a Chinese native pig breed, is still not clear. In 2000, Wada et al. revealed two QTLs for teat number on SSC1 and SSC7 by QTL analysis of 265 F2 offspring of the Meishan and Göttingen miniature pig [10]. Since then, an increasing number of studies have used microsatellite markers to confirm the QTL on SSC7 in other pig populations, including Meishan × Duroc F2 resource population [5], F2 populations of Yorkshire boars and Meishan sows [11], and Meishan × Large White F2 pigs [12]. Additionally, a number of studies detected associated SNPs for teat number near or within this QTL on SSC7 in Duroc pig [1,3,13,14], White Duroc × Erhualian F2 resource population [15], a commercial swine population [16], and Large White pig [17]. Notably, VRTN (located at 103.4 Mb on SSC7 on Sscrofa10.2) was a credible candidate gene in this major QTL for teat number on SSC7. These studies also indicated that QTLs for teat number-related traits are distributed across the genome on every chromosome. Several other genes have also been annotated as teat number-associated genes, such as Lysine Demethylase 6B (KDM6B) [15], TOX High Mobility Group Box Family Member 3 (TOX3) [17], Estrogen Receptor 1 (ESR1) and Nuclear Receptor Subfamily 5 Group A Member 1 (NR5A1) [8]. However, most previous studies have not used the high-density single-nucleotide polymorphisms (SNPs) detected by high-throughput sequencing data to investigate teat number-related traits. The purpose of the current study was to detect genome-wide associations for teat number-related traits in Qingping pigs using whole-genome SNPs and insertions/deletions (Indels) based on whole-genome resequencing.

2. Materials and Methods

2.1. Sample and Sequencing

In this study, all 100 pigs were raised indoors in the Qingping pig Conservation Farm in Yichang, Hubei, China. Ear tissues were collected and stored in liquid nitrogen until further analysis. Genomic DNA samples were extracted from ear tissues, using a standard phenol–chloroform method. Sequencing libraries were constructed and sequenced by the Novogene Bioinformatics Institute (Novogene, Beijing, China). High-throughput sequencing was performed as paired-end 150 sequencing using a HiSeq 4000 sequencing system (Illumina, San Diego, CA, USA).

2.2. Phenotypic Data

In this study, three independent persons observed all 100 pigs and counted the teat number of the left and right lines. A total of six teat number-related traits were obtained: (i) the number of teats on the left side (LTN); (ii) the number of teats on the right side (RTN); (iii) the total number of teats (TNUM = LTN + RTN); (iv) the maximum number of teats in LTN and RTN (MAXAP); (v) the difference between the two sides (L-R = LTN − RTN) and (vi) the absolute difference between left and right teat number (ADIFF = |LTN − RTN|). The mean, standard deviation, minimum value, maximum value, and coefficient of variance for each trait were calculated using R (version 3.6.0) (R Core Team, Vienna, Austria).

2.3. Genotyping and Quality Control

Clean reads from all 100 pigs were aligned to the Sscrofa11.1 reference genome by the BWA software (version: 0.7.8) (Wellcome Trust Sanger Institute, Hinxton, UK) [18]. To reduce mismatches generated by PCR amplification before sequencing, duplicated reads were removed using SAMtools (Wellcome Trust Sanger Institute, Hinxton, UK) [19]. SNPs and Indels calling were initially performed to generate a gvcf file using UnifiedGenotyper in GATK (version 3.6) (Broad Institute, Cambridge, MA, USA) [20]. SNPs and Indels were divided using SelectVariants in GATK. Hard filtering of SNPs was applied to the raw variant set using “QUAL < 30.0 || QD < 2.0 || FS > 60.0 || MQ < 40.0 || SOR > 3.0 || ReadPosRankSum < −8.0”. Hard filtering of Indels was applied to the raw variant set using “QUAL < 30.0 || QD < 2.0 || FS > 200.0 || SOR > 10.0 || ReadPosRankSum < −20.0 || MQ < 40.0 || MQRankSum < −12.5”. SNPs and Indels in the VCF files were quality-filtered using VCFtools (v0.1.16) (Wellcome Trust Sanger Institute, Hinxton, UK) to remove variants with sequencing depth less than 3 [21]. SNPs and Indels were further filtered using PLINK 2.0 (Complete Genomics, Mountain View, CA, USA) [22]. Missing genotypes were imputed using beagle.03Jul19.b33.jar [23], followed by filtering SNPs and Indels again using PLINK 2.0 to obtain the high-quality common SNPs and Indels of 100 pigs for further analysis (individual call rate > 0.90; minor allele frequency > 0.05; call rate > 0.90, SNPs and Indels in Hardy–Weinberg equilibrium (p > 1 × 10−6) and excluding SNPs and Indels located on the sex chromosomes).

2.4. SNP-Based Heritability

The phenotypic variance explained by genome-wide SNPs (SNP-based heritability) was estimated using GREML in Genome-wide Complex Trait Analysis (GCTA) [24]. Briefly, the genetic relationships between pairwise individuals from all the autosomal SNPs were estimated using the genetic relationship matrix (GRM) based on high-quality common SNPs, followed by GRM and phenotype for restricted maximum likelihood (REML) analysis to estimate the variance explained by the SNPs.

2.5. Principal Component Analysis

Principal component analysis (PCA) was used to explore the population structure of Qingping pigs and determine whether principal components (PCs) should be added to the GWAS. PCA was performed using MVP.Data.PC in rMVP R (version 3.6.0) (Huazhong Agricultural University, Wuhan, China) [25] package based on high quality common SNPs. PC1 and PC2 were visualized using MVP.PCAplot in rMVP R (version 3.6.0) package.

2.6. GWAS Using General Linear Model (GLM)

In the present study, GLM in rMVP R (version 3.6.0) package was used to perform the GWAS for teat number-related traits based on high-quality common SNPs and Indels [26]. Principal component analysis showed no discernible clustering. Therefore, no principal component was adjusted in the subsequent association analysis. Each SNP or Indel for teat number-related traits was tested by GLM as follows: where is the vector of each teat number-related trait in Qingping pigs; , a matrix of test SNP or Indel; , an incidence vector for ; , a vector of residuals following a normal distribution with a mean of zero and covariance, where is the identity matrix and is the residual variance.

2.7. GWAS Using FarmCPU

The Fixed Effect Model (FEM) and the Random Effect Model (REM) are used iteratively in FarmCPU [27], and FarmCPU in rMVP R (version 3.6.0) package was also used to perform GWAS for ADIFF. In the GLM model, some genetic markers were significantly associated with ADIFF with a whole-genome significant p-value (2.16 × 10−9 for SNPs and 2.44 × 10−8 for Indels). These markers can be used to define kinship in the REM step of FarmCPU to avoid the model over-fitting problem in FEM. The FEM is used to test each genetic marker, one at a time. Pseudo QTNs are included as covariates to control false positives. Specifically, the FEM could be expressed by the following equation: where is the phenotype of the individual; , , …, , the genotypes of . pseudo QTNs, initiated with no QTN; , , …, , the corresponding effects of the pseudo QTNs; , the genotype of the individual and genetic marker; , the corresponding effect of the genetic marker; , the residual having a distribution with zero mean and variance of . The REM is used to optimize the selection of pseudo QTNs from markers with whole genome significant p-values (2.16 × 10−9 for SNPs and 2.44 × 10−8 for Indels) and positions by using the SUPER algorithm [28]. The REM could be expressed by the following equation: where and . are the same as in FEM, and indicates the total additive genetic effect of the individual. The expectations of the individuals’ total genetic effects are zeros. The variance and covariance matrix of the individuals’ total genetic effects can be expressed by , where is an unknown genetic variance and is the kinship matrix calculated by pseudo QTNs. The FEM and REM are iterated until no new pseudo QTNs are added, or the specified maximum number of iterations is reached.

2.8. Comparison with Known QTLs and Haplotype Analysis

Pig QTLs based on Sscrofa11.1 were downloaded from the PigQTL database. The information of QTLs for teat number-related traits was obtained using R (version 3.6.0), followed by comparing the significant SNPs and Indels with these QTLs using R (version 3.6.0). For the strongest significant SNP (rs322863105) for ADIFF on SSC17, SNPs with suggestive significant p-values (1/557,540, 1.79 × 10−6) around rs322863105 were used for haplotype block analysis to evaluate the linkage disequilibrium (LD) patterns of selected SNPs within this region using Haploview version 4.2 [29]. The effects of this SNP on ADIFF were plotted using ggplot2 R (version 3.6.0) package.

2.9. Annotation of Candidate Genes and Functional Enrichment Analysis

Candidate genes, including or close to the significant SNPs and Indels, were annotated using the biomaRt [30] R (version 3.6.0) package based on Sscrofa11.1. Candidate genes located in 1-Mb regions of significant SNPs and Indels were also annotated. Gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway enrichment analyses and visualizations were performed using the clusterProfiler R (version 3.6.0) package [31].

3. Results

3.1. Genotyping and Phenotypic Statistics

After whole-genome sequencing, 2.76 TB of sequences were generated, with a mean coverage of 98.47% at an average of 9.66-fold depth for 100 Qingping pigs in this study (Table S1). Clean data were mapped to the pig reference genome (Sscrofa11.1), with 36,482,281 SNPs and 4,859,001 Indels being called with GATK. After filtering, 23,193,931 SNPs and 2,053,221 Indels, with a distribution roughly proportional to autosomal chromosomes of pigs, were retained for subsequent analyses (Figure 1).
Figure 1

The distribution of SNPs and Indels on autosomal chromosomes of pigs.

Table 1 shows the descriptive statistics of teat number-related traits of Qingping pigs. The average teat number [standard deviation (SD)] was seen to be 14.78 (0.97), ranging from 14 to 17, which was higher than the values of Beijing Black Pig (13.6) [9], Japanese Duroc (13.73) [3], American (10.90) and Canadian Duroc (10.92) [14], but lower than the value of Erhualian pigs (19.13) [7]. The meansvalues (SD) for the other teat number-related traits, were 7.36 (0.48), 7.42 (0.61), 7.50 (0.59), −0.06 (0.51) and 0.22 (0.46) for LTN, RTN, MAXAP, L-R and ADIFF, with coefficient of variation (CV) values of 6.55, 8.17, 6.56 and 7.93% for LTN, RTN, TNUM and MAXAP, respectively.
Table 1

Summary of phenotypic data in terms of mean, standard deviation, minimum, maximum, coefficient of variance, and SNP-based heritability and standard error (SE) for each trait.

TraitNMeanSDMinMaxC.V. hSNP2 SE
LTN100 7.36 0.48 7.00 8.00 6.55 0.29 0.21
RTN100 7.42 0.61 6.00 9.00 8.17 0.19 0.21
TNUM100 14.78 0.97 14.00 17.00 6.56 0.36 0.23
MAXAP100 7.50 0.59 7.00 9.00 7.93 0.38 0.22
L-R100 −0.06 0.51 −2.00 2.00 NA0.00 0.20
ADIFF100 0.22 0.46 0.00 2.00 NA0.24 0.20

LTN: the number of teats on the left side; RTN: the number of teats on the right side; TNUM: the total number of teats (TNUM = LTN + RTN); MAXAP: the maximum number of teats in LTN and RTN (MAXAP); L-R: the difference between the two sides (L-R = LTN − RTN); ADIFF: the absolute difference between left and right teat number (ADIFF = |LTN − RTN|).

In Table 1, it was shown that the values of SNP-based heritability () for teat number-related traits were 0.29, 0.19, 0.36, 0.38, 0.00 and 0.24 for LTN, RTN, TNUM, MAXAP, L-R and ADIFF in Qingping pigs, respectively. Unfortunately, the standard errors (SE) were large, which reflects the low accuracy of the heritability estimates. Therefore, we compared the results of the teat number-related heritability estimates in Qingping pigs with those of other pig breeds. In Qingping pigs, the narrow-sense heritability for TNUM (0.36) (only considering the contribution of additive genetic effects) was consistent with the values of purebred Korean Yorkshire pigs (0.37) [4] and Duroc pigs (0.34 ± 0.05) [3]. In previous studies, the values of LTN, RTN, MAXAP and L-R were 0.16, 0.26, 0.261 and 0.00, which were consistent with our results [17]. Unexpectedly, ADIFF had a medium value (0.24), which was virtually null in previous studies [16,17]. These results suggest that estimates of the heritability of teat number-related traits in Qingping pigs were reliable, but caution is required due to the large standard errors.

3.2. GLM GWAS for Teat Number-Related Traits

PCA results indicated that Qingping pigs could not be clustered into groups (Figure S1), so GWAS was performed using GLM with no principal component. The Bonferroni correction assumes that each of the tests is independent, an is thereby inherently conservative when considering SNPs in LD. We calculated the effectively independent tests based on the estimated number of independent markers [32]. A total of 557,540 SNP and 28,629 Indel independent tests were suggested, with the threshold p-value of 8.97 × 10−8 (0.05/557,540) for SNPs and 1.75 × 10−6 (0.05/28,629) for Indels. Figure 2 and Figure 3 show the Manhattan plots for LTN, RTN, TNUM, MAXAP, L-R and ADIFF, and Figure S2 presents the Q-Q plots for these traits. A total of 28 significant SNPs and 45 significant Indels were detected as peak associated variants for LTN, RTN, TNUM, MAXAP, L-R and ADIFF (Table 2). Among the 28 SNPs identified, 10 SNPs were located within 11 genes and the others (18 SNPs) were located at 3798 to 46,070,668 bp from the nearest genes (Table 2). Among the 45 Indels identified, 21 Indels were located within 22 genes and the others (24 Indels) were located 2660 to 137,303 bp from the nearest genes (Table 2).
Figure 2

Manhattan plots of GLM GWAS for teat number-related traits in Qingping pigs, including LTN, RTN, TNUM, MAXAP, L-R and ADIFF based on SNPs. LTN: the number of teats on the left side; RTN: the number of teats on the right side; TNUM: the total number of teats (TNUM = LTN + RTN); MAXAP: the maximum number of teats in LTN and RTN (MAXAP); L-R: the difference between the two sides (L-R = LTN − RTN); ADIFF: the absolute difference between left and right teat number (ADIFF = |LTN − RTN|).

Figure 3

Manhattan plots of GLM GWAS for teat number-related traits in Qingping pigs, including LTN, RTN, TNUM, MAXAP, L-R and ADIFF based on Indels. LTN: the number of teats on the left side; RTN: the number of teats on the right side; TNUM: the total number of teats (TNUM = LTN + RTN); MAXAP: the maximum number of teats in LTN and RTN (MAXAP); L-R: the difference between the two sides (L-R = LTN − RTN); ADIFF: the absolute difference between left and right teat number (ADIFF = |LTN − RTN|).

Table 2

Significant SNPs and Indels of GLM GWAS for teat number-related traits.

SNPTraitSSCPositionEffectp-ValueQTLs *AnnotationGene (Distance from the Gene in bp)
chr9:130956708LTN91309567080.643.85 × 108 intronicPACC1(within), NENF(within)
rs345573243RTN18473999080.621.60 × 10824,290, 7470intronic OSBPL3(within)
rs703282466MAXAP31067887300.452.29 × 1085224, 8797, 8798, 4250, 4256intergenic LTBP1(63796), ENSSSCG00000050704(9889)
chr15:137183045TNUM15137183045−0.797.02 × 108223,293intronic MLPH(within)
rs345573243TNUM18473999080.966.25 × 10824,290, 7470intronic OSBPL3(within)
rs1108940033ADIFF1457969830.612.19 × 108 intergenic PHF3(500583),ENSSSCG00000049526(273685)
rs321204530ADIFF11770568410.591.44 × 1085223, 5255, 822, 845, 1250intronic MDGA2(within)
rs318957512ADIFF2978493480.505.79 × 108 intronic ADGRV1(within)
rs326371568ADIFF21235862260.623.87 × 10104255intergenic FAM170A(119517),PRR16(613282)
rs342451777ADIFF21272241220.578.31 × 1094255intergenic ENSSSCG00000042143(13463),ENSSSCG00000040936(55836)
rs325963999ADIFF3193018370.669.15 × 1095224, 7455, 7472intronic KATNIP(within)
rs326276043ADIFF6340518480.556.98 × 10824,289intergenic ENSSSCG00000050973(273191),CYLD(8093)
rs338649298ADIFF81295521620.592.22 × 109 intergenic SNCA(163855),ENSSSCG00000043431(95940)
rs321470648ADIFF9107693370.455.38 × 108 intergenic ENSSSCG00000046278(36887),ENSSSCG00000045225(26362)
rs1109963100ADIFF1029111790.803.37 × 1010 intergenic ENSSSCG00000042899(210778),BRINP3(52348)
rs339887165ADIFF13121353880.681.46 × 1087479intergenic ENSSSCG00000044771(16544),ENSSSCG00000051554(31818)
chr13:39266305ADIFF13392663050.721.88 × 1087479intronic DNAH12(within)
rs701874475ADIFF13134665423−0.512.19 × 1087479intergenic LMLN(8675),ENSSSCG00000050583(21280)
rs338558804ADIFF13179683904−0.453.49 × 108 intergenic ENSSSCG00000038062(120975),NRIP1(140958)
rs343864506ADIFF13187708682−0.543.72 × 108 intergenic ENSSSCG00000047308(444803),ENSSSCG00000050420(500018)
rs334271954ADIFF14629597260.525.70 × 108 intergenic FAM13C(33877),SLC16A9(66937)
rs1109225784ADIFF15125813240.641.96 × 109 intergenic U6(133156),U6(186459)
rs326978910ADIFF15840159340.383.38 × 1087468intronic OSBPL6(within)
rs334746473ADIFF15972003230.754.23 × 10107468intergenic ENSSSCG00000046205(186079),U2(688281)
rs322863105ADIFF177610979−0.886.01 × 1011 intergenic ENSSSCG00000045345(132678),ENSSSCG00000047202(438984)
chr17:8221026ADIFF1782210260.791.47 × 109 intergenic U6(19092),FAT1(227734)
rs330045817ADIFF178536301−0.713.76 × 1010 ncRNA_intronic FAT1(within)
rs324534432ADIFF17133646680.791.47 × 109 intergenic PSD3(100469),ENSSSCG00000046441(3798)
Indel
chr6:7472906LTN67472906−0.411.06 × 10624,289intronic CDYL2(within)
rs695882779LTN936905070.399.84 × 107 ncRNA_intronic ENSSSCG00000049604(within)
chr9:119650540LTN9119650540−0.303.62 × 107 intergenic ENSSSCG00000044083(337170),ENSSSCG00000050832(4698)
rs792699200LTN91308998690.514.41 × 107 intronic PACC1(within)
chr12:44292044LTN1244292044−0.379.55 × 1075227, 5261, 6472, 6479, 595, 2929intronic NOS2(within)
chr13:194617904LTN13194617904−0.311.30 × 106 intergenic KRTAP11-1(111382),ENSSSCG00000047315(5813)
rs709659410RTN5179697710.425.28 × 1072927intergenic KRT73(2682),KRT2(14544)
chr6:9186279RTN691862790.892.28 × 10724,289intronic WWOX(within)
rs790747253RTN15100810568−0.611.28 × 1067468intronic PGAP1(within)
chr18:48316684RTN18483166840.417.00 × 10824,290, 7470intronic STK31(within)
chr3:98429885MAXAP3984298850.502.93 × 1075224, 8797, 8798intergenic ENSSSCG00000045166(59604),ENSSSCG00000046007(449247)
rs793312568MAXAP31067920800.438.28 × 1085224, 8797, 8798, 4250, 4256intergenic LTBP1(67153),ENSSSCG00000050704(6530)
chr6:9186279MAXAP691862790.821.59 × 10624,289intronic WWOX(within)
chr6:30642639MAXAP6306426390.431.31 × 10624,289intergenic ENSSSCG00000047270(70049),ENSSSCG00000041426(189382)
chr9:131996847MAXAP9131996847−0.491.46 × 106 intronic ENSSSCG00000040650(within)
chr14:12750600MAXAP14127506000.431.31 × 106 intronic HMBOX1(within)
rs790747253MAXAP15100810568−0.601.17 × 1067468intronic PGAP1(within)
rs711984029MAXAP1713041965−0.471.68 × 106 intronic PSD3(within)
rs793312568TNUM31067920800.631.41 × 1065224, 8797, 8798, 4250, 4256intergenic LTBP1(67153),ENSSSCG00000050704(6530)
chr5:75592729TNUM5755927290.988.38 × 107 intronic NELL2(within)
rs792699200TNUM91308998691.009.67 × 107 intronic PACC1(within)
chr18:48316684TNUM18483166840.624.01 × 10724,290, 7470intronic STK31(within)
chr1:44096236LR1440962360.691.25 × 107 intergenic ENSSSCG00000042072(159101),ENSSSCG00000045405(51729)
chr1:44973455ADIFF1449734550.701.96 × 107 intronic ZUP1(within),RSPH4A(within)
chr1:46840905ADIFF1468409050.715.46 × 108 intergenic ENSSSCG00000050391(123491),U6(392319)
chr1:65957275ADIFF1659572750.431.14 × 107 intergenic FBXL4(21578),FAXC(274880)
chr1:77875820ADIFF1778758200.428.57 × 108 intergenic FYN(65243),U6(107381)
chr1:118273285ADIFF1118273285−0.641.96 × 1095223, 6481, 5255intergenic ENSSSCG00000049391(9019),ENSSSCG00000045826(5556)
rs1113667849ADIFF21235840990.558.50 × 1074255intergenic FAM170A(117391),PRR16(615405)
chr3:54080763ADIFF3540807630.441.70 × 1065224, 7455, 7472, 6465intergenic LONRF2(99093),REV1(54761)
chr3:122749868ADIFF31227498680.761.91 × 107 intergenic LRATD1(17941),ENSSSCG00000045589(179131)
chr6:9702570ADIFF697025700.613.15 × 10724,289intronic WWOX(within)
chr6:29725029ADIFF629725029−0.411.54 × 10724,289intergenic ENSSSCG00000034192(132519),CES5A(117783)
chr7:48226837ADIFF7482268370.671.74 × 1065257intronic RASGRF1(within)
chr8:3878010ADIFF83878010−0.484.61 × 1077477intronic ENSSSCG00000027349(within)
chr8:131927486ADIFF81319274860.681.46 × 108 intergenic AFF1(2666),ENSSSCG00000032190(41799)
chr13:134712100ADIFF13134712100−0.502.83 × 1077479intergenic ENSSSCG00000050583(20384),OSBPL11(6327)
chr13:188309252ADIFF131883092520.354.51 × 107 intergenic ENSSSCG00000050420(95234),ENSSSCG00000043493(500979)
chr13:191714830ADIFF131917148300.761.91 × 107 intergenic ENSSSCG00000051384(248028),ENSSSCG00000048685(53357)
chr15:12585471ADIFF15125854710.671.74 × 106 intergenic U6(137304),U6(182308)
chr15:84277014ADIFF15842770140.536.57 × 1097468intergenic ENSSSCG00000036052(46519),ENSSSCG00000038561(122000)
rs792656057ADIFF1669550278−0.608.69 × 1075228intronic GRIA1(within)
rs793561441ADIFF175622328−0.339.14 × 108 intronic PCM1(within)
rs700363122ADIFF1781737670.601.13 × 108 intergenic ENSSSCG00000047202(116838),U6(28063)
rs789477433ADIFF18255418410.438.35 × 10724,290intergenic ENSSSCG00000048651(270433),FAM3C(27695)

* QTL number in PigQTL database. Effect means additive effect. LTN: the number of teats on the left side; RTN: the number of teats on the right side; TNUM: the total number of teats (TNUM = LTN + RTN); MAXAP: the maximum number of teats in LTN and RTN (MAXAP); L-R: the difference between the two sides (L-R = LTN − RTN); ADIFF: the absolute difference between left and right teat number (ADIFF = |LTN − RTN|).

Compared with known QTLs for teat number-related traits in the PigQTL database, 14 significant SNPs and 24 significant Indels overlapped with known QTLs (Table 2). Interestingly, Manhattan plots showed similar trends between SNPs and Indels. In Table 2, there were several significant SNPs and Indels were located at the same teat number-related traits QTLs in the PigQTL database. For RTN, significant SNP (rs345573243) and Indel (chr18:48316684) on SSC18 were located at QTL 7470 and QTL 24290, and coincidentally, rs345573243 and chr18:48316684 also showed significant associations with TNUM. For MAXAP, significant SNP (rs703282466) and Indel (rs793312568) on SSC3 were located at QTL 4250 and 4256, and coincidentally, rs793312568 also exhibited a significant association with TNUM. For ADIFF, significant SNPs (rs326371568 and rs342451777) and Indel (rs1113667849) on SSC2 were located at QTL 4255; significant SNP (rs701874475) and Indel (chr13:134712100) on SSC13 at QTL 7479; significant SNP (rs326978910) and Indel (chr15:84277014) on SSC15 at QTL 7468. Additionally, five new significant SNPs were closed to significant Indels, including rs1108940033 was closed to chr1:44973455 and chr1:46840905 on SSC1, rs338649298 was closed to chr8:131927486 on SSC8, rs343864506 was closed to chr13:188309252 on SSC13, rs1109225784 was closed to chr15:12585471 on SSC15, chr17:8221026 was closed to rs700363122 on SSC17.

3.3. FarmCPU GWAS for ADIFF

The GLM GWAS results of ADIFF showed significant associations even using whole genome SNPs or Indels to decide the thresholds (SNP: 0.05/2,319,3931; Indels: 0.05/2,053,221). However, Q-Q plots and genomic inflation factors (λSNP = 1.37, λIndel = 1.36) indicated possible false positives (Figure S2). FarmCPU, a powerful and efficient GWAS model, was used to control false positives and retain true positives. Q-Q plots and genomic inflation factors (λSNP = 0.98, λIndel = 0.86) were improved by FarmCPU (Figure S3). In Figure 4a and Table 3, 9 SNPs and 9 Indels were shown to be significantly associated variants for ADIFF on SSC1, 2, 3, 6, 8, 10, 11, 12, 13, 14, 15 and 17, with six SNPs and five Indels included in known QTLs (Table 3).
Figure 4

FarmCPU GWAS for ADIFF. (a) Manhattan plots of the GWAS based on SNPs and Indels. (b) Difference analysis of the strongest significant SNP (rs322863105) on SSC17 in GLM, which was retested in FarmCPU. (c) Haplotype block analysis of selected suggestive significant SNPs associated with ADIFF on SSC17 in GLM, including rs322863105. ADIFF: the absolute difference between left and right teat number (ADIFF = |LTN − RTN|).

Table 3

Significant SNPs and Indels of FarmCPU GWAS for ADIFF.

SNPSSCPositionEffectp-ValueQTLs *AnnotationGene (Distance from the Gene in bp)
rs325963999 #3193018370.27 7.82 × 10155224, 7455, 7472intronic KATNIP(within)
rs6936227086395405830.17 8.58 × 10924,289intergenic UQCRFS1(166710), ENSSSCG00000050718(41760)
rs3261348058885439010.13 3.81 × 10157477, 1100intergenic ENSSSCG00000044017(86061),SLC7A11(63983)
rs1109963100 #1029111790.33 3.74 × 1013 intergenic ENSSSCG00000042899(210778),BRINP3(52348)
rs111387539512114631440.16 7.51 × 1095227, 1128intergenic ABCA8(15848),ENSSSCG00000045738(20026)
rs343773900131101540620.24 5.42 × 10157479intronic PLD1(within)
rs33397051513132158230−0.18 3.99 × 10137479ncRNA_exonic ENSSSCG00000047632(within)
rs1109225784 #15125813240.32 1.46 × 1016 intergenic U6(133156),U6(186459)
rs322863105 #177610979−0.34 3.34 × 1013 intergenic ENSSSCG00000045345(132678),ENSSSCG00000047202(438984)
Indel
chr1:77875820 #1778758200.17 1.02 × 107 intergenic FYN(65243),U6(107381)
chr1:118273285 #1118273285−0.35 1.74 × 10105223, 6481, 5255intergenic ENSSSCG00000049391(9019),ENSSSCG00000045826(5556)
rs788352632262598411−0.12 3.81 × 1010909ncRNA_intronic ENSSSCG00000048292(within)
chr11:6896071116896071−0.13 1.22 × 1085260ncRNA_intronic ENSSSCG00000036846(within)
rs787621311131108980970.23 2.27 × 1077479intronic FNDC3B(within)
rs70171775614382222830.26 9.42 × 1010 intergenic RBM19(17536),ENSSSCG00000042669(3247)
chr14:914165561491416556−0.30 1.34 × 109 intergenic ENSSSCG00000047278(204580),CXCL12(99865)
chr15:84277014 #15842770140.22 2.92 × 1097468intergenic ENSSSCG00000036052(46519),ENSSSCG00000038561(122000)
rs700363122 #1781737670.40 9.10 × 1016 intergenic ENSSSCG00000047202(116838),U6(28063)

* QTL number in in PigQTL database. # Duplicate signals between GLM and FarmCPU. Effect means additive effect. ADIFF: the absolute difference between left and right teat number (ADIFF = |LTN − RTN|).

Three of the 9 significant SNPs were located within four genes and six SNPs were located 15,848 to 133,156 bp from the nearest genes (Table 3). Three of the 9 significant Indels were located within three genes and six Indels were located 3267 to 99,873 bp from the nearest genes (Table 3). Compared with the GLM GWAS results for ADIFF, 4 SNPs (rs325963999, rs1109963100, rs1109225784, rs322863105) and four Indels (chr1:77875820, chr1:118273285, chr15:84277014, rs700363122) were duplicated in the FarmCPU GWAS results, suggesting the reliability of the results (Table 3). The strongest significant SNP in GLM for ADIFF on SSC17 (rs322863105, p-value = 6.01 × 10−11) also showed significant association with ADIFF in FarmCPU (p-value = 3.34 × 10−13). The effect of rs322863105 on the ADIFF was estimated by genotyping Qingping pigs for this SNP. Individuals with the TT genotype had a lower ADIFF, suggesting LTN and RTN were more symmetrical (Figure 4b). Linkage analysis of the suggestive significant SNPs around this SNP identified one haplotype block of 8 kb between rs338532551 and rs322792299, including rs322863105 (Figure 4c). Three annotated genes were contained in the 1-Mb region around rs322863105, including tripartite motif family like 1 (TRIML1), tripartite motif family like 2 (TRIML2), and ZFP42 zinc finger protein (ZFP42). Moreover, a peak Indel (rs700363122) close to this SNP showed significant associations with ADIFF in the results of both GLM (1.13 × 10−8) and FarmCPU (9.10 × 10−16), with two annotated genes in the 1-Mb region around this Indel, including FAT Atypical Cadherin 1 (FAT1), and Melatonin Receptor 1A (MTNR1A).

3.4. Functional Enrichment of Candidate Genes

Annotated genes within 1-Mb regions of significant SNPs and Indels were defined as candidate genes. A total of 397 annotated genes were found in these regions (Table S2). In Figure 5a, GO enrichment analysis showed the enrichment of these candidate genes in epidermis development (p = 2.31 × 10−7), epidermal cell differentiation (p = 4.61 × 10−9), and skin development (p = 1.12 × 10−7). In Figure 5b KEGG pathway analysis revealed the enrichment of candidate genes in the pathways, such as the Sphingolipid signaling pathway (p = 1.27 × 10−3), ECM-receptor interaction (p = 4.79 × 10−3), and Glycine, serine and threonine metabolism (p = 5.43 × 10−3). Furthermore, we also paid attention to the Wnt signaling pathway (Figure 5c), due to its important role in initiating mammary morphogenesis and all subsequent stages of mammary formation as previously reported [33].
Figure 5

Enrichment results of candidate genes within 1-Mb regions of significant SNPs and Indels. (a) Dotplot of GO term enrichment. (b) Dotplot of KEGG pathway enrichment. (c) Wnt signaling pathway.

4. Discussion

During the first month after birth, piglets only have sow milk as a source of nutrients, which contributes to the regulation of their basal metabolism and temperature. Therefore, the sow’s ability to produce milk can influence the health and growth of piglets, probably with a long-term effect post-weaning. The sows’ lactation performance can be improved by enhancing the growth of the mammary gland and sows with a low prolactin/progesterone ratio before farrowing were reported to have a higher colostrum yield [34]. Additionally, milk production could be increased by adding L-arginine to the diets of lactating primiparous sows [35]. Moreover, for primiparous sows, teat suckling only for the first 2 days postpartum ensures the optimal mammary development and milk yield in the next lactation [36]. Furthermore, the lactating ability of sows can also be improved by increasing the teat number. As shown in the present study, teat number is heritable, with a genetic correlation to numerous microsatellite sites and SNPs (especially a QTL on SSC7). The of most teat number-related traits in Qingping pigs was moderate, except for L-R, suggesting it is feasible to improve teat number in pigs through genetic selection. SNPs and Indels associated with teat number-related traits might play an essential role in teat number improvement. Several candidate genes were reported to be related to mammary gland development and breast cancer. CDYL2, including an Indel significantly associated with LTN on SSC6, positively regulates breast cancer cell migration, invasion and epithelial-to-mesenchymal transition through p65/NF-κB and STAT3 [37]. FAM3C, which is in the 1-Mb region of an Indel on SSC18 and associated with ADIFF, encodes Interleukin-Like Epithelial-Mesenchymal Transition Inducer for the proliferation and migration of breast cancer cells [38]. WWOX, including an Indel (chr6:9186279) significantly associated with RTN, is known to play a role in breast cancer [39]. TRIML2 and MTNR1A were close to SNP (rs330045817) and Indel (rs700363122), respectively, on SSC17. As mentioned above, these two different types of variants were close to each other and significantly associated with ADIFF in both GLM and FarmCPU models. TRIML2 was significantly associated with prognosis, with a higher expression in triple-negative breast cancer cell lines than in normal mammary cell lines [40]. Common variants in MTNR1A may contribute to breast cancer susceptibility [41]. TBX5, encoding T-Box Transcription Factor 5, was close to the significant Indel (rs701717756) for ADIFF on SSC14. In a large German family, TBX3 and TBX5 duplication was reported to be associated with Ulnar-Mammary syndrome [42]. Importantly, TBX3 was the placode marker required for the formation of mammary placodes and the development of fetal mammary glands in all mammals [43]. Although not located in the 1-Mb region of rs701717756, TBX3 was close to this Indel with a distance of 625,516 bp. Unlike earlier studies, the present study failed to detect significant association signals in genome regions around VRTN on SSC7, which was reported as a credible candidate gene for the teat number and the vertebra number [17,44]. Zhuang et al. suggested that the genetic heterogeneity of variants in VRTN may exist in different populations and VRTN may not be a strong or the only causal gene for teat number based on their finding that VRTN mutation was significantly associated with the teat number in Canadian Duroc pigs, but not in American Duroc pigs [14]. Moreover, VRTN mutation on SSC7 was also not significantly associated with the teat number in Chinese pig breeds, including Beijing Black pig and Sushan pig [8,9], but significantly associated with the vertebra number in Beijing Black pig. Furthermore, VRTN was reported to modulate somite segmentation [44]. These reports suggested that VRTN plays a more important role in vertebra number and varies in its role in teat number among populations with different genetic backgrounds. Therefore, the teat number in Qingping pigs is speculated to involve other genes or pathways. The GO enrichment results showed significant enrichment in skin development, epidermis development, and epidermal cell differentiation. The mammary gland is an epithelial organ, and epithelial–stromal crosstalk is a key aspect of mammary morphogenesis [45]. In KEGG enrichment analysis, the Wnt signaling pathway did not reach a significance level of 0.05 (0.078). Interestingly, candidate genes, including WNT11, WNT16 and FZD3, were located at key positions in this pathway (Figure 5c). WNT16 and FZD3 were also involved in GO terms, skin development and epidermis development. The Wnt signaling cascade is implicated in almost all stages of mammary development and is pivotal for the specification and morphogenesis of the mammary gland [46]. WNT11 was expressed in stromal cells and basal cells in the adult mammary gland [46]. Chu et al. reported that WNT11 and FZD3 were expressed in mammary buds at E12.5 and E15.5 [47]. Importantly, Wnt signaling interacted with TBX3 in mammary placode development [47]. These reports suggested that candidate genes might affect teat number during mammary gland morphogenesis. Unlike other breeds, the teat number of Qingping pigs showed a medium heritability (0.24) for ADIFF, implying the potential involvement of candidate genes in mammary gland morphogenesis. Therefore, our candidate genes related to mammary gland morphogenesis and development can be assumed to contribute to teat number improvement and even may influence milk production.

5. Conclusions

In this study, GLM and FarmCUP GWAS were carried out to detect associated SNPs and Indels for 6 teat number-related traits. We found a total of 33 SNPs and 50 Indels for teat number. The most significant SNP and Indel were located on SSC17. Six candidate genes were enriched in the Wnt signaling pathway with an important role in mammary gland morphogenesis and development. A novel candidate gene on SSC14, TBX3, was detected as a mammary placode marker. These findings contribute to our understanding of the genetic architecture of teat number and provide genetic markers for genetic improvement of teat number in Qingping pigs.
  44 in total

1.  Detection of quantitative trait loci for teat number and female reproductive traits in Meishan × Large White F2 pigs.

Authors:  J P Bidanel; A Rosendo; N Iannuccelli; J Riquet; H Gilbert; J C Caritez; Y Billon; Y Amigues; A Prunier; D Milan
Journal:  Animal       Date:  2008-06       Impact factor: 3.240

2.  Genotyping by sequencing reveals a new locus for pig teat number.

Authors:  L Wang; Y Zhang; T Zhang; L Zhang; H Yan; X Liu; L Wang
Journal:  Anim Genet       Date:  2017-04-03       Impact factor: 3.169

3.  Genome-wide association study and genomic predictions for exterior traits in Yorkshire pigs1.

Authors:  Jungjae Lee; SeokHyun Lee; Jong-Eun Park; Sung-Ho Moon; Sung-Woon Choi; Gwang-Woong Go; Dajeong Lim; Jun-Mo Kim
Journal:  J Anim Sci       Date:  2019-07-02       Impact factor: 3.159

4.  Genome-wide association QTL mapping for teat number in a purebred population of Duroc pigs.

Authors:  A Arakawa; N Okumura; M Taniguchi; T Hayashi; K Hirose; K Fukawa; T Ito; T Matsumoto; H Uenishi; S Mikawa
Journal:  Anim Genet       Date:  2015-07-22       Impact factor: 3.169

5.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

6.  Genome-wide association study and accuracy of genomic prediction for teat number in Duroc pigs using genotyping-by-sequencing.

Authors:  Cheng Tan; Zhenfang Wu; Jiangli Ren; Zhuolin Huang; Dewu Liu; Xiaoyan He; Dzianis Prakapenka; Ran Zhang; Ning Li; Yang Da; Xiaoxiang Hu
Journal:  Genet Sel Evol       Date:  2017-03-29       Impact factor: 4.297

7.  FAM3C-YY1 axis is essential for TGFβ-promoted proliferation and migration of human breast cancer MDA-MB-231 cells via the activation of HSF1.

Authors:  Weili Yang; Biaoqi Feng; Yuhong Meng; Junpei Wang; Bin Geng; Qinghua Cui; Hongquan Zhang; Yang Yang; Jichun Yang
Journal:  J Cell Mol Med       Date:  2019-03-19       Impact factor: 5.310

Review 8.  The molecular basis of mammary gland development and epithelial differentiation.

Authors:  Priscila Ferreira Slepicka; Amritha Varshini Hanasoge Somasundara; Camila O Dos Santos
Journal:  Semin Cell Dev Biol       Date:  2020-10-17       Impact factor: 7.499

Review 9.  Mammary Development and Breast Cancer: A Wnt Perspective.

Authors:  Qing Cissy Yu; Esther M Verheyen; Yi Arial Zeng
Journal:  Cancers (Basel)       Date:  2016-07-13       Impact factor: 6.639

10.  Revealing New Candidate Genes for Teat Number Relevant Traits in Duroc Pigs Using Genome-Wide Association Studies.

Authors:  Yang Li; Lei Pu; Liangyu Shi; Hongding Gao; Pengfei Zhang; Lixian Wang; Fuping Zhao
Journal:  Animals (Basel)       Date:  2021-03-13       Impact factor: 2.752

View more
  1 in total

1.  Identification of Genomic Regions and Candidate Genes for Litter Traits in French Large White Pigs Using Genome-Wide Association Studies.

Authors:  Jianmei Chen; Ziyi Wu; Ruxue Chen; Zhihui Huang; Xuelei Han; Ruimin Qiao; Kejun Wang; Feng Yang; Xin-Jian Li; Xiu-Ling Li
Journal:  Animals (Basel)       Date:  2022-06-19       Impact factor: 3.231

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.