Literature DB >> 31138106

Determination of genetic associations between indels in 11 candidate genes and milk composition traits in Chinese Holstein population.

Jianping Jiang1,2, Lin Liu3, Yahui Gao1, Lijun Shi1, Yanhua Li1,3, Weijun Liang1, Dongxiao Sun4.   

Abstract

BACKGROUND: We have previously identified 11 promising candidate genes for milk composition traits by resequencing the whole genomes of 8 Holstein bulls with extremely high and low estimated breeding values for milk protein and fat percentages (high and low groups), including FCGR2B, CENPE, RETSAT, ACSBG2, NFKB2, TBC1D1, NLK, MAP3K1, SLC30A2, ANGPT1 and UGDH those contained 25 indels between high and low groups. In this study, the purpose was to further examine whether these candidates have significant genetic effects on milk protein and fat traits.
RESULTS: With PCR product sequencing, 13 indels identified by whole genome resequencing were successfully genotyped. With association analysis in 769 Chinese Holstein cows, we found that the indel in FCGR2B was significantly associated with milk yield, protein yield and protein percentage (P = 0.0041 to 0.0297); five indels in CENPE and one indel in MAP3K1 were markedly relevant to milk yield, fat yield and protein yield (P < 0.0001 to 0.0073); polymorphism in RETSAT was evidently associated with milk yield, fat yield, protein yield and protein percentage (P = 0.0001 to 0.0237); variant in ACSBG2 affected fat yield and protein percentage (P = 0.0088 and 0.0052); one indel in TBC1D1 was with respect to fat percentage and protein percentage (P = 0.0224 and 0.0209). Significant associations were shown between indels in NLK and protein yield and protein percentage (P = 0.0012 to 0.0257); variant in UGDH was related to the milk yield (P = 0.0312). The two exonic indels in FCGR2B and CENPE were predicted to change the mRNA and protein secondary structures, and resulted in the corresponding protein dysfunction.
CONCLUSION: Our findings presented here provide the first evidence for the associations of eight functional genes with milk yield and composition traits in dairy cattle.

Entities:  

Keywords:  Candidate gene; Dairy cattle; Genetic effect; Indel; Milk composition traits

Mesh:

Substances:

Year:  2019        PMID: 31138106      PMCID: PMC6537361          DOI: 10.1186/s12863-019-0751-y

Source DB:  PubMed          Journal:  BMC Genet        ISSN: 1471-2156            Impact factor:   2.797


Backgroud

In dairy cattle, milk yield and milk composition traits are the most important economic traits, which are controlled by numerous environmental factors and genes [1-4]. Over the past decades, unraveling the major genes and causal mutations with large effect on milk yield and composition traits is one of the important research fields for researchers. Quantitative trait locus (QTL) mapping and genome-wide association study (GWAS) have been widely applied to identify the QTLs, candidate genes and mutations affecting milk production traits in dairy cattle [5-8], and a large number of QTLs and genetic associations have been detected using such two approaches so far (http://www.animalgenome.org/cgi-bin/QTLdb/index). In recent years, short insertion and deletion (indel), as the second main form of genomic variation, has been increasingly paid more attention and has made great contribution to investigations on genetic and phenotypic diversities in human, chicken, pig and dairy cattle [9-13]. A previous study found that 2–18 base pairs (bp) indel located upstream of TAL bHLH transcription factor 1 (TAL1) was responsible for the T-cell acute lymphoblastic leukemia (T-ALL) [14]. In chicken, the 9–15 bp indel of premelanosome protein (PMEL17) gene was confirmed to be the causative mutation for the plumage color (Dominant white, Dun and Smoky) [10]. In pig, an intronic inserted retrotransposon of sperm flagellar 2 (SPEF2) led to the immotile short-tail sperm defect [15]. In Belgian blue cattle, a 11-bp indel in myostatin (MSTN) gene resulted in double-muscled phenotype [9], and an exonic 15-bp insertion in coagulation factor XI (F11) gene caused the factor XI deficiency in Japanese black cattle [11]. However, up to now, limited research of indel polymorphisms associated with milk production traits in dairy cattle has been reported [16]. With the rapidly emergence of next-generation sequencing (NGS), whole genome resequencing has been an important tool in the efforts to detect polymorphsims which were contributed to the complex traits or economic traits in human and domestic animals [17-20]. In our previous whole genome resequencing study, we identified over 0.9 million short indels and 3625 common differential indels with the same allelic distribution directions based on the 8 Holstein bulls with extremely high or low estimated breeding values (EBVs) of milk protein and fat percentages (high and low groups) [21]. Based on this, 11 genes were identified as the promising candidates affecting milk compositions traits in dairy cattle, including FCGR2B, CENPE, RETSAT, ACSBG2, NFKB2, TBC1D1, NLK, MAP3K1, SLC30A2, ANGPT1 and UGDH, which contained 25 differential indels [21]. Thus, the aim of this study was to further validate whether these identified indels in the 11 genes significantly impact on milk yield and compositions traits in Chinese Holstein population.

Results

Indel verification and genotyping

Based on two DNA pools from 40 Holstein sires, with PCR product sequencing, 22 of 25 indels identified by whole genome resequencing [21] were confirmed as true ones (Additional file 2), among them, four indels were identified for the first time (Table 1). Subsequently, 13 indels in 8 genes were successfully genotyped and performed for association analysis. Of the 13 indels, two indels, including rs381714237 in FCGR2B, ss2137349053 in CENPE were located in the exons, whilst, the remaining 11 indels were located in the intronic regions. Chi-squared test showed that all the 13 indels were in Hardy-Weinberg equilibrium (P > 0.05). The genotype frequencies and allele frequencies of the 13 indels were summarized in Table 2.
Table 1

Detailed information of 24 indels of 11 genes identified in Chinese Holstein cattle

IndelGeneLocationGenBank no.Position in UMD_3.1Indel SequenceConfirmed?
1 N ins FCGR2B exon7rs381714237chr3:7930047G1
3 N ins CENPE exon58ss2137349053chr6:23018080AGA1
3 N del CENPE exon68chr6:23026632–23026634TAG3
3 N ins CENPE intron13rs385060942chr6:22983076GTT1
1 N ins CENPE intron13ss2137349051chr6:22983397T1
1 N del CENPE intron18rs384082187chr6:22989805A2
21 N ins CENPE intron24rs377812754chr6:22996564–22996573ACTTAAGTATATAACCTTAAC2
2 N del CENPE intron41rs453960300chr6:23018994–23018995CC1
1 N del CENPE intron49rs378415122chr6:23036105C1
4 N ins CENPE intron51ss2137349056chr6:23040582ACAC4
2 N del RETSAT 3’UTRrs136527375chr11:49489416–49489417AA2
9 N ins RETSAT intron6rs134985825chr11:49485899ATTCTGGGG1
1 N ins ACSBG2 intron7rs377943075chr7:19476990G1
2 N ins NFKB2 5′ regulatory regionchr26:22891203GG3
1 N del TBC1D1 intron1rs136639319chr6:58898979T1
1 N ins NLK intron1rs137724510chr19:20180649T2
2 N del NLK intron1rs379188781chr19:20189055–20189056AT1
1 N del NLK intron3rs135129224chr19:20264835A2
4 N del NLK intron3rs134444531chr19:20276109–20276112AAAA1
5 N del MAP3K1 intron16ss2137349058chr20:22365627–22365631CATTT1
6 N del SLC30A2 intron2ss2137349049chr2:127640012–127640017TTTTTG2
2 N ins ANGPT1 intron1ss2137349057chr14:59305051AT2
1 N ins UGDH intron7rs383327605chr6:60236955T2
1 N ins UGDH intron2ss2019489562chr6:60252782G1

Note: 1 indels were genotyped successfully; 2 indels were failed to genotype using MALDI-TOF MS; 3 indels were not polymorphic in current population; 4 primers of indel were failed to design

Table 2

The genotypic and allelic frequencies of 13 indels of 8 genes

LocusGeneGenotypeGenotype frequenciesAlleleAllele frequencies
rs381714237 FCGR2B del/del0.115del0.369
del/ins0.508ins0.631
ins/ins0.377
ss2137349053 CENPE del/del0.208del0.462
del/ins0.507ins0.538
ins/ins0.284
rs385060942 CENPE del/del0.206del0.462
del/ins0.512ins0.538
ins/ins0.282
ss2137349051 CENPE del/del0.277del0.534
del/ins0.513ins0.466
ins/ins0.210
rs453960300 CENPE ins/ins0.212ins0.467
ins/del0.511del0.533
del/del0.277
rs378415122 CENPE ins/ins0.210ins0.467
ins/del0.514del0.533
del/del0.276
rs134985825 RETSAT del/del0.276del0.524
del/ins0.497ins0.476
ins/ins0.227
rs377943075 ACSBG2 del/del0.082del0.276
del/ins0.388ins0.724
ins/ins0.530
rs136639319 TBC1D1 ins/ins0.025ins0.186
ins/del0.321del0.814
del/del0.654
rs379188781 NLK ins/ins0.374ins0.608
ins/del0.468del0.392
del/del0.158
rs134444531 NLK ins/ins0.265ins0.532
ins/del0.532del0.468
del/del0.202
ss2137349058 MAP3K1 ins/ins0.207ins0.466
ins/del0.518del0.534
del/del0.275
ss2019489562 UGDH del/del0.220del0.466
del/ins0.491ins0.534
ins/ins0.289
Detailed information of 24 indels of 11 genes identified in Chinese Holstein cattle Note: 1 indels were genotyped successfully; 2 indels were failed to genotype using MALDI-TOF MS; 3 indels were not polymorphic in current population; 4 primers of indel were failed to design The genotypic and allelic frequencies of 13 indels of 8 genes

Associations between indels and five milk production traits

The results of associations between the 13 indels and five milk production traits were shown in Table 3. It was observed that all these indels were significantly associated with at least one of the milk traits (P < 0.0001 to P = 0.0312) as described below.
Table 3

Association results of the thirteen indels in eight genes on the five milk production traits (least squares mean ± SE)

LocusGeneGenotype (No.)MYFYFPPYPP
rs381714237 FCGR2B del/del(86)10,891 ± 99.80385.05 ± 4.213.59 ± 0.040319.32 ± 3.10ab2.95 ± 0.014a
ins/del(380)10,702 ± 64.80383.94 ± 2.893.66 ± 0.027315.86 ± 2.11Aa2.98 ± 0.009ab
ins/ins(282)10,834 ± 68.20385.61 ± 3.033.64 ± 0.028321.83 ± 2.20Bb2.99 ± 0.009b
P value 0.0297* 0.79260.1738 0.0041** 0.0198*
ss2137349053 CENPE del/del(153)10,403 ± 55.66A374.36 ± 2.82Aa3.61 ± 0.034308.40 ± 2.65A2.97 ± 0.024
del/ins(373)10,661 ± 47.88B378.61 ± 2.43Aa3.57 ± 0.027315.70 ± 2.14Ba2.97 ± 0.017
ins/ins(209)10,852 ± 51.03C385.99 ± 2.59B3.58 ± 0.031321.05 ± 2.37Bb2.97 ± 0.020
P value < 0.0001** < 0.0001** 0.3733 < 0.0001** 0.9882
rs385060942 CENPE del/del(152)10,494 ± 84.00Aa376.49 ± 3.60Aa3.60 ± 0.034310.45 ± 2.62B2.96 ± 0.012
del/ins(377)10,717 ± 65.77Aa381.44 ± 2.93ab3.57 ± 0.027318.18 ± 2.13Aa2.97 ± 0.009
ins/ins(208)10,871 ± 74.83B387.82 ± 3.26Bb3.59 ± 0.031322.24 ± 2.38Aa2.97 ± 0.010
P value < 0.0001** 0.0045** 0.6063 < 0.0001** 0.6166
ss2137349051 CENPE ins/ins(206)10,817 ± 75.26A386.81 ± 3.28A3.60 ± 0.031321.11 ± 2.39A2.97 ± 0.010
del/ins(382)10,586 ± 65.52Ba378.41 ± 2.92Ba3.59 ± 0.027314.42 ± 2.12Ba2.98 ± 0.009
del/del(156)10,397 ± 83.94Bb374.49 ± 3.61Ba3.61 ± 0.034308.61 ± 2.63Bb2.97 ± 0.012
P value < 0.0001** 0.0010** 0.7289 < 0.0001** 0.9571
rs453960300 CENPE ins/ins(158)10,380 ± 83.33Aa375.15 ± 3.583.63 ± 0.034307.63 ± 2.61Aa2.98 ± 0.012
ins/del(381)10,586 ± 66.10b380 ± 2.953.61 ± 0.027313.34 ± 2.15b2.98 ± 0.009
del/del(207)10,764 ± 74.83Bc385.96 ± 3.263.61 ± 0.031318.18 ± 2.38Bc2.97 ± 0.010
P value < 0.0001** 0.0073** 0.7589 0.0002** 0.9461
rs378415122 CENPE ins/ins(158)10,373 ± 83.79Aa373.03 ± 3.60Aa3.62 ± 0.034306.81 ± 2.62Aa2.97 ± 0.012
ins/del(386)10,547 ± 65.79Aa377.75 ± 2.93a3.60 ± 0.027312.15 ± 2.13Aa2.97 ± 0.009
del/del(207)10,785 ± 74.23B386.21 ± 3.23Bb3.60 ± 0.030319.19 ± 2.36B2.97 ± 0.010
P value < 0.0001** 0.0004** 0.8913 < 0.0001** 0.9535
rs134985825 RETSAT del/del(205)10,774 ± 75.77Aa383.94 ± 3.30ab3.58 ± 0.031318.23 ± 2.40ab2.96 ± 0.010a
del/ins(370)10,563 ± 66.51B378.10 ± 2.95Aa3.60 ± 0.027314.44 ± 2.15Aa2.98 ± 0.009ab
ins/ins(169)10,823 ± 79.45Aa388.92 ± 3.43Bb3.61 ± 0.032323.7 ± 2.50Bb2.99 ± 0.011b
P value 0.0003** 0.0009** 0.6500 0.0001** 0.0237*
rs377943075 ACSBG2 del/del(62)10,746 ± 111.22378.40 ± 4.68Aa3.56 ± 0.045318.61 ± 3.423.00 ± 0.016ab
del/ins(294)10,776 ± 69.04387.18 ± 3.03ab3.61 ± 0.028319.22 ± 2.192.99 ± 0.009Ab
ins/ins(401)10,884 ± 63.81391.00 ± 2.84Bb3.62 ± 0.026319.31 ± 2.052.97 ± 0.008Ba
P value 0.1334 0.0088** 0.43930.9749 0.0052**
rs136639319 TBC1D1 ins/ins(19)10,752 ± 182.68391.92 ± 7.473.65 ± 0.073ab320.14 ± 5.452.98 ± 0.026ab
del/ins(242)10,564 ± 72.44383.75 ± 3.183.65 ± 0.030a314.38 ± 2.322.99 ± 0.010a
del/del(493)10,659 ± 64.49380.06 ± 2.883.58 ± 0.027b314.42 ± 2.12.96 ± 0.009b
P value 0.24930.1198 0.0224* 0.5384 0.0209*
rs379188781 NLK ins/ins(277)10,676 ± 70.45377.09 ± 3.113.56 ± 0.029318.23 ± 2.262.99 ± 0.009Aa
del/ins(347)10,627 ± 67.41381.09 ± 3.003.60 ± 0.028314.48 ± 2.182.97 ± 0.009b
del/del(117)10,749 ± 90.28384.24 ± 3.833.57 ± 0.037315.92 ± 2.792.95 ± 0.013Bb
P value 0.33890.10200.26150.1279 0.0047**
rs134444531 NLK ins/ins(197)10,748 ± 78.14384.18 ± 3.403.58 ± 0.032322.04 ± 2.47Aa2.99 ± 0.011a
del/ins(395)10,723 ± 65.31386.23 ± 2.913.61 ± 0.027317.93 ± 2.12a2.97 ± 0.009ab
del/del(150)10,563 ± 85.88381.58 ± 3.683.63 ± 0.035312.32 ± 2.68Bb2.96 ± 0.012b
P value 0.07330.30260.4078 0.0012** 0.0257*
ss2137349058 MAP3K1 ins/ins(152)10,482 ± 82.27Aa372.63 ± 3.54Aa3.58 ± 0.034310.01 ± 2.58Aa2.97 ± 0.011
del/ins(380)10,613 ± 66.30Aa376.47 ± 2.94Aa3.56 ± 0.027314.12 ± 2.14Aa2.97 ± 0.009
del/del(202)10,909 ± 77.27Bb390.47 ± 3.35Bb3.59 ± 0.032323.13 ± 2.44Bb2.97 ± 0.011
P value < 0.0001** < 0.0001** 0.4863 < 0.0001** 0.8495
ss2019489562 UGDH del/del(405)10,860 ± 64.35a387.55 ± 2.893.64 ± 0.027318.31 ± 2.112.99 ± 0.009
del/ins(4)9902 ± 384.75b376.58 ± 15.493.81 ± 0.153299.74 ± 11.303.02 ± 0.056
ins/ins(342)10,798 ± 66.31ab388.28 ± 2.953.64 ± 0.027315.05 ± 2.132.97 ± 0.009
P value 0.0312* 0.72670.50770.05320.0567

Note:*significant association at the significance level of 0.05; **significant association at the significance level of 0.01

The different superscripts (A,B within the same column with different superscripts indicate P < 0.01; a,b indicate P < 0.05) adjusted after correction for multiple testing indicate significant differences among the genotypes

MY milk yield, FY fat yield, FP fat percentage, PY protein yield, PP protein percentage

Association results of the thirteen indels in eight genes on the five milk production traits (least squares mean ± SE) Note:*significant association at the significance level of 0.05; **significant association at the significance level of 0.01 The different superscripts (A,B within the same column with different superscripts indicate P < 0.01; a,b indicate P < 0.05) adjusted after correction for multiple testing indicate significant differences among the genotypes MY milk yield, FY fat yield, FP fat percentage, PY protein yield, PP protein percentage

Exonic indels

The exonic indel rs381714237 in FCGR2B was associated with milk yield (P = 0.0297), protein yield (P = 0.0041) and protein percentage (P = 0.0198). The other exonic indel ss2137349053 in CENPE, was strongly associated with milk yield (P < 0.0001), fat yield (P < 0.0001) and protein yield (P < 0.0001).

Intronic indels

The four intronic indels (rs385060942, ss2137349051, rs453960300 and rs378415122) in CENPE were significantly associated with milk yield (P < 0.0001), fat yield (P = 0.0004 to 0.0073) and protein yield (P < 0.0001 to 0.0002). Additionally, the five indels (four intronic indels and one exonic indel above) of CENPE gene were found to be highly linked (r2 > 0.98), and one haplotype block was inferred as presented in Fig. 1. Haplotype-based association analysis showed that the haplotype combination was evidently associated with milk yield, fat yield and protein yield as well (P < 0.0001 to P = 0.0076) (Table 4).
Fig. 1

Linkage disequilibrium estimated of the five indels in CENPE gene. The values in boxes are pair-wise indel correlations (r2)

Table 4

Haplotype analysis of CENPE gene (least squares mean ± SE)

Haplotype (No.)MYFYFPPYPP
H1H1 (198)10,874 ± 77.45Aa389.53 ± 3.38Aa3.60 ± 0.031321.43 ± 2.46Aa2.97 ± 0.011
HIH2 (356)10,734 ± 68.09Aa382.37 ± 3.02b3.58 ± 0.028316.68 ± 2.20Ab2.96 ± 0.009
H2H2 (140)10,498 ± 88.34Bb378.32 ± 3.78Bb3.62 ± 0.036309.40 ± 2.75B2.95 ± 0.012
P value 0.0003** 0.0076** 0.4045 < 0.0001** 0.7803

Note:*significant association at the significance level of 0.05; **significant association at the significance level of 0.01

The different superscripts (A,B within the same column with different superscripts indicate P < 0.01; a,b indicate P < 0.05) adjusted after correction for multiple testing indicate significant differences among the genotypes

MY milk yield, FY fat yield, FP fat percentage, PY protein yield, PP protein percentage

Linkage disequilibrium estimated of the five indels in CENPE gene. The values in boxes are pair-wise indel correlations (r2) Haplotype analysis of CENPE gene (least squares mean ± SE) Note:*significant association at the significance level of 0.05; **significant association at the significance level of 0.01 The different superscripts (A,B within the same column with different superscripts indicate P < 0.01; a,b indicate P < 0.05) adjusted after correction for multiple testing indicate significant differences among the genotypes MY milk yield, FY fat yield, FP fat percentage, PY protein yield, PP protein percentage Indel rs134985825 in the intron 6 of RETSAT showed remarkable effects on milk yield, protein yield, fat yield and protein percentage (P = 0.0001 to 0.0237). For ACSBG2, indel rs377943075 in the intron 7 was significantly associated with fat yield (P = 0.0088) and protein percentage (P = 0.0052). Variant rs136639319 in the intron 3 of TBC1D1was significantly associated with fat percentage (P = 0.0224) and protein percentage (P = 0.0209). For the indel rs379188781 in the intron 1 of NLK, it was found to be associated with protein percentage (P = 0.0047), the other indel rs134444531 in the intron 3 was associated with protein yield (P = 0.0012) and protein percentage (P = 0.0257). While, no LD was observed between such two indels (r2 = 0.14). For MAP3K1, indel ss2137349058 in the intron 16 was markedly associated with milk yield, fat yield and protein yield (P < 0.0001). For the intronic indel of UGDH gene, the indel ss2019489562 located in intron 2 was significantly associated with milk yield (P = 0.0312). Additionally, the significant additive, dominant and allele substitution effects of the 13 indels on the five milk traits were observed as well (Table 5).
Table 5

Genetic effects of thirteen indels in eight genes on five milk production traits

LocusGeneGene effectsMYFYFPPYPP
rs381714237 FCGR2B Additive effect(a) 66.13* 0.83−0.0072 2.99** 0.0051
Dominant effect(d)123.270.27− 0.06180.47 −0.0348**
Substitution effect(α)30.920.760.0105 2.85* 0.0150*
ss2137349053 CENPE Additive effect(a) 224.66** 5.81** −0.0174 6.32** − 0.0002
Dominant effect(d)33.43−1.57−0.02460.980.0028
Substitution effect(α) 227.29** 5.69** −0.0193 6.4** 0.0001
rs385060942 CENPE Additive effect(a) −188.63** −5.66** 0.007 −5.89** −0.0042
Dominant effect(d)34.37−0.71−0.02181.840.0065
Substitution effect(α) −191.51** −5.60** 0.0089 −6.05** −0.0048
ss2137349051 CENPE Additive effect(a) −209.61** − 6.16** 0.0093 − 6.25** −0.0009
Dominant effect(d)−20.82− 2.24−0.014−0.440.0022
Substitution effect(α) −208.06** −6.00** 0.0103 −6.22** −0.0011
rs453960300 CENPE Additive effect(a) − 192.46** −5.40** 0.0105 − 5.27** 0.0015
Dominant effect(d)13.90−0.55−0.01060.430.0017
Substitution effect(α) −193.46** −5.36** 0.0113 −5.31** 0.0014
rs378415122 CENPE Additive effect(a) −205.95** −6.59** 0.0069 −6.19** −0.0011
Dominant effect(d)−32.03−1.87−0.0068− 0.850.0022
Substitution effect(α) −203.68** −6.46** 0.0073 −6.13** −0.0012
rs134985825 RETSAT Additive effect(a)24.572.490.0156 2.74* 0.0163**
Dominant effect(d) − 235.31** −8.32** 0.0001 −6.52** 0.0036
Substitution effect(α)29.532.670.0156 2.88* 0.0162**
rs377943075 ACSBG2 Additive effect(a)−69.28 −6.30** −0.0272−0.35 0.0154*
Dominant effect(d)−39.322.480.01780.260.0108
Substitution effect(α)−56.34 −7.12* −0.033−0.440.0118
rs136639319 TBC1D1 Additive effect(a)47.54−1.85 −0.0342** 0.02 −0.0127**
Dominant effect(d)140.1510.020.03465.740.0103
Substitution effect(α)98.361.79−0.02162.11−0.0089
rs379188781 NLK Additive effect(a)−36.45−3.58−0.00571.16 0.0194**
Dominant effect(d)−85.740.430.0343−2.59−0.0039
Substitution effect(α)−55.71−3.480.0020.58 0.0185*
rs134444531 NLK Additive effect(a)92.551.30−0.0226 4.86** 0.0169*
Dominant effect(d)67.333.360.00840.75−0.0055
Substitution effect(α)97.881.57−0.0219 4.92** 0.0164*
ss2137349058 MAP3K1 Additive effect(a) −213.49** −8.92** −0.0046 −6.56** 0.0013
Dominant effect(d)−82.74−5.08*−0.0264−2.450.0043
Substitution effect(α)−208.37**−8.60**−0.0029−6.41**0.0011
ss2019489562 UGDH Additive effect(a)31.00−0.36− 0.00321.630.0097*
Dominant effect(d)−925.95*−11.330.1715−16.940.0462
Substitution effect(α)−42.62−1.270.01040.280.0134*

Note: (a), (d), (α) means Additive, Dominant and Substitution effect, respectively

* means the additive, dominant or allele substitution effect of the locus indicated differ at P < 0.05 and ** means the additive, dominant or allele substitution effect of the locus indicated differ at P < 0.01

MY milk yield, FY fat yield, FP fat percentage, PY protein yield, PP protein percentage

Genetic effects of thirteen indels in eight genes on five milk production traits Note: (a), (d), (α) means Additive, Dominant and Substitution effect, respectively * means the additive, dominant or allele substitution effect of the locus indicated differ at P < 0.05 and ** means the additive, dominant or allele substitution effect of the locus indicated differ at P < 0.01 MY milk yield, FY fat yield, FP fat percentage, PY protein yield, PP protein percentage

Prediction the mRNA and protein structures

Using a statistical folding algorithm, the alteration of the most stable mRNA secondary structures caused by the two exonic indels for FCGR2B and CENPE were observed for both the ins/ins and del/del genotypes. As illustrated in Fig. 2, obvious structural differences spanning the position 971–980 between the ins/ins and del/del genotypes of the indel rs381714237 in FCGR2B gene were observed. The free energy (∆G) of the ins allele was predicted to be higher (∆G = − 468.70 kcal/mol) than the del allele (∆G = − 470.30 kcal/mol). Correspondingly, the ins allele was deduced to form one larger single loop structure, which potentially decreasing the stability of mRNA (∆∆G = + 1.6 kcal/mol). It is worth mentioning that previous studies have evidenced that the ∆∆G ranged from − 3.9 kcal/mol to + 4.0 kcal/mol could affect the mRNA stability [22-28]. In addition, indel rs381714237 of FCGR2B was predicted to decrease the number of amino acid by 38, which might change protein structure and function. As a result, differences of the protein secondary structures were predicted between the FCGR2B proteins corresponding to alleles del and ins with regard to alpha helix (21.64% vs. 16.45%), extended strand (23.10% vs. 24.67%), beta turn (7.02% vs. 7.89%) and random coil (48.25% vs. 50.99%) using the SOPMA program.
Fig. 2

The predicted mRNA secondary structures corresponding to the exonic indel of FCGR2B gene

The predicted mRNA secondary structures corresponding to the exonic indel of FCGR2B gene For the non-frameshiting indel, subtle change of mRNA secondary structures between the two homozygous genotypes of indel ss2137349053 in CENPE was occurred (data not shown). The free energy was altered from − 1816.10 kcal/mol for the del allele to − 1818.90 kcal/mol for the ins allele. While, slight difference was predicted for the CENPE protein in accordance between the del/del and ins/ins genotypes, alpha helix (72.57% vs. 72.69%), and random coil (13.94% vs. 13.82%). There was no change of extended strand and beta turn for CENPE protein.

Discussion

In the present work, we confirmed that 13 indels belonging to 8 candidate genes (FCGR2B, CENPE, RETSAT, ACSBG2, TBC1D1, NLK, MAP3K1 and UGDH) for milk compositions identified by our previous whole genome resequencing study [21] showed significant genetic effects on at least one of milk traits in dairy cattle. As far as our knowledge, this is the first report to connect these genes to milk production traits of dairy cattle. Among the total 25 differential indels with the same allelic distribution directions between the bulls in high and low groups identified by our previous whole genome resequencing study [21], indel rs383700527 (3 N ins) located upstream of ACSBG2 gene was found to contribute to milk fat in a cis-regulatory manner (unpublished data). Thus, we investigated another 24 indels in the present study. Among them, one intronic indel (4 N ins in CENPE) was failed to be verified by Sanger sequencing due to the special characteristic of the flanking sequence with lower GC% and repetitive DNA sequences. Two indels (3 N del in CENPE and 2 N ins in NFKB2) didn’t show polymorphic in this study. Eight indels (1 N del and 21 N ins in CENPE, 2N del in RETSAT, 1 N ins and 1 N del in NLK, 6 N del in SLC30A2, 2 N ins in ANGPT1 and 1N ins in UGDH) were failed to be genotyped by using MALDI-TOF MS. The possible reason may be that MALDI-TOF MS for multiplex genotyping was relied on multiplex-PCR primers and extended primers to genotype multiple loci [29], simultaneously, the primer design was depended on sequence composition, molecular weight, annealing temperature and reaction efficiencies of each locus [29]. Hence, a total of 13 polymorphic indels were successfully genotyped and performed for association analysis.

Significant associations between candidate genes and milk production traits

Six indels in FCGR2B and CENPE

For indel rs381714237 in FCGR2B, we demonstrated that ins/ins genotype had higher protein percentage. As a regulator, FCGR2B was contributed to immune response [30]. Additionally, bovine mammary gland is a product of the innate immune system and active during lactation. Thus, these evidences indicated that FCGR2B might affect milk protein percentage through impacting the cows on immune response during lactation. For the five indels in CENPE, the association results revealed that ins/ins genotypes were dominant compared with del/del genotypes for milk yield, fat yield and protein yield. Previous report has found that CENPE acted as a monitor protein and was necessary for cell cycle [31]. Thus, it appeared that the CENPE might affect these traits through modulating bovine mammary gland development.

Seven indels located in six genes

Our association analysis confirmed that the ins/ins genotype of the indel rs134985825 in RETSAT gene increased milk yield, fat yield, and protein yield. RETSAT was considered as a regulator for liver metabolism, and was critical for lipid accumulation and adipogenesis promotion [32]. Previous research has investigated that the polymorphisms of RETSAT gene were associated with premium cut yields and backfat thickness in pig [33]. Taken together, we speculated that RETSAT might affect milk traits through influencing the lipid metabolism. Herein, we found that individuals with ins/ins genotype of indel rs377943075 in ACSBG2 showed higher fat yield than those with del/del genotype. The ACSBG2 gene encodes the protein that belongs to a member of the acyl-CoA synthetase family and participated in PPAR signaling pathway and involved in lipid metabolism and lipid droplet formation [34, 35]. Previous researchers have found that polymorphisms of ACSBG2 showed positive effects on yolk development and abdominal fat weight [36]. In current study, our results also showed a significant relationship between the indel rs136639319 in TBC1D1 and fat percentage as well as protein percentage. It was worth mentioning that TBC1D1, as a member of Rab GTPase-activating proteins (GAPs), was involved in translocation of GLUT4 to the plasma membrane. Polymorphisms in TBC1D1 have been observed to show significant effects on severe obesity or carcass in human [37] and chicken [38], respectively, suggesting exhibiting functions related to lipid and energy homeostasis as reported by Hargett et al. [39, 40]. Two intronic indel (rs379188781 and rs134444531) in NLK showed strong associations with protein yield and protein percentage. Interestingly, Cole et al. reported that one single nucleotide polymorphism (SNP) (ARS-BFGL-NGS-106227) significantly associated with protein percentage (P = 5.59 × 10− 8) was merely 90 kb away from the NLK gene [41]. Furthermore, NLK, as a member of MAPK subfamily, had an essential role in mediating the mTORC1 signaling pathway which was involved in milk protein synthesis [42, 43]. Together, these data suggested that significant variation of protein yield and protein percentage might be regulated by NLK. As for MAP3K1, individuals with del/del genotype of indel polymorphism ss2137349058 had higher milk yield, fat yield and protein yield. MAP3K1 was known to be involved in the MAPK signaling pathway, and was considered to be a metabolic stimuli inducing cell proliferation [44, 45]. Meanwhile, it functioned as a candidate gene for type 2 diabetes (T2D) by interacting with insulin signaling pathway [46]. Thus, we concluded that MAP3K1 might regulate milk composition traits by modulating bovine mammary gland development. The intronic indel ss2019489562 in UGDH showed significant effect on milk yield. UGDH encodes the protein that was implicated with biosynthesis of glycosaminoglycans, hyaluronan, chondroitin sulfate, and heparan sulfate. Previously, Xu et al. have demonstrated that two exonic SNPs in UGDH showed significant associations with milk production traits in Chinese Holstein population [47]. In particular, UGDH was close to the peak location of two reported QTLs for fat yield, fat percentage and protein yield [48-51]. Further, two previously reported significant SNPs for fat yield, protein yield, fat percentage and protein percentage [41] were near to the UGDH gene. Moreover, expression pattern in InnateDB showed that UGDH have the highest expression in liver which plays an indispensable role in metabolism of carbohydrates, fats and proteins in dairy cattle. Hence, these data demonstrated that UGDH gene might be a vital regulator for milk traits by affecting liver metabolism.

Conclusion

In the present study, we performed association analysis for the 13 short indels within 8 candidate genes for milk compositions identified by our previous whole genome resequencing study, including FCGR2B, CENPE, RETSAT, ACSBG2, TBC1D1, NLK, MAP3K1 and UGDH. As a result, the 13 indels were shown to have significant genetic effects on at least one of milk yield and composition traits. These results not only validated the candidate genes and indels from the previous whole genome resequencing work, but also provided novel molecular information for genetic improvement program of dairy cattle.

Methods

Ethics statement

All the procedures for sample collections and phenotypic observations of experimental individuals were carried out along with regular quarantine inspection of the farms and in strict accordance with the protocol reviewed and approved by the Institutional Animal Care and Use Committee (IACUC) at China Agricultural University, and the permit number is DK996.

Animals

The animals used for association analysis included a total of 769 Chinese Holstein cows those were daugters of 40 sire families. These daughters were collected from 22 herds of Beijing Sanyuanlvhe Dairy Farming Center, a leading dairy company in China. Phenotypic data of the five milk production traits including 305-day milk yield (MY), fat yield (FY), protein yield (PY), fat percentage (FP) and protein percentage (PP) those were calculated based on at least 6 test-day records in each lactation using a multiple trait random regression test-day model by the Dairy Data Center of China, Dairy Association of China (http://www.holstein.org.cn/). Genomic DNA was isolated from whole blood of cows and frozen semen of sires as previously described by Yang et al. [16].

Indels selection, PCR amplification, sequencing and genotyping

Of the 25 short indels that identified by our previous whole-genome resequencing study, 24 indels were investigated the associations with the five milk production traits except for a three-nucleotide insertion (3 N ins) in ACSBG2 gene. A total of 23 pairs of PCR primers were designed with Primer Premier 5.0 and Oligo 7.0 softwares based on the genomic sequences of the 11 candidate genes in Bos_taurus_UMD3.1 assembly (Additional file 1). To identify the twenty-four potential indel polymorphisms, two DNA pools for the above 40 sires were constructed with equal concentration of 50 ng/μl of each bull (20 individuals/pool). PCR products basd on the pooled DNA were purified with an EasyPure PCR Purification Kit (TransGen Biotech, Beijing, China) and then bi-directionally sequenced using ABI3730xl DNA Analyzer (Applied Biosystems, Foster City, CA, USA). To further confirm the position and sequence of the insertions and deletions, the purified PCR products were cloned into the pClone007 vector with a pClone 007 Vector Kit (TsingKe Biological Technology, Beijing, China). Positive clones including target indels were sequenced to search potential indels. The BLAST software (https://blast.ncbi.nlm.nih.gov/Blast.cgi) and Chromas 2.0 (Technelysium, Australia) were applied for sequence alignment to the reference sequence of the corresponding gene referring to Bos_taurus_UMD_3.1 assembly. Finally, genotyping for the identified indels in 769 chinese cows was performed by using the Sequenom MassArray matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS).

Bioinformatics analysis

To further explore the potential impact of the exonic indels in FCGR2B and CENPE on the mRNA secondary structures as well as the second structures of corresponding proteins, the online RNA FOLDING FORM (version 2.3) software [52] and SPOMA program (http://npsa-pbil.ibcp.fr/) [53] were used, respectively.

Association analysis

Allele frequencies and genotype frequencies between the insertion and deletion genotypes, as well as the Hardy-Weinberg equilibrium were determined through a chi-square test. Associations between the 13 investigated indels and the five milk production traits were carried out by applying the mixed procedure in SAS 9.2 [54] based on the following linear mixed regression model: Where Y is the phenotypic record for the analyzed trait of the cows, μ is the overall mean of the phenotypic record for each trait, hys is a fixed effect of herd, year and season, b is linear regression coefficient on calving month (M), M is effect of calving month, G is a fixed effect of indel genotype or haplotype, a is a random polygenic effect account for all known pedigree relationships, and e is a random residual. Also, we estimated the additive (a), dominance (d) and allele substitution (α) effects using the equation of Falconer & Mackay [55]: where AA, BB and AB were the least square means of the phenotypic values for corresponding genotypes, and p and q indicates the allele frequencies of the corresponding alleles. Multiple t-tests with Bonferroni correction were used to compare the effects of the genotypes on each indel. The linkage disequilibrium (LD) extent among the genotyped indels (five indels in CENPE gene and two indels in NLK gene) and haplotype blocks was estimated using Haploview 4.2 (Broad Institute of MIT and Harvard, Cambridge, MA, USA). Table S1. Primers used for pooled DNA sequencing for the 24 indels. (XLSX 42 kb) Results of sanger and clone sequencing of the thirteen indels. (DOCX 332 kb)
  53 in total

1.  The Dominant white, Dun and Smoky color variants in chicken are associated with insertion/deletion polymorphisms in the PMEL17 gene.

Authors:  Susanne Kerje; Preety Sharma; Ulrika Gunnarsson; Hyun Kim; Sonchita Bagchi; Robert Fredriksson; Karin Schütz; Per Jensen; Gunnar von Heijne; Ron Okimoto; Leif Andersson
Journal:  Genetics       Date:  2004-11       Impact factor: 4.562

2.  Using RNA-Seq SNP data to reveal potential causal mutations related to pig production traits and RNA editing.

Authors:  A M Martínez-Montes; A Fernández; D Pérez-Montarelo; E Alves; R M Benítez; Y Nuñez; C Óvilo; N Ibañez-Escriche; J M Folch; A I Fernández
Journal:  Anim Genet       Date:  2016-09-18       Impact factor: 3.169

3.  Rapidly cleaving bovine two-cell embryos have better developmental potential and a distinctive mRNA pattern.

Authors:  Ernesto Orozco-Lucero; Isabelle Dufort; Claude Robert; Marc-André Sirard
Journal:  Mol Reprod Dev       Date:  2013-11-27       Impact factor: 2.609

4.  An insertion mutation of the bovine Fii gene is responsible for factor XI deficiency in Japanese black cattle.

Authors:  Masaki Kunieda; Takehito Tsuji; Abdol Rahim Abbasi; Maryam Khalaj; Miho Ikeda; Keiko Miyadera; Hiroyuki Ogawa; Tetsuo Kunieda
Journal:  Mamm Genome       Date:  2005-05       Impact factor: 2.957

5.  Effect of polymorphisms in the FASN, OLR1, PPARGC1A, PRL and STAT5A genes on bovine milk-fat composition.

Authors:  A Schennink; H Bovenhuis; K M Léon-Kloosterziel; J A M van Arendonk; M H P W Visker
Journal:  Anim Genet       Date:  2009-08-31       Impact factor: 3.169

6.  Mutational analysis of MIR184 in sporadic keratoconus and myopia.

Authors:  Judith Lechner; Ha Ae Bae; Jasenka Guduric-Fuchs; Aine Rice; Gowthaman Govindarajan; Salina Siddiqui; Layal Abi Farraj; Shea Ping Yip; Maurice Yap; Manoranjan Das; Emmanuelle Souzeau; Doug Coster; Richard A Mills; Richard Lindsay; Tony Phillips; Paul Mitchell; Manir Ali; Chris F Inglehearn; Periasamy Sundaresan; Jamie E Craig; David A Simpson; Kathryn P Burdon; Colin E Willoughby
Journal:  Invest Ophthalmol Vis Sci       Date:  2013-08-05       Impact factor: 4.799

7.  Gene networks driving bovine mammary protein synthesis during the lactation cycle.

Authors:  Massimo Bionaz; Juan J Loor
Journal:  Bioinform Biol Insights       Date:  2011-05-04

8.  Genome wide association study identifies 20 novel promising genes associated with milk fatty acid traits in Chinese Holstein.

Authors:  Cong Li; Dongxiao Sun; Shengli Zhang; Sheng Wang; Xiaoping Wu; Qin Zhang; Lin Liu; Yanhua Li; Lv Qiao
Journal:  PLoS One       Date:  2014-05-23       Impact factor: 3.240

9.  Whole-genome sequencing of Berkshire (European native pig) provides insights into its origin and domestication.

Authors:  Mingzhou Li; Shilin Tian; Carol K L Yeung; Xuehong Meng; Qianzi Tang; Lili Niu; Xun Wang; Long Jin; Jideng Ma; Keren Long; Chaowei Zhou; Yinchuan Cao; Li Zhu; Lin Bai; Guoqing Tang; Yiren Gu; An'an Jiang; Xuewei Li; Ruiqiang Li
Journal:  Sci Rep       Date:  2014-04-14       Impact factor: 4.379

10.  Association Analysis of a Microsatellite Repeat in the TRIB1 Gene With Prostate Cancer Risk, Aggressiveness and Survival.

Authors:  Leire Moya; John Lai; Andrea Hoffman; Srilakshmi Srinivasan; Janaththani Panchadsaram; Suzanne Chambers; Judith A Clements; Jyotsna Batra
Journal:  Front Genet       Date:  2018-10-04       Impact factor: 4.599

View more
  3 in total

1.  Weighted Single-Step Genome-Wide Association Study Uncovers Known and Novel Candidate Genomic Regions for Milk Production Traits and Somatic Cell Score in Valle del Belice Dairy Sheep.

Authors:  Hossein Mohammadi; Amir Hossein Khaltabadi Farahani; Mohammad Hossein Moradi; Salvatore Mastrangelo; Rosalia Di Gerlando; Maria Teresa Sardina; Maria Luisa Scatassa; Baldassare Portolano; Marco Tolone
Journal:  Animals (Basel)       Date:  2022-04-29       Impact factor: 3.231

2.  Sequence-based genome-wide association study of individual milk mid-infrared wavenumbers in mixed-breed dairy cattle.

Authors:  Kathryn M Tiplady; Thomas J Lopdell; Edwardo Reynolds; Richard G Sherlock; Michael Keehan; Thomas Jj Johnson; Jennie E Pryce; Stephen R Davis; Richard J Spelman; Bevin L Harris; Dorian J Garrick; Mathew D Littlejohn
Journal:  Genet Sel Evol       Date:  2021-07-20       Impact factor: 4.297

3.  Genome-Wide Association Study Candidate Genes on Mammary System-Related Teat-Shape Conformation Traits in Chinese Holstein Cattle.

Authors:  Mudasir Nazar; Xubin Lu; Ismail Mohamed Abdalla; Numan Ullah; Yongliang Fan; Zhi Chen; Abdelaziz Adam Idriss Arbab; Yongjiang Mao; Zhangping Yang
Journal:  Genes (Basel)       Date:  2021-12-19       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.