Literature DB >> 30704154

Genetic Analyses Confirm SNPs in HSPA8 and ERBB2 are Associated with Milk Protein Concentration in Chinese Holstein Cattle.

Cong Li1, Miao Wang2, Wentao Cai3, Shuli Liu4, Chenghao Zhou5, Hongwei Yin6, Dongxiao Sun7, Shengli Zhang8.   

Abstract

Heat shock 70 kDa protein 8 (HSPA8) and erb-b2 receptor tyrosine kinase 2 (ERBB2) were the promising candidates for milk protein concentration in dairy cattle revealed through previous RNA sequencing (RNA-Seq) study. The objective of this post-RNA-Seq study was to confirm genetic effects of HSPA8 and ERBB2 on milk protein concentration in a large Chinese Holstein population and to evaluate the genetic effects of both genes on other milk production traits. There were 2 singlenucleotide polymorphisms (SNPs) identified for HSPA8 and 11 SNPs for ERBB2 by sequencing 17 unrelated Chinese Holstein sires. The SNP-rs136632043 in HSPA8 had significant associations with all five milk production traits (p = 0.0086 to p < 0.0001), whereas SNP-rs132976221 was remarkably associated with three yield traits (p < 0.0001). Nine (ss1996900615, rs109017161, rs109122971, ss1996900614, rs110133654, rs109941438, rs110552983, rs133031530, and rs109763505) of 11 SNPs in ERBB2 were significantly associated with milk protein percentage (p = 0.0177 to p < 0.0001). A 12 Kb haplotype block was formed in ERBB2 and haplotype associations revealed similar effects on milk protein traits. Our findings confirmed the significant genetic effects of HSPA8 and ERBB2 on milk protein concentration and other milk production traits and SNP phenotypic variances above 1% may serve as genetic markers in dairy cattle breeding programs.

Entities:  

Keywords:  association analyses; candidate genes; dairy cows; haplotypes; milk protein traits

Mesh:

Substances:

Year:  2019        PMID: 30704154      PMCID: PMC6409942          DOI: 10.3390/genes10020104

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


1. Introduction

Molecular selective breeding strategies have been widely applied in animal breeding to improve the important economic traits of livestock. Identification of key genes or causal variations for economic traits is prerequisite to perform molecular selective breeding strategies [1,2]. Milk protein concentration is an important index to evaluate the nutritional value in cow’s milk. However, limited key genes or variations for milk protein concentration were found in dairy cattle [3,4,5,6,7,8,9]. Our initial RNA sequencing (RNA-Seq) study revealed that heat shock 70 kDa protein 8 (HSPA8) and erb-b2 receptor tyrosine kinase 2 (ERBB2) were candidate genes affecting milk protein traits in dairy cows [10]. It was observed that HSPA8 (Log2fold change = −1.13, q-value = 2.91 × 10−2) and ERBB2 (Log2fold change = 1.00, q-value = 5.25 × 10−3) were significant differentially expressed in bovine mammary tissues of cows in high and low milk protein percentage comparisons [10]. HSPA8 is located in BTA15 with a total length of 4426 bp, containing 9 exons and 8 introns, and encoding 650 amino acids. HSPA8 is an important gene in the MAPK pathway [11], which has a positive effect on protein synthesis by increasing the stability of mRNA through phosphorylation of the AU-rich element-binding protein [12]. HSPA8 is highly expressed in the lactating mammary gland [10,13], which has a role in regulating protein folding and processing in the endoplasmic reticulum [14], likely in support of active milk protein synthesis [13]. ERBB2 encodes a receptor of tyrosine kinases [15] and is located in BTA19 with a total length of 24,682 bp, containing 27 exons and 26 introns, and encoding 1255 amino acids. ERBB2 could activate PI3K signaling pathway by directly binding of PI3K regulatory subunit p85 to phosphorylated tyrosine residues, which is known to regulate milk protein synthesis [16]. Most importantly, HSPA8 is known to be downstream of ERBB2 in human [17,18], indicating the probably regulatory effect of ERBB2 to HSPA8 and working together in specific biological processes. Genetic analyses between selected genes and bovine milk production traits can provide valuable molecular information for the genetic improvement program for milk quality of dairy cattle. Therefore, the aims of this study were to identify genetic polymorphisms in HSPA8 and ERBB2, to explore the linkages among single-nucleotide polymorphisms (SNPs), and to conduct the genetic effects analyses in a large Chinese Holstein population.

2. Materials and Methods

2.1. Animal Population and Phenotypic Data

A total of 1027 Chinese Holstein cows from 17 sire families were used to construct the study population. Family size ranges from 25 to 187 daughters with an average of 60 daughters per sire (Table S1). Cows were selected from 17 dairy farms in the Beijing Sanyuan Lvhe Dairy Farm Center, where regular and standard performance testing (Dairy Herd Improvement, DHI) have been implemented since 1999. Five milk production traits (305 d milk yield, 305 d protein yield, 305 d fat yield, average 305 d protein percentage, and average 305 d fat percentage) were collected from individual animal via the complete DHI data and were used for the subsequent analyses.

2.2. Genomic DNA Extraction

Blood samples were collected from 1027 Chinese Holstein cows via coccygeal vein and stored at −20 °C. The tubule frozen semen samples of 17 sires were collected from Beijing Bull Station. Genomic DNA was extracted from blood samples with a TIANamp Blood DNA kit (TIANGEN Biotech, Beijing, China) and from the semen samples using a standard phenol-chloroform procedure [19]. The quantity and quality of isolated DNA were confirmed before further analysis.

2.3. SNP Identification and Genotyping

A DNA pool was constructed from aforementioned 17 Holstein bulls (50 ng/uL of each individual) to identify potential SNPs that were involved in HSPA8 and ERBB2 genes. A total of 13 and 26 pairs of primers (Table S2) were designed to amplify all exons and their partial flanking intronic sequences based on the reference sequences of the bovine HSPA8 (NCBI Reference Sequence: AC_000172.1) and ERBB2 (NCBI Reference Sequence: AC_000176.1) referring to Bos_taurus_UMD_3.1 assembly using Primer3 web Program v.0.4.0 (http://primer3.ut.ee), respectively. The polymerase chain reaction (PCR) was performed to amplify the pooled DNA from 17 sires with a final reaction volume of 25 μL, comprising of 50 ng genomic DNA, 0.5 μL of each primer, 2.5 μL 10× PCR buffer, 2.5 mM each of dNTP, and 1 U of Taq DNA polymerase (Takara Biotechnology Co., Ltd., Dalian, China). The PCR protocol was 5 min at 94 °C for initial denaturing, followed by 34 cycles at 94 °C for 30 s, 56 °C for 30 s, 72 °C for 30 s, and a final extension at 72 °C for 7 min. The amplification products were visualized by gel electrophoresis on 2% agarose gels, followed by photography under UV light. After that, 40 μL of each PCR product from the pooled DNA was bi-directionally sequenced using the ABI3730XL (Applied Biosystems, Foster City, CA, USA), and the sequences were aligned to the bovine reference sequences (UMD3.1) using BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi) to identify potential SNPs. The subsequent genotyping analysis of the 1027 Chinese Holstein cows were performed with matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS, Squenom MassARRAY, Bioyong Technologies Inc. HK) assay.

2.4. Linkage Disequilibrium Analysis

Pair-wise linkage disequilibrium (LD) was measured for each pair of SNPs genotyped within the HSPA8 and ERBB2 genes based on the criterion of D prime (D′) using Haploview [20]. Genotypes were firstly imputed for each individual using the Beagle3.2 software program [21]. Briefly, an iterative algorithm was applied for fitting a haplotype Hidden Markov Model (HMM) to genotype data that alternated between model building and sampling. In the model-building step, current estimates of phased haplotypes are used for building a new haplotype HMM, in the sampling step, new haplotypes are sampled for each individual conditional upon the genotype data and current haplotype HMM. Estimated phased haplotypes for the initial iteration are obtained by imputing missing genotypes at random according to allele frequencies and randomly phasing heterozygous genotypes. Accordingly, haplotype blocks where SNPs are in high LD (D′ > 0.90) were determined based on confidence intervals methods [22]. A haplotype with a frequency >5% was considered as a distinguishable haplotype, while the haplotypes with relative frequency <5% were pooled into a single group. Haplotype blocks within relative SNPs were applied to subsequent analyses to detect their associations with phenotypes.

2.5. Association and Haplotype Analyses

Hardy–Weinberg equilibrium test was conducted on each identified SNP. Chi-square was used to compare the number of expected and observed genotypes with a significance level of 0.05. The genetic effects of each candidate SNP or haplotype on five milk production traits were analyzed with the mixed procedure of SAS (SAS Institute Inc., Cary, NC, USA) with the following statistical model: where, was the phenotypic value of each trait of cows (n = 1027 for each trait); μ was the overall mean; was the fixed effect of farm; was the fixed effect of year-season; was the fixed effect of parity; M was the covariate effect of calving month; b was the regression coefficient of M; was the fixed effect corresponding to the genotype of polymorphisms or haplotype; was the random polygenic effect, distributed as N (0, Aσa2), with the additive genetic relationship matrix A and the additive genetic variance σa2; and was the random residual, distributed as N (0, Iσe2), with identity matrix I and residual error variance σe2. The overall reliability of the whole model was symbolized as ‘Goodness of Fit’, and was calculated by as following formula, where, was the overall reliability; was sum of squares of variables; was total sum of squares; was residual sum of squares; was the residuals; was phenotypes of traits; was mean values of phenotypes of traits. The results showed that the of milk yield, milk protein percentage, milk fat percentage, milk protein yield and milk fat yield are 0.55, 0.50, 0.66, 0.73, 0.76, respectively, suggests that the model provide a good fit. The differences among the effects of single SNPs or haplotypes on each trait were compared with Bonferroni correction. The significant level of the multiple tests was equal to the raw P value divided by number of tests. The additive (a), dominance (d) and allele substitution (α) effects were estimated according to the equation proposed by Falconer & Mackay [23], i.e., , and , where AA and BB represent the two homozygous genotypes, AB is heterozygous genotype, and p and q are the allele frequencies of corresponding loci.

2.6. Phenotypic Variance

The proportion of phenotypic variance of the trait explained by a SNP was symbolized to show the effect of a SNP on a specific trait. The calculation formula is: where is the allele frequency of SNP, is the average effect of gene substitution calculated by the linear mixed model, and is the estimate of the phenotypic variance using the complete DHI data of Chinese dairy cattle population.

2.7. Ethics Approval and Consent to Participate

All protocols for collection of the blood and frozen semen samples of experimental individuals were approved by the Institutional Animal Care and Use Committee (IACUC) at China Agricultural University (Permit Number: DK996). We obtained written agreements from the cattle owners to use the samples and data.

3. Results

3.1. SNPs Identification

Two SNPs of rs136632043 and rs132976221 were identified for HSPA8 gene, with one located in the 3′ regulatory region (3′-UTR) and the other located in the intron (Table 1). A total of 11 SNPs was discovered for ERBB2 gene. Among these identified SNPs, eight SNPs (ss1996900615, rs109122971, ss1996900614, rs110133654, rs109941438, rs110552983, rs133031530 and rs109763505) were found within introns, one (rs133724008) was in the 5’ regulatory region (5′-UTR), and two SNPs (rs110735562 and rs109017161) were synonymous substitutions located in exons. All 13 identified SNPs of HSPA8 and ERBB2 genes were in Hardy–Weinberg equilibrium (p > 0.05, Table 2).
Table 1

Information for the identified single-nucleotide polymorphisms (SNPs) in HSPA8 and ERBB2 genes.

CHRRefSNPSNP LocusAllelesLocationPositionGene
15rs132976221g.1585A>CA/CIntron-334219120 HSPA8
15rs136632043g.4218T>GT/G3′-UTR34216487 HSPA8
19rs133724008g.873T>CT/C5′-UTR40720963 ERBB2
19ss1996900615g.20982delTC/.Intron-1940742818 ERBB2
19rs110735562g.21561A>GA/GExon-2140743397 ERBB2
19rs109017161g.22268T>CT/CExon-2340744104 ERBB2
19rs109122971g.23650T>CT/CIntron-2640745486 ERBB2
19ss1996900614g.19414A>GA/GIntron-1440741250 ERBB2
19rs110133654g.10727A>GA/GIntron-740732563 ERBB2
19rs109941438g.11680C>TC/TIntron-840733516 ERBB2
19rs110552983g.16431C>GC/GIntron-1440738267 ERBB2
19rs133031530g.22346A>TA/TIntron-2340744182 ERBB2
19rs109763505g.22400A>GA/GIntron-2340744236 ERBB2
Table 2

Genotypic and allelic frequencies and Hardy–Weinberg equilibrium test of SNPs of HSPA8 and ERBB2 genes in Chinese Holstein cattle.

GenePositionLocusGenotypesNFrequencyAlleleFrequencyHardy–Weinberg Equilibrium χ2 Test
HSPA8 Intron-3rs132976221 g.1585A>CCA4320.428A0.702p > 0.05
AA4930.488C0.298
CC850.084
HSPA8 3′ prime UTR (exon9)rs136632043 g.4218T>GGT2770.274G0.175p > 0.05
GG380.038T0.825
TT6960.688
ERBB2 5′ flanking regionrs133724008 g.873T>CCT4120.411C0.324 p > 0.05
CC1190.119T0.676
TT4710.470
ERBB2 Intron-19ss1996900615 g.20982delDEL.TC4710.469DEL0.604p > 0.05
DEL3720.370TC0.396
TC1620.161
ERBB2 Exon-21rs110735562 g.21561A>GAG3620.358A0.243p > 0.05
AA650.064G0.757
GG5850.578
ERBB2 Exon-23rs109017161 g.22268T>CCT4580.458C0.607p > 0.05
CC3780.378T0.393
TT1640.164
ERBB2 Intron-26rs109122971 g.23650T>CCT4730.473C0.568p > 0.05
CC3320.332T0.432
TT1960.196
ERBB2 Intron-14ss1996900614 g.19414A>GAG4700.463A0.394p > 0.05
AA1650.162G0.606
GG3810.375
ERBB2 Intron-7rs110133654 g.10727A>GAG4700.468A0.604p > 0.05
AA3720.370G0.396
GG1630.162
ERBB2 Intron-8rs109941438 g.11680C>TCT4520.455C0.605p > 0.05
CC3750.378T0.395
TT1660.167
ERBB2 Intron-14rs110552983 g.16431C>GCG4720.464C0.393p > 0.05
CC1640.161G0.607
GG3810.375
ERBB2 Intron-23rs133031530 g.22346A>TAT4450.451A0.390p > 0.05
AA1620.164T0.610
TT3790.384
ERBB2 Intron-23rs109763505 g.22400A>GAG4830.478A0.432p > 0.05
AA1950.193G0.568
GG3320.329

3.2. Single Locus-Based Association Analyses

SNP-rs136632043 was highly associated with all five milk production traits (p = 0.0086 to p < 0.0001; Table 3). SNP-rs132976221 also showed strong associations with three yield traits (milk yield, fat yield, and protein yield, p < 0.0001). There were six significant pairs of SNP-trait explaining phenotypic variations with greater than 1%, a range from 1.70 to 5.13% (Table 3). In addition, SNPs in HSPA8 also showed the corresponding significant additive, substitution, or dominant effects on target traits (Table S3).
Table 3

Associations of identified SNPs of HSPA8 and ERBB2 genes with milk production traits in Chinese Holstein cattle (Least square mean ± Standard error, LSM ± SE).

LocusGenotypesMilk YieldFat YieldFat PercentageProtein YieldProtein Percentage
HSPA8rs132976221 g.1585A>CAA(493)10307 ± 61.79 A361.58 ± 2.58 A3.572 ± 0.025323.23 ± 1.88 A3.169 ± 0.009
AC(432)10639 ± 62.63 B373.69 ± 2.62 B3.580 ± 0.025333.02 ± 1.91 B3.164 ± 0.009
CC(85)10465 ± 100.55 AB367.58 ± 4.24 AB3.576 ± 0.041330.38 ± 3.09 B3.180 ± 0.014
p-value <0.0001 <0.0001 0.9436 <0.0001 0.4773
Variance 4.76 × 10−2 3.74 × 10−2 1.48 × 10−4 5.13 × 10−2 8.83 × 10−5
HSPA8rs136632043 g.4218T>GGG(38)10632 ± 140.30 A403.60 ± 5.93 A3.797 ± 0.057 A338.32 ± 4.32 A3.192 ± 0.020 A
GT(277)10339 ± 67.43 B363.93 ± 2.82 B3.586 ± 0.027 B325.44 ± 2.06 B3.188 ± 0.010 A
TT(696)10532 ± 59.56 A364.61 ± 2.48 B3.536 ± 0.024 B328.70 ± 1.81 B3.154 ± 0.008 B
p-value 0.0022 <0.0001 <0.0001 0.0086 <0.0001
Variance1.28 × 10−3 4.16 × 10−2 3.25 × 10−2 1.46 × 10−3 1.70 × 10−2
ERBB2rs133724008 g.873T>CCC(119)10311 ± 90.86 a361.80 ± 3.81 a3.574 ± 0.037324.73 ± 2.78 A3.195 ± 0.013 A
CT(412)10469 ± 62.32 ab365.97 ± 2.60 ab3.542 ± 0.025327.50 ± 1.89 A3.150 ± 0.009 B
TT(471)10551 ± 62.29 b370.49 ± 2.60 b3.595 ± 0.025331.81 ± 1.89 B3.178 ± 0.009 A
p-value 0.0178 0.0290 0.0666 0.0069 <0.0001
Variance 1.49 × 10−2 9.02 × 10−39.40 × 10−59.89 × 10−3 1.78 × 10−2
ERBB2ss 1996900615 g.20982delDEL(372)10453 ± 63.86366.24 ± 2.663.594 ± 0.026327.94 ± 1.943.178 ± 0.009 a
DEL.TC(471)10478 ± 61.61367.51 ± 2.573.561 ± 0.025328.49 ± 1.873.158 ± 0.009 b
TC(162)10449 ± 83.25366.96 ± 3.493.561 ± 0.033328.21 ± 2.543.181 ± 0.012 a
p-value0.87560.87150.31570.9526 0.0140
Variance5.87 × 10−51.58 × 10−59.68 × 10−42.45 × 10−61.60 × 10−3
ERBB2rs110735562 g.21561A>GAA(65)10305 ± 115.13 A364.26 ± 4.86 AB3.592 ± 0.046323.00 ± 3.54 A3.180 ± 0.016
AG(362)10566 ± 66.63 B372.41 ± 2.78 A3.577 ± 0.027331.66 ± 2.02 B3.171 ± 0.009
GG(585)10410 ± 58.40 A363.35 ± 2.43 B3.565 ± 0.023325.84 ± 1.77 A3.165 ± 0.008
p-value 0.0060 0.0006 0.7432 0.0010 0.4977
Variance 1.79 × 10−2 6.48 × 10−39.48 × 10−4 2.05 × 10−2 2.37 × 10−3
ERBB2rs109017161 g.22268T>CCC(164)10465 ± 63.62367.53 ± 2.663.599 ± 0.026328.83 ± 1.943.180 ± 0.009 A
CT(458)10465 ± 61.90367.96 ± 2.593.569 ± 0.025327.77 ± 1.893.157 ± 0.009 B
TT(378)10422 ± 83.31363.93 ± 3.503.549 ± 0.034326.39 ± 2.553.178 ± 0.012 AB
p-value0.83820.43940.21870.5892 0.0062
Variance6.05 × 10−42.78 × 10−33.24 × 10−31.58 × 10−35.71 × 10−4
ERBB2rs109122971 g.23650T>CCC(332)10457 ± 66.19367.99 ± 2.773.608 ± 0.027328.41 ± 2.023.179 ± 0.009 A
CT(473)10488 ± 61.60368.74 ± 2.573.563 ± 0.025328.80 ± 1.873.157 ± 0.009 B
TT(196)10377 ± 77.66363.34 ± 3.263.564 ± 0.031325.82 ± 2.373.185 ± 0.011 A
p-value0.28430.18350.13680.3740 0.0038
Variance2.32 × 10−34.14 × 10−32.09 × 10−32.41 × 10−31.86 × 10−3
ERBB2ss 1996900614 g.19414A>GAA(165)10381 ± 82.85363.42 ± 3.483.565 ± 0.033325.46 ± 2.543.181 ± 0.012 A
AG(470)10502 ± 61.53368.89 ± 2.573.563 ± 0.025329.13 ± 1.873.157 ± 0.009 B
GG(381)10476 ± 63.60368.21 ± 2.663.598 ± 0.026329.29 ± 1.943.178 ± 0.009 A
p-value0.27450.21790.27960.2308 0.0078
Variance3.64 × 10−34.98 × 10−38.93 × 10−45.28 × 10−31.75 × 10−3
ERBB2rs110133654 g.10727A>GAA(372)10501 ± 64.00369.19 ± 2.683.592 ± 0.026330.40 ± 1.953.178 ± 0.009 A
AG(470)10490 ± 61.54366.60 ± 2.563.548 ± 0.025328.63 ± 1.873.155 ± 0.009 B
GG(163)10416 ± 83.71364.56 ± 3.523.549 ± 0.034327.67 ± 2.563.180 ± 0.012 A
p-value0.53600.32950.12910.4442 0.0049
Variance2.20 × 10−32.73 × 10−31.57 × 10−31.67 × 10−31.62 × 10−3
ERBB2rs109941438 g.11680C>TCC(375)10464 ± 63.85367.66 ± 2.673.593 ± 0.026329.19 ± 1.943.181 ± 0.009 A
CT(452)10482 ± 61.95367.23 ± 2.583.553 ± 0.025328.40 ± 1.883.157 ± 0.009 B
TT(166)10364 ± 82.72360.55 ± 3.473.543 ± 0.033324.45 ± 2.533.183 ± 0.012 A
p-value0.28840.07240.13410.1331 0.0034
Variance3.78 × 10−39.51 × 10−32.76 × 10−37.37 × 10−31.57 × 10−3
ERBB2rs110552983 g.16431C>GCC(164)10440 ± 82.87366.09 ± 3.483.559 ± 0.033327.50 ± 2.533.177 ± 0.012 AB
CG(472)10521 ± 61.62368.52 ± 2.583.551 ± 0.025330.20 ± 1.883.156 ± 0.009 A
GG(381)10511 ± 63.46370.25 ± 2.653.598 ± 0.026331.32 ± 1.933.179 ± 0.009 B
p-value0.54630.44210.10450.2855 0.0080
Variance1.90 × 10−32.49 × 10−31.07 × 10−34.35 × 10−35.06 × 10−4
ERBB2rs133031530 g.22346A>TAA(162)10420 ± 84.37363.01 ± 3.543.539 ± 0.034327.77 ± 2.583.182 ± 0.012 A
AT(445)10452 ± 64.28364.58 ± 2.683.544 ± 0.026327.28 ± 1.953.155 ± 0.009 B
TT(379)10457 ± 65.19365.70 ± 2.723.587 ± 0.026329.02 ± 1.983.181 ± 0.009 A
p-value0.88890.70970.11970.6089 0.0015
Variance4.20 × 10−41.04 × 10−32.21 × 10−31.45 × 10−41.60 × 10−3
ERBB2rs109763505 g.22400A>GAA(195)10476 ± 77.76368.62 ± 3.263.567 ± 0.031330.12 ± 2.383.182 ± 0.011 a
AG(483)10515 ± 61.66369.25 ± 2.573.563 ± 0.025330.07 ± 1.873.160 ± 0.009 b
GG(332)10495 ± 66.45369.09 ± 2.783.606 ± 0.027329.97 ± 2.033.180 ± 0.009 a
p-value0.84600.97760.17470.9973 0.0177
Variance1.65 × 10−44.54 × 10−51.50 × 10−35.58 × 10−66.97 × 10−4

Note: p-value refers to the results of association analysis between each SNP and milk production traits. Different letter (small letters: p < 0.05; capital letters: p < 0.01) superscripts (adjusted value after correction for multiple testing) indicate significant differences among the genotypes.

Nine SNPs (ss1996900615, rs109017161, rs109122971, ss1996900614, rs110133654, rs109941438, rs110552983, rs133031530, and rs109763505) in ERBB2 were significantly associated with milk protein percentage (p = 0.0177 to p < 0.0001; Table 3). SNPs of rs110735562 and rs133724008 had significant associations with three yield traits (milk yield, fat yield, and protein yield, p = 0.0290 to p < 0.0001), whereas SNP of rs133724008 was also significantly associated with milk protein percentage (p < 0.0001; Table 3). Phenotypic variations explained by the 11 SNPs in ERBB2 with greater than 1% were existed in four significant pairs of SNP-trait, ranging from 1.49 to 2.05% (Table 3). Nine SNPs (ss1996900615, rs109017161, rs109122971, ss1996900614, rs110133654, rs109941438, rs110552983, rs133031530, and rs109763505) had significant dominant effects on milk protein percentages (Table S3). SNP-rs110735562 had significant dominant effects on three yield traits. SNP-rs133724008 showed significant additive and substitution effects on three milk yield traits and significant dominant effects on milk protein percentage (Table S3).

3.3. LD and Haplotypes Analyses

One SNP (ss1996900615) identified for ERBB2 gene was insertion/deletion (InDel), whereas the remaining 10 SNPs were used to perform LD analysis. Pair-wise D′ measures showed that nine SNPs in ERBB2 were highly linked (D′ = 0.99~1.00) and one 12 Kb haplotype block comprising these nine SNPs were inferred (Figure 1), in which three main haplotypes were formed. The frequencies of the haplotypes ACGGGCTGC, GTCAATAAT, and GTCAGTAAT were 56.57%, 24.15%, and 15.04%, respectively (Table 4). Subsequently, haplotype-based analysis showed significant association of the haplotypes with all four milk production traits, except the milk yield (p = 0.0130 to p < 0.0001, Table 5). No LD was observed between the two identified SNPs for HSPA8 gene.
Figure 1

The haplotype blocks and pairwise linkage disequilibrium (LD) values (D′) for the ten SNPs in ERBB2. Legends: The values within boxes are pair wise SNP correlation (D′), bright red boxes without numbers indicate complete LD (D′ = 1). The brighter shade of red indicates higher LD.

Table 4

Main haplotypes and their frequencies observed in ERBB2 gene.

ERBB2 HaplotypesSNP1 G > ASNP2 T > CSNP3 C > GSNP4 A > GSNP5 A > GSNP6 T > CSNP7 A > TSNP8 A > GSNP9 T > CFrequency (%)
ACGGGCTGCACGGGCTGC56.57
GTCAATAATGTCAATAAT24.15
GTCAGTAATGTCAGTAAT15.04

Note: The Ref number of each SNP can be found in the haplotype Figure 1. SNP1 = rs110133654, SNP2 = rs109941438, SNP3 = rs110552983, SNP4 = ss1996900614, SNP5 = rs110735562, SNP6 = rs109017161, SNP7 = rs133031530, SNP8 = rs109763505, SNP9 = rs109122971.

Table 5

Haplotype associations of the nine SNPs in ERBB2 with milk production traits in Chinese Holstein cattle (LSM ± SE).

ERBB2 HaplotypesMilk YieldFat YieldFat PercentageProtein YieldProtein Percentage
H1H1(337)10464 ± 70.84366.88 ± 2.97 AC3.605 ± 0.029 a327.87 ± 2.16 A3.178 ± 0.010 A
H1H2(272)10536 ± 76.34371.71 ± 3.19 A3.603 ± 0.031 a330.23 ± 2.33 A3.174 ± 0.011 A
H1H3(169)10348 ± 82.03355.82 ± 3.45 B3.507 ± 0.033 b320.29 ± 2.51 B3.130 ± 0.011 B
H2H2(65)10296 ± 117.42362.97 ± 4.96 AB3.593 ± 0.047 ab322.42 ± 3.61 AB3.183 ± 0.016 A
H2H3(82)10416 ± 108.62357.93 ± 4.57 BC3.521 ± 0.044 ab323.81 ± 3.33 AB3.183 ± 0.015 A
H3H3(19)10122 ± 203.08360.35 ± 8.60 AB3.612 ± 0.082 ab319.76 ± 6.27 AB3.180 ± 0.028 AB
p-value0.0538<0.00010.01300.00090.0001

Note: p-value refers to the results of association analysis between each haplotype and milk production traits. Different letter (small letters: p < 0.05; capital letters: p < 0.01) superscripts (adjusted value after correction for multiple testing) indicate significant differences among the haplotypes. H1 = ACGGGCTGC, H2 = GTCAATAAT, H3 = GTCAGTAAT.

4. Discussion

The results of the present study confirmed the significant genetic effects of HSPA8 and ERBB2 genes on milk protein traits in a large population of dairy cattle. The two candidate genes also had remarkable impacts on other milk production traits, which may provide new insights into the enhancement of milk profiles via selection strategies. Two SNPs, rs110735562 and rs109017161 in exonic region of ERBB2 showed significant associations with milk protein traits are synonymous variations, which could modify mRNA stability and further affect protein expression [24,25]. The SNPs have affect the promoter activity and gene expression [26], and a previous study reported that two SNPs (rs209535817 and rs210440016) in the 5′-UTR region of bovine SAA2 (serum amyloid A2) gene impact the phenotype through altering the promoter activity [27]. Hence, it suggested that the genetic effect of SNP rs133724008 identified in the 5′-UTR of the ERBB2 gene on milk production traits was likely due to the impacts on its transcription. Generally, mature miRNAs are bound to mRNA ribosomal complex and regulate the expression of target genes by complementary recognition of the 3′-UTR of the mRNA [28,29]. Thus, whether there are relative miRNAs binding to the 3′-UTR (Position: Chr15, 34216278-34216505) of HSPA8 including identified SNPs were predicted by RNAhybrid software [30]. The results showed that miR-301a was targeted to the 3′-UTR including SNP rs136632043 (Position: Chr15, 34216487, Table S4), which indicated the significant associations of SNP rs136632043 in 3′-UTR of HSPA8 with milk production traits probably resulted from miR-301a or unknown potential regulatory mechanism. Although an intron does not hold a sequence for coding protein, an important function of SNPs in introns in altering gene transcriptional level has been elucidated [31,32]. Additionally, the significant associations between SNPs in introns with milk protein traits are also likely due to their LD with true causative variation. As a group of highly conserved and widely expressed proteins, heat-shock proteins (HSPs) play important physiological functions [33]. For example, HSPA8 has been validated as an evolutionarily conserved protein in swine and bovine [34]. HSPA8 is highly involved in many biological processes, including proteasomal degradation [35], catalyzing protein folding and clathrin uncoating [36], and other protein networks involved in protein catabolism, protein homeostasis, ubiquitination, carbohydrate metabolism and cell cycle control [37]. ERBB2 is a member of a family of transmembrane receptor tyrosine kinases involved in the regulation of cellular processes by modulation of several pathways, such as mTOR, MAPK, and PI3K/AKT pathways [38,39,40]. In consistent with the present study, the functional role of ERBB2 on milk production traits has been also identified as positional candidate gene for lactation persistency in Canadian Holstein cattle [41]. Therefore, the results of the current study and previously published research indicate that HSPA8 and ERBB2 are promising candidate genes that have strong genetic effects on milk production traits. In addition, both single SNP-based and haplotype-based association analyses also suggest that the novel SNPs in these two genes may be used as potential genetic markers for genetic improvement in dairy breeding schemes. Generally, a small proportion with less than 1% of the phenotypic variance was explained by polymorphisms underlying complex traits in livestock animals [42]. In the present study, six and four significant pairs of SNP-trait explaining phenotypic variations with greater than 1% were found in HSPA8 and ERBB2, respectively. These results suggest that the subset of these large-effect SNPs (rs132976221, rs136632043, rs133724008, and rs110735562) could be used as potential genetic markers for further marker-assisted selection (MAS) in milk production traits, especially for milk protein traits. All identified SNPs in HSPA8 and ERBB2 genes could be incorporated into the SNP panels for genomic selection of dairy cattle breeding schemes and could be used to improve frequencies of the genetic markers that are positively related to milk production traits of interest.
  41 in total

1.  The structure of haplotype blocks in the human genome.

Authors:  Stacey B Gabriel; Stephen F Schaffner; Huy Nguyen; Jamie M Moore; Jessica Roy; Brendan Blumenstiel; John Higgins; Matthew DeFelice; Amy Lochner; Maura Faggart; Shau Neen Liu-Cordero; Charles Rotimi; Adebowale Adeyemo; Richard Cooper; Ryk Ward; Eric S Lander; Mark J Daly; David Altshuler
Journal:  Science       Date:  2002-05-23       Impact factor: 47.728

2.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

3.  Genome-wide association analysis and pathways enrichment for lactation persistency in Canadian Holstein cattle.

Authors:  D N Do; N Bissonnette; P Lacasse; F Miglior; M Sargolzaei; X Zhao; E M Ibeagha-Awemu
Journal:  J Dairy Sci       Date:  2017-01-11       Impact factor: 4.034

4.  Effect of casein genes - beta-LGB, DGAT1, GH, and LHR - on milk production and milk composition traits in crossbred Holsteins.

Authors:  A Molee; C Poompramun; P Mernkrathoke
Journal:  Genet Mol Res       Date:  2015-03-30

5.  The protein-protein interaction network and clinical significance of heat-shock proteins in esophageal squamous cell carcinoma.

Authors:  Hong Sun; Xinyi Cai; Haofeng Zhou; Xiaoqi Li; Zepeng Du; Haiying Zou; Jianyi Wu; Lei Xie; Yinwei Cheng; Wenming Xie; Xiaomei Lu; Liyan Xu; Longqi Chen; Enmin Li; Bingli Wu
Journal:  Amino Acids       Date:  2018-04-27       Impact factor: 3.520

6.  Whole-genome association study for milk protein composition in dairy cattle.

Authors:  G C B Schopen; M H P W Visker; P D Koks; E Mullaart; J A M van Arendonk; H Bovenhuis
Journal:  J Dairy Sci       Date:  2011-06       Impact factor: 4.034

7.  A whole genome scan to map QTL for milk production traits and somatic cell score in Canadian Holstein bulls.

Authors:  D Kolbehdari; Z Wang; J R Grant; B Murdoch; A Prasad; Z Xiu; E Marques; P Stothard; S S Moore
Journal:  J Anim Breed Genet       Date:  2009-06       Impact factor: 2.380

8.  An intronic SNP in a RUNX1 binding site of SLC22A4, encoding an organic cation transporter, is associated with rheumatoid arthritis.

Authors:  Shinya Tokuhiro; Ryo Yamada; Xiaotian Chang; Akari Suzuki; Yuta Kochi; Tetsuji Sawada; Masakatsu Suzuki; Miyuki Nagasaki; Masahiko Ohtsuki; Mitsuru Ono; Hidehiko Furukawa; Masakazu Nagashima; Shinichi Yoshino; Akihiko Mabuchi; Akihiro Sekine; Susumu Saito; Atsushi Takahashi; Tatsuhiko Tsunoda; Yusuke Nakamura; Kazuhiko Yamamoto
Journal:  Nat Genet       Date:  2003-11-09       Impact factor: 38.330

Review 9.  Key stages in mammary gland development. Secretory activation in the mammary gland: it's not just about milk protein synthesis!

Authors:  Steven M Anderson; Michael C Rudolph; James L McManaman; Margaret C Neville
Journal:  Breast Cancer Res       Date:  2007       Impact factor: 6.466

10.  RNA-Seq reveals 10 novel promising candidate genes affecting milk protein concentration in the Chinese Holstein population.

Authors:  Cong Li; Wentao Cai; Chenghao Zhou; Hongwei Yin; Ziqi Zhang; Juan J Loor; Dongxiao Sun; Qin Zhang; Jianfeng Liu; Shengli Zhang
Journal:  Sci Rep       Date:  2016-06-02       Impact factor: 4.379

View more
  1 in total

1.  SERPINA1 gene identified in RNA-Seq showed strong association with milk protein concentration in Chinese Holstein cows.

Authors:  Cong Li; Wentao Cai; Shuli Liu; Chenghao Zhou; Hongwei Yin; Dongxiao Sun; Shengli Zhang
Journal:  PeerJ       Date:  2020-02-24       Impact factor: 2.984

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.