Literature DB >> 36268046

Genome-wide association analysis of milk production, somatic cell score, and body conformation traits in Holstein cows.

Peng Wang1, Xue Li2,3, Yihao Zhu1, Jiani Wei4, Chaoxin Zhang2,3, Qingfang Kong1, Xu Nie1, Qi Zhang5, Zhipeng Wang2,3.   

Abstract

Milk production and body conformation traits are critical economic traits for dairy cows. To understand the basic genetic structure for those traits, a genome wide association study was performed on milk yield, milk fat yield, milk fat percentage, milk protein yield, milk protein percentage, somatic cell score, body form composite index, daily capacity composite index, feed, and leg conformation traits, based on the Illumina Bovine HD100k BeadChip. A total of 57, 12 and 26 SNPs were found to be related to the milk production, somatic cell score and body conformation traits in the Holstein cattle. Genes with pleiotropic effect were also found in this study. Seven significant SNPs were associated with multi-traits and were located on the PLEC, PLEKHA5, TONSL, PTGER4, and LCORL genes. In addition, some important candidate genes, like GPAT3, CEBPB, AGO2, SLC37A1, and FNDC3B, were found to participate in fat metabolism or mammary gland development. These results can be used as candidate genes for milk production, somatic cell score, and body conformation traits of Holstein cows, and are helpful for further gene function analysis to improve milk production and quality.
Copyright © 2022 Wang, Li, Zhu, Wei, Zhang, Kong, Nie, Zhang and Wang.

Entities:  

Keywords:  Holstein cattle; body conformation traits; genome-wide association study; milk production traits; pleiotropic effect

Year:  2022        PMID: 36268046      PMCID: PMC9578681          DOI: 10.3389/fvets.2022.932034

Source DB:  PubMed          Journal:  Front Vet Sci        ISSN: 2297-1769


Introduction

Milk is a source of nutrients essential for human growth and development. The milk production traits are important for the dairy industry. Body conformation traits have been applied in several countries with the development of dairy cattle breeding since they are closely related to the health (1), productivity (2), lifetime (3), and calving ease (4) of cows. Some studies have identified the genetic correlation between body conformation traits and first lactation milk yield to be between 0.48 and 0.54 (5). These correlations are therefore very important for the dairy industry to improve the milk production traits and body conformation traits. The rapid development of sequencing technology has revealed the cause variants of complex traits using genome-wide association analysis (GWAS). A study by Schennink et al. (6) has revealed DGAT1 and SCD1 to be highly associated with the composition of milk-fat (long-chain fatty acid). Kiser et al. (7) verified the TFAP2A gene to be related to the production of colostrum in Jersey cattle. It reported the genes CDH2 and GABRG2 to be related to the milk fat percentage and milk protein traits, respectively, in dual-purpose Xinjiang brown cattle (8). Bouwman et al. (9) and Vanvanhossou et al. (10) have reported the VEPH1 gene to be associated with conformation. However, the identified genes have not explained all genetic variances. There is a need to continue the search for novel genes related to some quantitative traits. This study conducted GWAS using the Illumina Bovine HD100k(100k) BeadChip, for identifying important candidate genes or variants related to milk production, somatic cell score, and body conformation traits. There was an expectation for discovering novel genetic variations or candidate genes.

Materials and methods

Animal population

This experiment involved 1,313 cows from 7 different pastures in Heilongjiang Province. The use and care of the animals in this study were approved by the Animal Care Advisory Committee, Northeast Agricultural University (Harbin, China), and all the experimental procedures were according to the university's guidelines for animal research.

Genotypes data

The samples were collected from the tail roots near the hips of the cows. The DNA in the hair was extracted and genotyped using Illumina Bovine HD100k BeadChip, containing 95,256 SNPs. The markers with minor allele frequencies <0.05 and call rates <0.90 were filtered out and individuals with a call rate of 0.80 or greater were selected. These SNPs were distributed across 29 chromosomes.

Population stratification

The SNP genotypes of these individuals were used to estimate the population stratification based on principal component analysis (PCA), and Plink (version 1.9) (11) was used to analyze a total of 1,310 cows with 86,645 markers covering the whole genome to study the population structure (12). The software uses the default matrix construction method to construct G matrix and get the PCA results. We used R language (version 4.1.2)—ggplot 2 package to draw pictures. The PCA scatterplots (Figure 1) illustrate a clear population structure for the 1,310 individuals in the seven pastures cattle herds that comprised our study population.
Figure 1

Population structure demonstrated by principal component analysis.

Population structure demonstrated by principal component analysis. LD decay of cow.

Genome-wide association analysis

Combination with dairy herd improvement data of National Holstein cows in China, this study estimated the genomic estimated breeding values (GEBVs) of all animal milk production traits, somatic cell score, and body conformation traits, using single-step genomic best linear unbiased prediction (ssGBLUP). The ssGBLUP was developed to integrate all the information including genotypes, phenotypes, and pedigree information in one step, and each SNP effect was calculated using the FarmCPU method (13) based on the predicted GEBVs. The ssGBLUP method is an improvement of BLUP, in which the pedigree relationship matrix a−1 matrix must be replaced by H−1 (14). The specific model is as follows: Where y was each phenotypic value vector; b is the fixed effect of the field and the PCA effect to explain the population stratification, and u is a vector of animal effects. The e was a vector of random residual effects with e~N(0,I), and X, Z were incidence matrices for b and u, respectively. Where, the A matrix is pedigree relationship matrix, A22 is a numerator relationship matrix for genotyped animals, and G is a genomic relationship matrix (15). G−1 was obtained as the inverse of a combination of the G matrix and the corresponding A matrix. The w is the weight of A22 in the matrix, the default value is 0.05. The τ and ω are 1. We use DMU software to calculate the GEBV value. Both G and H matrices were derived using software default parameter setting by DMU software. G was calulated as: Where p is the allele frequency at locus i in all genotyped animals, is a normalizing constant (16) that sums expected variances across markers scaling G toward the A matrix (17), D is weight for each locus(I if same variance assumed), W is a design matrix as follows): Each SNP effect was calculated using the FarmCPU method (13) based on the predicted GEBVs. The FarmCPU method (13) in this study can be written as two models. The y is the GEBV value. The pseudoQTN is significant marker from previous loops that is null when the model begins. SNPi is testing marker in each loop. The K is the kinship between each individuals. The e is residual vector. For each trait, the threshold P-value for genome-wide significance was 5.99 × 10−7= 0.05/83446 using the Bonferroni multiple test method.

QTLs annotation analysis

The cattle QTL data were downloaded from the Cattle QTL database (https://www.animalgenome.org/cgi-bin/QTLdb/BT/index) referred to as the ARS-UCD1.2 assembly. The square of the correlation coefficient (r2) between the two loci is used to evaluate the range of LD measurement, because r2 is considered to be more robust and not affected by changes in allele frequency and population size (18). Haploview software was used to calculate the genotype correlation coefficient (r2) between all SNP pairs in the cow population to estimate the LD of the whole genome, and the LD decay map with distance of the cow population was visualized.

Results

The phenomenon of group stratification is an important research problem in the study of group association (19). In order to determine the population stratification level, we drew the population structure by principal component analysis (PCA). The PCA scatterplots shows the population structure of a 1,300 individual composed of seven pastures (Figure 1). Different colors represent different pastures. It can be seen that it is mainly divided into three clusters, but most of the cows in the seven pastures are gathered together, and only a few cows are separated. These clusters indicate that, although individuals may come from different ranches, they still retain close genetic relationships.

The genome-wide association study

Basic descriptive statistics of milk production traits, somatic cell score and body conformation traits (see Table 1). A total of 86,645 SNPs were retained after quality control for the GWAS (Table 2). The average physical distance between the adjacent SNP markers was approximately 29.58 kb, ranging between 26.37 kb (BTA19) and 32.02 kb (BTA8).
Table 1

Descriptive statistics of milk procuction trants and body conformation traits.

Statistic MY (kg) FP (%) PP (%) SCS BFCI FTLEG DCCI
Mean8382.993.853.314.0285.6885.4685.91
Standard Deviation1950.680.500.261.444.884.167.68
Minimum1505.0022.171.0065.2565.8056.18
Maximum15983.006.205.009.0098.3699.0099.95
Coefficient of Variation0.230.130.080.360.060.050.09
Table 2

Distribution of SNPs after quality control.

BTA Length (Mb) No. SNP (Chip data)No. SNP (after QC)Density length/SNP(kb)
1158.535556518830.56
2136.234688436731.20
3121.014508415829.10
4120.004049376031.92
5120.094523408329.41
6117.814364397729.62
7110.683903355131.17
8113.323805353932.02
9105.453695346930.40
10103.313626337630.60
11106.983801352230.38
1287.223044284230.69
1383.473064282229.58
1482.403045279629.47
1585.013119288529.47
1681.012826258631.33
1773.172668250629.20
1865.822605238927.55
1963.452726240626.37
2071.972737249828.81
2169.862573237429.43
2260.772201203829.82
2352.502110195126.91
2462.322259208129.95
2542.351726158926.65
2651.991823170830.44
2745.611699162428.09
2845.941735163028.18
2951.101871173129.52
Total2489.37903498344629.58
Descriptive statistics of milk procuction trants and body conformation traits. Distribution of SNPs after quality control. The p-value profiles of all the SNP markers associated with each trait are represented in Figures 3, 4 and included the Manhattan and Quantile-Quantile plots. In total, 95 genome-wide significant SNPs were detected for the milk production traits, such as milk yield (MY), milk fat yield (FY), milk fat percentage (FP), milk protein yield (PY), milk protein percentage (PP), somatic cell score (SCS), and body conformation traits (body form composite index, BFCI; daily capacity composite index, DCCI; feed and leg conformation, FTLEG). There were 57, 12 and 26 SNPs related to milk production, somatic cell score and body conformation traits, respectively. Among them, we mainly focused on the first few significant SNPs in each trait. In addition, we also found seven SNPs that overlap with multiple traits, such as PLEC is related to MY, FP and PP, PLEKHA5 is related to FP and FY, TONSL is connected with FY and SCS, LCORL is correlated with DCCI and FTLEG, PYGB is related to BFCI and FTLEG, and PTGER4 is related to BFCI, and DCCI (see Table 3).
Figure 3

Manhattan plots and Quantile-Quantile plots for the milk production and somatic cell score traits. MY (A,B), FY (C,D), FP (E,F), PP (G,H), PY (I,J) and SCS (K,L).

Figure 4

Manhattan plots and Quantile-Quantile plots for the body conformation traits. BFCI (A,B), DCCI (C,D) and FTLEG (E,F).

Table 3

The SNPs and candidate genes with pleiotropic effect in this study.

SNP name Traits Gene Distance(kb)* Gene full name Gene function
BovineHD1400000287MY, FP, PPPLECIntronPlectinRelated to the MY, FP, and PP traits in Chinese Holsteins (20).
BovineHD0500025853FP, FYPLEKHA5IntronPleckstrin homology domain containing, family A member 5Significantly associated with FP (21).
BovineHD1400011649MY, PYHNF4GIntronHepatocyte nuclear factor 4 gammaAssociated with childhood obesity (22). Key regulators of beef cattle carcass IMF (23).
BovineHD1400000206FY, SCSTONSL1.65 (U)Tonsoku like, DNA repair proteinRelated to milk yield (24, 25) and affect the gamma–linolenic acid, long–chain saturated fatty acids and milk fat percent of the Canadian Holstein cows (26).
BovineHD4100004660DCCI, FTLEGLCORL665.01 (D)Ligand dependent nuclear receptor corepressor likeAffect human height (27), pig body length (28), horse height (29), chicken carcass weight (30), and the growth and development of cattle (31). Associated with the human skeletal frame size (32).
BovineHD1300012605BFCI, FTLEGPYGB0.61 (D)Glycogen phosphorylase BInhibition of glycogen utilization (33)
BTA−50244–no–rsBFCI, DCCIPTGER4541.63 (D)Prostaglandin E receptor 4Relaxation to the smooth muscle (34), leading to the phosphorylation of glycogen synthase kinase−3 (35), involving in osteoporosis (36), and regulating lipid droplet size and mitochondrial activity in the white adipose tissue (37, 38).

*U, Upstream; D, Downstream.

Manhattan plots and Quantile-Quantile plots for the milk production and somatic cell score traits. MY (A,B), FY (C,D), FP (E,F), PP (G,H), PY (I,J) and SCS (K,L). Manhattan plots and Quantile-Quantile plots for the body conformation traits. BFCI (A,B), DCCI (C,D) and FTLEG (E,F). The SNPs and candidate genes with pleiotropic effect in this study. *U, Upstream; D, Downstream. As shown in Tables 4, 5, 12, 11, 15, 11, 12, and 17 genome-wide significant SNPs were detected for MY, FY, FP, PY, PP, and SCS, respectively. These significant SNPs are mainly distributed in BTA 1, BTA 2, BTA 5, BTA 6, BTA 11, BTA 14, and BTA 20, with as many as 9 SNPs on BTA 14.
Table 4

Genome-wide significant SNPs are associated with milk production traits.

Traits SNP name BTA Postion (Mb) MAF Nearest gene Distance (kb)* P–value SNP effect
MYBTB−00088434233.860.0599KCNH739.78 (U)5.87E−08−295.1002
MYHapmap40999–BTA−47831262.780.4824TMEM1630.81 (D)2.01E−07−131.0532
MY5–82810184–C–T–rs110495697582.400.4156ARNTL2Intron1.30E−13206.5174
MYBovineHD0600027996698.510.2000GPAT3144.85(D)5.23E−09−188.5885
MYBovineHD0800021118869.700.2790SLC39A14Intron5.53E−07−145.3961
MYBovineHD1400000287140.880.2240PLECIntron2.71E−07163.9574
MYBovineHD14000116491438.570.1725HNF4GIntron4.36E−07−154.3554
MYBTA−07375–no–rs1466.160.3824ERICH5Intron8.97E−10194.9580
MYBovineHD15000144071549.220.3053OR51L40.17(U)1.23E−07−155.6556
MYBovineHD17000129681745.450.2130SFSWAP90.47(D)5.35E−07−153.0319
MYHapmap55097–rs290109521826.880.4260GOT2436.24(D)1.95E−07136.2285
MYBovineHD2800000275281.650.2076URB245.14(D)2.75E−07−164.7354
FYHapmap24838–BTA−143176562.150.4137TMPO560.66(U)3.37E−083.4575
FYBovineHD0500025853590.660.3145PLEKHA5Intron1.91E−073.4453
FYARS–BFGL–NGS−10921714.970.3893PDE4AIntron1.80E−073.3773
FYBovineHD0700020203767.040.3893SGCD536.03(U)6.02E−09−3.6651
FYARS–BFGL–NGS−753501149.900.1893TCF7L119.24(D)2.79E−07−3.8169
FYARS–BFGL–NGS−297371247.060.1004CXCL1411.84(U)5.06E−096.6175
FYBovineHD1400000206140.490.2172TONSL1.65(U)2.76E−176.3376
FYARS–BFGL–NGS−55227145.690.4179KHDRBS3711.31(U)6.01E−093.7919
FYARS–BFGL–NGS−1152332058.830.4576TRIOIntron2.29E−093.6663
FYBovineHD23000116332340.580.1126ZNF496Intron7.11E−09−5.8266
FYARS–BFGL–NGS−95089278.940.2202AGA238.44(D)4.30E−07−3.7871
FPBTB−00035766181.920.3565EHHADHIntron3.93E−070.0443
FPHapmap40546–BTA−486222104.360.4168MARCHF4Intron7.92E−08−0.0460
FPBovineHD0400011775442.790.4855OR2AV1410.61(U)4.24E−07−0.0432
FPBovineHD0500025853590.660.3145PLEKHA5Intron8.76E−110.0609
FPHapmap35196–BES10_Contig207_566690.840.0679SDAD1Intron1.73E−070.0917
FPBovineHD0800011917839.970.0832SLC1A1Intron5.36E−070.0705
FPARS–BFGL–NGS−15823928.530.3385PKIBIntron2.70E−110.0611
FPBovineHD11000156761153.860.42291.28E−09−0.0479
FPBovineHD1400000287140.880.2240PLECIntron6.48E−370.1423
FPUA–IFASA−7269143.100.2313AGO2Intron2.01E−15−0.1058
FPBovineHD15000154381552.730.1485P2RY20.64(D)4.08E−08−0.0687
FPBovineHD15000175631560.500.4202KCNA4230.62(U)5.11E−070.04461
FPARS–BFGL–BAC−279302029.360.2080DDX6Intron8.92E−11−0.0795
FPBTA−50420–no–rs2036.050.2107EGFLAMIntron1.32E−11−0.0872
FPBTB−012630102042.720.1565CDH6363.33(D)2.61E−080.0769
PPBovineHD0100013692148.120.24208.70E−080.0222
PPBovineHD01000416071142.820.3676SLC37A1Intron9.80E−110.0252
PPARS–BFGL–NGS−80635231.590.4523COBLL16.23(D)1.02E−070.0182
PPARS–BFGL–NGS−117881582.230.3859C5H12orf7121.60(U)1.11E−13−0.0318
PPBovineHD0600008707629.630.3897BMPR1B36.41(D)1.62E−08−0.0220
PPARS–BFGL–BAC−157341348.970.3725BMP2179.62(U)5.84E−08−0.0210
PPBovineHD1400000287140.880.2240PLECIntron1.27E−160.0362
PPBovineHD14000137241446.180.4492EXT1Intron3.55E−080.0211
PPchr14_572506921455.090.1481NUDCD1Intron3.79E−08−0.0283
PPARS–BFGL–NGS−219211914.400.2817CCL1459.58(U)1.97E−07−0.0197
PPBovineHD20000093612032.690.2844OXCT1Intron5.35E−08−0.0294
PPBovineHD25000044792515.710.1275XYLT1Intron1.78E−100.0365
PYBovineHD0100027261195.180.3279FNDC3BIntron1.20E−113.3681
PYBovineHD0300017107356.640.3195LMO454.14(U)1.26E−07−2.5817
PYHapmap26317–BTC−059618680.530.1557EPHA5306.96(U)4.69E−08−3.2564
PYARS–BFGL–NGS−497411106.560.1069ZMYND195.22(D)4.52E−09−4.2183
PYBovineHD14000116491438.570.1725CRISPLD1226.89(D)4.93E−083.2889
PYBovineHD17000164491755.890.4740CCDC60Intron1.62E−07−2.2566
PYARS–USMARC–Parent–EF034086–no–rs2637.900.4607EMX263.30(D)4.29E−092.6483

*U, Upstream; D, Downstream.

Table 5

Genome–wide significant SNPs are associated with somatic cell score.

Traits SNP name BTA Postion (Mb) MAF Nearest gene Distance(kb) * P value SNP effect
SCSHapmap59481–rs29019616156.940.1893GCSAMIntron1.59E−10−0.1547
SCSBovineHD02000331552113.940.2847NYAP290.45(D)1.98E−080.1198
SCSBovineHD0600020300671.350.3580CEP1356.10(U)1.94E−10−0.1304
SCSBovineHD11000115471139.190.2080CCDC85A118.86(D)3.15E−080.1376
SCSBovineHD13000192521367.150.2282KIAA175521.69(D)1.73E−09−0.1423
SCSBovineHD1400000206140.490.2172TONSL1.65(U)4.03E−07−0.1159
SCSBovineHD14000115081438.010.2939PI15147.13(U)3.10E−09−0.1307
SCSBovineHD16000132291647.050.3996ACOT7Intron7.12E−080.1065
SCSBovineHD16000157831655.270.0657SERPINC1Intron2.54E−08−0.2208
SCSBTA−65815–no–rs1659.730.2267RASAL2Intron2.31E−10−0.1528
SCSUA–IFASA−53051959.210.1271SOX9289.75(D)3.61E−09−0.1751
SCSBovineHD20000173152061.610.4405CTNND211.32(U)1.18E−070.1036
SCSBTA−52343–no–rs2142.730.1042AKAP6Intron4.65E−09−0.1536
SCSHapmap46118–BTA−1082522219.450.4435GRM7Intron3.00E−08−0.1121
SCSARS–BFGL–NGS−245192510.590.1378GSPT10.93(D)2.31E−070.1394
SCSARS–BFGL–NGS−371892532.400.07786RCC1L267.69(U)3.28E−080.1948
SCSHapmap42542–BTA−407762627.930.2504SORCS120.95(D)5.60E−100.1395

*U, Upstream; D, Downstream.

Genome-wide significant SNPs are associated with milk production traits. *U, Upstream; D, Downstream. Genome–wide significant SNPs are associated with somatic cell score. *U, Upstream; D, Downstream. In addition, this study reported an interesting phenomenon where four SNPs were found to be related to multi-traits, including BovineHD0500025853 (BTA 5:90.66 Mb), BovineHD1400000206 (BTA 14:0.49 Mb), BovineHD1400000287 (BTA 14:0.88 Mb), and BovineHD1400011649 (BTA 14:38.57 Mb) (see Table 6). The bovinehd1400000287 SNP located in the 58th intron of the PLEC gene was found to be associated with MY, FP, and PP. The fat yield and the somatic cell score trait shared one SNP bovinehd1400000206 located 1.46 kb away from TONSL on BTA 14.
Table 6

Genome–wide significant SNPs are associated with body conformation traits.

Traits SNP name BTA Position (Mb) MAF Nearest gene Distance (kb)* P-value SNP effect
BFCIARS–BFGL–NGS−39319831.330.3836MPDZ122.75(D)4.59E−08−1.3217
BFCIBovineHD10000155741052.010.3450AQP975.39(U)3.07E−091.5722
BFCIBovineHD12000088031229.840.1481HSPH119.83(U)3.72E−081.9433
BFCIBovineHD13000126051342.810.4622PYGB0.61(D)8.06E−081.2185
BFCIARS–BFGL–NGS−662521650.240.0805MMEL131.03(U)2.79E−082.4780
BFCIBovineHD16000231011677.360.4538ATP6V1G347.95(U)9.44E−081.1905
BFCIBovineHD17000056231719.110.4050SLC7A11307.12(U)2.82E−081.4169
BFCIBovineHD19000150241953.080.3546RBFOX3Intron4.39E−071.2520
BFCIBTA−50244–no–rs2034.300.3710PTGER4541.63(D)5.84E−13−1.9115
BFCIBovineHD2200000513221.990.1302EOMES123.32(U)2.09E−071.8111
DCCIBovineHD0300021562373.790.4351NEGR1Intron1.03E−07−1.2304
DCCIBTB−00190417459.090.3496DNAJB9493.74(U)4.93E−111.6662
DCCIBovineHD4100004660638.220.4271LCORL665.01(D)2.39E−09−1.4509
DCCIARS–BFGL–BAC−150231231.340.4103MTUS2Intron6.74E−081.2696
DCCIBTB−005970651541.000.3527GALNT1864.26(U)9.91E−101.5227
DCCIBTA−50244–no–rs2034.300.3710PTGER4541.63(D)6.11E−08−1.2173
DCCIARS–BFGL–NGS−977472328.020.3840CDSN4.46(U)1.77E−091.4608
FTLEGBovineHD0100020157169.850.0962SNX437.98(U)2.04E−07−2.4836
FTLEGARS–BFGL–NGS−565841145.090.1309POFUT2Intron7.56E−082.0359
FTLEGBovineHD0300019080363.660.1248ADGRL2511.07(D)1.06E−082.5317
FTLEGBTB−01326707638.000.2737LCORL665.01(D)3.16E−11−2.0018
FTLEGBTB−00124923934.940.1851FRK243.36(D)3.42E−071.7253
FTLEGBovineHD13000126051342.810.4622PYGB0.61(D)2.23E−091.6065
FTLEGHapmap50322–BTA−340171378.200.1309CEBPB7.25(U)8.11E−08−2.2401
FTLEGBovineHD1600000840163.120.1191KLHDC8A11.96(D)3.74E−072.1807
FTLEGBovineHD16000083811628.910.1683TMEM63A1.14(D)7.79E−09−2.1564
FTLEGBovineHD20000118112041.040.3221SUB126.79(U)4.00E−11−1.9453
FTLEGBTA−14388–rs290231512223.200.4561IL5RAIntron8.59E−101.6383

*U, Upstream; D, Downstream.

Genome–wide significant SNPs are associated with body conformation traits. *U, Upstream; D, Downstream. This study detected 10, 7, and 11 significant SNPs related to BFCI, DCCI, and FTLEG, respectively. There were 4 SNPs distributed on BTA 16. Three SNPs were found to be possibly as pleiotropism SNPs, including BovineHD4100004660 (BTA 6:38.22 Mb), BovineHD1300012605 (BTA 13:42.81 Mb) and BTA-50244-no-rs (BTA 20:34.30 Mb), respectively. Of these significant SNPs, the BTA-50244-no-rs SNP related to BFCI (P = 5.84E-13) was located downstream of the PTGER4 gene.

QTL annotation analysis

The LD of cows decreases with the increase of distance, when the distance is extended to 200 Kb, the decline rate of LD of cows tends to be gentle, and the average r2 value of cows is 0.3 at this time (show as Figure 2). The 100 Kb range of SNP upstream and downstream of significant trait association obtained from genome-wide association analysis is compared with the data that has been verified in the current cattle QTL database. Our significant SNPs associated with MY, FY, FP, PP and SCS overlapped with 1332, 1177, 3042, 1288, 24 QTLs, respectively. But there are also very few QTLs about body conformation traits overlapped with significant SNPs.
Figure 2

LD decay of cow.

Discussion

Comparison with the other GWAS studies

In this study, FarmCPU was applied for screening the QTLs related to the milk production traits, health traits, and body conformation traits. A total of 95 significant SNPs were detected, located on the 93 candidate genes. Of these genes, EHHADH, SLC37A1, PLEKHA5, TONSL, PLEC, and IL5RA were reportedly related to milk production traits in other studies (15, 20, 21, 24, 39). However, this study did not detect some important candidate genes, such as DGAT1. Because in this study, the closest SNP on both flanks of DGAT1 are BovineHD1400000206 (109.2 kb) and ARS-BFGL-NGS-55227 (50.8 Mb), respectively. Of these, BovineHD1400000206 associated with fat yield (P value = 2.76E−17). But the nearest gene on this significantly SNP is the TONSL gene (1.65 kb), which is a neighboring gene to DGAT1. So, the DGAT1 gene was not detected in this study. The study by Ning et al. (40) used two models and a 70k SNP chip based on the Chinese Holsteins population and identified the DGAT1 gene to be related to milk (40). Kim et al. (41) also obtained DGAT1 affecting MY and FY in the Korean cattle population (41). Cole et al. (42) identified the PHKA2 gene to be highly significant for four body size traits (stature, strength, body depth, rump width) (42). The 770k BeadChip was used by An et al. (43) to identify five candidate genes (CSMD3, LAP3, SYN3, FAM19A5, and TIMP3) related to the body conformation traits. This study did not detect the above genes to be associated with body conformation traits. These inconsistencies might be due to differences in the detection platforms or algorithms used in the corresponding analysis, changes in the genetic background of the analyzed cattle, differences in the size and structure of the study population, or random or technical errors in some analyses. This also indicated that there are many important genetic markers or candidate genes in the bovine genome that are yet to be discovered.

Genetic analysis of pleiotropic genes

Organisms have hundreds of thousands of genes and tens of thousands of phenotypes. The relationship between genes and epigenetic factors is complex. There are various associations such as pleiotropism, multigenic effect, polygene effect and so on. Pleiotropy is defined as the phenomenon where a single locus affects two or more distinct phenotypic traits (44, 45). It is common in nature. For example, the DGAT1 gene is related to milk yield (40) and fat yield (26, 41). The genes PIK3R6 and PIK3R1 showed direct functional associations with height and body size (10). Production and health constitute fundamental dairy functions while body conformation traits are related to the functionality of the cow's body. So, the milk production traits and body conformation traits of dairy cows tend to complement each other. Certain identified regions related to conformation traits overlap with the performance traits such as reproduction (46), and milk production (47). Some genes in these regions were also involved in regulating the cell cycle or cell division, homeostasis, and lipid metabolism (10). This study also reported this interesting phenomenon where the PLEC, PLEKHA5, and TONSL genes were found to belong to the pleiotropism gene for milk traits, and the LCORL, and PTGER4 were pleiotropic genes for the body conformation traits. The PLEC gene (Plectin) can interlink different elements of the cytoskeleton. The PLEC gene was found to be associated with multiple traits, like MY, FP, and PP. Dan Wang et al. (20) also detected PLEC to have potential effects on the MY, FP, and PP traits, which could be useful for molecular breeding for milk production in Chinese Holsteins. The PLEKHA5 gene, located on BTA 5, was predicted to enable the activity of binding phosphatidylinositol phosphate (48). Jiang et al. (21) showed the PLEKHA5 gene to be significantly associated with FP using two different methods using 294,079 Holstein cows. The TONSL protein was considered to be an NF-κ negative regulator of B mediated transcription. Peters et al. (24), Nayeri et al. (25), and Atashi et al. (49) found this gene to be related to milk yield and the TONSL gene was found to reportedly affect the gamma-linolenic acid, long-chain saturated fatty acids and milk fat percent of the Canadian Holstein cows (26). Interesting, the TONSL gene is a neighboring gene to DGAT1 (flanking <200 kb), associated with the fat percentage of milk (26). Some studies on the LCORL gene showed it to affect human height (27), pig body length (28), horse height (29), chicken carcass weight (30), and the growth and development of cattle (31). This gene might have been a novel loci associated with the human skeletal frame size (32). PTGER4 encodes a protein that is one of the members of the G-protein coupled receptor family, which imparts relaxation to the smooth muscle (34), leading to the phosphorylation of glycogen synthase kinase-3 (35), involved in osteoporosis (36), and regulating lipid droplet size and mitochondrial activity in the white adipose tissue (37, 38).

Important candidate genes related to the fat metabolism or mammary gland development

Fatty acids are essential components of milk with known positive associations with human cardiovascular diseases and so on. This study identified genes such as GPAT3, ARNTL2, EHHADH, CEBPB, DNAJB9, ZNF496, AGO2, GALNT18, and NEGR1 as critical for obesity traits or adipose metabolism (see Table 7).
Table 7

Important candidate genes related to the fat metabolism or mammary gland development.

Gene name Location (BTA:Start–End, Mb) Full name Gene function
GPAT36:98.29–98.36Glycerol−3–phosphate acyltransferase 3Highly expressed in the adipose tissue with an important role in adipogenesis (50). Can be regulated by folic acid for controlling lactation and metabolic function of the dairy cows (51). Involved in fat and lipid metabolism in the Yunling cattle (52).
ARNTL25:82.47–82.55Aryl hydrocarbon receptor nuclear translocator like 2Influencing Mexican–Mestizo childhood obesity (53).
EHHADH1:81.88–81.93Enoyl–CoA hyd ratase and 3–hydroxyacyl CoA dehydrogenaseInvolved in fatty acid oxidation is essential for producing medium–chain dicarboxylic acids (54). Impact on the characteristics of milk fatty acid traits in Chinese Holstein (55). A pivotal gene in the fat–related pathway (56).
CEBPB13:78.20–78.21CCAAT enhancer binding protein betaInvolved in regulating the expression of fatty acid synthase in dairy cow mammary epithelial cells and milk fat synthesis (57).
DNAJB94:59.58–59.59DnaJ heat shock protein family (Hsp40) member B9The prognostic biomarkers of breast cancer (58). Correlated with the abdominal fat weight (59).
ZNF4967:40.57–40.61Zinc finger protein 496Associated with milk fat and fertility (60).
AGO214:3.06–3.14Argonaute RISC catalytic component 2Related to mitochondrial oxidation and obesity–associated pathophysiology (61).
GALNT1815:41.06–41.42Polypeptide N–acetylgalactosaminyltransferase 18Associated with milk protein and fat traits (62).
NEGR13:72.81–73.84Neuronal growth regulator 1Associated with obesity and BMI (body mass index) (6365).
SLC37A11:142.81–142.87Solute carrier family 37 member 1Over–expressed in the bovine mammary tissue (66). Increases milk yield, decreases phosphorus concentration (66).
FNDC3B1:95.12–95.41Fibronectin type III domain containing 3BBiomarker for the bovine mammary stem/progenitor cells, and Essential for the growth and maintenance of the mammary epithelium (67).

U, Upstream; D, Downstream.

Important candidate genes related to the fat metabolism or mammary gland development. U, Upstream; D, Downstream. GPAT3 is highly expressed in the adipose tissue with an important role in adipogenesis (50). This gene can be regulated by folic acid for controlling lactation and metabolic function of the dairy cows (51) and is also involved in fat and lipid metabolism in the Yunling cattle (52). EHHADH involved in fatty acid oxidation is essential for producing medium-chain dicarboxylic acids (54). Hence, this gene has a key impact on the characteristics of milk fatty acid traits in Chinese Holstein (55). In porcine adipogenesis, EHHADH has been proposed to be a pivotal gene in the fat-related pathway (56). The DNAJB9 gene is reportedly one of the prognostic biomarkers of breast cancer (58). Interestingly, DNAJB9 and DNAJB6 are members of the DNAJ gene family, with sequence similarity. The expression level of DNAJB6 in the chicken abdominal adipose tissue was significantly negatively correlated with the abdominal fat weight (59). ZNF496 is reportedly associated with milk concentration (milk fat) and fertility (60). According to Gao et al. (62), the GALNT18 gene was associated with milk protein and fat traits. According to the known gene functions, some candidate genes were expressed in the mammary gland, such as the SLC37A1, and FNDC3B genes (see Table 7). SLC37A1, over-expressed in the bovine mammary tissue relative to the 17 other tissue types (66) transports glucose-6-phosphate in one direction and phosphorus in the other (68). Glucose is known to be essential for lactose synthesis in mammary cells. Kemper et al. (66) identified the causative mutation increasing the expression of SLC37A1 leading to an increase in milk yield and decreasing the phosphorus concentration.

QTLs result overlapped with GWAS

Although many quantitative trait loci (QTLs) related to economically important traits in dairy cows have been identified, due to insufficient sample size and insufficient marker density used in QTL mapping research in history, not all genetic variations of these traits have been captured (69), in the study, we used GWAS to analyze the milk production traits, body conformation traits and somatic cells of dairy cows, and most of the results were also verified in the QTL analysis of dairy cows. Interestingly, our study found many SNP related to pleiotropy, but no repeated QTL regions were found in the QTL analysis (70). Also found the same phenomenon in the study of multiple traits of beef cattle. With these results, we can get some inspiration in verifying QTLs of some characteristics of interest shared among varieties (71).

Conclusions

A total of 95 significant SNPs were identified to be related to the milk production, somatic cell score, and body conformation traits in Holstein cattle. Among them, 7 significant SNPs located on the PLEC, PLEKHA5, TONSL, PTGER4, and LCORL genes showed pleiotropic effects on milk production or body conformation traits. In addition, some important candidate genes, including GPAT3, CEBPB, AGO2, SLC37A1, and FNDC3B, were also found to be related to the fat metabolism or involved in mammary gland development. The above genes however need to be consolidated as new potential genes through future validation.

Data availability statement

The original contributions presented in the study are included in the article or supplementary material, the variation data reported in this article have been deposited in the Genome Variation Map (GVM) in Big Data Center, Beijing Institute of Genomics (BIG), and Chinese Academy of Sciences, under accession numbers GVM000388 that are publicly accessible at https://bigd.big.ac.cn/gvm/getProjectDetail?project=GVM000388. The Bioproject accession number is PRJCA011726. Further inquiries can be directed to the corresponding author.

Ethics statement

The animal study was reviewed and approved by Animal Care Advisory Committee, Northeast Agricultural University (Harbin, China).

Author contributions

ZW, PW, and XL conceived the study and participated in its design. YZ, JW, QK, and XN were involved in the acquisition of data. XL, JW and QZ performed all data analysis. XL and ZW drafted the manuscript. ZW, PW, XL, YZ, CZ, QK, and XN contributed to the writing and editing. All authors read and approved the final manuscript.

Funding

This work was financially supported by the Natural Science Foundation of China (No. 32070571), the Academic Backbone Project of Northeast Agricultural University (No. 15XG14), and the NEAU Research Founding for Excellent Young Teachers (2010RCB29).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  70 in total

1.  Genetic parameters of conformation traits, milk yield, and herd life in Holsteins.

Authors:  T H Short; T J Lawlor
Journal:  J Dairy Sci       Date:  1992-07       Impact factor: 4.034

2.  Meta-analysis of genome-wide association studies for cattle stature identifies common genes that regulate body size in mammals.

Authors:  Aniek C Bouwman; Hans D Daetwyler; Amanda J Chamberlain; Carla Hurtado Ponce; Mehdi Sargolzaei; Flavio S Schenkel; Goutam Sahana; Armelle Govignon-Gion; Simon Boitard; Marlies Dolezal; Hubert Pausch; Rasmus F Brøndum; Phil J Bowman; Bo Thomsen; Bernt Guldbrandtsen; Mogens S Lund; Bertrand Servin; Dorian J Garrick; James Reecy; Johanna Vilkki; Alessandro Bagnato; Min Wang; Jesse L Hoff; Robert D Schnabel; Jeremy F Taylor; Anna A E Vinkhuyzen; Frank Panitz; Christian Bendixen; Lars-Erik Holm; Birgit Gredler; Chris Hozé; Mekki Boussaha; Marie-Pierre Sanchez; Dominique Rocha; Aurelien Capitan; Thierry Tribout; Anne Barbat; Pascal Croiseau; Cord Drögemüller; Vidhya Jagannathan; Christy Vander Jagt; John J Crowley; Anna Bieber; Deirdre C Purfield; Donagh P Berry; Reiner Emmerling; Kay-Uwe Götz; Mirjam Frischknecht; Ingolf Russ; Johann Sölkner; Curtis P Van Tassell; Ruedi Fries; Paul Stothard; Roel F Veerkamp; Didier Boichard; Mike E Goddard; Ben J Hayes
Journal:  Nat Genet       Date:  2018-02-19       Impact factor: 38.330

3.  Prostaglandin E receptor subtype 4 regulates lipid droplet size and mitochondrial activity in murine subcutaneous white adipose tissue.

Authors:  Fan Ying; Yin Cai; Yu Cai; Yu Wang; Eva Hoi Ching Tang
Journal:  FASEB J       Date:  2017-05-22       Impact factor: 5.191

4.  Effects of short and long milking intervals on milking characteristics and changes of milk constituents during the course of milking in crossbred Istrian × Awassi × East-Friesian ewes.

Authors:  Alen Dzidic; Jordan Kuehnl; Maja Simic; Rupert M Bruckmaier
Journal:  J Dairy Res       Date:  2022-02-16       Impact factor: 1.904

5.  Breast cancer prognostic biomarker using attractor metagenes and the FGD3-SUSD3 metagene.

Authors:  Tai-Hsien Ou Yang; Wei-Yi Cheng; Tian Zheng; Matthew A Maurer; Dimitris Anastassiou
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2014-09-23       Impact factor: 4.254

6.  A genome-wide association study reveals loci influencing height and other conformation traits in horses.

Authors:  Heidi Signer-Hasler; Christine Flury; Bianca Haase; Dominik Burger; Henner Simianer; Tosso Leeb; Stefan Rieder
Journal:  PLoS One       Date:  2012-05-16       Impact factor: 3.240

7.  Eigen decomposition expedites longitudinal genome-wide association studies for milk production traits in Chinese Holstein.

Authors:  Chao Ning; Dan Wang; Xianrui Zheng; Qin Zhang; Shengli Zhang; Raphael Mrode; Jian-Feng Liu
Journal:  Genet Sel Evol       Date:  2018-03-26       Impact factor: 4.297

8.  Genome-wide association for milk production and lactation curve parameters in Holstein dairy cows.

Authors:  Hadi Atashi; Mazdak Salavati; Jenne De Koster; Jim Ehrlich; Mark Crowe; Geert Opsomer; Miel Hostens
Journal:  J Anim Breed Genet       Date:  2019-10-01       Impact factor: 2.380

9.  Genetic determinants of height growth assessed longitudinally from infancy to adulthood in the northern Finland birth cohort 1966.

Authors:  Ulla Sovio; Amanda J Bennett; Iona Y Millwood; John Molitor; Paul F O'Reilly; Nicholas J Timpson; Marika Kaakinen; Jaana Laitinen; Jari Haukka; Demetris Pillas; Ioanna Tzoulaki; Jassy Molitor; Clive Hoggart; Lachlan J M Coin; John Whittaker; Anneli Pouta; Anna-Liisa Hartikainen; Nelson B Freimer; Elisabeth Widen; Leena Peltonen; Paul Elliott; Mark I McCarthy; Marjo-Riitta Jarvelin
Journal:  PLoS Genet       Date:  2009-03-06       Impact factor: 5.917

10.  Meta-analysis of genome-wide scans for human adult stature identifies novel Loci and associations with measures of skeletal frame size.

Authors:  Nicole Soranzo; Fernando Rivadeneira; Usha Chinappen-Horsley; Ida Malkina; J Brent Richards; Naomi Hammond; Lisette Stolk; Alexandra Nica; Michael Inouye; Albert Hofman; Jonathan Stephens; Eleanor Wheeler; Pascal Arp; Rhian Gwilliam; P Mila Jhamai; Simon Potter; Amy Chaney; Mohammed J R Ghori; Radhi Ravindrarajah; Sergey Ermakov; Karol Estrada; Huibert A P Pols; Frances M Williams; Wendy L McArdle; Joyce B van Meurs; Ruth J F Loos; Emmanouil T Dermitzakis; Kourosh R Ahmadi; Deborah J Hart; Willem H Ouwehand; Nicholas J Wareham; Inês Barroso; Manjinder S Sandhu; David P Strachan; Gregory Livshits; Timothy D Spector; André G Uitterlinden; Panos Deloukas
Journal:  PLoS Genet       Date:  2009-04-03       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.