| Literature DB >> 30092776 |
Jun He1,2, Yage Guo1,3, Jiaqi Xu1,4, Hao Li1,5, Anna Fuller1, Richard G Tait1, Xiao-Lin Wu6,7, Stewart Bauck1.
Abstract
BACKGROUND: SNPs are informative to estimate genomic breed composition (GBC) of individual animals, but selected SNPs for this purpose were not made available in the commercial bovine SNP chips prior to the present study. The primary objective of the present study was to select five common SNP panels for estimating GBC of individual animals initially involving 10 cattle breeds (two dairy breeds and eight beef breeds). The performance of the five common SNP panels was evaluated based on admixture model and linear regression model, respectively. Finally, the downstream implication of GBC on genomic prediction accuracies was investigated and discussed in a Santa Gertrudis cattle population.Entities:
Keywords: Admixture model; Cattle; Genomic breed composition; Genomic prediction; SNP
Mesh:
Year: 2018 PMID: 30092776 PMCID: PMC6085684 DOI: 10.1186/s12863-018-0654-3
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Descriptive statistics of genotype data for 29,609 animals used in the present study
| Cattle Breed | Number of Genotyped Animals a | Number of SNPs | Average MAF Mean (SD) | Breed Type |
|---|---|---|---|---|
| Holstein | 8,905 (8,863) | 49,463 | 0.295 (0.152) | Dairy |
| Jersey | 6,911 (6,860) | 49,463 | 0.256 (0.158) | Dairy |
| Akaushi | 198 (167) | 49,463 | 0.243 (0.158) | Beef |
| Angus | 4,713 (4,672) | 49,463 | 0.303 (0.152) | Beef |
| BM | 608 (583) | 49,463 | 0.305 (0.142) | Beef |
| Brangus | 1,819 (1,770) | 40,660 | 0.238 (0.161) | Beef |
| Hereford | 2,423 (2,412) | 49,463 | 0.270 (0.150) | Beef |
| RA | 2,229 (2,158) | 49,463 | 0.300 (0.151) | Beef |
| SG | 297 (291) | 49,463 | 0.301 (0.140) | Beef |
| Wagyu | 1,506 (1,506) | 40,660 | 0.188 (0.164) | Beef |
BM Beefmaster, RA Red Angus, SG Santa Gertrudis, MAF minor allele frequency, SD standard deviation of MAF
aIn the brackets are the number of animals in the reference set for each breed, after removing outliers
Summary statistics of expected progeny differences (EPD) and accuracies of EPD of nine quantitative traits for 1424 animals presented as Santa Gertrudis
| Trait | N | Min | Q25% | Median | Q75% | Max | Mean | SD |
|---|---|---|---|---|---|---|---|---|
| EPD | ||||||||
| BW, lb | 1424 | −8.411 | −0.725 | −0.263 | 0.381 | 6.742 | −0.14 | 0.992 |
| FAT, in | 1424 | −0.125 | −0.002 | 0.001 | 0.002 | 0.062 | 0 | 0.006 |
| HCW, lb | 1424 | − 35.79 | − 4.188 | − 1.01 | 3.795 | 39.95 | 0.145 | 7.041 |
| MARBa | 1424 | −0.329 | − 0.017 | 0.004 | 0.012 | 0.473 | −0.004 | 0.04 |
| MWW, lb | 1424 | −23.04 | −2.051 | 0.4555 | 2.783 | 19.63 | 0.311 | 4.393 |
| REA, sq. in | 1424 | −0.551 | −0.041 | 0.003 | 0.043 | 0.647 | 0.005 | 0.091 |
| SC, cm | 1424 | −1.044 | −0.069 | 0.017 | 0.072 | 1.24 | −0.008 | 0.152 |
| WW, lb | 1424 | −32.37 | −4.211 | −1.118 | 4.175 | 46.97 | 0.235 | 7.093 |
| YW, lb | 1424 | −45.82 | −5.263 | −1 | 5.62 | 54.39 | 0.702 | 9.901 |
| Accuracy of EPD | ||||||||
| BW | 1424 | 0.001 | 0.049 | 0.136 | 0.174 | 0.852 | 0.279 | 0.149 |
| FAT | 1424 | 0.001 | 0.006 | 0.023 | 0.081 | 0.714 | 0.097 | 0.125 |
| HCW | 1424 | 0.001 | 0.022 | 0.066 | 0.099 | 0.574 | 0.139 | 0.099 |
| MARB | 1424 | 0.001 | 0.003 | 0.014 | 0.059 | 0.628 | 0.069 | 0.095 |
| MWW | 1424 | 0.001 | 0.073 | 0.161 | 0.181 | 0.84 | 0.269 | 0.128 |
| REA | 1424 | 0.001 | 0.01 | 0.032 | 0.069 | 0.595 | 0.089 | 0.088 |
| SC | 1424 | 0.001 | 0.003 | 0.009 | 0.051 | 0.599 | 0.039 | 0.097 |
| WW | 1424 | 0.001 | 0.059 | 0.158 | 0.191 | 0.887 | 0.297 | 0.151 |
| YW | 1424 | 0.001 | 0.034 | 0.098 | 0.135 | 0.714 | 0.197 | 0.126 |
Min minimum value, Median median value (50% quantile), Max maximum value, QX% X% quantile, where X = 25 and 75, respectively, SD standard deviation, BW birth weight, WW weaning weight, HCW hot carcass weight, MARB marbling score, MWW maternal weaning weight, FAT fat thickness, REA ribeye area, SC scrotal circumference, YW yearling weight
a4.00 = Slight, 5.00 = Small, 6.00 = Modest, 7.00 = Moderate, 8.00 = Slightly Abundant
Fig. 1Average Euclidean distance of SNP allele frequencies for each of the five SNP panels. a = 1,000 SNPs; b = 3,000 SNPs; c = 5,000 SNPs; d = 10,000 SNPs; e = 15,708 SNPs. The X-axis represents average Euclidean distance (AED) between SNP allele frequencies, and the y-axis represents frequencies of SNPs for a given level of AED
Fig. 2Plots of -2logLikelihood for: (a) 198 animals presented as Akaushi (red circles) and (b) 2,423 animals presented as Hereford (green circles), obtained with assumed true allele frequencies of SNPs for each of the 10 breeds. The blue arrow below the x-axis indicates the cutoff value of the -2logLikelihood values for data cleaning. Assumed breeds from left to right are: (a) Akaushi, Wagyu, Santa Gertrudis, Beef Master, Holstein, Brangus, Hereford, Red Angus, Angus, and Jersey; (b) Hereford, Beef Master, Santa Gertrudis, Red Angus, Brangus, Angus, Holstein, Wagyu, Jersey, and Akaushi
Fig. 3Hierarchical clustering of 10 cattle breeds based on Euclidean distance of allele frequencies of SNPs on the 1 K SNP panel
Distribution of genomic breed composition (GBC) of 198 animals presented as Akaushi
| Akaushi breed coefficient | Admixture model | Linear Regression model | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 K | 3 K | 5 K | 10 K | 16 K | 1 K | 3 K | 5 K | 10 K | 16 K | |
| =1 | 166 | 166 | 166 | 166 | 166 | 57 | 125 | 142 | 150 | 151 |
| [0.9, 1.0) | 1 | 1 | 1 | 1 | 0 | 71 | 36 | 24 | 18 | 17 |
| [0.8, 0.9) | 9 | 8 | 9 | 9 | 11 | 41 | 13 | 9 | 7 | 8 |
| [0.7, 0.8) | 4 | 5 | 4 | 4 | 3 | 8 | 4 | 5 | 5 | 4 |
| [0.6, 0.7) | 1 | 0 | 1 | 1 | 1 | 3 | 4 | 1 | 1 | 3 |
| [0.5, 0.6) | 11 | 12 | 11 | 11 | 11 | 6 | 9 | 11 | 11 | 9 |
| [0.4, 0.5) | 0 | 0 | 0 | 0 | 0 | 5 | 1 | 0 | 0 | 0 |
| [0.3, 0.4) | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| [0.2, 0.3) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| [0.1, 0.2) | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| [0, 0.1) | 5 | 5 | 5 | 5 | 5 | 5 | 5 | 5 | 5 | 5 |
[x,y) = an interval of GBC in which the value is greater than (or equal to) x and less than y
Fig. 4Plots of genomic breed composition (GBC) of 198 animals presented as Akaushi based on an admixture model versus a linear regression model: (a) GBC were estimated using 1 K SNP panel; (b) GBC were estimated using 16 K SNP panel
Fig. 5Plots of genomic prediction accuracies on nine traits in 1424 beef cattle with varying level of genomic breed composition of the Santa Gertrudis breed (GBCSG). BW = birth weight; WW = weaning weight; HCW = hot carcass weight; MARB = marbling score; MWW = maternal weaning weight; FAT = fat thickness; REA = ribeye area; SC = Scrotal Circumference; YW = yearling weight; IVA = independent validation on 71 cattle with 0 ≤ GBCSG < 0.70 while SNP effects were estimated from the 1225 animals with GBCSG ≥0.90; IVB independent validation on 128 cattle with 0.70 ≤ GBCSG < 0.90 while SNP effects estimated from the 1225 animals with GBCSG > 0.90; LOOCV = leave-out cross-validation in the 1225 animals with GBCSG ≥0.90