| Literature DB >> 29776327 |
Biaty Raymond1,2, Aniek C Bouwman3, Chris Schrooten4, Jeanine Houwing-Duistermaat5,6, Roel F Veerkamp3.
Abstract
BACKGROUND: Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selected from a genome-wide association study (GWAS), and to test our hypothesis that many non-causal markers in WGS data have a diluting effect on accuracy of across-breed prediction.Entities:
Mesh:
Year: 2018 PMID: 29776327 PMCID: PMC5960108 DOI: 10.1186/s12711-018-0396-8
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Description of scenarios used in the study
| Scenario | Description |
|---|---|
|
| |
| Full_seq | All available markers in WGS data that met the quality check criteria were included |
| HD | SNPs on the commercial BovineHD SNP chip |
| 50k | SNPs on the commercial 50k chip |
| HD_Top | Top_markers added to HD |
| 50k_Top | Top_markers added to 50k |
|
| |
| Pval5 | All markers that had a − log10(p) value higher than 5 from the meta-GWAS analysis |
| Pval7 | All markers that had a − log10(p) value higher than 7 form the meta-GWAS analysis |
| COJO3 | Markers selected using the following COJO model parameters thresholds: conditional and joint |
| COJO8 | Markers selected using the following COJO model parameters thresholds: conditional and joint p-value threshold (p) = 5e–8, collinearity between selected markers = 0.9, window size = 10 Mb |
| Top_markers | The most significant markers identified in each defined QTL region from the meta-GWAS analysi; only those that passed QC are included here |
Number of markers in each scenario and their mean minor allele frequency in Dutch Holsteins (DH), New Zealand Holsteins (NZH) and New Zealand Jerseys (NZJ)
| Scenario | Number of SNPs | Mean MAF | ||
|---|---|---|---|---|
| DH | NZH | NZJ | ||
|
| ||||
| Full_seq | 14,341,737 | 0.18 | 0.17 | 0.15 |
| HD | 583,078 | 0.27 | 0.28 | 0.24 |
| 50k | 48,912 | 0.25 | 0.26 | 0.21 |
| HD_TOP | 583,194 | 0.27 | 0.28 | 0.24 |
| 50k_TOP | 49,045 | 0.25 | 0.25 | 0.21 |
|
| ||||
| Pval5 | 59,828 | 0.25 | 0.23 | 0.17 |
| Pval7 | 23,125 | 0.24 | 0.23 | 0.17 |
| COJO3 | 1570 | 0.18 | 0.17 | 0.13 |
| COJO8 | 360 | 0.20 | 0.21 | 0.13 |
| Top_markers | 133 | 0.23 | 0.24 | 0.16 |
Composition of reference and validation sets used in the study
| Reference set (size) | Validation set (size) |
|---|---|
| New Zealand Holsteins (957) | New Zealand Jerseys (595) |
| Dutch Holsteins (5553) | |
| New Zealand Jerseys (595) | New Zealand Holsteins (957) |
| Dutch Holsteins (5553) | |
| Dutch Holsteins (5553) | New Zealand Holsteins (957) |
| New Zealand Jerseys (595) |
Fig. 1Proportion of genetic variance captured by markers in the different scenarios. The standard errors are indicated at the top of each bar
Accuracies of prediction for all prediction scenarios in the study
| Across-breed, across-country | Across-breed, within-country | Within-breed, across-country | ||||
|---|---|---|---|---|---|---|
| Reference | DH | NZJ | NZH | NZJ | NZH | DH |
| Validation | NZJ | DH | NZJ | NZH | DH | NZH |
|
| ||||||
| Full_seq (14,341,737) | 0.08 | 0.02 | 0.09 | 0.06 | 0.41 | 0.51 |
| HD (583,078) | 0.10 | 0.05 | 0.07 | − 0.04 | 0.43 | 0.55 |
| 50k (48,912) | 0.06 | 0.05 | 0.13 | − 0.04 | 0.42 | 0.52 |
| HD_Top (583,194 | 0.10 | 0.05 | 0.07 | − 0.04 | 0.43 | 0.55 |
| 50k_Top (49,045) | 0.07 | 0.06 | 0.13 | − 0.03 | 0.43 | 0.53 |
|
| ||||||
| Pval5 (59,828) | 0.19 | 0.19 | 0.14 | 0.32 | 0.52 | 0.58 |
| Pval7 (23,125) | 0.01 | 0.15 | 0.07 | 0.35 | 0.46 | 0.47 |
| COJO3 (1570) | 0.20 | 0.22 | 0.14 | 0.31 | 0.45 | 0.52 |
| COJO8 (360) | 0.21 | 0.25 | 0.16 | 0.18 | 0.47 | 0.44 |
| Top_markers (133) | 0.23 | 0.18 | 0.08 | 0.27 | 0.41 | 0.47 |
| SE* | 0.04 | 0.01 | 0.04 | 0.03 | 0.01 | 0.01 |
Predictions were carried out either across breed and across country, across breed and within country or within breed and across country. The populations are Dutch Holsteins (DH), New Zealand Holsteins (NZH) and New Zealand Jerseys (NZJ)
The unselected marker sets are the whole-genome sequence (Full_seq), high-density markers (HD) and markers on the traditional 50k chip (50k). HD_Top and 50k_Top are scenarios in which some pre-selected markers from a meta-GWAS (Top_markers) are added to the HD and 50k markers respectively. Pval5 and Pval7 are marker sets pre-selected from a meta-GWAS based on their p values. The COJO scenarios are those containing markers that are assumed to be independently significant markers from a meta-GWAS at different significant levels. Top_markers contain the most significant markers in each QTL region from a meta-GWAS
*Standard error (SE) of estimates, did not differ across the different sets of markers, provided the reference and validation populations remained the same
Intercept and slope of regression when deregressed breeding values of validation animals were regressed onto their predicted genomic breeding values
| Across-breed, across-country | ||||
|---|---|---|---|---|
| Scenarios | DH reference, NZJ validation | NZJ reference, DH validation | ||
| Intercept | Slope | Intercept | Slope | |
| Full_seq | 0.28 | 0.38 (0.20) | − 0.01 | 0.36 (0.22) |
| HD | 0.39 | 0.41 (0.16) | − 0.02 | 0.73 (0.20) |
| 50k | 0.24 | 0.20 (0.14) | − 0.01 | 0.63 (0.16) |
| HD_Top | 0.39 | 0.41 (0.16) | − 0.02 | 0.74 (0.20) |
| 50k_Top | 0.28 | 0.23 (0.13) | − 0.02 | 0.73 (0.16) |
| Pval5 | 0.30 | 0.36 (0.08) | − 0.03 | 0.59 (0.04) |
| Pval7 | 0.01 | 0.01 (0.06) | − 0.04 | 0.42 (0.03) |
| COJO3 | 0.40 | 0.35 (0.07) | − 0.05 | 0.71 (0.04) |
| COJO8 | 0.13 | 0.40 (0.07) | − 0.03 | 0.81 (0.04) |
| Top_markers | 0.46 | 0.43 (0.08) | − 0.06 | 0.48 (0.03) |
The populations are Dutch Holsteins (DH), New Zealand Holsteins (NZH) and New Zealand Jerseys (NZJ). Standard errors for the slopes are given in parentheses
The unselected marker sets are the whole-genome sequence (Full_seq), high-density markers (HD) and markers on the traditional 50k chip (50k). HD_Top and 50k_Top are scenarios in which some pre-selected markers from a meta-GWAS (Top_markers) are added to the HD and 50k markers, respectively. Pval5 and Pval7 are marker sets pre-selected from a meta-GWAS based on their p values. The COJO scenarios are those containing markers that are assumed to be independently significant markers from a meta-GWAS at different significant levels. Top_markers contain the most significant markers in each QTL region from a meta-GWAS