| Literature DB >> 26895843 |
Oscar O M Iheshiulor1, John A Woolliams2,3, Xijiang Yu4, Robin Wellmann5, Theo H E Meuwissen6.
Abstract
BACKGROUND: Currently, genomic prediction in cattle is largely based on panels of about 54k single nucleotide polymorphisms (SNPs). However with the decreasing costs of and current advances in next-generation sequencing technologies, whole-genome sequence (WGS) data on large numbers of individuals is within reach. Availability of such data provides new opportunities for genomic selection, which need to be explored.Entities:
Mesh:
Year: 2016 PMID: 26895843 PMCID: PMC4759725 DOI: 10.1186/s12711-016-0193-1
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Population structure and parameters used in the simulation
| Number of chromosomes | 1 |
| Genome length | 1 Morgan |
| Mutation rate | 10−8/bp/meiosis |
| Effective population size (Ne) | 200 |
| Recombination | Haldane map function |
| QTL density | 45 or 132/Morgan |
| QTL effects | Normal distribution |
| Number of generations | 1950 + 50 or 1990 + 10 |
| Heritability | 0.30 and 0.07 |
Fig. 1Distribution of allele frequencies in populations A and B at 10 and 50 generations of divergence. A_10 (B_10) and A_50 (B_50) refer to different times of divergence (T = 10 or 50) between both populations. The plots are the result of one replicate. SNP alleles that were fixed in both populations were excluded
Fig. 2QTL variance for one of the replicates of populations A and B at 10 and 50 generations of divergence. QTL variance was calculated as 2pqa2. QTL that were fixed in both populations were excluded. Pop_A and Pop_B refers to populations A and B at T = 10 or 50 generations of divergence, respectively
Fig. 3Distribution of QTL variance for populations A and B at 10 and 50 generations of divergence. QTL variance was calculated as 2pqa2. A_10 (B_10) and A_50 (B_50) refer to different times of divergence (T = 10 or 50) between both populations. The plots are the result of one replicate. QTL that were fixed in both populations were excluded
Fig. 4Linkage disequilibrium (r2) and persistency of phase (r) as a function of genomic distance. a Average linkage disequilibrium (LD) between SNPs estimated according to [39]. Pop_A_10 and Pop_B_10 refer to divergence of populations A and B by 10 generations while _50 refers to divergence by 50 generations. b Persistency of LD phase (i.e. the correlation of LD between populations A and B, [40]). Calculations are within populations A and B at different times of divergence (T = 10 or 50). Values are binned at an interval of 50 kb. The plots are the result of one replicate of simulated WGS data. Calculations were done with PLINK [41]
Accuracy of genomic prediction (±SE) for a trait with a heritability of 0.30 for population A based on SNP-BLUP or MixP using the different datasets
| Dataset | 45 QTL/Morgan | 132 QTL/Morgan | ||
|---|---|---|---|---|
| Accuracy | % decrease | Accuracy | % decrease | |
| SNP-BLUP | ||||
| WGS data | 0.596 (±0.015) | 0.582 (±0.014) | ||
| data3000 | 0.578 (±0.016) | 3.0 | 0.575 (±0.014) | 1.2 |
| data2000 | 0.576 (±0.014) | 3.4 | 0.568 (±0.014) | 2.4 |
| data1000 | 0.564 (±0.014) | 5.4 | 0.555 (±0.015) | 4.6 |
| data200 | 0.473 (±0.015) | 20.0 | 0.468 (±0.014) | 19.6 |
| MixP | ||||
| WGS data | 0.632 (±0.018) | 0.587 (±0.014) | ||
| data3000 | 0.598 (±0.018) | 5.4 | 0.579 (±0.014) | 1.4 |
| data2000 | 0.591 (±0.015) | 6.5 | 0.573 (±0.014) | 2.4 |
| data1000 | 0.579 (±0.015) | 8.4 | 0.562 (±0.015) | 4.3 |
| data200 | 0.484 (±0.016) | 23.4 | 0.465 (±0.014) | 20.8 |
Accuracy of prediction was measured as the correlation between simulated true and predicted genetic values in the validation dataset
% decrease in accuracy of prediction relative to that obtained with WGS data
Accuracy of genomic prediction (±SE) for a trait with a heritability of 0.07 for population A based on SNP-BLUP or MixP using the different datasets
| Dataset | 45 QTL/Morgan | 132 QTL/Morgan | ||
|---|---|---|---|---|
| Accuracy | % decrease | Accuracy | % decrease | |
| SNP-BLUP | ||||
| WGS data | 0.413 (±0.024) | 0.347 (±0.019) | ||
| data3000 | 0.400 (±0.023) | 3.1 | 0.332 (±0.019) | 4.3 |
| data2000 | 0.394 (±0.021) | 4.6 | 0.326 (±0.019) | 6.1 |
| data1000 | 0.377 (±0.023) | 8.7 | 0.326 (±0.019) | 6.1 |
| data200 | 0.326 (±0.021) | 20.6 | 0.273 (±0.016) | 21.1 |
| MixP | ||||
| WGS data | 0.431 (±0.028) | 0.348 (±0.019) | ||
| data3000 | 0.407 (±0.025) | 5.6 | 0.333 (±0.019) | 4.3 |
| data2000 | 0.404 (±0.023) | 6.3 | 0.327 (±0.019) | 6.0 |
| data1000 | 0.382 (±0.024) | 11.4 | 0.328 (±0.019) | 5.7 |
| data200 | 0.338 (±0.023) | 21.6 | 0.276 (±0.016) | 20.7 |
Accuracy of prediction was measured as the correlation between simulated true and predicted genetic value in the validation dataset
% decrease in accuracy of prediction relative to that obtained with WGS data
Accuracy of across-population genomic prediction (SE) for a trait with a heritability of 0.30 or 0.07 when populations have diverged for T (10 or 50) generations: population A (reference) and population B (validation) based on SNP-BLUP or MixP using the different datasets
| Dataset | h2 = 0.30 | h2 = 0.07 | ||||||
|---|---|---|---|---|---|---|---|---|
| T = 10 | T = 50 | T = 10 | T = 50 | |||||
| SNP-BLUP | MixP | SNP-BLUP | MixP | SNP-BLUP | MixP | SNP-BLUP | MixP | |
| 45 QTL/Morgan | ||||||||
| WGS data | 0.396 (0.017) | 0.482 (0.023) | 0.276 (0.021) | 0.360 (0.030) | 0.270 (0.018) | 0.286 (0.022) | 0.171 (0.023) | 0.197 (0.029) |
| data3000 | −0.004 (0.015) | 0.008 (0.016) | 0.000 (0.022) | 0.026 (0.023) | 0.015 (0.016) | 0.015 (0.017) | −0.003 (0.019) | −0.003 (0.018) |
| data2000 | 0.001 (0.018) | 0.001 (0.027) | 0.001 (0.017) | −0.002 (0.017) | 0.030 (0.018) | 0.032 (0.018) | −0.003 (0.017) | −0.017 (0.023) |
| data1000 | 0.020 (0.019) | 0.009 (0.019) | −0.023 (0.019) | −0.005 (0.020) | −0.013 (0.015) | −0.017 (0.015) | −0.009 (0.014) | −0.005 (0.015) |
| data200 | 0.001 (0.013) | −0.010 (0.015) | −0.030 (0.021) | −0.012 (0.017) | 0.002 (0.021) | −0.001 (0.022) | −0.034 (0.021) | −0.044 (0.026) |
| 132 QTL/Morgan | ||||||||
| WGS data | 0.392 (0.021) | 0.403 (0.021) | 0.238 (0.019) | 0.246 (0.019) | 0.209 (0.016) | 0.211 (0.015) | 0.186 (0.018) | 0.186 (0.019) |
| data3000 | 0.022 (0.017) | 0.025 (0.017) | 0.049 (0.023) | 0.052 (0.015) | 0.005 (0.016) | 0.005 (0.017) | −0.018 (0.019) | −0.018 (0.019) |
| data2000 | 0.013 (0.017) | 0.021 (0.019) | −0.003 (0.017) | −0.004 (0.017) | −0.026 (0.014) | −0.018 (0.012) | 0.001 (0.015) | 0.001 (0.015) |
| data1000 | 0.022 (0.017) | 0.028 (0.018) | −0.001 (0.017) | −0.002 (0.017) | 0.009 (0.017) | 0.009 (0.017) | 0.017 (0.018) | 0.019 (0.017) |
| data200 | 0.005 (0.017) | 0.003 (0.015) | −0.022 (0.014) | −0.019 (0.015) | 0.002 (0.014) | 0.005 (0.017) | −0.005 (0.017) | −0.005 (0.017) |
Accuracy of prediction was measured as the correlation between simulated true and predicted genetic value in the validation dataset
Accuracy of genomic prediction (SE) for a trait with a heritability of 0.30 or 0.07 using a multi-breed reference population when populations have diverged for T (10 or 50) generations, based on SNP-BLUP or MixP using the different datasets
| Dataset | h2 = 0.30 | h2 = 0.07 | ||||||
|---|---|---|---|---|---|---|---|---|
| T = 10 | T = 50 | T = 10 | T = 50 | |||||
| SNP-BLUP | MixP | SNP-BLUP | MixP | SNP-BLUP | MixP | SNP-BLUP | MixP | |
| 45 QTL/Morgan | ||||||||
| WGS data | 0.654 (0.013) | 0.710 (0.015) | 0.627 (0.013) | 0.675 (0.019) | 0.475 (0.021) | 0.525 (0.025) | 0.448 (0.023) | 0.480 (0.028) |
| data3000 | 0.578 (0.016) | 0.602 (0.018) | 0.571 (0.016) | 0.592 (0.019) | 0.404 (0.023) | 0.414 (0.024) | 0.398 (0.024) | 0.409 (0.026) |
| data2000 | 0.571 (0.015) | 0.574 (0.016) | 0.564 (0.016) | 0.558 (0.017) | 0.400 (0.022) | 0.414 (0.025) | 0.387 (0.022) | 0.390 (0.024) |
| data1000 | 0.552 (0.017) | 0.551 (0.019) | 0.546 (0.017) | 0.549 (0.017) | 0.376 (0.022) | 0.377 (0.023) | 0.368 (0.022) | 0.367 (0.022) |
| data200 | 0.424 (0.017) | 0.406 (0.017) | 0.428 (0.019) | 0.423 (0.021) | 0.281 (0.019) | 0.273 (0.019) | 0.295 (0.020) | 0.289 (0.021) |
| 132 QTL/Morgan | ||||||||
| WGS data | 0.647 (0.011) | 0.662 (0.011) | 0.612 (0.014) | 0.624 (0.014) | 0.410 (0.016) | 0.412 (0.017) | 0.370 (0.018) | 0.374 (0.019) |
| data3000 | 0.569 (0.012) | 0.573 (0.013) | 0.573 (0.015) | 0.576 (0.015) | 0.343 (0.016) | 0.345 (0.016) | 0.332 (0.019) | 0.332 (0.020) |
| data2000 | 0.568 (0.012) | 0.570 (0.013) | 0.565 (0.015) | 0.567 (0.015) | 0.341 (0.018) | 0.341 (0.018) | 0.331 (0.019) | 0.330 (0.019) |
| data1000 | 0.537 (0.012) | 0.536 (0.012) | 0.542 (0.016) | 0.521 (0.015) | 0.313 (0.013) | 0.313 (0.013) | 0.322 (0.019) | 0.321 (0.020) |
| data200 | 0.427 (0.017) | 0.416 (0.017) | 0.430 (0.014) | 0.411 (0.014) | 0.233 (0.017) | 0.224 (0.017) | 0.246 (0.016) | 0.237 (0.017) |
Accuracy of prediction was measured as the correlation between simulated true and predicted genetic value in the validation dataset
Accuracy of genomic prediction (SE) for a trait with a heritability of 0.30 based on single- and multi-breed reference populations (RP) obtained with SNP-BLUP or MixP using the different datasets
| Dataset | Single-breed | Multi-breed | ||||||
|---|---|---|---|---|---|---|---|---|
| RP = 200 | RP = 400 | RP = 400 | ||||||
| T = 10 | T = 50 | |||||||
| SNP-BLUP | MixP | SNP-BLUP | MixP | SNP-BLUP | MixP | SNP-BLUP | MixP | |
| 45 QTL/Morgan | ||||||||
| WGS data | 0.596 (0.015) | 0.632 (0.018) | 0.696 (0.014) | 0.736 (0.017) | 0.654 (0.013) | 0.710 (0.015) | 0.627 (0.013) | 0.675 (0.019) |
| data3000 | 0.578 (0.016) | 0.598 (0.018) | 0.681 (0.013) | 0.719 (0.015) | 0.578 (0.016) | 0.602 (0.018) | 0.571 (0.016) | 0.592 (0.019) |
| data2000 | 0.576 (0.014) | 0.591 (0.015) | 0.679 (0.013) | 0.713 (0.014) | 0.571 (0.015) | 0.574 (0.016) | 0.564 (0.016) | 0.558 (0.017) |
| data1000 | 0.564 (0.014) | 0.579 (0.015) | 0.668 (0.013) | 0.697 (0.014) | 0.552 (0.017) | 0.551 (0.019) | 0.546 (0.017) | 0.549 (0.017) |
| data200 | 0.473 (0.015) | 0.484 (0.016) | 0.566 (0.014) | 0.576 (0.016) | 0.424 (0.017) | 0.406 (0.017) | 0.428 (0.019) | 0.423 (0.021) |
| 132 QTL/Morgan | ||||||||
| WGS data | 0.582 (0.014) | 0.587 (0.014) | 0.691 (0.011) | 0.702 (0.012) | 0.647 (0.011) | 0.662 (0.011) | 0.612 (0.014) | 0.624 (0.014) |
| data3000 | 0.575 (0.014) | 0.579 (0.014) | 0.681 (0.010) | 0.688 (0.011) | 0.569 (0.012) | 0.573 (0.013) | 0.573 (0.015) | 0.576 (0.015) |
| data2000 | 0.568 (0.014) | 0.573 (0.014) | 0.670 (0.012) | 0.677 (0.012) | 0.568 (0.012) | 0.570 (0.013) | 0.565 (0.015) | 0.567 (0.015) |
| data1000 | 0.555 (0.015) | 0.562 (0.014) | 0.654 (0.012) | 0.665 (0.012) | 0.537 (0.012) | 0.536 (0.012) | 0.542 (0.016) | 0.521 (0.015) |
| data200 | 0.468 (0.014) | 0.465 (0.014) | 0.568 (0.013) | 0.563 (0.013) | 0.427 (0.017) | 0.416 (0.017) | 0.430 (0.014) | 0.411 (0.014) |
Accuracy of prediction was measured as the correlation between simulated true and predicted genetic value in the validation dataset