| Literature DB >> 21933416 |
Fernanda V Brito1, José Braccini Neto, Mehdi Sargolzaei, Jaime A Cobuci, Flavio S Schenkel.
Abstract
BACKGROUND: The success of genomic selection depends mainly on the extent of linkage disequilibrium (LD) between markers and quantitative trait loci (QTL), the number of animals in the training set (TS) and the heritability (h2) of the trait. The extent of LD depends on the genetic structure of the population and the density of markers. The aim of this study was to calculate accuracy of direct genomic estimated breeding values (DGEBV) using best linear unbiased genomic prediction (GBLUP) for different marker densities, heritabilities and sizes of the TS in simulated populations that mimicked previously reported extent and pattern of LD in beef cattle.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21933416 PMCID: PMC3224120 DOI: 10.1186/1471-2156-12-80
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Parameters of the simulation process
| Population structure | POP1/POP2 |
|---|---|
| Number of generations(size) - phase 1 | 1000(1000) |
| Number of generations(size) - phase 2 | 1020(200) |
| Number of founder males from HG | 100 |
| Number of founder females from HG | 100 |
| Number of generations | 8/6 |
| Number of offspring per dam | 5 |
| Number of founder males from EG | 640/160 |
| Number of founder females from EG | 32000/8000 |
| Number of generations | 10 |
| Number of offspring per dam | 1 |
| Ratio of male | 50% |
| Mating system | Random |
| Replacement ratio for males | 60% |
| Replacement ratio for females | 20% |
| Selection/culling | EBV |
| BV estimation method | BLUP animal model |
| Ratio of missing sire and dam | 5% |
| Heritability of the trait | 10%, 25%, or 40% |
| Phenotypic variance | 1.0 |
| Number of chromosomes | 29 |
| Total length | 2333 cM |
| Number of markers | 40000/800000 |
| Marker distribution | Evenly spaced |
| Number of QTL | 750 |
| QTL distribution | Random |
| MAF for markers | 0.1 |
| MAF for QTL | 0.1 |
| Additive allelic effects for markers | Neutral |
| Additive allelic effects for QTL | Gamma distribution (shape = 0.40) |
| Rate of missing marker genotypes | 0.01 |
| Rate of marker genotyping error | 0.005 |
| Rate of recurrent mutation | 0.0001 |
POP1: population 1; POP2: population 2; EBV: estimated breeding value; BV: breeding value; QTL: quantitative trait loci; MAF: minor allele frequency.
The whole simulation process was repeated 10 times.
Figure 1Schematic representation of the simulation steps.
Average Linkage Disequilibrium (r) between adjacent SNP markers.
| POP1 (40 k) | POP2 (800 k) | ||||
|---|---|---|---|---|---|
| 1 | 146 | 2712 | 0.25(0.25) | 47539 | 0.33(0.30) |
| 2 | 126 | 2205 | 0.25(0.25) | 38629 | 0.33(0.30) |
| 3 | 116 | 2060 | 0.24(0.25) | 38824 | 0.31(0.29) |
| 4 | 111 | 2011 | 0.23(0.24) | 37889 | 0.30(0.28) |
| 5 | 119 | 1749 | 0.24(0.24) | 32230 | 0.32(0.30) |
| 6 | 112 | 1962 | 0.26(0.26) | 38405 | 0.32(0.29) |
| 7 | 101 | 1687 | 0.25(0.26) | 33209 | 0.31(0.29) |
| 8 | 104 | 1748 | 0.24(0.25) | 33833 | 0.32(0.30) |
| 9 | 95 | 1369 | 0.26(0.26) | 30472 | 0.31(0.29) |
| 10 | 96 | 1413 | 0.27(0.26) | 30513 | 0.32(0.29) |
| 11 | 102 | 1551 | 0.25(0.25) | 33513 | 0.30(0.28) |
| 12 | 78 | 1070 | 0.24(0.26) | 23437 | 0.31(0.29) |
| 13 | 83 | 1249 | 0.25(0.25) | 27385 | 0.32(0.30) |
| 14 | 82 | 1290 | 0.25(0.25) | 27452 | 0.33(0.30) |
| 15 | 75 | 1081 | 0.27(0.27) | 22531 | 0.33(0.30) |
| 16 | 73 | 1150 | 0.24(0.25) | 22891 | 0.33(0.30) |
| 17 | 70 | 1195 | 0.27(0.26) | 23281 | 0.31(0.29) |
| 18 | 63 | 987 | 0.23(0.23) | 19391 | 0.32(0.29) |
| 19 | 63 | 966 | 0.25(0.26) | 20719 | 0.31(0.29) |
| 20 | 68 | 1133 | 0.24(0.24) | 21775 | 0.35(0.31) |
| 21 | 63 | 1032 | 0.25(0.26) | 19720 | 0.32(0.30) |
| 22 | 60 | 946 | 0.26(0.26) | 19212 | 0.31(0.29) |
| 23 | 49 | 772 | 0.25(0.26) | 15958 | 0.31(0.29) |
| 24 | 60 | 901 | 0.24(0.24) | 18087 | 0.32(0.30) |
| 25 | 42 | 749 | 0.25(0.25) | 14861 | 0.31(0.29) |
| 26 | 48 | 840 | 0.25(0.26) | 15999 | 0.30(0.30) |
| 27 | 43 | 670 | 0.26(0.26) | 13563 | 0.32(0.30) |
| 28 | 40 | 704 | 0.25(0.26) | 13407 | 0.32(0.30) |
| 29 | 45 | 806 | 0.26(0.26) | 14420 | 0.33(0.30) |
| Overall | 2333 | 38008 | 0.25(0.26) | 749145 | 0.32(0.30) |
The results are for each of the 29 autosomes in the recent generations for moderate heritability (0.25) and POP1 (40 k) and POP2 (800 k) over 10 replicates. SD: Standard Deviation.
Figure 2Schematic representation of the simulated scenarios. n_TS: number of bulls in the training set. Accuracies of EBV for all n_TS were 0.79, 0.90 or 0.94 for heritability = 0.10, 0.25 or 0.40, respectively. The average progeny size was 73 for all n_TS and heritability levels.
Average Linkage Disequilibrium (r) for different distances between closely located SNP pairs.
| POP1 (40 k) | POP2 (800 k) | ||||||
|---|---|---|---|---|---|---|---|
| h2 | Distance | Pairs | Frequency | Pairs | Frequency | ||
| 0.00-0.10 | 5049 | 0.23 (0.24) | 1512 (29.95) | 4275 | 0.24 (0.25) | 1272 (29.75) | |
| 0.10-0.20 | 5074 | 0.17 (0.19) | 1063 (20.95) | 4474 | 0.18 (0.20) | 919 (20.54) | |
| 0.20-0.30 | 6265 | 0.14 (0.16) | 885 (14.13) | 4290 | 0.14 (0.16) | 644 (15.01) | |
| 0.30-0.40 | 4938 | 0.12 (0.14) | 510 (10.33) | 4358 | 0.12 (0.14) | 493 (11.31) | |
| 0.40-0.50 | 5042 | 0.10 (0.13) | 425 (8.43) | 4280 | 0.10 (0.13) | 359 (8.39) | |
| 0.10 | 0.50-0.60 | 6199 | 0.09 (0.11) | 370 (5.97) | 4196 | 0.09 (0.12) | 279 (6.65) |
| 0.60-0.70 | 4994 | 0.08 (0.10) | 228 (4.57) | 4208 | 0.09 (0.11) | 242 (5.75) | |
| 0.70-0.80 | 5975 | 0.07 (0.09) | 187 (3.69) | 4268 | 0.08 (0.10) | 190 (4.45) | |
| 0.80-0.90 | 6183 | 0.07 (0.09) | 167 (2.70) | 4334 | 0.07 (0.10) | 170 (3.92) | |
| 0.90-1.00 | 4917 | 0.06 (0.08) | 128 (2.60) | 4163 | 0.07 (0.09) | 149 (3.58) | |
| Overall | 3932610 | 0.008 (0.03) | 6151 (0.16) | 3126250 | 0.01 (0.03) | 6062 (0.19) | |
| 0.00-0.10 | 5248 | 0.22 (0.24) | 1495 (28.49) | 4494 | 0.24 (0.25) | 1375 (30.60) | |
| 0.10-0.20 | 5224 | 0.17 (0.19) | 1077 (20.62) | 4740 | 0.18 (0.20) | 1006 (21.22) | |
| 0.20-0.30 | 6528 | 0.13 (0.16) | 855 (13.10) | 4630 | 0.14 (0.17) | 724 (15.64) | |
| 0.30-0.40 | 5147 | 0.11 (0.14) | 461 (8.96) | 4560 | 0.12 (0.14) | 488 (10.70) | |
| 0.40-0.50 | 5112 | 0.09 (0.12) | 315 (6.16) | 4596 | 0.11 (0.13) | 406 (8.83) | |
| 0.25 | 0.50-0.60 | 6466 | 0.08 (0.11) | 343 (5.31) | 4469 | 0.10 (0.12) | 333 (7.45) |
| 0.60-0.70 | 5146 | 0.07 (0.09) | 186 (3.61) | 4418 | 0.09 (0.11) | 250 (5.66) | |
| 0.70-0.80 | 5227 | 0.07 (0.09) | 139 (2.66) | 4502 | 0.08 (0.10) | 226 (5.02) | |
| 0.80-0.90 | 6431 | 0.06 (0.08) | 178 (2.77) | 4359 | 0.08 (0.10) | 207 (4.75) | |
| 0.90-1.00 | 5129 | 0.06 (0.08) | 95 (1.85) | 4416 | 0.07 (0.09) | 154 (3.49) | |
| Overall | 4096952 | 0.006 (0.03) | 5689 (0.14) | 3283438 | 0.011 (0.03) | 6543 (0.20) | |
| 0.00-0.10 | 5439 | 0.22 (0.24) | 1517 (27.89) | 4191 | 0.24 (0.24) | 1265 (30.18) | |
| 0.10-0.20 | 5468 | 0.17 (0.19) | 1070 (19.57) | 4225 | 0.17 (0.20) | 865 (20.47) | |
| 0.20-0.30 | 6801 | 0.13 (0.16) | 886 (13.03) | 4263 | 0.14 (0.16) | 621 (14.57) | |
| 0.30-0.40 | 5435 | 0.11 (0.14) | 543 (9.99) | 4251 | 0.12 (0.14) | 457 (10.75) | |
| 0.40-0.50 | 5444 | 0.09 (0.12) | 361 (6.63) | 4286 | 0.10 (0.12) | 333 (7.77) | |
| 0.40 | 0.50-0.60 | 6782 | 0.08 (0.10) | 293 (4.32) | 4284 | 0.09 (0.11) | 270 (6.30) |
| 0.60-0.70 | 5377 | 0.07 (0.09) | 176 (3.27) | 4103 | 0.09 (0.11) | 232 (5.65) | |
| 0.70-0.80 | 5374 | 0.07 (0.09) | 150 (2.79) | 4260 | 0.08 (0.10) | 205 (4.81) | |
| 0.80-0.90 | 6733 | 0.06 (0.08) | 155 (2.30) | 4285 | 0.07 (0.09) | 136 (3.17) | |
| 0.90-1.00 | 5342 | 0.06 (0.07) | 75 (1.40) | 4199 | 0.06 (0.08) | 104 (2.48) | |
| Overall | 4290985 | 0.006 (0.03) | 5710 (0.13) | 3072154 | 0.010 (0.03) | 5305 (0.17) | |
The results are for chromosome 1 in the recent generations for three heritability levels and POP1 (40 k) and POP2 (800 k) for one replicate. For POP2 (800 k), 40 k markers were randomly sampled to represent the 800 k panel.
Average Linkage Disequilibrium (r) between adjacent SNPs pairs and distribution across different rranges.
| Number of SNP Pairs and (%) | ||||
|---|---|---|---|---|
| LD (r2) range | 0.10 | 0.25 | 0.40 | |
| POP1 (40 K) | 0.00-0.10 | 975 (36.97) | 1040 (38.35) | 1060 (37.80) |
| 0.10-0.20 | 463 (17.56) | 468 (17.26) | 501 (17.87) | |
| 0.20-0.30 | 305 (11.57) | 316 (11.65) | 331 (11.80) | |
| 0.30-0.40 | 232 (8.80) | 232 (8.55) | 248 (8.84) | |
| 0.40-0.50 | 192 (7.28) | 195 (7.19) | 182 (6.49) | |
| 0.50-0.60 | 126 (4.78) | 142 (5.24) | 136 (4.85) | |
| 0.60-0.70 | 110 (4.17) | 98 (3.61) | 107 (3.82) | |
| 0.70-0.80 | 83 (3.15) | 78 (2.88) | 81 (2.89) | |
| 0.80-0.90 | 62 (2.35) | 66 (2.43) | 79 (2.82) | |
| 0.90-1.00 | 89 (3.38) | 77 (2.84) | 79 (2.82) | |
| Average LD (r2) | 0.26 | 0.25 | 0.25 | |
| POP2 (800 K) | 0.00-0.10 | 13980 (29.54) | 14124 (29.71) | 14939 (31.03) |
| 0.10-0.20 | 8034 (16.98) | 7443 (15.66) | 8072 (16.77) | |
| 0.20-0.30 | 5388 (11.38) | 5442 (11.45) | 5571 (11.57) | |
| 0.30-0.40 | 4149 (8.77) | 4240 (8.92) | 4156 (8.63) | |
| 0.40-0.50 | 3250 (6.87) | 3418 (7.19) | 3321 (6.90) | |
| 0.50-0.60 | 2859 (6.04) | 2837 (5.97) | 2665 (5.54) | |
| 0.60-0.70 | 2359 (4.98) | 2479 (5.21) | 2274 (4.72) | |
| 0.70-0.80 | 1928 (4.07) | 2162 (4.55) | 2121 (4.41) | |
| 0.80-0.90 | 1929 (4.08) | 2032 (4.27) | 1924 (4.00) | |
| 0.90-1.00 | 3451 (7.29) | 3362 (7.07) | 3095 (6.43) | |
| Average LD (r2) | 0.33 | 0.33 | 0.32 | |
The results are for chromosome 1 in the recent generations for three heritability levels and POP1 (40 k) and POP2 (800 k) over 10 replicates.
Accuracy of direct genomic estimated breeding value.
| POP1(40 k) | POP2(800 k) | ||||||
|---|---|---|---|---|---|---|---|
| 0.10 | - | - | - | 0.44a,a | - | 0.30b,a | |
| PHE | 0.25 | - | - | - | 0.56a,b | - | 0.41b,b |
| 0.40 | - | - | - | 0.65a,c | - | 0.50b,c | |
| 0.10 | 0.37a,a | 0.45b,a | 0.56c,a | - | 0.43b,a | - | |
| EBV | 0.25 | 0.37a,a | 0.49b,b | 0.60c,b | - | 0.46d,ab | - |
| 0.40 | 0.39a,b | 0.51b,c | 0.64c,c | - | 0.48d,b | - | |
The accuracies of Direct Genomic Estimated Breeding Value (DGEBV) are for animals in the prediction set, considering two levels of marker densities, different numbers of bulls in the training set (TS), three heritability levels, and two alternate response variables used for calculating the marker effects - phenotype (PHE) or Estimated Breeding Value (EBV). The accuracy of EBV used for calculating the marker effects was 0.79, 0.90 or 0.94 for the heritability 0.10, 0.25 or 0.40, respectively, regardless the population and number of bulls in the TS. The average progeny size was 73 for all TS sizes, except for those were the phenotypic record was used to estimate the marker effects. The results are presented as the average over 10 replicates. Different letters indicate significant differences (p < 0.05) by t-test (the first letter indicates differences within rows, while the second letter indicates differences within columns).
Figure 3Accuracy of Direct Genomic Estimated Breeding Value (DGEBV) for 40 k Scenario. DGEBV as a function of number of animals in the training set, considering three levels of heritability and 40 k markers. The results are presented as the average of 10 replicates.