| Literature DB >> 20302681 |
Albart Coster1, John W M Bastiaansen, Mario P L Calus, Johan A M van Arendonk, Henk Bovenhuis.
Abstract
The objective of this simulation study was to compare the effect of the number of QTL and distribution of QTL variance on the accuracy of breeding values estimated with genomewide markers (MEBV). Three distinct methods were used to calculate MEBV: a Bayesian Method (BM), Least Angle Regression (LARS) and Partial Least Square Regression (PLSR). The accuracy of MEBV calculated with BM and LARS decreased when the number of simulated QTL increased. The accuracy decreased more when QTL had different variance values than when all QTL had an equal variance. The accuracy of MEBV calculated with PLSR was affected neither by the number of QTL nor by the distribution of QTL variance. Additional simulations and analyses showed that these conclusions were not affected by the number of individuals in the training population, by the number of markers and by the heritability of the trait. Results of this study show that the effect of the number of QTL and distribution of QTL variance on the accuracy of MEBV depends on the method that is used to calculate MEBV.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20302681 PMCID: PMC2851578 DOI: 10.1186/1297-9686-42-9
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Scenarios with different number of QTL and distribution of QTL variance.
| Scenario | Number of QTL | Distribution of QTL variance |
|---|---|---|
| 1 | low | unequal |
| 2 | intermediate | unequal |
| 3 | high | unequal |
| 4 | low | equal |
| 5 | intermediate | equal |
| 6 | high | equal |
Scenarios were numbered from 1 to 6, according to the number of QTL contributing 90% of the genetic variance.
Average (standard error) of number of polymorphic markers (nSNP), LD between adjacent markers (r2), number of QTL (nQTL), and average coefficient of determination of QTL (R2).
| Situation | nSNP | nQTL | ||
|---|---|---|---|---|
| low nQTL | 1431 (5.3) | 0.048 (< 0.001) | 35 (0.2) | 0.806 (0.003) |
| low nQTL MAF > 0.10 | 374 (2.1) | 0.145 (0.002) | 35 (0.2) | 0.715 (0.004) |
| int. nQTL | 1431 (5.3) | 0.048 (< 0.001) | 172 (1.0) | 0.811 (0.002) |
| int. nQTL MAF > 0.10 | 374 (2.1) | 0.145 (0.002) | 172 (1.0) | 0.717 (0.002) |
| high nQTL | 1431 (5.3) | 0.048 (< 0.001) | 343 (2.0) | 0.811 (0.001) |
| high nQTL MAF > 0.10 | 374 (2.1) | 0.145 (0.002) | 343 (2.0) | 0.717 (0.001) |
The simulated number of QTL was low, intermediate (int.) or high and markers with a MAF lower than 0.10 were either or not included in the marker data. The table summarizes 60 replicated simulations.
Average (standard error) accuracy of MEBV for individuals in the evaluation population.
| Method | unequal QTL variance | equal QTL variance | ||||
|---|---|---|---|---|---|---|
| low nQTL | int. nQTL | high nQTL | low nQTL | int. nQTL | high nQTL | |
| sc. 1 | sc. 2 | sc. 3 | sc. 4 | sc. 5 | sc. 6 | |
| BM | 0.77 (0.009) | 0.67 (0.010) | 0.60 (0.012) | 0.71 (0.004) | 0.67 (0.005) | 0.67 (0.006) |
| LARS | 0.75 (0.009) | 0.67 (0.005) | 0.65 (0.004) | 0.65 (0.005) | 0.63 (0.006) | 0.63 (0.006) |
| PLSR | 0.66 (0.009) | 0.66 (0.007) | 0.67 (0.007) | 0.68 (0.006) | 0.67 (0.006) | 0.66 (0.007) |
The MEBV were calculated with methods BM, LARS and PLSR. Simulated number of QTL was low (low nQTL), intermediate (int. nQTL) or high (high nQTL). The simulated variance of every tenth QTL was 81 times larger than variance of the remaining QTL (unequal QTL variance) or equal for all QTL (equal QTL variance). The averages and standard deviations were calculated using 60 replicated simulations.
Figure 1Plot of the accuracies of MEBV calculated with BM, LARS and PLSR as affected by the simulated number of QTL. The plots display the accuracies of 60 replicated simulations for number of QTL around 35, 172 and 343 plus the accuracies of 10 replicated simulation with number of QTL around 227 and 285 in the scenarios of unequal QTL variance. The variance of every tenth QTL was 81 times larger than variance of remaining QTL (unequal QTL variance) or equal for all QTL (equal QTL variance). The line is a LOESS smoother through accuracies on.
Average (standard error) change of accuracy of MEBV for individuals in the evaluation population as affected by alternative simulation situations.
| Method | unequal QTL variance | equal QTL variance | ||||
|---|---|---|---|---|---|---|
| low QTL | int. QTL | high QTL | low QTL | int. QTL | high QTL | |
| sc. 1 | sc. 2 | sc. 3 | sc. 4 | sc. 5 | sc. 6 | |
| BM | -0.14 (< 0.01) | -0.16 (0.01) | -0.08 (0.01) | -0.12 (< 0.01) | -0.16 (< 0.01) | -0.18 (< 0.01) |
| LARS | -0.14 (< 0.01) | -0.16 (< 0.01) | -0.15 (< 0.01) | -0.15 (< 0.01) | -0.14 (< 0.01) | -0.14 (< 0.01) |
| PLSR | -0.10 (< 0.01) | -0.11 (< 0.01) | -0.11 (< 0.01) | -0.11 (< 0.01) | -0.12 (< 0.01) | -0.11 (< 0.01) |
| BM | 0.05 (< 0.01) | 0.11 (0.01) | 0.16 (0.01) | 0.06 (< 0.01) | 0.08 (< 0.01) | 0.07 (< 0.01) |
| LARS | 0.04 (< 0.01) | 0.07 (< 0.01) | 0.06 (< 0.01) | 0.07 (< 0.01) | 0.07 (< 0.01) | 0.07 (< 0.01) |
| PLSR | 0.07 (< 0.01) | 0.06 (< 0.01) | 0.06 (< 0.01) | 0.06 (< 0.01) | 0.06 (< 0.01) | 0.07 (< 0.01) |
| BM | -0.03 (< 0.01) | -0.01 (< 0.01) | 0.02 (0.01) | -0.03 (< 0.01) | -0.03 (< 0.01) | -0.04 (< 0.01) |
| LARS | -0.02 (< 0.01) | 0.00 (< 0.01) | -0.01 (< 0.01) | 0.00 (< 0.01) | 0.00 (< 0.01) | 0.00 (< 0.01) |
| PLSR | -0.02 (< 0.01) | -0.01 (< 0.01) | -0.01 (< 0.01) | -0.03 (< 0.01) | -0.02 (< 0.01) | -0.01 (< 0.01) |
Simulated heritability was reduced from 0.5 to 0.25 (h2 = 0.25); the size of the training population was increased from 500 to 1,000 individuals (nTR = 1,000); only markers with a MAF above 0.10 were used to fit the models (MAF > 0.1). The simulated number of QTL was low (low nQTL), intermediate (int. nQTL) or high (high nQTL). The simulated variance of every tenth QTL was 81 times larger than variance of remaining QTL (unequal QTL variance) or equal for all QTL (equal QTL variance). Methods BM, LARS and PLSR were used to calculate the MEBV. The averages and standard deviations were calculated using 60 replicated simulations.
Average (standard error) of Mean Square Error of Prediction (MSEP) of MEBV for individuals in the evaluation population.
| Method | unequal QTL variance | equal QTL variance | ||||
|---|---|---|---|---|---|---|
| low nQTL | int. nQTL | high nQTL | low nQTL | int. nQTL | high nQTL | |
| sc. 1 | sc. 2 | sc. 3 | sc. 4 | sc. 5 | sc. 6 | |
| BM | 659 (26) | 4049 (108) | 10463 (343) | 79 (2) | 416 (6) | 850 (12) |
| LARS | 707 (24) | 4019 (71) | 8230 (124) | 91 (2) | 465 (6) | 927 (12) |
| PLSR | 993 (24) | 4242 (73) | 8405 (123) | 93 (2) | 458 (6) | 922 (14) |
Methods BM, LARS and PLSR were used to calculate the MEBV. The simulated number of QTL was low (low nQTL), intermediate (int. nQTL) or high (high nQTL). The simulated variance of every tenth QTL was 81 times larger than variance of remaining QTL (unequal QTL variance) or equal for all QTL (equal QTL variance). The averages and standard deviations were calculated using 60 replicated simulations.
Average (standard error) of the simulated additive genetic variance () in the evaluation population, and variance of MEBV calculated for individuals in the evaluation population.
| Method | unequal QTL variance | equal QTL variance | ||||
|---|---|---|---|---|---|---|
| low QTL | int. QTL | high QTL | low QTL | int. QTL | high QTL | |
| sc. 1 | sc. 2 | sc. 3 | sc. 4 | sc. 5 | sc. 6 | |
| 1623 (23) | 7210 (88) | 14193 (156) | 158 (2) | 767 (8) | 1538 (18) | |
| BM | 890 (38) | 2537 (168) | 2032 (283) | 81 (3) | 327 (13) | 575 (24) |
| LARS | 914 (31) | 3937 (164) | 7017 (293) | 75 (4) | 344 (15) | 715 (29) |
| PLSR | 1249 (49) | 5263 (198) | 10747 (393) | 129 (5) | 618 (21) | 1150 (46) |
The methods BM, LARS and PLSR were used to calculate the MEBV. The simulated number of QTL was low (low nQTL), intermediate (int. nQTL) or high (high nQTL). The simulated variance of every tenth QTL was 81 times larger than variance of remaining QTL (unequal QTL variance) or equal for all QTL (equal QTL variance). The averages and standard deviations were calculated using 60 replicated simulations.
Average (standard error) computation time required for fitting the MEBV models to the training population and calculating MEBV for the evaluation population, measured in seconds.
| Method | Normal | h | nTr = 1,000 | MAF > 0.10 |
|---|---|---|---|---|
| BM | 423.25 (3.73) | 429.57 (3.88) | 820.75 (9.05) | 109.49 (1.90) |
| LARS | 211.75 (3.28) | 210.92 (2.62) | 1058.38 (9.34) | 57.37 (1.80) |
| PLSR | 4.05 (0.10) | 4.10 (0.18) | 6.47 (0.15) | 0.81 (0.02) |
Situation normal: heritability equal to 0.5, size of the training population equal to 500 individuals, and all markers included in the data. Situation h2 = 0.25: heritability was decreased from 0.50 to 0.25. Situation nTr = 1,000: size of training population was increased from 500 to 1,000 individuals. Situation MAF > 0.10: markers with a MAF below 0.10 were excluded from the data. The table summarizes ten simulations for the scenario of intermediate number of QTL and equal QTL variance.
Average (standard error) accuracy of MEBV for individuals in the evaluation population.
| Method | unequal QTL variance | equal QTL variance | ||||
|---|---|---|---|---|---|---|
| low nQTL | int. nQTL | high nQTL | low nQTL | int. nQTL | high nQTL | |
| Standard | 0.80 (0.007) | 0.67 (0.006) | 0.57 (0.007) | 0.69 (0.005) | 0.62 (0.006) | 0.57 (0.006) |
| h2 = 0.25 | 0.68 (0.011) | 0.52 (0.006) | 0.56 (0.008) | 0.57 (0.004) | 0.51 (0.005) | 0.53 (0.006) |
| MAF>0.10 | 0.77 (0.008) | 0.69 (0.006) | 0.64 (0.007) | 0.67 (0.006) | 0.66 (0.005) | 0.61 (0.004) |
The simulated number of QTL was low (low nQTL), intermediate (int. nQTL) or high (high nQTL). The simulated variance of every tenth QTL was 81 times larger than variance of remaining QTL (unequal QTL variance) or equal for all QTL (equal QTL variance). The rows of the table correspond to the standard situation (h2 = 0.5, size of training population = 500 individuals, all markers included), the situation with h2 = 0.25, and the situation where markers with MAF < 0.10 were excluded from the data. Method BM was used to calculate the. The prior number of QTL was 35 QTL in the scenarios with a low number of QTL, 172 QTL in the scenarios with an intermediate number of QTL, and 343 QTL in the scenarios with a high number of QTL. The prior for QTL variance was the ratio of the total genetic variance (Table 2) and the number of QTL. The averages and standard deviations were calculated using 60 replicated simulations