| Literature DB >> 35705918 |
Mario P L Calus1, John M Henshall2, Rachel Hawken2, Jérémie Vandenplas3.
Abstract
BACKGROUND: In genomic prediction including data of 3- or 4-way crossbred animals, line composition is usually fitted as a regression on expected line proportions, which are 0.5, 0.25 and 0.25, respectively, for 3-way crossbred animals. However, actual line proportions for the dam lines can vary between ~ 0.1 and 0.4, and ignoring this variation may affect the genomic estimated breeding values of purebred selection candidates. Our aim was to validate a proposed gold standard to evaluate different approaches for estimating line proportions using simulated data, and to subsequently use this in actual 3-way crossbred broiler data to evaluate several other methods.Entities:
Mesh:
Year: 2022 PMID: 35705918 PMCID: PMC9199202 DOI: 10.1186/s12711-022-00728-4
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 5.100
Different quality measures of the estimated line B proportions derived with ADMIXTURE with various levels of pruning based on linkage disequilibrium (r2) in the simulated dataa
| Measure | Lines | r2 threshold for pruning | |||||
|---|---|---|---|---|---|---|---|
| 0.1 | 0.3 | 0.5 | 0.7 | 0.9 | No. | ||
| Number of SNPs | Close | 305 | 1364 | 2661 | 3781 | 4679 | 5720 |
| Distant | 281 | 1190 | 2425 | 3543 | 4499 | 5753 | |
| Unrelated | 260 | 936 | 1953 | 3016 | 3989 | 5740 | |
| Accuracy | Close | 0.560 | 0.694 | 0.696 | 0.689 | 0.682 | 0.675 |
| Distant | 0.787 | 0.890 | 0.901 | 0.897 | 0.890 | 0.886 | |
| Unrelated | 0.891 | 0.958 | 0.966 | 0.965 | 0.963 | 0.959 | |
| Dispersion bias | Close | 0.404 | 0.533 | 0.536 | 0.531 | 0.524 | 0.517 |
| Distant | 0.653 | 0.808 | 0.824 | 0.820 | 0.809 | 0.801 | |
| Unrelated | 0.810 | 0.916 | 0.930 | 0.930 | 0.925 | 0.922 | |
| Maximum error | Close | 0.404 | 0.330 | 0.332 | 0.334 | 0.341 | 0.348 |
| Distant | 0.285 | 0.192 | 0.181 | 0.181 | 0.186 | 0.190 | |
| Unrelated | 0.181 | 0.110 | 0.104 | 0.103 | 0.109 | 0.112 | |
| RMSE | Close | 0.132 | 0.106 | 0.106 | 0.107 | 0.108 | 0.109 |
| Distant | 0.083 | 0.056 | 0.053 | 0.054 | 0.056 | 0.058 | |
| Unrelated | 0.056 | 0.034 | 0.031 | 0.031 | 0.032 | 0.034 | |
RMSE root mean squared error
aAll 428 A(BC) crossbreds were used
Different quality measures of the estimateda line B proportions when compared against true or BOA estimated values for the simulated datab
| Measure | Reference | Lines | BOA | LR | ADM | REL_GP | REL_GP_noF |
|---|---|---|---|---|---|---|---|
| Accuracy | True | Close | 0.986 | 0.680 | 0.692 | 0.949 | 0.954 |
| Distant | 0.996 | 0.876 | 0.902 | 0.950 | 0.959 | ||
| Unrelated | 0.997 | 0.950 | 0.967 | 0.952 | 0.962 | ||
| BOA | Close | 0.690 | 0.704 | 0.936 | 0.942 | ||
| Distant | 0.878 | 0.903 | 0.948 | 0.956 | |||
| Unrelated | 0.948 | 0.965 | 0.951 | 0.961 | |||
| Dispersion bias | True | Close | 0.958 | 0.513 | 0.530 | 0.928 | 0.923 |
| Distant | 0.964 | 0.776 | 0.820 | 0.966 | 0.958 | ||
| Unrelated | 0.962 | 0.910 | 0.934 | 1.025 | 1.022 | ||
| BOA | Close | 0.536 | 0.555 | 0.943 | 0.938 | ||
| Distant | 0.804 | 0.849 | 0.996 | 0.987 | |||
| Unrelated | 0.942 | 0.966 | 1.061 | 1.058 | |||
| Maximum error | True | Close | 0.076 | 0.321 | 0.317 | 0.109 | 0.105 |
| Distant | 0.046 | 0.172 | 0.155 | 0.106 | 0.098 | ||
| Unrelated | 0.038 | 0.109 | 0.089 | 0.104 | 0.094 | ||
| BOA | Close | 0.312 | 0.308 | 0.124 | 0.120 | ||
| Distant | 0.178 | 0.159 | 0.114 | 0.105 | |||
| Unrelated | 0.114 | 0.089 | 0.109 | 0.099 | |||
| RMSE | True | Close | 0.019 | 0.109 | 0.106 | 0.036 | 0.035 |
| Distant | 0.012 | 0.061 | 0.053 | 0.035 | 0.032 | ||
| Unrelated | 0.010 | 0.037 | 0.030 | 0.035 | 0.032 | ||
| BOA | Close | 0.108 | 0.104 | 0.041 | 0.039 | ||
| Distant | 0.060 | 0.053 | 0.037 | 0.034 | |||
| Unrelated | 0.038 | 0.031 | 0.037 | 0.033 |
RMSE root mean squared error
aLine B proportions are estimated from estimated breed-origin-of-alleles (BOA), using linear regression on mean allele counts within line (LR), ADMIXTURE analysis (ADM) after pruning SNPs based on r2 > 0.5, the genomic relationship with maternal grandsire (REL_GP), or this relationship after adjusting all self-relationships to be 1 (REL_GP_noF)
bResults are based on the average 188 A(BC) crossbreds with both maternal grandparents included with genotypes in the data
Correlation among line B proportions estimated with different methodsa for the simulated data
| Lines | Method | LR | ADM | REL_GP | REL_GP_noF |
|---|---|---|---|---|---|
| Closely-related | BOA | 0.690 | 0.704 | 0.936 | 0.942 |
| LR | 1 | 0.943 | 0.633 | 0.644 | |
| ADM | 0.943 | 1 | 0.647 | 0.659 | |
| REL_GP | 0.633 | 0.647 | 1 | 0.994 | |
| REL_GP_noF | 0.644 | 0.659 | 0.994 | 1 | |
| Distantly-related | BOA | 0.878 | 0.903 | 0.948 | 0.956 |
| LR | 1 | 0.963 | 0.817 | 0.834 | |
| ADM | 0.963 | 1 | 0.844 | 0.859 | |
| REL_GP | 0.817 | 0.844 | 1 | 0.991 | |
| REL_GP_noF | 0.834 | 0.859 | 0.991 | 1 | |
| Unrelated | BOA | 0.948 | 0.965 | 0.951 | 0.961 |
| LR | 1 | 0.975 | 0.890 | 0.905 | |
| ADM | 0.975 | 1 | 0.917 | 0.930 | |
| REL_GP | 0.890 | 0.917 | 1 | 0.989 | |
| REL_GP_noF | 0.905 | 0.930 | 0.989 | 1 |
aLine B proportions are estimated from estimated breed-origin-of-alleles (BOA), using linear regression on mean allele counts within line (LR), ADMIXTURE analysis (ADM), the genomic relationship with maternal grandsire (REL_GP), or this relationship after adjusting all self-relationships to 1 (REL_GP_noF)
Fig. 1Observed line B proportions in the broiler data (histogram), versus the theoretically expected distribution (red line)
Different quality measures of the estimated line B proportions derived with ADMIXTURE with various levels of pruning based on linkage disequilibrium (r2) in the broiler data
| Measure | r2 threshold for pruning | |||||
|---|---|---|---|---|---|---|
| 0.1 | 0.3 | 0.5 | 0.7 | 0.9 | No. | |
| Number of SNPs | 3392 | 15,144 | 29,892 | 41,376 | 48,905 | 55,729 |
| Accuracy | 0.839 | 0.914 | 0.927 | 0.927 | 0.925 | 0.918 |
| Dispersion bias | 0.970 | 1.140 | 1.138 | 1.128 | 1.115 | 1.090 |
| Maximum error | 0.094 | 0.073 | 0.066 | 0.064 | 0.068 | 0.070 |
| RMSE | 0.024 | 0.018 | 0.017 | 0.017 | 0.017 | 0.018 |
RMSE root mean squared error
Different quality measures of the estimateda line B and C proportions for the broiler data
| Measure | LR | ADMIXTURE | REL_GP | REL_GP_noF |
|---|---|---|---|---|
| Accuracy | 0.917 | 0.927 | 0.861 | 0.902 |
| Dispersion bias | 1.091 | 1.138 | 1.146 | 1.232 |
| Max | 0.068 | 0.066 | 0.149 | 0.102 |
| RMSE | 0.018 | 0.017 | 0.023 | 0.020 |
RMSE root mean squared error
aLine proportions are estimated using linear regression on mean allele counts within line (LR), ADMIXTURE analysis after pruning SNPs based on r2 > 0.5, the genomic relationship with maternal grandsire (REL_GP), or this relationship after adjusting all self-relationships to 1 (REL_GP_noF)
Correlation among line B proportions estimated with different methodsa for the broiler data
| Method | LR | ADMIXTURE | REL_GP | REL_GP_noF |
|---|---|---|---|---|
| BOA | 0.917 | 0.927 | 0.861 | 0.902 |
| LR | 1 | 0.975 | 0.843 | 0.890 |
| ADMIXTURE | 0.975 | 1 | 0.864 | 0.911 |
| REL_GP | 0.843 | 0.864 | 1 | 0.970 |
| REL_GP_noF | 0.890 | 0.911 | 0.970 | 1 |
aLine B proportions are estimated from estimated breed-origin-of-alleles (BOA), using linear regression on mean allele counts within line (LR), ADMIXTURE analysis, the genomic relationship with maternal grandsire (REL_GP), or this relationship after adjusting all self-relationships to 1 (REL_GP_noF)