| Literature DB >> 30795734 |
Ling-Yun Chang1,2, Sajjad Toghiani3,4, Samuel E Aggrey5,6, Romdhane Rekaya3,6.
Abstract
BACKGROUND: It becomes clear that the increase in the density of marker panels and even the use of sequence data didn't result in any meaningful increase in the accuracy of genomic selection (GS) using either regression (RM) or variance component (VC) approaches. This is in part due to the limitations of current methods. Association model are well over-parameterized and suffer from severe co-linearity and lack of statistical power. Even when the variant effects are not directly estimated using VC based approaches, the genomic relationships didn't improve after the marker density exceeded a certain threshold. SNP prioritization-based fixation index (FST) scores were used to track the majority of significant QTL and to reduce the dimensionality of the association model.Entities:
Keywords: Genomic selection; High density panel; SNP prioritization
Mesh:
Substances:
Year: 2019 PMID: 30795734 PMCID: PMC6387489 DOI: 10.1186/s12863-019-0720-5
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Fig. 1Effects and distribution of the 200 simulated quantitative trait loci (QTL) along the ten chromosomes (a) and their associated FST scores distribution (b) when the LD between adjacent markers was equal to 0.7
Fig. 2Distribution of the 200 simulated QTL (in blue) and 10 K (a) and 5 K (b) preselected SNPs based on FST scores (in red) across the 10 chromosomes when LD between adjacent markers was equal to 0.7 (* indicates the top 10% QTL)
Functional genomic similarity under different subsets of FST based and randomly selected SNPs for the scenario when LDa between adjacent markers was equal to 0.7. Standard errors of Functional genomic similarity are listed between parentheses
| Genomic similarity | ||
|---|---|---|
| SNPs | FST based | Random |
| 2.5 K | 0.7013 (0.0020) | 0.6695 (0.0003) |
| 5 K | 0.6862 (0.0020) | 0.6687 (0.0003) |
| 10 K | 0.6752 (0.0010) | 0.6682 (0.0002) |
| 20 K | 0.6718 (0.0006) | 0.6678 (0.0001) |
| 40K | 0.6712 (0.0005) | 0.6675 (0.0001) |
| 80 K | 0.6708 (0.0004) | 0.6673 (0.0001) |
| 160 K | 0.6705 (0.0003) | 0.6672 (0.0001) |
| 400 K | 0.6671 (0.0003) | 0.6671 (0.0001) |
aLD linkage disequilibrium
Distribution of off-diagonal elements (OD) of the genomic relationships matrix corresponding to the training and validation individuals under different selection criteria of SNP markers (in %)
| 20 K SNPs | 40 K SNPs | 80 K SNPs | 400 K SNPs | Pedigree | ||||
|---|---|---|---|---|---|---|---|---|
| S1 | R2 | S | R | S | R | - | - | |
| OD < -0.05 | 15.47 | 1.79 | 7.30 | 1.64 | 2.42 | 0.66 | 0.11 | 0 |
| -0.05 < OD < - 0.03 | 11.71 | 8.80 | 11.97 | 8.56 | 9.54 | 6.35 | 3.30 | 0 |
| -0.03 < OD < - 0.01 | 14.96 | 23.79 | 19.60 | 23.93 | 23.16 | 24.72 | 24.43 | 0 |
| -0.01 < OD < 0.01 | 16.19 | 32.57 | 22.98 | 33.1 | 30.78 | 37.96 | 45.91 | 60.09 |
| 0.01 < OD < 0.03 | 14.85 | 22.75 | 19.98 | 22.85 | 22.46 | 23.39 | 22.72 | 32.55 |
| 0.03 < OD < 0.05 | 11.54 | 8.26 | 11.62 | 8.02 | 9.09 | 5.96 | 3.15 | 5.25 |
| OD > 0.05 | 15.28 | 2.04 | 7.25 | 1.90 | 2.56 | 0.95 | 0.39 | 2.11 |
1SNPs selected based on FSTscores; 2SNPs randomly selected
Fig. 3Distribution of off-diagonal elements of the additive relationship matrix using a) all 400 K SNP markers (in blue), b) 20 K SNPs prioritized based on their FST scores (in red), and c) pedigree information (in green)
Variance component estimates (standard deviation) under different subsets of FSTbased and randomly selected SNPs for populations1P1and P2(average over 5 replicates)
| P1(LD =0.3) | P2(LD = 0.7) | |||
|---|---|---|---|---|
| GV2 | RV3 | GV | RV | |
| FST based | ||||
| 2.5 K | 0.126 (0.017) | 0.728 (0.027) | 0.198 (0.029) | 0.736 (0.006) |
| 5 K | 0.149 (0.016) | 0.706 (0.030) | 0.204 (0.005) | 0.711 (0.001) |
| 10 K | 0.175 (0.023) | 0.684 (0.037) | 0.195 (0.009) | 0.697 (0.004) |
| 20 K | 0.203 (0.031) | 0.663 (0.044) | 0.195 (0.007) | 0.686 (0.007) |
| 40 K | 0.226 (0.041) | 0.649 (0.052) | 0.203 (0.009) | 0.677 (0.007) |
| 80 K | 0.247 (0.048) | 0.641 (0.055) | 0.217 (0.008) | 0.671 (0.008) |
| 160 K | 0.264 (0.045) | 0.642 (0.047) | 0.235 (0.008) | 0.670 (0.008) |
| Random | ||||
| 2.5 K | 0.104 (0.013) | 0.834 (0.012) | 0.155 (0.012) | 0.788 (0.006) |
| 5 K | 0.139 (0.016) | 0.796 (0.013) | 0.185 (0.013) | 0.757 (0.005) |
| 10 K | 0.173 (0.019) | 0.762 (0.006) | 0.215 (0.011) | 0.730 (0.012) |
| 20 K | 0.203 (0.023) | 0.733 (0.013) | 0.234 (0.010) | 0.712 (0.008) |
| 40 K | 0.227 (0.026) | 0.710 (0.015) | 0.242 (0.007) | 0.703 (0.005) |
| 80 K | 0.238 (0.027) | 0.770 (0.015) | 0.246 (0.008) | 0.699 (0.007) |
| 160 K | 0.242 (0.027) | 0.696 (0.016) | 0.250 (0.008) | 0.696 (0.006) |
| Full panel | ||||
| 400 K | 0.247 (0.027) | 0.692 (0.016) | 0.251 (0.007) | 0.695 (0.006) |
1P1: 200 QTLs and linkage disequilibrium (LD) between adjacent markers equal to 0.3 and P2: 200 QTLs and LD between adjacent markers equal to 0.7; 2genetic variance, 3residual variance
Accuracy of genomic prediction (standard deviation) under different subsets of FST based and randomly selected SNPs for populationsa P1 and P2 (average over 5 replicates)
| Accuracyb | ||
|---|---|---|
| P1 (LD = 0.3) | P2 (LD = 0.7) | |
| FST based | ||
| 2.5 K | 0.724 (0.021) | 0.805 (0.014) |
| 5 K | 0.736 (0.022) | 0.823 (0.012) |
| 10 K | 0.740 (0.023) | 0.828 (0.013) |
| 20 K | 0.741 (0.027) | 0.824 (0.013) |
| 40 K | 0.735 (0.027) | 0.815 (0.014) |
| 80 K | 0.728 (0.028) | 0.802 (0.012) |
| 160 K | 0.723 (0.031) | 0.784 (0.013) |
| Random | ||
| 2.5 K | 0.600 (0.054) | 0.669 (0.019) |
| 5 K | 0.640 (0.047) | 0.709 (0.015) |
| 10 K | 0.676 (0.036) | 0.736 (0.019) |
| 20 K | 0.695 (0.037) | 0.746 (0.014) |
| 40 K | 0.707 (0.034) | 0.754 (0.010) |
| 80 K | 0.712 (0.033) | 0.757 (0.013) |
| 160 K | 0.715 (0.031) | 0.759 (0.011) |
| Full panel | ||
| 400 K | 0.716 (0.032) | 0.760 (0.011) |
aP1: 200 QTLs and linkage disequilibrium (LD) between adjacent markers equal to 0.3 and P2: 200 QTLs and LD between adjacent markers equal to 0.7;b correlation between true and predicted breeding values
Variance component estimates, accuracy of genomic prediction, and heritability (standard deviation) under different subsets of FST based and randomly selected SNPs for weaning weight of beef cattle
| Accuracya | GVb | RVc | Heritability | |
|---|---|---|---|---|
| FST based | ||||
| 2.5 K | 0.36 (0.02) | 91.39 (6.28) | 321.41 (7.43) | 0.22 (0.01) |
| 5 K | 0.36 (0.02) | 119.32 (8.67) | 299.13 (7.87) | 0.29 (0.02) |
| 20 K | 0.33 (0.03) | 144.94 (15.74) | 286.10 (11.88) | 0.34 (0.03) |
| Random | ||||
| 2.5 K | 0.26 (0.04) | 83.75 (13.99) | 346.11 (12.57) | 0.19 (0.03) |
| 5 K | 0.25 (0.03) | 100.34 (17.60) | 332.13 (15.87) | 0.23 (0.04) |
| 20 K | 0.27 (0.01) | 120.67 (15.30) | 313.64 (11.87) | 0.28 (0.03) |
| Full panel | ||||
| 50 K | 0.27 (0.02) | 128.08 (17.86) | 306.69 (13.33) | 0.29 (0.04) |
acorrelation between adjusted phenotypes and predicted breeding values;b genetic variance,c residual variance