| Literature DB >> 26586567 |
Yoshinobu Uemoto1, Shinji Sasaki2, Takatoshi Kojima3, Yoshikazu Sugimoto4, Toshio Watanabe5.
Abstract
BACKGROUND: Genetic variance that is not captured by single nucleotide polymorphisms (SNPs) is due to imperfect linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTLs), and the extent of LD between SNPs and QTLs depends on different minor allele frequencies (MAF) between them. To evaluate the impact of MAF of QTLs on genomic evaluation, we performed a simulation study using real cattle genotype data.Entities:
Mesh:
Year: 2015 PMID: 26586567 PMCID: PMC4653875 DOI: 10.1186/s12863-015-0287-8
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Factors for different scenarios in a simulation study
| Scenario | ||||
|---|---|---|---|---|
| Factor | 1 | 2 | 3 | 4 |
| MAFa | All, High, Low | All, High, Low | All, High, Low | All, High, Low |
| QTL heritability | 0.2, 0.4, 0.8 | 0.4 | 0.4 | 0.4 |
| Number of QTLs | 500 | 50, 100, 300, 500, 1000, 2000 | 500 | 500 |
| Distribution of QTL effectb | EquV | Gamma, EquV | EquV | EquV |
| SNP densityc | 50 K | 50 K | 7 K, 50 K, 7K_to_HD, 50 K_to_HD, HD | 50 K |
| Prediction modeld | Model (1) with GY | Model (1) with GY | Model (1) with GV, GY, and GS, Model (2) | Model (1) with GY |
| Size of reference set | 1231 | 1231 | 1231 | 200, 400, 800, 1200 |
| Size of test set | 137 | 137 | 137 | 1168, 968, 568, 168 |
aMAF, Minor allele frequency; All, 0.01 ≤ MAF ≤ 0.5; High, 0.05 < MAF ≤ 0.5; Low, 0.01 ≤ MAF ≤ 0.05
bGamma, Gamma distribution model; EquV, Equal variance model
c7K, 50 K and HD, Illumina infinium BovineLDv1.1, BovineSNP50v2, and BovineHD BeadChips, respectively; 7 K_to_HD and 50 K_to_HD, Imputations were performed from 7 K and 50 K to HD, respectively
dGV, VanRaden's G matrix; GY, Yang's G matrix; GS, Speed's G matrix
Fig. 1Distribution of minor allele frequencies for SNPs under different SNP densities. The x-axis indicates the MAF of SNPs, and the y-axis represents the proportion of SNPs in each MAF category. 7 K, 50 K, and HD are SNP markers on Illumina infinium BovineLDv1.1, BovineSNP50v2, and BovineHD BeadChips, respectively
Fig. 2Proportion of linkage disequilibrium value (r2) between QTLs and adjacent SNPs. The plot on the right upper corner is the zoomed area of the bigger plot. The x-axis indicates the r2 value between QTLs and SNPs, and the y-axis represents the proportion of QTLs in each minor allele frequency (MAF) category (All, Low, and High). The r2 values between QTLs and both adjacent SNPs were calculated, and then the maximum value of r2 between two QTL-SNP intervals was chosen to plot in each QTL. The parameters used were the same as those under scenario 1
Fig. 3Results obtained from scenario 1. Estimated QTL heritability and correlation between true breeding and genomic estimated breeding values are calculated. The x-axis indicates the true QTL heritability, and the y-axis represents mean values of 300 replicates for the estimated QTL heritability (a) and the correlation between true breeding value (TBV) and genomic estimated breeding value (GEBV) (b). The results of varying minor allele frequency (MAF) categories (All, Low, and High) and QTL heritabilities (0.20, 0.40, and 0.80) are shown. The whiskers represent the standard deviation of 300 replicates
Fig. 4Results obtained from scenario 2. Estimated QTL heritability and correlation between true breeding and genomic estimated breeding values are calculated. The x-axis indicates the number of QTLs, and the y-axis represents mean values of 300 replicates for the estimated QTL heritability (a) and the correlation between true breeding value (TBV) and genomic estimated breeding value (GEBV) (b). The results of varying minor allele frequency (MAF) categories (All, Low, and High), number of QTLs (50, 100, 300, 500, 1000, and 2000), and distribution of QTL allele substitution effect (Gamma, gamma distribution model; EquV, equal variance model) are shown
Heritability estimation in scenario 3
| All MAFa | High MAFa | Low MAFa | |||||
|---|---|---|---|---|---|---|---|
| SNPb | Prediction modelc | Mean | SD | Mean | SD | Mean | SD |
| 7 K | Model (1) with GV | 0.28 | 0.05 | 0.32 | 0.05 | 0.20 | 0.06 |
| Model (1) with GY | 0.30 | 0.05 | 0.33 | 0.05 | 0.23 | 0.06 | |
| Model (1) with GS | 0.30 | 0.05 | 0.33 | 0.05 | 0.24 | 0.06 | |
| Model (2) | 0.30 | 0.05 | 0.33 | 0.05 | 0.23 | 0.06 | |
| 50 K | Model (1) with GV | 0.33 | 0.06 | 0.38 | 0.06 | 0.24 | 0.06 |
| Model (1) with GY | 0.36 | 0.06 | 0.39 | 0.06 | 0.30 | 0.06 | |
| Model (1) with GS | 0.38 | 0.06 | 0.40 | 0.06 | 0.34 | 0.07 | |
| Model (2) | 0.37 | 0.06 | 0.39 | 0.06 | 0.34 | 0.06 | |
| 7K_to_HD | Model (1) with GV | 0.34 | 0.06 | 0.39 | 0.06 | 0.24 | 0.06 |
| Model (1) with GY | 0.37 | 0.06 | 0.40 | 0.06 | 0.30 | 0.06 | |
| Model (1) with GS | 0.41 | 0.07 | 0.41 | 0.07 | 0.39 | 0.07 | |
| Model (2) | 0.39 | 0.06 | 0.40 | 0.06 | 0.38 | 0.06 | |
| 50K_to_HD | Model (1) with GV | 0.34 | 0.06 | 0.39 | 0.06 | 0.25 | 0.06 |
| Model (1) with GY | 0.37 | 0.06 | 0.41 | 0.06 | 0.30 | 0.07 | |
| Model (1) with GS | 0.41 | 0.07 | 0.42 | 0.07 | 0.40 | 0.07 | |
| Model (2) | 0.40 | 0.06 | 0.40 | 0.06 | 0.40 | 0.06 | |
| HD | Model (1) with GV | 0.35 | 0.06 | 0.39 | 0.06 | 0.25 | 0.06 |
| Model (1) with GY | 0.38 | 0.06 | 0.41 | 0.06 | 0.31 | 0.07 | |
| Model (1) with GS | 0.42 | 0.07 | 0.41 | 0.07 | 0.40 | 0.07 | |
| Model (2) | 0.40 | 0.06 | 0.40 | 0.06 | 0.41 | 0.06 | |
aMAF, Minor allele frequency; All MAF, 0.01 ≤ MAF ≤ 0.5; High MAF, 0.05 < MAF ≤ 0.5; Low MAF, 0.01 ≤ MAF ≤ 0.05
b7K, 50 K and HD, Illumina infinium BovineLDv1.1, BovineSNP50v2, and BovineHD BeadChips, respectively; 7 K_to_HD and 50 K_to_HD, Imputations were performed from 7 K and 50 K to HD, respectively
cGV, VanRaden's genome relationship matrix (GRM); GY, Yang's GRM; GS, Speed's GRM
Model fitness measured by Akaike information criterion (AIC) in scenario 3
| All MAFa | High MAFa | Low MAFa | |||||
|---|---|---|---|---|---|---|---|
| SNPb | Prediction modelc | Mean | SD | Mean | SD | Mean | SD |
| 7 K | Model (1) with GV | 6164 | 63 | 6145 | 66 | 6191 | 61 |
| Model (1) with GY | 6162 | 63 | 6145 | 66 | 6188 | 61 | |
| Model (1) with GS | 6162 | 63 | 6146 | 66 | 6187 | 61 | |
| Model (2) | 6163 | 63 | 6147 | 66 | 6186 | 61 | |
| 50 K | Model (1) with GV | 6159 | 63 | 6139 | 65 | 6188 | 62 |
| Model (1) with GY | 6155 | 63 | 6139 | 65 | 6181 | 62 | |
| Model (1) with GS | 6155 | 63 | 6142 | 65 | 6175 | 62 | |
| Model (2) | 6155 | 63 | 6140 | 65 | 6163 | 62 | |
| 7K_to_HD | Model (1) with GV | 6158 | 63 | 6138 | 65 | 6189 | 62 |
| Model (1) with GY | 6155 | 63 | 6138 | 65 | 6182 | 62 | |
| Model (1) with GS | 6156 | 63 | 6147 | 65 | 6171 | 62 | |
| Model (2) | 6154 | 63 | 6139 | 65 | 6155 | 62 | |
| 50K_to_HD | Model (1) with GV | 6157 | 63 | 6137 | 65 | 6188 | 62 |
| Model (1) with GY | 6154 | 63 | 6137 | 65 | 6181 | 62 | |
| Model (1) with GS | 6155 | 63 | 6146 | 65 | 6169 | 62 | |
| Model (2) | 6153 | 63 | 6138 | 65 | 6152 | 62 | |
| HD | Model (1) with GV | 6157 | 63 | 6136 | 65 | 6188 | 62 |
| Model (1) with GY | 6154 | 63 | 6137 | 65 | 6180 | 62 | |
| Model (1) with GS | 6155 | 63 | 6147 | 65 | 6168 | 62 | |
| Model (2) | 6152 | 63 | 6138 | 65 | 6150 | 62 | |
aMAF, Minor allele frequency; All MAF, 0.01 ≤ MAF ≤ 0.5; High MAF, 0.05 < MAF ≤ 0.5; Low MAF, 0.01 ≤ MAF ≤ 0.05
b7K, 50 K and HD, Illumina infinium BovineLDv1.1, BovineSNP50v2, and BovineHD BeadChips, respectively; 7 K_to_HD and 50 K_to_HD, Imputations were performed from 7 K and 50 K to HD, respectively
cGV, VanRaden's genome relationship matrix (GRM); GY, Yang's GRM; GS, Speed's GRM
Correlation between true breeding value and genomic breeding value in scenario 3
| All MAFa | High MAFa | Low MAFa | |||||
|---|---|---|---|---|---|---|---|
| SNPb | Prediction modelc | Mean | SD | Mean | SD | Mean | SD |
| 7 K | Model (1) with GV | 0.41 | 0.08 | 0.48 | 0.08 | 0.30 | 0.09 |
| Model (1) with GY | 0.42 | 0.08 | 0.48 | 0.08 | 0.32 | 0.09 | |
| Model (1) with GS | 0.42 | 0.08 | 0.48 | 0.08 | 0.33 | 0.09 | |
| Model (2) | 0.42 | 0.08 | 0.48 | 0.08 | 0.33 | 0.09 | |
| 50 K | Model (1) with GV | 0.43 | 0.08 | 0.50 | 0.08 | 0.32 | 0.09 |
| Model (1) with GY | 0.44 | 0.08 | 0.50 | 0.08 | 0.35 | 0.09 | |
| Model (1) with GS | 0.44 | 0.08 | 0.49 | 0.08 | 0.37 | 0.09 | |
| Model (2) | 0.44 | 0.08 | 0.50 | 0.08 | 0.41 | 0.09 | |
| 7K_to_HD | Model (1) with GV | 0.44 | 0.08 | 0.50 | 0.08 | 0.32 | 0.09 |
| Model (1) with GY | 0.45 | 0.08 | 0.50 | 0.08 | 0.35 | 0.09 | |
| Model (1) with GS | 0.44 | 0.08 | 0.48 | 0.08 | 0.38 | 0.08 | |
| Model (2) | 0.45 | 0.08 | 0.50 | 0.08 | 0.44 | 0.08 | |
| 50K_to_HD | Model (1) with GV | 0.44 | 0.08 | 0.51 | 0.08 | 0.32 | 0.09 |
| Model (1) with GY | 0.45 | 0.08 | 0.51 | 0.08 | 0.36 | 0.09 | |
| Model (1) with GS | 0.44 | 0.08 | 0.48 | 0.08 | 0.39 | 0.08 | |
| Model (2) | 0.46 | 0.08 | 0.51 | 0.08 | 0.46 | 0.08 | |
| HD | Model (1) with GV | 0.44 | 0.08 | 0.51 | 0.08 | 0.32 | 0.09 |
| Model (1) with GY | 0.45 | 0.08 | 0.51 | 0.08 | 0.36 | 0.08 | |
| Model (1) with GS | 0.44 | 0.08 | 0.48 | 0.08 | 0.39 | 0.08 | |
| Model (2) | 0.46 | 0.08 | 0.51 | 0.08 | 0.47 | 0.08 | |
aMAF, Minor allele frequency; All MAF, 0.01 ≤ MAF ≤ 0.5; High MAF, 0.05 < MAF ≤ 0.5; Low MAF, 0.01 ≤ MAF ≤ 0.05
b7K, 50 K and HD, Illumina infinium BovineLDv1.1, BovineSNP50v2, and BovineHD BeadChips, respectively; 7 K_to_HD and 50 K_to_HD, Imputations were performed from 7 K and 50 K to HD, respectively
cGV, VanRaden's genome relationship matrix (GRM); GY, Yang's GRM; GS, Speed's GRM
Fig. 5Results obtained from scenario 3. Estimated QTL heritability and correlation between true breeding and genomic estimated breeding values are calculated. The x-axis indicates the size of the reference set, and the y-axis represents mean values of 300 replicates for the estimated QTL heritability (a) and the correlation between true breeding value (TBV) and genomic estimated breeding value (GEBV) (b). The results of varying minor allele frequency (MAF) categories (All, Low, and High) and size of the reference set (200, 400, 800, and 1200) are shown. The whiskers represent the standard deviation of 300 replicates