| Literature DB >> 30458700 |
Qianqian Zhang1,2,3, Goutam Sahana4, Guosheng Su4, Bernt Guldbrandtsen4, Mogens Sandø Lund4, Mario P L Calus5.
Abstract
BACKGROUND: Availability of whole-genome sequence data for a large number of cattle and efficient imputation methodologies open a new opportunity to include rare and low-frequency variants (RLFV) in genomic prediction in dairy cattle. The objective of this study was to examine the impact of including RLFV that are within genes and selected from whole-genome sequence variants, on the reliability of genomic prediction for fertility, health and longevity in dairy cattle.Entities:
Mesh:
Year: 2018 PMID: 30458700 PMCID: PMC6247626 DOI: 10.1186/s12711-018-0432-8
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Reliability of genomic prediction using different marker sets
| Marker sets | Fertility | Health | Longevity |
|---|---|---|---|
| 50 k SNP array | 39.2 | 31.9 | 28.5 |
| 50 k + All genic RLFV | 40.3 | 32.6 | 27.7 |
| 50 k + RLFV in genes with significant association | 39.9 | 31.8 | 27.8 |
| 50 k + RLFV with medium-to-high impact annotations | 39.7 | 32.6 | 27.1 |
| 50 k + RLFV with high impact annotations | 38.4 | 31.5 | 26.3 |
Reliabilities are presented on a scale from 0 to 100%. Accuracies, i.e. the square roots of the reliabilities, ranged from 0.513 to 0.635 on a scale from 0 to 1, while the corresponding standard errors ranged from 0.011 to 0.014
Estimates of additive genetic variance from different models using various marker sets for fertility, health and longevity
| Marker sets/index traits | Average number of variants | Fertility | Health | Longevity | |||
|---|---|---|---|---|---|---|---|
| Variance components | 50 k | RLFV | 50 k | RLFV | 50 k | RLFV | |
| 50 k | 54,323 | 144.5 | – | 146.6 | – | 142.2 | – |
| 50 k + all genic RLFV | 1,650,799 | 113.2 | 27.0 | 119.2 | 25.1 | 95.2 | 42.9 |
| 50 k + RLFV in genes with significant association | 191,414 | 139.2 | 11.0 | 142.9 | 6.0 | 115.9 | 26.5 |
| 50 k + RLFV with medium-to-high impact annotations | 577,732 | 117.7 | 22.2 | 120.3 | 23.5 | 97.7 | 39.8 |
| 50 k + RLFV with high impact annotations | 81,717 | 131.6 | 17.0 | 123.2 | 21.4 | 105.8 | 31.9 |
Bias of the GEBV measured by regression slope in different methods of selection of rare and low-frequency variants (RLFV)
| Methods of selection of RLFV | Fertility | Health | Longevity |
|---|---|---|---|
| 50 k SNP array | 0.993 | 0.902 | 0.851 |
| 50 k + all genic RLFV | 1.046 | 0.950 | 0.939 |
| 50 k + RLFV in genes with significant association | 1.012 | 0.913 | 0.925 |
| 50 k + RLFV with medium-to-high impact annotations | 1.040 | 0.956 | 0.931 |
| 50 k + RLFV with high impact annotations | 1.038 | 0.935 | 0.902 |
Characteristics for one random replicate of each simulation scenario
| Characteristics | SQTN | MQTN | LQTN |
|---|---|---|---|
| Number of genes | 7–10 per chr | 1 per chr | 9 |
| Number of genic RLFV simulated as QTN | 21,468 | 1348 | 235 |
| Number of RLFV from 10 random selected genes from each chromosome | 29,166 | 27,290 | 28,308 |
| Number of RLFV from mapped genes | 78,080 | 81,010 | 80,999 |
Chr = chromosome
RLFV refer to rare and low-frequency variants and QTN refer to quantitative trait nucleotides. SQTN corresponds to the scenario with RLFV in seven to ten genes per chromosome simulated as causal variants; MQTN corresponds to the scenario with RLFV in one gene per chromosome simulated as causal variants; LQTN corresponds to the scenario with RLFV in nine randomly selected genes across the whole genome simulated as causal variants. The simulated total variances for the QTN in SQTN, MQTN and LQTN were 10% of the estimate of variance explained by 50 k markers for fertility index
Reliabilities of genomic prediction for one random replicate in each simulation scenario and different strategies for selection of rare and low-frequency variants (RLFV)
| Scenarios | SQTN | MQTN | LQTN |
|---|---|---|---|
| 50 k | 35.7 | 43.0 | 38.0 |
| 50 k + all simulated QTN | 42.5 | 46.4 | 43.6 |
| 50 k + simulated QTN and RLFV from 10 random selected genes from each chromosome | 41.3 | 44.1 | 37.9 |
| 50 k + RLFV in mapped genes | 39.7 | 44.3 | 40.1 |
SQTN corresponds to the scenario with RLFV in seven to ten genes per chromosome simulated as causal variants; MQTN corresponds to the scenario with RLFV in one gene per chromosome simulated as causal variants; LQTN corresponds to the scenario with RLFV in nine randomly selected genes across the whole genome simulated as causal variants. Reliabilities are presented on a scale from 0 to 100%. Accuracies, i.e. the square roots of the reliabilities, ranged from 0.597 to 0.681 on a scale from 0 to 1, while the corresponding standard errors ranged from 0.010 to 0.012
Bias of the GEBV measured by regression slope for one random replicate in each simulation scenario and different strategies for selection of rare and low-frequency variants (RLFV)
| Scenarios | SQTN | MQTN | LQTN |
|---|---|---|---|
| 50 k | 0.929 | 0.986 | 0.927 |
| 50 k + all simulated QTN | 1.009 | 1.010 | 0.963 |
| 50 k + simulated QTN and RLFV from 10 random selected genes from each chromosome | 0.995 | 1.016 | 0.938 |
| 50 k + RLFV in mapped genes | 0.986 | 1.013 | 0.967 |
SQTN corresponds to the scenario with RLFV in seven to ten genes per chromosome simulated as causal variants; MQTN corresponds to the scenario with RLFV in one gene per chromosome simulated as causal variants; LQTN corresponds to the scenario with RLFV in nine randomly selected genes across the whole genome simulated as causal variants