| Literature DB >> 29571285 |
Kathryn E Kemper1,2, Philip J Bowman3,4, Benjamin J Hayes3,5,6, Peter M Visscher7,8, Michael E Goddard9,3.
Abstract
BACKGROUND: Genomic prediction and quantitative trait loci (QTL) mapping typically analyze one trait at a time but this may ignore the possibility that one polymorphism affects multiple traits. The aim of this study was to develop a multivariate Bayesian approach that could be used for simultaneously elucidating genetic architecture, QTL mapping, and genomic prediction. Our approach uses information from multiple traits to divide markers into 'unassociated' (no association with any trait) and 'associated' (associated with one or more traits). The effect of associated markers is estimated independently for each trait to avoid the assumption that QTL effects follow a multi-variate normal distribution.Entities:
Mesh:
Year: 2018 PMID: 29571285 PMCID: PMC5866527 DOI: 10.1186/s12711-018-0377-y
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Number of records in the reference and validation datasets for Holstein, Jersey and Australian Red dairy cattle
| Breed | Traits | Total records | Reference | Validation | |||
|---|---|---|---|---|---|---|---|
| Bulls | Cows | YOB | Bulls | Cows | |||
| Holstein | FY, MY, PY | 11,789 | 3049 | 8478 | 2005 | 262 | – |
| Jersey | FY, MY, PY | 4793 | 770 | 3917 | 2005 | 105 | – |
| Australian Red* | FY, MY, PY | 361 | – | – | – | 114 | 247 |
Year-of-birth (YOB) for the animals included in the validation datasets is also provided
FY = fat yield (kg/lac), MY = milk yield (L/lac); PY = protein yield (kg/lac)
*Australian Red animals were only used for validation and never included in the reference population
Posterior mean number of SNPs allocated to each component of the mixture distribution for the two simulated traits when analyzed with BayesR or BayesMV
| pQTLa | Distributionb | Simulatedc | BayesRd | BayesMV |
|---|---|---|---|---|
| 0.0 | Unassociated | 12,725 | – | 12,643 |
| (trait1)1_(trait2)1 | 0 | 12,396 | 14 | |
| (trait1)2–4_(trait2)1 | 10 | 174 | 25 | |
| (trait1)1_(trait2)2–4 | 10 | 173 | 23 | |
| (trait1)2–4_(trait2)2–4 | 0 | 2 | 41 | |
| 1.0 | Unassociated | 12,735 | – | 12,728 |
| (trait1)1_(trait2)1 | 0 | 12,435 | 0 | |
| (trait1)2–4_(trait2)1 | 0 | 148 | 1 | |
| (trait1)1_(trait2)2–4 | 0 | 159 | 1 | |
| (trait1)2–4_(trait2)2–4 | 10 | 3 | 15 |
aQTL were independent (no pleiotropic QTL, pQTL = 0) or completely pleiotropic (pQTL = 1.0)
bSubscripts indicate distributions 1 to 4, where distributions 1 to 4 explain 0, 0.0001, 0.001 or 0.01 σP2, respectively
cThe number of simulated QTL is also provided
dJoint probabilities are the product of posterior probabilities (p and q)
Fig. 1QTL mapping in simulated data where all QTL are pleiotropic. The mean posterior probabilities (PP) of SNPs having a non-zero effect for any trait for multivariate (BayesMV, top) and univariate (BayesR, middle) methods with the –log10(P value) for the multi-trait single-SNP genome-wide association study (GWAS, bottom) are shown. Results for simulated QTL are highlighted in orange and their position marked with dashed vertical lines
Accuracy of genomic predictions for the analysis of the two simulated traits with BayesR and BayesMV for two pleiotropy scenarios
| Method | Pleiotropya | Trait | Holstein | Jersey | ||
|---|---|---|---|---|---|---|
| Accuracy | SE | Accuracy | SE | |||
| BayesR | pQTL = 0 | 1 | 0.88 | 0.07 | 0.78 | 0.15 |
| 2 | 0.88 | 0.04 | 0.82 | 0.10 | ||
| pQTL = 1.0 | 1 | 0.89 | 0.05 | 0.77 | 0.15 | |
| 2 | 0.89 | 0.04 | 0.80 | 0.14 | ||
| BayesMV | pQTL = 0 | 1 | 0.89 | 0.06 | 0.80 | 0.15 |
| 2 | 0.90 | 0.04 | 0.84 | 0.11 | ||
| pQTL = 1.0 | 1 | 0.97 | 0.02 | 0.96 | 0.04 | |
| 2 | 0.97 | 0.01 | 0.96 | 0.04 | ||
SE standard error (across replicates)
aQTL were independent (no pleiotropy, pQTL = 0) or completely pleiotropic (pQTL = 1.0) and accuracies are for validation within-(Holstein) or across (Jersey) breeds
Eigenvectors applied to each trait to construct linear combinations (LC1, 2 and 3) with zero error co-variance for the milk production traits
| LC1 | LC2 | LC3 | |
|---|---|---|---|
| Fat yield | 0.55 | 0.83 | − 0.01 |
| Milk yield | 0.59 | − 0.39 | − 0.70 |
| Protein yield | 0.59 | − 0.39 | 0.71 |
| Heritability (h2)a | 0.45 | 0.73 | 0.88 |
aEstimated heritability of the linear combinations
Posterior mean number of SNPsa in each distribution for milk production traits from BayesMV or BayesR
| Reference | Distributionb,c | BayesR | BayesMV |
|---|---|---|---|
| Hol_Jer | Unassociated | – | 627,911 |
| LC11_LC21_LC31 | 620,515 | 1 | |
| LC11_LC21_LC32–4 | 3504 | 11 | |
| LC11_LC22–4_LC31 | 2994 | 4 | |
| LC12–4_LC21_LC31 | 4913 | 64 | |
| LC11_LC22–4_LC32–4 | 21 | 47 | |
| LC12–4_LC21_LC32–4 | 29 | 685 | |
| LC12–4_LC22–4_LC31 | 25 | 218 | |
| LC12–4_LC22–4_LC32–4 | 1 | 3062 | |
| Holstein | Unassociated | – | 628,451 |
| LC11_LC21_LC31 | 621,268 | 0 | |
| LC11_LC21_LC32–4 | 2817 | 2 | |
| LC11_LC22–4_LC31 | 3110 | 4 | |
| LC12–4_LC21_LC31 | 4743 | 12 | |
| LC11_LC22–4_LC32–4 | 17 | 50 | |
| LC12–4_LC21_LC32–4 | 22 | 124 | |
| LC12–4_LC22–4_LC31 | 25 | 234 | |
| LC12–4_LC22–4_LC32–4 | 0 | 3124 | |
| Jersey | Unassociated | – | 630,779 |
| LC11_LC21_LC31 | 624,314 | 0 | |
| LC11_LC21_LC32–4 | 1957 | 3 | |
| LC11_LC22–4_LC31 | 1366 | 3 | |
| LC12–4_LC21_LC31 | 4335 | 1 | |
| LC11_LC22–4_LC32–4 | 6 | 101 | |
| LC12–4_LC21_LC32–4 | 14 | 28 | |
| LC12–4_LC22–4_LC31 | 10 | 30 | |
| LC12–4_LC22–4_LC32–4 | 0 | 1057 |
aThe posterior mean number of unassociated SNPs from BayesMV is shown with the joint probability of a non-zero effect on one or more traits. Joint probabilities are the product of posterior probabilities (p and q)
bTraits are three linear combinations (LC1, LC2, LC3) of fat, milk and protein yield
cSubscripts indicate distributions 1 to 4, each explaining 0, 0.0001, 0.001 or 0.01 of the genetic variance
Fig. 2Mean posterior probability and associations with beta-lactoglobulin concentration on bovine chromosome 11. The posterior probability (PP) for a SNP being associated with any trait from BayesMV and BayesR and the single-SNP association analysis results are shown. The PAEP coding region (formally LGB, lactoglobulin beta) is highlighted in grey
Accuracy and bias of genomic predictions for milk productiona traits using different reference populations and different analysis methods and when validated in Holstein, Jersey or Australian Red animals
| Analysis methodb | Reference dataset | Validation dataset | Accuracyc | Bias | ||||
|---|---|---|---|---|---|---|---|---|
| FY | MY | PY | FY | MY | PY | |||
| BayesRd | Holstein | Holstein | 0.63 | 0.62 | 0.58 | 1.22 | 0.89 | 1.02 |
| BayesR_LC | Holstein | Holstein | 0.65 | 0.62 | 0.57 | 1.17 | 0.91 | 0.99 |
| BayesMV | Holstein | Holstein | 0.65 | 0.63 | 0.59 | 1.21 | 0.89 | 1.03 |
| BayesRd | Hol_Jer | Holstein | 0.65 | 0.63 | 0.58 | 1.25 | 0.89 | 0.99 |
| BayesR_LC | Hol_Jer | Holstein | 0.65 | 0.62 | 0.58 | 1.14 | 0.90 | 0.97 |
| BayesMV | Hol_Jer | Holstein | 0.66 | 0.63 | 0.58 | 1.17 | 0.87 | 0.97 |
| BayesRd | Jersey | Jersey | 0.56 | 0.70 | 0.72 | 0.89 | 0.98 | 1.24 |
| BayesR_LC | Jersey | Jersey | 0.57 | 0.70 | 0.72 | 0.70 | 1.05 | 1.17 |
| BayesMV | Jersey | Jersey | 0.55 | 0.70 | 0.71 | 0.81 | 1.00 | 1.11 |
| BayesRd | Hol_Jer | Jersey | 0.56 | 0.69 | 0.71 | 0.93 | 0.95 | 1.18 |
| BayesR_LC | Hol_Jer | Jersey | 0.58 | 0.69 | 0.73 | 0.92 | 1.00 | 1.20 |
| BayesMV | Hol_Jer | Jersey | 0.55 | 0.66 | 0.69 | 0.92 | 0.96 | 1.15 |
| BayesRd | Hol_Jer | Aust Red | 0.26 | 0.22 | 0.10 | 0.89 | 0.56 | 0.38 |
| BayesR_LC | Hol_Jer | Aust Red | 0.28 | 0.20 | 0.12 | 0.87 | 0.53 | 0.41 |
| BayesMV | Hol_Jer | Aust Red | 0.26 | 0.14 | 0.07 | 0.75 | 0.34 | 0.25 |
aMilk production traits were fat yield (FY), milk yield (MY) and protein yield (PY)
bMethods were either BayesR on raw phenotypes (BayesR), linear combinations of traits analyzed with univariate BayesR (BayesR_LC) or the multivariate BayesMV method
cStandard errors are approximately 0.062 for Holstein, 0.098 for Jersey and 0.074 for Australian Red predictions
dUnivariate results from Kemper et al. [1]