| Literature DB >> 24715901 |
Gota Morota1, Prashanth Boddhireddy2, Natascha Vukasinovic2, Daniel Gianola3, Sue Denise2.
Abstract
Prediction of complex trait phenotypes in the presence of unknown gene action is an ongoing challenge in animals, plants, and humans. Development of flexible predictive models that perform well irrespective of genetic and environmental architectures is desirable. Methods that can address non-additive variation in a non-explicit manner are gaining attention for this purpose and, in particular, semi-parametric kernel-based methods have been applied to diverse datasets, mostly providing encouraging results. On the other hand, the gains obtained from these methods have been smaller when smoothed values such as estimated breeding value (EBV) have been used as response variables. However, less emphasis has been placed on the choice of phenotypes to be used in kernel-based whole-genome prediction. This study aimed to evaluate differences between semi-parametric and parametric approaches using two types of response variables and molecular markers as inputs. Pre-corrected phenotypes (PCP) and EBV obtained for dairy cow health traits were used for this comparison. We observed that non-additive genetic variances were major contributors to total genetic variances in PCP, whereas additivity was the largest contributor to variability of EBV, as expected. Within the kernels evaluated, non-parametric methods yielded slightly better predictive performance across traits relative to their additive counterparts regardless of the type of response variable used. This reinforces the view that non-parametric kernels aiming to capture non-linear relationships between a panel of SNPs and phenotypes are appealing for complex trait prediction. However, like past studies, the gain in predictive correlation was not large for either PCP or EBV. We conclude that capturing non-additive genetic variation, especially epistatic variation, in a cross-validation framework remains a significant challenge even when it is important, as seems to be the case for health traits in dairy cows.Entities:
Keywords: dairy cow; genetic variance; kernel method; non-additive effect; whole-genome prediction
Year: 2014 PMID: 24715901 PMCID: PMC3970026 DOI: 10.3389/fgene.2014.00056
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Correlations among six health traits: ketosis (KET), displaced abomasum (DA), retained placenta (RP), lameness (LAME), metritis (METR), and clinical mastitis (CM). Variable names followed by "_ebv" denote estimated breeding values (EBV).
Estimated ratios of variance components (weights) for ketosis (KET), displaced abomasum (DA), retained placenta (RP), lameness (LAME), metritis (METR), and clinical mastitis (CM) using parametric multiple-kernel learning.
| KET | PCP | 0.09 (0.10) | 0.13 (0.14) | 0.14 | 0.35 (0.24) | 0.24 | 0.36 | 0.40 |
| EBV | 0.25 (0.24) | 0.03 (0.04) | 0.01 | 0.29 (0.28) | 0.84 | 0.12 | 0.04 | |
| DA | PCP | 0.06 (0.08) | 0.09 (0.10) | 0.25 | 0.40 (0.18) | 0.16 | 0.22 | 0.62 |
| EBV | 0.39 (0.30) | 0.04 (0.05) | 0.30 | 0.73 (0.36) | 0.53 | 0.05 | 0.41 | |
| RP | PCP | 0.05 (0.07) | 0.09 (0.11) | 0.35 | 0.50 (0.18) | 0.11 | 0.18 | 0.71 |
| EBV | 0.27 (0.23) | 0.03 (0.03) | 0.07 | 0.37 (0.26) | 0.73 | 0.07 | 0.20 | |
| LAME | PCP | 0.06 (0.07) | 0.07 (0.09) | 0.39 | 0.52 (0.16) | 0.12 | 0.14 | 0.75 |
| EBV | 0.39 (0.30) | 0.03 (0.08) | 0.27 | 0.70 (0.38) | 0.56 | 0.05 | 0.39 | |
| METR | PCP | 0.06 (0.07) | 0.07 (0.08) | 0.21 | 0.33 (0.15) | 0.17 | 0.21 | 0.62 |
| EBV | 0.31 (0.26) | 0.05 (0.07) | 0.42 | 0.78 (0.34) | 0.39 | 0.07 | 0.54 | |
| CM | PCP | 0.06 (0.07) | 0.07 (0.09) | 0.26 | 0.39 (0.16) | 0.15 | 0.18 | 0.66 |
| EBV | 0.36 (0.29) | 0.02 (0.05) | 0.16 | 0.54 (0.34) | 0.66 | 0.04 | 0.29 | |
The epistatic kernel was created from the Hadamard product of additive and dominance kernels. Pre-corrected phenotype (PCP) and estimated breeding value (EBV) were used as phenotypes. V.
Figure 2Scatter plots of relationships among additive (.
Figure 3Correlations between off-diagonal elements of the additive genomic relationship matrix . Genotypes were both randomly sampled from the present study (Level = LD) and via a computer simulation locus by locus (Level = LE) with an average minor allele frequency equal to 0.35. The averages of the r2 linkage disequilibrium (LD) statistic between adjacent markers were 0.18 and 0.008 for the real and simulated datasets, respectively.
Predictive correlation for ketosis (KET), displaced abomasum (DA), retained placenta (RP), lameness (LAME), metritis (METR), and clinical mastitis (CM) using various kernels and the average of five 10-fold cross-validation.
| KET | PCP | 0.16 | 0.18 | 0.16 | 0.18 | |
| EBV | 0.85 | 0.86 | 0.84 | 0.86 | ||
| DA | PCP | 0.07 | 0.07 | 0.07 | ||
| EBV | 0.59 | 0.53 | 0.59 | 0.60 | ||
| RP | PCP | 0.03 | 0.05 | 0.05 | 0.05 | |
| EBV | 0.65 | 0.60 | 0.66 | 0.65 | ||
| LAME | PCP | 0.07 | 0.04 | 0.07 | 0.05 | |
| EBV | 0.64 | 0.58 | 0.65 | 0.64 | ||
| METR | PCP | 0.05 | 0.04 | 0.05 | 0.05 | |
| EBV | 0.48 | 0.43 | 0.50 | 0.49 | ||
| CM | PCP | 0.07 | 0.05 | 0.07 | 0.07 | |
| EBV | 0.72 | 0.68 | 0.73 | 0.73 | ||
Pre-corrected phenotype (PCP) and estimated breeding value (EBV) were target phenotypes. Kernels were: additive genomic relationship kernel (G), Gaussian additive kernel (GK), Gaussian dominance kernel (GK), multiple kernel learning using Gaussian additive, Gaussian dominance, and Gaussian additive by dominance kernels (GK), and fitting three parametric kernels (G, D, and G#D) simultaneously (ALL). The best prediction within trait and type of phenotype is italicized.
Figure 4Posterior density plots of the ratios of variance components for ketosis (KET), displaced abomasum (DA), retained placenta (RP), lameness (LAME), metritis (METR), and clinical mastitis (CM). Estimated breeding value was used as phenotype.