| Literature DB >> 31263059 |
Janeo Eustáquio de Almeida Filho1, João Filipi Rodrigues Guimarães2, Fabyano Fonsceca E Silva3, Marcos Deon Vilela de Resende4, Patricio Muñoz5, Matias Kirst6, Marcio Fernando Ribeiro de Resende Júnior7.
Abstract
The genetic merit of individuals can be estimated using models with dense markers and pedigree information. Early genomic models accounted only for additive effects. However, the prediction of non-additive effects is important for different forest breeding systems where the whole genotypic value can be captured through clonal propagation. In this study, we evaluated the integration of marker data with pedigree information, in models that included or ignored non-additive effects. We tested the models Reproducing Kernel Hilbert Spaces (RKHS) and BayesA, with additive and additive-dominance frameworks. Model performance was assessed for the traits tree height, diameter at breast height and rust resistance, measured in 923 pine individuals from a structured population of 71 full-sib families. We have also simulated a population with similar genetic properties and evaluated the performance of models for six simulated traits with distinct genetic architectures. Different cross validation strategies were evaluated, and highest accuracies were achieved using within family cross validation. The inclusion of pedigree information in genomic prediction models did not yield higher accuracies. The different RKHS models resulted in similar predictions accuracies, and RKHS and BayesA generated substantially better predictions than pedigree-only models. The additive-BayesA resulted in higher accuracies than RKHS for rust incidence and in simulated additive-oligogenic traits. For DBH, HT and additive-dominance polygenic traits, the RKHS- based models showed slightly higher accuracies than BayesA. Our results indicate that BayesA performs the best for traits with few genes with major effects, while RKHS based models can best predict genotypic effects for clonal selection of complex traits.Entities:
Keywords: BayesA; GenPred; Genomic Prediction; Genotypic Value; Oligogenic; Polygenic; RKHS; Shared Data Resources
Mesh:
Substances:
Year: 2019 PMID: 31263059 PMCID: PMC6686920 DOI: 10.1534/g3.119.201004
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Accuracy distribution of all genomic prediction models fitted for tree height (HT), diameter at breast height (DBH) and two measures of fusiform rust infection: presence or absence of rust (RFbin) and gall volume (RFgall). These results were achieved from three different 10-fold cross validation orientations: a) Across families: Each fold is a group of distinct families; b) Within families: The folds were grouped inside families; and c) Random sample: Each fold is a group of distinct individuals random sampled ignoring family information.
Figure 2Accuracy average of prediction of breeding values and genotypic values for all genomic prediction models fitted in six simulated traits: Oligogenic and Polygenic with three degrees of dominance (d2 = 0; d2 = 0.1 and d2 = 0.2). These results were achieved from three different 10-fold cross validation orientations: a) Across families: Each fold is a group of distinct families; b) Within families: The folds were grouped inside families; and c) Random sample: Each fold is a group of distinct individuals random sampled ignoring family information.
Average of accuracies for prediction of phenotypic values for all models based in pedigree-only, in markers-only and models combining pedigree and markers information
| Models | DBH | HT | RFbin | RFgall |
|---|---|---|---|---|
| Pedigree | 0.536 | 0.450 | 0.331 | 0.255 |
| Markers | 0.545 | 0.459 | 0.361 | 0.288 |
| Markers + Pedigree | 0.548 | 0.465 | 0.356 | 0.279 |
Only-Markers models: Additive-, Additive-dominance-BayesA, RKHS Ka and RKHS Ka-Kd; Only-Pedigree models: Additive-, Additive-dominance-Pedigree; Markers + Pedigree are the models id: 2,4,5,7,8,10,11 (Table S2).
Average of accuracies of breeding values, dominance deviation, genotypic values and phenotypic values prediction for all models based in pedigree-only, in markers-only and models combining pedigree and markers information
| Accuracy | Models | d2 = 0 | d2 = 0.1 | d2 = 0.2 | |||
|---|---|---|---|---|---|---|---|
| Olig | Poly | Olig | Poly | Olig | Poly | ||
| Breeding Value | Pedigree | 0.564 | 0.576 | 0.545 | 0.560 | 0.538 | 0.554 |
| Markers | 0.653 | 0.627 | 0.645 | 0.618 | 0.645 | 0.613 | |
| Markers + Pedigree | 0.646 | 0.626 | 0.639 | 0.615 | 0.638 | 0.610 | |
| Dominance Deviation | Pedigree | — | — | 0.179 | 0.202 | 0.271 | 0.259 |
| Markers | — | — | 0.175 | 0.169 | 0.273 | 0.243 | |
| Markers + Pedigree | — | — | 0.186 | 0.185 | 0.284 | 0.258 | |
| Genotypic Value | Pedigree | 0.556 | 0.567 | 0.488 | 0.521 | 0.481 | 0.479 |
| Markers | 0.652 | 0.626 | 0.586 | 0.575 | 0.569 | 0.537 | |
| Markers + Pedigree | 0.638 | 0.619 | 0.578 | 0.571 | 0.566 | 0.536 | |
| Phenotypic Value | Pedigree | 0.251 | 0.259 | 0.284 | 0.306 | 0.313 | 0.335 |
| Markers | 0.297 | 0.286 | 0.338 | 0.331 | 0.378 | 0.373 | |
| Markers + Pedigree | 0.290 | 0.282 | 0.335 | 0.331 | 0.373 | 0.373 | |
Only-Markers models: Additive-, Additive-dominance-BayesA, RKHS Ka and RKHS Ka-Kd; Only-Pedigree models: Additive-, Additive-dominance-Pedigree; Markers + Pedigree are the models id: 2,4,5,7,8,10,11 (Table S2).
Figure 3Average of phenotypic prediction accuracies and standard error for four markers-only models: additive- and additive-dominance-BayesA, RKHS-Ka and RKHS Ka-Kd for diameter at breast height (DBH), height (HT) and Rust resistance evaluated as gall volume (RFgall) and presence or absence (RFbin) in loblolly pine. The standard errors (s.e.=s.d.(x)/sqrt(10)) were calculated for each ten-fold procedure. The error bars are the averages of s.e. across the ten independent cross validations.
Figure 4Accuracy average of prediction of breeding values and genotypic values for four marker-only models: additive-, additive-dominance-BayesA, RKHS-Ka and RKHS Ka-Kd for six simulated traits: Oligogenic and Poligenic with three degrees of dominance (d2 = 0; d2 = 0.1 and d2 = 0.2). Error bars are standard error (s.e.) considering the 100 independent samples used to calculate the mean (s.e.=s.d.(x)/sqrt(100)).