| Literature DB >> 28454519 |
Patrick R N Lenz1,2, Jean Beaulieu3,4, Shawn D Mansfield5, Sébastien Clément3, Mireille Desponts6, Jean Bousquet4.
Abstract
BACKGROUND: Genomic selection (GS) uses information from genomic signatures consisting of thousands of genetic markers to predict complex traits. As such, GS represents a promising approach to accelerate tree breeding, which is especially relevant for the genetic improvement of boreal conifers characterized by long breeding cycles. In the present study, we tested GS in an advanced-breeding population of the boreal black spruce (Picea mariana [Mill.] BSP) for growth and wood quality traits, and concurrently examined factors affecting GS model accuracy.Entities:
Keywords: Black spruce; Gene SNPs; Genomic selection; Genomic-estimated breeding values; Tree improvement and breeding; Wood properties
Mesh:
Year: 2017 PMID: 28454519 PMCID: PMC5410046 DOI: 10.1186/s12864-017-3715-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genomic selection analyses based on all marker information (4,993 SNPs) and following different schemes of model building and application. “Pedigree” indicates that only pedigree information was used for prediction after model calibrations, which is also referred to as the conventional breeding approach in the body of the text; “Markers” indicates that SNP information was used for prediction. All results are from cross-validations using 10 replicates on randomly-selected trees not included in model fitting, but from the same families used to fit models. The first scheme considered all 734 trees from the two sites combined. Models in other schemes were trained on one site only and whether applied on the same or on the other site, respectively. For comparative purposes, the “Site_mean” scheme represents the mean of 5 models run on 359 trees corresponding to the mean number of trees per site. Model accuracy is the correlation between the cross-validated estimated breeding value (using independent sets of trees) and the “true” reference breeding value. The predictive ability is the correlation between the predicted and the actual phenotypes. Genetic gains are given in absolute values and percentages are given in brackets. Gain estimates are based on predicted phenotypes and a selection intensity of 5%. Annual gain estimates were based on assumptions of a conventional breeding cycle length of 28 years for pedigree selection (“Pedigree”), and a shortened cycle length of 9 years for selection with markers (“Markers”), with 4 years for crosses and production of seedlots that are full-siblings to the training population, followed by 1 year for selection of individuals using markers and genomic selection models, and 4 years for vegetative propagation of selected individuals for seedling production
| Trait | GS scenario | Accuracy (error) | Predictive ability (error) | Gaina (percent) | Gain ratio | Gain per year | Gain per year ratio | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Pedigree | Markers | Pedigree | Markers | Pedigree | Markers | M/Pb | Pedigree | Markers | M/Pb | ||
| Wood density | Combined sites | 0.89 (0.03) | 0.84 (0.02) | 0.45 (0.09) | 0.49 (0.07) | 34.74 (0.08) | 35.63 (0.09) | 1.03 | 1.24 | 3.96 | 3.19 |
| Site1 | 0.80 (0.08) | 0.77 (0.06) | 0.34 (0.16) | 0.38 (0.12) | 35.90 (0.09) | 31.87 (0.08) | 0.89 | 1.28 | 3.54 | 2.77 | |
| Site2 | 0.88 (0.05) | 0.82 (0.10) | 0.52 (0.15) | 0.56 (0.12) | 26.25 (0.06) | 29.48 (0.07) | 1.12 | 0.94 | 3.28 | 3.49 | |
| Site_mean | 0.85 (0.05) | 0.81 (0.05) | 0.42 (0.18) | 0.43 (0.16) | 30.15 (0.07) | 28.55 (0.07) | 0.95 | 1.08 | 3.17 | 2.94 | |
| Site1 → 2 | 0.79 (0.05) | 0.77 (0.08) | 0.42 (0.16) | 0.44 (0.19) | 24.60 (0.06) | 21.47 (0.05) | 0.87 | 0.88 | 2.39 | 2.72 | |
| Site2 → 1 | 0.85 (0.04) | 0.80 (0.08) | 0.36 (0.13) | 0.39 (0.11) | 40.02 (0.10) | 45.46 (0.11) | 1.14 | 1.43 | 5.05 | 3.53 | |
| Height | Combined sites | 0.88 (0.03) | 0.86 (0.03) | 0.56 (0.08) | 0.57 (0.07) | 105.17 (0.13) | 104.57 (0.13) | 0.99 | 3.76 | 11.62 | 3.09 |
| Site1 | 0.84 (0.05) | 0.81 (0.04) | 0.51 (0.11) | 0.51 (0.11) | 53.95 (0.07) | 52.17 (0.07) | 0.97 | 1.93 | 5.8 | 3.01 | |
| Site2 | 0.88 (0.02) | 0.85 (0.03) | 0.58 (0.09) | 0.58 (0.12) | 82.89 (0.10) | 78.69 (0.10) | 0.95 | 2.96 | 8.74 | 2.95 | |
| Site_mean | 0.85 (0.05) | 0.83 (0.05) | 0.55 (0.12) | 0.55 (0.12) | 87.00 (0.11) | 84.56 (0.11) | 0.97 | 3.11 | 9.4 | 3.02 | |
| Site1 → 2 | 0.84 (0.03) | 0.83 (0.04) | 0.56 (0.12) | 0.56 (0.13) | 75.93 (0.09) | 74.56 (0.09) | 0.98 | 2.71 | 8.28 | 3.06 | |
| Site2 → 1 | 0.85 (0.04) | 0.84 (0.03) | 0.51 (0.11) | 0.52 (0.11) | 63.67 (0.08) | 62.31 (0.08) | 0.98 | 2.27 | 6.92 | 3.05 | |
| Diameter (DBH) | Combined sites | 0.86 (0.03) | 0.83 (0.04) | 0.45 (0.06) | 0.43 (0.07) | 16.65 (0.14) | 14.81 (0.13) | 0.89 | 0.59 | 1.65 | 2.80 |
| Site1 | 0.76 (0.08) | 0.74 (0.08) | 0.38 (0.15) | 0.34 (0.15) | 12.09 (0.10) | 10.15 (0.09) | 0.84 | 0.43 | 1.13 | 2.63 | |
| Site2 | 0.87 (0.02) | 0.82 (0.04) | 0.53 (0.05) | 0.48 (0.08) | 14.70 (0.13) | 13.18 (0.12) | 0.90 | 0.53 | 1.46 | 2.75 | |
| Site_mean | 0.82 (0.05) | 0.79 (0.06) | 0.45 (0.09) | 0.42 (0.11) | 13.44 (0.12) | 11.84 (0.10) | 0.88 | 0.48 | 1.32 | 2.75 | |
| Site1 → 2 | 0.76 (0.05) | 0.75 (0.05) | 0.43 (0.09) | 0.43 (0.10) | 10.99 (0.10) | 9.65 (0.08) | 0.88 | 0.39 | 1.07 | 2.74 | |
| Site2 → 1 | 0.80 (0.05) | 0.78 (0.05) | 0.33 (0.13) | 0.34 (0.12) | 16.09 (0.14) | 13.75 (0.12) | 0.85 | 0.57 | 1.53 | 2.68 | |
| Microfibril anglec | Combined sites | 0.88 (0.03) | 0.84 (0.04) | 0.51 (0.11) | 0.51 (0.10) | −2.71 (−0.15) | −2.78 (−0.15) | 1.02 | −0.10 | −0.31 | 3.10 |
| Site1 | 0.83 (0.06) | 0.79 (0.04) | 0.47 (0.15) | 0.45 (0.13) | −2.47 (−0.12) | −2.29 (−0.11) | 0.92 | −0.09 | −0.25 | 2.78 | |
| Site2 | 0.86 (0.04) | 0.82 (0.04) | 0.54 (0.16) | 0.52 (0.15) | −1.72 (−0.11) | −1.80 (−0.12) | 1.09 | −0.06 | −0.20 | 3.33 | |
| Site_mean | 0.84 (0.06) | 0.80 (0.05) | 0.48 (0.15) | 0.48 (0.14) | −1.84 (−0.11) | −1.80 (−0.11) | 1.00 | −0.07 | −0.20 | 2.86 | |
| Site1 → 2 | 0.80 (0.03) | 0.76 (0.04) | 0.48 (0.13) | 0.47 (0.12) | −2.05 (−0.13) | −1.95 (−0.12) | 0.92 | −0.07 | −0.22 | 3.14 | |
| Site2 → 1 | 0.84 (0.04) | 0.81 (0.04) | 0.44 (0.13) | 0.45 (0.11) | −2.14 (−0.10) | −2.16 (−0.10) | 1.01 | −0.08 | −0.24 | 3.00 | |
aGenetic gains are given in absolute values; units are kg/m3 for wood density, cm for height, mm for diameter and degrees for microfibril angle
bM/P, markers to pedigree gain ratio
cNegative genetic gain for MFA indicates trait improvement
Variance component estimates from genomic selection analyses, combined-site and single-site analyses. “Pedigree” indicates variance component estimation based on pedigree and phenotypic information, and “Markers” indicate SNP information from all 4,993 SNPs
| Trait | Model |
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
| Wood density | Combined sites | Pedigree | - | - | 597.73 | 870.01 | 0.41 |
| Markers | 0.29 | 551.46 | - | 878.19 | 0.39 | ||
| Site 1 | Pedigree | - | - | 973.94 | 1407.01 | 0.41 | |
| Markers | 0.50 | 957.04 | - | 1398.98 | 0.41 | ||
| Site 2 | Pedigree | - | - | 653.68 | 216.32 | 0.75 | |
| Markers | 0.27 | 523.71 | - | 291.00 | 0.64 | ||
| Diameter (DBH) | Combined sites | Pedigree | - | - | 126.42 | 93.69 | 0.57 |
| Markers | 2.9E -2 | 55.88 | - | 135.02 | 0.29 | ||
| Site 1 | Pedigree | - | - | 123.22 | 129.34 | 0.49 | |
| Markers | 0.04 | 68.09 | - | 164.37 | 0.29 | ||
| Site 2 | Pedigree | - | - | 143.65 | 54.22 | 0.73 | |
| Markers | 0.03 | 60.26 | - | 106.18 | 0.36 | ||
| Height | Combined sites | Pedigree | - | - | 3503.73 | 1613.54 | 0.68 |
| Markers | 0.97 | 1851.53 | - | 2514.81 | 0.42 | ||
| Site 1 | Pedigree | - | - | 3232.04 | 1623.81 | 0.67 | |
| Markers | 1.14 | 2182.23 | - | 2227.79 | 0.49 | ||
| Site 2 | Pedigree | - | - | 4494.96 | 1222.76 | 0.79 | |
| Markers | 1.32 | 2530.77 | - | 2341.31 | 0.52 | ||
| Microfibril angle | Combined sites | Pedigree | - | - | 0.14 | 4.8E -2 | 0.74 |
| Markers | 3.5E -5 | 6.7E -2 | - | 8.8E -2 | 0.43 | ||
| Site 1 | Pedigree | - | - | 0.15 | 5.5E -2 | 0.73 | |
| Markers | 4.5E -5 | 8.6E -2 | - | 9.2E -2 | 0.48 | ||
| Site 2 | Pedigree | - | - | 0.15 | 3.1E -2 | 0.83 | |
| Markers | 4.3E -5 | 8.2E -2 | - | 7.2E -2 | 0.53 |
a σ 2 a is the additive genetic variance explained by marker loci
b V A is the additive genetic variance based on markers was estimated as
c σ 2 u is the polygenic variance estimated based on pedigree
d σ 2 e is the residual variance
e h 2i is the individual trait heritability
Accuracies of genomic selection models based on half-sib families using all 4,993 markers in a combined-site analysis, thus removing full-sib family linkage. Pedigree indicates that only pedigree information was used for predictions after model calibrations; markers indicates that SNP information was used. The predictive ability is the correlation between the predicted and the actual phenotypes. Genetic gains are given in absolute values and percentages are given in brackets. Gain estimates are based on predicted phenotypes and a selection intensity of 5%. Annual gain estimates were based on assumptions of a conventional breeding cycle length of 28 years for pedigree selection (“Pedigree”), and a shortened cycle length of 9 years for selection with markers (“Markers”), with 4 years for crosses and production of seedlots that are full-siblings to the training population, followed by 1 year for selection of individuals using markers and genomic selection models, and 4 years for vegetative propagation of selected individuals for seedling production
| Trait | Accuracy (error) | Predictive ability (error) | Gaina (percent) | Gain ratio | Gain per year | Gain per year ratio | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Pedigree | Markers | Pedigree | Markers | Pedigree | Markers | M/Pb | Pedigree | Markers | M/Pb | |
| Wood density | 0.77 (0.14) | 0.65 (0.18) | 0.38 (0.16) | 0.37 (0.15) | 26.48 (0.06) | 26.33 (0.06) | 0.99 | 0.95 | 2.93 | 3.08 |
| Height | 0.80 (0.12) | 0.76 (0.13) | 0.50 (0.19) | 0.49 (0.18) | 78.59 (0.10) | 76.70 (0.10) | 0.98 | 2.81 | 8.52 | 3.03 |
| Diameter (DBH) | 0.64 (0.17) | 0.63 (0.17) | 0.30 (0.20) | 0.31 (0.21) | 14.41 (0.13) | 12.19 (0.11) | 0.85 | 0.51 | 1.35 | 2.65 |
| Microfibril anglec | 0.40 (0.41) | 0.42 (0.28) | 0.18 (0.32) | 0.23 (0.25) | −2.01 (−0.11) | −1.97 (−0.11) | 1.00 | −0.07 | −0.22 | 3.14 |
aGenetic gains are given in absolute values; units are kg/m3 for wood density, cm for height, mm for diameter and degrees for microfibril angle
bM/P, markers to pedigree gain ratio
cNegative genetic gain for MFA indicates trait improvement
Fig. 1Accuracy of genomic selection models with reduced marker density. Accuracy estimates for subsets of markers are shown by correlations between the genomic-predicted breeding values (x-axis) and the true breeding values (y-axis) for tree height, diameter at breast height (DBH), wood density, and microfibril angle. Associated errors of accuracy estimates are presented in brackets. On the y-axis of the fifth row of plots, largest SNPs indicate SNPs with largest absolute effects. On the y-axis of the sixth row of plots, remaining SNPs indicate the subset of SNPs without those 250 SNPs with largest absolute effects
Fig. 2Effect of linkage group on accuracy of genomic selection models. Accuracy (black circles) and associated errors for models based on markers pertaining to the same individual linkage group. Dashed grey lines indicate the means of accuracy estimates of the 12 linkage groups; long-dashed black lines are the accuracies of models combining all markers of known map positions (2928 markers, see Methods) and spanning the entire genome
Fig. 3Accuracy of genomic selection models obtained using subsets of trees. Accuracy estimates for pedigree-based models (light grey line) and marker-based models (dark line), as well as their ratio (histograms). Estimates are given with their associated standard errors
Fig. 4Predicted genetic gain using subsets of trees to build pedigree-based models (light grey line) and marker-based models (dark line), and the corresponding coefficient of variation (error bars). The ratio of marker- to pedigree-predicted genetic gain is presented by histograms. Gain estimates are based on predicted phenotypes and a selection intensity of 5%