| Literature DB >> 35870886 |
Eduardo P Cappa1,2, Charles Chen3, Jennifer G Klutsch4,5, Jaime Sebastian-Azcona4,6, Blaise Ratcliffe7, Xiaojing Wei4, Letitia Da Ros8, Aziz Ullah4, Yang Liu7, Andy Benowicz9, Shane Sadoway10, Shawn D Mansfield8, Nadir Erbilgin4, Barb R Thomas4, Yousry A El-Kassaby7.
Abstract
BACKGROUND: Genomic prediction (GP) and genome-wide association (GWA) analyses are currently being employed to accelerate breeding cycles and to identify alleles or genomic regions of complex traits in forest trees species. Here, 1490 interior lodgepole pine (Pinus contorta Dougl. ex. Loud. var. latifolia Engelm) trees from four open-pollinated progeny trials were genotyped with 25,099 SNPs, and phenotyped for 15 growth, wood quality, pest resistance, drought tolerance, and defense chemical (monoterpenes) traits. The main objectives of this study were to: (1) identify genetic markers associated with these traits and determine their genetic architecture, and to compare the marker detected by single- (ST) and multiple-trait (MT) GWA models; (2) evaluate and compare the accuracy and control of bias of the genomic predictions for these traits underlying different ST and MT parametric and non-parametric GP methods. GWA, ST and MT analyses were compared using a linear transformation of genomic breeding values from the respective genomic best linear unbiased prediction (GBLUP) model. GP, ST and MT parametric and non-parametric (Reproducing Kernel Hilbert Spaces, RKHS) models were compared in terms of prediction accuracy (PA) and control of bias.Entities:
Keywords: Genome wide association analyses; Genomic prediction; Lodgepole pine; Quantitative genetic parameters; Single- and multiple-trait mixed models
Mesh:
Year: 2022 PMID: 35870886 PMCID: PMC9308220 DOI: 10.1186/s12864-022-08747-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Narrow-sense heritability estimates and their approximate standard error (SE), for each of the growth, wood quality, pest resistance, drought tolerance and chemical defense assessed traits at four progeny test sites in a lodgepole pine population. Heritabilies were estimated using the genomic- relationship matrix (-matrix) constructed using 25 K SNPs. See text for site and trait abbreviations
| Trait / Site | JUDY | VIRG | SWAN | TIME |
|---|---|---|---|---|
| 0.767 (0.201) | 0.503 (0.221) | 0.616 (0.153) | 0.432 (0.180) | |
| 0.495 (0.178) | 0.464 (0.207) | 0.186 (0.136) | 0.091 (0.137) | |
| 0.465 (0.174) | 0.399 (0.213) | 0.648 (0.165) | 0.678 (0.197) | |
| 0.617 (0.213) | 0.293 (0.216) | 0.203 (0.163) | 0.576 (0.190) | |
| 0.320 (0.171) | 0.064 (0.184) | 0.452 (0.153) | 0.219 (0.146) | |
| 0.537 (0.198) | 0.423 (0.216) | 0.638 (0.178) | 0.434 (0.167) | |
| 0.497 (0.222) | 0.005 (0.190) | 0.239 (0.173) | 0.400 (0.206) | |
| 0.140 (0.163) | 0.651 (0.211) | 0.353 (0.141) | 0.013 (0.008) | |
| 0.681 (0.189) | 0.355 (0.205) | 0.317 (0.168) | 0.503 (0.176) | |
| 0.298 (0.161) | 0.491 (0.211) | 0.686 (0.142) | 0.547 (0.174) | |
| 0.376 (0.196) | 0.281 (0.196) | 0.237 (0.146) | 0.374 (0.175) | |
| 0.523 (0.207) | 0.613 (0.228) | 0.495 (0.161) | 0.200 (0.148) | |
| 0.437 (0.184) | 0.671 (0.206) | 0.795 (0.149) | 0.661 (0.171) | |
| 0.233 (0.179) | 0.566 (0.201) | 0.373 (0.149) | 0.547 (0.175) | |
| 0.266 (0.183) | 0.326 (0.194) | 0.256 (0.151) | 0.247 (0.167) |
Logarithmic transformed
Fig. 1Genomic-based multiple-trait estimates of genetic correlation among the 15 traits studied. Colours reflect the genetic correlation strength, with red and green indicating negative correlations and light blue and violet reflecting positive correlations in lodgepole pine, respectively. See text for site and trait abbreviations
Fig. 2Genomic-based multiple-site estimates of genetic correlations between the four lodgepole pine progeny test sites. Genetic correlation estimates are shown in each cell below the diagonal, with colour and size of circle reflecting the genetic correlation strength. The red and blue circles reflect negative and positive correlations, respectively, and small (weaker) and larger (stronger) circles indicate the strength of the correlation, shown above diagonal. See text for trait abbreviations
Fig. 3Number of total significant (p-values < 1.99 × 10− 06) SNPs identified by the single-trait (blue) and multiple-trait (orange) GWA analyses in lodgepole pine. A total of 17 SNPs was identified for both GWA analyses for the traits HT (1), MPB (2), α-pinene (7), limonene (2), and β-phellandrene (5). See text for site and trait abbreviations
Number of significantly associated SNPs with a Bonferroni correction p-value cutoff of 1.99 × 10−06 for single-trait and multiple-trait GWA analyses (diagonal), and number of common significant SNP markers across different traits by single-trait (above diagonal) and multiple-trait (below diagonal) GWA analyses in lodgepole pine. See text for site and trait abbreviations
| HT | DBH | WGR | WD | δ | MPB | α-pinene | β-pinene | myrcene | limonene | β-phellandrene | terpinolene | total_monoterpenes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | – | – | – | – | – | – | – | – | – | – | – | – | |
| 1 | 83 | – | – | – | – | – | – | – | – | – | 1 | – | |
| – | – | 1 | – | – | – | – | – | – | – | – | – | – | |
| – | – | – | 3 | – | – | – | – | – | – | – | – | – | |
| – | – | – | – | 1 | – | – | – | – | – | – | – | – | |
| – | – | – | – | – | 16 | – | – | – | 2 | – | – | – | |
| – | 2 | – | – | – | – | 35 | – | – | – | – | – | – | |
| – | – | – | – | – | – | – | 1 | – | – | – | – | – | |
| – | – | – | – | – | – | – | – | 7 | 1 | – | – | 1 | |
| – | – | – | – | – | 4 | – | – | – | 10 | – | – | 1 | |
| – | 1 | – | 1 | – | – | 14 | – | – | – | 50 | – | – | |
| – | – | – | – | – | – | – | – | – | – | – | 8 | – | |
| – | – | – | 1 | – | – | – | – | 4 | – | – | – | 15 |
Fig. 4Quantile-quantile (Q-Q) plots for genome-wide association (GWA) analyses based in single-trait (ST, blue) and multi-trait (MT, orange) models for 13 traits studied in lodgepole pine. Q-Q plot is used to assess the number and magnitude of observed associations between genotyped SNPs and traits under study, compared to the association statistics expected under the null hypothesis of no association. See text for trait abbreviations
Fig. 5Scatterplots of p-values in -log10(p-value) scale for the single-trait (x-axis) and multiple-trait (y-axis) GWA analyses in lodgepole pine for 13 traits studied. Note as several points (markers) are positioned above the blue line (i.e., deviated from the diagonal (regression line y = x), suggesting the multiple-trait association analysis increased the power as compared to the single-trait analysis. See text for trait abbreviations
Fig. 6Average prediction accuracy and prediction bias across 13 traits using different genomic selection methods for five single- (ST) and two multiple-trait (MT) models in lodgepole pine. Common letters above box-plots are not significantly different (α = 0.05) according to the Tukey test. See text for ST and MT model abbreviations
Fig. 7Average prediction accuracy and prediction bias using different genomic selection methods for two single-trait (ST) and two multiple-trait (MT) models for 13 traits studied in lodgepole pine. Within each trait, common letters above box-plots are not significantly different (α = 0.05) according to the Tukey test. See text for model and trait abbreviations