| Literature DB >> 30654497 |
Liqiang He1,2, Jin Xiao3, Khalid Y Rashid4, Gaofeng Jia5, Pingchuan Li6, Zhen Yao7, Xiue Wang8, Sylvie Cloutier9, Frank M You10,11.
Abstract
Pasmo (Septoria linicola) is a fungal disease causing major losses in seed yield and quality and stem fibre quality in flax. Pasmo resistance (PR) is quantitative and has low heritability. To improve PR breeding efficiency, the accuracy of genomic prediction (GP) was evaluated using a diverse worldwide core collection of 370 accessions. Four marker sets, including three defined by 500, 134 and 67 previously identified quantitative trait loci (QTL) and one of 52,347 PR-correlated genome-wide single nucleotide polymorphisms, were used to build ridge regression best linear unbiased prediction (RR-BLUP) models using pasmo severity (PS) data collected from field experiments performed during five consecutive years. With five-fold random cross-validation, GP accuracy as high as 0.92 was obtained from the models using the 500 QTL when the average PS was used as the training dataset. GP accuracy increased with training population size, reaching values >0.9 with training population size greater than 185. Linear regression of the observed PS with the number of positive-effect QTL in accessions provided an alternative GP approach with an accuracy of 0.86. The results demonstrate the GP models based on marker information from all identified QTL and the 5-year PS average is highly effective for PR prediction.Entities:
Keywords: Septoria linicola; flax; genomic prediction; genomic selection; genotyping by sequencing; pasmo resistance; pasmo severity; quantitative trait loci; single nucleotide polymorphism
Mesh:
Substances:
Year: 2019 PMID: 30654497 PMCID: PMC6359301 DOI: 10.3390/ijms20020359
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Pasmo severity of 370 flax accessions across five years in the field condition.
| Data Set |
| Range | |
|---|---|---|---|
| PS-2012 | 5.57 ± 1.86 | 1.00–9.00 | 32.76 |
| PS-2013 | 5.69 ± 1.91 | 2.00–9.00 | 33.20 |
| PS-2014 | 6.86 ± 2.07 | 1.00–9.00 | 29.41 |
| PS-2015 | 6.11 ± 1.55 | 1.00–9.00 | 25.44 |
| PS-2016 | 6.72 ± 1.37 | 2.00–9.00 | 20.39 |
| PS-mean | 6.22 ± 1.32 | 1.80–9.00 | 21.27 |
: average pasmo severity across five years; s: standard deviation; CV: coefficient of variation.
Figure 1Dot plots (lower triangle), histograms (diagonal) and Pearson correlations (upper triangle) between six pasmo severity datasets. Best curves are fitted in dot plots and histograms. *** represents significance at the <0.001 probability level.
Figure 2Distribution of R2 (%) (phenotypic variation explained by individual QTL) in the three QTL marker sets.
Phenotypic variation of pasmo severity (PS) ( ± s) explained by the four marker sets.
| PS Dataset | Marker Set | |||
|---|---|---|---|---|
| SNP-500QTL | SNP-134QTL | SNP-67QTL | SNP-52347 | |
| PS-mean | 0.72 ± 0.04 | 0.27 ± 0.05 | 0.29 ± 0.05 | 0.54 ± 0.07 |
| PS-2012 | 0.64 ± 0.06 | 0.18 ± 0.05 | 0.16 ± 0.04 | 0.43 ± 0.08 |
| PS-2013 | 0.63 ± 0.06 | 0.12 ± 0.04 | 0.12 ± 0.04 | 0.38 ± 0.08 |
| PS-2014 | 0.65 ± 0.06 | 0.23 ± 0.05 | 0.20 ± 0.05 | 0.45 ± 0.08 |
| PS-2015 | 0.56 ± 0.06 | 0.20 ± 0.05 | 0.17 ± 0.04 | 0.44 ± 0.09 |
| PS-2016 | 0.53 ± 0.06 | 0.18 ± 0.05 | 0.18 ± 0.05 | 0.38 ± 0.07 |
Accuracy (r) and relative efficiency (RE) values of the 24 combinations representing the four marker sets and six pasmo severity (PS) datasets using RR-BLUP obtained using a random five-fold cross-validation.
| Marker Set | PS Dataset | ||
|---|---|---|---|
| SNP-500QTL | PS-mean | 0.92 ± 0.02a | 1.84 ± 0.04a |
| PS-2012 | 0.84 ± 0.03b | 1.68 ± 0.06b | |
| PS-2013 | 0.81 ± 0.04c | 1.62 ± 0.07c | |
| PS-2014 | 0.82 ± 0.04c | 1.63 ± 0.07c | |
| PS-2015 | 0.76 ± 0.05d | 1.52 ± 0.09d | |
| PS-2016 | 0.76 ± 0.05d | 1.52 ± 0.11d | |
| SNP-134QTL | PS-mean | 0.75 ± 0.06e | 1.49 ± 0.11e |
| PS-2012 | 0.68 ± 0.06f | 1.36 ± 0.11f | |
| PS-2013 | 0.60 ± 0.07ij | 1.19 ± 0.14ij | |
| PS-2014 | 0.60 ± 0.07i | 1.21 ± 0.14i | |
| PS-2015 | 0.47 ± 0.09o | 0.94 ± 0.18o | |
| PS-2016 | 0.56 ± 0.09l | 1.12 ± 0.17l | |
| SNP-67QTL | PS-mean | 0.76 ± 0.05d | 1.53 ± 0.1d |
| PS-2012 | 0.67 ± 0.06g | 1.35 ± 0.11g | |
| PS-2013 | 0.60 ± 0.07ij | 1.20 ± 0.14ij | |
| PS-2014 | 0.60 ± 0.07ij | 1.20 ± 0.14ij | |
| PS-2015 | 0.50 ± 0.09n | 1.00 ± 0.17n | |
| PS-2016 | 0.59 ± 0.08k | 1.17 ± 0.17k | |
| SNP-52347 | PS-mean | 0.67 ± 0.07g | 1.33 ± 0.14g |
| PS-2012 | 0.63 ± 0.06h | 1.27 ± 0.12h | |
| PS-2013 | 0.59 ± 0.07jk | 1.19 ± 0.14jk | |
| PS-2014 | 0.53 ± 0.08m | 1.06 ± 0.17m | |
| PS-2015 | 0.38 ± 0.09q | 0.77 ± 0.17q | |
| PS-2016 | 0.46 ± 0.09p | 0.93 ± 0.18p |
1 Different letters represent multiple test significance among the 24 combinations at the 0.05 probability level.
Figure 3Accuracy (r) (a) and relative efficiency (RE) (b) of RR-BLUP prediction models built with combinations of four marker sets using the five-year average PS dataset (PS-mean) and random five-fold cross-validations. Letters above box plots indicated statistical significance (p < 0.05) for r and RE among marker sets.
Figure 4Relationship between the genomic prediction accuracy (r) and the size of the training population based on the SNP-500QTL marker set, the PS-mean dataset and the RR-BLUP models. The dash line represents a prediction accuracy of 0.9.
Accuracy (r) and relative efficiency (RE) of genomic prediction for pasmo severity in different years using the RR-BLUP model built with the SNP-500QTL marker set and the PS-mean phenotypic data using all 370 accessions as training data set.
| PS Dataset for Prediction |
|
|
|---|---|---|
| PS-mean | 0.98 | 1.96 |
| PS-2012 | 0.73 | 1.46 |
| PS-2013 | 0.71 | 1.42 |
| PS-2014 | 0.81 | 1.62 |
| PS-2015 | 0.71 | 1.43 |
| PS-2016 | 0.77 | 1.55 |
Figure 5Relationship of observed pasmo severity (PS) with PS predicted by a GP model (a,c) or with PS predicted by the number of QTL with positive-effect alleles (NPQTL) (b,d). (a) Linear regression of observed PS (y) to predicted PS (x) using the genomic prediction model built with the PS-mean dataset and the SNP-500QTL marker set of all 370 accessions as training data set. (b) Linear regression of observed PS (y) to NPQTL (x) in the 370 flax accessions. (c) Relationship of observed PS of 93 randomly chosen accessions with the PS predicted by the genomic model constructed with the SNP-500QTL marker set and PS-mean dataset when a random subset of 277 accessions was used as training population. (d) Relationship of observed PS of 93 randomly chosen accessions with the PS predicted by NPQTL (Figure S2) The red dashed lines represent upper and lower boundaries of the 95% prediction intervals, that is, it is expected that the value of a sample lies within that prediction interval in 95% of the samples. The grey band represents the 95% confidence interval, that is, 95% of those intervals include the true value of the population mean.