| Literature DB >> 31620155 |
Xiaogang Liu1, Hongwu Wang1, Xiaojiao Hu1, Kun Li1, Zhifang Liu1, Yujin Wu1, Changling Huang1.
Abstract
Genomic selection (GS), a tool developed for molecular breeding, is used by plant breeders to improve breeding efficacy by shortening the breeding cycle and to facilitate the selection of candidate lines for creating hybrids without phenotyping in various environments. Association and linkage mapping have been widely used to explore and detect candidate genes in order to understand the genetic mechanisms of quantitative traits. In the current study, phenotypic and genotypic data from three experimental populations, including data on six agronomic traits (e.g., plant height, ear height, ear length, ear diameter, grain yield per plant, and hundred-kernel weight), were used to evaluate the effect of trait-relevant markers (TRMs) on prediction accuracy estimation. Integrating information from mapping into a statistical model can efficiently improve prediction performance compared with using stochastically selected markers to perform GS. The prediction accuracy can reach plateau when a total of 500-1,000 TRMs are utilized in GS. The prediction accuracy can be significantly enhanced by including nonadditive effects and TRMs in the GS model when genotypic data with high proportions of heterozygous alleles and complex agronomic traits with high proportion of nonadditive variancein phenotypic variance are used to perform GS. In addition, taking information on population structure into account can slightly improve prediction performance when the genetic relationship between the training and testing sets is influenced by population stratification due to different allele frequencies. In conclusion, GS is a useful approach for prescreening candidate lines, and the empirical evidence provided by the current study for TRMs and nonadditive effects can inform plant breeding and in turn contribute to the improvement of selection efficiency in practical GS-assisted breeding programs.Entities:
Keywords: association and linkage mapping; genomic selection; maize; nonadditive effect; population structure; trait-relevant marker
Year: 2019 PMID: 31620155 PMCID: PMC6759780 DOI: 10.3389/fpls.2019.01129
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of the high-density genetic map derived from the RIL and F2:3 populations.
| Pop. | Chr. | No. of bins | Physical length of map (Mb) | Map length (cM) | Average genetic length (cM) | Maximal genetic length (cM) |
|---|---|---|---|---|---|---|
| RIL | 1 | 436 | 301.0 | 309.9 | 0.7 | 3.6 |
| 2 | 301 | 237.0 | 240.6 | 0.8 | 3.0 | |
| 3 | 292 | 232.1 | 224.4 | 0.8 | 3.4 | |
| 4 | 233 | 246.9 | 156.8 | 0.7 | 4.5 | |
| 5 | 298 | 217.6 | 276.6 | 0.9 | 3.1 | |
| 6 | 202 | 169.2 | 128.3 | 0.6 | 3.8 | |
| 7 | 204 | 176.1 | 141.0 | 0.7 | 3.5 | |
| 8 | 209 | 175.5 | 150.7 | 0.7 | 3.0 | |
| 9 | 132 | 156.0 | 93.2 | 0.7 | 3.1 | |
| 10 | 143 | 149.9 | 89.6 | 0.6 | 4.5 | |
| Total | 2,450 | 2,061.3 | 1,811.3 | 0.7 | 4.5 | |
| F2:3 | 1 | 507 | 301.0 | 169.3 | 0.3 | 1.7 |
| 2 | 296 | 237.0 | 127.7 | 0.4 | 3.9 | |
| 3 | 279 | 232.1 | 128.1 | 0.5 | 3.6 | |
| 4 | 271 | 246.9 | 130.8 | 0.5 | 7.8 | |
| 5 | 336 | 217.6 | 125.3 | 0.4 | 2.9 | |
| 6 | 308 | 169.2 | 110.8 | 0.4 | 2.2 | |
| 7 | 233 | 176.1 | 95.4 | 0.4 | 3.0 | |
| 8 | 231 | 175.5 | 110.2 | 0.5 | 2.3 | |
| 9 | 181 | 156.0 | 111.6 | 0.6 | 7.1 | |
| 10 | 184 | 149.9 | 96.0 | 0.5 | 2.8 | |
| Total | 2,826 | 2,061.3 | 1,205.4 | 0.4 | 7.8 |
Pop.: the experimental populations.
Chr.: the number of chromosomes.
No. of bins: the number of bin markers on the chromosome.
Pleiotropic QTL (pQTL) for each agronomic trait in the biparental populations.
| pQTL | Chr. | Interval | Physical length | No. of QTL | Integrated QTL |
|---|---|---|---|---|---|
| pQTL1 | 1 | 213.1–251.7 | 38.6 | 4 |
|
| pQTL3 | 3 | 212.0–216.2 | 4.2 | 2 |
|
| pQTL4 | 4 | 33.7–130.7 | 97.0 | 2 |
|
| pQTL8 | 8 | 165.2–168.7 | 3.5 | 2 |
|
| pQTL9 | 9 | 17.5–93.3 | 57.8 | 6 |
|
The name of the pleiotropic QTL includes the information of the number of chromosomes.
Chr.: the number of chromosomes.
Interval: the confidence interval between two bin markers.
Physical length: the physical distance between two bin markers based on the B73 genome.
Figure 1Comparison of prediction accuracies between trait-relevant markers (TRMs) and randomly selected markers based on the results of association and linkage mapping using genotypic and phenotypic data of the training set within the experimental populations. (A) and (B) Plant height (PH) and grain yield per plant (GYP) in the natural population (N = 435); (C) and (D) PH and GYP in the RIL population (N = 212); (E) and (F) PH and GYP in the F2:3 population (N = 304). N is the number of individuals in each population. TRM: the prediction accuracy based on TRMs in the general genomic best linear unbiased prediction (GBLUP) model; RAN: the prediction accuracy based on randomly selected markers in the general GBLUP model. ALL: total of 38,299 single nucleotide polymorphisms (SNPs), 2,450 and 2,826 bin markers were used to perform the scheme of cross-validation in natural, recombinant inbred line (RIL), and F2:3 populations, respectively. The fivefold cross-validation scheme was implemented in this case.
Figure 2Prediction accuracy of models based on nonadditive effects and trait-relevant markers (TRMs). (A) and (B) Plant height (PH) and grain yield per plant (GYP) in the F2:3 population (N = 304). N is the number of individuals in each population. The capital letters A, D, and AA refer to additive, dominance, and additive-by-additive interaction effects, respectively. The A model that contains only additive effects is equivalent to the general genomic best linear unbiased prediction (GBLUP) model using trait-relevant markers (TRMs) to perform cross-validation. ALL: total of 2,826 bin markers were used to perform the scheme of cross-validation in F2:3 population. The fivefold cross-validation scheme was implemented in this case.
Proportions of variance components estimated by the models.
| Parameters | PH | GYP | ||||||
|---|---|---|---|---|---|---|---|---|
| A | A + AA | A + D | A + D + AA | A | A + AA | A + D | A + D + AA | |
| 0.564 | 0.542 | 0.526 | 0.510 | 0.182 | 0.172 | 0.171 | 0.161 | |
| 0.043 | 0.036 | 0.205 | 0.173 | |||||
| 0.032 | 0.019 | 0.127 | 0.032 | |||||
| 0.436 | 0.425 | 0.431 | 0.435 | 0.818 | 0.700 | 0.624 | 0.634 | |
additive genetic variance; :dominance variance; :additive-by-additive interaction variance; :estimated error variance.
PH: plant height; GYP: grain yield per plant.
The model containing various effects; A, additive genetic effect; D, dominance effect; AA, additive-by-additive interaction effect. The BLUE values of individuals in F2:3 population were used as phenotypic data. The extended GBLUP models were implemented for each trait in the F2:3 population using 2,000 TRMs. The numbers in the parentheses were standard deviation.
Figure 3Comparison of prediction accuracy between models based on trait-relevant markers (TRMs). (A) and (B) Plant height (PH) and grain yield per plant (GYP) in the natural population (N = 435). N is the number of individuals in each population; trait-relevant marker (TRM): the prediction accuracy based on TRMs in the general genomic best linear unbiased prediction (GBLUP) model; BC + PC: the prediction accuracy based on the BayesC model with PCs as fixed effects using TRMs; G + PC: the prediction accuracy based on the GBLUP model with PCs as fixed effects using TRMs. ALL: total of 38,299 SNPs were used to perform the scheme of cross-validation in natural population. The fivefold cross-validation scheme was implemented in this case.