| Literature DB >> 35631723 |
Jing Shao1,2,3, Yangfan Hao2, Lanfen Wang2, Yuxin Xie2, Hongwei Zhang2, Jiangping Bai1,3, Jing Wu1,2, Junjie Fu2.
Abstract
Due to insufficient identification and in-depth investigation of existing common bean germplasm resources, it is difficult for breeders to utilize these valuable genetic resources. This situation limits the breeding and industrial development of the common bean (Phaseolus vulgaris L.) in China. Genomic prediction (GP) is a breeding method that uses whole-genome molecular markers to calculate the genomic estimated breeding value (GEBV) of candidate materials and select breeding materials. This study aimed to use genomic prediction to evaluate 15 traits in a collection of 628 common bean lines (including 484 landraces and 144 breeding lines) to determine a common bean GP model. The GP model constructed by landraces showed a moderate to high predictive ability (ranging from 0.59-0.88). Using all landraces as a training set, the predictive ability of the GP model for most traits was higher than that using the landraces from each of two subgene pools, respectively. Randomly selecting breeding lines as additional training sets together with landrace training sets to predict the remaining breeding lines resulted in a higher predictive ability based on principal components analysis. This study constructed a widely applicable GP model of the common bean based on the population structure, and encouraged the development of GP models to quickly aggregate excellent traits and accelerate utilization of germplasm resources.Entities:
Keywords: common bean germplasm; genomic prediction; population structure
Year: 2022 PMID: 35631723 PMCID: PMC9144439 DOI: 10.3390/plants11101298
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1A workflow for genomic prediction of common bean germplasm. The above three ways are represented by the flow of the three numbers (1, 2, 3) in the figure. (1) The landraces from each of two subgene pools were used as training sets to predict breeding lines, respectively. (2) All landraces were used as training sets to predict breeding lines. (3) A part of the breeding lines was selected and included in the training sets.
Figure 2Different number of SNPs and different population sizes were used in ten-fold cross-validation using 100 replications for 15 agronomic traits in 484 common bean landraces (The figure shows four traits: DF, days to flowering; PH, plant height; SP, seeds per pod; and SH, seed height. Other traits are shown in Supplementary Figure S2).
The predictive ability of landraces as the training set to predict breeding lines using 15 traits.
| Traits | Landraces (An + M) | Landraces (An) | Landraces (M) | ||
|---|---|---|---|---|---|
| Breeding Line (An + M) | Breeding Line (An) | Breeding Line (M) | Breeding Line (An) | Breeding Line (M) | |
| DF | 0.3204 | 0.1699 | 0.2063 | 0.1832 | 0.2043 |
| DM | 0.2115 | 0.3917 | −0.0045 | 0.4370 | −0.1070 |
| PH | 0.6761 | 0.4201 | 0.6536 | 0.4035 | 0.6257 |
| SNN | 0.6492 | 0.3492 | 0.3865 | 0.2747 | 0.2640 |
| BN | 0.5365 | 0.4573 | 0.5907 | 0.4414 | 0.5954 |
| PP | 0.7797 | 0.1974 | 0.6856 | 0.2731 | 0.6967 |
| PL | 0.7646 | 0.5688 | 0.8426 | 0.5786 | 0.8441 |
| PW | 0.6702 | 0.5927 | 0.6848 | 0.5574 | 0.6374 |
| PDH | 0.7267 | 0.4896 | 0.5255 | 0.5088 | 0.5085 |
| SP | 0.7818 | 0.2422 | 0.4995 | 0.2645 | 0.3562 |
| GY | 0.5931 | 0.2096 | 0.4799 | 0.1810 | 0.4886 |
| GW | 0.7616 | 0.3387 | 0.6106 | 0.3325 | 0.6046 |
| SL | 0.7705 | 0.4458 | 0.7070 | 0.4744 | 0.7222 |
| SW | 0.7195 | 0.4672 | 0.6254 | 0.4007 | 0.6146 |
| SH | 0.8155 | 0.7401 | 0.6924 | 0.7330 | 0.6917 |
DF, days to flowering; DM, days to maturity; PH, plant height; SNN, stem node number; BN, branch number; PP, pod per plant; PL, pod length; PW, pod width; PDH, pod height; SP, seeds per pod; GY, grain yield per plant; GW, 100-grain weight; SL, seed length; SW, seed width; SH, seed height.
Figure 3The three principal components of a PCA performed on 9781 SNPs markers among 628 common bean accessions. The four colors (shapes) represent landraces and breeding lines in the two gene pools.
Two-step scenario for optimizing the training set by adding 50% of the breeding lines.
| Scenarios | Scenario 1 | Scenario 2 | ||||||
|---|---|---|---|---|---|---|---|---|
| Groups | G1 | G2 | G3 | G4 | G5 | G6 | G7 | G8 |
| Training sets | Landraces An + M(484) | Landraces An + M(484) | Landraces An + M(484) | Landraces An + M(484) | Landraces An + M(484) | Landraces An + M(484) | Landraces An + M(484) | Landraces An + M(484) |
| Additional | Breeding lines A1 (30) | Breeding lines A1 + M1 (72) | Breeding lines M1 (42) | Breeding lines A1 + M1 (72) | Breeding lines A1 + M1 (72) | Breeding lines random (72) | ||
| Testing sets | Breeding lines A2 (30) | Breeding lines A2 (30) | Breeding lines A2 (30) | Breeding lines M2 (42) | Breeding lines M2 (42) | Breeding lines M2 (42) | Breeding lines A2 + M2 (72) | Breeding lines remaining (72) |
The 50% breeding lines (An) were randomly selected as breeding lines A1 (30). The remaining breeding lines (An) are used as breeding lines A2 (30). The 50% breeding lines (M) were randomly selected as breeding lines M1 (42). The remaining breeding lines (M) are used as breeding lines M2 (42). The 50% breeding lines were randomly selected as breeding lines random (72). The remaining breeding lines are used as breeding lines remaining (72).
The model’s precision for 8 groups across two scenarios among 15 traits.
| Scenarios | Scenario 1 | Scenario 2 | ||||||
|---|---|---|---|---|---|---|---|---|
| Groups | G1 | G2 | G3 | G4 | G5 | G6 | G7 | G8 |
| DF | 0.2566 | 0.3590 | 0.3625 | 0.2109 | 0.3934 | 0.3800 | 0.4664 | 0.3686 |
| DM | 0.5108 | 0.6442 | 0.6237 | −0.0846 | 0.0547 | 0.0323 | 0.3868 | 0.4308 |
| PH | 0.4958 | 0.6294 | 0.6446 | 0.6463 | 0.7818 | 0.7795 | 0.7904 | 0.8260 |
| SNN | 0.3920 | 0.5399 | 0.5549 | 0.4497 | 0.5869 | 0.5865 | 0.7255 | 0.6850 |
| BN | 0.5538 | 0.6950 | 0.6820 | 0.6053 | 0.6921 | 0.6996 | 0.6779 | 0.6922 |
| PP | 0.2465 | 0.2771 | 0.2675 | 0.7380 | 0.7356 | 0.7398 | 0.7986 | 0.7385 |
| PL | 0.6740 | 0.8103 | 0.8111 | 0.8934 | 0.9390 | 0.9456 | 0.8929 | 0.7724 |
| PW | 0.6559 | 0.7840 | 0.7854 | 0.7662 | 0.8975 | 0.8966 | 0.8874 | 0.6407 |
| PDH | 0.3626 | 0.3772 | 0.3631 | 0.3312 | 0.3355 | 0.3316 | 0.6837 | 0.5964 |
| SP | 0.1611 | 0.3168 | 0.3504 | 0.5389 | 0.5855 | 0.5877 | 0.7864 | 0.7056 |
| GY | 0.2468 | 0.2496 | 0.2515 | 0.5795 | 0.6191 | 0.6210 | 0.6440 | 0.5802 |
| GW | 0.2613 | 0.5268 | 0.5274 | 0.6238 | 0.6165 | 0.6155 | 0.7804 | 0.6944 |
| SL | 0.3876 | 0.4674 | 0.4775 | 0.7364 | 0.7285 | 0.7226 | 0.7647 | 0.6956 |
| SW | 0.4314 | 0.6614 | 0.6680 | 0.7061 | 0.7596 | 0.7624 | 0.8137 | 0.7460 |
| SH | 0.7114 | 0.7988 | 0.7988 | 0.6650 | 0.7389 | 0.7290 | 0.8535 | 0.8186 |
DF, days to flowering; DM, days to maturity; PH, plant height; SNN, stem node number; BN, branch number; PP, pod per plant; PL, pod length; PW, pod width; PDH, pod height; SP, seeds per pod; GY, grain yield per plant; GW, 100-grain weight; SL, seed length; SW, seed width; SH, seed height.