| Literature DB >> 29167677 |
Ao Zhang1,2, Hongwu Wang2,3, Yoseph Beyene4, Kassa Semagn4, Yubo Liu1,2, Shiliang Cao5, Zhenhai Cui1, Yanye Ruan1, Juan Burgueño2, Felix San Vicente2, Michael Olsen4, Boddupalli M Prasanna4, José Crossa2, Haiqiu Yu1, Xuecai Zhang2.
Abstract
Genomic selection is being used increasingly in plant breeding to accelerate genetic gain per unit time. One of the most important applications of genomic selection in maize breeding is to predict and select the best un-phenotyped lines in bi-parental populations based on genomic estimated breeding values. In the present study, 22 bi-parental tropical maize populations genotyped with low density SNPs were used to evaluate the genomic prediction accuracy (rMG ) of the six trait-environment combinations under various levels of training population size (TPS) and marker density (MD), and assess the effect of trait heritability (h2 ), TPS and MD on rMG estimation. Our results showed that: (1) moderate rMG values were obtained for different trait-environment combinations, when 50% of the total genotypes was used as training population and ~200 SNPs were used for prediction; (2) rMG increased with an increase in h2 , TPS and MD, both correlation and variance analyses showed that h2 is the most important factor and MD is the least important factor on rMG estimation for most of the trait-environment combinations; (3) predictions between pairwise half-sib populations showed that the rMG values for all the six trait-environment combinations were centered around zero, 49% predictions had rMG values above zero; (4) the trend observed in rMG differed with the trend observed in rMG /h, and h is the square root of heritability of the predicted trait, it indicated that both rMG and rMG /h values should be presented in GS study to show the accuracy of genomic selection and the relative accuracy of genomic selection compared with phenotypic selection, respectively. This study provides useful information to maize breeders to design genomic selection workflow in their breeding programs.Entities:
Keywords: genomic selection; maize; marker density; training population size; trait heritability
Year: 2017 PMID: 29167677 PMCID: PMC5683035 DOI: 10.3389/fpls.2017.01916
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of 22 bi-parental populations used in the present study, including pedigree code, tester, number of families (Na) and number of polymorphic SNPs (Nb) of all populations, mean and standard deviation of all target trait-environment combinations (GY, grain yield; AD, anthesis date; PH, plant height; WW, well-watered environments; WS, water-stressed environments).
| 1 | P1 × P2 | T1 | 165 | 201 | 6.98 ± 1.91 | 2.34 ± 1.48 | 61.72 ± 8.28 | 86.78 ± 10.64 | 215.02 ± 37.06 | 151.68 ± 49.85 |
| 2 | P2 × P3 | T1 | 162 | 188 | 7.03 ± 2.12 | 2.69 ± 1.61 | 62.39 ± 8.57 | 84.48 ± 11.67 | 218.53 ± 37.91 | 152.50 ± 36.14 |
| 3 | P2 × P4 | T1 | 126 | 183 | 7.57 ± 2.64 | 2.34 ± 1.51 | 62.66 ± 8.55 | 86.3 ± 11.88 | 214.9 ± 39.08 | 160.57 ± 31.51 |
| 4 | P2 × P5 | T1 | 163 | 209 | 7.62 ± 2.32 | 2.76 ± 1.48 | 62.46 ± 8.87 | 82.17 ± 10.57 | 215.27 ± 37.9 | 156.78 ± 34.66 |
| 5 | P4 × P1 | T1 | 183 | 208 | 6.27 ± 2.37 | 2.81 ± 1.67 | 64.97 ± 8.25 | 83.88 ± 9.77 | 199.38 ± 46.76 | 155.99 ± 36.42 |
| 6 | P6 × P7 | T1 | 173 | 195 | 6.32 ± 2.15 | 2.70 ± 1.88 | 60.32 ± 7.43 | 74.18 ± 10.83 | 180.97 ± 34.53 | 160.08 ± 27.26 |
| 7 | P8 × P9 | T1 | 181 | 212 | 6.67 ± 1.98 | 2.55 ± 1.69 | 61.37 ± 6.75 | 75.22 ± 10.08 | 192.42 ± 38.06 | 169.55 ± 22.55 |
| 8 | P8 × P7 | T1 | 184 | 212 | 6.67 ± 1.92 | 2.59 ± 1.28 | 60.43 ± 5.15 | 81.88 ± 11.25 | 174.38 ± 40.46 | 139.08 ± 35.87 |
| 9 | P10 × P11 | T1 | 184 | 211 | 7.06 ± 1.91 | 1.75 ± 1.50 | 64.26 ± 8.33 | 89.67 ± 12.57 | 216.63 ± 44.08 | 154.39 ± 24.57 |
| 10 | P12 × P13 | T1 | 174 | 194 | 7.31 ± 2.23 | 2.44 ± 1.62 | 65.20 ± 8.17 | 88.47 ± 11.04 | 220.70 ± 45.49 | 166.41 ± 29.92 |
| 11 | P17 × P18 | T2 | 184 | 185 | 5.45 ± 1.91 | 1.99 ± 1.33 | 63.55 ± 8.10 | 86.28 ± 17.74 | 243.99 ± 35.67 | 177.80 ± 23.01 |
| 12 | P18 × P19 | T2 | 160 | 162 | 6.66 ± 2.84 | 2.38 ± 1.23 | 66.04 ± 9.09 | 84.84 ± 12.85 | 243.56 ± 38.78 | 180.78 ± 22.17 |
| 13 | P19 × P15 | T2 | 178 | 176 | 6.38 ± 2.73 | 1.90 ± 1.05 | 65.76 ± 8.86 | 84.83 ± 94.46 | 242.57 ± 46.75 | 172.52 ± 24.19 |
| 14 | P20 × P17 | T2 | 173 | 166 | 6.95 ± 2.86 | 2.03 ± 1.16 | 64.14 ± 8.77 | 87.85 ± 12.61 | 232.14 ± 42.01 | 166.06 ± 36.08 |
| 15 | P21 × P22 | T2 | 176 | 172 | 6.82 ± 2.87 | 2.04 ± 1.06 | 63.08 ± 8.20 | 85.00 ± 11.33 | 219.31 ± 33.64 | 156.47 ± 23.74 |
| 16 | P22 × P23 | T2 | 155 | 184 | 6.63 ± 2.61 | 1.75 ± 0.99 | 64.01 ± 8.43 | 85.22 ± 11.32 | 233.82 ± 41.48 | 158.36 ± 28.42 |
| 17 | P19 × P15 | T3 | 164 | 255 | 9.11 ± 0.91 | 5.55 ± 0.80 | 100.81 ± 1.56 | 98.97 ± 1.25 | 223.02 ± 7.81 | 192.59 ± 6.87 |
| 18 | P19 × P26 | T3 | 278 | 283 | 9.91 ± 1.31 | 3.66 ± 0.66 | 65.64 ± 0.95 | 79.93 ± 1.57 | 247.07 ± 10.13 | 192.29 ± 8.19 |
| 19 | P19 × P27 | T3 | 216 | 217 | 8.37 ± 1.26 | 4.64 ± 0.94 | 62.03 ± 1.59 | 85.59 ± 1.43 | 251.26 ± 10.49 | 223.70 ± 8.93 |
| 20 | P24 × P25 | T4 | 247 | 199 | 10.00 ± 1.31 | 4.88 ± 0.55 | 62.32 ± 1.04 | 85.12 ± 1.23 | 233.92 ± 10.11 | 211.86 ± 7.68 |
| 21 | P28 × P29 | T5 | 249 | 238 | 2.54 ± 0.48 | 1.54 ± 0.28 | 54.38 ± 0.76 | 64.79 ± 1.38 | 155.07 ± 9.90 | 126.86 ± 6.66 |
| 22 | P30 × P31 | T6 | 245 | 271 | 5.38 ± 0.72 | 1.56 ± 0.33 | 55.46 ± 0.91 | 70.82 ± 0.98 | 232.45 ± 10.94 | 169.63 ± 7.40 |
Figure 1Distribution of heritability of all the target trait-environment combinations across all the populations.
Figure 2Mean and standard deviation (SD) of r and r/h of all the target traits across all the 22 bi-parental populations. Values of r in each population were estimated, when training population size equaled to 90 and all the SNPs were used for prediction.
Figure 3Distribution of r across all the populations for all the target traits evaluated under WW condition under all the possible training population size (TPS) and marker density (MD) combinations. (A) TPS = 30; (B) TPS = 50; (C) TPS = 70; (D) TPS = 90. Three levels of MD, i.e., 50 SNPs, 100 SNPs and All SNPs, were used for prediction.
Figure 4Distribution of r across all the populations for all the target traits evaluated under WS condition under all the possible training population size (TPS) and marker density (MD) combinations. (A) TPS = 30; (B) TPS = 50; (C) TPS = 70; (D) TPS = 90. Three levels of MD, i.e., 50 SNPs, 100 SNPs and All SNPs, were used for prediction.
Figure 5Combination plot of r and h of all the 6 trait-environment combinations, all the 22 populations were divided into 4 subgroups sorted by the h of target traits from low to high, mean r of each subgroup was estimated under combinations of TPS = 90 and MD = All SNPs. (A) GY_WW; (B) GY_WS; (C) AD_WW; (D) AD_WS; (E) PH_WW; (F) PH_WS.
Figure 6Distribution of r values of predictions between the pairwise half-sib populations.
Correlation between r and the three factors (h, TPS and MD) for all the trait-environment combinations.
| GY_WW | 0.17 | 4.66E-03 | 0.29 | 1.49E-06 | 0.25 | 5.12E-05 |
| GY_WS | 0.49 | 2.24E-17 | 0.19 | 2.01E-03 | 0.24 | 7.57E-05 |
| AD_WW | 0.51 | 6.00E-19 | 0.26 | 1.37E-05 | 0.22 | 3.45E-04 |
| AD_WS | 0.72 | 3.91E-43 | 0.21 | 4.68E-04 | 0.16 | 8.28E-03 |
| PH_WW | 0.72 | 4.90E-43 | 0.19 | 2.05E-03 | 0.13 | 3.39E-02 |
| PH_WS | 0.75 | 4.93E-49 | 0.12 | 5.97E-02 | 0.9.0 | 1.45E-01 |
Variance analysis between r and the three factors (h, TPS and MD) for all the trait-environment combinations.
| GY_WW | 3.02 | 2.34E-03 | 8.58 | 1.14E-05 | 6.10 | 1.79E-05 | 82.31 |
| GY_WS | 24.04 | 8.09E-19 | 3.63 | 3.59E-03 | 4.84 | 2.42E-05 | 67.49 |
| AD_WW | 26.09 | 9.00E-22 | 7.18 | 2.40E-06 | 5.90 | 1.04E-06 | 60.82 |
| AD_WS | 51.56 | 1.54E-46 | 4.66 | 5.95E-06 | 1.46 | 3.15E-03 | 42.32 |
| PH_WW | 51.48 | 2.32E-47 | 3.67 | 6.36E-05 | 3.70 | 2.47E-06 | 41.15 |
| PH_WS | 56.32 | 4.79E-50 | 1.43 | 3.20E-02 | 0.87 | 2.06E-02 | 41.38 |