| Literature DB >> 27920780 |
Manish Roorkiwal1, Abhishek Rathore1, Roma R Das1, Muneendra K Singh1, Ankit Jain1, Samineni Srinivasan1, Pooran M Gaur1, Bharadwaj Chellapilla2, Shailesh Tripathi2, Yongle Li3, John M Hickey4, Aaron Lorenz5, Tim Sutton6, Jose Crossa7, Jean-Luc Jannink8, Rajeev K Varshney9.
Abstract
Genomic selection (GS) unlike marker-assisted backcrossing (MABC) predicts breeding values of lines using genome-wide marker profiling and allows selection of lines prior to field-phenotyping, thereby shortening the breeding cycle. A collection of 320 elite breeding lines was selected and phenotyped extensively for yield and yield related traits at two different locations (Delhi and Patancheru, India) during the crop seasons 2011-12 and 2012-13 under rainfed and irrigated conditions. In parallel, these lines were also genotyped using DArTseq platform to generate genotyping data for 3000 polymorphic markers. Phenotyping and genotyping data were used with six statistical GS models to estimate the prediction accuracies. GS models were tested for four yield related traits viz. seed yield, 100 seed weight, days to 50% flowering and days to maturity. Prediction accuracy for the models tested varied from 0.138 (seed yield) to 0.912 (100 seed weight), whereas performance of models did not show any significant difference for estimating prediction accuracy within traits. Kinship matrix calculated using genotyping data reaffirmed existence of two different groups within selected lines. There was not much effect of population structure on prediction accuracy. In brief, present study establishes the necessary resources for deployment of GS in chickpea breeding.Entities:
Keywords: chickpea; genetic gain; genomic prediction accuracy; genomic selection; population structure; prediction models; training population
Year: 2016 PMID: 27920780 PMCID: PMC5118446 DOI: 10.3389/fpls.2016.01666
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Analysis of variance (ANOVA) and genetic estimates for days to flower, days to maturity, 100 seed weight and seed yield.
| Days to flowering (DF) | ICRISAT-IR-12 | 38.93 | 2.13 | 5.48 | 30.20 | 0.95 | 14.12 | 15.14 | 5.48 |
| ICRISAT-IR-13 | 42.60 | 2.29 | 5.39 | 30.62 | 0.94 | 12.99 | 14.06 | 5.39 | |
| ICRISAT-RF-13 | 44.76 | 2.10 | 4.69 | 8.29 | 0.84 | 6.43 | 7.96 | 4.69 | |
| IARI-IR-12 | 66.46 | 0.81 | 1.22 | 232.01 | 0.99 | 22.92 | 22.95 | 1.22 | |
| IARI-IR-13 | 65.48 | 0.33 | 0.50 | 606.18 | 0.99 | 37.60 | 37.61 | 0.50 | |
| Days to maturity (DM) | ICRISAT-IR-12 | 103.11 | 2.05 | 1.99 | 1.73 | 0.54 | 1.28 | 2.36 | 1.99 |
| ICRISAT-IR-13 | 93.93 | 1.86 | 1.98 | 14.63 | 0.92 | 4.07 | 4.53 | 1.98 | |
| ICRISAT-RF-13 | 91.62 | 2.60 | 2.84 | 8.98 | 0.79 | 3.27 | 4.33 | 2.84 | |
| IARI-IR-12 | 153.24 | 0.93 | 0.61 | 11.74 | 0.96 | 2.24 | 2.32 | 0.61 | |
| IARI-IR-13 | 153.16 | 0.18 | 0.12 | 12.29 | 0.99 | 2.29 | 2.29 | 0.12 | |
| 100 seed weight (SDW; g) | ICRISAT-IR-12 | 26.65 | 1.17 | 4.39 | 57.16 | 0.99 | 28.36 | 28.70 | 4.39 |
| ICRISAT-IR-13 | 28.44 | 1.69 | 5.95 | 67.32 | 0.99 | 28.84 | 29.45 | 5.95 | |
| ICRISAT-RF-13 | 28.96 | 2.29 | 7.92 | 68.81 | 0.98 | 28.64 | 29.72 | 7.92 | |
| IARI-IR-12 | 32.29 | 0.52 | 1.62 | 68.59 | 0.99 | 25.65 | 25.70 | 1.62 | |
| IARI-IR-13 | 27.12 | 0.35 | 1.31 | 67.99 | 0.99 | 30.40 | 30.43 | 1.31 | |
| Seed yield (SY; g per plot) | ICRISAT-IR-12 | 122.13 | 13.55 | 11.10 | 1318.21 | 0.95 | 29.73 | 31.73 | 11.10 |
| ICRISAT-IR-13 | 134.47 | 18.94 | 14.09 | 385.92 | 0.76 | 14.61 | 20.29 | 14.09 | |
| ICRISAT-RF-13 | 119.23 | 14.20 | 11.91 | 567.20 | 0.89 | 19.97 | 23.26 | 11.91 | |
| IARI-IR-12 | 140.50 | 35.42 | 25.21 | 2854.12 | 0.82 | 38.03 | 45.62 | 25.21 | |
| IARI-IR-13 | 233.98 | 5.20 | 2.22 | 10304.00 | 0.99 | 43.38 | 43.44 | 2.22 |
SD, Standard deviation; CV, Coefficient of variation; GV, Genetic Variance; GCV, Genotypic Coefficient of Variation; H.
Figure 1Estimation of polymorphism information content (PIC) value and gene diversity of markers used (A) SilicoDArTs (B) DArT-SNP markers.
Figure 2Genome-wide LD heat map constructed using SilicoDArTs and DArT-SNP markers across the 315 elite lines of chickpea. Linkage disequilibrium was calculated using r2 = 0.2 as threshold. Genome-wide LD extend up to 500–2500 kb in CaLG02 and CaLG04. One huge LD block on CaLG04 was observed.
Figure 3Genetic relatedness among the 315 elite lines. Distance matrix was calculated using average linkage clustering. Heat map of the kinship matrix and dendrogram analysis using genotyping data clearly depicts the existence of two different group among the elite lines.
Effect of missing marker data and minor allele frequency on prediction accuracy.
| Ridge Regression | 0.681 | 0.679 | 0.669 | 0.669 | 0.676 | 0.671 | 0.660 | 0.674 | 0.670 |
| Kinship Gauss | 0.697 | 0.710 | 0.702 | 0.692 | 0.701 | 0.695 | 0.688 | 0.698 | 0.702 |
| Bayes Cπ | - | 0.678 | 0.662 | - | 0.688 | 0.674 | - | 0.668 | 0.683 |
| Bayes B | - | 0.674 | 0.655 | - | 0.680 | 0.660 | - | 0.663 | 0.663 |
| Bayes LASSO | 0.660 | 0.681 | 0.666 | 0.684 | 0.671 | 0.665 | 0.672 | 0.657 | 0.680 |
| Random Forest | 0.694 | 0.713 | 0.705 | 0.709 | 0.694 | 0.693 | 0.698 | 0.689 | 0.697 |
Miss, Missing marker data; MAF, Minor Allele Frequency.
Comparative analysis of prediction accuracies of different GS models for four yield related traits across chickpea population.
| Days to flowering (DF) | Ridge Regression | 0.665 | 0.005 | 0.556 | 0.006 | 0.674 | 0.005 | 0.663 | 0.006 | 0.823 | 0.003 |
| Kinship Gauss | 0.707 | 0.005 | 0.635 | 0.005 | 0.673 | 0.005 | 0.701 | 0.006 | 0.847 | 0.003 | |
| Bayes Cπ | 0.663 | 0.005 | 0.564 | 0.006 | 0.675 | 0.005 | 0.663 | 0.006 | 0.824 | 0.003 | |
| Bayes B | 0.647 | 0.005 | 0.560 | 0.006 | 0.673 | 0.005 | 0.664 | 0.006 | 0.825 | 0.003 | |
| Bayes LASSO | 0.666 | 0.005 | 0.562 | 0.006 | 0.673 | 0.005 | 0.664 | 0.006 | 0.827 | 0.003 | |
| Random Forest | 0.693 | 0.005 | 0.626 | 0.006 | 0.683 | 0.004 | 0.695 | 0.006 | 0.851 | 0.003 | |
| Days to maturity (DM) | Ridge Regression | 0.794 | 0.004 | 0.478 | 0.006 | 0.301 | 0.008 | 0.325 | 0.009 | 0.374 | 0.007 |
| Kinship Gauss | 0.808 | 0.004 | 0.539 | 0.006 | 0.304 | 0.008 | 0.320 | 0.008 | 0.394 | 0.007 | |
| Bayes Cπ | 0.799 | 0.004 | 0.495 | 0.006 | 0.304 | 0.009 | 0.324 | 0.009 | 0.379 | 0.007 | |
| Bayes B | 0.798 | 0.004 | 0.510 | 0.006 | 0.289 | 0.009 | 0.331 | 0.009 | 0.395 | 0.007 | |
| Bayes LASSO | 0.797 | 0.004 | 0.476 | 0.006 | 0.301 | 0.008 | 0.329 | 0.009 | 0.376 | 0.007 | |
| Random Forest | 0.815 | 0.004 | 0.531 | 0.007 | 0.254 | 0.009 | 0.300 | 0.009 | 0.407 | 0.007 | |
| 100 seed weight (SDW) | Ridge Regression | 0.893 | 0.002 | 0.797 | 0.004 | 0.816 | 0.004 | 0.898 | 0.002 | 0.909 | 0.002 |
| Kinship Gauss | 0.893 | 0.002 | 0.798 | 0.003 | 0.817 | 0.004 | 0.909 | 0.002 | 0.912 | 0.002 | |
| Bayes Cπ | 0.892 | 0.002 | 0.797 | 0.003 | 0.817 | 0.004 | 0.901 | 0.002 | 0.909 | 0.002 | |
| Bayes B | 0.887 | 0.002 | 0.792 | 0.004 | 0.816 | 0.004 | 0.903 | 0.002 | 0.908 | 0.002 | |
| Bayes LASSO | 0.892 | 0.002 | 0.799 | 0.004 | 0.817 | 0.004 | 0.900 | 0.002 | 0.909 | 0.002 | |
| Random Forest | 0.897 | 0.002 | 0.801 | 0.004 | 0.815 | 0.004 | 0.909 | 0.001 | 0.912 | 0.002 | |
| Seed yield (SY) | Ridge Regression | 0.523 | 0.006 | 0.172 | 0.008 | 0.166 | 0.008 | 0.604 | 0.005 | 0.222 | 0.008 |
| Kinship Gauss | 0.522 | 0.006 | 0.148 | 0.008 | 0.138 | 0.008 | 0.602 | 0.005 | 0.218 | 0.008 | |
| Bayes Cπ | 0.520 | 0.007 | 0.175 | 0.008 | 0.163 | 0.008 | 0.602 | 0.005 | 0.216 | 0.008 | |
| Bayes B | 0.517 | 0.006 | 0.171 | 0.008 | 0.168 | 0.008 | 0.597 | 0.005 | 0.209 | 0.009 | |
| Bayes LASSO | 0.524 | 0.006 | 0.182 | 0.008 | 0.163 | 0.007 | 0.598 | 0.006 | 0.216 | 0.008 | |
| Random Forest | 0.493 | 0.006 | 0.186 | 0.008 | 0.165 | 0.009 | 0.606 | 0.005 | 0.205 | 0.009 | |
SE, Standard Error.
Effect of population structure/size on prediction accuracy using six GS models for yield related traits.
| Ridge Regression | 0.665 ± 0.005 | 0.531 ± 0.011 | 0.639 ± 0.007 | 0.556 ± 0.006 | 0.418 ± 0.012 | 0.646 ± 0.007 | 0.674 ± 0.005 | 0.350 ± 0.017 | 0.377 ± 0.011 | 0.823 ± 0.003 | 0.477 ± 0.014 | 0.561 ± 0.010 | 0.663 ± 0.006 | 0.451 ± 0.011 | 0.518 ± 0.009 |
| Kinship Gauss | 0.707 ± 0.005 | 0.572 ± 0.010 | 0.637 ± 0.007 | 0.635 ± 0.005 | 0.444 ± 0.011 | 0.673 ± 0.007 | 0.673 ± 0.005 | 0.374 ± 0.018 | 0.366 ± 0.011 | 0.847 ± 0.003 | 0.510 ± 0.014 | 0.572 ± 0.009 | 0.701 ± 0.006 | 0.474 ± 0.011 | 0.507 ± 0.009 |
| Bayes Cπ | 0.663 ± 0.005 | 0.532 ± 0.011 | 0.635 ± 0.007 | 0.564 ± 0.006 | 0.425 ± 0.011 | 0.652 ± 0.007 | 0.675 ± 0.005 | 0.285 ± 0.018 | 0.373 ± 0.011 | 0.824 ± 0.003 | 0.413 ± 0.015 | 0.567 ± 0.010 | 0.663 ± 0.006 | 0.420 ± 0.011 | 0.507 ± 0.009 |
| Bayes B | 0.647 ± 0.005 | 0.527 ± 0.012 | 0.605 ± 0.007 | 0.560 ± 0.006 | 0.404 ± 0.012 | 0.629 ± 0.008 | 0.673 ± 0.005 | 0.283 ± 0.018 | 0.375 ± 0.010 | 0.825 ± 0.003 | 0.328 ± 0.016 | 0.567 ± 0.010 | 0.664 ± 0.006 | 0.449 ± 0.012 | 0.475 ± 0.009 |
| Bayes LASSO | 0.666 ± 0.005 | 0.539 ± 0.011 | 0.638 ± 0.007 | 0.562 ± 0.006 | 0.413 ± 0.011 | 0.645 ± 0.007 | 0.673 ± 0.005 | 0.341 ± 0.018 | 0.378 ± 0.011 | 0.827 ± 0.003 | 0.469 ± 0.013 | 0.566 ± 0.009 | 0.664 ± 0.006 | 0.432 ± 0.012 | 0.509 ± 0.009 |
| Random Forest | 0.693 ± 0.005 | 0.573 ± 0.011 | 0.624 ± 0.007 | 0.626 ± 0.006 | 0.441 ± 0.011 | 0.681 ± 0.007 | 0.684 ± 0.005 | 0.373 ± 0.017 | 0.363 ± 0.010 | 0.851 ± 0.003 | 0.443 ± 0.016 | 0.614 ± 0.008 | 0.695 ± 0.006 | 0.436 ± 0.013 | 0.516 ± 0.009 |
| Ridge Regression | 0.794 ± 0.004 | 0.499 ± 0.011 | 0.392 ± 0.013 | 0.478 ± 0.007 | 0.490 ± 0.011 | 0.385 ± 0.010 | 0.301 ± 0.008 | 0.096 ± 0.014 | 0.378 ± 0.011 | 0.374 ± 0.007 | 0.356 ± 0.014 | 0.254 ± 0.011 | 0.325 ± 0.009 | 0.195 ± 0.014 | 0.371 ± 0.010 |
| Kinship Gauss | 0.808 ± 0.004 | 0.508 ± 0.010 | 0.390 ± 0.013 | 0.539 ± 0.006 | 0.480 ± 0.010 | 0.405 ± 0.011 | 0.304 ± 0.008 | 0.099 ± 0.014 | 0.370 ± 0.011 | 0.394 ± 0.007 | 0.354 ± 0.015 | 0.293 ± 0.010 | 0.320 ± 0.008 | 0.195 ± 0.015 | 0.378 ± 0.011 |
| Bayes Cπ | 0.799 ± 0.004 | 0.489 ± 0.011 | 0.368 ± 0.013 | 0.496 ± 0.006 | 0.474 ± 0.011 | 0.375 ± 0.011 | 0.304 ± 0.009 | 0.053 ± 0.014 | 0.362 ± 0.011 | 0.379 ± 0.007 | 0.350 ± 0.014 | 0.248 ± 0.011 | 0.324 ± 0.009 | 0.188 ± 0.013 | 0.369 ± 0.010 |
| Bayes B | 0.798 ± 0.004 | 0.456 ± 0.012 | 0.369 ± 0.013 | 0.510 ± 0.006 | 0.453 ± 0.012 | 0.346 ± 0.011 | 0.289 ± 0.009 | 0.072 ± 0.014 | 0.352 ± 0.011 | 0.395 ± 0.007 | 0.355 ± 0.016 | 0.239 ± 0.010 | 0.331 ± 0.009 | 0.176 ± 0.015 | 0.356 ± 0.012 |
| Bayes LASSO | 0.797 ± 0.004 | 0.505 ± 0.011 | 0.386 ± 0.013 | 0.476 ± 0.006 | 0.476 ± 0.012 | 0.384 ± 0.010 | 0.301 ± 0.008 | 0.049 ± 0.013 | 0.367 ± 0.011 | 0.376 ± 0.007 | 0.354 ± 0.015 | 0.259 ± 0.010 | 0.329 ± 0.009 | 0.150 ± 0.014 | 0.377 ± 0.011 |
| Random Forest | 0.815 ± 0.004 | 0.466 ± 0.012 | 0.375 ± 0.011 | 0.531 ± 0.007 | 0.448 ± 0.012 | 0.405 ± 0.010 | 0.254 ± 0.009 | 0.056 ± 0.015 | 0.346 ± 0.011 | 0.407 ± 0.007 | 0.341 ± 0.016 | 0.288 ± 0.010 | 0.300 ± 0.009 | 0.179 ± 0.014 | 0.354 ± 0.010 |
| Ridge Regression | 0.893 ± 0.002 | 0.609 ± 0.009 | 0.678 ± 0.008 | 0.797 ± 0.004 | 0.548 ± 0.009 | 0.512 ± 0.010 | 0.816 ± 0.004 | 0.441 ± 0.013 | 0.335 ± 0.013 | 0.909 ± 0.002 | 0.701 ± 0.008 | 0.726 ± 0.007 | 0.898 ± 0.002 | 0.641 ± 0.008 | 0.732 ± 0.007 |
| Kinship Gauss | 0.893 ± 0.002 | 0.626 ± 0.008 | 0.676 ± 0.008 | 0.798 ± 0.004 | 0.530 ± 0.009 | 0.506 ± 0.010 | 0.817 ± 0.004 | 0.443 ± 0.013 | 0.325 ± 0.013 | 0.912 ± 0.002 | 0.718 ± 0.007 | 0.723 ± 0.008 | 0.909 ± 0.002 | 0.672 ± 0.008 | 0.731 ± 0.006 |
| Bayes Cπ | 0.892 ± 0.002 | 0.611 ± 0.009 | 0.656 ± 0.008 | 0.797 ± 0.003 | 0.551 ± 0.010 | 0.509 ± 0.011 | 0.817 ± 0.004 | 0.442 ± 0.012 | 0.343 ± 0.012 | 0.909 ± 0.002 | 0.708 ± 0.007 | 0.715 ± 0.008 | 0.901 ± 0.002 | 0.637 ± 0.009 | 0.726 ± 0.007 |
| Bayes B | 0.887 ± 0.002 | 0.588 ± 0.009 | 0.630 ± 0.009 | 0.792 ± 0.004 | 0.559 ± 0.009 | 0.501 ± 0.010 | 0.816 ± 0.004 | 0.445 ± 0.012 | 0.375 ± 0.012 | 0.908 ± 0.002 | 0.688 ± 0.007 | 0.699 ± 0.009 | 0.903 ± 0.002 | 0.646 ± 0.008 | 0.704 ± 0.007 |
| Bayes LASSO | 0.892 ± 0.002 | 0.614 ± 0.008 | 0.674 ± 0.008 | 0.799 ± 0.004 | 0.553 ± 0.009 | 0.514 ± 0.010 | 0.817 ± 0.004 | 0.442 ± 0.013 | 0.332 ± 0.012 | 0.909 ± 0.002 | 0.703 ± 0.008 | 0.727 ± 0.008 | 0.900 ± 0.002 | 0.632 ± 0.010 | 0.735 ± 0.007 |
| Random Forest | 0.897 ± 0.002 | 0.647 ± 0.007 | 0.725 ± 0.007 | 0.801 ± 0.004 | 0.562 ± 0.009 | 0.556 ± 0.010 | 0.815 ± 0.004 | 0.478 ± 0.013 | 0.319 ± 0.014 | 0.912 ± 0.002 | 0.727 ± 0.008 | 0.745 ± 0.008 | 0.909 ± 0.001 | 0.652 ± 0.009 | 0.742 ± 0.007 |
| Ridge Regression | 0.523 ± 0.006 | 0.267 ± 0.012 | 0.261 ± 0.012 | 0.172 ± 0.008 | 0.093 ± 0.013 | 0.063 ± 0.011 | 0.166 ± 0.008 | 0.153 ± 0.013 | 0.243 ± 0.011 | 0.222 ± 0.008 | 0.053 ± 0.013 | 0.241 ± 0.011 | 0.604 ± 0.005 | 0.399 ± 0.010 | 0.697 ± 0.006 |
| Kinship Gauss | 0.522 ± 0.006 | 0.218 ± 0.012 | 0.251 ± 0.013 | 0.148 ± 0.008 | 0.062 ± 0.011 | 0.177 ± 0.012 | 0.138 ± 0.008 | 0.199 ± 0.012 | 0.232 ± 0.012 | 0.218 ± 0.008 | 0.023 ± 0.013 | 0.242 ± 0.012 | 0.603 ± 0.005 | 0.453 ± 0.009 | 0.690 ± 0.006 |
| Bayes Cπ | 0.520 ± 0.007 | 0.246 ± 0.013 | 0.262 ± 0.012 | 0.175 ± 0.008 | −0.004 ± 0.011 | −0.002 ± 0.011 | 0.163 ± 0.008 | 0.144 ± 0.012 | 0.236 ± 0.012 | 0.216 ± 0.008 | −0.060 ± 0.012 | 0.235 ± 0.011 | 0.602 ± 0.005 | 0.403 ± 0.010 | 0.687 ± 0.006 |
| Bayes B | 0.517 ± 0.006 | 0.227 ± 0.013 | 0.285 ± 0.012 | 0.171 ± 0.008 | −0.003 ± 0.012 | −0.010 ± 0.011 | 0.168 ± 0.008 | 0.141 ± 0.013 | 0.227 ± 0.011 | 0.209 ± 0.009 | −0.063 ± 0.012 | 0.239 ± 0.011 | 0.597 ± 0.005 | 0.406 ± 0.012 | 0.675 ± 0.006 |
| Bayes LASSO | 0.524 ± 0.006 | 0.262 ± 0.013 | 0.262 ± 0.012 | 0.182 ± 0.008 | 0.004 ± 0.012 | 0.001 ± 0.010 | 0.163 ± 0.007 | 0.143 ± 0.012 | 0.240 ± 0.011 | 0.216 ± 0.008 | −0.061 ± 0.012 | 0.247 ± 0.012 | 0.598 ± 0.006 | 0.408 ± 0.010 | 0.690 ± 0.006 |
| Random Forest | 0.493 ± 0.006 | 0.190 ± 0.012 | 0.258 ± 0.012 | 0.186 ± 0.008 | 0.104 ± 0.012 | 0.077 ± 0.011 | 0.165 ± 0.009 | 0.192 ± 0.013 | 0.131 ± 0.011 | 0.205 ± 0.009 | 0.161 ± 0.012 | 0.186 ± 0.013 | 0.606 ± 0.005 | 0.457 ± 0.010 | 0.655 ± 0.007 |
Figure 4Regression of true breeding value on breeding values estimated with different methods (A) for ICRISAT location; (B) for IARI location.