| Literature DB >> 29868066 |
Norman Philipp1, Stephan Weise2, Markus Oppermann2, Andreas Börner2, Andreas Graner2, Jens Keilwagen3, Benjamin Kilian4, Yusheng Zhao1, Jochen C Reif1, Albert W Schulthess1.
Abstract
Genebanks are a rich source of genetic variation. Most of this variation is absent in breeding programs but may be useful for further crop plant improvement. However, the lack of phenotypic information forms a major obstacle for the educated choice of genebank accessions for research and breeding. A promising approach to fill this information gap is to exploit historical information gathered routinely during seed regeneration cycles. Still, this data is characterized by a high non-orthogonality hampering their analysis. By examining historical data records for flowering time, plant height, and thousand grain weight collected during 70 years of regeneration of 6,207 winter wheat (Triticum aestivum L.) accessions at the German Federal ex situ Genebank, we aimed to elaborate a strategy to analyze and validate non-orthogonal historical data in order to charge genebank information platforms with high quality ready-to-use phenotypic information. First, a three-step quality control assessment considering the plausibility of trait values and a standard as well as a weather parameter index based outlier detection was implemented, resulting in heritability estimates above 0.90 for all three traits. Then, the data was analyzed by estimating best linear unbiased estimations (BLUEs) applying a linear mixed-model approach. An in silico resampling study mimicking different missing data patterns revealed that accessions should be regenerated in a random fashion and not blocked by origin or acquisition date in order to minimize estimation biases in historical data sets. Validation data was obtained from multi-environmental orthogonal field trials considering a random subsample of 3,083 accessions. Correlations above 0.84 between BLUEs estimated for historical data and validation trials outperformed previous approaches and confirmed the robustness of our strategy as well as the high quality of the historical data. The results indicate that the IPK winter wheat collection reveals an extraordinary high phenotypic diversity compared to other collections. The quality checked ready-to-use phenotypic information resulting from this study is the first brick to extend traditional, conservation driven genebanks into bio-digital resource centers.Entities:
Keywords: bio-digital resource center; data quality assessment; genebank; genetic resources; historical data; winter wheat
Year: 2018 PMID: 29868066 PMCID: PMC5953327 DOI: 10.3389/fpls.2018.00609
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Second-degree statistics for flowering time (FT), plant height (PH), and thousand grain weight (TGW) before and after outlier correction, where refers to the variance of the years, denotes the genetic variance, is the variance of the residuals, Number of years is the average number of years when accessions were regenerated, and h2 refers to the heritability.
| Source | FT | PH | TGW |
|---|---|---|---|
| 72.36∗∗∗ | 228.75∗∗∗ | 17.91∗∗∗ | |
| 15.23∗∗∗ | 342.15∗∗∗ | 28.98∗∗∗ | |
| 9.08 | 102.62 | 16.13 | |
| Number of years | 5.13 | 5.05 | 4.42 |
| h2 | 0.90 | 0.94 | 0.89 |
| 71.95∗∗∗ | 230.99∗∗∗ | 15.02∗∗∗ | |
| 15.62∗∗∗ | 346.72∗∗∗ | 29.67∗∗∗ | |
| 6.48 | 98.53 | 13.76 | |
| Number of years | 5.09 | 5.04 | 4.30 |
| h2 | 0.92 | 0.95 | 0.90 |
Pearson’s correlation coefficients among best linear unbiased estimations of validation trials (BLUESValidation) and best linear unbiased estimations of the historical data (BLUESHistorical), the arithmetic means of the historical data (MeanHistorical), and the normalized rank product of the historical data (NRPHistorical) for flowering time (FT) and plant height (PH).
| Source | BLUESValidation | |
|---|---|---|
| FT | PH | |
| BLUESHistorical | 0.84∗∗∗ | 0.87∗∗∗ |
| MeanHistorical | 0.70∗∗∗ | 0.83∗∗∗ |
| NRPHistorical | 0.76∗∗∗ | 0.76∗∗∗ |