| Literature DB >> 26148055 |
Hongjun Liu1, Huangkai Zhou2, Yongsheng Wu3, Xiao Li4, Jing Zhao5, Tao Zuo6, Xuan Zhang2, Yongzhong Zhang1, Sisi Liu1, Yaou Shen1, Haijian Lin1, Zhiming Zhang1, Kaijian Huang3, Thomas Lübberstedt5, Guangtang Pan1.
Abstract
Genomic selection is a promising research area due to its practical application in breeding. In this study, impact of realized genetic relationship and linkage disequilibrium (LD) on marker density and training population size required was investigated and their impact on practical application was further discussed. This study is based on experimental data of two populations derived from the same two founder lines (B73, Mo17). Two populations were genotyped with different marker sets at different density: IBM Syn4 and IBM Syn10. A high-density marker set in Syn10 was imputed into the Syn4 population with low marker density. Seven different prediction scenarios were carried out with a random regression best linear unbiased prediction (RR-BLUP) model. The result showed that the closer the real genetic relationship between training and validation population, the fewer markers were required to reach a good prediction accuracy. Taken the short-term cost for consideration, relationship information is more valuable than LD information. Meanwhile, the result indicated that accuracies based on high LD between QTL and markers were more stable over generations, thus LD information would provide more robust prediction capacity in practical applications.Entities:
Mesh:
Year: 2015 PMID: 26148055 PMCID: PMC4493124 DOI: 10.1371/journal.pone.0132379
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genetic variance (σ 2) and heritability (h ) in different populations.
| Population | Number of progenies | Total genetic distance (cM) | GDD | PH | ||
|---|---|---|---|---|---|---|
|
|
|
|
| |||
| Syn10 | 194 | 11,198.50 | 895.96 | 0.79 | 279.18 | 0.92 |
| Syn4 | 244 | 6,240 | 1,074.74 | 0.81 | 298.81 | 0.90 |
| Mixed population | 438 | - | 1,022.71 | 0.80 | 296.21 | 0.90 |
**Significantly different from zero at the 0.01 level of probability.
Scenarios for GS tests.
| Scenario | Training population | Validation population | Prediction Type |
|---|---|---|---|
| 1 | Syn10 | Syn10 | Within population |
| 2 | Syn4 | Syn4 | Within population |
| 3 | Syn10 | Syn4 | Between populations |
| 4 | Syn4 | Syn10 | Between populations |
| 5 | Mixed population | Mixed population | Across populations |
| 6 | Mixed population | Syn10 | Across populations |
| 7 | Mixed population | Syn4 | Across populations |
a Mixed population was a combination of the Syn4 and Syn10 populations
b Lines for training and validation came from the same population
c Lines for training and validation came from different populations
d Lines for training came from both populations.
Average distance and LD between adjacent markers in different marker sets.
| Marker set | Physical Distance | Genetic Distance | Genetic Distance | LD | LD in Mixed | LD in Syn4 |
|---|---|---|---|---|---|---|
| ( | (Mb) | in Syn4 (cM) | in Syn10 (cM) | ( | Population ( | ( |
| 6,611 | 0.31 | 0.94 | 1.69 | 0.78 | 0.81 | 0.87 |
| 3,200 | 0.64 | 1.95 | 3.50 | 0.66 | 0.71 | 0.78 |
| 1,600 | 1.28 | 3.90 | 7.00 | 0.50 | 0.55 | 0.63 |
| 800 | 2.57 | 7.80 | 14.00 | 0.32 | 0.37 | 0.44 |
| 400 | 5.13 | 15.60 | 28.00 | 0.19 | 0.22 | 0.27 |
| 200 | 10.26 | 31.20 | 55.99 | 0.09 | 0.11 | 0.15 |
| 100 | 20.53 | 62.40 | 111.99 | 0.05 | 0.06 | 0.08 |
a Average physical distance (in Mb) between adjacent markers.
b Average genetic distance (in cM) between adjacent markers.
c Linkage disequilibrium as estimated by the mean pairwise r values between adjacent markers.
Fig 1LD decay measured in different populations.
Fig 2Prediction accuracies depending on marker number (N ) and training population size (N ) for different scenarios and traits.