| Literature DB >> 31638889 |
Hailiang Song1, Shaopan Ye2, Yifan Jiang1, Zhe Zhang2, Qin Zhang3, Xiangdong Ding4.
Abstract
BACKGROUND: For genomic selection in populations with a small reference population, combining populations of the same breed or populations of related breeds is an effective way to increase the size of the reference population. However, genomic predictions based on single nucleotide polymorphism (SNP)-chip genotype data using combined populations with different genetic backgrounds or from different breeds have not shown a clear advantage over using within-population or within-breed predictions. The increasing availability of whole-genome sequencing (WGS) data provides new opportunities for combined population genomic prediction. Our objective was to investigate the accuracy of genomic prediction using imputation-based WGS data from combined populations in pigs. Using 80K SNP panel genotypes, WGS genotypes, or genotypes on WGS variants that were pruned based on linkage disequilibrium (LD), three methods [genomic best linear unbiased prediction (GBLUP), single-step (ss)GBLUP, and genomic feature (GF)BLUP] were implemented with different prior information to identify the best method to improve the accuracy of genomic prediction for combined populations in pigs.Entities:
Mesh:
Year: 2019 PMID: 31638889 PMCID: PMC6805481 DOI: 10.1186/s12711-019-0500-8
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Summary of the three Yorkshire populations, numbers of genotyped animals, and estimates of heritability ()
| Populationa (number of animals in the pedigree) | Origin | Trait | N-obs | Birth year | Genotyped animals | |
|---|---|---|---|---|---|---|
| LM (72,998) | USA | NBA | 5907/19,660 | 2004–2016 | 1641 | 0.08 (0.01) |
| TNB | 5907/19,660 | 1641 | 0.09 (0.01) | |||
| AGE | 28,827 | 2007–2016 | 1769 | 0.38 (0.02) | ||
| BFT | 28,827 | 1769 | 0.36 (0.02) | |||
| XD (51,964) | UK | NBA | 4842/18,369 | 2004–2015 | 762 | 0.07 (0.01) |
| TNB | 4842/18,369 | 762 | 0.07 (0.01) | |||
| ZX (16,914) | USA | AGE | 6721 | 2012–2016 | 320 | 0.19 (0.03) |
| BFT | 6721 | 320 | 0.24 (0.03) |
NBA number of piglets born alive, TNB total number of piglets born, AGE days to 100 kg, BFT backfat thickness, N-obs number of individuals/observations
aYorkshire populations from three Chinese pig breeding farms
Fig. 1Principal component analysis (PCA) for three Yorkshire populations. XD, LM and ZX represent three Yorkshire populations from three Chinese pig breeding farms; PC1 (3.9%) = first principal component (variance explained by PC1 = 3.9%); PC2 (2.1%) = second principal component (variance explained by PC2 = 2.1%)
Fig. 2Imputation accuracy for each minor allele frequency (MAF) interval. Genotype concordance rate (CR), which was defined as the proportion of genotypes of the imputed variants that were the same as the whole-genome sequencing variants. AR2, allelic R-squared for consistent variants between imputation and whole-genome sequencing. AR2_ALL, allelic R-squared for all imputed variants
Accuracy of genomic prediction for reproduction traits using different reference populations to predict 223 younger individuals of the LM population
| Reference seta (size of reference population) | Method | Trait | |
|---|---|---|---|
| NBA | TNB | ||
| LM (1418) | GBLUP_80K | 0.453 | 0.450 |
| GBLUP_WGS | 0.456 | 0.452 | |
| GBLUP_LD | 0.470 | 0.464 | |
| GFBLUP_QTL | 0.473 | 0.466 | |
| GFBLUP_GWAS | 0.464 | 0.459 | |
| ssGBLUP_80K | 0.649 | 0.664 | |
| ssGBLUP_WGS | 0.641 | 0.658 | |
| ssGBLUP_LD | 0.662 | 0.679 | |
| LM+XD (2180) | GBLUP_80K | 0.459 | 0.460 |
| GBLUP_WGS | 0.467 | 0.468 | |
| GBLUP_LD | 0.478 | 0.479 | |
| GFBLUP_QTL | 0.478 | 0.471 | |
| GFBLUP_GWAS | 0.462 | 0.455 | |
| ssGBLUP_80K | 0.646 | 0.664 | |
| ssGBLUP_WGS | 0.648 | 0.667 | |
| ssGBLUP_LD | 0.668 | 0.686 | |
| XD (762) | GBLUP_80K | 0.026 | − 0.016 |
| GBLUP_WGS | 0.042 | − 0.001 | |
| GBLUP_LD | 0.050 | 0.028 | |
| GFBLUP_QTL | − 0.017 | − 0.058 | |
| GFBLUP_GWAS | − 0.008 | − 0.009 | |
GBLUP_80K genomic BLUP based on the 80K SNP panel; GBLUP_WGS GBLUP based on imputed whole-genome sequencing (WGS) data; GBLUP_LD GBLUP based on WGS after LD pruning; GFBLUP_QTL genomic feature BLUP with a genomic feature matrix constructed based on QTL prior information; GFBLUP_GWAS genomic feature BLUP with a genomic feature matrix constructed based on GWAS prior information; ssGBLUP_80K, ssGBLUP_WGS, ssGBLUP_LD For ssGBLUP, the H matrix was constructed based on the different genomic relationship matrices of the 80K chip panel, WGS data and WGS data after LD pruning and termed ssGBLUP_80K, ssGBLUP_WGS and ssGBLUP_LD, respectively; NBA number of piglets born alive; TNB total number of piglets born
aYorkshire population LM, XD or LM plus the XD reference population
Accuracy of genomic prediction for reproduction traits using different reference populations to predict 196 younger individuals of the XD population
| Reference seta (size of reference population) | Method | Trait | |
|---|---|---|---|
| NBA | TNB | ||
| XD (566) | GBLUP_80K | 0.392 | 0.431 |
| GBLUP_WGS | 0.390 | 0.425 | |
| GBLUP_LD | 0.396 | 0.435 | |
| GFBLUP_QTL | 0.398 | 0.435 | |
| GFBLUP_GWAS | 0.378 | 0.425 | |
| ssGBLUP_80K | 0.441 | 0.465 | |
| ssGBLUP_WGS | 0.439 | 0.457 | |
| ssGBLUP_LD | 0.440 | 0.462 | |
| LM+XD (2207) | GBLUP_80K | 0.387 | 0.439 |
| GBLUP_WGS | 0.407 | 0.454 | |
| GBLUP_LD | 0.403 | 0.455 | |
| GFBLUP_QTL | 0.407 | 0.458 | |
| GFBLUP_GWAS | 0.429 | 0.494 | |
| ssGBLUP_80K | 0.451 | 0.480 | |
| ssGBLUP_WGS | 0.455 | 0.481 | |
| ssGBLUP_LD | 0.453 | 0.483 | |
| LM (1641) | GBLUP_80K | 0.121 | 0.214 |
| GBLUP_WGS | 0.204 | 0.275 | |
| GBLUP_LD | 0.152 | 0.228 | |
| GFBLUP_QTL | 0.163 | 0.232 | |
| GFBLUP_GWAS | 0.221 | 0.282 | |
GBLUP_80K genomic BLUP based on the 80K SNP panel; GBLUP_WGS GBLUP based on imputed whole-genome sequencing (WGS) data; GBLUP_LD GBLUP based on WGS after LD pruning; GFBLUP_QTL genomic feature BLUP with a genomic feature matrix constructed based on QTL prior information; GFBLUP_GWAS genomic feature BLUP with a genomic feature matrix constructed based on GWAS prior information; ssGBLUP_80K, ssGBLUP_WGS, ssGBLUP_LD For ssGBLUP, the H matrix was constructed based on the different genomic relationship matrices of the 80K chip panel, WGS data and WGS data after LD pruning and termed ssGBLUP_80K, ssGBLUP_WGS and ssGBLUP_LD, respectively; NBA number of piglets born alive; TNB total number of piglets born
aYorkshire population LM, XD or LM plus the XD reference population
Accuracy of genomic prediction for production traits using different reference populations to predict 270 younger individuals of the LM population
| Reference seta (size of reference population) | Method | Trait | |
|---|---|---|---|
| AGE | BFT | ||
| LM (1499) | GBLUP_80K | 0.625 | 0.432 |
| GBLUP_WGS | 0.633 | 0.436 | |
| GBLUP_LD | 0.642 | 0.445 | |
| GFBLUP_QTL | 0.642 | 0.444 | |
| GFBLUP_GWAS | 0.640 | 0.444 | |
| ssGBLUP_80K | 0.739 | 0.677 | |
| ssGBLUP_WGS | 0.738 | 0.663 | |
| ssGBLUP_LD | 0.745 | 0.671 | |
| LM+ZX (1819) | GBLUP_80K | 0.615 | 0.431 |
| GBLUP_WGS | 0.636 | 0.443 | |
| GBLUP_LD | 0.648 | 0.454 | |
| GFBLUP_QTL | 0.638 | 0.443 | |
| GFBLUP_GWAS | 0.635 | 0.452 | |
| ssGBLUP_80K | 0.737 | 0.677 | |
| ssGBLUP_WGS | 0.739 | 0.664 | |
| ssGBLUP_LD | 0.746 | 0.673 | |
| ZX (320) | GBLUP_80K | 0.028 | 0.067 |
| GBLUP_WGS | 0.098 | 0.111 | |
| GBLUP_LD | 0.164 | 0.142 | |
| GFBLUP_QTL | 0.164 | 0.144 | |
| GFBLUP_GWAS | 0.124 | 0.119 | |
GBLUP_80K genomic BLUP based on the 80K SNP panel; GBLUP_WGS GBLUP based on imputed whole-genome sequencing (WGS) data; GBLUP_LD GBLUP based on WGS after LD pruning; GFBLUP_QTL genomic feature BLUP with a genomic feature matrix constructed based on QTL prior information; GFBLUP_GWAS genomic feature BLUP with a genomic feature matrix constructed based on GWAS prior information; ssGBLUP_80K, ssGBLUP_WGS, ssGBLUP_LD For ssGBLUP, the H matrix was constructed based on the different genomic relationship matrices of the 80K chip panel, WGS data and WGS data after LD pruning and termed ssGBLUP_80K, ssGBLUP_WGS and ssGBLUP_LD, respectively; AGE days to 100 kg; BFT backfat thickness
aYorkshire population of LM, ZX or LM plus the ZX reference population