| Literature DB >> 36077985 |
Muhammad Yasir Nawaz1, Priscila Arrigucci Bernardes2, Rodrigo Pelicioni Savegnago3, Dajeong Lim4, Seung Hwan Lee5, Cedric Gondro3.
Abstract
This study evaluated the accuracy of sequence imputation in Hanwoo beef cattle using different reference panels: a large multi-breed reference with no Hanwoo (n = 6269), a much smaller Hanwoo purebred reference (n = 88), and both datasets combined (n = 6357). The target animals were 136 cattle both sequenced and genotyped with the Illumina BovineSNP50 v2 (50K). The average imputation accuracy measured by the Pearson correlation (R) was 0.695 with the multi-breed reference, 0.876 with the purebred Hanwoo, and 0.887 with the combined data; the average concordance rates (CR) were 88.16%, 94.49%, and 94.84%, respectively. The accuracy gains from adding a large multi-breed reference of 6269 samples to only 88 Hanwoo was marginal; however, the concordance rate for the heterozygotes decreased from 85% to 82%, and the concordance rate for fixed SNPs in Hanwoo also decreased from 99.98% to 98.73%. Although the multi-breed panel was large, it was not sufficiently representative of the breed for accurate imputation without the Hanwoo animals. Additionally, we evaluated the value of high-density 700K genotypes (n = 991) as an intermediary step in the imputation process. The imputation accuracy differences were negligible between a single-step imputation strategy from 50K directly to sequence and a two-step imputation approach (50K-700K-sequence). We also observed that imputed sequence data can be used as a reference panel for imputation (mean R = 0.9650, mean CR = 98.35%). Finally, we identified 31 poorly imputed genomic regions in the Hanwoo genome and demonstrated that imputation accuracies were particularly lower at the chromosomal ends.Entities:
Keywords: Hanwoo cattle; imputation accuracy; whole-genome sequence data
Year: 2022 PMID: 36077985 PMCID: PMC9454883 DOI: 10.3390/ani12172265
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 3.231
Figure 1IVDP pipeline for variant calling from Illumina whole-genome short sequencing reads.
Figure 2Plot of first two principal components of the large multi-breed reference including Hanwoo animals.
Concordance rate in percentage between imputed and real genotypes for imputation from 50K to 700K (991 700K reference).
| AAimp | ABimp | BBimp | |
|---|---|---|---|
|
| 99.57 | 0.41 | 0.02 |
|
| 0.48 | 99.03 | 0.48 |
|
| 0.03 | 0.33 | 99.65 |
imp: imputed genotypes; ref: reference genotypes.
Figure 3(A) Concordance and correlation of samples for one-step and two-step imputation. (B) Concordance by bins allele frequency (0.01 interval) for one-step and two-step imputation.
Concordance rate in percentage between imputed sequence and real sequence of 136 animals for one-step (224 WGS reference) and two-step imputation (224 WGS and 991 700K reference).
| One-Step | Two-Step | ||||||
|---|---|---|---|---|---|---|---|
| AAimp | ABimp | BBimp | AAimp | ABimp | BBimp | ||
|
| 97.14 | 2.65 | 0.22 |
| 95.69 | 3.98 | 0.33 |
|
| 1.10 | 94.85 | 4.05 |
| 1.4 | 91.82 | 6.78 |
|
| 0.03 | 1.07 | 98.91 |
| 0.02 | 1.09 | 98.89 |
imp: imputed genotypes; ref: reference genotypes.
Figure 4Plot of the first two principal components of the genomic relationship matrix of the sequence data and the imputed genotypes using the one- and two-step approaches.
Concordance rate in percentage between imputed and real genotypes for 224 Hanwoo and 6269 multi-breed animals in reference.
| AAimp | ABimp | BBimp | |
|---|---|---|---|
|
| 97.75 | 2.06 | 0.19 |
|
| 0.88 | 94.8 | 4.33 |
|
| 0.01 | 0.78 | 99.2 |
imp imputed genotypes; ref reference genotypes.
Concordance rate in percentage between imputed and real genotypes for multi-breed reference panels with (88 Hanwoo, 6269 multi-breed) and without Hanwoo (6269 multi-breed) samples.
| Multi-Breed without Hanwoo | Multi-Breed with Hanwoo | ||||||
|---|---|---|---|---|---|---|---|
| AAimp | ABimp | BBimp | AAimp | ABimp | BBimp | ||
|
| 76.03 | 18.97 | 5.01 |
| 90.45 | 8.84 | 0.71 |
|
| 12.29 | 56.95 | 30.76 |
| 5.15 | 82.15 | 12.7 |
|
| 0.71 | 8.09 | 91.2 |
| 0.1 | 3.23 | 96.67 |
imp: imputed genotypes; ref: reference genotypes.
Concordance rate in percentage between imputed and real genotypes with a reference panel of 88 purebred Hanwoo.
| AAimp | ABimp | BBimp | |
|---|---|---|---|
|
| 89.21 | 9.93 | 0.86 |
|
| 4.63 | 85.82 | 9.54 |
|
| 0.18 | 4.14 | 95.68 |
imp: imputed genotypes; ref: reference genotypes.
Concordance rate in percentage between imputed and real genotypes with a reference panel of 190 purebred Hanwoo.
| AAimp | ABimp | BBimp | |
|---|---|---|---|
|
| 93.69 | 5.91 | 0.40 |
|
| 2.47 | 89.54 | 8.00 |
|
| 0.04 | 1.93 | 98.03 |
imp: imputed genotypes; ref: reference genotypes.
Concordance rate in percentage between imputed and real genotypes of 136 target animals with a reference panel of 9596 purebred Hanwoo that were previously imputed from 50K to sequence.
| AAimp | ABimp | BBimp | |
|---|---|---|---|
|
| 97.11 | 2.67 | 0.22 |
|
| 1.11 | 94.79 | 4.10 |
|
| 0.03 | 1.07 | 98.90 |
imp: imputed genotypes; ref: reference genotypes.
Figure 5Smoothed concordance rate across 29 autosomes in Hanwoo cattle.
Figure 6Plot of imputation accuracy against distance of SNPs from nearest end of chromosome (Mb).
Chromosomal position of poorly imputed genomic regions in Hanwoo.
| CHR | Start (BP) | End (BP) | Length (BP) | No. of SNPs |
|---|---|---|---|---|
| 2 | 81,496,310 | 81,511,148 | 14,838 | 273 |
| 3 | 11,716,513 | 11,776,690 | 60,177 | 452 |
| 4 | 105,573,863 | 105,589,511 | 15,648 | 1422 |
| 4 | 112,830,880 | 113,341,416 | 510,536 | 13,917 |
| 5 | 92,617,024 | 92,629,684 | 12,660 | 236 |
| 6 | 55,737,025 | 55,758,674 | 21,649 | 508 |
| 7 | 10,758,277 | 11,039,089 | 280,812 | 4584 |
| 7 | 95,736,166 | 95,742,201 | 6035 | 163 |
| 8 | 29,160,922 | 29,183,317 | 22,395 | 559 |
| 9 | 10,593,065 | 10,596,144 | 3079 | 59 |
| 9 | 104,358,861 | 104,380,205 | 21,344 | 671 |
| 10 | 459,493 | 521,352 | 61,859 | 846 |
| 10 | 22,908,467 | 23,406,397 | 497,930 | 2334 |
| 10 | 100,209,890 | 101,216,542 | 1,006,652 | 2146 |
| 11 | 40,903,028 | 40,905,284 | 2256 | 72 |
| 12 | 70,443,588 | 70,455,007 | 11,419 | 713 |
| 12 | 71,728,153 | 71,735,710 | 7557 | 719 |
| 14 | 82,178,440 | 82,180,992 | 2552 | 60 |
| 15 | 45,854,158 | 46,106,343 | 252,185 | 3506 |
| 15 | 78,379,540 | 79,226,472 | 846,932 | 398 |
| 16 | 372,932 | 418,633 | 45,701 | 1473 |
| 16 | 5,775,482 | 5,778,367 | 2885 | 53 |
| 17 | 14,049,911 | 14,159,496 | 109,585 | 4297 |
| 17 | 37,810,468 | 39,363,461 | 1,552,993 | 4880 |
| 18 | 62,688,950 | 62,690,650 | 1700 | 5 |
| 19 | 927,489 | 945,870 | 18,381 | 537 |
| 20 | 49,746,665 | 49,760,276 | 13,611 | 432 |
| 22 | 897,334 | 922,574 | 25,240 | 738 |
| 23 | 15,964,918 | 15,976,090 | 11,172 | 138 |
| 23 | 25,875,394 | 26,511,965 | 636,571 | 2223 |
| 29 | 51,092,637 | 51,093,495 | 858 | 23 |