| Literature DB >> 24065982 |
Matthew C McClure1, Tad S Sonstegard, George R Wiggans, Alison L Van Eenennaam, Kristina L Weber, Cecilia T Penedo, Donagh P Berry, John Flynn, Jose F Garcia, Adriana S Carmo, Luciana C A Regitano, Milla Albuquerque, Marcos V G B Silva, Marco A Machado, Mike Coffey, Kirsty Moore, Marie-Yvonne Boscher, Lucie Genestout, Raffaele Mazza, Jeremy F Taylor, Robert D Schnabel, Barry Simpson, Elisa Marques, John C McEwan, Andrew Cromie, Luiz L Coutinho, Larry A Kuehn, John W Keele, Emily K Piper, Jim Cook, Robert Williams, Curtis P Van Tassell.
Abstract
To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset.Entities:
Keywords: SNP; STR; imputation; microsatellite; parentage verification
Year: 2013 PMID: 24065982 PMCID: PMC3776237 DOI: 10.3389/fgene.2013.00176
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Sample breed counts.
| Abondance | 165 | 7 | – | 2.04 |
| Angus | 359 | 288 | 16 | 4.44 |
| Ankole-Watusi | – | – | 15 | 0.00 |
| Aubrac | 234 | 5 | – | 2.90 |
| Ayshire | 71 | 510 | – | 0.88 |
| Bazadaise | 53 | 27 | – | 0.66 |
| Beefmaster | 17 | 17 | – | 0.21 |
| Belgian Blue | 169 | 39 | 12 | 2.09 |
| Belmont Red | – | 40 | – | 0.00 |
| Blonde D'Aquitaine | 201 | 24 | – | 2.49 |
| 29 | 106 | – | 0.36 | |
| Brahman | 358 | 31 | – | 4.43 |
| Brangus | 8 | – | – | 0.10 |
| Braunvieh | 16 | 1 | – | 0.20 |
| Bretonne Pie Noire | 16 | 11 | – | 0.20 |
| Brown Swiss | 33 | 75 | – | 0.41 |
| Brune Des Alpes | 109 | – | – | 1.35 |
| Charolais | 1092 | 340 | 14 | 13.52 |
| Chiangus | – | 19 | – | 0.00 |
| Devon | – | – | 16 | 0.00 |
| Dexter | – | – | 15 | 0.00 |
| Friesian | 35 | 140 | – | 0.43 |
| Gasconne | 142 | – | – | 1.76 |
| Gelbvieh | 24 | 16 | – | 0.30 |
| Gir | 125 | 114 | – | 1.55 |
| Guernsey | 18 | 94 | – | 0.22 |
| Hereford | 251 | 589 | – | 3.11 |
| Holstein | 528 | 2103 | 5 | 6.54 |
| Jersey | 48 | 48 | – | 0.59 |
| Kerry | – | 1 | – | 0.00 |
| Lagunair | – | 5 | – | 0.00 |
| Limousin | 1599 | 557 | – | 19.80 |
| Maine-Anjou | – | 19 | 16 | 0.00 |
| Montbeliarde | 251 | 6 | – | 3.11 |
| Murray Grey | – | 22 | – | 0.00 |
| Ndama | – | 24 | – | 0.00 |
| Nelore | 124 | 1739 | – | 1.54 |
| Normande | 243 | 13 | – | 3.01 |
| Norwegian Red | – | 17 | – | 0.00 |
| Parthenaise | 218 | 73 | – | 2.70 |
| Pie Rouge Des Plaines | 116 | 44 | – | 1.44 |
| Piedmontese | 24 | – | – | 0.30 |
| Red Angus | 46 | 9 | – | 0.57 |
| Romagnola | – | 24 | – | 0.00 |
| Rouge Flamande | 41 | – | – | 0.51 |
| Salers | 234 | 24 | – | 2.90 |
| Santa Gertrudis | – | 97 | – | 0.00 |
| Sheko | – | 18 | – | 0.00 |
| Shorthorn | 17 | 170 | – | 0.21 |
| Simmental | 521 | 217 | – | 6.45 |
| Swedish Red | 2 | 3 | – | 0.02 |
| Tarentaise | 155 | 12 | – | 1.92 |
| Texas Longhorn | – | – | 13 | 0.00 |
| Tropical Composite | 336 | – | 4.16 | |
| Vosgienne | 49 | 4 | – | 0.61 |
| Unknown taurine | – | 880 | – | 0.00 |
| Total | 8077 | 8622 | 122 | 100.00 |
Microsatellite (MS) imputation haplotype information.
| 1 | 10 | 7035 | 963 | 70 | 264,604 | 551 | 50.8 | 91.9 | 2.42 | 19 | 7.53 | 27 | |
| 2 | 12 | 7069 | 962 | 50 | 143,822 | 1375 | 42.5 | 87.4 | 2.13 | 12 | 4.24 | 17 | |
| 3 | 15 | 7006 | 855 | 110 | 401,960 | 1158 | 38.9 | 87.7 | 1.78 | 11 | 4.18 | 21 | |
| 5 | 11 | 6875 | 962 | 80 | 377,892 | 582 | 46.7 | 92.6 | 2.34 | 14 | 6.04 | 27 | |
| 9 | 12 | 7055 | 960 | 110 | 429,755 | 1569 | 35.2 | 82.6 | 1.74 | 10 | 3.62 | 22 | |
| 15 | 13 | 6971 | 965 | 40 | 95,748 | 461 | 46.6 | 90.9 | 2.99 | 23 | 6.43 | 36 | |
| 16 | 21 | 2902 | 302 | 80 | 299,752 | 403 | 26.8 | 57.1 | 1.80 | 10 | 3.13 | 16 | |
| 18 | 18 | 7053 | 966 | 40 | 140,730 | 1071 | 39.6 | 85.6 | 1.94 | 18 | 5.40 | 18 | |
| 19 | 14 | 3850 | 101 | 80 | 267,330 | 491 | 30.1 | 67.7 | 1.46 | 6 | 3.15 | 10 | |
| 20 | 12 | 6808 | 943 | 90 | 263,494 | 619 | 41.5 | 79.5 | 2.22 | 14 | 7.51 | 35 | |
| 21 | 26 | 7056 | 966 | 50 | 145,453 | 605 | 42.4 | 86.5 | 2.31 | 17 | 5.98 | 33 | |
| 23 | 11 | 1463 | 940 | 80 | 269,309 | 525 | 42.1 | 88.6 | 4.52 | 12 | 3.43 | 11 | |
| Ave | 15 | 5929 | 824 | 73 | 258,321 | 784 | 40 | 83 | 2.30 | 14 | 5.05 | 23 | |
Count of unique MS alleles observed in reference population.
Count of Bos taurus reference animals with genotype for given MS.
Count of Bos indicus reference animals with genotype for given MS.
Number of SNPs included in haplotype centered on MS.
Ba se pair size of the SNP haplotype.
Number of unique SNP haplotypes in the reference population.
Percent of haplotypes in the reference population that met the haplotype identification selection criteria.
Percent of total BT reference population haplotypes that were tallied.
Average number of breeds in the reference population having a MS-SNP haplotype where the SNP haplotype is linked to 1 MS 100% of the time or > 1 MS and matches 1 MS allele > 90% of the time.
Figure 1Count of MS-SNP haplotypes present in at least one breed across the whole reference population.
Microsatellite genotype concordance accuracies for animals with microsatellite alleles imputed from GGP-LD SNP.
| Angus | BT | 20 | 80.73 | 100.00 | 58.33 |
| 100 | 80.73 | 100.00 | 58.33 | ||
| BT + BI | 20 | 80.73 | 100.00 | 58.33 | |
| 100 | 80.73 | 100.00 | 58.33 | ||
| Belgian Blue | BT | 20 | 72.92 | 91.67 | 50.00 |
| 100 | 73.61 | 91.67 | 50.00 | ||
| BT + BI | 20 | 73.61 | 91.67 | 58.33 | |
| 100 | 72.92 | 91.67 | 58.33 | ||
| Charolais | BT | 20 | 73.81 | 91.67 | 58.33 |
| 100 | 75.00 | 91.67 | 58.33 | ||
| BT + BI | 20 | 76.79 | 100.00 | 58.33 | |
| 100 | 76.19 | 91.67 | 58.33 | ||
| Devon | BT | 20 | 65.63 | 83.33 | 41.67 |
| 100 | 66.67 | 83.33 | 41.67 | ||
| BT + BI | 20 | 68.23 | 83.33 | 50.00 | |
| 100 | 69.79 | 83.33 | 50.00 | ||
| Dexter | BT | 20 | 61.11 | 83.33 | 41.67 |
| 100 | 60.56 | 83.33 | 41.67 | ||
| BT + BI | 20 | 60.00 | 83.33 | 41.67 | |
| 100 | 58.89 | 83.33 | 41.67 | ||
| Holstein | BT | 20 | 81.67 | 100.00 | 41.67 |
| 100 | 81.67 | 100.00 | 41.67 | ||
| BT + BI | 20 | 78.33 | 100.00 | 25.00 | |
| 100 | 78.33 | 100.00 | 25.00 | ||
| Maine-Anjou | BT | 20 | 66.15 | 91.67 | 41.67 |
| 100 | 66.15 | 91.67 | 33.33 | ||
| BT + BI | 20 | 68.75 | 91.67 | 41.67 | |
| 100 | 67.71 | 91.67 | 41.67 | ||
| Texas Longhorn | BT | 20 | 43.59 | 66.67 | 25.00 |
| 100 | 42.95 | 66.67 | 25.00 | ||
| BT + BI | 20 | 53.85 | 75.00 | 33.33 | |
| 100 | 53.85 | 75.00 | 33.33 | ||
| Ankole-Watusi | BT | 20 | 32.22 | 58.33 | 8.33 |
| 100 | 28.33 | 41.67 | 8.33 | ||
| BT + BI | 20 | 57.22 | 91.67 | 41.67 | |
| 100 | 52.78 | 83.33 | 33.33 |
The min SNP set was used for GGP imputations.
BT, Bos taurus breeds; BT + BI, Bos taurus and Bos indicus breeds.
Number of BEAGLE iterations.
Mendelian inheritance accuracy by microsatellite and imputation strategy.
| 777 | 786 | 95.50 | 95.88 | 96.65 | 96.91 | 96.95 | 96.69 | 96.52 | 552 | 859 | 98.91 | 98.84 | ||
| 1283 | 1293 | 96.34 | 96.34 | 96.49 | 96.65 | 96.37 | 96.52 | 96.58 | 3129 | 3446 | 98.98 | 98.69 | ||
| 1263 | 1273 | 95.80 | 95.57 | 92.32 | 96.28 | 96.28 | 95.57 | 3099 | 3416 | 98.52 | 98.59 | |||
| 1226 | 1237 | 96.41 | 96.08 | 96.82 | 96.90 | 96.93 | 96.77 | 96.70 | 3037 | 3352 | 98.52 | 98.57 | ||
| 1274 | 1285 | 95.84 | 96.15 | 96.70 | 96.86 | 96.26 | 95.95 | 96.39 | 3115 | 3432 | 98.81 | 98.63 | ||
| 1113 | 1121 | 96.59 | 96.41 | 96.41 | 96.32 | 96.34 | 96.61 | 96.50 | 1913 | 1923 | 98.01 | 97.97 | ||
| 1254 | 1263 | 97.13 | 97.37 | 96.81 | 96.97 | 96.83 | 96.91 | 97.08 | 3055 | 3322 | 98.13 | 98.07 | ||
| 1270 | 1281 | 96.85 | 96.77 | 96.30 | 96.85 | 96.61 | 96.76 | 3083 | 3400 | 98.28 | 97.35 | |||
| 1281 | 1292 | 96.17 | 96.25 | 95.86 | 96.64 | 96.41 | 96.67 | 96.43 | 3127 | 3444 | 98.27 | 98.17 | ||
| 1269 | 1280 | 90.39 | 89.99 | 90.31 | 90.39 | 90.63 | 90.55 | 91.04 | 3094 | 3411 | 96.22 | 96.22 | ||
| 1267 | 1278 | 94.71 | 94.48 | 93.69 | 93.84 | 94.84 | 94.60 | 94.48 | 3093 | 3408 | 98.03 | 98.06 | ||
| 1082 | 1084 | 89.74 | 89.28 | 89.00 | 88.72 | 89.48 | 88.93 | 89.83 | 1676 | 1734 | 93.38 | 92.16 | ||
| Average | 1197 | 1206 | 95.14 | 95.08 | 95.80 | 95.27 | 95.26 | 95.39 | 95.32 | 95.32 | 2664 | 2929 | 97.84 | 97.61 |
BT, Bos taurus breeds, BT + BI, Bos taurus and Bos indicus breeds.
min, minimum SNP haplotype, 1 Mb, full SNP haplotype.
Number of BEAGLE iterations.
Count of validation animals with reported parent microsatellite allele.
Average % accuracy between individual's imputed microsatellite allele and reported parental allele; bold, highest imputed microsatellite accuracy.
Count of reference animals with MS alleles and with reported parent microsatellite allele.
Average % BT accuracy between individual's reported microsatellite allele and reported parental allele.
BEAGLE running time for reference and validation populations.
| 1 | 0:01:04 | 0:03:54 | 0:23:04 | 0:01:15 | 0:04:45 | 0:03:22 | 0:15:05 | 0:00:15 | 0:01:04 | 0:00:17 | 0:01:16 | 0:38:03 | 0:29:02 | |
| 2 | 0:02:49 | 0:09:58 | 4:57:47 | 0:03:01 | 0:10:23 | 0:07:03 | 0:33:25 | 0:00:29 | 0:02:10 | 0:00:33 | 0:02:30 | 2:07:16 | 1:37:04 | |
| 3 | 0:03:08 | 0:10:47 | 0:21:30 | 0:03:49 | 0:13:09 | 0:19:04 | 1:24:35 | 0:01:11 | 0:05:20 | 0:01:27 | 0:06:55 | 0:45:42 | 0:33:10 | |
| 5 | 0:01:24 | 0:05:14 | 0:05:27 | 0:01:30 | 0:05:57 | 0:04:36 | 0:18:22 | 0:00:19 | 0:01:25 | 0:00:22 | 0:01:42 | 0:07:12 | 0:06:15 | |
| 9 | 0:04:22 | 0:16:37 | 0:20:07 | 0:04:57 | 0:18:20 | 0:17:57 | 1:16:59 | 0:01:13 | 0:05:34 | 0:02:01 | 0:09:46 | 0:29:53 | 0:23:06 | |
| 15 | 0:01:22 | 0:05:00 | 5:03:09 | 0:01:06 | 0:04:02 | 0:03:27 | 0:15:57 | 0:00:16 | 0:01:12 | 0:00:22 | 0:01:43 | 1:50:45 | 1:29:21 | |
| 16 | 0:02:13 | 0:06:55 | 0:13:49 | 0:02:21 | 0:07:32 | 0:07:36 | 0:27:19 | 0:00:09 | 0:00:40 | 0:00:11 | 0:00:47 | 0:08:31 | 0:06:59 | |
| 18 | 0:01:51 | 0:06:46 | 3:17:29 | 0:02:04 | 0:07:07 | 0:07:24 | 0:27:45 | 0:00:13 | 0:00:59 | 0:00:16 | 0:01:07 | 1:24:19 | 1:04:29 | |
| 19 | 0:01:31 | 0:05:04 | 0:19:39 | 0:01:38 | 0:05:33 | 0:04:45 | 0:20:29 | 0:00:21 | 0:01:41 | 0:00:25 | 0:01:55 | 0:14:01 | 0:12:49 | |
| 20 | 0:01:38 | 0:06:20 | 0:46:27 | 0:01:52 | 0:07:18 | 0:05:17 | 0:22:56 | 0:00:18 | 0:01:26 | 0:00:23 | 0:01:49 | 0:48:23 | 0:35:20 | |
| 21 | 0:01:15 | 0:04:06 | 0:51:59 | 0:01:17 | 0:03:56 | 0:03:51 | 0:11:58 | 0:00:11 | 0:00:45 | 0:00:13 | 0:00:49 | 0:40:33 | 0:31:13 | |
| 23 | 0:00:52 | 0:03:01 | 0:22:31 | 0:01:00 | 0:03:26 | 0:02:04 | 0:07:11 | 0:00:03 | 0:00:10 | 0:00:04 | 0:00:14 | 0:09:05 | 0:05:08 | |
| Average | 0:01:57 | 0:06:59 | 1:25:15 | 0:02:09 | 0:07:37 | 0:07:12 | 0:30:10 | 0:00:25 | 0:01:52 | 0:00:33 | 0:02:33 | 0:46:59 | 0:36:10 | |
BT, Bos taurus breeds; BT + BI, Bos taurus and Bos indicus breeds.
min, minimum SNP haplotype, 1 Mb, full SNP haplotype.
Number of BEAGLE iterations.
CPU running time for reference population.
Average Mendelian inheritance accuracy by microsatellite and breed across all imputation strategies.
| Angus | 58 | 100.00 | 97.78 | 99.74 | 96.43 | 96.80 | 97.67 | 98.28 | 98.15 | 98.28 | 98.25 | 91.43 | 87.04 | 95.44 |
| Bos taurus crossbred | 38 | 99.21 | 96.88 | 95.67 | 73.56 | 94.64 | – | 99.31 | 96.88 | 98.81 | 96.78 | 99.40 | – | 95.11 |
| Beefmaster | 4 | – | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | – | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| Belgian Blue | 3 | – | 100.00 | 71.43 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 97.40 |
| Belmont Red | 6 | 73.81 | 90.48 | 83.33 | 83.33 | 78.57 | – | 100.00 | 95.24 | 100.00 | 90.48 | 100.00 | – | 89.52 |
| Brahman | 4 | 100.00 | 100.00 | 100.00 | 100.00 | 50.00 | – | 100.00 | 75.00 | 100.00 | 100.00 | 100.00 | – | 89.29 |
| Braunvieh | 1 | – | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | – | 100.00 | 100.00 | 100.00 | 14.29 | 100.00 | 91.43 |
| Charolais | 112 | 100.00 | 95.79 | 97.19 | 100.00 | 98.47 | 96.78 | 97.58 | 100.00 | 92.35 | 100.00 | 97.92 | 96.49 | 97.71 |
| Chiangus | 1 | – | 100.00 | 100.00 | 85.71 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 98.70 |
| Freisan | 16 | 80.95 | 95.54 | 98.21 | 83.04 | 100.00 | 100.00 | 99.11 | 92.86 | 93.75 | 92.86 | 100.00 | 75.82 | 92.68 |
| Gelbvieh | 1 | 100.00 | 14.29 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 14.29 | 85.71 |
| Hereford | 473 | 97.85 | 95.65 | 95.68 | 97.80 | 96.71 | 97.61 | 96.32 | 95.22 | 96.53 | 83.75 | 95.86 | 90.76 | 94.98 |
| Holstein | 61 | 85.71 | 99.77 | 99.05 | 99.30 | 98.13 | 99.77 | 100.00 | 100.00 | 98.36 | 98.59 | 87.76 | 96.92 | 96.95 |
| Jersey | 12 | 100.00 | 95.24 | 100.00 | 51.19 | 81.43 | 100.00 | 100.00 | 100.00 | 100.00 | 92.21 | 97.40 | 98.57 | 93.00 |
| Limousin | 106 | 94.56 | 98.64 | 94.74 | 97.64 | 99.46 | 100.00 | 98.02 | 99.45 | 95.33 | 98.64 | 95.78 | 93.09 | 97.11 |
| Nelore | 14 | 100.00 | 100.00 | 100.00 | 100.00 | 85.71 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | – | 98.70 |
| Red Angus | 2 | – | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 92.86 | 100.00 | 14.29 | 91.56 |
| Simmental | 74 | 94.49 | 98.65 | 97.02 | 99.21 | 99.32 | 100.00 | 97.01 | 94.03 | 95.74 | 78.91 | 97.06 | 79.29 | 94.23 |
| Unknown taurine | 326 | 93.99 | 97.14 | 93.37 | 97.78 | 93.90 | 91.49 | 96.57 | 97.36 | 96.93 | 95.74 | 91.22 | 85.23 | 94.23 |
| Overall | 1312 | 94.33 | 93.46 | 96.08 | 92.89 | 93.32 | 98.96 | 98.95 | 97.06 | 98.21 | 95.74 | 93.06 | 82.12 | 94.41 |
Average Mendelian inheritance accuracy for different imputation methods.
| Bt | All ( | 0 | 66.54 | 67.39 | 72.42 | 67.39 | 66.46 | 68.32 | 68.16 | 85.25 | 86.66 |
| ≤1 | 90.55 | 90.40 | 91.25 | 91.17 | 91.32 | 91.09 | 90.70 | 95.89 | 96.31 | ||
| ≤2 | 95.58 | 95.20 | 96.05 | 96.05 | 96.05 | 96.05 | 96.13 | 98.24 | 98.31 | ||
| Sire/dam not ref | 0 | 66.56 | 67.55 | 72.21 | 67.55 | 66.56 | 68.55 | 68.64 | |||
| ≤1 | 90.35 | 90.35 | 90.93 | 90.93 | 91.10 | 90.77 | 90.52 | ||||
| ≤2 | 95.59 | 95.17 | 96.09 | 96.09 | 96.09 | 96.01 | 96.09 | ||||
| Not reference breed | 0 | 59.24 | 58.97 | 71.47 | 61.14 | 59.78 | 61.96 | 62.23 | |||
| ≤1 | 85.05 | 84.51 | 88.32 | 87.77 | 87.77 | 88.04 | 87.23 | ||||
| ≤2 | 93.75 | 91.58 | 94.84 | 94.84 | 94.84 | 95.38 | 95.11 | ||||
| BT × BI | All ( | 0 | 81.82 | 81.82 | |||||||
| ≤1 | 90.91 | 90.91 | |||||||||
| ≤2 | 90.91 | 100.00 | |||||||||
Average accuracy for the validation populations using imputed microsatellite alleles and their parents' reported alleles.
Average accuracy for the reference population and their parents using reported microsatellite alleles.
BT, Bos taurus breeds; BT + BI, Bos taurus and Bos indicus breeds.
min, minimum SNP haplotype; 1 Mb, full SNP haplotype.
Number of BEAGLE iterations.
Subset of individuals in the validation population whose parents have reported microsatellite genotypes.
Total number of imputed microsatellite alleles with Mendelian inheritance conflicts.
The animal's parents were not part of the reference population.
The animal's breed is not represented in the reference population.