| Literature DB >> 22912645 |
Matthew McClure1, Tad Sonstegard, George Wiggans, Curtis P Van Tassell.
Abstract
Microsatellite (MS) markers have recently been used for parental verification and are still the international standard despite higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP)-based assays. Despite domestic and international interest from producers and research communities, no viable means currently exist to verify parentage for an individual unless all familial connections were analyzed using the same DNA marker type (MS or SNP). A simple and cost-effective method was devised to impute MS alleles from SNP haplotypes within breeds. For some MS, imputation results may allow inference across breeds. A total of 347 dairy cattle representing four dairy breeds (Brown Swiss, Guernsey, Holstein, and Jersey) were used to generate reference haplotypes. This approach has been verified (>98% accurate) for imputing the International Society of Animal Genetics recommended panel of 12 MS for cattle parentage verification across a validation set of 1,307 dairy animals. Implementation of this method will allow producers and breed associations to transition to SNP-based parentage verification utilizing MS genotypes from historical data on parents where SNP genotypes are missing. This approach may be applicable to additional cattle breeds and other species that wish to migrate from MS- to SNP-based parental verification.Entities:
Keywords: ISAG; SNP haplotype; across breed imputation; microsatellite imputation; parentage verification
Year: 2012 PMID: 22912645 PMCID: PMC3418578 DOI: 10.3389/fgene.2012.00140
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Animal and genotype count.
| Breed | HDSNP | Microsatellite | |||||
|---|---|---|---|---|---|---|---|
| 9 | 10 | 11 | 12 | Sire | Dam | ||
| BS | 71 | 2 | 0 | 2 | 29 | 31 | 0 |
| GU | 60 | 0 | 5 | 5 | 6 | 2 | 0 |
| HO | 1110 | 6 | 49 | 118 | 77 | 395 | 0 |
| JE | 60 | 3 | 8 | 15 | 22 | 57 | 26 |
| Total | 1301 | 11 | 62 | 140 | 134 | 485 | 26 |
.
.
.
.
.
Microsatellite information.
| Marker | Chr | UMD3.1 position | Structure | Sequence | Heterozygosity | MS | SNP | Hap | Reference | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Center | Start | End | |||||||||
| 23 | 39,294,185 | 38,794,063 | 39,794,307 | simple | (TG)n | 79 | 7 | 32 | 34 | Bishop et al. ( | |
| 1 | 132,498,052 | 131,997,971 | 132,998,132 | simple | (GT)n | 60 | 5 | 32 | 19 | Barendse et al. ( | |
| 2 | 127,591,894 | 127,091,834 | 128,091,954 | simple | (CA)n | 69 | 7 | 14 | 31 | Sunden et al. ( | |
| 5 | 56,657,927 | 56,157,827 | 57,158,026 | simple | (AC)n | 59 | 8 | 28 | 40 | Toldo et al. ( | |
| 9 | 10,858,186 | 10,358,122 | 11,358,249 | simple | (CA)n | 71 | 8 | 24 | 53 | Steffen et al. ( | |
| 19 | 56,648,498 | 56,148,462 | 57,148,534 | compound | (GT)nAC(GT)6 | 59 | 7 | 20 | 25 | Toldo et al. ( | |
| 3 | 33,010,922 | 32,510,842 | 33,511,002 | simple | (AC)n | 10 | 38 | 29 | Vaiman et al. ( | ||
| 15 | 24,165,259 | 23,665,145 | 24,665,372 | simple | (CA)n | 12 | 10 | 24 | Georges and Massey ( | ||
| 21 | 57,640,936 | 57,140,875 | 58,140,996 | compound | (AC)n(AT)n | 67 | 15 | 30 | 51 | Georges and Massey ( | |
| 20 | 22,207,036 | 21,706,989 | 22,707,083 | simple | (TG)n | 67 | 5 | 33 | 28 | Georges and Massey ( | |
| 18 | 65,406,736 | 64,906,704 | 65,906,767 | simple | (TG)n | 81 | 13 | 34 | 74 | Georges and Massey ( | |
| 16 | 25,789,455 | 25,289,387 | 26,289,522 | compound | (TG)6CG(TG)4(TA)n | 78 | 15 | 21 | 16 | Georges and Massey ( | |
.
.
.
.
.
Example of allele counts per SNP haplotype for BM1824 that fits the haplotype identification criteria.
| Haplotype | Brown Swiss | Guernsey | Holstein | Jersey | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 178 | 180 | 182 | 188 | 190 | 178 | 180 | 182 | 188 | 190 | 178 | 180 | 182 | 188 | 190 | 178 | 180 | 182 | 188 | 190 | |
| BABBAABBAAAABBBAABABABBBABBBAAAB | 26 | 5 | 32 | 1 | ||||||||||||||||
| BAABAABBAAAABBBAABABABBBABBBAAAB | 32 | |||||||||||||||||||
| BBBBAABBAAAABBBAABABABBBABBBAAAB | 21 | |||||||||||||||||||
| BABBAABBAAAABBBAABABABBBAABBBBAB | 4 | |||||||||||||||||||
| BABBAABBAAAABBBAABABABBBABBBAABA | 3 | |||||||||||||||||||
| BAABBAABBAAABBBAABABABABABBBAAAB | 2 | |||||||||||||||||||
| BAABABABBAAABBAABAABABBBABAAAABA | 65 | 1 | ||||||||||||||||||
| BAABABABBAAABBAABAABABBBABBBAAAB | 50 | |||||||||||||||||||
| AABBAABBBAAABBAABAABABBBABBBAAAB | 58 | 2 | ||||||||||||||||||
| AAAABAAABBABBAAABAABABABABAAAAAB | 3 | 14 | ||||||||||||||||||
| AAAABAAABBABBAAABAABABABABAAAABA | 8 | 8 | 69 | 1 | 3 | |||||||||||||||
| AAAABAAABBABBAAABAABABABABAAAAAA | 14 | |||||||||||||||||||
| BAABBAABBBABBBAABBABABABABABAAAA | 1 | |||||||||||||||||||
| AAAABAAABBABBAAABAABABABABABAAAA | 3 | |||||||||||||||||||
| BABBAABBAABAABBBBBBABAAABABBBBAB | 1 | |||||||||||||||||||
| BAAABABBAABAABBBBBBABAAABABBBBAB | 113 | |||||||||||||||||||
| BAABAABBAABAABBBBBBABAAABABBBBAB | 1 | 71 | ||||||||||||||||||
| AABBAABBAABAABBBBBBABAAABABBBBAB | 3 | 4 | ||||||||||||||||||
| AAAABABBAABAABBBBBBABAAABABBBBAB | 19 | 1 | 8 | 9 | 3 | 11 | ||||||||||||||
.
.
Percent of 1,301 individuals with haplotypes imputed to microsatellite (MS) alleles.
| MS | Alleles | 0 Allele | 1 Allele | 2 Allele | |||
|---|---|---|---|---|---|---|---|
| Within | Across | Within | Across | Within | Across | ||
| 7 | 0.020 | 0.012 | 0.980 | 0.988 | 0.834 | 0.858 | |
| 5 | 0.001 | 0.001 | 0.999 | 0.999 | 0.983 | 0.988 | |
| 7 | 0.004 | 0.001 | 0.996 | 0.999 | 0.960 | 0.985 | |
| 8 | 0.010 | 0.003 | 0.99 | 0.997 | 0.940 | 0.949 | |
| 8 | 0.037 | 0.018 | 0.963 | 0.982 | 0.902 | 0.932 | |
| 7 | 0.005 | 0.005 | 0.995 | 0.995 | 0.947 | 0.957 | |
| 10 | 0.012 | 0.002 | 0.988 | 0.998 | 0.947 | 0.961 | |
| 7 | 0.006 | – | 0.994 | 1.000 | 0.961 | 0.988 | |
| 15 | 0.044 | 0.017 | 0.956 | 0.982 | 0.891 | 0.904 | |
| 5 | 0.015 | 0.009 | 0.985 | 0.991 | 0.944 | 0.960 | |
| 13 | 0.033 | 0.028 | 0.967 | 0.972 | 0.889 | 0.892 | |
| 15 | 0.002 | – | 0.998 | 1.000 | 0.954 | 0.971 | |
| Average | 8.9 | 0.016 | 0.008 | 0.984 | 0.992 | 0.929 | 0.945 |
.
.
.
.
Microsatellite (MS) validations.
| Microsatellite | Concordance | Parentage verification | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Within | Across | Reported MS | Imputed MS | |||||||||||
| Overall | Not reported | Reported | ||||||||||||
| Within | Across | Within | Across | Within | Across | |||||||||
| 0.993 | 0.977 | 133 | 1.000 | 37 | 0.972 | 0.979 | 144 | 0.991 | 0.991 | 107 | 0.921 | 0.947 | 38 | |
| 1.000 | 0.994 | 337 | 0.995 | 189 | 1.000 | 1.000 | 475 | 1.000 | 1.000 | 289 | 1.000 | 1.000 | 188 | |
| 0.991 | 0.980 | 344 | 1.000 | 195 | 0.998 | 0.998 | 478 | 1.000 | 1.000 | 284 | 0.995 | 0.995 | 194 | |
| 1.000 | 0.988 | 343 | 1.000 | 194 | 0.996 | 0.987 | 479 | 0.997 | 0.986 | 285 | 0.995 | 0.990 | 195 | |
| 0.988 | 0.955 | 325 | 1.000 | 154 | 0.991 | 0.979 | 470 | 0.989 | 0.975 | 283 | 0.995 | 0.984 | 187 | |
| 0.997 | 0.991 | 340 | 1.000 | 149 | 1.000 | 1.000 | 479 | 1.000 | 1.000 | 288 | 1.000 | 1.000 | 192 | |
| 0.991 | 0.988 | 343 | 0.990 | 194 | 1.000 | 1.000 | 479 | 1.000 | 1.000 | 285 | 1.000 | 1.000 | 194 | |
| 0.979 | 0.954 | 340 | 1.000 | 195 | 0.996 | 0.994 | 479 | 0.997 | 1.000 | 284 | 0.995 | 0.985 | 196 | |
| 0.997 | 0.959 | 319 | 1.000 | 195 | 0.996 | 0.979 | 471 | 0.993 | 0.986 | 282 | 1.000 | 0.958 | 189 | |
| 0.997 | 0.988 | 345 | 1.000 | 194 | 1.000 | 1.000 | 478 | 1.000 | 1.000 | 284 | 1.000 | 1.000 | 194 | |
| 1.000 | 0.979 | 337 | 1.000 | 189 | 0.993 | 0.989 | 475 | 0.990 | 0.990 | 286 | 1.000 | 0.989 | 190 | |
| 1.000 | 0.970 | 297 | 0.987 | 153 | 0.991 | 0.993 | 431 | 1.000 | 1.000 | 279 | 0.974 | 0.980 | 152 | |
| Average | 0.994 | 0.977 | 317 | 0.998 | 170 | 0.994 | 0.992 | 445 | 0.996 | 0.994 | 270 | 0.989 | 0.986 | 176 |
.
.
.
.
.
.
.
.
.
Figure 1Example of the compounding effect of MS allele miscalls. While the sire in the second generation was misgenotyped for the parental allele it was not caught because the error still follows Mendelian inheritance patterns. This will cause a problem in the third generation because if the parental 121 allele is called correctly it will fail the parentage verification, if incorrectly called again as a 119 allele this error will further propagate in future generations.