| Literature DB >> 35464354 |
Andrea M Luttman1,2, Misa Komine3, Tuddow Thaiwong4, Tyler Carpenter1,5, Susan L Ewart6, Matti Kiupel3,4, Ingeborg M Langohr3,7, Patrick J Venta1,8.
Abstract
Tetranucleotide and pentanucleotide short tandem repeat (hereafter termed tetraSTR and pentaSTR) polymorphisms have properties that make them desirable for DNA profiling and paternity testing. However, certain species, such as the horse, have far fewer tetraSTRs than other species and for this reason dinucleotide STRs (diSTRs) have become the standard for DNA profiling in horses, despite being less desirable for technical reasons. During our testing of a series of candidate genes as potentially underlying a heritable condition characterized by megaesophagus in the Friesian horse breed, we found that good tetraSTRs do exist in horses but, as expected, at a much lower frequency than in other species, e.g., dogs and humans. Using a series of efficient methods developed in our laboratory for the production of multiplexed tetraSTRs in other species, we identified a set of tetra- and pentaSTRs that we developed into a 17-plex panel for the horse, plus a sex-identifying marker near the amelogenin gene. These markers were tested in 128 horses representing 16 breeds as well as crossbred horses, and we found that these markers have useful genetic variability. Average observed heterozygosities (Ho) ranged from 0.53 to 0.89 for the individual markers (0.66 average Ho for all markers), and 0.62-0.82 for expected heterozygosity (He) within breeds (0.72 average He for all markers). The probability of identity (PI) within breeds for which 10 or more samples were available was at least 1.1 x 10-11, and the PI among siblings (PIsib) was 1.5 x 10-5. Stutter was ≤ 11% (average stutter for all markers combined was 6.9%) compared to the more than 30% typically seen with diSTRs. We predict that it will be possible to develop accurate allelic ladders for this multiplex panel that will make cross-laboratory comparisons easier and will also improve DNA profiling accuracy. Although we were only able to exclude candidate genes for Friesian horse megaesophagus with no unexcluded genes that are possibly causative at this point in time, the study helped us to refine the methods used to develop better tetraSTR multiplexed panels for species such as the horse that have a low frequency of tetraSTRs.Entities:
Keywords: 17-plex; DNA profiling; Friesian; equine; horse; paternity testing; tetranucleotide short tandem repeat
Year: 2022 PMID: 35464354 PMCID: PMC9021955 DOI: 10.3389/fvets.2022.861623
Source DB: PubMed Journal: Front Vet Sci ISSN: 2297-1769
Marker primers, observed data, and mutation rate predictions.
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| amel(YX) | F: CCAGGATGAGGTGGTAGCTTTTATA | [GA]12 | NA | HEX | 0.7 | 113;134 | 113-150 | 134 | 11 | ND | NA |
| R: gtATGTGAACAATTGCATATTGACTTAATCT | 0.7 | ||||||||||
| ECA28.003.6 | F: aGGTAGCATAACCCCTACTGAGATAA | [AGAT]13 | 468 | HEX | 0.3 | 179 | 159-187 | 171 | 8 | 10.3 | 0.18 |
| R: gtttcttGGGTTCCACATGTCAAAACAAA | 0.3 | ||||||||||
| ECA15.001.6 | F: TGCTTGGTGTACAGGCCTCAG | [TCTA]4[TCTG][TCTA]11 | 602 | HEX | 1.3 | 245 | 228-257 | 242 | 8 | 9.1 | 0.33 |
| R: gtttcttTTTCTGAGAGAAAGCTGAAAGTATG | 1.3 | ||||||||||
| ECA07.065.4 | F: aGAACAATGAGCAGGGAGTACAA | [ATAG]13 | 477 | HEX | 1.0 | 291 | 267-305 | 283 | 8 | 7.9 | 0.19 |
| R: gAGCAAGACTTGAAGAGGAATGGA | 1.0 | ||||||||||
| ECA05.065.8 | F: GGACTTTCAAAACTCACCCAAA | [GTTTT]11 | 504 | HEX | 1.5 | 347 | 312-361 | 336 | 8 | 1.7 | 0.22 |
| R: GATACAAAGTCCATGATCAAAACAAA | 1.5 | ||||||||||
| ECA12.004.8 | F: AAGGAGCAAGTTCAGGCAAA | [ATCT] | 633 | HEX | 5.0 | 431 | 410-435 | 420 | 9 | 3.2 | 0.37 |
| R: gtttcttTCATCCCTTGTACGCCTCTAA | 5.0 | ||||||||||
| ECA24.028.9 | F: TTTCAGGTTCTCGTTACTCAGATAGAA | [TCTCT]11 | 495 | NED | 1.0 | 120 | 80-137 | 113 | 9 | 6.2 | 0.21 |
| R: ATATTCTTGTAGGTAGGGTTT | 1.0 | ||||||||||
| ECA22.012.1 | F: CTCATGGTCTTTTAATTTTGAGTTATAC | [AGAT]12 | 510 | NED | 0.5 | 169 | 142-172 | 160 | 7 | 2.5 | 0.23 |
| R: gtttcttGCAACACATGTAACTGACCCAAA | 0.5 | ||||||||||
| ECA21.027.5 | F: TCCAAGGACCTTCTTCCAAA | [GATA]2[GTTA][GATA] | 608 | NED | 1.5 | 221 | 193-236 | 215 | 6 | 7.3 | 0.34 |
| R: gtttcttAAATTAGTGAATTTGGAGAAAACAA | 1.5 | ||||||||||
| ECA25.011.9 | F: TCTGAGAGGTGATGGCAAAA | [ATCT]3[ATCC][ATCT]11 [ATCC]4 | 602 | NED | 1.0 | 248 | 235-267 | 250 | 8 | 10.7 | 0.33 |
| R: gtttcttTTCATTGTGTACAGTGTGGTATCAA | 1.0 | ||||||||||
| ECA27.020.1 | F: TTGAATTGCCATGATTAGGAA | [AGAT]12 | 582 | NED | 2.5 | 307 | 285-326 | 306 | 7 | 7.3 | 0.31 |
| R: gTGAATTTGGGCTGAGATTGAA | 2.5 | ||||||||||
| ECA14.076.8 | F: CCTACTTAGTCCCCCTTCCTGAAA | [CATC]2[TATC][CATT] [TATC][CATC]4[CAAC] [CATC][CAAC][CATC]10 | 663 | NED | 1.5 | 354 | 346-362 | 356 | 4 | 8.0 | 0.42 |
| R: gtttcttCAGGACAGAGGTTAAGTCACAAATAA | 1.5 | ||||||||||
| ECA04.104.2 | F: AGGAGTGGCAGTTGGTTGTGG | [CTAT] | 931 | NED | 2.5 | 428 | 411-436 | 422 | 7 | 6.6 | 0.92 |
| R: gtttcttCACCCATCAATGCACAAATCTGCAGAAAA | 2.5 | ||||||||||
| ECA04.035.5 | F: CCCACATGACAAAAGCACAA | [TAAAA]9 | 405 | 6-FAM | 0.4 | 127 | 97-131 | 114 | 7 | 1.2 | 0.13 |
| R: gTTCTCTCTGAGTCCAGATGCAA | 0.4 | ||||||||||
| ECA02.105.5 | F: AGCTCTTGAGCCCTCTTTGTAA | [AGAA] | 647 | 6-FAM | 1.0 | 198 | 172-207 | 191 | 12 | 6.6 | 0.39 |
| R: gtttcttTTCCTTCATGCTGTTCCTGTAA | 1.0 | ||||||||||
| ECA03.025.9 | F: AGTAACATTTGGGTCATCTGAAA | [AGAT]5 | 542 | 6-FAM | 1.0 | 254 | 238-261 | 252 | 7 | 7.5 | 0.26 |
| R: gCGCAGCTCCTCATACTGAAA | 1.0 | ||||||||||
| ECA01.102.4 | F: TTTGGAGATGTTGGAAGTTAAGG | [ATCT]7 | 671 | 6-FAM | 1.0 | 296 | 261-307 | 297 | 9 | 9.8 | 0.43 |
| R: gtttcttAAGGGGAGAGGATGGAGCAAGAAAGAA | 1.0 | ||||||||||
| ECA20.003.8 | F: AAATAAGATGAATAGACAGGCCCTAA | [TATC]18[TGTC] | 672 | 6-FAM | 4.8 | 409 | 354-417 | 381 | 17 | 11.2 | 0.43 |
| R: gtttcttTCCGACTCATCCTACAGCAA | 4.8 |
The table is sorted by primer label (HEX, NED, and 6-FAM) and then by the amplicon size predicted from the reference genome (Ref size).
Markers are named by the chromosome upon which they reside (Eca, Equus caballus), followed by the megabase position for the repeat from the horse 2007 (EquCAb2) build.
Lower case letters within the primer sequences indicate nucleotides that are not found in the horse reference genome; these nucleotides are primarily “pigtails” to suppress split peaks (.
The motif is indicated by the sequence found in TRF track in the UCSC Genome Browser. The TRF track follows the nomenclature recommendations of the DNA Commission of the International Society of Forensic Genetics better than that of RepeatMasker (.
The reference genome size (ref size) includes the additional non-templated adenine that is attached via the terminal transferase activity of the Taq polymerase (.
The concentration is the final value used in our implementation of the multiplex panel.
The stutter is given in the percent of the height of the main peak. The number for each marker is the average stutter across all measurements.
mut rate, This is the predicted mutation rate based upon the relationship of the SW score with the human mutation rate, assuming that the horse mutation rate will be similar (Venta, unpublished results). The mutation rate for Eca04a (ECA04.104.2) is likely to be too high because much of the repeat region is imperfect and it is very long (see Discussion).
SWS, Smith Waterman score, as obtained from the UCSC Genome Browser for equCab2.
wt avg, weighted average in base pairs, as is the range.
Genetic information for markers in the 17-plex panel.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| amel (XY) | 0.56 | 0.68 | NA | NA | NA | NA | NA | 2 |
| Eca28 | 0.80 | 0.79 | 0.71 | 0.579 | 0.401 | ND | ND | 4 |
| Eca15 | 0.75 | 0.80 | 0.74 | 0.442 | 0.281 | 0.06 | 0.36 | 4 |
| Eca07 | 0.64 | 0.75 | 0.71 | 0.639 | 0.461 | 0.12 | 0.42 | 4 |
| Eca05 | 0.59 | 0.73 | 0.72 | 0.675 | 0.498 | 0.16 | 0.46 | 5 |
| Eca12 | 0.56 | 0.69 | 0.75 | 0.726 | 0.561 | 0.18 | 0.47 | 4 |
| Eca24 | 0.79 | 0.74 | 0.72 | 0.631 | 0.452 | 0.12 | 0.42 | 5 |
| Eca22 | 0.60 | 0.74 | 0.73 | 0.591 | 0.411 | 0.14 | 0.45 | 4 |
| Eca21 | 0.53 | 0.62 | 0.74 | 0.768 | 0.595 | 0.27 | 0.56 | 4 |
| Eca25 | 0.55 | 0.62 | 0.74 | 0.791 | 0.650 | 0.31 | 0.56 | 4 |
| Eca27 | 0.60 | 0.65 | 0.74 | 0.764 | 0.608 | 0.24 | 0.51 | 4 |
| Eca14 | 0.63 | 0.63 | 0.61 | 0.771 | 0.607 | 0.23 | 0.51 | 4 |
| Eca04a | 0.68 | 0.73 | 0.75 | 0.668 | 0.488 | 0.13 | 0.44 | 4 |
| Eca04b | 0.69 | 0.74 | 0.66 | 0.664 | 0.488 | 0.14 | 0.44 | 5 |
| Eca02 | 0.69 | 0.77 | 0.75 | 0.589 | 0.412 | 0.12 | 0.41 | 4 |
| Eca03 | 0.66 | 0.75 | 0.73 | 0.657 | 0.481 | 0.13 | 0.42 | 4 |
| Eca01 | 0.63 | 0.69 | 0.75 | 0.662 | 0.475 | 0.14 | 0.44 | 4 |
| Eca20 | 0.89 | 0.86 | 0.75 | 0.426 | 0.270 | 0.03 | 0.32 | 4 |
| Avg | 0.66 | 0.72 | 0.72 | 0.650 | 0.479 | 0.158 | 0.449 | |
| combined | 9.11E-9 | 1.11E-5 | 2.0E-14 | 2.3E-06 |
The entire horse panel was used in this analysis (combined values are lower for individual breeds; see the text). Sort order is the same as in .
Figure 1Histograms of allele frequencies for markers across 124 horses with the 17-plex panel plus the amel marker. Allele numbers are on the X-axis and the allele count is given on the Y-axis. Asterisks indicate the alleles that were Sanger sequenced (Genbank MN646783, MN684168, MN684204). The markers are displayed in the order of the color of the fluorescent label used (FAM, NED, PET), and size order, as is the case in Table 1.