| Literature DB >> 30105047 |
Felipe Avila1, James R Mickelson2, Robert J Schaefer1, Molly E McCue1.
Abstract
Selective breeding for athletic performance in various disciplines has resulted in population stratification within the American Quarter Horse (QH) breed. The goals of this study were to utilize high density genotype data to: (1) identify genomic regions undergoing positive selection within and among QH subpopulations; (2) investigate haplotype structure within each QH subpopulation; and (3) identify candidate genes within genomic regions of interest (ROI), as well as biological pathways, predicted to play a role in elite performance in each group. For that, 65K SNP genotyping data on 143 elite individuals from 6 QH subpopulations (cutting, halter, racing, reining, western pleasure, and working cow) were imputed to 2M SNPs. Signatures of selection were identified using FST-based (di ) and haplotype-based (hapFLK) analyses, accompanied by identification of local haplotype structure and sharing within subpopulations (hapQTL). Regions undergoing positive selection were identified on all 31 autosomes, and ROI on 2 chromosomes were identified by all 3 methods combined. Genes within each ROI were retrieved and used to identify pathways and genes that might contribute to performance in each subpopulation. These included, among others, candidate genes associated with skeletal muscle development, metabolism, and central nervous system development. This work improves our understanding of equine breed development, and provides breeders with a better understanding of how selective breeding impacts the performance of QH populations.Entities:
Keywords: SNP; ancestral haplotypes; genotyping; horse; imputation; selection signatures
Year: 2018 PMID: 30105047 PMCID: PMC6060370 DOI: 10.3389/fgene.2018.00249
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Primary characteristics of the six QH performance groups evaluated in this study, as described by the American Quarter Horse Association.
| Cutting | Horse and rider must move quietly into a herd of cattle, cut one cow from the herd, drive it to the center of the arena and “hold” it away from the herd. The horse is scored on its ability to keep the cow from returning to the herd, cow sense, attentiveness, and courage. |
| Halter | Horses are led before judges so that lameness and quality of movement can be evaluated. Horses are judged on conformation including: balance, structural correctness, breed, sex characteristics and degree of muscling. |
| Racing | Horses race against one another at distances between 220 and 870 yards. The classic distance is 440 yards (1/4 mile). |
| Reining | The horse is judged on movement under saddle, mastery of a prescribed maneuver and attitude as it is guided through a specific pattern. The horse is required to perform stops, spins, rollbacks, lead changes and circles at a lope. |
| Western pleasure | Horses are evaluated on quality of movement under saddle at the walk, jog, and lope, while staying quiet, calm, and traveling on a loose rein. |
| Working cow (Reined cow horse) | Combines reining ability and cow sense. The competition consists of two parts: prescribed reined work and actual cow work. Judging is based on good manners, smoothness, cow sense and ease of reining. |
Originally provided in Petersen et al. (.
Figure 1Genome-wide d values for the 6 QH subpopulations. Each d value is plotted on the y axis and each autosome is shown on the x axis in alternating colors. Each point represents a 10 kb window. The red horizontal line represents the top 0.1% of the empirical distribution of dfor each QH subpopulation.
ROI identified by d analysis in the 6 QH subpopulations.
| Cutting | 60 | 134 |
| Halter | 47 | 90 |
| Racing | 77 | 171 |
| Reining | 52 | 81 |
| Western pleasure | 39 | 81 |
| Working cow | 57 | 78 |
| Total | 346 | 635 |
Totals do not consider overlap between populations or analysis methods.
ROI identified by d analysis that are shared by two or more QH subpopulations.
| 1 | 13,685,246 | X | X | ||||
| 1 | 119,564,500 | X | X | ||||
| 1 | 119,676,493 | X | X | ||||
| 1 | 161,715,511 | X | X | ||||
| 2 | 28,112,330 | X | X | X | |||
| 2 | 63,922,661 | X | X | ||||
| 2 | 93,844,075 | X | X | ||||
| 2 | 93,904,657 | X | X | ||||
| 2 | 95,577,244 | X | X | ||||
| 2 | 95,962,979 | X | X | ||||
| 2 | 96,425,441 | X | X | ||||
| 3 | 12,705,501 | X | X | ||||
| 3 | 68,445,127 | X | X | ||||
| 4 | 18,376,592 | X | X | ||||
| 4 | 19,475,682 | X | X | ||||
| 4 | 35,285,735 | X | X | ||||
| 4 | 106,747,371 | X | X | X | |||
| 5 | 65,517,082 | X | X | ||||
| 6 | 25,394,821 | X | X | ||||
| 6 | 26,913,211 | X | X | ||||
| 6 | 30,595,130 | X | X | ||||
| 6 | 30,637,559 | X | X | ||||
| 6 | 30,686,504 | X | X | ||||
| 6 | 30,705,641 | X | X | ||||
| 6 | 30,724,394 | X | X | ||||
| 6 | 30,753,624 | X | X | ||||
| 6 | 31,086,772 | X | X | ||||
| 6 | 31,334,561 | X | X | ||||
| 6 | 81,474,997 | X | X | X | X | ||
| 6 | 81,534,461 | X | X | X | |||
| 7 | 5,529,473 | X | X | ||||
| 7 | 65,654,968 | X | X | ||||
| 9 | 69,646,156 | X | X | ||||
| 9 | 70,735,787 | X | X | ||||
| 9 | 70,814,115 | X | X | ||||
| 10 | 23,574,776 | X | X | X | |||
| 10 | 23,956,861 | X | X | X | |||
| 10 | 24,014,382 | X | X | ||||
| 10 | 66,622,713 | X | X | X | |||
| 10 | 71,364,970 | X | X | ||||
| 10 | 83,744,000 | X | X | ||||
| 11 | 3,863,250 | X | X | ||||
| 11 | 35,553,211 | X | X | ||||
| 11 | 36,006,028 | X | X | ||||
| 11 | 38,335,749 | X | X | ||||
| 16 | 24,163,505 | X | X | ||||
| 17 | 15,344,171 | X | X | ||||
| 18 | 19,864,538 | X | X | ||||
| 18 | 50,904,645 | X | X | ||||
| 20 | 12,885,496 | X | X | ||||
| 20 | 25,186,056 | X | X | ||||
| 20 | 35,886,068 | X | X | ||||
| 25 | 6,263,862 | X | X | ||||
| 25 | 6,734,228 | X | X | ||||
| 25 | 14,862,570 | X | X | ||||
| 28 | 14,573,111 | X | X |
Figure 2Haplotype-based hapFLK results for all 6 QH subpopulations. Chromosome number and statistical significance ([–log10] p-values) are plotted in the x and y axes, respectively. The genome-wide significance threshold corresponding to P < 0.0001 is shown as a horizontal red line.
ROI identified by hapFLK and their overlap with ROI identified by d and hapQTL.
| 1 | 40,502,663-40,641,889 | 0.000128973 | hapQTL | – | hapQTL | – | – | ||
| 7 | 5,501,423-5,596,628 | 0.000068200 | Both | – | – | – | – | ||
| 9 | 70,643,295-70,822,536 | 0.0000146 | – | Both | – | Both | – | – | |
| 15 | 23,550,251-23,566,268 | 0.000127614 | – | – | – | – | – | hapQTL | – |
| 21 | 29,967,251-29,989,856 | 0.00006170 | hapQTL | hapQTL | hapQTL | hapQTL | – | – | |
| 21 | 31,684,806-31,731,726 | 0.00004790 | – | – | hapQTL | – | – | – |
Chromosomal location, genomic coordinates, lowest p-value and genes contained within each hapFLK ROI (left). For each QH subpopulation (right), “d.
Figure 3Genome-wide hapQTL values for the 6 QH subpopulations. Bayes factor values for each SNP are plotted on the y axis and each autosome is shown on the x axis in alternating colors.
Figure 4Circos plot showing genome-wide significant d values (red layer), hapQTL values (orange layer), and hapFLK values (blue layer) across all 31 autosomes for the reining QH subpopulation.
Genes undergoing selection in 2 or more subpopulations.
| X | X | |||||
| X | X | X | ||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | X | ||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | |||||
| X | X | X | X | |||
| X | X | |||||
| X | X | |||||
| X | X | X | ||||
| X | X | X | ||||
| X | X | |||||
| X | X | |||||
| X | X |
Two genes (7SK and U6) were not counted because they have copies in several genomic loci.