| Literature DB >> 19503617 |
Jingjing Gu1, Nick Orr, Stephen D Park, Lisa M Katz, Galina Sulimova, David E MacHugh, Emmeline W Hill.
Abstract
Thoroughbred horses have been selected for exceptional racing performance resulting in system-wide structural and functional adaptations contributing to elite athletic phenotypes. Because selection has been recent and intense in a closed population that stems from a small number of founder animals Thoroughbreds represent a unique population within which to identify genomic contributions to exercise-related traits. Employing a population genetics-based hitchhiking mapping approach we performed a genome scan using 394 autosomal and X chromosome microsatellite loci and identified positively selected loci in the extreme tail-ends of the empirical distributions for (1) deviations from expected heterozygosity (Ewens-Watterson test) in Thoroughbred (n = 112) and (2) global differentiation among four geographically diverse horse populations (F(ST)). We found positively selected genomic regions in Thoroughbred enriched for phosphoinositide-mediated signalling (3.2-fold enrichment; P<0.01), insulin receptor signalling (5.0-fold enrichment; P<0.01) and lipid transport (2.2-fold enrichment; P<0.05) genes. We found a significant overrepresentation of sarcoglycan complex (11.1-fold enrichment; P<0.05) and focal adhesion pathway (1.9-fold enrichment; P<0.01) genes highlighting the role for muscle strength and integrity in the Thoroughbred athletic phenotype. We report for the first time candidate athletic-performance genes within regions targeted by selection in Thoroughbred horses that are principally responsible for fatty acid oxidation, increased insulin sensitivity and muscle strength: ACSS1 (acyl-CoA synthetase short-chain family member 1), ACTA1 (actin, alpha 1, skeletal muscle), ACTN2 (actinin, alpha 2), ADHFE1 (alcohol dehydrogenase, iron containing, 1), MTFR1 (mitochondrial fission regulator 1), PDK4 (pyruvate dehydrogenase kinase, isozyme 4) and TNC (tenascin C). Understanding the genetic basis for exercise adaptation will be crucial for the identification of genes within the complex molecular networks underlying obesity and its consequential pathologies, such as type 2 diabetes. Therefore, we propose Thoroughbred as a novel in vivo large animal model for understanding molecular protection against metabolic disease.Entities:
Mesh:
Year: 2009 PMID: 19503617 PMCID: PMC2685479 DOI: 10.1371/journal.pone.0005767
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Empirical distribution for Dh/sd.
The empirical distribution for Dh/sd scores (horizontal axis) from the Ewens-Watterson test. The vertical axis indicates the number of loci with the corresponding Dh/sd score. The positions in the distribution of the regions that have been subject to selection in Thoroughbred are indicated by the gene symbols for the strongest candidate genes. Genes in the negative tail-end of the distribution may have been subject to positive selection. Genes in the positive end of the distribution may have been subject to balancing selection.
Figure 2F ST versus heterozygosity.
F ST (vertical axis) versus heterozygosity (horizontal axis) plots for Thoroughbred, Akhal-Teke, Connemara and Tuva at 394 genome-wide microsatellite loci.
Population genetic diversity.
| Population | Geographic origin |
|
| Ave. no. alleles sampled/locus | SEM |
| Thoroughbred | United Kingdom and Ireland | 112 | 0.536 | 4.6 | 0.092 |
| Akhal-Teke | Turkmenistan | 18 | 0.587 | 4.0 | 0.077 |
| Connemara | Ireland | 17 | 0.630 | 4.5 | 0.086 |
| Tuva | Republic of Tuva | 17 | 0.672 | 5.0 | 0.099 |
Horse population origins, number of individuals genotyped (n), average observed heterozygosity (Ho), mean number of alleles sampled per locus and standard error of mean (SEM).
Figure 3Chromosome-wide F ST versus Thoroughbred heterozygosity.
F ST (solid line) versus Thoroughbred heterozygosity (dashed line) plots across chromosomes for the nine highest F ST regions with significant (P<0.05) deviations from expected heterozygosity (Dh/sd) in Thoroughbred. Loci defining selected genomic regions are highlighted. Left vertical axis: F ST; Right vertical axis: Thoroughbred heterozygosity; Horizontal axis: chromosome position (Mb).
Figure 4Empirical distribution for F ST.
The empirical distribution for global differentiation, F ST (horizontal axis). The vertical axis indicates the number of loci with the corresponding F ST. The positions of the regions that have been subject to positive selection in Thoroughbred are located in the tail-end of the distribution and are indicated by the gene symbols for the strongest candidate genes.
Overrepresented gene ontologies (Dh/sd).
| GO Biological Process | GO Identifier | Number of genes |
| Fold enrichment |
| Phosphoinositide-mediated signalling | GO:0048015 | 13 | 0.000 | 3.2 |
| Insulin receptor signalling pathway | GO:0008286 | 7 | 0.002 | 5.0 |
| G-protein signalling, coupled to IP3 second messenger (phospholipase C activating) | GO:0007200 | 10 | 0.003 | 3.4 |
| Spermatogenesis | GO:0007283 | 18 | 0.005 | 2.1 |
| Skeletal development | GO:0001501 | 18 | 0.007 | 2.0 |
| Porphyrin biosynthetic process | GO:0006779 | 4 | 0.035 | 5.5 |
| Peptidyl-histidine phosphorylation | GO:0018106 | 3 | 0.035 | 9.7 |
| Response to hydrogen peroxide | GO:0042542 | 4 | 0.039 | 5.2 |
| Lipid transport | GO:0006869 | 9 | 0.049 | 2.2 |
| Muscle development | GO:0007517 | 12 | <0.1 | 1.8 |
| Response to reactive oxygen species | GO:0000302 | 4 | <0.1 | 3.8 |
| Cellular lipid metabolic process | GO:0044255 | 32 | <0.1 | 1.3 |
|
| ||||
| Mitochondrion | GO:0005739 | 56 | 0 | 1.5 |
| Sarcoglycan complex | GO:0016012 | 3 | 0.027 | 11.1 |
|
| ||||
| Diacylglycerol kinase activity | GO:0004143 | 5 | 0 | 9.1 |
| Pyruvate dehydrogenase (acetyl-transferring) kinase activity | GO:0004740 | 3 | 0.008 | 20.4 |
| Oxidoreductase activity, acting on peroxide as acceptor | GO:0016684 | 5 | <0.1 | 3.2 |
| Electron donor activity | GO:0009053 | 2 | <0.1 | 27.3 |
| Phospholipid transporter activity | GO:0005548 | 4 | <0.1 | 3.8 |
Gene ontology biological processes, cellular components and molecular functions that were significantly overrepresented among genes within selected regions and may be relevant to Thoroughbred selection. The complete list is given in Table S2. Selected regions were defined by deviation from expected heterozygosity (Ewen-Watterson test) in Thoroughbred and included the four high F ST regions that had highly significant (P<0.01) reductions in Thoroughbred heterozygosity. The table shows the number of genes in each functional group, P-value for significance of over-representation and fold-enrichment.
Overrepresented gene ontologies (F ST).
| GO Biological Process | GO Identifier | Number of genes |
| Fold enrichment |
| Spermatogenesis | GO:0007283 | 13 | 0.007 | 2.5 |
| Cellular calcium ion homeostasis | GO:0006874 | 9 | 0.008 | 3.1 |
| Porphyrin biosynthetic process | GO:0006779 | 4 | 0.009 | 8.8 |
| Sexual reproduction | GO:0019953 | 16 | 0.001 | 2.0 |
| Skeletal development | GO:0001501 | 12 | 0.022 | 2.1 |
| Anatomical structure formation | GO:0048646 | 10 | 0.029 | 2.3 |
| Hydrogen peroxide catabolic process | GO:0042744 | 3 | 0.032 | 10.4 |
| Regulation of osteoblast differentiation | GO:0045667 | 3 | 0.037 | 9.6 |
| Hydrogen peroxide metabolic process | GO:0042743 | 3 | 0.043 | 8.9 |
| Mating | GO:0007618 | 3 | 0.043 | 8.9 |
| Regulation of catalytic activity | GO:0050790 | 19 | 0.047 | 1.6 |
| Insulin receptor signaling pathway | GO:0008286 | 4 | <0.01 | 4.6 |
| Muscle development | GO:0007517 | 9 | <0.01 | 2.1 |
| Response to oxidative stress | GO:0006979 | 6 | <0.01 | 2.6 |
| Actin filament organization | GO:0007015 | 4 | <0.01 | 3.9 |
| Detection of temperature stimulus | GO:0016048 | 2 | <0.01 | 20.8 |
|
| ||||
| Mitochondrion | GO:0005739 | 34 | 0.025 | 1.5 |
| Cortical actin cytoskeleton | GO:0030864 | 3 | <0.01 | 6.7 |
|
| ||||
| G-protein-coupled receptor binding | GO:0001664 | 7 | 0.008 | 4.0 |
| Oxidoreductase activity, acting on peroxide as acceptor | GO:0016684 | 5 | 0.014 | 5.3 |
| Pyruvate dehydrogenase (acetyl-transferring) kinase activity | GO:0004740 | 2 | <0.01 | 22.3 |
|
| ||||
| Focal adhesion | hsa04510 | 10 | <0.01 | 1.8 |
Gene ontology biological processes, cellular components, molecular functions and KEGG pathways that were significantly over-represented among genes within selected regions and may be relevant to Thoroughbred selection. The complete list is given in Table S3. Selected regions were defined by the nine high F ST regions that had significant (P<0.05) reductions in Thoroughbred heterozygosity. The table shows the number of genes in each functional group, P-value for significance of over-representation and fold-enrichment.
Candidate genes in selected regions (Dh/sd).
| Locus | Chr | Location EqCab2.0 (Mb) | Region delimited (Mb) |
|
|
|
| S.D. | Dh/sd |
|
|
| No. genes in region | Candidate genes |
|
| 9 | 18.9 | 16.9–20.3 | 128 | 9 | 0.437 | 0.820 | 0.041 | −9.436 | 0.000 | 0.258 | 0.061 | 16 |
|
|
| 25 | 25.7 | 15.7–28.1 | 166 | 6 | 0.254 | 0.715 | 0.068 | −6.775 | 0.000 | 0.352 | 0.020 | 67 |
|
|
| 4 | 38.6 | 34.4–40.1 | 216 | 5 | 0.133 | 0.656 | 0.085 | −6.117 | 0.000 | 0.450 | 0.005 | 32 |
|
|
| 18 | 59.1 | 54.2–63.5 | 178 | 8 | 0.563 | 0.790 | 0.047 | −4.872 | 0.001 | 0.118 | NS | 33 |
|
|
| 3 | 36.3 | 31.6–41.2 | 210 | 6 | 0.384 | 0.717 | 0.070 | −4.752 | 0.002 | 0.078 | NS | 53 |
|
|
| 22 | 40 | 37.6–46.0 | 190 | 5 | 0.273 | 0.657 | 0.086 | −4.444 | 0.004 | 0.135 | NS | 46 |
|
|
| 27 | 20.8 | 20.1–21.5 | 216 | 6 | 0.393 | 0.713 | 0.074 | −4.331 | 0.007 | 0.148 | NS | 12 |
|
|
| 28 | 43.1 | 42.1–42.7 | 202 | 8 | 0.591 | 0.789 | 0.047 | −4.244 | 0.003 | 0.098 | NS | 6 |
|
|
| 21 | 5.7 | 3.4–10.8 | 194 | 5 | 0.290 | 0.652 | 0.091 | −3.956 | 0.005 | 0.160 | NS | 35 |
|
|
| X | 19.4 | 18.6–21.5 | 118 | 5 | 0.324 | 0.665 | 0.088 | −3.890 | 0.006 | 0.020 | NS | 9 |
|
|
| 11 | 9.1 | 4.1–10.5 | 202 | 6 | 0.426 | 0.714 | 0.075 | −3.848 | 0.007 | 0.170 | NS | 77 |
|
|
| 8 | 64.2 | 57.5–64.3 | 142 | 6 | 0.441 | 0.716 | 0.075 | −3.686 | 0.010 | 0.130 | NS | 5 |
|
|
| 30 | 22.5 | 18.8–23.4 | 202 | 4 | 0.150 | 0.566 | 0.114 | −3.652 | 0.003 | 0.112 | NS | 16 |
|
|
| 19 | 45.2 | 40.9–45.8 | 216 | 6 | 0.463 | 0.715 | 0.069 | −3.646 | 0.007 | 0.039 | NS | 24 |
|
|
| 17 | 20.7 | 10.2–23.8 | 202 | 4 | 0.184 | 0.572 | 0.108 | −3.602 | 0.007 | 0.380 | 0.018 | 79 |
|
|
| 17 | 23.8 | 20.7–41.4 | 212 | 7 | 0.567 | 0.759 | 0.054 | −3.560 | 0.007 | 0.185 | NS | 46 |
|
|
| 4 | 34.4 | 32.3–38.6 | 180 | 7 | 0.552 | 0.759 | 0.058 | −3.541 | 0.007 | 0.253 | 0.066 | 21 |
|
|
| 3 | 8.7 | 7.9–9.8 | 122 | 4 | 0.745 | 0.579 | 0.108 | 1.54 | 0.007 | 0.027 | NS | 29 |
|
Candidate genes located within regions that have been subject to positive selection in Thoroughbred ranked by Dh/sd score; the locus with the strongest evidence for selection is listed first. Loci were localised to chromosomes and their positions were determined by in silico ePCR (Table S1). Each selected region was defined by loci flanking the central locus. The number of Thoroughbred chromosomes genotyped (n), number of observed alleles (ko), observed heterozygosity (Ho), expected heterozygosity (He), standard deviation (SD), deviation from expected heterozygosity (Dh/sd) and the associated statistical significance values (P) are shown. F ST values are also shown to indicate inter-population differentiation at each locus. Three loci were also found among the highest F ST regions listed in Table 5 (TKY316, TKY222 and NVHEQ079). The number of genes (with Homo sapiens functional annotation) located within each defined region is given. Loci NVHEQ079 and UM176 are located adjacent on ECA17 and their defined regions overlap, resulting in five common candidate genes listed for both regions.
Candidate genes in selected regions (F ST).
| Locus | Chr | Location EqCab2.0 (Mb) | Region delimited (Mb) |
|
| Heterozygosity | Dh/sd |
| No. genes in region | Candidate Genes | |||
| TB | AH | CON | TU | ||||||||||
|
| 21 | 46.7 | 44.5–48.3 | 0.613 | 0.000 | 0.032 | 0.677 | 0.426 | 0.583 | −1.034 | NS | 4 |
|
|
| 1 | 71.2 | 66.5–76.4 | 0.452 | 0.003 | 0.274 | 0.542 | 0.535 | 0.616 | −1.163 | NS | 39 |
|
|
| 4 | 38.6 | 34.4–40.1 | 0.450 | 0.005 | 0.133 | 0.667 | 0.635 | 0.799 | −6.117 | 0.000 | 32 |
|
|
| 22 | 0.2 | 0–11.0 | 0.423 | 0.008 | 0.000 | 0.063 | 0.520 | 0.121 | - | - | 49 |
|
|
| 5 | 12.2 | 7.8–21.8 | 0.404 | 0.010 | 0.153 | 0.633 | 0.500 | 0.514 | −2.151 | 0.042 | 73 |
|
|
| 11 | 36.8 | 21.9–39.7 | 0.392 | 0.013 | 0.030 | 0.606 | 0.616 | 0.544 | −2.916 | 0.002 | 208 |
|
|
| 10 | 55.8 | 55–57.8 | 0.388 | 0.015 | 0.243 | 0.556 | 0.750 | 0.634 | −1.437 | NS | 14 |
|
|
| 17 | 20.7 | 10.2–23.8 | 0.380 | 0.018 | 0.185 | 0.752 | 0.613 | 0.714 | −3.602 | 0.007 | 79 |
|
|
| 25 | 25.7 | 15.7–28.1 | 0.352 | 0.020 | 0.254 | 0.644 | 0.528 | 0.736 | −6.775 | 0.000 | 67 |
|
|
| 8 | 2.6 | 0–14.4 | 0.319 | 0.023 | 0.245 | 0.574 | 0.627 | 0.595 | −1.360 | NS | 133 |
|
|
| 27 | 17.2 | 14.6–20.1 | 0.316 | 0.025 | 0.390 | 0.684 | 0.679 | 0.801 | −1.654 | NS | 11 |
|
|
| 6 | 16.0 | 14.7–17.3 | 0.315 | 0.028 | 0.078 | 0.533 | 0.554 | 0.792 | −2.571 | 0.014 | 13 |
|
|
| 4 | 71.1 | 59.5–98.5 | 0.312 | 0.031 | 0.184 | 0.478 | 0.722 | 0.744 | −1.788 | NS | - | region too large (39 Mb) (includes |
|
| 17 | 41.4 | 23.8–59.9 | 0.310 | 0.033 | 0.083 | 0.495 | 0.203 | 0.705 | −2.569 | 0.013 | - | region too large (36.1 Mb) |
|
| 18 | 31.1 | 23.1–31.9 | 0.309 | 0.036 | 0.064 | 0.671 | 0.443 | 0.688 | −2.534 | 0.010 | 10 |
|
|
| 27 | 29.6 | 27.3–35.3 | 0.309 | 0.038 | 0.244 | 0.540 | 0.631 | 0.720 | −2.863 | 0.017 | 17 |
|
|
| 4 | 4.5 | 3.1–7.2 | 0.305 | 0.041 | 0.413 | 0.578 | 0.786 | 0.869 | −0.272 | NS | 25 |
|
Candidate genes located within regions that have been subject to positive selection in Thoroughbred ranked by F ST; the locus with the strongest evidence for selection is listed first. Heterozygosity values for Thoroughbred (TB), Akhal-Teke (AH), Connemara (CON) and Tuva (TU) are shown. Loci with lowest heterozygosity in Thoroughbred were included. Loci were localised to chromosomes and their positions were determined by in silico ePCR (Table S1). Each selected region was defined by loci flanking the central locus. Dh/sd scores and associated P-values for Thoroughbred are given to indicate within population deviation from expected heterozygosity. Four loci (TKY222, TKY033, NVHEQ079 and TKY316) had highly significant (P<0.01) Dh/sd scores while five (AHT024, UMNe197, COR032, TKY303 and VHL150) met a less stringent criteria (P<0.05). One locus was monomorphic in Thoroughbred and seven loci did not have significant deviations (NS) from expected heterozygosity in the Ewens-Watterson test. The number of genes (with Homo sapiens functional annotation) located within each defined region is given. Two regions (defined by HTG009 and COR032) were considered too large (39 Mb and 36 Mb) to reasonably search for candidate genes, however LEP (Leptin), associated with obesity in mouse [137], is located at 83.4 Mb on ECA4. COR032 is located adjacent to NVHEQ079 and UM176 on ECA17 together defining a 48 Mb region that may contain a number of strongly selected genes.