| Literature DB >> 31698676 |
Elaine Norton1, Nichol Schultz1, Ray Geor2, Dianne McFarlane3, James Mickelson4, Molly McCue1.
Abstract
Equine metabolic syndrome (EMS) is a complex trait for which few genetic studies have been published. Our study objectives were to perform within breed genome-wide association analyses (GWA) to identify associated loci in two high-risk breeds, coupled with meta-analysis to identify shared and unique loci between breeds. GWA for 12 EMS traits identified 303 and 142 associated genomic regions in 264 Welsh ponies and 286 Morgan horses, respectively. Meta-analysis demonstrated that 65 GWA regions were shared across breeds. Region boundaries were defined based on a fixed-size or the breakdown of linkage disequilibrium, and prioritized if they were: shared between breeds or across traits (high priority), identified in a single GWA cohort (medium priority), or shared across traits with no SNPs reaching genome-wide significance (low priority), resulting in 56 high, 26 medium, and seven low priority regions including 1853 candidate genes in the Welsh ponies; and 39 high, eight medium, and nine low priority regions including 1167 candidate genes in the Morgans. The prioritized regions contained protein-coding genes which were functionally enriched for pathways associated with inflammation, glucose metabolism, or lipid metabolism. These data demonstrate that EMS is a polygenic trait with breed-specific risk alleles as well as those shared across breeds.Entities:
Keywords: EMS; fat metabolism; genetic risk factors; genetics; genome-wide association analysis; horses; insulin dysregulation
Mesh:
Substances:
Year: 2019 PMID: 31698676 PMCID: PMC6895807 DOI: 10.3390/genes10110893
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure A1Principal components analysis for the Welsh ponies showing clustering between sections.
Figure 1Flow chart of the prioritization of the regions identified on GWA.
Figure 2Manhattan plots of the genome-wide association results for insulin concentration post oral sugar test in (A) Morgan horses and (B) the section A and B Welsh ponies. The equine chromosomes (ECA) are plotted on the x-axis and the −log of the p-value is plotted on the y-axis. The blue line indicates the suggestive threshold (1.0 × 10−05) and the red line represents the genome-wide significant threshold (7.61 × 10−08 in the Morgan horses and 5.98 × 10−08 the Welsh ponies). In the section A, B, C and D Welsh ponies (not shown) and A and B Welsh ponies (B), the same region on ECA10 exceeds the suggestive threshold, which was also identified as an ROI in the Morgan horses (A). GWA meta-analysis identified this region as shared across both breeds.
Meta-analysis across breeds and across EMS metabolic traits.
| Trait | Chr | Min_SNP | Max_SNP | Sugg_SNPs | Sign_SNPs | FE |
|---|---|---|---|---|---|---|
|
| 5 | 44104129 | 45081679 | 39 | 4 | - |
| 15 | 5887873 | 6225014 | 21 | 2 | - | |
| 24 | 28804043 | 29076914 | 3 | 2 | - | |
|
| 10 | 71620835 | 72425049 | 19 | 4 | X |
| 28 | 38307699 | 38344594 | 2 | 1 | - | |
|
| 4 | 18053357 | 18550035 | 20 | 1 | - |
| 8 | 9289661 | 9312611 | 2 | 1 | - | |
|
| 3 | 55982921 | 56558742 | 57 | 39 | X |
| 4 | 27802674 | 28514796 | 18 | 4 | X | |
| 15 | 79697363 | 79717603 | 3 | 3 | - | |
| 28 | 34861664 | 34868420 | 2 | 2 | - | |
|
| 1 | 183532379 | 184178932 | 21 | 15 | X |
| 17 | 13355958 | 14014858 | 23 | 1 | X | |
| 24 | 20975408 | NA | 1 | 1 | - | |
| 30 | 20148173 | 20205201 | 10 | 3 | X | |
|
| 2 | 16725632 | 17531903 | 25 | 19 | X |
| 4 | 37105938 | 37523046 | 6 | 2 | X | |
| 6 | 31582345 | 31708194 | 17 | 1 | X | |
| 6 | 67097628 | 68036518 | 16 | 1 | - | |
| 18 | 41399862 | 41533081 | 9 | 1 | - | |
| 18 | 60138400 | 60241267 | 10 | 2 | - | |
| 20 | 3447045 | 3609674 | 10 | 4 | X | |
|
| 7 | 65731012 | 65804974 | 6 | 3 | X |
| 10 | 871456 | NA | 1 | 1 | - | |
| 19 | 48839140 | 49627683 | 44 | 22 | X | |
| 24 | 28551544 | 28744981 | 17 | 6 | - | |
|
| 1 | 69730886 | 70257187 | 4 | 1 | - |
| 1 | 82755708 | 82879246 | 10 | 1 | - | |
| 3 | 41684754 | NA | 1 | 1 | - | |
| 3 | 101236287 | 101618645 | 42 | 24 | X | |
| 5 | 28822515 | 29342972 | 12 | 3 | X | |
| 10 | 78846710 | NA | 1 | 1 | - | |
|
| 1 | 88009187 | NA | 1 | 1 | - |
| 3 | 58464229 | NA | 1 | 1 | - | |
| 4 | 51903203 | 53474757 | 64 | 40 | - | |
| 6 | 63614756 | 63814984 | 20 | 10 | - | |
| 9 | 22745020 | NA | 1 | 1 | - | |
| 9 | 33549797 | 34165892 | 31 | 1 | - | |
| 11 | 18987272 | 19176693 | 10 | 8 | - | |
| 14 | 63778931 | 63876998 | 7 | 2 | X | |
| 19 | 1134701 | 1139669 | 2 | 2 | - | |
| 19 | 32230245 | 33643392 | 55 | 2 | - | |
| 20 | 39797561 | 40162785 | 7 | 4 | X | |
| 20 | 59659997 | 60403627 | 11 | 4 | X | |
| 21 | 20193411 | 21256032 | 18 | 11 | X | |
| 24 | 33852631 | 34812035 | 36 | 23 | X | |
|
| 1 | 121484057 | 121775873 | 47 | 19 | - |
|
| 131512239 | 131621826 | 3 | 3 | X | |
| 4 | 84181768 | 85275183 | 29 | 11 | X | |
| 11 | 18987272 | 19176693 | 10 | 9 | X | |
| 17 | 32120145 | 32544617 | 23 | 4 | X | |
| 19 | 28934939 | NA | 1 | 1 | - | |
| 20 | 63560971 | 63691145 | 10 | 6 | - | |
| 22 | 40135963 | 40167502 | 4 | 4 | X | |
|
| 1 | 49077969 | NA | 1 | 1 | - |
| 2 | 36104151 | 36108219 | 6 | 6 | - | |
| 4 | 17765473 | 18991639 | 11 | 3 | - | |
| 12 | 29378128 | 30296509 | 19 | 11 | X | |
| 14 | 88430222 | 89591967 | 20 | 5 | - | |
| 18 | 31679672 | 33134556 | 51 | 26 | X | |
| 19 | 28057756 | 28417335 | 5 | 2 | - | |
| 19 | 57605404 | 58429206 | 36 | 20 | X | |
| 22 | 3565315 | 4307679 | 62 | 38 | X | |
| 23 | 12226548 | 12763020 | 35 | 24 | X | |
| 28 | 9446507 | 9643240 | 13 | 5 | X |
To be considered an MA-ROI, at least one SNP had to exceed the threshold for genome-wide significance (1.6 × 10−07) on meta-analysis. Provided is the base pair position of the lowest (Min_SNP) and highest (Max_SNP) SNP, as well as the number of SNPs per region which exceeded the suggestive (Sugg_SNPs) and genome-wide significance (Sign_SNPs) threshold. Regions shared across two traits in the meta-analysis are represented by the corresponding highlighted chromosomes (Chr). Regions which were statistically significant using a standard fixed effects models (FE) are indicated by an X.
LD-bound high priority GWA regions for the Welsh pony cohorts.
| Trait | Chr | Min | Max | Protein | RNA | Total |
|---|---|---|---|---|---|---|
|
| 5 | 35409104 | 44806458 | 267 | 38 | 306 |
| 8 | 69350844 | 75906595 | 32 | 23 | 55 | |
| 15 | 5748377 | 6612684 | 0 | 1 | 1 | |
| 18 | 78720858 | 79634082 | 2 | 4 | 6 | |
| 24 | 28451012 | 29887250 | 2 | 4 | 6 | |
|
| 1 | 176773704 | 176873704 | 0 | 1 | 1 |
| 8 | 73173455 | 73699198 | 1 | 1 | 2 | |
| 10 | 71967783 | 72438937 | 3 | 0 | 3 | |
| 28 | 39322188 | 39488807 | 8 | 1 | 9 | |
|
| 15 | 83728178 | 83828178 | 2 | 0 | 2 |
|
| 28 | 34271949 | 35138699 | 9 | 0 | 9 |
|
| 1 | 171861236 | 178270042 | 25 | 24 | 49 |
| 18 | 60060215 | 61349045 | 7 | 6 | 13 | |
|
| 5 | 39751797 | 50431769 | 207 | 32 | 239 |
| 6 | 488137 | 4012580 | 15 | 10 | 25 | |
| 7 | 65678376 | 68117086 | 1 | 2 | 3 | |
| 10 | 692055 | 1068890 | 0 | 5 | 5 | |
| 21 | 22940681 | 23516697 | 1 | 0 | 1 | |
|
| 19 | 1005718 | 1105718 | 2 | 0 | 2 |
| 28 | 32909542 | 35703535 | 65 | 11 | 76 | |
|
| 1 | 42944403 | 45232767 | 5 | 4 | 9 |
| 1 | 69558737 | 70960589 | 7 | 16 | 23 | |
| 10 | 55060512 | 56255134 | 1 | 1 | 2 | |
| 10 | 78795710 | 80306613 | 20 | 5 | 25 | |
| 20 | 60381850 | 60481850 | 0 | 0 | 0 | |
|
| 4 | 67130904 | 69873296 | 8 | 8 | 16 |
| 4 | 77298241 | 81186565 | 24 | 15 | 40 | |
| 4 | 83144842 | 83244842 | 1 | 0 | 1 | |
| 7 | 93176991 | 93628686 | 0 | 1 | 1 | |
| 9 | 32632235 | 37587269 | 10 | 8 | 18 | |
| 11 | 18342117 | 19876247 | 55 | 4 | 60 | |
| 14 | 63702522 | 63847210 | 0 | 2 | 2 | |
| 20 | 40244007 | 41210876 | 3 | 11 | 14 | |
| 20 | 60723014 | 61735694 | 0 | 2 | 2 | |
| 21 | 5280993 | 6396786 | 2 | 6 | 8 | |
| 21 | 19515280 | 25046226 | 22 | 27 | 49 | |
| 24 | 31843480 | 36758215 | 47 | 10 | 57 | |
|
| 1 | 132184772 | 133716124 | 9 | 7 | 16 |
| 4 | 68425678 | 69636837 | 5 | 1 | 6 | |
| 4 | 70026254 | 81648125 | 49 | 45 | 95 | |
| 4 | 82570011 | 86366835 | 49 | 25 | 75 | |
| 7 | 93191676 | 93628672 | 0 | 1 | 1 | |
| 11 | 15414337 | 16451463 | 24 | 1 | 25 | |
| 11 | 18613895 | 19317536 | 26 | 0 | 26 | |
| 18 | 79527484 | 81467661 | 13 | 12 | 25 | |
| 19 | 31204596 | 31799125 | 0 | 0 | 0 | |
| 20 | 29486630 | 30976763 | 54 | 8 | 62 | |
| 20 | 59464566 | 61015217 | 1 | 2 | 3 | |
| 20 | 64722427 | 65336095 | 1 | 3 | 4 | |
| 21 | 20611963 | 22057711 | 3 | 4 | 7 | |
| 22 | 41032889 | 41066045 | 0 | 0 | 0 | |
| 25 | 19435041 | 19535041 | 4 | 0 | 4 | |
|
| 1 | 49391032 | 49491032 | 0 | 1 | 1 |
| 2 | 35880861 | 36665473 | 8 | 6 | 14 | |
| 19 | 57082025 | 62825378 | 42 | 16 | 59 | |
| 28 | 9990892 | 10844823 | 4 | 0 | 4 | |
|
| 1146 | 415 | 1567 |
Final region boundaries were based on LD and are indicated by the lowest base pair position (Min) and the highest base pair position (Max). The total number of genes includes all protein-coding genes, pseudogenes, and RNA genes identified for region based on EquCab3. Shared regions across prioritized traits are indicated by highlighted chromosomes.
LD-bound high priority GWA regions for the Morgan horses.
| Trait | Chr | Min | Max | Protein | RNA | Total |
|---|---|---|---|---|---|---|
|
| 24 | 21134897 | 21184897 | 1 | 0 | 1 |
|
| 4 | 28373202 | 28423202 | 0 | 0 | 0 |
| 6 | 32751552 | 34029749 | 12 | 10 | 22 | |
| 10 | 71666607 | 73534053 | 6 | 5 | 12 | |
|
| 4 | 17239374 | 19043831 | 6 | 5 | 11 |
| 8 | 11193683 | 12404572 | 8 | 9 | 17 | |
|
| 3 | 55746338 | 58085997 | 15 | 6 | 21 |
| 4 | 26695616 | 29116058 | 5 | 4 | 9 | |
|
| 1 | 184859013 | 187238015 | 24 | 17 | 41 |
| 17 | 12653835 | 14464765 | 4 | 2 | 6 | |
| 24 | 20287835 | 20973401 | 15 | 1 | 16 | |
| 30 | 20915473 | 21380977 | 0 | 0 | 0 | |
|
| 2 | 16362904 | 18105119 | 21 | 21 | 42 |
| 4 | 34723398 | 39321960 | 36 | 10 | 47 | |
| 6 | 32486287 | 32841880 | 3 | 4 | 7 | |
| 6 | 64297403 | 71493047 | 168 | 22 | 191 | |
| 18 | 41448414 | 41498414 | 0 | 1 | 1 | |
| 20 | 3649052 | 4325872 | 8 | 3 | 11 | |
|
| 4 | 51590680 | 52810437 | 4 | 5 | 9 |
| 19 | 51286493 | 53959028 | 7 | 14 | 21 | |
| 24 | 25564765 | 29384679 | 7 | 14 | 21 | |
| 1 | 82700933 | 84269783 | 18 | 5 | 24 | |
| 3 | 42674448 | 44422013 | 2 | 7 | 10 | |
| 3 | 102944842 | 103801021 | 2 | 4 | 6 | |
| 5 | 25378878 | 27689002 | 12 | 16 | 28 | |
|
| 1 | 82097718 | 83618523 | 14 | 6 | 20 |
| 4 | 52024470 | 54237747 | 8 | 12 | 20 | |
| 6 | 60410647 | 70570773 | 144 | 26 | 172 | |
| 19 | 661978 | 1345372 | 4 | 1 | 6 | |
| 19 | 32962795 | 37391949 | 53 | 19 | 73 | |
|
| 1 | 120644115 | 124691346 | 36 | 20 | 56 |
| 17 | 31806060 | 33720086 | 3 | 3 | 7 | |
|
| 4 | 17301415 | 19812653 | 8 | 7 | 16 |
| 12 | 32885278 | 34800986 | 29 | 16 | 45 | |
| 14 | 87916190 | 91602875 | 32 | 26 | 58 | |
| 18 | 30095266 | 35177011 | 23 | 13 | 36 | |
| 19 | 30133826 | 30183826 | 2 | 0 | 2 | |
| 22 | 2843476 | 5225020 | 13 | 10 | 23 | |
| 23 | 7656404 | 12984095 | 11 | 23 | 34 | |
|
| 764 | 367 | 1142 |
Final region boundaries were based on LD and are indicated by the lowest base pair position (Min) and the highest base pair position (Max). The total number of genes includes all protein-coding genes, pseudogenes, and RNA genes identified for region based on EquCab3. Shared regions across prioritized traits are indicated by highlighted chromosomes.
Repeatability across results for the Bayesian sparse linear mixed model (BSLMM).
| 10 M Iterations | 10 M Iterations | 20 M Iterations | 30 M Iterations | |||||
|---|---|---|---|---|---|---|---|---|
| CHR | Sugg | Sign | Sugg | Sign | Sugg | Sign | Sugg | Sign |
|
| 2 | 0 | NA | NA | NA | NA | NA | NA |
|
| 1 | 0 | NA | NA | 1 | 0 | 4 | 0 |
|
| 38 | 27 | 5 | 1 | 32 | 2 | 5 | 1 |
|
| NA | NA | NA | NA | 1 | NA | ||
|
| 54 | 4 | 25 | 2 | 3 | 0 | 5 | 2 |
|
| 68 | 4 | 8 | 1 | 11 | 2 | 11 | 2 |
|
| 14 | 0 | 10 | 0 | 5 | 0 | 8 | 0 |
|
| 6 | 0 | NA | NA | NA | NA | NA | NA |
|
| NA | NA | 2 | 0 | NA | NA | NA | NA |
|
| NA | NA | NA | NA | 1 | 0 | NA | NA |
|
| 6 | 1 | 5 | 1 | 5 | 2 | 12 | 1 |
|
| NA | NA | NA | NA | 2 | 0 | NA | NA |
|
| NA | NA | 1 | 0 | NA | NA | NA | NA |
|
| NA | NA | NA | NA | NA | NA | 3 | 0 |
|
| NA | NA | NA | NA | NA | NA | 1 | 0 |
|
| NA | NA | NA | NA | 4 | 0 | NA | NA |
|
| NA | NA | 11 | 0 | NA | NA | NA | NA |
|
| NA | NA | 4 | 1 | 16 | 0 | NA | NA |
Repeatability across results for the Bayesian sparse linear mixed model (BSLMM) using the max gamma values from 10 million (M) iterations with seeds 1–10, 10 M iterations with seeds 11–20, 20 M iterations with seeds 1–10, and 30 M iterations with seeds 1–10 for adiponectin concentrations in the Morgan horses. Regions which are highlighted in yellow indicate those which would have been identified as a region of interest (contained a minimum of five SNPs exceeding the suggestive threshold, with at least one SNP exceeding the threshold for genome wide significance). Abbreviations: Sugg (total number of SNPs which exceeded the suggested threshold for genome-wide significance), Sign (total number of SNPs which exceed the threshold for genome-wide significance).