| Literature DB >> 28379989 |
Lauren A Baker1, Brian Kirkpatrick2, Guilherme J M Rosa2, Daniel Gianola2, Bruno Valente2,3, Julia P Sumner4, Wendy Baltzer5, Zhengling Hao1, Emily E Binversie1, Nicola Volstad1, Alexander Piazza1, Susannah J Sample1, Peter Muir1.
Abstract
Anterior cruciate ligament (ACL) rupture is a common condition that can be devastating and life changing, particularly in young adults. A non-contact mechanism is typical. Second ACL ruptures through rupture of the contralateral ACL or rupture of a graft repair is also common. Risk of rupture is increased in females. ACL rupture is also common in dogs. Disease prevalence exceeds 5% in several dog breeds, ~100 fold higher than human beings. We provide insight into the genetic etiology of ACL rupture by genome-wide association study (GWAS) in a high-risk breed using 98 case and 139 control Labrador Retrievers. We identified 129 single nucleotide polymorphisms (SNPs) within 99 risk loci. Associated loci (P<5E-04) explained approximately half of phenotypic variance in the ACL rupture trait. Two of these loci were located in uncharacterized or non-coding regions of the genome. A chromosome 24 locus containing nine genes with diverse functions met genome-wide significance (P = 3.63E-0.6). GWAS pathways were enriched for c-type lectins, a gene set that includes aggrecan, a gene set encoding antimicrobial proteins, and a gene set encoding membrane transport proteins with a variety of physiological functions. Genotypic risk estimated for each dog based on the risk contributed by each GWAS locus showed clear separation of ACL rupture cases and controls. Power analysis of the GWAS data set estimated that ~172 loci explain the genetic contribution to ACL rupture in the Labrador Retriever. Heritability was estimated at 0.48. We conclude ACL rupture is a moderately heritable highly polygenic complex trait. Our results implicate c-type lectin pathways in ACL homeostasis.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28379989 PMCID: PMC5381864 DOI: 10.1371/journal.pone.0173810
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The Labrador Retriever is a dog breed with a relatively low amount of inbreeding.
Whiskers represent the standard error of the mean for each analysis method (n = 237 dogs).
Fig 2Linear mixed model GWAS corrects for population structure and identifies 99 ACL associated loci explaining a large proportion of phenotypic variance.
For each linear mixed model (LMM), the QQ plots show no evidence of population stratification relative to the expected distribution. Permutation testing with each model determined genome-wide significance at (a) P<3.63E-7 for GCTA (Genome-wide Complex Trait Analysis) [33], λ = 0.987 (b) P<6.097E-7 for GEMMA (Genome-wide Efficient Mixed Model Association) [34], λ = 0.994 and (c) P<4.01E-7 for PUMA (Penalized Unified Multiple-locus Association) [35], λ = 1.012. The plots represent analysis of 118,992 SNPs from 98 cases and 139 phenotype-negative controls. (d) With GCTA, 36 loci have P<5E-4, with the most significant locus located in CFA 24, which did not meet genome-wide significance defined by minimum p-values from permutation testing. (e) With GEMMA, 47 loci have P<5E-4, with the locus on CFA 24 meeting genome-wide significance defined by minimum p-values from permutation testing. (f) With PUMA, 65 loci were significant at P<5E-4 and the locus on CFA 24 exceeded genome-wide significance defined by minimum p-values from permutation testing. The single SNP that met genome-wide significance lies within the gene PPP1R16B.
ACL rupture associated loci identified by GWAS in the Labrador Retriever, a dog breed with a high disease prevalence.
| SNP | Chr | Position | LMM | Risk allele | f(A) | f(U) | OR | Region start-end | Size (kb) | Genes | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| BICF2G630500368 | 24 | 30241088 | 2.76E-07 | 1,2,3 | G | 0.83 | 0.66 | 2.56 | 30241088–30245795 | 5 | |
| BICF2P1121006 | 18 | 54279578 | 1.11E-05 | 1,2,3 | A | 0.63 | 0.42 | 2.28 | No LD | Many (40 genes) | |
| BICF2S2356299 | 27 | 30557856 | 2.21E-05 | 2,3 | A | 0.43 | 0.27 | 2.03 | No LD | ||
| BICF2P483191 | 29 | 21601273 | 2.31E-05 | 1,2,3 | C | 0.73 | 0.51 | 2.54 | No LD | C29H8orf34, | |
| BICF2P50610 | 11 | 32270617 | 2.75E-05 | 3 | A | 0.29 | 0.19 | 1.7 | 31939564–32270617 | 331 | C11H9orf123, |
| BICF2P890246 | 9 | 53427907 | 3.23E-05 | 1,2 | A | 0.16 | 0.36 | 2.99 | 53427907–53432248 | 4 | Many (20 genes) |
| BICF2S23324965 | 6 | 14077648 | 3.36E-05 | 3 | G | 0.68 | 0.60 | 1.42 | 14077648–14092057 | 14 | |
| BICF2P544126 | 24 | 29772193 | 4.09E-05 | 3 | G | 0.94 | 0.87 | 2.28 | 29772193–29794411 | 22 | |
| BICF2P526639 | 27 | 39217437 | 4.12E-05 | 2,3 | G | 0.23 | 0.12 | 2.18 | 39211186–39217437 | 6 | |
| BICF2P1462185 | 20 | 15053718 | 4.89E-05 | 3 | A | 0.85 | 0.74 | 1.90 | 14838270–15053718 | 215 | |
| BICF2P1208798 | 9 | 12671217 | 5.49E-05 | 1,2 | G | 0.56 | 0.36 | 2.27 | No LD | Many (20 genes) | |
| BICF2G630175389 | 4 | 84260906 | 5.87E-05 | 1,2 | A | 0.83 | 0.68 | 2.28 | No LD | ||
| BICF2S24415473 | 3 | 86974042 | 7.07E-05 | 1,2 | G | 0.40 | 0.26 | 1.97 | 86948527–86974042 | 26 | |
| BICF2G630412697 | 30 | 3126573 | 7.22E-05 | 1,3 | G | 0.96 | 0.86 | 4.23 | No LD | ||
| BICF2P498515 | 6 | 75848537 | 7.89E-05 | 1,2,3 | A | 0.16 | 0.06 | 3.11 | No LD | ||
| BICF2P792911 | 26 | 22894961 | 8.55–05 | 1,2,3 | G | 0.44 | 0.27 | 2.14 | No LD | ||
| BICF2G630810143 | 6 | 11130832 | 9.46E-05 | 3 | A | 0.44 | 0.32 | 1.72 | 11130832–11177149 | 46 | |
| BICF2P564273 | 3 | 55250188 | 1.07E-04 | 1,2,3 | A | 0.70 | 0.52 | 2.16 | No LD | ||
| TIGRP2P297337 | 22 | 58201452 | 1.08E-04 | 1,2,3 | A | 0.44 | 0.27 | 2.2 | No LD | ||
| BICF2G630658881 | 21 | 7582214 | 1.09E-04 | 1,2,3 | G | 0.49 | 0.32 | 2.12 | 7581714–8383209 | 800 |
Note: OR (odds ratio) calculated from PLINK [32]. LMM Linear mixed model 1 –GCTA [33], 2 –GEMMA [34], 3 –PUMA [35]. F(A) and F(U) represent the frequency of the risk allele in case and control dogs, respectively. Data represent the twenty most significant loci of 99 associations with canine ACL rupture. SNP position and genomic regions are based on canFam2. Genes lists were derived from the SNP locus or LD block with 500kb flanking regions after liftover to canFam 3.1.
Fig 3Phenotype variance was explained to a large degree by the associated genomic loci.
Loci identified by linear mixed model (LMM) analysis were broadly defined as SNPs with r2>0.5 within 5Mb of the peak SNP. (a) For GCTA, 36 loci in 72.7Mb of the genome explained 48.09% of the phenotypic variance. (b) For GEMMA, 47 loci in 82.7Mb of the genome explained 55.88% of the phenotypic variance. (c) For PUMA, 65 loci in 86.58Mb of the genome explained 50.28% of the phenotypic variance in the ACL rupture trait. Whiskers represent the standard error of the mean.
Genetic risk scoring in ACL rupture case and control Labrador Retriever dogs using GWAS associated SNPs from linear mixed model analysis.
| LMM | Number of Risk alleles | Significance | |
|---|---|---|---|
| Control | Case | ||
| GCTA | 24 (16, 40) | 35 (24, 43) | |
| GEMMA | 33 (22, 43) | 45 (29, 55) | |
| PUMA | 62 (37, 84) | 77 (56, 99) | |
Note: Data represent median (range) for allele counting (cGRS) [51]. LMM Linear mixed models used were GCTA [33], GEMMA [34], PUMA [35]. The Mann-Whitney U test was used to determine significance.
Fig 4Genetic risk scoring [42] using GWAS associated loci from linear mixed model analysis with GEMMA segregates ACL rupture disease risk in case and control Labrador Retriever dogs.
(a) Distribution of the number of ACL rupture risk loci in case and control groups of Labrador Retriever dogs. The number of risk alleles in cases and controls is significantly different (P<2.2E-16) (b) ACL rupture odds ratios of weighted genetic risk scores (wGRS) relative to the first quartile. Vertical bars represent the 95% confidence intervals. * Odds ratio is significantly different from the reference first quartile.
Receiver Operating Characteristic (ROC) analysis of genetic risk scoring in ACL rupture case and control Labrador Retriever dogs using GWAS associated SNPs from linear mixed model analysis.
| LMM | cGRS | wGRS | ||
|---|---|---|---|---|
| AUC | 95% confidence interval | AUC | 95% confidence interval | |
| GCTA | 0.9487 | 0.9191–0.9725 | 0.9464 | 0.9183–0.9694 |
| GEMMA | 0.9634 | 0.9369–0.9824 | 0.9601 | 0.933–0.9801 |
| PUMA | 0.8842 | 0.8356–0.9158 | 0.8909 | 0.8458–0.9263 |
Note: LMM Linear mixed models used were GCTA [33], GEMMA [34], PUMA [35]. AUC Area under the ROC curve. Simple risk alleles count method (cGRS) and weighted method (wGRS) [42].
*Significantly different from GCTA and GEMMA (P<0.005 for cGRS and P<0.05 for wGRS).
Functional annotation clustering of genes in regions associated with anterior cruciate ligament rupture identified by GWAS in the Labrador Retriever.
| Cluster Term | Gene | Location start-end (CanFam 3.1) |
|---|---|---|
| chr27:35938182–35964350 | ||