| Literature DB >> 31323019 |
Lea I Mikkola1,2,3, Saila Holopainen1,2,3,4, Anu K Lappalainen4, Tiina Pessa-Morikawa1, Thomas J P Augustine1, Meharji Arumilli1,2,3, Marjo K Hytönen1,2,3, Osmo Hakosalo1,2,3, Hannes Lohi1,2,3, Antti Iivanainen1.
Abstract
Canine hip dysplasia is a common, non-congenital, complex and hereditary disorder. It can inflict severe pain via secondary osteoarthritis and lead to euthanasia. An analogous disorder exists in humans. The genetic background of hip dysplasia in both species has remained ambiguous despite rigorous studies. We aimed to investigate the genetic causes of this disorder in one of the high-risk breeds, the German Shepherd. We performed genetic analyses with carefully phenotyped case-control cohorts comprising 525 German Shepherds. In our genome-wide association studies we identified four suggestive loci on chromosomes 1 and 9. Targeted resequencing of the two loci on chromosome 9 from 24 affected and 24 control German Shepherds revealed deletions of variable sizes in a putative enhancer element of the NOG gene. NOG encodes for noggin, a well-described bone morphogenetic protein inhibitor affecting multiple developmental processes, including joint development. The deletion was associated with the healthy controls and mildly dysplastic dogs suggesting a protective role against canine hip dysplasia. Two enhancer variants displayed a decreased activity in a dual luciferase reporter assay. Our study identifies novel loci and candidate genes for canine hip dysplasia, with potential regulatory variants in the NOG gene. Further research is warranted to elucidate how the identified variants affect the expression of noggin in canine hips, and what the potential effects of the other identified loci are.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31323019 PMCID: PMC6668854 DOI: 10.1371/journal.pgen.1008197
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Genome-wide association for canine hip dysplasia: FASTA and QTSCORE, N(cases) = 132, N(controls) = 160.
For the upper two figure segments the red horizontal lines are the thresholds for Bonferroni correction for significance. The blue horizontal line is the threshold for significance for independent tests. In the undermost segment, where the permuted P-values are shown, the single red line represents the threshold for genome-wide significance level of 0.05.
Fig 2Genome-wide association for canine hip dysplasia: First meta-analysis, N(cases) = 132, N(controls) = 277.
For the upper two figure segments the red horizontal line is the threshold for Bonferroni correction for significance. The blue horizontal line is the threshold for significance for independent tests. In the undermost segment, where the permuted P-values are shown, the single red line represents the threshold for genome-wide significance level of 0.05.
Fig 3Genome-wide association for canine hip dysplasia: Second meta-analysis, N(cases) = 247, N(controls) = 277.
For the upper figure segment the red horizontal line is the threshold for Bonferroni correction for significance. The blue horizontal line is the threshold for significance for independent tests. In the undermost segment, where the permuted P-values are shown, the single red line represents the threshold for genome-wide significance level of 0.05.
Adjusted P-values, empirical genome-wide significance levels and odds ratios for the most significant SNPs from the original genome-wide association analyses.
| SNP name (CanFam3.1) | Alleles (Risk / Non-risk) | Locus | GWAS (N = 292) | ||||
|---|---|---|---|---|---|---|---|
| FASTA: P-value after inflation factor correction (lambda = 1.02) | QTSCORE: P-value after inflation factor correction (lambda = 0.76) | Genome-wide significance (24159 permutations, lambda = 0.76) | Genotypic odds ratios with 95% CI (ORAa; ORAA) | Allelic odds ratios (ORA) | |||
| TIGRP2P126044 | C/T | Chr9: 30993502 | 5.8x10-5 | 7.7x10-5 | 0.82 | 0.32 [0.17–0.61]; 0.18 [0.09–0.36] | 0.41 [0.29–0.57] |
| BICF2S23027935 | A/G | Chr9: 31300189 | 1.2x10-5 | 1.9x10-5 | 0.36 | 0.29 [0.15–0.55]; 0.17 [0.08–0.34] | 0.40 [0.29–0.56] |
| BICF2P742007 | C/T | Chr9: 31387114 | 7.8x10-6 | 1.4x10-5 | 0.29 | 0.61 [0.34–1.09]; 0.18 [0.09–0.35] | 0.38 [0.27–0.53] |
| BICF2G630834826 | A/G | Chr9: 31477907 | 3.8x10-5 | 5.5x10-5 | 0.71 | 0.60 [0.34–1.06]; 0.22 [0.11–0.42] | 0.43 [0.31–0.60] |
| BICF2G630835002 | C/T | Chr9: 31772306 | 4.1x10-5 | 6.6x10-5 | 0.77 | 0.61 [0.34–1.09]; 0.20 [0.10–0.39] | 0.42 [0.30–0.59] |
| BICF2G630835188 | G/T | Chr9: 32166146 | 4.6x10-5 | 8.1x10-5 | 0.83 | 0.42 [0.23–0.77]; 0.24 [0.13–0.46] | 0.45 [0.32–0.63] |
| BICF2G630835223 | G/A | Chr9: 32271830 | 5.2x10-5 | 7.2x10-5 | 0.79 | 0.57 [0.32–1.01]; 0.22 [0.11–0.42] | 0.43 [0.31–0.60] |
| BICF2P283506 | T/C | Chr9: 32382532 | 5.5x10-5 | 8.0x10-5 | 0.83 | 0.56 [0.32–0.99]; 0.23 [0.12–0.44] | 0.44 [0.32–0.61] |
| BICF2G630837209 | G/A | Chr9: 36543581 | 4.2x10-5 | 5.9x10-5 | 0.73 | 2.55 [1.32–4.94]; 4.90 [2.44–9.85] | 2.28 [1.63–3.19] |
| BICF2G630837233 | G/T | Chr9: 36565413 | 4.2x10-5 | 7.5x10-5 | 0.80 | 2.55 [1.32–4.94]; 4.77 [2.38–9.57] | 2.25 [1.61–3.15] |
| BICF2G630837240 | A/G | Chr9: 36579921 | 2.9x10-5 | 5.5x10-5 | 0.71 | 2.32 [1.19–4.52]; 4.58 [2.29–9.15] | 2.32 [1.60–3.16] |
ORAa, odds ratio for genotype Aa relative to the genotype aa; ORAA, odds ratio for genotype AA relative to the genotype aa; ORA, odds ratio for allele A relative to the allele a. Only the SNPs that showed association with the disorder after both statistical methods (FASTA and QTSCORE) with P-value threshold of 1.0x10-4 were included.
*Complementary strand to what is indicated in Table 2.
Adjusted P-values, empirical genome-wide significance levels and odds ratios for the top SNPs after the meta-analysis in moderate-to-severe cases compared with controls.
| SNP name (CanFam3.1) | Alleles (Risk / Non-risk) | Locus | GWAS meta-analysis (N = 409) | ||||
|---|---|---|---|---|---|---|---|
| FASTA: P-value after inflation factor correction (lambda = 1.01) | QTSCORE: P-value after inflation factor correction (lambda = 0.72) | Genome-wide significance (26323 permutations, lambda = 0.72) | Genotypic odds ratios with 95% CI (ORAa; ORAA) | Allelic odds ratios (ORA) | |||
| BICF2S23027935 | A/G | Chr9: 31300189 | 1.5x10-5 | 3.1x10-5 | 0.55 | 0.59 [0.36–0.96]; 0.19 [0.10–0.36] | 0.43 [0.32–0.58] |
| BICF2P742007 | G/A | Chr9: 31387114 | 1.2x10-5 | 3.1x10-5 | 0.54 | 0.49 [0.30–0.81]; 0.17 [0.09–0.31] | 0.39 [0.29–0.53] |
| BICF2G630837240 | A/G | Chr9: 36579921 | 1.7x10-5 | 4.1x10-5 | 0.64 | 2.00 [1.07–3.73]; 4.78 [2.51–9.09] | 2.28 [1.68–3.10] |
| BICF2G630837405 | A/G | Chr9: 36837067 | 3.0x10-6 | 3.4x10-6 | 0.090 | 0.37 [0.22–0.61]; 0.21 [0.12–0.38] | 0.44 [0.33–0.59] |
| BICF2P272135 | G/C | Chr9: 36842872 | 7.6x10-5 | 1.0x10-5 | 0.24 | 0.36 [0.22–0.60]; 0.20 [0.11–0.37] | 0.43 [0.32–0.58] |
| TIGRP2P126345 | G/A | Chr9: 36886621 | 3.1x10-6 | 2.2x10-6 | 0.060 | 0.35 [0.21–0.58]; 0.22 [0.12–0.40] | 0.43 [0.32–0.58] |
*The strand of SNP BICF2P742007 was flipped in the pre-meta-analysis data merge and therefore the alleles are complementary to those in Table 1. All the SNPs in the table showed association with the disorder after both statistical methods (FASTA and QTSCORE) at the P-value threshold of 1.0x10-4.
Adjusted P-values, empirical genome-wide significance levels and odds ratios for the top SNPs after the meta-analysis of mild to severe cases compared with controls.
| SNP name (CanFam3.1) | Alleles (Risk / Non-risk) | Locus | GWAS meta-analysis (N = 524) | |||
|---|---|---|---|---|---|---|
| QTSCORE: P-value after inflation factor correction (lambda = 1.43) | Genome-wide significance (87158 permutations, lambda = 1.43) | Genotypic odds ratios with 95% CI (ORAa; ORAA) | Allelic odds ratios (ORA) | |||
| BICF2S23248027 | G/C | Chr1: 45161186 | 1.3x10-5 | 0.31 | 0.62 [0.37–1.04]; 0.27 [0.16–0.47] | 0.52 [0.41–0.68] |
| BICF2P468585 | C/A | Chr1: 45382633 | 7.5x10-6 | 0.20 | 0.61 [0.36–1.02]; 0.26 [0.16–0.47] | 0.51 [0.40–0.65] |
| BICF2P1037296 | A/C | Chr1: 46268586 | 4.4x10-5 | 0.67 | 0.56 [0.35–0.90]; 0.29 [0.17–0.49] | 0.54 [0.42–0.69] |
| BICF2P357728 | G/A | Chr1: 46279297 | 3.2x10-5 | 0.56 | 0.55 [0.34–0.89]; 0.28 [0.17–0.47] | 0.53 [0.41–0.68] |
| BICF2S23329752 | G/A | Chr1: 87382164 | 6.4x10-5 | 0.79 | 0.32 [0.15–0.66]; 0.21 [0.10–0.43] | 0.52 [0.40–0.68] |
| BICF2S23660342 | G/A | Chr1: 87446951 | 6.2x10-5 | 0.78 | 0.32 [0.15–0.68]; 0.20 [0.09–0.42] | 0.52 [0.40–0.68] |
| BICF2P1129598 | C/A | Chr1: 87749401 | 5.8x10-5 | 0.76 | 0.34 [0.15–0.76]; 0.20 [0.09–0.44] | 0.51 [0.38–0.68] |
| BICF2S23027935 | A/G | Chr9: 31300189 | 5.8x10-5 | 0.76 | 0.74 [0.48–1.11]; 0.30 [0.18–0.49] | 0.54 [0.41–0.68] |
| BICF2P742007 | G/A | Chr9: 31387114 | 6.4x10-5 | 0.79 | 0.67 [0.42–1.02]; 0.32 [0.19–0.50] | 0.53 [0.41–0.68] |
Only the SNPs that showed association with the disorder at least at the P-value threshold of 1.0x10-4 were included.
Protein coding genes near the top SNPs from the genome-wide association analyses.
| SNP ID | Location | Protein coding genes within ± 500 kb from the SNP | Relation to the closest gene(s) |
|---|---|---|---|
| BICF2S23248027 | Chr1: 45161186 | Intronic: | |
| BICF2P468585 | Chr1: 45382633 | SCAF8, | Intergenic: |
| BICF2P1037296 | Chr1: 46268586 | Intergenic: | |
| BICF2P357728 | Chr1: 46279297 | Intergenic: | |
| BICF2S23329752 | Chr1: 87382164 | Intronic: | |
| BICF2S23660342 | Chr1: 87446951 | Intronic: | |
| BICF2P1129598 | Chr1: 87749401 | Intronic: | |
| BICF2S23027935 | Chr9: 31300189 | Intronic: | |
| BICF2P742007 | Chr9: 31387114 | Intergenic: | |
| BICF2G630837240 | Chr9: 36579921 | Intergenic: |
Genes closest to the SNPs are in bold.
Allelic and genotypic odds ratios between phenotype comparisons for the top markers on chromosomes 1 and 9.
| Comparison groups | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Controls (N = 278); mild cases (N = 79) | Mild cases (N = 79); moderate-to-severe cases (N = 168) | |||||||||
| Chr | SNP | Allele (Risk / Non-risk) | P | OR allelic | P | ORAa; ORAA [CI 95%] | P | OR allelic | P | ORAa; ORAA [CI 95%] |
| 1 | BICF2S23248027 | G/C | 0.01 | 0.64 [0.45–0.92] | 0.04 | 0.76 [0.36–1.59]; 0.42 [0.19–0.93] | 0.15 | 0.76 [0.52–1.11] | 0.31 | 0.75 [0.36–1.55]; 0.54 [0.24–1.22] |
| BICF2P468585 | C/A | 0.02 | 0.66 [0.46–0.95] | 0.07 | 0.74 [0.35–1.56]; 0.44 [0.20–0.96] | 0.07 | 0.70 [0.48–1.02] | 0.14 | 0.76 [0.37–1.57]; 0.47 [0.21–1.06] | |
| BICF2P1037296 | A/C | 0.08 | 0.72 [0.50–1.03] | 0.20 | 0.76 [0.38–1.53]; 0.53 [0.25–1.12] | 0.03 | 0.65 [0.44–0.95] | 0.08 | 0.66 [0.33–1.32]; 0.41 [0.19–0.90] | |
| BICF2P357728 | G/A | 0.08 | 0.72 [0.50–1.03] | 0.19 | 0.74 [0.37–1.49]; 0.51 [0.24–1.08] | 0.03 | 0.65 [0.44–0.95] | 0.07 | 0.67 [0.34–1.33]; 0.41 [0.19–0.90] | |
| BICF2S23329752 | G/A | 0.02 | 0.62 [0.42–0.92] | 0.01 | 0.33 [0.13–0.85]; 0.26 [0.10–0.66] | 0.25 | 0.79 [0.53–1.18] | 0.47 | 0.99 [0.42–2.31]; 0.71 [0.31–1.63] | |
| BICF2S23660342 | G/A | 0.01 | 0.61 [0.41–0.90] | 0.01 | 0.34 [0.13–0.91]; 0.26 [0.09–0.66] | 0.28 | 0.80 [0.53–1.20] | 0.56 | 0.88 [0.37–2.11]; 0.68 [0.29–1.61] | |
| BICF2P1129598 | C/A | 0.03 | 0.63 [0.42–0.95] | 0.05 | 0.42 [0.14–1.23]; 0.30 [0.11–0.85] | 0.16 | 0.74 [0.49–1.12] | 0.39 | 0.75 [0.29–1.94]; 0.57 [0.22–1.45] | |
| 9 | BICF2S23027935 | A/G | 0.26 | 0.82 [0.58–1.17] | 0.23 | 1.19 [0.62–2.27]; 0.72 [0.35–1.47] | ||||
| BICF2P742007 | G/A | 0.74 | 0.94 [0.66–1.35] | 0.35 | 1.50 [0.71–3.16]; 1.05 [0.48–2.28] | |||||
| BICF2G630837240 | A/G | 0.72 | 1.07 [0.75–1.52] | 0.81 | 1.22 [0.67–2.22]; 1.13 [0.55–2.32] | |||||
The threshold for significance to correct for multiple testing was determined as 0.05/8 = 6.25x10-3, where the denominator comes from four independent loci multiplied with two phenotypic tests. The significant P-values and corresponding odds ratios are in bold font.
Cross-tabulated frequencies and counts of the most common genotypes of the two top SNPs on chromosome 9 (BICF2S23027935, BICF2P742007) with the hip phenotypes.
| Genotype | Hip phenotype | |||
|---|---|---|---|---|
| Control | Mild | Moderate-to-severe | All cases and controls together | |
| GA | 0.40 (100/253) | 0.32 (21/66) | 0.13 (21/159) | 0.30 (142/478) |
| RR | 0.42 (106/253) | 0.52 (34/66) | 0.48 (76/159) | 0.45 (216/478) |
| AG | 0.19 (47/253) | 0.17 (11/66) | 0.39 (62/159) | 0.25 (120/478) |
| 1.00 (253/253) | 1.00 (66/66) | 1.00 (159/159) | 1.00 (478/478) | |
Variants that segregated between cases and controls categorized by the associated gene and the predicted functional effect.
| Gene name | Functional effect | Comments | ||||||
|---|---|---|---|---|---|---|---|---|
| downstream | nonsynonymous | exonic;splicing | ncRNA_exonic | splicing | upstream | UTR3 | ||
| 0 | 0 | 0 | 0 | 0 | 4 | rs852180586 (E->K, tolerated), no or low H3K27Ac signal in the corresponding human genomic region in chr.17 | ||
| 0 | 0 | 0 | -2 | -12 | 0 | No or low H3K27Ac signal in corresponding human genomic region on chr.17, gap in the human/mouse/dog alignment | ||
| 0 | 0 | 0 | 0 | 0 | 0 | No or low H3K27Ac signal in corresponding human genomic region on chr.17 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | rs24532262 (S->W, SIFT 0.04 deleterious), not in the mature protein | ||
| 0 | 0 | 0 | 7 | 0 | No or low H3K27Ac signal in corresponding human genomic region on chr.17 and chr.11; missense variant (A216P) outside of any known protein domain on ENSCAFP00000037414.1 | |||
| 0 | 0 | 0 | 0 | 0 | Gap in the human /dog alignment, no functional elements in the corresponding region in mouse chr 11; | |||
| -14 | 0 | 0 | 0 | 0 | -5 | Strong H3K27Ac and a nearby DNaseI signal in corresponding human region on chr. 17 | ||
| -19 | 2 | 0 | 7 | 0 | rs852464446, splicing mutation (33059666) close to SD in intron 1–2 in both transcripts; rs850635127, splicing mutation (33006594) close to SA in intron 3–4 in transcript ENSCAFT00000027763.4 and nonsynonymous variant of transcript ENSCAFT00000043743.1 (P->L, tolerated, low confidence) | |||
| -21 | 0 | 0 | 0 | -19 | 3 | No or low H3K27Ac signal in corresponding human genomic region on chr.17. Gaps in the human/mouse/dog alignment | ||
| 0 | -13 | 0 | 0 | 0 | 4 | No or low H3K27Ac signal in corresponding human genomic region on chr.17 | ||
| 0 | 0 | 2 | 0 | Downstream variants: no or low H3K27Ac signal in corresponding human genomic region on chr.17; rs850849831 (H->R, tolerated), rs851033426 (M->T, tolerated), rs850562371 (N->S, tolerated), rs851547031 (W->L, tolerated), rs850850258 (Q->E, tolerated); splicing mutations in the introns | ||||
| 3 | 0 | 0 | 0 | 8 | 0 | rs852311430, no overlap with Ensembl Regulatory features or with Ensembl Motif features | ||
The numbers indicate the difference between the numbers of cases and controls carrying variants of a particular functional effect regardless of the position within the indicated gene. Completely or nearly completely segregating variants are in bold. Negative numbers indicate that the variants were more prevalent among the controls. The various variants are commented in the last column. SD = splice donor. SA = splice acceptor. Single letter IUPAC amino acid coding is used. Functional consequences of coding variants have been estimated with SIFT (sift.jcvi.org) and provided by the Ensembl genome browser (www.ensembl.org). The H3K27Ac chromatin marks are from the ENCODE project (www.encodeproject.org) and provided by UCSC Genome Browser (genome.ucsc.edu).
Fig 4Human NOG enhancer region compared to the related region in the dog genome.
At the top of the image there is a sequence comparison between human and dog. hs = human genomic (GRCh38.p7) region at chr17:56592775–56592815. cf = canine genomic (CanFam3.1) region at chr9:31453829–31453899. d1-d4 = the corresponding sequences of Dog1-Dog4. Below the sequence comparison is first the canine genomic region including the deletion (blue triangle), gap region marked with Ns, and the complete coding sequence for NOG from a Beagle (GenBank: AB544074.1). At the bottom of the image are the corresponding human genomic regions with a TF binding site (orange box) and TFs that can bind to this site as predicted by the ENCODE ChIP-seq experiments. Green boxes are H3K4Me1 histone mark (commonly associated with enhancers) peaks, and the blue box is a H3K4Me3 histone mark (commonly associated with active promoters) peak from the ENCODE data.
Number of upstream NOG deletion genotypes in cases and controls.
| Genotype | Phenotype group | ||||
|---|---|---|---|---|---|
| Mild cases | Moderate-to-severe cases | All cases together | Controls | All cases and controls together | |
| 0.29 (23/79) | 0.60 (98/164) | 0.50 (121/243) | 0.37 (101/273) | 0.43 (222/516) | |
| 0.57 (45/79) | 0.35 (57/164) | 0.42 (102/243) | 0.47 (127/273) | 0.44 (229/516) | |
| 0.14 (11/79) | 0.05 (9/164) | 0.08 (20/243) | 0.16 (45/273) | 0.13 (65/516) | |
| 1.00 (79/79) | 1.00 (164/164) | 1.00 (243/243) | 1.00 (273/273) | 1.00 (516/516) | |
The odds ratios based on logistic regression modeling of the CHD phenotype using the NOG deletion allele and the GA top SNP combination genotype as independent variables.
| Subset | OR [95% confidence interval] | ||
|---|---|---|---|
| Deletion in both alleles | Deletion in one allele | GA SNP genotype | |
| Controls (N = 272); | 0.74 [0.36–1.51] | 0.81 [0.55–1.19] | 0.41 [0.25–0.68] |
| Controls (N = 272); | 0.43 [0.17–1.03] | 0.56 [0.36–0.87] | 0.36 [0.19–0.64] |
| Mild cases (N = 79); | 0.24 [0.07–0.78] | 0.31 [0.17–0.58] | 0.74 [0.33–1.69] |
Comparison of full (with NOG deletion) and reduced (without NOG deletion) generalized linear models on three subsets of dogs.
| Subset | Model | Residual Df | Residual Deviance | Df | Deviance (P) | AUC | Z (P) |
|---|---|---|---|---|---|---|---|
| Controls (N = 272); all cases (N = 243) | Full | 511 | 686.76 | 2 | 1.37 (0.50) | 0.61 | 1.31 (9.5x10-2) |
| Reduced | 513 | 688.13 | 0.60 | ||||
| Controls (N = 272); moderate-to-severe cases (N = 164) | Full | 432 | 538.63 | 2 | 7.88 (1.9x10-2) | 0.66 | 2.71 (3.3x10-3) |
| Reduced | 434 | 546.51 | 0.62 | ||||
| Mild cases (N = 79); moderate-to-severe cases (N = 164) | Full | 239 | 284.68 | 2 | 15.09 (5.3x10-4) | 0.66 | 3.59 (1.6x10-4) |
| Reduced | 241 | 299.77 | 0.57 |
Df = degrees of freedom, P = probability value, Z = Z test statistic, AUC = area under the receiver operating characteristic curve
Fig 5The differential effects of three NOG regulatory alleles in a dual luciferase reporter assay.
Relative luminisence (NanoLuc/firefly luminisence expressed as median value ± standard deviation, three biological replicates with four technical replicates each per every construct). Left: HEK293 cells transfected with 50 ng plasmid DNA + 50 ng carrier DNA. Right: U-2 OS cells transfected with 10 ng plasmid DNA + 10 ng carrier DNA. pNL: empty control vector. A: construct A, B: construct B, C: construct C. *: P<0.05 relative to construct A. The difference between relative luminescence of the control plasmid and each of the constructs was always statistically significant.