| Literature DB >> 26588071 |
Maja L Arendt1,2, Malin Melin1, Noriko Tonomura3,4, Michele Koltookian3, Celine Courtay-Cahen5, Netty Flindall5, Joyce Bass5, Kim Boerkamp6, Katherine Megquir1,3,4, Lisa Youell5, Sue Murphy5, Colleen McCarthy3, Cheryl London7, Gerard R Rutteman6,8, Mike Starkey5, Kerstin Lindblad-Toh1,3.
Abstract
Canine mast cell tumours (CMCT) are one of the most common skin tumours in dogs with a major impact on canine health. Certain breeds have a higher risk of developing mast cell tumours, suggesting that underlying predisposing germ-line genetic factors play a role in the development of this disease. The genetic risk factors are largely unknown, although somatic mutations in the oncogene C-KIT have been detected in a proportion of CMCT, making CMCT a comparative model for mastocytosis in humans where C-KIT mutations are frequent. We have performed a genome wide association study in golden retrievers from two continents and identified separate regions in the genome associated with risk of CMCT in the two populations. Sequence capture of associated regions and subsequent fine mapping in a larger cohort of dogs identified a SNP associated with development of CMCT in the GNAI2 gene (p = 2.2x10-16), introducing an alternative splice form of this gene resulting in a truncated protein. In addition, disease associated haplotypes harbouring the hyaluronidase genes HYAL1, HYAL2 and HYAL3 on cfa20 and HYAL4, SPAM1 and HYALP1 on cfa14 were identified as separate risk factors in European and US golden retrievers, respectively, suggesting that turnover of hyaluronan plays an important role in the development of CMCT.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26588071 PMCID: PMC4654484 DOI: 10.1371/journal.pgen.1005647
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Multiple dimensional scaling plot visualising the genetic similarity between the individuals within the population using the two first principal components as calculated in PLINK.
a) American and European data combined b) American data c) European data.
Fig 2Manhattan plot showing the–log 10 p-values in relation to the chromosomes.
a) American data b) European data c) American and European data combined. Q-Q plots showing the expected p-value in relation to the observed p-value for each GWAS analysis. Shaded area indicates 95% confidence interval. Stippled line marks nominal significance threshold.
Fig 3a) Close up view of the most associated region from the American GWAS analysis. b) Minor allele frequency of the associated region. c) Further close up of the associated peak showing the LD structure of the SNPs in the area relative to the most associated SNP. d) Close up view of the genes located in the area of the associated haplotype.
Red arrow indicates the location of the most associated SNP.
Top-ranking GWAS SNPs from European and United States data.
Table showing the 5 most associated GWAS SNPs from the European and American analysis.
| SNP | Chr | Position | Allele |
|
|
|
| OR | AFa | AFu | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| European (Netherlands and United Kingdom) | BICF2S22934685 | 20 | 42,547,825 | C/T | 0.72 | 1.4×10−6 | 0.0042 | 0.034 | 6.2 | 0.91 | 0.64 |
| BICF2P1444805 | 20 | 42,957,449 | A/G | 0.90 | 2.6×10−6 | 0.0067 | 0.078 | 6.5 | 0.94 | 0.70 | |
| BICF2P299292 | 20 | 48,377,580 | A/C | 0.45 | 1.2×10−6 | 2.2×10−5 | 0.085 | 4.0 | 0.65 | 0.31 | |
| BICF2P301921 | 20 | 48,599,799 | A/C | 0.58 | 4.3×10−7 | 2.2×10−5 | 0.022 | 4.1 | 0.65 | 0.31 | |
| BICF2P623297 | 20 | 49,201,505 | G/A | 0.54 | 8.1×10−7 | 2.0×10−5 | 0.037 | 4.2 | 0.63 | 0.29 | |
| United States | BICF2P867665 | 14 | 14,714,009 | G/T | 3.2×10−7 | 0.27 | 1.2×10−5 | 0.030 | 5.2 | 0.86 | 0.53 |
| BICF2G630521572 | 14 | 14,670,361 | T/C | 1.3×10−6 | 0.09 | 1.5×10−6 | 0.099 | 4.2 | 0.77 | 0.44 | |
| BICF2G630521696 | 14 | 14,756,089 | G/A | 1.3×10−6 | 0.09 | 1.5×10−6 | 0.099 | 4.2 | 0.77 | 0.44 | |
| TIGRP2P186605 | 14 | 14,727,905 | G/A | 2.6×10−6 | 0.09 | 2.1×10−6 | 0.17 | 4.0 | 0.76 | 0.44 | |
| BICF2G630521678 | 14 | 14,740,313 | A/G | 2.6×10−6 | 0.09 | 2.1×10−6 | 0.17 | 4.0 | 0.76 | 0.44 |
P US = p-values for the American analysis, P EU = p-values for the European analysis, P-comb = p-value for the combined European and American analysis, Pperm 10,000 = permutated p-value 10,000 permutations, OR = odds ratio, AFa = allele frequency of risk allele in affected individuals in respective populations, AFu = allele frequency of risk allele in control individuals in respective populations.
Fig 4a) Close up view of the most associated region from the European GWAS analysis. b) Minor allele frequency of the associated region. c) Further close up of the associated region showing the LD structure of the SNPs in the 48 MB locus relative to the most associated SNP in the locus. d) Further close up of the associated peak showing the LD structure of the SNPs in the 42 MB locus relative to the most associated SNP in the locus. e) Close up view of the genes located on the associated haplotype in the 42MB region.
Red arrow indicates the location of the most associated SNP in the region.
Fig 5PCR products confirming the alternative splice site in the individuals carrying the A allele on CFA20: 42080147.
PCR was performed using primers specific for the alternative splicing, excluding exon 3.