| Literature DB >> 33793571 |
Benoît Hédan1, Édouard Cadieu1, Maud Rimbault1, Amaury Vaysse1, Caroline Dufaure de Citres2, Patrick Devauchelle3, Nadine Botherel1, Jérôme Abadie4, Pascale Quignon1, Thomas Derrien1, Catherine André1.
Abstract
Histiocytic sarcoma (HS) is a rare but aggressive cancer in both humans and dogs. The spontaneous canine model, which has clinical, epidemiological, and histological similarities with human HS and specific breed predispositions, provides a unique opportunity to unravel the genetic basis of this cancer. In this study, we aimed to identify germline risk factors associated with the development of HS in canine-predisposed breeds. We used a methodology that combined several genome-wide association studies in a multi-breed and multi-cancer approach as well as targeted next-generation sequencing, and imputation We combined several dog breeds (Bernese mountain dogs, Rottweilers, flat-coated retrievers, and golden retrievers), and three hematopoietic cancers (HS, lymphoma, and mast cell tumor). Results showed that we not only refined the previously identified HS risk CDKN2A locus, but also identified new loci on canine chromosomes 2, 5, 14, and 20. Capture and targeted sequencing of specific loci suggested the existence of regulatory variants in non-coding regions and methylation mechanisms linked to risk haplotypes, which lead to strong cancer predisposition in specific dog breeds. We also showed that these canine cancer predisposing loci appeared to be due to the additive effect of several risk haplotypes involved in other hematopoietic cancers such as lymphoma or mast cell tumors as well. This illustrates the pleiotropic nature of these canine cancer loci as observed in human oncology, thereby reinforcing the interest of predisposed dog breeds to study cancer initiation and progression.Entities:
Year: 2021 PMID: 33793571 PMCID: PMC8016107 DOI: 10.1371/journal.pgen.1009395
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Characteristics of the genome-wide association studies (GWAS) analyses performed in this study.
* Dogs genotyped on the Affymetrix Axiom Canine Genotyping Array were also available in the Illumina 173K SNV Canine HD array format.
| GWAS Name | Paragrah | cancers | breeds | SNP arrays format | Number of cases after QC | Number of controls after QC |
|---|---|---|---|---|---|---|
| GWAS_1_HS_BMD | §2.1 | HS | BMD | Illumina 173k SNV Canine HD array | 172 | 128 |
| GWAS_2_HS+lymphoma_BMD | §2.2a | HS and lymphoma | BMD | Illumina 173k SNV Canine HD array | 252 | 128 |
| GWAS_3_HS+lymphoma_BMD+golden_retriever | §2.2a | HS and lymphoma | BMD, golden retriever | Illumina 173k SNV Canine HD array | 293 | 300 |
| GWAS_4_HS+MCT_BMD | §2.2b | HS and mast cell tumor | BMD | Illumina 173k SNV Canine HD array | 216 | 128 |
| GWAS_5_HS+MCT_BMD+golden_retriever | §2.2b | HS and mast cell tumor | BMD, golden retriever | Illumina 173k SNV Canine HD array | 285 | 202 |
| GWAS_6_HS_BMD_with_imputed_SNV | §2.3 | HS | BMD | Affymetrix Axiom Canine Genotyping array 1.1M SV (n = 113)* | 403 | 347 |
| GWAS_7_HS_BMD+FCR_with_imputed_SNV | §2.3 | HS | BMD, FCR | Affymetrix Axiom Canine Genotyping array 1.1M SV (n = 134)* | 416 | 362 |
| GWAS_8_HS_BMD+FCR+Rottweiler_with_imputed_SNV | §2.3 | HS | BMD, FCR,Rottweiler | Affymetrix Axiom Canine Genotyping array 1.1M SV (n = 134)* | 453 | 385 |
Fig 1Results of genome-wide association studies (GWAS) on Bernese mountain dogs (BMD) with 172 histiocytic sarcoma (HS) cases and 128 controls (GWAS_1_HS_BMD).
A) Quantile-quantile plot displaying a genomic inflation λ of 1.000005, indicating no residual inflation. B) Manhattan plot displaying the statistical results from the GWAS. This analysis pointed out two loci (arrows) on chromosome 11 (CFA11:41161441, pcorrected = 3.11 × 10−7) and on chromosome 20 (CFA20:30922308, pcorrected = 3.73 × 10−5).
Fig 2Close up view of the CFA5 locus.
A) Manhattan plot of the CFA5 20–40 Mb region highlighting the best p-values obtained in the three genome-wide association studies (GWAS): Bernese mountain dog (BMD) GWAS for histiocytic sarcoma (HS) with 172 cases vs. 128 controls (GWAS_1_HS_BMD); BMD GWAS for HS and lymphoma with 252 cases vs. 128 controls (GWAS_2_HS+lymphoma_BMD); meta-analysis combining the BMD GWAS of HS and lymphoma (252 cases vs. 128 controls) and golden retriever GWAS for lymphoma (41 cases vs. 172 controls) from Tonomura et al. [16] (GWAS_3_HS+lymphoma_BMD+golden_retriever). The R2 in cases from the top single nucleotide variation (SNV) is depicted to show the linkage-disequilibrium (LD) structure. B) Regions delimitated by the SNPs in LD with the best GWAS SNVs (R2 > 0.6) in cases; minimal region between the three GWAS (CFA5:28309815–34321500) is delimitated by red lines. C) Close up view of the genes (with available symbols) located in this minimal region (28–34 Mb) of CFA5.
Fig 3Close up view of the CFA20 locus.
A) Manhattan plot of the CFA20 20–45 Mb region highlighting the best p-values obtained in the three genome-wide association study (GWAS): Bernese Mountain dogs (BMD) GWAS for histiocytic sarcoma (HS) with 172 cases vs. 128 controls (GWAS_1_HS_BMD); BMD GWAS for HS and mast cell tumor with 216 cases vs. 128 controls (GWAS_4_HS+MCT_BMD); meta-analysis combining BMD GWAS for HS and mast cell tumor with European golden retriever (69 cases vs. 74 controls) from Arendt et al. [15] (GWAS_5_HS+MCT_BMD+golden_retriever). The R2 in cases from top single nucleotide variation (SNVs) show the linkage disequilibrium (LD) structure. B) Regions delimitated by SNVs in LD with the best GWAS SNVs (R2 > 0.6) in cases; minimal region between the three GWAS (CFA20:31036863–32778949) is delimitated by red lines. C) Close up view of the genes (with available symbols) located in this minimal region (31–33 Mb) of CFA20.
Significant loci identified by genome-wide association studies (GWAS) after imputation on high-density single nucleotide variation (SNV) array.
Number of associated SNVs with the best SNV and the corresponding corrected p-value are presented for each locus and each GWAS.
| GWAS | ||||
|---|---|---|---|---|
| Chromosome | BMD | BMD+FCR | BMD+FCR+Rott | |
| 2 | Number of associated SNV | 1 | 4 | 7 |
| Localisation of the best SNV | 29716535 | 29716535 | 29716535 | |
| Region delimited by significant associated SNVs | 29716535 | 29653137–29978776 | 29507029–34223001 | |
| Region delimited by significant associated SNVs and SNVs in LD in cases (R2≥0.6) | 29154373–30121047 | 29154373–30121047 | 29385904–35075340 | |
| 5 | Number of associated SNV | 292 | 320 | 322 |
| Localisation of the best SNV | 30496048 | 33823740 | 33823740 | |
| Region delimited by significant associated SNVs | 25628485–34513401 | 25522718–34477045 | 25566642–34477045 | |
| Region delimited by significant associated SNVs and SNVs in LD in cases (R2≥0.6) | 25402068–37781406 | 25402068–34513401 | 25517580–34513401 | |
| 11 | Number of associated SNV | 1145 | 984 | 930 |
| Localisation of the best SNV | 41215628 | 41252822 | 41252822 | |
| Region delimited by significant associated SNVs | 29934486–52418087 | 29978631–52418087 | 29978631–52418087 | |
| Region delimited by significant associated SNVs and SNVs in LD in cases (R2≥0.6) | 29047449–52471659 | 29315057–52471659 | 29341449–52418087 | |
| 14 | Number of associated SNV | 292 | 250 | 246 |
| Localisation of the best SNVs | 6567456 | 6567456 | 6566022 | |
| 10231328 | 10665001 | 11021670 | ||
| Region delimited by significant associated SNVs | 561549–11111293 | 561549–11111293 | 561549–11111293 | |
| Region delimited by significant associated SNVs and SNVs in LD in cases (R2≥0.6) | 475090–11638599 | 475090–11379670 | 475090–11379670 | |
Fig 4Genome-wide association studies (GWAS) of Bernese mountain dogs (BMD) and other predisposed breeds on histiocytic sarcoma (HS) with the imputation of single nucleotide variations (SNV) on a higher density SNV array.
A–B) BMD GWAS results based on 403 cases and 347 controls (GWAS_6_HS_BMD_with_imputed_SNV). A) Quantile-quantile plot displaying a genomic inflation λ of 1.000023, indicating no residual inflation. B) Manhattan plot displaying the statistical results from the GWAS. This analysis shows four loci (arrows) on chromosome 2 (best SNV at CFA2:29716535, pcorrected = 3.25 × 10−4), chromosome 5 (best SNV at Chr5:30496048, pcorrected = 8.22 × 10−6), chromosome 11 (best SNV at Chr11:41215628, pcorrected = 1.45 × 10−13), and chromosome 14 (CFA14:6567456, pcorrected = 4.04 × 10−6). C–D. GWAS results for HS combining BMDs (403 cases vs. 347 controls) and flat-coated retrievers (FCRs; 13 cases vs. 15 controls; GWAS_7_HS_BMD+FCR_with_imputed_SNV). C) Quantile-quantile plot displaying a genomic inflation λ of 1.000018, indicating no residual inflation. D) Manhattan plot displaying the statistical results from the GWAS. This analysis shows four loci (arrows) on chromosome 2 (best SNV at CFA2:29716535, pcorrected = 6.02 × 10−5), chromosome 5 (best SNV at CFA5:33823740, pcorrected = 2.52 × 10−6), chromosome 11 (best SNV at CFA11:41252822, pcorrected = 1.49 × 10−13), and chromosome 14 (CFA14:6567456, pcorrected = 1.37 × 10−6). E–F) GWAS results for HS combining BMDs (403 cases vs. 347 controls), FCRs (13 cases vs. 15 controls), and Rottweilers (37 cases vs. 23 controls; GWAS_8_HS_BMD+FCR+Rottweiler_with_imputed_SNV). E) Quantile-quantile plot displaying a genomic inflation λ of 1.000013, indicating no residual inflation. F) Manhattan plot displaying the statistical results from the GWAS. This analysis shows four loci (arrows) on chromosome 2 (best SNV at CFA2:29716535, pcorrected = 3.58 × 10−5), chromosome 5 (best SNV at CFA5:33823740, pcorrected = 2.4 × 10−6), chromosome 11 (best SNV at CFA11:41252822, pcorrected = 2.04 × 10−14), and chromosome 14 (best SNV at CFA14:6566022, pcorrected = 1.09 × 10−6). G) Close up view of the CFA11 locus highlighting the best p-values obtained in the three GWAS: BMDs GWAS (GWAS_6_HS_BMD_with_imputed_SNV), BMDs plus FCRs GWAS (GWAS_7_HS_BMD+FCR_with_imputed_SNV), and BMDs plus Rottweilers and FCRs GWAS (GWAS_8_HS_BMD+FCR+Rottweiler_with_imputed_SNV). R2 in cases from top SNV is depicted to show the linkage disequilibrium (LD) structure. H) Regions delimitated by SNVs in LD with the best GWAS SNVs (R2 > 0.6) in cases, minimal region between the three GWAS (CFA11: 38435917–41701130) is delimitated by red lines. I) Close up view of the genes (with available symbols) located in this minimal region (38–42 Mb).
Association analysis of haplotypes of CFA11, CFA5, and CFA14 loci with the phenotype in predisposed breeds.
The CFA11:41 Mb haplotype was determined by the genotype of the following SNVs: 41144469, 41161357, 41161441*, 41163558, 41166847, 41176819*, 41185205, 41192676, 41196587*, 41200012, 41204074, 41215628, 41217026*, 41218376, and 41252822. The CFA11:44 Mb haplotype was determined by the genotype of the following SNVs: 43263273, 43272660, 43290639, 43317981, 43690075*, 44070751, 44097553*, 44098601*, 44099662, 44121253, 44126921, 44129025, 44135605, 44150645*, 44152497, 44364615, 44366678, and 44367106*. The CFA5:33 Mb haplotype was determined by the genotype of the following SNVs: 33650581, 33788841*, 33801240*, 33807500*, 33823740*, 33839057, 33872506*, 33879286, 33883533, 33883991, 33897576, 33900059, 33906211, 33916864, 34213158, 34217528, 34220300, 34225552, 34225906, 34227643, 34231901*, 34232173, 34234461, 34239783*, 34240989, 34246822, and 34321500. The CFA14:11 Mb haplotype was determined by the genotype of the following SNVs: 11021670, 11023871, 11026224, 11028349, 11030215, 11033259, 11041572, 11067136, 11070626, 11077346, 11083217, 11092391, 11094795*, and 11108670. At risk haplotype in the breeds are represented in bold. CI: confidence interval (Woolf Method). * SNVs from the 173K SNV Canine HD.
| Haplotype frequencies in the 3 breeds | Odds Ratio | ||||||
|---|---|---|---|---|---|---|---|
| haplotype | affected | unaffected | Odds Ratio | confidence interval (Woolf Method) | pval | ||
| Locus: CFA11:41Mb | BMD | 0,79 | 0,55 | 3.03 | [2.42–3.79] | 7.03x10-23 | |
| {A/G}GCCCGGGGAAGTGC | 0,21 | 0,45 | 0.32 | [0.26–0.4] | 2.56x10-23 | ||
| others | 0,00 | 0,00 | 1.73 | [0.32–9.47 | 0.69 | ||
| Rottweiler | 0,76 | 0,54 | 2.61 | [1.19–5.73] | 0.0175 | ||
| AGCCCGGGGAAGTGC | 0,22 | 0,41 | 0.39 | [0.17–0.87] | 0.024 | ||
| others | 0,03 | 0,04 | 0.61 | [0.08–4.49] | 0.63 | ||
| FCR | GTTTAAAAAGCCCAT | 0,54 | 0,57 | 0.89 | [0.31–2.56] | 0.83 | |
| 0,38 | 0,2 | 2.5 | [0.76–8.25] | 0.12 | |||
| {A/G}GCCCGGGGAAGTAC | 0,08 | 0,23 | 0.27 | [0.05–1.44] | 0.15 | ||
| Locus: CFA11:44Mb | BMD | 0,65 | 0,44 | 2.41 | [1.96–2.97] | 7.27x10-17 | |
| TCAGTTATTGTAATCGTC | 0,12 | 0,25 | 0.41 | [0.31–0.54] | 4.91x10-11 | ||
| TCAGTTATTGCGGTCGTC | 0,09 | 0,12 | 0.68 | [0.49–0.95] | 0.026 | ||
| TCAGCAATTGCGGTCGTC | 0,09 | 0,11 | 0.81 | [0.58–1.13] | 0.23 | ||
| others | 0,05 | 0,08 | 0.62 | [0.4–0.95] | 0.03 | ||
| Rottweiler | 0,82 | 0,57 | 3.61 | [1.57–8.33] | 0.003 | ||
| ATGACTATTGTAATCGTC | 0,07 | 0,17 | 0.34 | [0.1–1.11] | 0.078 | ||
| ATGATAATAGCGGCAACT | 0,05 | 0,15 | 0.32 | [0.09–1.16] | 0.1 | ||
| others | 0,05 | 0,11 | 0.47 | [0.12–1.85] | 0.30 | ||
| FCR | 0,35 | 0,17 | 2.65 | [0.76–9.29] | 0.137 | ||
| TCAGCAGGAACGGCAATC | 0,54 | 0,70 | 0.5 | [0.17–1.5] | 0.21 | ||
| others | 0,12 | 0,13 | 0.85 | [0.17–4.2] | 1 | ||
| 1,00 | 1,00 | ||||||
| all breeds | 0,66 | 0,44 | 2.45 | [2.01–2.99] | 3.2x10-19 | ||
| others | 0,34 | 0,56 | 0.41 | [0.34–0.5] | |||
| Locus: CFA5:33Mb | BMD | 0,75 | 0,59 | 2.05 | [1.65–2.55] | 1.01x10-10 | |
| ACACCCGGGTTGACAGATTAACGACCT | 0,15 | 0,25 | 0.53 | [0.41–0.69] | 1.87x10-6 | ||
| ATTTCTGGGTTGACAGATTAACGACCT | 0,08 | 0,13 | 0.59 | [0.42–0.82] | 0.0016 | ||
| others | 0,02 | 0,03 | 0.7 | [0.37–1.32] | 0.69 | ||
| Rottweiler | 0,69 | 0,63 | 1.3 | [0.6–2.82] | 0.55 | ||
| CCACCCACACAAGTGTCCCGATAGATT | 0,12 | 0,17 | 0.66 | [0.23–1.85] | 0.66 | ||
| others | 0,19 | 0,20 | 0.96 | [0.38–2.44] | 0.96 | ||
| FCR | 0,69 | 0,33 | 4.5 | [1.46–13.89] | 0.015 | ||
| CCTTCTACACAAGTAGACCGGTAACCT | 0,04 | 0,23 | 0.13 | [0.01–1.14] | 0.056 | ||
| CTTTCTGGGTTGACGTATTGACAGATT | 0,12 | 0,20 | 0.52 | [0.12–2.33] | 0.48 | ||
| others | 0,15 | 0,23 | 0.6 | [0.15–2.34] | 0.51 | ||
| all breeds | 0,75 | 0,59 | 2.05 | [1.67–2.52] | 6.58X10-12 | ||
| others | 0,25 | 0,41 | 0.49 | [0.4–0.6] | |||
| Locus: CFA14:11Mb | BMD | 0,74 | 0,61 | 1.79 | [1.44–2.23] | 1.86x10-7 | |
| ACCAACGCACGCAT | 0,17 | 0,19 | 0.85 | [0.65–1.1] | 0.25 | ||
| GCCAACGTATAAGT | 0,05 | 0,15 | 0.34 | [0.23–0.49] | 2.65x10-9 | ||
| ATAGGAACCCGCAT | 0,04 | 0,04 | 0.8 | [0.48–1.34] | 0.43 | ||
| others | 0,01 | 0,01 | 0.64 | [0.22–1.85] | 0.43 | ||
| Rottweiler | 0,57 | 0,43 | 1.77 | [0.85–3.69] | 0.14 | ||
| ATAGGAACCCGCAT | 0,35 | 0,41 | 0.77 | [0.36–1.64] | 0.56 | ||
| GCCAACGTATAAGT | 0,07 | 0,11 | 0.59 | [0.16–2.16] | 0.59 | ||
| others | 0,01 | 0,04 | 0.2 | [0.02–1.98] | 0.15 | ||
| FCR | 0,46 | 0,20 | 3.43 | [1.05–11.17] | 0.036 | ||
| GCCAACGTATAAGT | 0,23 | 0,53 | 0.26 | [0.08–0.83] | 0.02 | ||
| ATAGGAACCCGCAC | 0,23 | 0,23 | 0.99 | [0.29–3.44] | 0.98 | ||
| others | 0,08 | 0,03 | 2.42 | [0.21–28.34] | 0.59 | ||
| all breeds | GCCAACGTATAAGT | 0,06 | 0,16 | 0.34 | [0.24–0.47] | 9.08x10-11 | |
| others | 0,94 | 0,84 | 2.92 | [2.09–4.08] | |||
Association of CFA5, CFA11, and CFA14 with the phenotype in predisposed breeds.
The number of risk alleles was determined with the genotype of the following SNVs: CFA5:30496048, CFA11:41252822, and CFA14:11021670. CI: confidence interval (Woolf Method).
| frequencies of risk alleles in the 3 breeds | Odds Ratio | |||||
|---|---|---|---|---|---|---|
| number of risk alleles | affected | unaffected | Odds Ratio | confidence interval (Woolf Method) | pval | |
| BMD | ≥5 risk alleles | 0,72 | 0,31 | 5.67 | [4.14–7.76] | 1.44x10-29 |
| 4 risk alleles | 0,23 | 0,4 | 0.46 | [0.34–0.63] | 1.779x10-6 | |
| ≤ 3 risk alleles | 0,05 | 0,29 | 0.12 | [0.07–0.2] | 1.34x10—20 | |
| Rottweiler | ≥5 risk alleles | 0,74 | 0,43 | 3.51 | [1.17–10.53] | 0.03 |
| 4 risk alleles | 0,18 | 0,26 | 0.66 | [0.19–2.29] | 0.53 | |
| ≤3 risk alleles | 0,08 | 0,3 | 0.2 | [0.05–0.88] | 0.034 | |
| FCR | 4 risk alleles | 0,38 | 0 | NA | NA | 0.013 |
| 3 risk alleles | 0,54 | 0,53 | 1.02 | [0.23–4.52] | 1 | |
| ≤2risk alleles | 0,08 | 0,47 | 0.1 | [0.01–0.98] | 0.037 | |
| all breeds | ≥5 risk alleles | 0,70 | 0,31 | 5.27 | [3.92–7.08] | 1.52X10-30 |
| 4 risk alleles | 0,23 | 0,37 | 0.51 | [0.38–0.69] | 1.45x10-5 | |
| ≤3 risk alleles | 0,07 | 0,32 | 0.15 | [0.1–0.23] | 8.1X10-22 | |