| Literature DB >> 30940804 |
Jocelyn Plassais1, Jaemin Kim1, Brian W Davis1,2, Danielle M Karyadi1,3, Andrew N Hogan1, Alex C Harris1, Brennan Decker1,4, Heidi G Parker1, Elaine A Ostrander5.
Abstract
Domestic dog breeds are characterized by an unrivaled diversity of morphologic traits and breed-associated behaviors resulting from human selective pressures. To identify the genetic underpinnings of such traits, we analyze 722 canine whole genome sequences (WGS), documenting over 91 million single nucleotide and small indels, creating a large catalog of genomic variation for a companion animal species. We undertake both selective sweep analyses and genome wide association studies (GWAS) inclusive of over 144 modern breeds, 54 wild canids and a hundred village dogs. Our results identify variants of strong impact associated with 16 phenotypes, including body weight variation which, when combined with existing data, explain greater than 90% of body size variation in dogs. We thus demonstrate that GWAS and selection scans performed with WGS are powerful complementary methods for expanding the utility of companion animal systems for the study of mammalian growth and biology.Entities:
Mesh:
Year: 2019 PMID: 30940804 PMCID: PMC6445083 DOI: 10.1038/s41467-019-09373-w
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Summary of phenotypes used to perform GWAS using the WGS catalog
| Phenotype | Number dogs | Number cases/controls | Number variants | Best | Bonferroni threshold: (−log 10 (0.05/Nb.variants)) | Nb. Variants passing Bonferroni threshold |
|---|---|---|---|---|---|---|
| Canids catalog | 722 | — | 76854926 | — | — | — |
| Kinship | 268 | — | 14489676 | — | — | — |
| Aggressiveness | 63 | — | 13191759 | 1.27E-07 | 0 | |
| Boldness | 65 | 29/36 | 13545961 | 1.01E-10 | 8.42 | 21 |
| Bulky | 257 | 21/239 | 14437654 | 3.79E-57 | 8.43 | 1797 |
| Drop ears | 214 | 113/101 | 14426181 | 7.63E-24 | 8.46 | 1100 |
| Furnishing | 257 | 59/198 | 14387349 | 1.06E-68 | 8.46 | 976 |
| Hairless | 268 | 6/262 | 14489548 | 3.16E-67 | 8.46 | 3908 |
| Height | 255 | — | 14416697 | 6.35E-27 | 8.46 | 1074 |
| Large ears | 213 | 31/182 | 14457478 | 4.91E-41 | 8.46 | 1242 |
| Lengh of fur | 215 | 89/126 | 14352965 | 4.71E-20 | 8.46 | 43 |
| Life span | 242 | — | 14670938 | 1.77E-09 | 8.46 | 4 |
| Long legs | 102 | 22/80 | 13732336 | 6.24E-14 | 8.44 | 569 |
| Muscled | 244 | 52/192 | 14809625 | 2.69E-15 | 8.46 | 1175 |
| Tail curl | 173 | — | 14637750 | 4.47E-11 | 8.47 | 461 |
| Weight | 255 | — | 14416697 | 4.04E-23 | 8.46 | 938 |
| White chest | 195 | 100/195 | 14847812 | 3.75E-19 | 8.47 | 50 |
| White head | 179 | 57/122 | 14386917 | 5.99E-29 | 8.46 | 94 |
Summary of significant associations identified by multiple GWAS using a maximum of 268 modern breed genomes
| Position/region | Gene/locus | Function | Associated phenotype in this study | |
|---|---|---|---|---|
| chr1:42085782-42573240 | 2.31E-11 |
| Major mediator of estrogen action for bone mass/osteoporosis[ |
|
| chr3:55954929-56065637 | 1.62E-10 |
| Human, pig and cattle loan weight QTL[ |
|
| chr3:91269525 | 4.04E-23 |
| Transcription factor - body size[ | Height, weight[ |
| chr4:39182836 | 8.93E-11 |
| Glycoprotein hormone - body size | Height[ |
| chr4:66902902-67093815 | 7.51E-14 |
| Growth hormone - body size | Height[ |
| chr7:26603745-28240043 | 4.47E-11 |
| Gene involved in bones morphology[ |
|
| chr7:43724293-43890274 | 1.26E-10 |
| Transcription factor - body size | Height[ |
| chr9:27659585 | 3.03E-40 |
| Enzyme associated with metabolic syndrome[ |
|
| chr10:8070103 | 7.63E-24 |
| Multi-traits locus in dogs |
|
| chr10:8351907-8488300 | 1.17E-26 |
| Transcription factor - body size | Height[ |
| chr11:14030600 | 9.53E-15 |
| Body mass QTL in pig/body mass index QTL in human[ |
|
| chr11:18621251-18855024 | 2.39E-09 |
| Associated with mechanical function in cartilage[ |
|
| chr12:33803314-35061155 | 4.91E-41 |
| Neurological genes (cognition/potassium channel in cochlea)[ |
|
| chr13:8610419 | 1.06E-68 |
| Fur length and furnishing gene | Furnishing[ |
| chr15:41221438 | 6.35E-27 |
| Growth hormone - body size | Height, weight[ |
| chr17:37651314 | 3.16E-67 |
| Hairless in dogs | Hairless[ |
| chr18:20447435 | 5.95E-16 | Chondrodysplasia | Height[ | |
| chr19:38303408 | 2.17E-13 |
| Meat QTL in cattle[ |
|
| chr20:21786368-21869849 | 5.99E-29 |
| Coat color | White chest, white head[ |
| chr20:26692625 | 1.21E-13 |
| Human adiposity locus[ |
|
| chr24:31856245 | 1.27E-07 |
| Associated with psychotic illness in human patients[ |
|
| chr26:12796099-13004170 | 2.08E-11 |
| Body size locus |
|
| chr29:23802662 | 1.95E-10 |
| Intramuscular fat deposition of beef cattle[ |
|
| chr32:4476417 | 4.71E-20 |
| Length of fur | Length of fur[ |
| chr34:20097018-212633271 | 1.26E-11 | Growth hormone - body size | Weight[ | |
| chrX:82919525 | 3.79E-57 |
| Enzyme associated with body mass | Weight, bulky breed[ |
| chrX:82310627-86057014 | 3.38E-13 |
| Body mass locus | Height, weight[ |
| chrX:101732248-103320770 | 2.69E-15 |
| Body mass locus | Height, weight, muscled breeds[ |
Bold indicates identification in this study, and asterisk denote previously unreported mutations
Region or exact positions are defined by variants passing the Bonferroni correction threshold (8.46)
Fig. 1GWAS results for morphological traits in dogs using the canine 722 genome catalog. Manhattan plots showing statistical significance (−log10 scale) for the 30,000 most associated biallelic variants for each canine autosome, and all variants for the X chromosome (X-axis). a Validation of this WGS-GWAS approach using known examples in dogs: presence or absence of moustache and eyebrows, length of fur, and height as a multigenic trait. b Associations identified using body mass including the bulky phenotype and life span. The red line represents the Bonferroni corrected significance threshold (−log10(P) ≃8.46) and variants passing this threshold are colored in red. Candidate genes identified in this study are in bold
Fig. 2Identification of LCORL mutation in large breeds and comparison with human. a Comparison of genomic sequences between human and the two canine alleles. A single nucleotide insertion is observed in large breeds (>41 kg). b Conservation of the two main LCORL proteins and their predicted functional domain using SIM[68] and LALNVIEW[69]. c Schematic representations of LCORL proteins, highlighting the effect of the canine mutation (STOP codon after amino acid 1221 leads to a loss of 610 aa). The common part shared by all forms is colored in yellow. Source data are provided as a Source Data file
Previously unreported candidate variants identified using the WGS canids catalog
| Associated phenotype | Gene/lncRNA | Definition/function | Locus/position | Best candidate variation(s) | |
|---|---|---|---|---|---|
| Height (long legs) |
| Estrogen receptor 1 | 2.31E-11 | chr1:42085782-42573240 | Intronic SNPs |
| Height, weight |
| Ligand-dependent nuclear receptor corepressor like | 4.04E-23 | chr3:91269525 | Indel in the last exon - lead to a STOP codon |
| Drop ears |
| Mutated lncRNA 29 kb downstream the last exon of | 7.63E-24 | chr10:8070103 | Exonic SNP in one lincRNA |
| Weight |
| Zinc finger protein 608 | 9.53E-15 | chr11:13906259-14081398 | SNPs 200 kb downstream the last exon |
| Weight, bulky |
| R3H domain containing 1 | 2.17E-13 | chr19:38303408 | CpG island - promoter |
| Weight |
| ADAM metallopeptidase with thrombospondin type 1 motif 9 | 1.21E-13 | chr20:26692625 | exonic variant |
| Bulky breed |
| Hepatocyte nuclear factor 4 gamma | 1.95E-10 | chr29:23802662 | 3’UTR SNP |
aMutated transcript
Allele frequencies at 14 markers explain 95% of weight variation in dog population
| Gene/locus | Mean SBW of D/D dogs (in kg) | Position | Modern dog breeds population | Small breeds (<10 kg) | Medium breeds (10 < SBW < 41 kg) | Large breeds (>41 kg) | Wild canid | Village dogs | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Af | beta | Variance | Af | Af | Af | Af | Af | |||
|
| chr3:91269525 | 0.17 | 11.15 | 0.159 | 0.00 | 0.01 | 0.68 | 0.00 | 0.10 | |
|
| chr15:41221438 | 0.44 | −7.72 | 0.131 | 0.85 | 0.35 | 0.18 | 0.00 | 0.19 | |
|
| chr10:8351907 | 0.21 | −8.92 | 0.116 | 0.87 | 0.03 | 0.05 | 0.06 | 0.08 | |
|
| chrX:82919525 | 0.08 | 12.50 | 0.100 | 0.00 | 0.03 | 0.55 | 0.00 | 0.00 | |
|
| chr11:13945821 | 0.02 | 27.33 | 0.118 | 0.00 | 0.00 | 0.16 | 0.00 | 0.00 | |
|
| chr20:26661051 | 0.02 | 21.46 | 0.096 | 0.00 | 0.00 | 0.21 | 0.00 | 0.00 | |
|
| chr19:38303408 | 0.01 | 14.55 | 0.019 | 0.00 | 0.00 | 0.18 | 0.00 | 0.00 | |
|
| chr7:43782633 | 0.24 | −5.95 | 0.057 | 0.57 | 0.16 | 0.05 | 0.01 | 0.03 | |
|
| chr4:67040898 | 0.23 | −4.53 | 0.032 | 0.68 | 0.19 | 0.04 | 0.00 | 0.07 | |
|
| chrX:102292529 | 0.38 | 4.63 | 0.045 | 0.19 | 0.34 | 0.96 | 0.17 | 0.59 | |
|
| chr4:39182836 | 0.18 | −5.29 | 0.036 | 0.47 | 0.10 | 0.11 | 0.00 | 0.05 | |
|
| chr29:23802662 | 0.01 | 9.97 | 0.009 | 0.00 | 0.00 | 0.20 | 0.00 | 0.01 | |
|
| chr3:41849479 | 0.04 | −7.70 | 0.018 | 0.13 | 0.01 | 0.00 | 0.00 | 0.01 | |
|
| chr4:67040939 | 0.01 | −10.17 | 0.013 | 0.06 | 0.00 | 0.00 | 0.00 | 0.00 | |
| Af: allele frequency for the derived allele | Total | 0.95 | ||||||||
Beta: estimated SNP effect (regression coefficient)
Derived allele is the allele absent (or present with a low frequency) into the wild canid population. The genetic variance attributable to each variant was estimated as V = 2Af(1 − Af) × beta2/variance(SBW)
Fig. 3Body mass and longevity analyses using 746 dogs genotyped on 170k SNP markers. a Manhattan plot of the multivariate GWAS for standard breed weight (SBW) and life span corrected by sex, using 746 dogs genotyped on Illumina HD SNP array[11]. The −log10 P values for each SNP are plotted on the y-axis versus each canine autosome and the X-chromosome on the x-axis. The red line represents the Bonferroni corrected significance threshold (−log10(P) = 6.48) and SNPs passing this threshold are colored in red. b Negative correlation between SBW and longevity. In blue, large breed outliers: Anatolian Shepherd Dogs (52.2 kg; 13 years) and Tibetan Mastiff (70.3 kg; 13.5 years) c SBW and longevity (y-axis) of each breed (without outliers) are plotted by genotype at each marker (x-axis). The homozygous D/D alleles have generally a stronger effect on the distribution of SBWs (or longevity) for a given genotype/marker combination (the median and first and third quartiles are indicated by the box-plots). Statistics for each genotype/marker combination are summarized in (d). P values estimated by Mann–Whitney–Wilcoxon tests (*P < 0.05; **P < 0.01; ***P < 0.001). SBWs and longevity of genotype classes are reported as mean ± SD. Source data are provided as a Source Data file
Fig. 4ESR1 and the long leg phenotype in dogs. a Manhattan plots showing statistical significance (−log10 scale) for the 30,000 most associated biallelic variants for each canine autosome, and all variants for the X chromosome (X-axis) for the long-leg phenotype observed in Sighthounds, Great Dane, and Great Pyrenees. We distinguish four peaks: one peak pinpointing ESR1 gene on chromosome 1, one locus on CFA9 without any candidate genes in the interval, and IGF1 (CFA15) and IRS4 (CFAX) previously associated with height variation in dogs. Images to the left are Great Dane (top) and Greyhound (bottom). b UCSC genome browser showing the ESR1 locus in dog (top) and human (bottom). Vertical bars correspond to the most associated variants identified with the 722 genomes (in red), and the 855 dogs genotyped on 170k SNP array (in brown), and horizontal bars represent the homozygous haplotype observed. The bottom panel represents the human orthologous locus with tracks corresponding to the H3K4me1 and H3K27ac chromatin signals annotated by the ENCODE project[55]. c Expression level of ESR1 in a panel of 20 breeds, showing high expression in the Sighthounds, Irish Wolfhound and Whippet, in comparison to six different breeds with average leg length. Y-axis represents the relative normalized expression. d XP-CLR plot on ESR1 locus comparing Sighthounds (long legs breeds) with normal-sized legs breeds. We detected a significant selection signature located on ESR1 locus (in grey). Horizontal lines represent the empirical top 1% of genomic regions. Source data are provided as a Source Data file
Fig. 5Ear morphology in dogs. a Manhattan plots showing one significant signal on the CFA10 for the drops ears phenotype and another one on chromosome 12 for the large and round ears. b Characteristic breeds representing four different ear shapes observed in dogs: Normal (1,3), large and round (2,4), prick (1,2) or drop (3,4). c UCSC genome browser showing the position on the canine genome (Canfam3.1) of the mutated lincRNA (in red) associated with the drop ears. d Combination of alleles at both loci create four phenotypes. Plus (+) and minus signs (−) indicate the presence or absence of variant (non-ancestral) genotype
Summary of XP-CLR and XP-EHH analyses between domestic dog breeds
| Analysis | Gene | XP-CLR | XP-EHH ( |
|---|---|---|---|
| Long legs (Sighthounds) |
| 109.0b | 1.16 (0.005) |
| Bulky | 7.3c | 0.94 (0.012) | |
| 22.5b | 1.02 (0.0054) | ||
|
| NS | 1.02 (0.0056) | |
|
| 4.7c | NS | |
|
| 3.3c | NS | |
|
| NS | NS | |
| CFAX-locus 1 | 197.9b | 1.66b | |
| CFAX-locus 2 | 110.1b | 2.18b | |
| Height/weight (small breeds) |
| 7.19c | 1.99 (0.000039) |
|
| 55.6b | 1.17 (0.0025) | |
|
| 5.0c | 0.84 (0.04) | |
|
| 173.7b | NS | |
|
| 169.3b | NS | |
| Height/weight (large breeds) |
| 14.6b | 1.91 (0.0002) |
| 21.3b | 1.37 (0.0025) | ||
| Drop ears |
| 224.3b | 1.49 (0.00025) |
| Large ears |
| 452.6b | 1.78 (0.00012) |
X chromosome was separated from autosomes to define the empirical top 1% of regions for both XP-CLR and XP-EHH. P values were not assigned for genes in the X chromosome (ACSL4, CFAX1, CFAX2)
NS non significant
aRank-based empirical P value
bSignificant under 1% threshold
cSignificant under 5% threshold