| Literature DB >> 26061395 |
Jennifer L Johnson1, Helena Wittgenstein2, Sharon E Mitchell3, Katie E Hyma3, Svetlana V Temnykh2, Anastasiya V Kharlamova4, Rimma G Gulevich4, Anastasiya V Vladimirova4, Hiu Wa Flora Fong1, Gregory M Acland2, Lyudmila N Trut4, Anna V Kukekova1.
Abstract
The silver fox (Vulpes vulpes) offers a novel model for studying the genetics of social behavior and animal domestication. Selection of foxes, separately, for tame and for aggressive behavior has yielded two strains with markedly different, genetically determined, behavioral phenotypes. Tame strain foxes are eager to establish human contact while foxes from the aggressive strain are aggressive and difficult to handle. These strains have been maintained as separate outbred lines for over 40 generations but their genetic structure has not been previously investigated. We applied a genotyping-by-sequencing (GBS) approach to provide insights into the genetic composition of these fox populations. Sequence analysis of EcoT22I genomic libraries of tame and aggressive foxes identified 48,294 high quality SNPs. Population structure analysis revealed genetic divergence between the two strains and more diversity in the aggressive strain than in the tame one. Significant differences in allele frequency between the strains were identified for 68 SNPs. Three of these SNPs were located on fox chromosome 14 within an interval of a previously identified behavioral QTL, further supporting the importance of this region for behavior. The GBS SNP data confirmed that significant genetic diversity has been preserved in both fox populations despite many years of selective breeding. Analysis of SNP allele frequencies in the two populations identified several regions of genetic divergence between the tame and aggressive foxes, some of which may represent targets of selection for behavior. The GBS protocol used in this study significantly expanded genomic resources for the fox, and can be adapted for SNP discovery and genotyping in other canid species.Entities:
Mesh:
Year: 2015 PMID: 26061395 PMCID: PMC4465646 DOI: 10.1371/journal.pone.0127013
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Principal component analysis.
Principal component analysis of SNP data for 20 tame and 20 aggressive foxes. 8,437 SNPs with genotypes available for all individuals were used in this analysis. Aggressive individuals are represented by red dots, tame individuals are represented by green triangles. PC1 is plotted on the x-axis, PC2 is plotted on the y-axis.
Fig 2Estimation of population structure using STRUCTURE.
Cluster analysis of fox genotypes was performed at four values of K (2, 3, 4, and 5) without population information. The numbers of assumed clusters are indicated on the y-axis. The population origin of individuals is indicated on x-axis. On each graph the individuals are listed in the order obtained at K = 3. Each individual is represented by a bar that is segmented into colors based on the assignment into inferred clusters given the assumption of K populations. The length of the colored segment is the estimated proportion of the individual’s genome belonging to that cluster. The analysis was run in 8 replicates for each K, the replicate with the highest likelihood is shown. The genetic structure analysis clearly differentiated the tame population from the aggressive one and did not reveal significant population stratification within the tame population at every K tested. In contrast, the population stratification within the aggressive population became apparent at K = 3.
Fig 3Estimation of linkage disequilibrium (r2) in tame and aggressive fox populations.
Distributions of r values between pairs of SNPs separated by different distances are compared between tame (green) and aggressive (red) populations. SNP pairs were divided into 14 sets (bins) using the estimated distances between SNPs in the fox genome (S2 Table). Each bin is represented by a doubled bar (green and red) on the graph. The range of distances between SNPs in each bin is indicated on the x-axis. The width of the bar represents the relative number of SNP pairs in that bin for that population after a log transformation (wider bars have more pairs of SNPs). Exact numbers of SNP pairs in each bin are presented in S2 Table. The y-axis indicates r values for pairs of SNPs. The yellow diamonds correspond to the mean r for all SNPs in that bin in the population. The white circles correspond to the median values. The thin black line within each bar represents r values in that bin in the population in the interval from the 25th to 75th percentile. The horizontal line corresponds r = 0.2.
SNP minor allele frequency and heterozygosity in tame and aggressive populations.
| Population | Number of SNPs | Mean MAF | Median MAF | SD MAF | Fixed in the population | Mean Expected Heterozygosity | Mean Observed SNP Heterozygosity | Median Observed SNP Heterozygosity | SD SNP Heterozygosity |
|---|---|---|---|---|---|---|---|---|---|
| Tame | 48,294 | 0.148 | 0.088 | 0.147 | 8,014 | 0.22 | 0.16 | 0.10 | 0.16 |
| Aggressive | 48,294 | 0.163 | 0.115 | 0.147 | 5,514 | 0.24 | 0.17 | 0.11 | 0.16 |
Fig 4Allele frequency differences of genome-wide and VVU14 SNPs.
The black dots within the gray bar on the top of each figure panel indicate regions with FST≥0.5. Dots of different colors are the Bonferroni corrected significance of the allele frequency differences between the tame and aggressive populations calculated by PLINK2. The y-axis indicates-log10 (Bonferroni corrected p-value) for SNP allele frequency differences. The black horizontal line corresponds to a significance cutoff of (-log10 = 6). Genomic position in fox, as extrapolated from syntenic regions in the dog genome, is plotted on the x-axis. 4A. Differences in allele frequency of SNPs genome wide. Colors indicate the different fox chromosomes. The x-axis indicates the cumulative estimated positions in the fox genome in megabases. 4B. Differences in allele frequency of SNPs on VVU14. The horizontal colored bars on the x-axis correspond to syntenic dog chromosomes. 4C. Multi SNPs region on VVU14 with significant allele frequency differences between the two populations. CFA3:42,000,000–46,000,000 corresponding to the cluster on VVU14 with multiple SNPs that are highly significant. The genes in the region are indicated just above the x-axis. The gene NR2F2 (COUP-TFII) is the only gene located in the significant region.
SNPs with significant allele frequency differences between tame and aggressive populations.
| SNP | CFA | Location on CFA | VVU | Inferred location on VVU | Differences of SNP minor allele frequencies between two populations | Bonferroni corrected P-value | Distance from previous significant SNP | Clusters |
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
| S1_812259209 | 2 | 51,452,767 | 2 | 138,634,343 | 0.80 | 3.76E-08 | ||
| S1_1648737836 | 36 | 1,743,329 | 3 | 1,743,329 | 0.79 | 1.08E-07 | ||
| S1_1673084403 | 36 | 26,089,896 | 3 | 26,089,896 | 0.77 | 1.74E-07 | 24,346,567 | |
| S1_1964707411 | 6 | 54,892,000 | 3 | 127,827,426 | 0.75 | 2.05E-07 | 101,737,530 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| S1_1531784156 | 32 | 23,626,827 | 4 | 42,200,559 | 0.88 | 2.20E-07 | ||
| S1_1794484892 | 4 | 61,861,562 | 4 | 127,688,948 | 0.86 | 1.16E-08 | 85,488,389 | |
| S1_1712828536 | 38 | 4,119,843 | 5 | 105,433,843 | 0.82 | 4.94E-09 | ||
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| S2_148636413 | 8 | 67,661,781 | 6 | 129,101,715 | 0.75 | 8.95E-07 | 35,047,593 | |
| S1_540429371 | 16 | 13,131,693 | 7 | 13,131,693 | 0.7750 | 9.22E-08 | ||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| S1_1246055552 | 27 | 38,802,107 | 8 | 38,802,107 |
| 3.76E-08 | 2,415,857 | |
| S1_1252424198 | 27 | 45,170,753 | 8 | 45,170,753 |
| 1.97E-07 | 6,368,646 | |
| S1_648198405 | 17 | 61,267,781 | 8 | 48,897,988 |
| 8.80E-07 | 3,727,235 | |
| S1_644111987 | 17 | 57,181,363 | 8 | 52,984,406 |
| 2.20E-08 | 4,086,418 | |
| S1_641943154 | 17 | 55,012,530 | 8 | 55,153,239 |
| 2.08E-08 | 2,168,833 | |
| S1_625871222 | 17 | 38,940,598 | 8 | 71,225,171 |
| 7.10E-10 | 16,071,932 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| S1_237429054 | 11 | 45,418,622 | 12 | 28,970,475 | 0.8111 | 5.07E-08 | ||
| S1_201401022 | 11 | 9,390,590 | 12 | 64,998,507 | 0.7500 | 3.64E-07 | 36,028,032 | |
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| S1_1384636870 | 3 | 48,479,065 | 14 | 96,177,844 | 0.8250 | 3.20E-09 | 3,846,414 | |
| S1_1272088483 | 28 | 18,958,228 | 15 | 99,068,409 | 0.7921 | 8.52E-08 | 2,890,565 |
SNPs with differences in allele frequency between two populations at p <10-7 (Bonferroni correction of PLINK2 [32] association test). The absolute differences of SNP minor allele frequencies between the tame and aggressive populations are listed. These SNPs are presented as dots above the significance line (–log10 >6) on Fig 4. SNPs are listed in the order of the fox map. Multi SNP clusters are in bold and numbered in the rightmost column. Multi SNP clusters are regions where SNPs that met the threshold are within 2 Mb to the next significant SNP. Genes that are in or near (50,000 bp) the multi SNP clusters are listed in Table 3. Dog chromosome—CFA; Fox chromosome—VVU.
Genes located inside or within 50,000 bp from start and end of the multi SNP clusters in the dog genome.
| Cluster | Genes |
|---|---|
| 1 |
|
| 2 | none |
| 3 |
|
| 4 |
|
| 5 |
|
| 6 |
|
| 7 |
|
| 8 |
|
| 9 |
|
| 10 |
|
| 11 |
|
Cluster numbers refer to the numbering from Table 2.