Literature DB >> 28549194

Disentangling Timing of Admixture, Patterns of Introgression, and Phenotypic Indicators in a Hybridizing Wolf Population.

Marco Galaverni1,2, Romolo Caniglia1, Luca Pagani3,4, Elena Fabbri1, Alessio Boattini5, Ettore Randi1,6.   

Abstract

Hybridization is a natural or anthropogenic process that can deeply affect the genetic make-up of populations, possibly decreasing individual fitness but sometimes favoring local adaptations. The population of Italian wolves (Canis lupus), after protracted demographic declines and isolation, is currently expanding in anthropic areas, with documented cases of hybridization with stray domestic dogs. However, identifying admixture patterns in deeply introgressed populations is far from trivial. In this study, we used a panel of 170,000 SNPs analyzed with multivariate, Bayesian and local ancestry reconstruction methods to identify hybrids, estimate their ancestry proportions and timing since admixture. Moreover, we carried out preliminary genotype-phenotype association analyses to identify the genetic bases of three phenotypic traits (black coat, white claws, and spur on the hind legs) putative indicators of hybridization. Results showed no sharp subdivisions between nonadmixed wolves and hybrids, indicating that recurrent hybridization and deep introgression might have started mostly at the beginning of the population reexpansion. In hybrids, we identified a number of genomic regions with excess of ancestry in one of the parental populations, and regions with excess or resistance to introgression compared with neutral expectations. The three morphological traits showed significant genotype-phenotype associations, with a single genomic region for black coats and white claws, and with multiple genomic regions for the spur. In all cases the associated haplotypes were likely derived from dogs. In conclusion, we show that the use of multiple genome-wide ancestry reconstructions allows clarifying the admixture dynamics even in highly introgressed populations, and supports their conservation management.
© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Canis lupus; admixture history; genotype–phenotype association; hybridization; introgression

Mesh:

Year:  2017        PMID: 28549194      PMCID: PMC5850710          DOI: 10.1093/molbev/msx169

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


Introduction

Hybridization is a biological process that, through the cross-breeding between individuals from distinct but closely related taxa, or between discrete entities that exchange genes, can deeply affect their genetic make-up, long-term survival and evolution (Parham et al. 2013; Saetre 2013; Gompert and Buerkle 2016). Natural hybridization is no longer viewed as a sporadic and undesirable evolutionary dead-end, but rather as a relatively frequent and potentially creative process (Mallet 2008; Larsen et al. 2010; Bailey et al. 2013; Stern 2013; Rius and Darling 2014). Natural hybrid zones represent hot-spots of genetic diversity, where natural selection can expose novel gene assemblages to the adaptive process, possibly leading to hybrid speciation (Lavrenchenko and Bulatova 2016). Episodic hybridization may introduce genetic variation into isolated populations, contrasting the deleterious consequences of small effective size and inbreeding (genetic rescue; Brennan et al. 2015), and cases of adaptive introgression have been recently documented in diverse animal and plant species (Grossen et al. 2014; Hovick and Whitney 2014; Fulgione et al. 2016). Conversely, anthropogenic hybridization, caused by habitat changes and by the spread of alien or domestic species, is mostly viewed as a risk factor in conservation biology, especially for the deleterious consequences of gene introgression into wild populations through backcrossing (Allendorf et al. 2001; Laikre et al. 2010). Introgression may spread maladaptive variants causing fitness declines and outbreeding depression, swamp genetic diversity, or disrupt epistatic equilibria, and local adaptations, driving local populations, or entire species to extinction (genomic extinctions; Rhymer and Simberloff 1996; Todesco et al. 2016). However, the long-term evolutionary consequences of genetic admixtures remain largely unpredictable and can sometimes be beneficial (Coulson et al. 2011; Hedrick et al. 2016). Hybridized populations may have complex histories. Hybrids can be generated either by discrete events (e.g., from single migrations waves) or by recurrent gene flow and admixture (e.g., in stable areas of sympatry; Allendorf et al. 2001; Currat et al. 2008; Moorjani et al. 2011; Muhlfeld et al. 2014). Whereas modeling the former case is relatively simple, since the relative proportions of ancestral components will be fairly homogeneous across all the hybrid individuals (Rhymer and Simberloff 1996; Rieseberg et al. 1999; Rosenberg et al. 2010), identifying the patterns and timing of recurrent hybridization events is far more complex (Currat and Excoffier 2011; Twyford and Ennos 2012) and is conditioned by the availability of efficient ancestry-informative markers (AIMs). Moreover, hybridization is expected to affect the phenotypes of admixed individuals, depending on the dominance and quantitative nature of the considered traits (Mallet 2008; Patterson et al. 2010). Therefore, the study of hybridization and introgression in natural populations also provides the opportunity to identify loci associated with phenotypic features, adaptation and selection, which might underlie the genetic bases of fitness in hybrid individuals (Arnold and Martin 2010; Gompert and Buerkle 2012). Canids (genus Canis) represent interesting examples of hybridizing taxa, sharing related and recently divergent genomes. In Eurasia, wolves (C. lupus) can hybridize in the wild with the closely related golden jackals (C. aureus; Freedman et al. 2014; Moura et al. 2014). In North America, historical wolf x coyote (C. latrans) hybridization originated complex patterns of admixed taxa, such as the red wolf (C. rufus), although their origin and taxonomic status is highly debated (Fredrickson and Hedrick 2006; Koblmuller et al. 2009; VonHoldt et al. 2011; Gese et al. 2015; Rutledge et al. 2015; VonHoldt et al. 2016a, 2016b). Wolves, coyotes and golden jackals can successfully reproduce also with domestic dogs (C. l. familiaris), principally in human-dominated regions where the widespread diffusion of stray dogs threatens the gene pool of several wild canid populations, constituting a serious conservation concern (Stronen and Paquet 2013; Monzón et al. 2014; Randi et al. 2014; Galov et al. 2015). A particular case-study is offered by the Italian wolf population, which is sharply genetically differentiated from any other wolf population due to protracted isolation south of the Alps and to demographic declines that led it to shrink down to <100 individuals in the 1970s (Zimen and Boitani 1975; Lucchini et al. 2004; Pilot et al. 2014; Randi et al. 2014). Thanks to the increased availability of prey and legal protection, Italian wolves are now recovering (Galaverni et al. 2016), but they are still threatened by heavy poaching (Caniglia et al. 2010; Imbert et al. 2016) and hybridization with stray dogs, as documented by genetic evidences (Lucchini et al. 2004; Caniglia et al. 2013; Randi et al. 2014). To date, except for a few studies where SNP arrays were applied, but not to assess hybridization patterns in this population (Vonholdt et al. 2011; Stronen and Paquet 2013; Pilot et al. 2014), Italian wolves have been genotyped using small panels of microsatellites (STRs) and uniparental markers (mitochondrial DNA and Y-chromosome sequences), which allow the detection of hybridization only up to 2–3 generations in the past (Vähä and Primmer 2006; Randi 2008), but not to understand the temporal patterns of older hybridization events. In particular, it is unknown whether the Italian wolves prevalently hybridized during or before the bottleneck of the 1970s, or if they recurrently admixed during the reexpansion along the Apennine ridge in the last 40 years. To overcome these limitations, in this study we genotyped 118 wolves, 31 village dogs, and 72 putative genetic and/or phenotypical wolf x dog hybrids, in addition to 456 publicly available dog genotypes, at a panel of 170,000 genome-wide single nucleotide polymorphisms (SNPs) and we exploited the additional information inferred from their haplotype blocks, aiming to identify admixed genotypes older than the first few generations of backcrossing and to estimate their timing of admixture (Lindblad-Toh et al. 2005; Vonholdt et al. 2010; Patterson et al. 2012). Furthermore, we applied local ancestry reconstruction methods to identify regions of the genomes of hybrids that were significantly deviating from random inheritance patterns, likely indicating selective forces locally acting on such regions. Moreover, wolf x dog hybridization may be reflected at the phenotypic level. The causative mutations of some phenotypic variants are well known, such as the one for the black coat color (Candille et al. 2007). This trait is coded by a 3-bp deletion at the β-defensin gene CDB103 that was likely introduced into North American wolves by ancient hybridization with dogs (Anderson et al. 2009). Other traits have also been suggested as potential indicators of hybridization, such as white claws (compared with the usual black or dark-grey wild phenotype) or the spur on the hind legs (a relict fifth finger, also known as dewclaw or preaxial polydactyly; Deng et al. 2015), but their genetic determinants are unknown or uncertain (Ciucci et al. 2003; Caniglia et al. 2013; Randi et al. 2014). Therefore, we used local genome ancestry (Gompert and Buerkle 2012) and genotype–phenotype association procedures (Gorlova et al. 2011) to identify putative dog-derived causal mutations associated with phenotypic variants that might be introgressed into the Italian wolves.

Results

Data Filtering, Sample and Marker Selection

Following genotyping and quality cleaning steps performed in SVS, both per sample and per locus, we retained 213 unrelated samples with high call rate (96.4%) that were successfully genotyped at 156,132 autosomal SNPs (90%, hereafter referred to as the “156k data set”): these samples included 114 wolves, 68 putative hybrids (identified based on previous STR assignments and/or anomalous phenotypic traits; Randi et al. 2014) and 31 village dogs, which were added to 456 dog samples from 30 breeds publicly available from the LUPA project data set, created for the genetic mapping of a number of canine diseases (Lequarré et al. 2011; Vaysse et al. 2011). A subset of 25,061 SNPs (14.5%) was obtained after LD pruning performed in Plink (Purcell et al. 2007) at threshold r2 = 0.1 (the 25 k data set). A smaller set of 19,732 SNPs (11.4%) was obtained after discarding all sites with any missing data (19 k data set).

Assignment and Admixture Analyses

The first component of an exploratory PCA, performed in SVS using the 156 k data set, explained most of the genetic variability (72%; fig. 1) and clearly differentiated dogs from wolves. The putative genetic hybrids were scattered along the first axis between wolves and dogs, much closer to the former except one intermediate sample. Conversely, the putative hybrids identified only by atypical phenotypes were mostly included within the wolf cluster. The second axis, which explained a 10-time smaller portion of the genetic diversity, differentiated the most divergent dog breed, the Dobermanns, followed by Border Terriers and Weimaraners along the additional axes (supplementary fig. S1, Supplementary Material online).
. 1

PC1 versus PC2 results from an exploratory principal component analysis (PCA) computed in SVS on the 156 k SNP data set and including dogs (yellow dots), putatively nonadmixed wolves (blue squares), black wolves (black squares), genetically admixed wolves based on STR data (pink squares) and wolves with atypical phenotypes (gray squares). The two axes are not to scale, in order to better distinguish individuals along PC2. The shaded rectangle includes a zoom on the wolf/hybrid cluster. The dog cluster in the bottom-right corner of the figure is composed of Dobermann individuals.

PC1 versus PC2 results from an exploratory principal component analysis (PCA) computed in SVS on the 156 k SNP data set and including dogs (yellow dots), putatively nonadmixed wolves (blue squares), black wolves (black squares), genetically admixed wolves based on STR data (pink squares) and wolves with atypical phenotypes (gray squares). The two axes are not to scale, in order to better distinguish individuals along PC2. The shaded rectangle includes a zoom on the wolf/hybrid cluster. The dog cluster in the bottom-right corner of the figure is composed of Dobermann individuals. Results from Admixture (Alexander et al. 2009), run with the 25k data set to reassign each sample to its population of origin, were concordant with the PCA analysis (fig. 2). At increasing values of K, the cross validation error steadily decreased up to K = 34 (supplementary fig. S2, Supplementary Material online). However, the first main decrease in CV error was observed at K = 2, that clearly separated wolves from dogs. Thus, we retained K = 2 to identify the proportions of admixture in wolf x dog hybrids. All dogs (DOG, n = 487) had an individual assignment value qw < 0.15 (fig. 2). Similarly, based on the distribution of individual values in the reference wolves (fig. 2), we selected a conservative threshold qw ≥ 0.999 to assign genotypes to the reference Italian wolf cluster (WIT, n = 90), and all those with an intermediate assignment value as possible hybrids (HYB, n = 92) in the downstream analyses, independently of phenotypic information. Since at K > 10 some level of substructuring was observed within wolves, we used K = 10 to identify the prevalent dog components in the admixed individuals, which turned out to correspond to German Shepherds (fig. 2). At K = 34, corresponding to the optimal number of genetic clusters (supplementary fig. S2, Supplementary Material online), all breeds clearly separated from one another (fig. 2), and also from wolves, which were split into five main groups (centred around the Northern Apennines, Eastern Coast—Adriatic, Western Coast—Maremma, Central Apennines, Southern Apennines) reflecting a rough geographical substructure along the Apennines. ThreePOP results for the f3 test (Pickrell and Pritchard 2012) confirmed a highly significant admixture between wolves and dogs in the HYB samples (z-score = −72.064). To better estimate the actual admixture proportions in the analyzed hybrids (Falush et al. 2016), we applied a PCA-based admixture deconvolution approach implemented in PCAdmix (Brisbin et al. 2012), which identified 1.2–51.6% dog blocks in the genome of the hybrid individuals. These proportions highly correlated with those estimated in Admixture at K = 2 (R2 = 0.969; P < 0.01), but highlighted larger dog components (supplementary fig. S3, Supplementary Material online).
. 2.

Admixture results from the 25 k SNP data set at K = 2, K = 10, and K = 34. K = 2 clearly separates wolves from dogs, whereas at K = 10 it is possible to identify the prevalent dog component in the admixed wolves, namely from German Shepherds (in blue). At K = 34 all the dog breeds clearly separates from one another and wolves are split into five main groups, reflecting a rough geographical structure of the population along the Apennines (Northern Apennines in yellow, Eastern Coast—Adriatic in blue, Western Coast—Maremma in dark gray, Central Apennines in pink and Southern Apennines in light brown). The intermediate hybrid identified in the PCA is the third individual of the HIT subset. DIT = village dogs (31), BeT = Belgian Tervuren (12), Bgl = Beagle (10), BMD = Bernese Mountain Dog (12), BoC = Border Collie (16), BoT = Border Terrier (25), BrS = Brittany Spaniel (12), CoS = Cocker Spaniel (14), Dac = Dachshund (12), Dob = Doberman Pinscher (25), EBD = English Bulldog (13), Elk = Elkhound (12), ESt = English Setter (12), Eur = Eurasian (12), Finnish Spitz FSp (12), GoS = Gordon Setter (25), Gry = Golden Retriever (11), GRe = Greyhound (14), GSh = German Shepherd (12), GSl = Greenland Sledge Dog (12), IrW = Irish Wolfhound (11), JRT = Jack Russell Terrier (12), LRe = Labrador Retriever (14), NFd = Newfoundland (25), NSD = Nova Scotia Duck Tolling Retriever (23), Rtw = Rottweiler (12), Sci = Schipperke (25), ShP = Shar-Pei (11), StP = Standard Poodle (12), TYo = Yorkshire Terrier (12), Wei = Weimaraner (26), PIT = wolves with atypical phenotypes (10), BIT = black wolves (14), HIT = genetically admixed wolves based on STR data (45), WIT = putatively nonadmixed Italian wolves (118).

Admixture results from the 25 k SNP data set at K = 2, K = 10, and K = 34. K = 2 clearly separates wolves from dogs, whereas at K = 10 it is possible to identify the prevalent dog component in the admixed wolves, namely from German Shepherds (in blue). At K = 34 all the dog breeds clearly separates from one another and wolves are split into five main groups, reflecting a rough geographical structure of the population along the Apennines (Northern Apennines in yellow, Eastern Coast—Adriatic in blue, Western Coast—Maremma in dark gray, Central Apennines in pink and Southern Apennines in light brown). The intermediate hybrid identified in the PCA is the third individual of the HIT subset. DIT = village dogs (31), BeT = Belgian Tervuren (12), Bgl = Beagle (10), BMD = Bernese Mountain Dog (12), BoC = Border Collie (16), BoT = Border Terrier (25), BrS = Brittany Spaniel (12), CoS = Cocker Spaniel (14), Dac = Dachshund (12), Dob = Doberman Pinscher (25), EBD = English Bulldog (13), Elk = Elkhound (12), ESt = English Setter (12), Eur = Eurasian (12), Finnish Spitz FSp (12), GoS = Gordon Setter (25), Gry = Golden Retriever (11), GRe = Greyhound (14), GSh = German Shepherd (12), GSl = Greenland Sledge Dog (12), IrW = Irish Wolfhound (11), JRT = Jack Russell Terrier (12), LRe = Labrador Retriever (14), NFd = Newfoundland (25), NSD = Nova Scotia Duck Tolling Retriever (23), Rtw = Rottweiler (12), Sci = Schipperke (25), ShP = Shar-Pei (11), StP = Standard Poodle (12), TYo = Yorkshire Terrier (12), Wei = Weimaraner (26), PIT = wolves with atypical phenotypes (10), BIT = black wolves (14), HIT = genetically admixed wolves based on STR data (45), WIT = putatively nonadmixed Italian wolves (118).

Timing of Admixture

The software Alder (Loh et al. 2013), by exploiting information derived from the haplotype structure, based on recombination rates and the extent of LD decay among neighboring loci, identified significant admixture in each generation cohort or pool of cohorts of putative hybrids (generations G1, G2, G3–4, and G5–9 from the present, all with P values < 1.4*10−6; supplementary table S1, Supplementary Material online), although it reported inconsistent decay rates (meaning that one of the parental populations could have been not fully sampled; Loh et al. 2013; supplementary fig. S4, Supplementary Material online). Hybridization was estimated to have occurred 3.5–5.0 generations before sampling and, assuming a wolf generation time of 3 years (Skoglund et al. 2011), this corresponded to four partially overlapping intervals (one per cohort) in which admixture possibly originated, centerd around years 1987, 1995, 1999, and 2000 (fig. 3).
. 3.

Top: timing since the admixture event for each admixed individual, deduced from the empirical distribution of the number of chromosomal switches inferred from PCAdmix in relation to the individual assignment values (proportion of wolf blocks). The expected distributions at increasing generations since admixture are indicated by the colored lines. Bottom: temporal distribution of the admixture events deduced from PCAdmix (vertical bars) compared with the time intervals reconstructed by Alder (horizontal bars) from four generation cohorts or pools of cohorts, with G1 being the last sampled generation (years 2013–2015) and so on, considering a wolf generation time of 3 years (Skoglund et al. 2011). Square, triangle and diamond symbols represent mean values, and vertical sticks represent confidence intervals.

Top: timing since the admixture event for each admixed individual, deduced from the empirical distribution of the number of chromosomal switches inferred from PCAdmix in relation to the individual assignment values (proportion of wolf blocks). The expected distributions at increasing generations since admixture are indicated by the colored lines. Bottom: temporal distribution of the admixture events deduced from PCAdmix (vertical bars) compared with the time intervals reconstructed by Alder (horizontal bars) from four generation cohorts or pools of cohorts, with G1 being the last sampled generation (years 2013–2015) and so on, considering a wolf generation time of 3 years (Skoglund et al. 2011). Square, triangle and diamond symbols represent mean values, and vertical sticks represent confidence intervals. PCAdmix results, that we also used to estimate individual admixture times, indicated that admixture mostly occurred 3–4 generations before sampling, with the oldest events tracing back up to 19 generations before sampling (supplementary fig. S5, Supplementary Material online). Converted into years, this indicated that the hybridization events likely started in the 1940s, followed in the late 1970s, but peaked in the late 1990s (fig. 3), highly concordant with the results from Alder. Although our methods could be expected to be more efficient in tracing more recent hybridization events, the estimated timing of admixture through PCAdmix appeared robust, since estimates from different generation cohorts were highly concordant (supplementary fig. S5, Supplementary Material online). From a geographical point of view, the patterns of admixture showed an initial South-to-North trend, followed by more complex dynamics (fig. 4, supplementary file S1, Supplementary Material online) likely reflecting the range expansion of the species along the Apennines (Fabbri et al. 2007).
. 4.

Map visualizing the geographical distribution of the wolf x dog admixture events in Italy through time, as reconstructed from PCAdmix results. Locations are plotted where the admixed individuals have been sampled and cannot fully reflect the potential movements from where the first parental hybrids were sampled. An animated version of the map is available in supplementary file S1, Supplementary Material online.

Map visualizing the geographical distribution of the wolf x dog admixture events in Italy through time, as reconstructed from PCAdmix results. Locations are plotted where the admixed individuals have been sampled and cannot fully reflect the potential movements from where the first parental hybrids were sampled. An animated version of the map is available in supplementary file S1, Supplementary Material online.

Deviations from Admixture Neutrality

In order to look at deviations from neutrality in the inheritance patterns of parental haplotypes in hybrids, all blocks analyzed by PCAdmix in the hybrids were ranked according to their relative proportion of “dog” or “wolf” assignment, and labeled as admixture outliers if falling into the top or bottom 1% of the genome-wide frequency distribution. In this way, we identified 73 regions with high frequency of dog alleles in hybrids (mostly mapping to chromosomes chr3, chr10, chr28, and chr32) and 106 blocks with high frequency of wolf alleles (mainly on chr7, chr8, and chr12), likely indicating outlier regions for excess of ancestry from one of the parental populations (fig. 5). The “dog-like” outlier regions included 179 annotated protein-coding genes, significantly enriched for Gene Ontology (GO) categories related to transmembrane transport, and a number of Human Phenotype (HP) categories, the most significant of which were linked to nasal and ear morphology (supplementary file S2, Supplementary Material online). The “wolf-like” outlier regions included 235 protein-coding genes, significantly enriched for immunity-related, metabolic and enzymatic biological processes (BP), and for HP categories related to abnormal ossification (supplementary file S2, Supplementary Material online).
. 5.

Distribution of the introgressed regions along the genomes of admixed individuals. Top: PCAdmix outlier regions. Blue bars indicate regions with excess of wolf-derived alleles, yellow bars indicate excess of dog-derived alleles. Center: BGC (Bayesian Genomic Cline analysis; Gompert and Buerkle 2012) alpha parameter outlier SNPs. Values higher than 0 indicate excess of dog alleles, values lower than 0 indicate excess of wolf alleles. Bottom: outlier SNPs for the BGC beta parameter. Values higher than 0 indicate resistance to introgression, lower than 0 an excess of introgression, compared to random expectations. In both cases, BGC significant outliers are indicated by blue crosses (top or bottom 1% of the empirical distribution of values) and by red dots (95% credibility intervals of 10,000 iterations not including 0).

Distribution of the introgressed regions along the genomes of admixed individuals. Top: PCAdmix outlier regions. Blue bars indicate regions with excess of wolf-derived alleles, yellow bars indicate excess of dog-derived alleles. Center: BGC (Bayesian Genomic Cline analysis; Gompert and Buerkle 2012) alpha parameter outlier SNPs. Values higher than 0 indicate excess of dog alleles, values lower than 0 indicate excess of wolf alleles. Bottom: outlier SNPs for the BGC beta parameter. Values higher than 0 indicate resistance to introgression, lower than 0 an excess of introgression, compared to random expectations. In both cases, BGC significant outliers are indicated by blue crosses (top or bottom 1% of the empirical distribution of values) and by red dots (95% credibility intervals of 10,000 iterations not including 0). Local deviations from neutrality were also investigate through a Bayesian Genomic Cline analysis in BGC (Gompert and Buerkle 2012), which identifies regions with excess of ancestry in one of the parental populations compared with neutral expectations (α parameter), as well as those with excess of or resistance to introgression (ẞ parameter). BGC results for the α parameter indicated that 187 SNPs, distributed throughout most of the chromosomes, had an excess of dog ancestry (significantly positive values of α) and 132 SNPs an excess of wolf ancestry (α < 0), with overall higher absolute values in the former (fig. 5). A significant excess of introgression was observed only in six SNPs on chr19 and chr24 (ẞ < 0), whereas nine SNPs on chr17, chr21, chr27, and chr35 showed values of ẞ > 0, indicating a resistance to introgression or steeper genetic clines (fig. 5). The 50-kb regions surrounding the SNPs with excess of dog ancestry contained 210 protein-coding genes that were mostly enriched for a GO biological process related to inflammatory response and a HP category linked to short stature (supplementary file S2, Supplementary Material online). Conversely, the regions with excess of wolf ancestry contained 156 coding genes, enriched for membrane and protein-related cellular components (CC) and for skin-related HP categories (supplementary file S2, Supplementary Material online). The 50-kb regions surrounding the outlier SNPs for the ẞ parameter included only eight genes with excess of introgression, enriched for GO categories related to telomere processing, and 18 genes with resistance to introgression, enriched for GO categories mainly linked to neural cells functions and glucose transport (supplementary file S2, Supplementary Material online). No SNP was significant for both parameters (supplementary fig. S2, Supplementary Material online), although a region located on chr24 around 46–47 Mb included both SNPs that were outlier for ẞ < 0, and SNPs either significantly positive or negative for α, indicating a possible excess of introgression in this region for both wolf and dog alleles. We found only a partial overlap between the outlier SNPs identified from the BGC α parameter and the blocks of ancestry-excess identified from PCAdmix, namely a region on chr25 for dog excess, and a region on chr27 for wolf excess (supplementary file S2, Supplementary Material online).

Ancestry-Informative Marker Selection

To verify their power to identify wolves, dogs and their hybrids, we selected three panels of the most ancestry-informative markers (192, 96 and 48 SNPs; supplementary file S2, Supplementary Material online) from the 25 k data set (FST = 0.16, HO-dog = 0.2312, HO-wolf = 0.1921) and used them to perform additional PCA analyses. Results were highly concordant and well differentiated dogs and wolves (192 top SNPs: FST = 0.58, HO-dog = 0.3051, HO-wolf = 0.0075; Top 96 SNPs: FST = 0.60, HO-dog = 0.2957, HO-wolf = 0.0087; Top 48 SNPs: FST = 0.61, HO-dog = 0.2843, HO-wolf = 0.0095). Most of the admixed individuals clustered in an intermediate position between the parental groups, although there was still no clear subdivision between some of the hybrids and the wolves (fig. 6). The assignment values from Admixture run at K = 2 on the 192, 96, and 48 SNPs were highly concordant with the 25k data set (R2 ≥ 0.87 in all cases) and all performed better than a set of 39 STRs (which had FST = 0.18, HO-dog = 0.5765, HO-wolf = 0.4582; supplementary fig. S9, Supplementary Material online) commonly used to identify wolf x dog hybrids in Europe (Randi et al. 2014). In particular, when fixing an operative individual assignment threshold of qw = 0.95 for the identification of hybrids (as currently used in the ongoing European LIFE projects for the management of wolf x dog hybrids in Italy), all the top-differentiating SNP sets were able to correctly recognize >85% of the hybrids identified with the 25 k data set. Conversely, although based on different Bayesian algorithms, assignment results from Structure on the 39 STRs identified only 37% of the hybrids detected with the 25 k SNP data set.
. 6.

Principal component analysis (PCA) computed in SVS on the 25 k SNP data set and on the 192, 96, 48 SNPs with the highest wolf-dog FST values. For each data set, PC1 versus PC2 are indicated (axes are not to scale), and the overall FST distribution for the 25 k data set is represented in the histogram. Yellow dots indicate dogs, blue dots nonadmixed wolves, light blue dots admixed wolves (Admixtureqw < 0.999). The power of the top 48 SNPs is comparable to that of the 25 k data set, indicating that they can be used as reliable ancestry-informative-markers (AIMs), although no clear subdivision could be traced between some of the admixed and the nonadmixed wolves.

Principal component analysis (PCA) computed in SVS on the 25 k SNP data set and on the 192, 96, 48 SNPs with the highest wolf-dog FST values. For each data set, PC1 versus PC2 are indicated (axes are not to scale), and the overall FST distribution for the 25 k data set is represented in the histogram. Yellow dots indicate dogs, blue dots nonadmixed wolves, light blue dots admixed wolves (Admixtureqw < 0.999). The power of the top 48 SNPs is comparable to that of the 25 k data set, indicating that they can be used as reliable ancestry-informative-markers (AIMs), although no clear subdivision could be traced between some of the admixed and the nonadmixed wolves.

Genotype–Phenotype Associations

To identify the genetic bases of three atypical phenotypic traits described in the literature as possible indicators of hybridization (Ciucci et al. 2003; Anderson et al. 2009), we performed an exploratory genotype–phenotype association test. We contrasted the genotypes of nine cases showing black coat (BC) phenotypes to 102 wild-type controls. The presence of one or more white claws (WC) was described in 16 cases, whereas 98 controls had all claws with the typical dark grey color. The presence of the spur (SP) on the hind legs was identified in five cases, compared with 108 controls where the spur was absent. The genotype–phenotype analysis revealed a main peak associated to the black coat color, with 90 SNPs above the significance threshold after Bonferroni correction (fig. 7), in an interval of about 12 Mb at the end of chr16 that included the β-defensin gene CBD103, known to be responsible for such trait in North American and Italian wolves (Anderson et al. 2009; Caniglia et al. 2013; Randi et al. 2014). The alleles of the three most significant SNPs (chr16:60391793, chr16:61370693, chr16:61718721; table 1) were perfectly associated to the phenotypes of all the cases, plus another four samples for which the presence of a black coat was not known, but that carried the KB deletion (supplementary table S2, Supplementary Material online). To evaluate whether the regions containing the SNPs associated with the atypical phenotypes had a wolf or dog ancestry, we proceeded in two ways: first, we verified their assignment in PCAdmix for all the cases; second, we concatenated all the significantly associated SNPs and reconstructed their haplotype networks. The network reconstructed from the 90 SNPs significantly associated to the BC showed a single haplotype largely represented in wild-type wolves, whereas the six different haplotypes found in individuals with BC were split into two main groups, both rooted on dog-derived nodes (fig. 8). Concordantly, the significant SNPs fell into 41 PCAdmix blocks where all the BC cases had at least one haplotype of dog ancestry.
. 7.

Genotype–phenotype association results for three phenotypic traits considered as potential indicators of hybridization: black coat (top), white claws (center) and spur on the hind legs (also known as dewclaw; bottom panel). Significance thresholds for the –log10 Chi-Squared P values are indicated, corresponding to a Bonferroni-corrected probability of 0.01. (Photo credits, from top to bottom: Renato Fabbri, Willy Reggioni, and Luigi Molinari).

Table 1.

Summary Statistics of the SNPs Most-Significantly Associated with Three Atypical Phenotypic Traits.

TraitTop SNPs
Black coat16:60391793(A/G)16:61370693(A/G)16:61718721(A/G)
 Cases (n = 9)0.4440.5000.500
 Controls (n = 102)0.0050.0050.005
 Chi-squared –10 log P21.213321.556821.5568
 Chi-squared Bonferroni P8.53*10−173.86*10−173.86*10−17
White claws20:27360052(A/G)20:31224886(A/T)20:37232759(A/G)
 Cases (n = 16)0.1880.1880.188
 Controls (n = 98)0.0000.0000.000
 Chi-squared –10 log P9.09319.09319.0931
 Chi-squared Bonferroni P0.000110.000110.00011
Spur11:24498328(A/G)11:27397738(A/C)11:35263745(A/C)
 Cases (n = 5)0.4000.3000.400
 Controls (n = 113)0.0000.0000.000
 Chi-squared –10 log P20.174515.273520.1745
 Chi-squared Bonferroni P9.33*10−167.43*10−119.33*10−16

Note.—For each SNP (indicated as chr:position), the allele associated to the atypical phenotype is showed in italics, together with its frequency in “cases” and in “controls” and the association values detected by SVS.

. 8.

Haplotype networks reconstructed from the SNPs significantly associated to the considered phenotypic traits. Top-left: black coat. Top-right: white claws. Bottom: spur (only referred to chr11). Node dimensions are proportional to the number of times each haplotype has been observed. Haplotypes found in wolves are indicated in blue, those found in hybrids (Admixtureqw < 0.999) in light blue, those found in dogs in yellow. Dashed ovals around nodes identify haplotypes reconstructed from SNPs significantly associated to each phenotypic trait.

Summary Statistics of the SNPs Most-Significantly Associated with Three Atypical Phenotypic Traits. Note.—For each SNP (indicated as chr:position), the allele associated to the atypical phenotype is showed in italics, together with its frequency in “cases” and in “controls” and the association values detected by SVS. Genotype–phenotype association results for three phenotypic traits considered as potential indicators of hybridization: black coat (top), white claws (center) and spur on the hind legs (also known as dewclaw; bottom panel). Significance thresholds for the –log10 Chi-Squared P values are indicated, corresponding to a Bonferroni-corrected probability of 0.01. (Photo credits, from top to bottom: Renato Fabbri, Willy Reggioni, and Luigi Molinari). Haplotype networks reconstructed from the SNPs significantly associated to the considered phenotypic traits. Top-left: black coat. Top-right: white claws. Bottom: spur (only referred to chr11). Node dimensions are proportional to the number of times each haplotype has been observed. Haplotypes found in wolves are indicated in blue, those found in hybrids (Admixtureqw < 0.999) in light blue, those found in dogs in yellow. Dashed ovals around nodes identify haplotypes reconstructed from SNPs significantly associated to each phenotypic trait. The genotype–phenotype association results for the presence of WC revealed a main, large peak on chr20, with 74 significantly associated SNPs (fig. 7). These SNPs spanned an interval of about 20 Mb, which included 286 annotated genes, some of which known to be related to pigmentation (supplementary file S2, Supplementary Material online). However, none of the significant SNPs was associated to all the 16 cases (table 1), but the 22 most significant SNPs on chr20 (between chr20:27360052 and chr20:37651111) clearly explained six of them (supplementary table S2, Supplementary Material online). Another four cases were associated to a solitary SNP on chr16 (chr16:50237955), although with a lower genome-wide significance (fig. 7). The haplotypes reconstructed from the significant SNPs on chr20 formed a network centered around the haplotype most commonly observed in wild-type wolves (fig. 8). Two haplotypes were associated to the WC phenotype and formed a distinct branch rooted on a dog node, which included two additional admixed individuals for which no phenotypic information was available. In PCAdmix, the blocks that included the significant SNPs on chr20 showed that all the six explained cases had at least one dog-derived haplotype, as for the blocks of the other four cases possibly explained by the single significant SNP on chr16. Several regions were associated to the presence of the spur, with a main peak of 145 significant SNPs on chr11, and other peaks on a number of other chromosomes, mainly chr7, chr12 and chr28 (fig. 7). In particular, the three most significant SNPs lied on chr11 (chr11:24498328, chr11:35263745, chr11:41596702; fig. 7) and their alleles were perfectly associated to the phenotypes of up to four of the five cases (table 1; supplementary table S2, Supplementary Material online). The 50-kb regions surrounding the significant SNPs included 467 genes (supplementary file S2, Supplementary Material online), enriched for the BP category GO:0048856 (anatomical structure development) and a number of HP categories related to phalangeal development, syndactyly and polydactyly, represented by 11 genes (supplementary file S2, Supplementary Material online). The haplotypes reconstructed from the significant SNPs in the main peak (chr11) formed a network where the four haplotypes found in the cases were rooted between the main wolf node and a wider panel of dog branches (fig. 8), not allowing a plain discrimination of their origin. However, PCAdmix clearly assigned as dog-derived all the blocks that included the SNPs significantly associated to the spur.

Discussion

Anthropogenic hybridization is considered a global threat to biodiversity, especially in human-dominated contexts where the growing diffusion of domestic species might increase the risks of hybridization and introgression in the gene pool of wild parental populations (Allendorf et al. 2001). Our genome-wide assessment of hybridization in the Italian wolf population, as highlighted by both multivariate and assignment procedures, showed that the wolf x dog putative hybrids we analyzed (selected by their low STR assignment or atypical phenotypic traits; Randi et al. 2014) ranged from complete wolf assignments to c. 20% dog-derived genomes. Only a few cases (8% of the hybrids) had a higher dog content and likely fell within the first three generations from the admixture. This pattern indicates that recurrent admixture events occurred in the past, but their legacy has been mostly diluted through time, as expected in a selectively neutral perspective where the retained dog ancestry should be <1% after seven generations of backcrossing with wolves. The estimated timing of the hybridization events suggests that most of the hybrids do not trace back to the last bottleneck of the Italian wolf population, but rather to its following reexpansion phase. In fact, even if the first cases were dated in the 1940s and in the 1970s (but not in between), the main peak occurred in the late 1990s, when the population was expanding in most of its current range (Lucchini et al. 2002; Fabbri et al. 2007). However, the frequency of hybridization events seems to decline in the following decade, possibly due to the increasing availability of potential wolf mates and to the stronger social structuring and stability of packs, which can lower the need for dog contributions to reproduction (parallel to what described by Rutledge et al. 2010 between eastern wolves and coyotes). Of course, given that 170 k SNPs offer only a moderately resolved snapshot on the whole Canis genome, we cannot exclude that the legacy of more ancient hybridization events could have been left in the whole population. PCAdmix is indeed agnostic on the number of admixture events that may have occurred as it simply identified the “dog” blocks within the genomes of hybrids. However, when applying Alder or similar dating methods, these models are constrained to pick up either the major admixture event (if a main one occurred), the midpoint (in case of continuous admixture events) or the latest event (if these were punctuated). The future comparison of multiple complete genomes from the Italian and other European populations will be needed to shed more light on this possibility. From a geographical point of view, results show that the first hybridization events might have occurred in the main population refugia in central and southern Italy, followed by more frequent events in the northern Apennines and finally in human-dominated landscapes along the Tyrrhenian and Adriatic coasts, likely surfing the main population expansion wave around the 1990s. In particular, the Maremma region (western Tuscany) confirmed to be a local hotspot of hybridization, as previously described for the wolf (Caniglia et al. 2013) and other mammals, such as the wildcat (Mattucci et al. 2013) and the roe deer (Mucci et al. 2012). However, mapping the individual sampling locations cannot properly take into account the movements and dispersals of individuals through the generations following the hybridization event. The prevalent dog contribution to admixture appears to come from German Shepherds, which interestingly represented the most common breed in Italy in the last decades (Ente Nazionale della Cinofilia Italiana, http://www.enci.it, last accessed March 1, 2017) and is also consistent with the presumably higher probability of successful mating of wolves with wolf-sized dogs compared with other breeds. Ancestry mapping enabled us to identify several regions with significant excess of dog or wolf alleles and with steeper or milder genetic clines in the hybrids, which are discussed into details in supplementary text S1, Supplementary Material online. The phenotype–genotype association tests on the black coat, used as a control trait, confirmed the ability of our data set to correctly identify the genomic region hosting the causative mutation for such phenotype, although the exclusion of dogs from the data set due to missing phenotypic information might have reduced the power of the test. Interestingly, both the local genome ancestry analysis and the haplotype networks revealed the dog derivation of this region in all the analyzed cases, confirming previous hypotheses derived from North American wolves (Anderson et al. 2009). However, the extension of dog-derived haplotype blocks and their rooting within the network suggest the occurrence of at least two separate events. The most ancient one included seven individuals sampled in the Northern Apennines, only one detected as a hybrid by Admixture and the other six completely assigned to the wolf cluster (qw = 1.00), indicating a genetic legacy not detectable anymore at the genome-wide level. Conversely, the most recent event traced back 3–8 generations and the corresponding haplotypes were mostly detected in hybrids from the Maremma region (0.841 < qw < 0.978), where a high frequency of black-coated canids was recently documented (Caniglia et al. 2013). Also for the white claws we identified a single highly associated peak on chr20, though relatively large (c. 20 Mb) and not explaining all the cases. This region hosted several genes related to pigmentation, such as MITF (microphthalmia-associated transcription factor), EOGT (EGF domain-specific O-linked N-acetylglucosamine transferase), PRICKLE2 (prickle homolog 2), WNT5A (wingless-type MMTV integration site family member 5A), and GNAI2 (guanine nucleotide binding protein, alpha inhibiting activity). However, a series of successive highly significant SNPs located between 36.92 and 37.65 Mb was found just 70 kb upstream of WTN5A. This gene belongs to the WNT family, whose members encode for secreted glycoproteins with signaling functions (Bachmann et al. 2005) and was described to be targeted by a miRNA (mir-202) highly expressed in the epidermal cells of white-colored alpacas (Tian et al. 2012). However, finding the causal mutations for white-coloring alleles in the claws of hybrid wolves needs further investigations and the role of adjacent genes could also concur to the expression of such trait. In particular, MITF (which is located right at the beginning of the peak) is regulated by the same miRNA of WNT5A (Tian et al. 2012) and is responsible for the major white spotting coloration in Boxer dogs (Karlsson et al. 2007), whereas PRICLKE2 is involved in the planar cell polarity signaling pathway, which controls the differentiation of follicles and thus affects the patterning of the skin and of its keratin annexes (Chen and Chuong 2012). Again, the reconstructed haplotype network and the local ancestry analysis revealed that in all the explained cases the region hosting the significantly associated SNPs was clearly dog-derived, confirming previous suggestions based on morphological bases (Ciucci et al. 2003). Coherently, all the cases explained by the associated variants were detected as possible hybrids (0.781 < qw < 0.993) and their estimated time of admixture traced back 4–9 generations. A more complex pattern emerged from the analyses of the genetic variants associated with the spur, showing a number of peaks on several chromosomes, none of them able to explain all the cases per se. However, this is not surprising since a number of different genes are known to play a role in the development of polydactyly, both in humans (Biesecker 2011) and in dogs, where it occurs more commonly in large breeds, such as Saint Bernard and Bernese (Galis et al. 2001). The regions hosting the significantly associated SNPs included several of such genes. In particular PITX1 (paired-like homeodomain 1) on chr11 and BMP7 (bone morphogenetic protein 7) on chr24 are known to be implicated in the development of polydactyly in a number of species (Galis et al. 2001; Marcil 2003; Klopocki et al. 2012), similar to LMBR1 (limb development membrane protein 1) on chr16, that Park et al. (2008) demonstrated to be responsible for such trait in a Korean dog breed. However, genes included in other significantly associated regions, such as FGF14 (fibroblast growth factor 14) on chr22 and WNT8B (wingless-type MMTV integration site family member 8A) on chr28 could also play a significant role in the complex development of canine polydactyly (Towle and Breur 2004). Again, although the Network results are not conclusive, from PCAdmix the main associated region appears to be dog-derived in the analyzed individuals, as confirmed by their low assignment values (0.781 < qw < 0.862) and compatible with admixture events occurred from 5 to 9 generations in the past. Therefore, the three atypical phenotypic traits we focused on seem to have genetic bases and their presence in wild-living canids is likely a signal of introgression from the dog to the wolf gene pool, although several caveats should be considered. First, not all the cases of white claws are explained by the associated genomic variants; therefore their presence cannot be considered an absolute signal of hybridization, but it could be due to environmental factors (e.g., nutrition, diseases, carcass degradation, or loss of the external keratin shell). Second, several individuals carrying a black coat could not be identified as hybrids even by genome-wide assignment procedures carried out with thousands of markers. This indicates that the hybridization event originating such gene flow occurred several generations in the past and most of the dog-derived alleles in such individuals got lost, resulting in almost pure wolf genomes. Therefore, any classification of hybrids based on sole phenotypic traits for management purposes (e.g., for hybrid removal) would be highly hazardous. Rather, we think that the likely dog ancestry of these traits should be used as a phenotypic marker for a simpler identification of potential hybrids, but a final assessment of their status should be based on careful genetic investigations. To this purpose, the reduced panels of AIMs we identified turned out to be highly concordant with the 25 k SNP data set and appeared to perform better than previously used microsatellites in the identification of hybrids, although such a direct comparison should be treated with caution given the different programs applied to analyze these two types of markers. Therefore, a small panel of highly informative SNPs could be applied to more extensive monitoring plans in areas of supposed or documented hybridization through microfluidic or quantitative PCR techniques (vonHoldt et al. 2013). Such approaches would allow the cost-effective analysis of dozens of samples and markers at a time (e.g., 48 samples × 48 SNPs), even starting from noninvasively collected materials such as faeces (Kraus et al. 2015; Norman and Spong 2015). However, any future management practice for the removal of hybrids should be undertaken considering the serious possibility to incur in both type I (removing nonadmixed wolves erroneously identified as hybrids) and type II errors (not removing hybrid individuals falsely identified as wolves; Allendorf et al. 2001; Randi et al. 2014). Therefore, these problems should be carefully evaluated especially in a population that is undergoing a recent reexpansion but is still threatened by strong poaching and accidental killings, which can weaken the pack structure and further promote hybridization (Rutledge et al. 2012). Moreover, hybrids originated several generations in the past can act as good ecological surrogates of nonadmixed wild-living wolves (Caniglia et al. 2013; Bassi et al. 2017), as documented also for wolf-coyote crosses (Benson et al. 2012), and should not be necessarily removed (VonHoldt et al. 2016a, 2016b; Wayne and Shaffer 2016). Consequently, management actions should be primarily aimed at reducing the high number of free-ranging dogs within the current wolf distribution, and particularly in their current expansion range (the Alps), as indicated by the latest action plan for the conservation of the wolf in Italy (Boitani et al., in prep.). Secondary, attention should be focused on recent hybrids or hybrid packs, in order to prevent the diffusion of large dog-derived genomic regions and a lower acceptance by local communities (M. Apollonio, personal communication). In conclusion, the application of multiple genome-wide ancestry reconstruction methods allows to clarify the patterns and dynamics of admixture even in highly introgressed populations, map the genetic bases of phenotypical indicators of hybridization, identify optimal ancestry-informative markers and support appropriate management practices.

Materials and Methods

Sample Selection, DNA Extraction, and SNP Genotyping

We genotyped DNA extracted from blood or muscular tissue samples from 118 wolves, 31 village dogs, and 72 putative wolf x dog hybrids. Wolves were sampled from the whole species’ distribution in Italy from 1992 to 2015. Putative hybrids were previously identified based on genetic evidences (Randi and Lucchini 2002; Caniglia et al. 2013; Randi et al. 2014) and/or on atypical phenotypes such as black coat color, white claws, or spur on the hind legs (Ciucci et al. 2003). Wolves and hybrids were sampled from carcasses and individuals live-trapped for scientific purposes or rescuing operations, under the Italian Ministry of the Environment’s permit. Dog blood samples were collected in local shelters by veterinary personnel, always respecting animal welfare procedures. No animal was hurt nor sacrificed for the purposes of this study. Genomic DNA was extracted using a DNeasy Tissue Kit (Qiagen Inc., Hilden, Germany) according to the manufacturer’s instructions. We quantified double-stranded DNA concentrations by the PicoGreen assay on a Quant-iT fluorometer (TermoFisher Scientific, Vantaa, Finland) and visually evaluated DNA quality by 1% agarose gel electrophoresis, selecting only samples with a minimum of 50 ng DNA (average DNA concentration: 60.63 ± 3.22 ng/µl) and showing no signs of degradation. We used the CanineHD BeadChip microarray (Illumina, Inc., San Diego, California, USA) to genotype the DNA samples at c. 170 k SNPs, following the Infinium HD Ultra Assay protocol and calling genotypes with GenomeStudio (http://www.illumina.com/documents/products/datasheets/datasheet_genomestudio_software.pdf, last accessed March 1, 2017). We then added 456 dogs from 30 breeds that were genotyped with the same microarray in the LUPA project and that were publicly available (Lequarré et al. 2011; Vaysse et al. 2011). Although the use of the CanineHD BeadChip, mostly developed from known dog variants, could introduce a limited ascertainment bias against wolves (VonHoldt et al. 2011), our haplotype-based analyses, although being based on dog recombination maps, are designed to minimize such an effect when detecting dog genomic segments that have introgressed into the wolf population.

Data Filtering

All genotypes were imported into the SNP&Variant Suite 8.0.1 (SVS, Golden Helix Inc., Bozeman, MT) and were checked for the possible presence of pairs of individuals with an identity-by-descent score higher than 0.5 (closely related individuals). Genotypes were filtered to ensure high call rates (> 95%) per sample and per locus (hereafter: quality-pruned data set), after discarding SNPs mapping on chromosomes X and Y. We used Plink 1.07 (Purcell et al. 2007) to further filter out loci in linkage disequilibrium (LD) at threshold r2 = 0.1 (LD-pruned data set), using the –dog option in order to manage the correct number of chromosomes. We then developed an ad hoc Unix pipeline (available upon request) to integrate and automate the subsequent analysis steps.

Assignment and Admixture Tests

We carried out an exploratory principal component analysis (PCA; Novembre and Stephens 2008) for the first five components in SVS to visualize the distribution of samples in the genetic space, using the additive genetic model (Price et al. 2006) on the quality-pruned data set. We then used Admixture 1.23 (Alexander et al. 2009) on the 25 k LD-pruned data set to reassign each sample to its population of origin, assuming K values from 1 to 40. The most likely number of clusters was identified based on the lowest cross-validation error (Alexander et al. 2009) and the results were plotted in R 3.0.2 (www.r-project.org, last accessed March 1, 2017). Introgression fractions assessed with Admixture (K = 2) were then used to select the reference dogs, reference wolves and admixed individuals for all the subsequent analyses (see results). Individual assignment values of hybrids were verified running again the software using the “supervised approach”, which allows to fix the reference populations (Alexander and Lange 2011), and confirmed by PCAdmix (Brisbin et al. 2012; see below), which is more appropriate in evaluating the actual admixture proportions (Falush et al. 2016). We used the f3 test in ThreePop (TreeMix package 1.12; Pickrell and Pritchard 2012) to formally assess the occurrence of admixture in the Italian wolf population, using blocks of 500 SNPs each and considering Z-score values < −3 to be indicative of admixture (Pickrell and Pritchard 2012).

Estimating the Timing of Admixture

We reconstructed chromosomal haplotypes from the quality-pruned data set in Shapeit 2.837 (Delaneau et al. 2012) with standard parameters and based on dog recombination maps derived from Muñoz-Fuentes et al. (2015), which were the best proxy in absence of any wolf-specific map. All coordinates were referred to the canFam2 dog genome, namely the genomic build that was used to originally design the CanineHD BeadChip. We then estimated the average timing of the admixture events using Alder 1.03 (Loh et al. 2013), which exploits information derived from the haplotype structure and the extent of LD decay among neighboring loci. The putative hybrids were grouped into generation cohorts based on their sampling dates. When in a given cohort the number of samples was <4 we pooled adjacent generations, then the software was run independently on each cohort or pool of cohorts. When the admixture was significant (at P values < 0.01), we retained the estimated number of generations since the admixture event. The average date of admixture for each cohort was then calculated as Y = sY – (nG * g), where Y is the inferred year of admixture, sY is the average sampling year of the cohort, and nG is the number of generations from the admixture event calculated by the software, assuming a generation time g = 3 years (Skoglund et al. 2011). Moreover, we estimated individual admixture times with the PCA-based admixture deconvolution approach implemented in PCAdmix 1.0 (Brisbin et al. 2012), which assigns the most likely ancestry of haplotype blocks identified in hybrid individuals to their parental populations. PCAdmix was run with blocks of 20 consecutive, nonoverlapping SNPs. The average genome-wide proportion of blocks assigned to each reference population was calculated for each sample and compared with the assignment proportions estimated by Admixture. The number of generations since the admixture for each individual was then estimated based on the number of switches from dog to wolf ancestry blocks (or viceversa) using the formula developed by Johnson et al. (2011), modified according to the dog genome length, conditional on the proportion of admixture estimated from PCAdmix. The number of generations from admixture was converted into years since sampling using the same value of 3 years per generation. Summary plots across all samples were then compared with those obtained from Alder. Finally, results for single admixed individuals were visualized in a map to identify possible geographic patterns of admixture through time. According to the neutral theory, the proportions of local ancestry in the hybrid samples should stochastically fluctuate around the average genome-wide proportion of dog and wolf ancestral components. However, selective pressures acting on each locus or linkage group can result in some genetic regions being introgressed more frequently than others in the admixed genotypes. We exploited recently developed Bayesian statistical approaches (Alexander et al. 2009; Brisbin et al. 2012; Gompert and Buerkle 2012) to identify genomic regions that are significantly more or less frequent than expected by chance in the hybrid individuals identified from Admixture. First, haplotype blocks identified by PCAdmix were ranked according to their relative proportion of “dog” or “wolf” assignment in the hybrids. Blocks falling into the top and bottom 1% of the genome-wide distribution were labeled as admixture outliers and adjacent top-ranking blocks were joined into single segments. These regions were then compared with those identified in the hybrids with the Bayesian Genomic Cline analysis in BGC (Gompert and Buerkle 2012), which retrieves SNPs with excess of ancestry in one of the parental populations (the reference wolves and the reference dogs identified from Admixture) compared with random expectations (α parameter), as well as those with excess of or resistance to introgression (ẞ parameter). The software was run after filtering for LD and removing all sites with missing genotypes for 10,000 iterations, sampled every fifth one, using the ICARrho model based upon dog recombination maps derived from Muñoz-Fuentes et al. (2015), in absence of any wolf-specific recombination map. Results were gathered with the estpost utility (Gompert and Buerkle 2012) and the adequate number of burn-in iterations were discarded after visual inspection of the log likelihood values. SNPs were retained as significant when the 95% credible intervals of their median α or ẞ values did not include zero and their median value was included in the top or bottom 1% of the genome-wide distribution (Trier et al. 2014). Starting from the LD-pruned data set, we selected three panels of the 192, 96, and 48 most ancestry-informative markers (AIMs), based on their values of FST between dogs and nonadmixed wolves (with Admixtureqw ≥ 0.999, see Results) calculated in SVS. These three panels of SNPs were used to rerun the PCA and Admixture analyses and to verify their power to identify wolves, dogs and their hybrids. Finally, their performance was compared with that of 39 microsatellite markers previously used to investigate wolf x dog admixture in Italian and European wolves (Randi et al. 2014), based on their Structureqw at K = 2 (run with admixture model, independent allele frequencies, 40 k burn-ins and 400 k MCMC for four iterations).

Genotype–Phenotype Association Testing and Identification of Trait Ancestry

All individuals showing the spur (SP) and white claws (WC) were ranked as “cases” in an exploratory genotype–phenotype association study, using as “controls” all the wild-type wolves and putative hybrids for which phenotypic information was available. Dogs had to be excluded since no phenotypic information was available for such traits. We performed the association test in SVS (Gorlova et al. 2011) using the “basic allele model”, in absence of any hypotheses on the dominance of these traits. We then applied a Bonferroni correction to identify significantly associated SNPs (at nominal P < 0.01) and we plotted the values of the –log10 Chi-Squared P for all chromosomes using the GenomeBrowse tool in SVS. To evaluate whether the mutations associated with the atypical phenotypes had wolf or dog ancestry, the significantly associated SNPs were concatenated and their haplotypes were reconstructed for each chromosomal region with at least ten SNPs in Phase (Stephens and Donnelly 2003) via DnaSP (Librado and Rozas 2009), including the village dog samples for comparison. We then used these haplotypes to build median-joining networks in Network 5.0 (Bandelt et al. 1999). Finally, we used PCAdmix to further evaluate the ancestry of these atypical traits by verifying the assignment of the haplotype blocks that included the significant SNPs to the wolf or dog clusters. The black coat (BC) phenotype, whose genetic basis and likely dog origin are well known (Candille et al. 2007; Anderson et al. 2009), was used for validation in all these analyses, with known black individuals used as cases and wild-type ones as controls.

Gene Search, Gene Ontology Enrichment, and Functional Networks

Coordinates of the outlier blocks from PCAdmix and the outlier SNPs from BGC, as well as the significant SNPs from the association tests, were converted into the canFam3.1 assembly using the liftover tool (https://genome.ucsc.edu/cgi-bin/hgLiftOver) in order to exploit the more complete gene annotation available for such build. Information for the genes mapping within 50 kb of distance from the significant SNPs (which is approximately half of the distance between two nearby SNPs of the 25 k data set) was retrieved in Ensembl Biomart (http://www.ensembl.org/biomart/martview/, last accessed March 1, 2017) from the Ensembl Genes 84 annotation database. The genes obtained from each analysis were then independently checked for possible enrichment towards specific Gene Ontology (GO) and Human Phenotypes (HP) categories in G-profiler (Reimand et al. 2016), only retaining categories significant at P < 0.05 after Benjamini–Hockberg correction. Possible functional relationships between the identified genes were searched in string v.10 (Szklarczyk et al. 2015), including all the available types of evidences.

Supplementary Material

Supplementary data are available at Molecular Biology and Evolution online. Click here for additional data file.
  83 in total

1.  Forensic DNA against wildlife poaching: identification of a serial wolf killing in Italy.

Authors:  Romolo Caniglia; Elena Fabbri; Claudia Greco; Marco Galaverni; Ettore Randi
Journal:  Forensic Sci Int Genet       Date:  2009-11-26       Impact factor: 4.882

2.  Pitx1 and Pitx2 are required for development of hindlimb buds.

Authors:  Alexandre Marcil; Emilie Dumontier; Michel Chamberland; Sally A Camper; Jacques Drouin
Journal:  Development       Date:  2003-01       Impact factor: 6.868

3.  [The role of hybrid zones in speciation: a case study on chromosome races of the house mouse Mus domesticus and common shrew Sorex araneus].

Authors:  L A Lavrenchenko; N Sh Bulatova
Journal:  Zh Obshch Biol       Date:  2015 Jul-Aug       Impact factor: 0.465

4.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

5.  Hybrid zones and the genetic architecture of a barrier to gene flow between two sunflower species.

Authors:  L H Rieseberg; J Whitton; K Gardner
Journal:  Genetics       Date:  1999-06       Impact factor: 4.562

6.  Deletions in PITX1 cause a spectrum of lower-limb malformations including mirror-image polydactyly.

Authors:  Eva Klopocki; Christian Kähler; Nicola Foulds; Hitesh Shah; Benjamin Joseph; Hermann Vogel; Sabine Lüttgen; Rainer Bald; Regina Besoke; Karsten Held; Stefan Mundlos; Ingo Kurth
Journal:  Eur J Hum Genet       Date:  2012-01-18       Impact factor: 4.246

7.  Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping.

Authors:  Amaury Vaysse; Abhirami Ratnakumar; Thomas Derrien; Erik Axelsson; Gerli Rosengren Pielberg; Snaevar Sigurdsson; Tove Fall; Eija H Seppälä; Mark S T Hansen; Cindy T Lawley; Elinor K Karlsson; Danika Bannasch; Carles Vilà; Hannes Lohi; Francis Galibert; Merete Fredholm; Jens Häggström; Ake Hedhammar; Catherine André; Kerstin Lindblad-Toh; Christophe Hitte; Matthew T Webster
Journal:  PLoS Genet       Date:  2011-10-13       Impact factor: 5.917

8.  First evidence of hybridization between golden jackal (Canis aureus) and domestic dog (Canis familiaris) as revealed by genetic markers.

Authors:  Ana Galov; Elena Fabbri; Romolo Caniglia; Haidi Arbanasić; Silvana Lapalombella; Tihomir Florijančić; Ivica Bošković; Marco Galaverni; Ettore Randi
Journal:  R Soc Open Sci       Date:  2015-12-02       Impact factor: 2.963

9.  Introgression from domestic goat generated variation at the major histocompatibility complex of Alpine ibex.

Authors:  Christine Grossen; Lukas Keller; Iris Biebach; Daniel Croll
Journal:  PLoS Genet       Date:  2014-06-19       Impact factor: 5.917

Review 10.  Hybridization and extinction.

Authors:  Marco Todesco; Mariana A Pascual; Gregory L Owens; Katherine L Ostevik; Brook T Moyers; Sariel Hübner; Sylvia M Heredia; Min A Hahn; Celine Caseys; Dan G Bock; Loren H Rieseberg
Journal:  Evol Appl       Date:  2016-02-22       Impact factor: 5.183

View more
  13 in total

1.  Applying genomic data in wildlife monitoring: Development guidelines for genotyping degraded samples with reduced single nucleotide polymorphism panels.

Authors:  Alina von Thaden; Carsten Nowak; Annika Tiesmeyer; Tobias E Reiners; Paulo C Alves; Leslie A Lyons; Federica Mattucci; Ettore Randi; Margherita Cragnolini; José Galián; Zsolt Hegyeli; Andrew C Kitchener; Clotilde Lambinet; José M Lucas; Thomas Mölich; Luana Ramos; Vinciane Schockert; Berardino Cocchiararo
Journal:  Mol Ecol Resour       Date:  2020-01-30       Impact factor: 7.090

Review 2.  Systematic Review on Local Ancestor Inference From a Mathematical and Algorithmic Perspective.

Authors:  Jie Wu; Yangxiu Liu; Yiqiang Zhao
Journal:  Front Genet       Date:  2021-05-24       Impact factor: 4.599

3.  Genomic and morphological data shed light on the complexities of shared ancestry between closely related duck species.

Authors:  Joshua I Brown; Flor Hernández; Andrew Engilis; Blanca E Hernández-Baños; Dan Collins; Philip Lavretsky
Journal:  Sci Rep       Date:  2022-06-17       Impact factor: 4.996

4.  Widespread, long-term admixture between grey wolves and domestic dogs across Eurasia and its implications for the conservation status of hybrids.

Authors:  Małgorzata Pilot; Claudia Greco; Bridgett M vonHoldt; Ettore Randi; Włodzimierz Jędrzejewski; Vadim E Sidorovich; Maciej K Konopiński; Elaine A Ostrander; Robert K Wayne
Journal:  Evol Appl       Date:  2018-03-08       Impact factor: 5.183

5.  Combining Bayesian genetic clustering and ecological niche modeling: Insights into wolf intraspecific genetic structure.

Authors:  Pietro Milanesi; Romolo Caniglia; Elena Fabbri; Felice Puopolo; Marco Galaverni; Rolf Holderegger
Journal:  Ecol Evol       Date:  2018-10-30       Impact factor: 2.912

6.  Two decades of non-invasive genetic monitoring of the grey wolves recolonizing the Alps support very limited dog introgression.

Authors:  Christophe Dufresnes; Nadège Remollino; Céline Stoffel; Ralph Manz; Jean-Marc Weber; Luca Fumagalli
Journal:  Sci Rep       Date:  2019-01-16       Impact factor: 4.379

7.  The relevance of gene flow with wild relatives in understanding the domestication process.

Authors:  Alejandra Moreno-Letelier; Jonás A Aguirre-Liguori; Daniel Piñero; Alejandra Vázquez-Lobo; Luis E Eguiarte
Journal:  R Soc Open Sci       Date:  2020-04-15       Impact factor: 2.963

8.  Increased genetic marker density reveals high levels of admixture between red deer and introduced Japanese sika in Kintyre, Scotland.

Authors:  S Eryn McFarlane; Darren C Hunter; Helen V Senn; Stephanie L Smith; Rebecca Holland; Jisca Huisman; Josephine M Pemberton
Journal:  Evol Appl       Date:  2019-11-06       Impact factor: 5.183

9.  Improving Selection Detection with Population Branch Statistic on Admixed Populations.

Authors:  Burak Yelmen; Davide Marnetto; Ludovica Molinaro; Rodrigo Flores; Mayukh Mondal; Luca Pagani
Journal:  Genome Biol Evol       Date:  2021-04-05       Impact factor: 3.416

10.  Wolf outside, dog inside? The genomic make-up of the Czechoslovakian Wolfdog.

Authors:  Romolo Caniglia; Elena Fabbri; Pavel Hulva; Barbora Černá Bolfíková; Milena Jindřichová; Astrid Vik Stronen; Ihor Dykyy; Alessio Camatta; Paolo Carnier; Ettore Randi; Marco Galaverni
Journal:  BMC Genomics       Date:  2018-07-13       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.