| Literature DB >> 35264717 |
Astrid Vik Stronen1,2,3, Federica Mattucci4, Romolo Caniglia4, Elena Fabbri4, Marco Galaverni5, Berardino Cocchiararo6,7, Carsten Nowak6,7, Raquel Godinho8,9, Aritz Ruiz-González10, Josip Kusak11, Tomaž Skrbinšek12, Ettore Randi13, Albena Vlasseva14, Nadia Mucci4.
Abstract
Intra- and inter-specific gene flow are natural evolutionary processes. However, human-induced hybridization is a global conservation concern across taxa, and the development of discriminant genetic markers to differentiate among gene flow processes is essential. Wolves (Canis lupus) are affected by hybridization, particularly in southern Europe, where ongoing recolonization of historic ranges is augmenting gene flow among divergent populations. Our aim was to provide diagnostic canid markers focused on the long-divergent Iberian, Italian and Dinaric wolf populations, based on existing genomic resources. We used 158 canid samples to select a panel of highly informative single nucleotide polymorphisms (SNPs) to (i) distinguish wolves in the three regions from domestic dogs (C. l. familiaris) and golden jackals (C. aureus), and (ii) identify their first two hybrid generations. The resulting 192 SNPs correctly identified the five canid groups, all simulated first-generation (F1) hybrids (0.482 ≤ Qi ≤ 0.512 between their respective parental groups) and all first backcross (BC1) individuals (0.723 ≤ Qi ≤ 0.827 to parental groups). An assay design and test with invasive and non-invasive canid samples performed successfully for 178 SNPs. By separating natural population admixture from inter-specific hybridization, our reduced panel can help advance evolutionary research, monitoring, and timely conservation management.Entities:
Mesh:
Year: 2022 PMID: 35264717 PMCID: PMC8907317 DOI: 10.1038/s41598-022-08132-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Explorative multivariate analyses performed with Adegenet 2.1.3[57] on the 158 canid individuals typed at 192 SNPs. (a) Bayesian Information Criterion (BIC) run with successive numbers (from 1 to 10) of K-means clusters. The function “table.value” was used to graphically visualize the corresponding best clustering of the individuals. (b) Plot showing that the analysed individuals clustered in their five original sample groups which corresponded to the five inferred genetic clusters (J: Jackals, purple bar; D: domestic dogs, green bar; WDIN: Dinaric wolves, yellow bar, WIB: Iberian wolves, blue bar; WIT: Italian wolves, red bar). (c) Discriminant analysis of principal component (DAPC) scatterplot showing the genetic distribution among dogs (left side of PC-I) and wild canids (right side of PC-I, which explained 66% of the total genetic variability. PC-II (explaining 19% of the total genetic variability) shows the distinctions among the Italian wolves WIT (top of the graphic) and the other wolf (WIB: Iberian wolves, WDIN: Dinaric wolves) and jackal populations. See also the DA and PCA eigenvalue histogram inserted on the lower and upper right side, respectively. (d) Assignment plot based on DAPC: all individuals were assigned to their a-priori genetic clusters defined by the K-means analyses with individual membership probabilities > 0.999.
Figure 2Results from the Bayesian assignment procedures. (a) Estimated posterior probability LnP(K) and corresponding standard deviations of the K clusters from 1 to 10. (b) Bar plots for the 158 individual qi-values obtained through assignment with the 192-SNP genotypes. Each individual is represented by a vertical line partitioned into coloured segments, whose length is proportional to the individual coefficients of membership (qi) in the jackal, dog and wolf clusters inferred by Bayesian assignment analyses performed in Structure2.3.4[60], assuming K from 2 to 5, no prior information (option Usepopinfo not activated) and choosing the “Admixture” and “Independent Allele Frequency” models. J: Jackals; D: dogs; WDIN: Dinaric wolves, WIB: Iberian wolves; WIT: Italian wolves. Bar plots were obtained concatenating data from the five independent runs using Clumpp 1.1.1[62] and graphically displayed using Distruct 1.1[63].
Average membership proportions Qi of the five canid groups in their original sampling group (in bold) with 90% confidence intervals (CI), estimated from the Bayesian assignment analyses performed in Structure 2.3.4[60], assuming K = 5 clusters and using the “Admixture” and “Independent allele frequencies” models as parameter settings.
| Populations | N | |||||
|---|---|---|---|---|---|---|
| Jackals | 21 | 0.000 (0.000–0.001) | 0.000 (0.000–0.001) | 0.000 (0.000–0.001) | 0.000 (0.000–0.001) | |
| Domestic dogs | 52 | 0.001 (0.000–0.009) | 0.001 (0.000–0.004) | 0.001 (0.000–0.005) | 0.001 (0.000–0.004) | |
| Dinaric wolves | 30 | 0.001 (0.000–0.004) | 0.000 (0.000–0.002) | 0.003 (0.000–0.012) | 0.002 (0.000–0.008) | |
| Iberian wolves | 25 | 0.001 (0.000–0.003) | 0.000 (0.000–0.001) | 0.001 (0.000–0.004) | 0.000 (0.000–0.001) | |
| Italian wolves | 30 | 0.000 (0.000–0.002) | 0.000 (0.000–0.001) | 0.000 (0.000–0.001) | 0.000 (0.000–0.001) |
Average Qi were obtained concatenating the data from the five independent runs using Clumpp 1.1.1[62].
Figure 3Bar plots for the 158 individual qi-values obtained through Structure 2.3.4[60] assignment for the canid reference populations with the 192-SNP genotypes, assuming K = 5 clusters: jackals (J), dogs (D), Dinaric wolves (WDIN), Iberian wolves (WIB), Italian wolves (WIT), and 10 first-generation (F1) hybrid and first-generation backcross (BC1) genotypes simulated for each pairwise combination of canid groups. Bar plots were obtained concatenating data from the five independent runs using Clumpp 1.1.1[62] and graphically displayed using Distruct 1.1[63].
Average membership proportions Qi in their original sampling group (in bold) with 90% confidence intervals (CI) for the five canid groups and the first two generations of simulated hybrids, estimated from the Bayesian assignment analyses performed in Structure 2.3.4[60], assuming K = 5 clusters and using the “Admixture” and “Independent allele frequencies” models as parameter settings.
| Populations | N | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Parentals | Jackals | 21 | 0.000 | (0.000–0.002) | 0.001 | (0.000–0.003) | 0.001 | (0.000–0.005) | 0.000 | (0.000–0.002) | ||
| Domestic dogs | 52 | 0.002 | (0.000–0.014) | 0.002 | (0.000–0.013) | 0.002 | (0.000–0.015) | 0.002 | (0.000–0.011) | |||
| Dinaric wolves | 30 | 0.001 | (0.000–0.008) | 0.001 | (0.000–0.005) | 0.006 | (0.000–0.030) | 0.003 | (0.000–0.014) | |||
| Iberian wolves | 25 | 0.001 | (0.000–0.006) | 0.000 | (0.000–0.003) | 0.002 | (0.000–0.013) | 0.001 | (0.000–0.005) | |||
| Italian wolves | 30 | 0.000 | (0.000–0.003) | 0.000 | (0.000–0.003) | 0.001 | (0.000–0.004) | 0.001 | (0.000–0.004) | |||
| Simulated F1 | D-WIT | 10 | 0.003 | (0.000–0.017) | 0.004 | (0.000–0.025) | 0.006 | (0.000–0.028) | ||||
| J-WIT | 10 | 0.000 | (0.000–0.003) | 0.002 | (0.000–0.011) | 0.002 | (0.000–0.012) | |||||
| WIT-WDIN | 10 | 0.002 | (0.000–0.009) | 0.001 | (0.000–0.004) | 0.005 | (0.000–0.031) | |||||
| WIT-WIB | 10 | 0.001 | (0.000–0.006) | 0.000 | (0.000–0.003) | 0.006 | (0.000–0.039) | |||||
| D-WDIN | 10 | 0.003 | (0.000–0.018) | 0.009 | (0.000–0.054) | 0.004 | (0.000–0.027) | |||||
| J-WDIN | 10 | 0.000 | (0.000–0.002) | 0.003 | (0.000–0.020) | 0.001 | (0.000–0.006) | |||||
| WDIN-WIB | 10 | 0.001 | (0.000–0.008) | 0.000 | (0.000–0.003) | 0.001 | (0.000–0.008) | |||||
| D-WIB | 10 | 0.007 | (0.000–0.037) | 0.010 | (0.000–0.058) | 0.004 | (0.000–0.023) | |||||
| J-WIB | 10 | 0.000 | (0.000–0.002) | 0.003 | (0.000–0.015) | 0.001 | (0.000–0.004) | |||||
| D-J | 10 | 0.003 | (0.000–0.021) | 0.004 | (0.000–0.025) | 0.002 | (0.000–0.010) | |||||
| Simulated BC1 | D-WIT-WIT | 10 | 0.002 | (0.000–0.012) | 0.004 | (0.000–0.029) | 0.004 | (0.000–0.027) | ||||
| J-WIT-WIT | 10 | 0.001 | (0.000–0.003) | 0.002 | (0.000–0.010) | 0.002 | (0.000–0.010) | |||||
| J-WIT-J | 10 | 0.000 | (0.000–0.002) | 0.002 | (0.000–0.011) | 0.002 | (0.000–0.012) | |||||
| WIT-WDIN | 10 | 0.002 | (0.000–0.010) | 0.001 | (0.000–0.003) | 0.005 | (0.000–0.035) | |||||
| WIT-WDIN | 10 | 0.001 | (0.000–0.007) | 0.000 | (0.000–0.003) | 0.004 | (0.000–0.025) | |||||
| WIT-WIB-WIT | 10 | 0.001 | (0.000–0.008) | 0.000 | (0.000–0.003) | 0.013 | (0.000–0.081) | |||||
| WIT-WIB-WIB | 10 | 0.001 | (0.000–0.008) | 0.000 | (0.000–0.003) | 0.014 | (0.000–0.070) | |||||
| D-WDIN-WDIN | 10 | 0.003 | (0.000–0.017) | 0.008 | (0.000–0.051) | 0.005 | (0.000–0.027) | |||||
| J-WDIN-WDIN | 10 | 0.000 | (0.000–0.003) | 0.004 | (0.000–0.026) | 0.001 | (0.000–0.010) | |||||
| J-WDIN-J | 10 | 0.000 | (0.000–0.003) | 0.003 | (0.000–0.017) | 0.001 | (0.000–0.006) | |||||
| WDIN-WIB-WDIN | 10 | 0.001 | (0.000–0.009) | 0.000 | (0.000–0.004) | 0.002 | (0.000–0.013) | |||||
| WDIN-WIB-WIB | 10 | 0.001 | (0.000–0.008) | 0.000 | (0.000–0.003) | 0.004 | (0.000–0.027) | |||||
| D-WIB-WIB | 10 | 0.002 | (0.000–0.014) | 0.005 | (0.000–0.030) | 0.004 | (0.000–0.026) | |||||
| J-WIB-WIB | 10 | 0.000 | (0.000–0.002) | 0.002 | (0.000–0.012) | 0.001 | (0.000–0.004) | |||||
| J-WIB-J | 10 | 0.000 | (0.000–0.002) | 0.002 | (0.000–0.014) | 0.001 | (0.000–0.004) | |||||
| D-J-J | 10 | 0.002 | (0.000–0.013) | 0.002 | (0.000–0.012) | 0.001 | (0.000–0.008) | |||||
Data comprise the 192-SNP genotypes of the parental populations (jackals, dogs, Dinaric, Iberian and Italian wolves), their first-generation hybrids (F1) and the first backcross (BC1) of simulated admixed genotypes. J = jackal, D = dog, W = wolf, IT = Italy, DIN = Dinaric, IB = Iberia. Average Qi were obtained concatenating the data from the five independent runs using Clumpp 1.1.1[62].
Figure 4Multivariate discriminant analyses (PC performed with the “dudi.pca” function) (a) in the Adegenet 2.3.4[57] package and (b) with a Bayesian assignment approach in Structure 2.3.4[60] (assuming K = 5 and the “Admixture” and “Independent allele frequencies” models) obtained analysing the 192-SNP genotypes of the five reference populations (J: Jackals; D: Dogs; WDIN: Dinaric; WIB: Iberian; WIT: Italian wolves) together with 33 additional canid genotypes categorized as non-admixed (W, grey dots) or admixed (Hy, black dots) in earlier studies (n = 10 Italian and n = 9 Dinaric wolves, n = 4 Italian wolf x dog (HyIT), n = 4 Dinaric wolf x dog (HyDIN), n = 4 Iberian wolf x dog (HyIB), and n = 2 jackal x dog individuals (HyJ-D)). Bar plots were obtained concatenating data from the five independent runs using Clumpp 1.1.1[62] and graphically displayed using Distruct 1.1[63].
Genetic variability at 192 SNPs in jackals, dogs, and wolves from the Dinaric, Iberian and Italian populations.
| Populations (N) | GL (%) | PL (%) | ML (%) | AO | AE | AR | NP | PIC | HO | uHE | PID | PIDsibs | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jackals (21) | 99.581 | 8.334 | 91.666 | 1.083 (0.020) | 1.039 (0.012) | 1.076 | 2 | 0.019 | 0.024 (0.007) | 0.023 (0.007) | − 0.035 | 0.240 | 1.343 × 10–04 | 1.039 × 10–02 |
| Domestic dogs (52) | 99.712 | 88.545 | 11.455 | 1.885 (0.023) | 1.481 (0.022) | 1.859 | 31 | 0.239 | 0.264 (0.010) | 0.296 (0.011) | 0.110 | < 0.001 *** | 1.005 × 10–50 | 4.279 × 10–26 |
| Dinaric wolves (30) | 99.861 | 46.352 | 53.648 | 1.464 (0.036) | 1.156 (0.020) | 1.383 | 0 | 0.081 | 0.098 (0.012) | 0.099 (0.011) | 0.010 | 0.348 | 2.776 × 10–18 | 3.828 × 10–09 |
| Iberian wolves (25) | 99.793 | 35.946 | 64.054 | 1.359 (0.035) | 1.114 (0.017) | 1.322 | 0 | 0.063 | 0.065 (0.009) | 0.076 (0.010) | 0.143 | < 0.001 *** | 2.002 × 10–13 | 3.855 × 10–07 |
| Italian wolves (30) | 99.913 | 32.818 | 67.182 | 1.328 (0.034) | 1.092 (0.017) | 1.245 | 0 | 0.047 | 0.058 (0.010) | 0.057 (0.009) | − 0.003 | 0.439 | 2.428 × 10–10 | 1.247 × 10–05 |
Values in parentheses denote standard errors. N = sample size; GL = percentage of mean proportion of successfully genotyped loci; PL = proportion of polymorphic loci; ML = proportion of monomorphic loci; AO = number of observed alleles; AE = number of expected alleles; AR = allelic richness; NP = number of private alleles; PIC = polymorphic information content; HO = observed heterozygosity; uHE = unbiased expected heterozygosity; FIS = estimates of departure from Hardy–Weinberg equilibrium; P = probability of obtaining, by chance, FIS values higher than those observed after 10,000 random permutations (*P < 0.05, **P < 0.01, ***P < 0.001); PID = probability of identity; PIDsibs = probability of identity among full sibs.
Matrix of pairwise FST values (below the diagonal) and the corresponding estimates of historical gene flow NM (above the diagonal) among the sampled populations computed using 192-SNP genotypes.
| Populations | Jackals | Domestic dogs | Dinaric wolves | Iberian wolves | Italian wolves |
|---|---|---|---|---|---|
| Jackals | − | 0.119 | 0.091 | 0.070 | 0.033 |
| Domestic dogs | 0.677 | − | 0.141 | 0.136 | 0.110 |
| Dinaric wolves | 0.734 | 0.640 | − | 0.257 | 0.087 |
| Iberian wolves | 0.781 | 0.648 | 0.493 | − | 0.058 |
| Italian wolves | 0.884 | 0.695 | 0.741 | 0.813 | − |
All FST values were highly statistically significant (P < 0.0001).
Figure 5Flowchart describing the SNP selection process for design of the reduced marker panel.