Literature DB >> 28616185

Asymmetry matters: A genomic assessment of directional biases in gene flow between hybridizing spruces.

Guillaume de Lafontaine1,2, Jean Bousquet1.   

Abstract

Assessing directional bias in interspecific gene flow might be important in determining the evolutionary trajectory of closely related species pairs. Using a set of 300 single nucleotide polymorphisms (SNPs) having variable propensity to cross species boundary, we evaluated the genomic extent and direction of interspecific gene flow in a progenitor-derivative spruce species pair (black spruce and red spruce). A higher rate of gene flow was found from black spruce toward red spruce purebreds than vice versa. This asymmetry could reflect the historical gene flow between the two taxa at the time of species inception and during postglacial colonization. A clear asymmetry in introgression was depicted by a greater gene flow between red spruce and hybrids than between black spruce and hybrids. While backcrossing toward red spruce was invariably high across the genome, the actual species boundary is between hybrids and black spruce where gene flow is impeded at those genomic regions impermeable to introgression. Associations between hybrid index and climatic variables (total annual precipitation and mean annual temperature) were tested, as these might indicate a role for exogenous selection in maintaining the species boundary. While an apparent association was found between the hybrid index and precipitation, it collapsed when considered in light of the directional bias in interspecific gene flow. Hence, considering asymmetrical patterns of introgression allowed us to falsify an apparent role for exogenous selection. Although this was not formerly tested here, we suggest that this pattern could result from asymmetrical endogenous selection, a contention that deserves further investigations.

Entities:  

Keywords:  Picea mariana; Picea rubens; introgression; reproductive barriers; single nucleotide polymorphisms

Year:  2017        PMID: 28616185      PMCID: PMC5468134          DOI: 10.1002/ece3.2682

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


Introduction

Natural interspecific hybridization and introgressive hybridization (i.e., introgression) are ubiquitous in plants and have played a prominent role in their evolution (Anderson, 1949; Arnold, 1997). Interspecific gene flow is often asymmetrical in plants and might result in directional patterns of introgression (Arnold, Tang, Knapp, & Martin, 2010; Bacilieri, Ducousso, Petit, & Kremer, 1996; De La Torre, Ingvarsson, & Aitken, 2015; Guichoux et al., 2013; Lowry, Modliszewski, Wright, Wu, & Willis, 2008; Petit, Bodénès, Ducousso, Roussel, & Kremer, 2003; Starr, Gadek, Yoder, Flatz, & Smith, 2013; Tiffin, Olson, & Moyle, 2001). Causal factors that have been invoked for such asymmetrical interspecific gene flow include differences in phenology, fertilization success, and offspring viability (Baack, Melo, Rieseberg, & Ortiz‐Barrientos, 2015; Carney, Cruzan, & Arnold, 1994; Field, Ayre, Whelan, & Young, 2011; Keim, Paige, Whitham, & Lark, 1989; Rieseberg & Blackman, 2010; Starr et al., 2013; Sweigart & Willis, 2003), differences in selective advantages under certain environmental conditions (De La Torre et al., 2015; Lexer, Fay, Joseph, Nica, & Heinze, 2005; Martin, Bouck, & Arnold, 2006), differences in species abundance at the time of mating (Burgess, Morgan, Deverno, & Husband, 2005; Lepais et al., 2009), or contrasted population dynamics related to range shifts and expansions (Currat, Ruedi, Petit, & Excoffier, 2008; Guichoux et al., 2013). Assessing directional biases in interspecific gene flow and their driving processes might be important in determining the evolutionary trajectory of closely related species pairs (Fitzpatrick et al., 2010; Petit et al., 2003; Rhymer & Simberloff, 1996). The notion that interspecific gene flow and reproductive isolation are characteristics of genome regions, and not entire genomes (Key, 1968; Wu, 2001), has recently gained much interest with the advent of genomic methods to analyze large numbers of loci (Lexer & Widmer, 2008; Strasburg et al., 2012). Indeed, recent population genomic studies exploring locus‐level differences between closely related species have revealed that heterogeneous patterns of introgression across the genome are commonplace in the natural populations of various taxa (Harrison & Larson, 2014; Kane et al., 2009; Luttikhuizen, Drent, Peijnenburg, van der Veer, & Johannesson, 2012; Minder & Widmer, 2008; Nolte, Gompert, & Buerkle, 2009; Rieseberg, Whitton, & Gardner, 1999) including trees (Hamilton, Lexer, & Aitken, 2013a; de Lafontaine, Prunier, Gérardi, & Bousquet, 2015; Lexer et al., 2010; Martinsen, Whitham, Turek, & Keim, 2001; Scotti‐Saintagne et al., 2004). Specifically, all these studies support the view that some key genomic regions remain virtually impermeable to interspecific gene flow while most of the genome can be exchanged somewhat freely between species. So far, most studies that have investigated asymmetrical patterns of interspecific gene flow have relied on a modest number of presumably neutral markers, under the assumption that these should reflect the species' entire genome, thus conceived as a cohesive unit. However, just as the overall level of introgression is heterogeneous across the genome, the level of asymmetry in interspecific gene flow might also be variable across the genome (Fitzpatrick et al., 2010). We might specifically expect that the directional bias in introgression should be stronger at less permeable genomic regions, that is, at these key loci where gene flow is actually impeded between species (Currat et al., 2008). Accordingly, asymmetrical introgression between European oak species (Quercus robur and Quercus petraea) was reported when analyzing a subset of 60 interspecific outlier loci (from an F ST‐outlier analysis) but not when using a presumably selectively neutral subset of 166 nonoutlier markers (Guichoux et al., 2013). Black spruce (Picea mariana [Mill] B.S.P.) and red spruce (Picea rubens Sarg.) are two closely related North American conifers thought to have a fairly recent speciation history (Jaramillo‐Correa & Bousquet, 2003; Perron, Perry, Andalo, & Bousquet, 2000). Red spruce likely originated from the glaciation‐induced isolation of a preexisting black spruce population during the Pleistocene (i.e., allopatric speciation). The two taxa would have then come into secondary contact during the Late Holocene (Lindbladh, Jacobson, & Schauffler, 2003; Perron et al., 2000). The current natural range of black spruce spans the transcontinental boreal forest of North America whereas red spruce has a narrow distribution in the montane forests along the Appalachian Mountains of the United States, the Acadian forest of the Maritime Provinces of Canada, and the mixed forest of southern Québec where the climate is typically more humid and temperate. The two taxa co‐occur in sympatry over a large geographic area in northeastern North America where their ranges overlap (Figure 1). Frequent occurrences of hybrid trees have been reported within the contact zone along the St. Lawrence River valley in Southern Québec, testifying of the extensive gene flow between the two spruce taxa (de Lafontaine et al., 2015; Morgenstern & Farrar, 1964; Perron & Bousquet, 1997).
Figure 1

Sampling of 33 populations across the contact zone between Picea mariana and Picea rubens in eastern North America. Ranges of P. mariana and P. rubens are shown in green and red, respectively, with the sympatric area in brown. Pie charts indicate mean genomic proportion of P. mariana (black) and P. rubens (red) ancestry in each sampled population based on 300 SNPs using the Bayesian clustering algorithm implemented in STRUCTURE (K = 2)

Sampling of 33 populations across the contact zone between Picea mariana and Picea rubens in eastern North America. Ranges of P. mariana and P. rubens are shown in green and red, respectively, with the sympatric area in brown. Pie charts indicate mean genomic proportion of P. mariana (black) and P. rubens (red) ancestry in each sampled population based on 300 SNPs using the Bayesian clustering algorithm implemented in STRUCTURE (K = 2) It is generally recognized that the maintenance of species integrity in the face of interspecific gene flow can be driven by exogenous or endogenous selective pressures (i.e., ecological selection versus selection against heterozygotes or Bateson–Dobzhansky–Muller incompatibilities; Arnold, 1997; Barton & Hewitt, 1985; Moore, 1977; Nosil, 2012). Attempts at disentangling the roles of environment‐dependent and environment‐independent factors in driving patterns of introgression between black spruce and red spruce have given mixed results. On the one hand, transplant experiments have shown that red spruce is more shade‐tolerant but less cold and drought‐tolerant than black spruce, and that black spruce has higher allocation to stem wood, while red spruce has higher allocation to roots (Major, Barsi, Mosseler, Campbell, & Rajora, 2003; Major, Mosseler, Barsi, Campbell, & Rajora, 2003a,b; Manley & Ledig, 1979). These results suggest that the two taxa are ecologically distinct, each exhibiting physiological traits that favor different ecological niches. Red spruce has a greater competitive advantage in warm and mesic conditions while black spruce outperforms in cold and dry environments. Hence, these observations suggest a role for exogenous selection whereby interspecific gene flow is impeded as a result of different selective advantages under contrasted levels of ecological (i.e., moisture and temperature) conditions. On the other hand, controlled crosses suggested that endogenous selective pressures might also be important drivers of hybridization and introgression. Indeed, first‐generation hybrids had lower fitness, as illustrated by higher rates of empty and aborted seeds, lower germination rates, lower CO2 uptake, and lower juvenile survival (Major et al., 2003b, 2005; Manley & Ledig, 1979). No record of reproductive phenological barrier between these closely related species is reported in the literature. Heterogeneous patterns of interspecific gene flow have recently been uncovered across the genome of hybridizing black spruce and red spruce (de Lafontaine et al., 2015). In this previous study, 300 single nucleotide polymorphism (SNP) loci spanning the 12 chromosomes of black spruce were grouped into three classes of markers on the basis of their increasing capacity to cross the boundary between the two spruce taxa (impermeable, neutral, and highly permeable marker subsets, respectively) indicating variable states of introgression along the genome. The main objective of the present study was to assess whether interspecific gene flow between P. mariana and P. rubens is asymmetrical, and if so, whether the level of asymmetry is heterogeneous across the genome. We expected that the directional bias in introgression will vary across the three classes of markers as a result of the differential outcome of historical, demographic, and natural selection processes on the three marker subsets (Guichoux et al., 2013). As a complementary objective, we explored the putative role of ecological (exogenous) selective pressures in maintaining the hybrid zone and the integrity of species in the face of introgression. Because there is no possibility to replicate samples of the studied hybrid zone in different settings (as in Lexer et al., 2010), the geographic context of the hybrid zone between black spruce and red spruce does not lend itself easily to a direct assessment of the relative importance of exogenous versus endogenous (genetic) selective processes (see de Lafontaine et al., 2015). Hence, while formally disentangling the role of environmental and genetic factors in the maintenance of this hybrid zone remains beyond the scope of this study, we did hypothesize that uncovering directional biases in introgression could perhaps provide further useful insights regarding the relative importance of ecological selective pressures. Specifically, given that these two taxa are associated with contrasted ecological niches (Major, Barsi, et al., 2003; Major et al., 2003a,b; Manley & Ledig, 1979), we expected that variation in environmental conditions should reflect the extent and direction of introgression if ecological selection is an important determinant of the genetic boundary between species.

Materials and Methods

A total of 385 adult individual trees from 33 populations spanning the allopatric and sympatric zones of P. mariana and P. rubens (Figure 1 and Table S1) were successfully genotyped at 300 SNP loci representing 279 genes spread over the 12 chromosomes of black spruce (de Lafontaine et al., 2015; Dryad repository DOI:10.5061/dryad.9kb02). The intersection of at least two of three population genomic approaches (namely F ST‐outlier analysis, geographic, and genomic clines) was used to classify the sampled loci according to their capacity to introgression. Of the 300 SNPs, 23 were classified as impermeable to introgression, 238 were assumed to be neutral, and 39 were highly permeable to interspecific gene flow (de Lafontaine et al., 2015). Here, we assessed the direction of interspecific gene flow estimated from each of the three marker subsets and to evaluate whether variation in environmental conditions reflects the direction of introgression. Detailed methods regarding SNP discovery, DNA extraction, and genotyping can be found elsewhere (de Lafontaine et al., 2015).

Detection of admixed individuals

The proportion of black spruce and red spruce genome admixture for each sampled individual (hybrid index; q‐value) was estimated based on all SNPs, using the Bayesian clustering algorithm implemented in STRUCTURE 2.3.3 (Pritchard, Stephens, & Donnelly, 2000). Five independent runs, each with 1,000,000 Markov chain Monte Carlo (MCMC) iterations after 50,000 burn‐in periods, were performed for K (number of groups) set to 2, which corresponded to the optimal partitioning (de Lafontaine et al., 2015) and to the actual number of taxa. Expected levels of admixture are 0 and 1 for purebreds, 0.5 for first‐generation (F1) hybrids, and 0.25 and 0.75 for backcrossed individuals. Hence, theoretically, threshold values of 0.125 and 0.875 should be optimal for distinguishing between purebreds and first‐generation backcrossed individuals. We used simulations to assess the efficiency (proportion of individuals in a category that are correctly identified) and accuracy (proportion on an identified category that truly belongs to that category) of the STRUCTURE analysis to tease apart purebreds from admixed individuals (Vähä & Primmer, 2006). Simulated P. mariana, P. rubens, first‐generation hybrids (F1) and backcrosses with each parental (F1 × P. mariana and F1 × P. rubens) were obtained using HYBRIDLAB 1.0 (Nielsen, Bach, & Kotlick, 2006). We used the multilocus genotypes (300 SNPs) of individuals from the allopatric population group of each taxon as reference parental genotypes (n = 92 and 98 individuals for P. mariana and P. rubens, respectively). Simulated P. mariana, P. rubens, F1, F1 × P. mariana, and F1 × P. rubens (n = 1,000 simulated individuals per category) were analyzed in STRUCTURE with five runs of 500,000 MCMC iterations after 10,000 burn‐in periods.

Estimation of asymmetrical gene flow

The program MIGRATE‐N was originally designed and is still most typically used to estimate effective population sizes and past migration rates among n populations (i.e., movement of individuals within a metapopulation) (Beerli & Felsentein, 1999, 2001; Beerli & Palczewski, 2010). The analysis uses maximum‐likelihood or Bayesian inference to jointly estimate the parameters M (M = m/μ; where m and μ are the migration and mutation rates, respectively) describing the mutation‐scaled asymmetrical migration rate, and Θ (Θ = 4N eμ) describing the mutation‐scaled effective population size in a coalescent framework by which alleles are traced back in time to a single ancestral copy. Importantly, Nm in MIGRATE‐N has the same meaning as Nm in the context of F ST. As seen in any other population genetics method, m is seen as a backward immigration rate, which, in other words, is the probability that a randomly chosen individual (i.e., a multilocus genotype) in a given generation came from a population (i.e., a gene pool) different from the one in which it is currently observed in the preceding generation. In recent years, instead of assessing spatial movement of individuals among populations, an increasing number of studies have successfully used the MIGRATE‐N approach to test various scenarios of asymmetrical gene flow between closely related sympatric species and their hybrids based on STRUCTURE assignments of individuals (e.g., Andrew, Ostevik, Ebert, & Rieseberg, 2012; Field et al., 2011; Ortego, Gugger, & Sork, 2016; Scascitelli et al., 2010; Starr et al., 2013). Following this novel approach, we estimated the magnitude and direction of long‐term gene flow between the two spruce taxa and their hybrids using the maximum‐likelihood coalescent‐based approach implemented in MIGRATE‐N 3.6 (Beerli & Felsenstein, 1999, 2001). The 195 individuals from the sympatric zone (Figure 1) were assigned to one of three genotypic classes on the basis of their STRUCTURE q‐values: pure P. mariana (q‐values < 0.125), pure P. rubens (q‐values > 0.875), or hybrids (q‐values between 0.125 and 0.875). For each marker subset (impermeable, neutral, and highly permeable, n = 23, 238, and 39 SNPs, respectively), we estimated Θ for each genotypic class and M between all pairwise classes. For each marker subset, we conducted 10 replicate runs and changed the random number seed, the starting values of Θ and M, and search parameters among runs. For the first run, initial values of Θ and M were estimated from F ST, while for the subsequent runs, we iteratively used the mean maximum‐likelihood estimates (MLE) of Θ and M over the previous runs as starting parameters, as suggested by Beerli (2009). Search parameters included a first set of seven replicate runs with 10 short chains, each visiting 10,000 genealogies and using 500 trees to improve the driving values for the next chain as well as three long chains, each visiting 100,000 samplings with a burn‐in of 5,000 trees. To insure convergence, the following three runs were longer (×5) and consisted of 10 short chains of 50,000 trees, followed by three long chains of 500,000 steps, with the first 10,000 trees discarded as burn‐in. This search strategy was deemed satisfactory because the results (Θ and M parameter estimates) were largely congruent among runs (Beerli, 2009). However, the 95% confidence intervals generated by any individual run of MIGRATE‐N were too narrow, a caveat already reported on the basis of simulations (Abdo, Crandall, & Joyce, 2004). Hence, we used the variation across the MLE obtained from our 10 replicate runs to compute the mean and standard deviation of Θ and M parameter estimates. Differences among these parameters were tested by analyses of variance (ANOVAs) followed by post hoc analyses using Tukey's HSD (honest significant difference) test for multiple comparisons. False discovery rate (FDR; Benjamini & Yekutieli, 2001) control was performed to adjust p‐values for all multiple hypotheses tests (α = 0.05), and all reported p‐values were FDR‐adjusted.

Climatic data analysis

The latitude, longitude, and elevation of sampling locations were input into BIOSIM 10 (Régnière, 1996) which interpolates daily climatic conditions using a simulation model that integrates climatic data recorded between 1970 and 2000 at the four closest weather stations of each sampling location within a range of 50 km. Nine climatic variables were initially assessed: the annual mean temperature (°C), the annual mean of the daily minimal temperature, the annual mean of the daily maximal temperature, the annual number of degree‐days above 0°C, above 5°C and above 10°C, the number of frost days, the total annual amount of precipitation (in mm of water including rain and snow), and the annual amount of snow (in mm of water). All climatic variables were highly correlated with either mean annual temperature or total annual amount of precipitation (as previously reported in Prunier, Gérardi, Laroche, Beaulieu, & Bousquet, 2012); hence, only these two latter climatic factors were retained for further analysis (Table S1). In order to assess whether variation in environmental conditions reflects the direction of introgression, significant differences in the mean of temperature and precipitation variables among geographic/genotypic classes (i.e., allopatric P. mariana, sympatric P. mariana, hybrids, sympatric P. rubens, allopatric P. rubens) were tested by analysis of variance (ANOVA) followed by post hoc analysis using Tukey's HSD (honest significant difference) test for multiple comparisons along with FDR control. Furthermore, the relationship between the hybrid index (q‐value) and each climatic variable within the sympatric zone was tested with linear least‐squares regression.

Results

Simulations were used to assess the efficiency (proportion of individuals in a category that are correctly identified) and accuracy (proportion on an identified category that truly belongs to that category) of the STRUCTURE analysis to distinguish purebreds from admixed individuals (i.e., hybrids sensu lato) using the theoretical threshold hybrid indexes (q‐values of 0.125 and 0.875; see Materials and methods). STRUCTURE assignments of simulated individuals based on all 300 SNPs resulted in almost complete lack of overlap among the five categories assayed (simulated P. mariana, F1 × P. mariana, F1, F1 × P. rubens, and P. rubens; Figure 2a). Hence, simulation results indicated high accuracy and efficiency of the STRUCTURE analysis (both values above 99.8%) to tease apart purebreds from first‐generation backcrosses using the theoretical thresholds (Figure 2a).
Figure 2

Frequency distribution of hybrid indexes (STRUCTURE q‐values) of multilocus genotypes (n = 300 SNP loci) of (a) simulated purebreds, first‐generation backcrosses, and first‐generation (F1) hybrid individuals (n = 1,000 simulated individuals per category) and (b) adult spruce tree individuals (n = 385 individuals) sampled across the allopatric and sympatric zones of Picea mariana and Picea rubens. Sampled spruce tree individuals were assigned to one of three genotypic classes on the basis of their STRUCTURE q‐values: pure P. mariana (q‐values < 0.125), pure P. rubens (q‐values > 0.875), or admixed individuals (i.e., hybrids sensu lato; q‐values between 0.125 and 0.875)

Frequency distribution of hybrid indexes (STRUCTURE q‐values) of multilocus genotypes (n = 300 SNP loci) of (a) simulated purebreds, first‐generation backcrosses, and first‐generation (F1) hybrid individuals (n = 1,000 simulated individuals per category) and (b) adult spruce tree individuals (n = 385 individuals) sampled across the allopatric and sympatric zones of Picea mariana and Picea rubens. Sampled spruce tree individuals were assigned to one of three genotypic classes on the basis of their STRUCTURE q‐values: pure P. mariana (q‐values < 0.125), pure P. rubens (q‐values > 0.875), or admixed individuals (i.e., hybrids sensu lato; q‐values between 0.125 and 0.875) Based on these thresholds, assignment scores of the actual 385 sampled spruce trees indicated that 21% were admixed individuals (Figure 2b). Within the sympatric zone, the proportion of hybrids increased to 38%. While these admixed individuals belong to a wide range of hybrid generations, the distribution of the hybrid indexes was clearly a skewed (Figure 2b). Specifically, 74% of admixed individuals were backcrossed toward red spruce. The remaining hybrids were distributed at low frequency across the various levels of admixed ancestries, including recent‐generation hybrids (11%) and backcrosses toward black spruce (11%), along with advanced‐generation hybrids falling in between the simulated categories (4%). In practice, the individuals assigned as purebreds by STRUCTURE analysis (79% of total sample) should indeed include mostly pure P. mariana or P. rubens individuals, as well as some late‐generation backcrosses toward the parental taxa. The coalescent‐based approach implemented in MIGRATE‐N was used to jointly estimate the effective population size as well as the magnitude and direction of gene flow between the two spruce taxa and their hybrids within the sympatric zone. Results of this analysis are summarized in Figure 3 where means and standard deviations of Θ and M parameter estimates (see Section 2) are reported while MLE along with their 95% confidence intervals for each MIGRATE‐N run are available in Table S2. For all three sets of markers, the mutation‐scaled effective population size (Θ) was significantly higher in black spruce, intermediate in hybrids, and lower in red spruce (all p‐values <.0001), a result consistent with differences in geographic ranges (Figure 1). The mutation‐scaled migration rates (M) between P. mariana and P. rubens significantly increased in both directions (M ) among the marker subsets according to their increasing capacity to cross the boundary between the two spruce taxa (i.e., negligible gene flow at impermeable loci, intermediate levels at presumably neutral loci, and high gene flow at highly permeable loci; p‐value <.0001). There was a higher rate of unidirectional gene flow from black spruce to red spruce than the opposite (M  > M ). This asymmetry was significant with the impermeable and neutral markers subset (p‐values = .002 and.0006, respectively) but not with highly permeable markers (p‐value = .15). Bidirectional gene flow was high between red spruce and hybrids (M ) and did not vary significantly among the three marker subsets (p‐values = .07). By contrast, bidirectional gene flow between black spruce and hybrids (M ) increased among the marker subsets with negligible gene flow at impermeable loci, intermediate levels at presumably neutral loci, and high gene flow at highly permeable loci (p‐value < .0001). Hence, there was a clear asymmetry in introgression with greater overall (bidirectional) gene flow between red spruce and hybrids than between black spruce and hybrids (M  > M ; p‐value < .0001). However, no significant unidirectional gene flow asymmetry was found between purebreds and hybrids (i.e., M  = M ; M  = M ; all p‐values ≥.24), except between black spruce and hybrids at neutral markers (M  > M ; p‐value = .002). In summary, at impermeable markers, there was high gene flow between red spruce and hybrids but negligible gene flow between black spruce and red spruce as well as between black spruce and hybrids (M  > M  = M ; p‐value < .0001). A similar scenario was found at presumably neutral markers where there was high gene flow between red spruce and hybrids but intermediate gene flow between black spruce and red spruce as well as between black spruce and hybrids (M  > M  = M ; p‐value < .0001). However, at highly permeable markers, the level of gene flow was uniformly high among all three genotypic classes (M  = M  = M ; p‐value = .20).
Figure 3

Mean maximum‐likelihood estimates (standard deviation in parentheses) of the mutation‐scaled effective population size (Θ) and the mutation‐scaled migration rate (M) among Picea marina (B), Picea rubens (R), and hybrids (H) (represented by dark gray, red, and yellow circles, respectively) in the sympatric zone. Parameter estimates were computed over 10 replicate MIGRATE‐N runs for (a) a set of markers impermeable to interspecific gene flow (n = 23 SNP loci), (b) a set of neutral markers (n = 238 SNP loci), and (c) a set of markers highly permeable to interspecific gene flow (n = 39 SNP loci). Circle size and thickness of arrows proportional to parameter estimates

Mean maximum‐likelihood estimates (standard deviation in parentheses) of the mutation‐scaled effective population size (Θ) and the mutation‐scaled migration rate (M) among Picea marina (B), Picea rubens (R), and hybrids (H) (represented by dark gray, red, and yellow circles, respectively) in the sympatric zone. Parameter estimates were computed over 10 replicate MIGRATE‐N runs for (a) a set of markers impermeable to interspecific gene flow (n = 23 SNP loci), (b) a set of neutral markers (n = 238 SNP loci), and (c) a set of markers highly permeable to interspecific gene flow (n = 39 SNP loci). Circle size and thickness of arrows proportional to parameter estimates Significant differences in both temperature and precipitation variables were found among the various geographic and genotypic classes (both p‐values < .0001; Figure 4a,b). Mean annual temperature was significantly lower in the black spruce allopatric zone, intermediate in the sympatric zone, and higher in the allopatric red spruce zone (Figure 4a). The trend was roughly similar for total annual precipitation. However, within the sympatric zone, on average, red spruce individuals received more precipitation than both hybrids and black spruce individuals; that is, at a level not significantly different from that of the red spruce allopatric zone (Figure 4b). A closer look at the relationship between hybrid index and each climatic variable within the sympatric zone corroborates these results. While the hybrid index did not vary according to mean annual temperature (p‐value = .21; Figure 4c), there was a significant increase in P. rubens ancestry with increasing amount of total annual precipitation, although the explanatory power of this linear model was quite low (p‐value = 0.0001, r 2 = 0.07; Figure 4d).
Figure 4

Variation in mean annual temperature and total annual precipitation among geographic and genotypic classes (a,b) and scatter plots showing the relationship between hybrid index (STRUCTURE q‐value; 0 = Picea mariana, 1 = Picea rubens) and each climatic variable in the sympatric zone (c,d). Boxplots indicate lower quartile, median, and upper quartile. Whiskers' length is 1.5× interquartile range. Significant differences in mean values are indicated by different lowercase letters

Variation in mean annual temperature and total annual precipitation among geographic and genotypic classes (a,b) and scatter plots showing the relationship between hybrid index (STRUCTURE q‐value; 0 = Picea mariana, 1 = Picea rubens) and each climatic variable in the sympatric zone (c,d). Boxplots indicate lower quartile, median, and upper quartile. Whiskers' length is 1.5× interquartile range. Significant differences in mean values are indicated by different lowercase letters

Discussion

Uncovering the patterns of gene flow between closely related taxa is important for understanding the processes that maintain distinct species. Studies reporting morphological identifications of red spruce and black spruce and molecular assessments of the extent of introgression between these taxa have been quite controversial (Bobola et al., 1996; Gordon, 1976; Manley, 1972; Perron & Bousquet, 1997). Yet, in the present study, simulations indicated that the STRUCTURE clustering analysis used with a set of 300 SNP markers allowed to tease apart purebreds from first‐generation backcrosses using the theoretical thresholds with high accuracy and efficiency (Vähä & Primmer, 2006). Using this approach, the proportion of admixed individuals (i.e., hybrids sensu lato) reached 38% within the sympatric zone, supporting previous morphological and genetics studies reporting extensive interspecific gene flow between black spruce and red spruce (Morgenstern & Farrar, 1964; Perron & Bousquet, 1997). However, the level of introgression between these two spruce taxa is not homogeneous across the genome and, more specifically, gene flow varied among the 300 markers assayed in the present study (de Lafontaine et al., 2015). This property was further explored here in order to assess whether the level of asymmetry in interspecific gene flow between P. mariana and P. rubens is also heterogeneous across the genome. Within the sympatric zone, there was a higher rate of unidirectional gene flow from black spruce to red spruce purebreds than the opposite (M  > M ). Such asymmetry, based solely on data from purebred individuals, could reflect the current different population sizes of the two taxa. According to Burgess et al. (2005), higher gene flow should be expected from the more abundant species (black spruce) to the rare species (red spruce). However, under this density‐dependant scenario, we should also expect that first‐generation hybrids will be more likely to mate with the more abundant species (Lepais et al., 2009), which, in this instance, would lead to increased directional introgression toward black spruce. Because we did not observe such a pattern and also because MIGRATE‐N delivers long‐term migration estimates (Beerli, 2009), we conclude that the migration rate between pure types should reflect the general trend in historical gene flow between the two taxa at the time of species inception as well as during postglacial colonization (Currat et al., 2008; Guichoux et al., 2013). First, red spruce originated from a glaciation‐induced isolation of a preexisting black spruce population most likely during the Pleistocene (Perron et al., 2000). As a consequence, the genetic diversity of red spruce at the DNA level is essentially a subset of that of black spruce (Jaramillo‐Correa & Bousquet, 2003; Perron et al., 2000). Hence, given the progenitor‐derivative nature of this species pair, the direction of gene flow during the speciation was necessarily higher from black spruce (progenitor) toward red spruce (derivative) than vice versa. Second, based on pollen paleorecords of eastern North America, Lindbladh et al. (2003) suggested the replacement of a black spruce forest by a red spruce‐dominated forest during the Late Holocene (since ca. 1,500 cal year BP), when red spruce massively invaded the area. According to the theoretical neutral model of introgression during range expansions (Currat et al., 2008), introgression occurs almost exclusively from the resident to the invading species. Our genetic data corroborate this scenario for the two spruce species whereby the genome of the resident taxa (black spruce) infiltrated the genomic background of the colonizing taxa (red spruce) more than the opposite. Another prediction of the neutral model of introgression is that the asymmetrical pattern of introgression from resident to invasive taxa should be strongest at markers experiencing reduced gene flow (Currat et al., 2008; Guichoux et al., 2013). Correspondingly, the asymmetry of gene flow between black spruce and red spruce was found at both impermeable and presumably neutral markers, but not at highly permeable markers, where gene flow appeared virtually unimpeded in both directions. The most striking pattern is a clear asymmetry in introgression depicted by a greater overall (bidirectional) gene flow between red spruce and hybrids than between black spruce and hybrids (M  > M ). As expected, this asymmetry was heterogeneous across the genome, but this heterogeneity was entirely due to the genomic variability of the gene flow between black spruce and hybrids. Indeed, while gene flow between red spruce and hybrids (i.e., backcrossing toward P. rubens) remained invariably high even for impermeable loci subset, there was an increase of total gene flow between black spruce and hybrids (i.e., backcrossing toward P. mariana) according to the permeability of the marker subsets (i.e., negligible gene flow at impermeable loci, intermediate levels at presumably neutral loci, and high gene flow at highly permeable loci). At highly permeable loci, the asymmetrical pattern of introgression was no longer detected, that is, at these loci, backcrossing was equally high toward both parental species. We have argued above that the directional gene flow observed between the two parental species (black spruce toward red spruce) might reflect the known long‐term demographic history of the two spruce taxa (i.e., at the time of species inception and during postglacial colonization). Yet, it has been suggested that both historical and contemporary processes can act in concert to influence and maintain patterns of introgression (Abbott et al., 2013; Hamilton & Aitken, 2013). The higher level of backcrossing toward P. rubens than toward P. mariana might reflect more contemporary evolutionary processes including exogenous or endogenous selection. Indeed, in natural plant populations, it is commonly assumed that selection remains the primary mechanism implicated in determining contemporary patterns of hybridization and introgression (Fitzpatrick et al., 2010; Hamilton & Aitken, 2013; Hamilton et al., 2013a; Lexer et al., 2005; Martin et al., 2006; Scascitelli et al., 2010). Because red spruce and black spruce are associated with contrasted ecological niches (Major, Barsi, et al., 2003; Major et al., 2003a,b; Manley & Ledig, 1979) and as there is no evidence of reproductive phenological barrier, we expected that variation in environmental conditions should reflect the extent and direction of introgression if exogenous selection is a key determinant for the maintenance of the boundary between species. Overall, both temperature and precipitation variables (mean annual temperature and total annual amount of precipitation) had higher values for red spruce than for black spruce, with intermediate values for hybrids. These results are largely congruent with transplant experiments suggesting that red spruce has a greater competitive advantage in warm and mesic conditions while black spruce outperforms in cold and dry environments (Major, Barsi, et al., 2003; Major et al., 2003a,b; Manley & Ledig, 1979). On this basis, it could be tempting to argue that exogenous selection might have played a role in the maintenance of species integrity, with parental species locally adapted to their native ecological niche, and hybrids having higher fitness than parents within the transitional climate (bounded hybrid superiority model). Indeed, this model explained the maintenance of the hybrid zone and species integrity between hybridizing spruce species pairs from western North America (i.e., Picea sitchensis × Picea glauca, Hamilton et al., 2013a; Hamilton, Lexer, & Aitken, 2013b; Hamilton & Aitken, 2013; and Picea engelmannii × P. glauca, De La Torre, Wang, Jacquish, & Aitken, 2014; De La Torre et al., 2015). However, a closer look at the sympatric zone of P. mariana and P. rubens, where contemporary interspecific gene flow actually occurs, tells a different story. While an association between genetic ancestry and environmental variables should be expected if exogenous selection is a key process, temperature did not vary significantly among the three genotypic classes (P. mariana purebreds, hybrids, and P. rubens purebreds) within the sympatric zone. Hence, unlike the pattern found for hybridizing spruce species pairs of western North America (De La Torre et al., 2014, 2015; Hamilton & Aitken, 2013; Hamilton et al., 2013a,b), there was no significant relationship between hybrid index and temperature within the sympatric zone of black spruce and red spruce. By contrast, such a relationship was found between the hybrid index and precipitation, albeit with very low explanatory power. Precipitation was higher for P. rubens purebreds than for hybrids and P. mariana purebreds. Interestingly, this apparent association collapses when considered in light of the directional bias in interspecific gene flow. Indeed, within the sympatric zone, a difference in total annual amount of precipitation was only observed between red spruce and hybrids; yet, this cannot stand out as a strong selective filter because the gene flow was invariably high between these two genotypic classes. Furthermore, while an actual barrier to interspecific gene flow was strictly found between P. mariana and hybrids, the amount of precipitation did not differ between these. While formally disentangling the role of environmental and genetic factors in the maintenance of the hybrid zone between black spruce and red spruce was beyond the scope of this study, uncovering directional biases in interspecific gene flow did provide some useful insights. Specifically, it allowed us to falsify an apparent association between hybrid index and precipitation we initially detected before considering asymmetrical patterns of introgression. Although we cannot rigorously discard the possibility of exogenous selection based solely on our exploratory analysis, we found no evidence that two of the most important climatic factors, namely temperature and precipitation, played a key role in the maintenance of species boundary between black spruce and red spruce. This is in sharp contrast with other hybridizing spruce species pairs from western North America for which temperature and precipitation determined the distribution and structure of hybrid zones and where climatic differences were greater than in our study (De La Torre et al., 2014, 2015; Hamilton & Aitken, 2013; Hamilton et al., 2013a,b). However, it must be acknowledged that there could be other exogenous selective pressures that were not assessed in the present study, such that a different ecological factor is more important than precipitation or temperature for preventing black spruce backcrosses. Hypothetically, assuming that (1) there is no reproductive phenological barrier, (2) selection (rather than purely neutral processes) is the primary mechanism implicated in determining contemporary patterns of introgression, and (3) exogenous selection is not a key determinant for maintaining the spruce hybrid zone, endogenous selective pressures (such as Bateson–Dobzhansky–Muller incompatibilities) could perhaps represent another asymmetrical mechanism preventing ongoing backcrossing toward black spruce at impermeable genomic regions. These genomic incompatibilities would be asymmetrical as backcrossing toward red spruce remains virtually unimpeded. It is well established that such asymmetrical incompatibilities constitute a possible outcome of the Bateson–Dobzhansky–Muller models (e.g., when unidirectionally inherited markers are involved in the incompatibility) and represent Darwin's corollary to Haldane's rule (Turelli & Moyle, 2007; Welch, 2004; Wu & Beckenbach, 1983). In a previous study, we uncovered the semipermeable nature of the barriers to gene flow between black spruce and red spruce by identifying loci representing impermeable, neutral, and highly permeable genomic regions (de Lafontaine et al., 2015). With the present study, we moved forward our understanding about species delimitation of these progenitor‐derivative taxa by showing that the barriers to gene flow are asymmetrical. First, we suggested that the higher rate of gene flow from black spruce to red spruce than the opposite likely reflects the known long‐term demographic history of the two spruce taxa. Second, while backcrossing toward red spruce was invariably high across the genome, the actual species boundary is between hybrids and black spruce, where gene flow is impeded at those genomic regions impermeable to introgression. Hence, in this progenitor‐derivative species pair, the barrier to interspecific gene flow is asymmetrical and prevents introgression of the derivative species (red spruce) back into the genomic background of the progenitor species (black spruce). Uncovering directional biases in interspecific gene flow provided some insights about the relative importance of the selective pressures maintaining the hybrid zone and the integrity of closely related species. Specifically, considering asymmetrical patterns of introgression allowed us to falsify an apparent association between hybrid index and precipitation. Although our study did not completely rule out the role of exogenous selection in maintaining the species boundary between red spruce and black spruce, it does however advise caution when inferring exogenous selection pressure from a given ecological variable based on environment–genetic associations alone.

Conflict of Interest

None declared. Click here for additional data file.
  51 in total

1.  Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach.

Authors:  P Beerli; J Felsenstein
Journal:  Proc Natl Acad Sci U S A       Date:  2001-04-03       Impact factor: 11.205

2.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

3.  Hybrid populations selectively filter gene introgression between species.

Authors:  G D Martinsen; T G Whitham; R J Turek; P Keim
Journal:  Evolution       Date:  2001-07       Impact factor: 3.694

4.  Asymmetrical crossing barriers in angiosperms.

Authors:  P Tiffin; M S Olson; L C Moyle
Journal:  Proc Biol Sci       Date:  2001-04-22       Impact factor: 5.349

5.  Maximum-likelihood estimation of migration rates and effective population numbers in two populations using a coalescent approach.

Authors:  P Beerli; J Felsenstein
Journal:  Genetics       Date:  1999-06       Impact factor: 4.562

6.  Patterns of nucleotide diversity in two species of Mimulus are affected by mating system and asymmetric introgression.

Authors:  Andrea L Sweigart; John H Willis
Journal:  Evolution       Date:  2003-11       Impact factor: 3.694

7.  Evaluating the performance of likelihood methods for detecting population structure and migration.

Authors:  Zaid Abdo; Keith A Crandall; Paul Joyce
Journal:  Mol Ecol       Date:  2004-04       Impact factor: 6.185

8.  Hybrid zones and the genetic architecture of a barrier to gene flow between two sunflower species.

Authors:  L H Rieseberg; J Whitton; K Gardner
Journal:  Genetics       Date:  1999-06       Impact factor: 4.562

9.  Evidence from sequence-tagged-site markers of a recent progenitor-derivative species pair in conifers.

Authors:  M Perron; D J Perry; C Andalo; J Bousquet
Journal:  Proc Natl Acad Sci U S A       Date:  2000-10-10       Impact factor: 11.205

10.  Light-energy processing and freezing-tolerance traits in red spruce and black spruce: species and seed-source variation.

Authors:  John E Major; Debby C Barsi; Alex Mosseler; Moira Campbell; Om P Rajora
Journal:  Tree Physiol       Date:  2003-07       Impact factor: 4.196

View more
  2 in total

1.  Discrepancies between genetic and ecological divergence patterns suggest a complex biogeographic history in a Neotropical genus.

Authors:  Giorgio Binelli; William Montaigne; Daniel Sabatier; Caroline Scotti-Saintagne; Ivan Scotti
Journal:  Ecol Evol       Date:  2020-04-01       Impact factor: 2.912

2.  Adaptive evolution in a conifer hybrid zone is driven by a mosaic of recently introgressed and background genetic variants.

Authors:  Mitra Menon; Justin C Bagley; Gerald F M Page; Amy V Whipple; Anna W Schoettle; Christopher J Still; Christian Wehenkel; Kristen M Waring; Lluvia Flores-Renteria; Samuel A Cushman; Andrew J Eckert
Journal:  Commun Biol       Date:  2021-02-05
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.