| Literature DB >> 29259842 |
Julien M Haran1,2, Jean-Pierre Rossi3, Juan Pajares4, Luis Bonifacio5, Pedro Naves5, Alain Roques1, Géraldine Roux1.
Abstract
The use of multiple sampling areas in landscape genetic analysis has been recognized as a useful way of generalizing the patterns of environmental effects on organism gene flow. It reduces the variability in inference which can be substantially affected by the scale of the study area and its geographic location. However, empirical landscape genetic studies rarely consider multiple sampling areas due to the sampling effort required. In this study, we explored the effects of environmental features on the gene flow of a flying long-horned beetle (Monochamus galloprovincialis) using a landscape genetics approach. To account for the unknown scale of gene flow and the multiple local confounding effects of evolutionary history and landscape changes on inference, we developed a way of resampling study areas on multiple scales and in multiple locations (sliding windows) in a single large-scale sampling design. Landscape analyses were conducted in 3*104 study areas ranging in scale from 220 to 1,000 km and spread over 132 locations on the Iberian Peninsula. The resampling approach made it possible to identify the features affecting the gene flow of this species but also showed high variability in inference among the scales and the locations tested, independent of the variation in environmental features. This method provides an opportunity to explore the effects of environmental features on organism gene flow on the whole and reach conclusions about general landscape effects on their dispersal, while limiting the sampling effort to a reasonable level.Entities:
Keywords: Gene flow; Iberian peninsula; Insect dispersal; Landscape genetics; Monochamus galloprovincialis
Year: 2017 PMID: 29259842 PMCID: PMC5733902 DOI: 10.7717/peerj.4135
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Genetic clustering of 992 individuals of Monochamus galloprovincialis sampled at 132 locations.
(A) Assignment of individuals to clusters based on a STRUCTURE analysis for K = 2. (B) Assignment of demes to clusters for k = 2, displayed in geographic context (Iberian Peninsula, the size of the pies refers to the size of the demes). (C) PCA of individuals on first and second axes, colors and ellipses refer to demes.
Figure 2Empirical semi-variogram of genotypes of Monochamus galloprovincialis.
The variogram was fitted with an exponential model to highlight the first plateau. Data points are shown with a spatial lag distance of 50 km.
Figure 3Percentage of areas with supported IBR hypotheses for Mantel tests (A) and mean partial Mantel r (B) of areas with support for the IBR hypotheses (p < 0.05) with increasing scales (whole dataset).
E, Elevation; T, Mean minimum temperatures; Pr and Pc, pine densities as a resistant feature and as a corridor respectively; IBD, Isolation by distance.
Figure 4Distribution of supported IBR hypotheses through Mantel tests for the environmental features tested.
(IBD: Euclidian distances; T, mean minimum temperatures; E, elevation; Pr, high pine densities as barriers; Pc, high pine densities as corridors). Gray maps (A) refer to the distribution of environmental features associated with resistance models (from white to black: low to high resistance values). Colored maps refer to interpolations of supported IBR hypotheses on the whole dataset (B) and within the Eastern Iberian cluster only (C). From blue to red: low to high frequency of study areas with supported resistance models.
Figure 5Spatial heterogeneity (mean standard deviation, SD) of environmental features in areas with supported and non-supported resistance hypotheses through Mantel tests with increasing scales.
Mean SD: mean standard deviation. T, mean minimum temperatures (A); E, elevation (B); Pr, high pine densities as barriers (C); Pc, high pine densities as corridors (D). sig: significant, non-sig: non-significant.
Commonality coefficients of both unique and common effects for the three sampling areas with the highest variance explained.
Code pop: population code for the center of the sampling area. Scale: diameter of sampling area (km). N: number of individuals in sampling area. Coef.: percentage of variance explained by environmental features (IBR hypotheses). % Total: percentage of the contribution of environmental features to the total variance explained.
| Code pop | 85 | 130 | 131 | |||
|---|---|---|---|---|---|---|
| Scale | 620 | 540 | 520 | |||
| 225 | 254 | 244 | ||||
| IBR hypotheses | Coef. | % Total | Coef. | % Total | Coef. | % Total |
| 0,008 | 3,408 | 0,001 | 0,351 | 0,002 | 0,807 | |
| 0,050 | 20,775 | 0,070 | 32,651 | 0,059 | 28,108 | |
| 0,004 | 1,806 | 0,008 | 3,678 | 0,008 | 3,811 | |
| 0,085 | 35,235 | 0,047 | 21,817 | 0,046 | 22,214 | |
| 0,136 | 56,426 | 0,117 | 54,314 | 0,115 | 54,953 | |
| −0,003 | −1,255 | −0,001 | −0,243 | −0,001 | −0,430 | |
| 0,001 | 0,426 | 0,007 | 3,236 | 0,010 | 4,893 | |
| −0,002 | −0,897 | 0,031 | 14,375 | 0,020 | 9,721 | |
| −0,013 | −5,398 | 0,018 | 8,406 | 0,014 | 6,540 | |
| 0,024 | 10,049 | 0,008 | 3,579 | 0,011 | 5,333 | |
| 3rd-order | ||||||
| 0,014 | 5,724 | −0,005 | −2,311 | −0,001 | −0,474 | |
| −0,023 | −9,712 | −0,069 | −32,007 | −0,047 | −22,356 | |
| 0,003 | 1,313 | 0,026 | 12,118 | 0,030 | 14,509 | |
| −0,009 | −3,795 | 0,006 | 2,961 | 0,010 | 4,950 | |
| −0,034 | −14,106 | −0,049 | −22,926 | −0,068 | −32,581 | |
| Sum | 0,240 | 100 | 0,216 | 100 | 0,209 | 100 |
Notes.
elevation model (high elevations = resistance to dispersal)
Pine density model (high pine density = corridors to dispersal
reversed pine density model (high pine density = resistance to dispersal)
temperature model (low minimum annual temperatures = resistance to dispersal)
| # Simplified version of the script used in this study. Provides an overview of the general method employed. |
| #———————————————————————— |
| # create and plot background matrix with artificial barrier in middle |
| m <- matrix(1, nrow=10, ncol=10) ; m |
| m[,5] <- 4 |
| library(raster) |
| r <- raster(m) |
| plot(r) |
| # create and plot transition matrix |
| library(gdistance) |
| t <- transition(r, transitionFunction=mean, 4, symm=TRUE, intervalBreaks=3) |
| plot(raster(t)) |
| # create and plot sampling points and associated genetic data. |
| # (x coordinates, y coordinates, genetic data for 3 loci) |
| matG2 <- matrix(c(0.21, 0.22, 0.82, 0.23, 0.81, 0.83, 0.81, 0.21, 0.50, 0.51, 0.23, 0.83, 0, 0, 2, 0, 1, 1, 1, 2, 1, 1, |
| 1, 0, 2, 1, 0, 1, 0, 0), ncol=5) |
| xcoord<- matG2[, 1] ; ycoord <- matG2[, 2] |
| P<-cbind(xcoord,ycoord) |
| points(P) |
| # construction of moving windows (sampling areas) |
| library(“ade4”) ; library(“vegan”) |
| # Define the extent of sampling areas and the interval wanted |
| Min <- 0.7 # Minimum radius of areas wanted |
| Max <- 0.9 # Maximum radius of areas wanted |
| Step <- 0.1 # interval wanted |
| # Loops to test correlations in sampling area on multiple scales and in multiple locations |
| resultsfinal <- cbind(1,1,1,1,1) |
| colnames(resultsfinal) <- c(“xcoord”,“Ycoord”, “Radius”, “MantelR”, “Pval”) |
| for(Radius in seq(Min, Max, by = Step)){ |
| results = NULL |
| for(i in 1:length(xcoord)){ |
| Xcircle <- (xcoord [i] + Radius*cos(seq(0,2*pi,length.out=100))) |
| Ycircle <- (ycoord [i] + Radius*sin(seq(0,2*pi,length.out=100))) |
| polygon(Xcircle, Ycircle) |
| # extract individual data in each sampling are constructed |
| expr <- point.in.polygon(xcoord,ycoord,Xcircle,Ycircle) |
| xcoord[expr==1] |
| ycoord[expr==1] |
| coordPoly <- cbind (xcoord[expr==1],ycoord[expr==1]) |
| # sort data and compute matrix of basic pairwise euclidian distances (not used further in this example) |
| CoordOrder<- coordPoly[order(coordPoly[,1],decreasing=FALSE),] |
| locOrder<-data.frame(CoordOrder) |
| DisGeoEucl<-dist(locOrder, method = “euclidean”, diag = TRUE, upper = TRUE) |
| # compute corresponding matrix of genetic distances |
| listcoord = (1:6)[expr==1] |
| Genet = NULL |
| for(h in listcoord){ |
| tmp <- matG2[(matG2[, 1]==xcoord[h])and(matG2 [, 2]== ycoord[h]), ] |
| Genet = rbind(Genet,tmp) |
| } |
| GenetOrder<- Genet[order(Genet[,1],decreasing=FALSE),] |
| GenetOrderSanscoord <- GenetOrder[,-c(1,2)] |
| MatdistGenet<- vegdist(GenetOrderSanscoord, method=“bray”, binary=FALSE, diag=FALSE, upper=TRUE, na.rm = TRUE) |
| MatdistGenet <- as.dist(MatdistGenet) |
| # Compute matrix landscape “resistance” distances based on raster |
| spatiallocX <- locOrder[,1] |
| spatiallocY <- locOrder[,2] |
| SpaLoc <- SpatialPoints(cbind(spatiallocX, spatiallocY)) |
| Resdis<- commuteDistance(t, SpaLoc) |
| Resdis<-as.dist(Resdis, diag = TRUE, upper=TRUE) |
| # simple mantels test between genetic and landscape “resistance” distances |
| MantelpRes <- mantel.rtest(MatdistGenet, Resdis, nrepet = 99) |
| results <- rbind (results, cbind (xcoord [i], ycoord [i],Radius, MantelpRes[2], MantelpRes[4])) |
| } |
| resultsfinal <- rbind(resultsfinal,results) |
| } |
| # display result file with for each individual: x and y coordinates, radius of sampling area, mantel output and associated p-value |
| Resultsfinal |