| Literature DB >> 29491947 |
Cynthia Riginos1, Eric D Crandall2, Libby Liggins3, Pim Bongaerts4, Eric A Treml5.
Abstract
Population genomic approaches are making rapid inroads in the study of non-model organisms, including marine taxa. To date, these marine studies have predominantly focused on rudimentary metrics describing the spatial and environmental context of their study region (e.g., geographical distance, average sea surface temperature, average salinity). We contend that a more nuanced and considered approach to quantifying seascape dynamics and patterns can strengthen population genomic investigations and help identify spatial, temporal, and environmental factors associated with differing selective regimes or demographic histories. Nevertheless, approaches for quantifying marine landscapes are complicated. Characteristic features of the marine environment, including pelagic living in flowing water (experienced by most marine taxa at some point in their life cycle), require a well-designed spatial-temporal sampling strategy and analysis. Many genetic summary statistics used to describe populations may be inappropriate for marine species with large population sizes, large species ranges, stochastic recruitment, and asymmetrical gene flow. Finally, statistical approaches for testing associations between seascapes and population genomic patterns are still maturing with no single approach able to capture all relevant considerations. None of these issues are completely unique to marine systems and therefore similar issues and solutions will be shared for many organisms regardless of habitat. Here, we outline goals and spatial approaches for landscape genomics with an emphasis on marine systems and review the growing empirical literature on seascape genomics. We review established tools and approaches and highlight promising new strategies to overcome select issues including a strategy to spatially optimize sampling. Despite the many challenges, we argue that marine systems may be especially well suited for identifying candidate genomic regions under environmentally mediated selection and that seascape genomic approaches are especially useful for identifying robust locus-by-environment associations.Entities:
Keywords: adaptation; genetic–environment association; landscape; oceanography; population genomics; remote sensing; seascape genetics.
Year: 2016 PMID: 29491947 PMCID: PMC5804261 DOI: 10.1093/cz/zow067
Source DB: PubMed Journal: Curr Zool ISSN: 1674-5507 Impact factor: 2.624
Figure 1.Key concepts relevant to the properties of spatial and environmental variables used in seascape genomic analyses. These properties should be considered during the project design as they will influence which variables and what representative values of variables may be used. Moreover, these properties will help determine what methods are appropriate for analysis. The figure displays examples of the concepts in geographic space and their manifestations in analytical space. Points in the geographic space depict the location of sampling, and dashed lines represent a transect (Anisotropy, Example 1 only).
Figure 2.Spatial patterns in environmental variables of Atlantic European coastal waters. Eight select coastal seascape variables are shown including mean sea surface temperature (SST), standard deviation of sea surface temperature (sdSST), mean thermal stress frequency (TSF), mean sea surface salinity (SSS), mean net primary productivity (NPP), standard deviation of net primary productivity (sdNPP), bathymetry (BATH), and Pleistocene habitat suitability (PLEIS). In addition, the values for principal components 1 and 2 describing the eight coastal variables are also shown. PC1 and PC2 account for 47.1% and 17.9% of the variance among variables, respectively.
Figure 3.Biplot indicating PCA-based loadings of European seascape variables. PCA results showing environmental variables (vectors) plotted onto PC1 and PC2 from 10,000 randomly selected points in the seascape.
Descriptive statistics for eight select seascape variables for the northeast Atlantic region
| Moran’s | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Layer | Abbrev | Min | Max | Mean | Standard deviation | Units | 25 | 50 | 100 | 200 | 500 | Data source | |
| Mean sea surface temperature | SST | 2.197 | 19.859 | 10.600 | 2.916 | °C | 0.66 | 0.60 | 0.50 | 0.39 | 0.26 | NOAA | |
| Standard deviation of sea surface temperature | sdSST | 1.201 | 8.211 | 3.587 | 1.595 | °C | 0.69 | 0.63 | 0.58 | 0.47 | NOAA | ||
| Mean thermal stress frequency | TSF | 0.00 | 22.00 | 1.10 | 0.84 | freqency | 0.45 | 0.39 | 0.30 | 0.23 | 0.15 | CoRTAD | |
| Mean sea surface salinity | SSS | 2.108 | 36.524 | 29.928 | 10.607 | unitless | 0.64 | 0.57 | 0.51 | 0.37 | World Ocean Atlas 2013 v2 | ||
| Mean net primary productivity | NPP | 478 | 12,788 | 2,063 | 1697 | C m−2 day−1 | 0.62 | 0.52 | 0.44 | 0.35 | Ocean Productivity web | ||
| Standard deviation of net primary productivity | sdNPP | 276 | 15,945 | 3,521 | 3,080 | C m−2 day−2 | 0.67 | 0.59 | 0.52 | 0.39 | Ocean Productivity web | ||
| Bathymetry | BATH | −5,029 | 839 | −266 | 654 | Metres | 0.45 | 0.30 | 0.17 | ETOPO1 | |||
| Habitat exposure during Pleistocene low sea level stands | PLEIS | 0.000 | 1.000 | 0.398 | 0.297 | unitless | 0.35 | Derived from ETOPO1 | |||||
aMoran’s I is a measure of spatial autocorrelation and can range from −1 (complete negative spatial autocorrelation) to +1 (complete positive spatial autocorrelation). Values were estimated from 10,000 random points and values above 0.70 (high spatial autocorrelation) are in bold.
bMean frequency of thermal stress anomalies ≥ 1 °C over the previous 52 weeks.
cSSS: g/kg seawater; PLEIS: proportion.
Methodological approaches from empirical seascape genomic studies: elements of experimental design including spatial parameters, analyses, and genetic loci
| Organism | Location | Spatial parameters | Spatial predictor(s) | Genetic response variable(s) | Analytical approach | Type and numb of loci | Unit genotyped | Number of Pops. | Number of Individuals | Outlier tests | Number of outlier locie | Reference |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Continuous seascape variables with genomic data | ||||||||||||
| cod | North Atlantic—mostly Europe | LAT, LONG, temperature at spawning time, salinity at spawning time | Location-specific values of each of the spatial parameters in turn | Allele frequencies at individual outlier loci | SAM | tSNPs, SNPs (86,12) | Ind | 18 | 708 | BayeScan | 7 | |
| cod | Greenland & Iceland | LAT, LONG, distance to nearest coastline, annual maximum, mean and range for annual bottom spring temperature (maximum, mean, and range), surface spring temperature (maximum, mean, and range), and annual mean bottom salinity; initially 24 variables were examined and then reduced to this subset to reduce correlations among variables | Each of the spatial parameters in turn | Allele frequencies of all loci with covariance conditioned on neutral loci | BayEnv | tSNPs (935) | Ind | 20 | 847 | BayeScan; Hierarchical | 47 | |
| cod | Greenland | SST (mean, maximum and minimum), sea bottom temperature (mean, maximum and minimum), barotropic stream function | Least cost paths based on ocean climate suitability (created using species distribution modeling based on spatial parameters) between sampled population and spawning grounds | Posterior membership to Icelandic group | Linear model | tSNPs (81) | Ind | 6 | 872 | None | NA | |
| cod | Baltic and North Sea | SST, temperature at spawning depth, SSS, salinity at spawning depth, dissolved oxygen (both at surface and spawning depth) | Location-specific values of each of the spatial parameters in turn | Population allele frequencies | BayEnv | SNPs (8809) | Ind | 7 | 194 | BayeScan; Fdist | 326 | |
| anemonefishes | Red Sea + Oman | Three barriers (arising from circulation or upwelling), OWD, environmental variables: day time SST, SSS, chlorophyll, color dissolved organic matter, particulate organic carbon | Combinations of various putative barriers, overwater distance, first PC of combined environmental variables | Distance matrix based on population pairwise | Linear models with model support assessed by AIC | ddRAD (4559) | Ind | 11 | 144 | LFMM | 98 | |
| The first PC of the combined environmental variables | Individual genotypes | Latent factor mixed effects model | ||||||||||
| hake | Europe—mostly Mediterranean | SST, SSS, OWD | Location-specific values of each of the spatial parameters in turn | Allele frequencies at outlier loci with covariance conditioned on neutral loci | BayEnv | tSNPs (381) | Ind | 19 | 850 | BayeScan; Hierarchical | 17 | |
| herring | European North Atlantic | SST, mean spawning period surface temperature, SSS, mean spawning period surface salinity, OWD, LAT, LONG | Location-specific values of each of the spatial parameters in turn (excluding OWD), with a covariance matrix estimated from non-outlier loci | Population allele frequencies | BayEnv | tSNPs (281) | Ind | 18 | 607 | BayEnv; BayeScan; Hierarchical | 10 | |
| Distance matrices based on (1) SST, and (2) SSE with OWD as the additional distance matrix | Distance matrix based on population pairwise | Partial Mantel | ||||||||||
| sole | European North Atlantic | SST, sea bottom temperature, SSS, mixed layer depth, maximum density gradient, chlorophyll concentration, surface productive layer, OWD, LAT, LONG | Environmental parameters conditioned on LAT and LONG and MEMs based on PcoA of distance matrix of shortest OWDs | Genotypes with analyses performed separately for neutral and outliers | partial RDA | tSNPs (476) | Ind | 16 | 650 | Fdist; Bayescan | 13 | Diopere chap 5, 2014 |
| stickleback | Baltic and North Sea | SST, SSS | Location specific values of each of the spatial parameters in turn | Allele frequencies at outlier loci with covariance conditioned on neutral loci | BayEnv | RADseq (9404) | Pools | 10 | 360 | BayeScan; Fst quantile | 94 | ( |
| Continuous seascape variables with targeted microsatellites | ||||||||||||
| herring | Baltic and North Sea | Temperature at spawning and in April, salinity at spawning and in April, BPD, OWD, distance to the mouth of the Baltic, fishing pressure, LAT, LONG | Location-specific values of each temperature and salinity parameters in turn using both BPD and OWD as the additional distance matrices | Distance matrix based on population pairwise | Mantel and partial Mantel | gl Msats (51,17) | Ind | 15 | 694 | BayeScan; Fdist | 1 | |
| Location-specific values of each temperature and salinity parameters along with distance to the mouth of the Baltic, LAT, LONG | By locus allele frequencies (all loci) | MatSAM | ||||||||||
| Combinations of location specific values of each temperature and salinity parameters along with distance to the mouth of the Baltic, fishing pressure | Population heterozygosity and allelic richness partitioned for all loci, non-outliers, and the single outlier locus | GLMs | ||||||||||
| sticklebacks | Baltic and North Sea | SST, SSS, BPD, OWD, LAT, LONG | SST and SSS in turn | Population allele frequencies for outlier loci | SAM | gl Msats (20, 20) | Ind | 38 | 973 | BayeScan; Fdist | 9 | |
| SST and SSS in turn with OWD as the additional distance matrix | Distance matrices based on population pairwise FST for outlier loci (individually and as a group) | Partial Mantel tests | ||||||||||
| Combinations of SST, SSS, LAT, LONG | Population allele frequencies separately for genic and non genic loci | GESTE with Bayesian model selection | ||||||||||
| turbot | Europe (not Med) | Habitat variables: SST, sea bottom temperature, SSS, sea bottom salinity, oxygen concentration, primary production; Hydrodynamic variables: bottom shear stress, depth of pycnocline, density-based stratification index; LAT, LONG | All habitat and hydrodynamic variables, a matrix representing sampling year, and distance-based MEMs using a truncated distance matrix (PCNM approach) | Genotypes, with analyses performed separately for neutral and outliers and for different geographical partitions | RDA and partial RDA with dbMEM/PCNM to describe geographic distance and spatial correlations | gl Msats (12, 6) | Ind | 290 | 999 (466 for some analyses) | BayeScan; Fdist | 3 | |
| Nominal seascape variable(s), with genomic data | ||||||||||||
| anchovy | Europe (Atlantic, Mediterranean) | Open water versus coastal; nested sampling design with 2 habitats in 2 locations | Habitat | Joint allele frequency spectrum between pairs of populations | Custom model | RADseq (5628) | Ind | 4 | 128 | 431 | {LeMoan: 2016be} | |
| litorinid snail | Europe (Sweden, UK, Spain) | habitat: wave versus crab; nested sampling design with 2 habitat sites in 3 locations | Habitat | Proportion of shared outlier loci | Identifying sets of loci that are outliers in multiple habitat comparisons | RNAseq (6790) | pools | 6 | ∼480 | Fst quantile | NA | |
| Habitat | Allele frequencies of SNPs from outlier contigs | Determining whether allele frequency changes between habitats are consistent across locations | ||||||||||
| litorinid snail | Sweden—3 island locations <10 km distant | habitat: wave versus crab; nested sampling design with 2 habitat sites in 3 locations | Habitat | Proportion of shared outlier loci | Identifying sets of loci that are outliers in multiple habitat comparisons | RADseq (4187) | Ind | 6 | 130 | Fdist | 4-636 (depending on comparison) | |
| Habitat | Individual genotype (presence/absence of alleles) | Binomial GLM | ||||||||||
| seagrass | Wadden Sea, North Sea (50 kms) | tidal position: tidal flat (exposed to air twice per day) or tidal creek (permanently submerged) | OWD | Distance matrix based on population pairwise | Mantel | gl Msats (14, 11) | Ind | 6 | 485 | DetSel; Fdist | 3 | |
| OWD and habitat category | Distance matrix based on population pairwise | Partial Mantel | ||||||||||
| Marker type and habitat | Distance matrix based on population pairwise | 2-way ANOVA | ||||||||||
aLAT = latitude; LONG = longitude, SST = mean annual surface temperature; SSS = mean annual surface salinity, OWD = overwater distance, BPD = biophysical distance. Many studies simply report “temperature” or “salinity” in which case we assume that SST and SSS, respectively, are implied.
bBayEnv (Günther and Coop 2013); GESTE (Foll and Gaggiotti 2006); GLM is a general linear model; SAM/ matSAM (Joost et al. 2008).
ctSNPs: Transcriptome-derived SNPs; gl Msats: gene linked microsatellites. In parentheses is the number of loci included in analyses (post filtering); if study includes both gene linked loci and non-gene linked loci then 2 numbers are reported, respectively.
dBayEnv is the method of Coop et al. (2010); BayeScan is the method of Foll and Gaggiotti (2008); DetSel is from the method of Vitalis et al. (2003); Fdist is the method of Beaumont and Nicols (1996) regardless of which program was used for the analysis; LFMM is based on the method of Frichot et al. (2013) ; δαδι is the method of Gutenkunst et al. (2009).
eAs defined by authors.
Seascape properties and sample data sources useful for marine population genomic investigations
| Parameters | Source | URL |
|---|---|---|
| Latitude & Longitude | World Geodetic System 1984 (used for the Global Positioning System, GPS, satellite navigation systems) | |
| Latitude & Longitude | ESRI ArcGIS Basemaps | |
| Shoreline representation | GSHHG—A Global Self-consistent, Hierarchical, High-resolution Geography Database | |
| Bathymetry and topography | ETOPO1 integrated topography and bathymetry product | |
| Seafloor topography | Scripps Institute, UC San Diego | |
| Global distribution of coral reefs | UNEP WCMC | |
| Ocean currents | HYCOM | |
| Ocean temperature | The Group for High Resolution Sea Surface Temperature (GHRSST) | |
| Ocean temperature and derivatives | NASA Earth Observations (NEO) | |
| Ocean temperature | World Ocean Atlas 2013 version 2 | |
| Ocean temperature and derivatives | The Coral Reef Temperature Anomaly Database (CoRTAD) | |
| Sea surface salinity | NASA Earth Observations (NEO) | |
| Sea surface salinity | World Ocean Atlas 2013 version 2 | |
| Surface photosynthetically active radiation (PAR) | NASA MODIS satellite | |
| Chlorophyll a | NASA Earth Observations (NEO) | |
| Future climate scenarios | NCAR’s GIS Program Climate Change Scenarios GIS data portal | |
| Human population density | NASA Socioecomonic Data and Applications Center (SEDAC) |
Note: Depending on a study’s objectives, appropriate measures might include means, minimums, maximums, or variance of some parameters.
Figure 4.Hotspots of spatial gradients in seascape variables. Regions of strong environmental gradients or “hotspots” were identified using a moving window analysis to quantify the level of variation in the surrounding seascape. The hotspot regions could be targeted for genetic sampling to increase the likelihood of quantifying genetic environmental associations. (A) Hotspots for PC1 would be useful for exploratory analyses of many spatial variables simultaneously; gradients for (B) SSS and (C) NPP could be targeted to confirm predicted relationships between environmental variables and select loci.
Covariance matrix for 8 select seascape variables for the Northeast Atlantic region
| Layer | SST | sdSST | TSF | SSS | NPP | sdNPP | BATH | PLEIS |
|---|---|---|---|---|---|---|---|---|
| SST | 0.307 | −0.098 | −0.074 | 0.113 | −0.098 | −0.094 | −0.104 | −0.010 |
| sdSST | −0.098 | 0.188 | 0.044 | −0.139 | 0.136 | 0.133 | 0.052 | −0.036 |
| TSF | −0.074 | 0.044 | 0.191 | −0.058 | 0.050 | 0.050 | 0.025 | 0.000 |
| SSS | 0.113 | −0.139 | −0.058 | 0.175 | −0.137 | −0.139 | −0.035 | 0.012 |
| NPP | −0.098 | 0.136 | 0.050 | −0.137 | 0.182 | 0.170 | 0.047 | −0.044 |
| sdNPP | −0.094 | 0.133 | 0.050 | −0.139 | 0.170 | 0.182 | 0.044 | −0.031 |
| BATH | −0.104 | 0.052 | 0.025 | −0.035 | 0.047 | 0.044 | 0.203 | 0.044 |
| PLEIS | −0.010 | −0.036 | 0.000 | 0.012 | −0.044 | −0.031 | 0.044 | 0.203 |
Correlation matrix for 8 select seascape variables for the Northeast Atlantic region
| Layer | SST | sdSST | TSF | SSS | NPP | sdNPP | BATH | PLEIS |
|---|---|---|---|---|---|---|---|---|
| SST | 1 | −0.41 | −0.31 | 0.49 | −0.41 | −0.40 | −0.42 | −0.04 |
| sdSST | −0.41 | 1 | 0.23 | − | 0.27 | −0.18 | ||
| TSF | −0.31 | 0.23 | 1 | −0.32 | 0.27 | 0.27 | 0.13 | 0.00 |
| SSS | 0.49 | − | −0.32 | 1 | − | − | −0.18 | 0.07 |
| NPP | −0.41 | 0.27 | − | 1 | 0.24 | −0.23 | ||
| sdNPP | −0.40 | 0.27 | − | 1 | 0.23 | −0.16 | ||
| BATH | −0.42 | 0.27 | 0.13 | −0.18 | 0.24 | 0.23 | 1 | 0.22 |
| PLEIS | −0.04 | −0.18 | 0.00 | 0.07 | −0.23 | −0.16 | 0.22 | 1 |