| Literature DB >> 35318661 |
Priscila M Salloum1, Anna W Santure1, Shane D Lavery1,2, Pierre de Villemereuil3.
Abstract
Genetic adaptation to future environmental conditions is crucial to help species persist as the climate changes. Genome scans are powerful tools to understand adaptive landscapes, enabling us to correlate genetic diversity with environmental gradients while disentangling neutral from adaptive variation. However, low gene flow can lead to both local adaptation and highly structured populations, and is a major confounding factor for genome scans, resulting in an inflated number of candidate loci. Here, we compared candidate locus detection in a marine mollusc (Onithochiton neglectus), taking advantage of a natural geographical contrast in the levels of genetic structure between its populations. O. neglectus is endemic to New Zealand and distributed throughout an environmental gradient from the subtropical north to the subantarctic south. Due to a brooding developmental mode, populations tend to be locally isolated. However, adult hitchhiking on rafting kelp increases connectivity among southern populations. We applied two genome scans for outliers (Bayescan and PCAdapt) and two genotype-environment association (GEA) tests (BayeScEnv and RDA). To limit issues with false positives, we combined results using the geometric mean of q-values and performed association tests with random environmental variables. This novel approach is a compromise between stringent and relaxed approaches widely used before, and allowed us to classify candidate loci as low confidence or high confidence. Genome scans for outliers detected a large number of significant outliers in strong and moderately structured populations. No high-confidence GEA loci were detected in the context of strong population structure. However, 86 high-confidence loci were associated predominantly with latitudinally varying abiotic factors in the less structured southern populations. This suggests that the degree of connectivity driven by kelp rafting over the southern scale may be insufficient to counteract local adaptation in this species. Our study supports the expectation that genome scans may be prone to errors in highly structured populations. Nonetheless, it also empirically demonstrates that careful statistical controls enable the identification of candidate loci that invite more detailed investigations. Ultimately, genome scans are valuable tools to help guide further research aiming to determine the potential of non-model species to adapt to future environments.Entities:
Keywords: candidate loci; environmental adaptation; gene flow; genome scans for outliers; genotype-environment associations; outlier loci; population genetic structure; q-values geometric mean
Mesh:
Year: 2022 PMID: 35318661 PMCID: PMC9311215 DOI: 10.1111/1365-2656.13692
Source DB: PubMed Journal: J Anim Ecol ISSN: 0021-8790 Impact factor: 5.606
FIGURE 1Geographical and genetic relationships among sampled populations of Onithochiton neglectus. (a) Geographical location of populations asnd clades. Populations included in the NI dataset are indicated with *, and in the SI dataset with +; (b) relative migration network (estimated using divMigrate). Each circle represents a population (colour‐coded), with the intensity of arrows proportional to migration rate. Rates smaller than 0.05 are not plotted; (c) NZ‐wide PCA for all populations over all loci; (d) NZ‐wide PCA for remaining loci (after removing combined ‘outlier’ markers); (e) NZ‐wide PCA for combined ‘outlier’ markers (combined q‐value between PCAdapt and Bayescan smaller than 0.05). Population key: see Table 1
Sampling size per population and their contributions to the associated New Zealand wide (NZ‐wide), upper North Island (NI), South Island (SI) and south and Subantarctic Islands (southern) datasets
| Population | Population abbreviation | Individuals | Clade | Dataset |
|---|---|---|---|---|
| Russell | RU | 14 | North | NZ‐wide, NI |
| Auckland | TI | 12 | North | NZ‐wide, NI |
| Coromandel | NC | 10 | North | NZ‐wide, NI |
| East Cape | EC | 9 | North | NZ‐wide, NI |
| Wellington | WE | 18 | Central | NZ‐wide |
| Cape Palliser | CP | 10 | Central | NZ‐wide |
| Christchurch | CR | 6 | South | NZ‐wide |
| Dunedin | DU | 20 | South | NZ‐wide, southern, SI |
| Akatore | AK | 9 | South | NZ‐wide, southern, SI |
| Curio Bay | CU | 5 | South | NZ‐wide, southern, SI |
| Stewart Is. | ST | 8 | South | NZ‐wide, southern, SI |
| Auckland Is. | AU | 10 | South | NZ‐wide, southern |
| Campbell Is. | CA | 10 | South | NZ‐wide, southern |
| Antipodes Is. | AN | 7 | South | NZ‐wide, southern |
| Bounty Is. | BO | 10 | South | NZ‐wide, southern |
| Chatham Is. | CH | 9 | South | NZ‐wide, southern |
Population excluded from southern and SI datasets due to high missing data.
Two additional individuals were sampled but did not meet genomic quality control and were excluded.
Overall genetic diversity and population differentiation (among all sampled populations and using all loci in each specific dataset). No. pops, number of populations included in the dataset; no. Inds, number of individuals included in the dataset; ho, observed heterozygosity; he, expected heterozygosity; Fst, F ST based on gene diversity (gene diversity among samples/overall gene diversity); Fis, F IS following Nei, 1987 (= 1 − Ho/Hs)
| Dataset | No. pops | No. Inds | Ho | He | Fst | Fis |
|---|---|---|---|---|---|---|
| NZ‐wide | 16 | 167 | 0.057 | 0.062 | 0.792 | 0.083 |
| NI‐specific | 4 | 45 | 0.198 | 0.206 | 0.308 | 0.041 |
| SI‐specific | 4 | 43 | 0.255 | 0.277 | 0.048 | 0.079 |
| Southern specific | 9 | 88 | 0.218 | 0.232 | 0.165 | 0.060 |
FIGURE 2Proportion of ‘outlier’ markers found with each method in the NZ‐wide, NI‐, SI‐ and southern‐specific datasets. Combination represents the number of ‘outlier’ markers after combining the q‐values of PCAdapt and Bayescan using the geometric mean; intersection represents the number of ‘outlier’ markers found in common between PCAdapt and Bayescan
Number of GEA loci for each spatial scale and statistical treatment. ‘Real variables’ refer to the total number of associations with environmental variables; ‘5% random variables’ is the number of associations with at least 5% of the random variables (note that these loci are not necessarily also associated with environmental variables); ‘combined q‐value’ is the number of significant GEA loci after combining the q‐values of RDA and BayeScEnv using the geometric mean; ‘Final High‐confidence loci’ are GEA loci with combined q‐value <0.05 and not associated with >5% of the random variables. The total number of loci in each dataset is shown within parentheses
| Environmental dataset | Statistical treatment | NZ‐wide (10,987) | NI (12,012) | SI (7,476) | Southern (13,003) |
|---|---|---|---|---|---|
| Real variables | RDA | 0 | 0 | 0 | 708 |
| BayeScEnv | 3,093 | 15 | 10 | 135 | |
| Combined | 0 | 0 | 0 | 171 | |
| 5% random variables | RDA | 0 | 0 | 0 | 693 |
| BayeScEnv | 722 | 9 | 19 | 139 | |
| Combined | 0 | 2 | 0 | 90 | |
|
|
|
|
|
|
|