| Literature DB >> 36009846 |
David Dolci1,2, Lorenzo Peruzzi1,2,3.
Abstract
Correlative ecological niche modelling (ENM) is a method widely used to study the geographic distribution of species. In recent decades, it has become a leading approach for evaluating the most likely impacts of changing climate. When used to predict future distributions, ENM applications involve transferring models calibrated with modern environmental data to future conditions, usually derived from Global Climate Models (GCMs). The number of algorithms and software packages available to estimate distributions is quite high. To experimentally assess the effectiveness of correlative ENM temporal projection, we evaluated the transferability of models produced using 12 different algorithms on historical and modern data. In particular, we compared predictions generated using historical data and projected to the modern climate (simulating a "future" condition) with predictions generated using modern distribution and climate data. The models produced with the 12 ENM algorithms were evaluated in geographic (range size and coherence of predictions) and environmental space (Schoener's D index). None of the algorithms shows an overall superior capability to correctly predict future distributions. On the contrary, a few algorithms revealed an inadequate predictive ability. Finally, we provide hints that can be used as guideline to plan further studies based on the adopted general workflow, useful for all studies involving future projections.Entities:
Keywords: ENM; SDM; algorithms; distribution; endemic species; plant species; projection
Year: 2022 PMID: 36009846 PMCID: PMC9405103 DOI: 10.3390/biology11081219
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Previous studies focused on analysis of ENM transferability.
| Reference | Algorithms | Species/Data | Transferability Test |
|---|---|---|---|
| Randin et al. [ | GLM, GAM | 54 species with more than 30 occurrences from vegetation plots | Evaluation metrics; Kulczynski’s coefficient |
| Wenger and Olden [ | GLMM, ANN, R | Evaluation metrics combined with resampling methods | |
| Roberts and Hamann [ | RF | Modern ecosystem types | Validation based on palaeoecological records |
| Veloz et al. [ | BRT, MARS, MARS-COM, GAM, GLM | Fossil-pollen data | Tests of niche equivalency (D) and niche similarity (I) |
| Duque-Lazo et al. [ | ANN, BRT, CART, FDA, GAM, GLM, MaxEnt, MARS, RF, SRE | Presence-absence data for | Evaluation metrics; transferability index |
| Qiao et al. [ | BIOCLIM, ENFA, CONVEXHULL, MVE, GLM, GAM, BRT, GARP, Maxent, KDE, MA | 16 virtual species distributed across mainland Eurasia | Sensitivity, specificity and TSS plus volume ratio of estimated niches |
Figure 1Schematization of the workflow used to generate the two sets of environmental layers. Climate time series from CHELSAcruts were used to generate 35 layers. Then, two sets of variables were organized for two groups of experiments carried out in parallel. The first set was composed of only 3 variables (annual mean temperature, annual precipitation and annual potential evapotranspiration), and the second set was composed of all 35 variables and then converted in 3 summary layers by applying a PCA ordination. Each of the two sets of variables was tailored to fit the known distribution of each species. The procedure was applied to both historical and a modern set of environmental layers.
Figure 2General scheme of the adopted modelling workflow.
Summary of the different procedures used in the present work. Commonly adopted parameters were selected in all procedures.
| Procedure | Type and Algorithm | Data Input | Package (Version) | Background/Pseudoabsence Cells |
|---|---|---|---|---|
| BIOCLIM | Single (BIOCLIM) | Presence only | Dismo (1.1-4) | 1000 background cells |
| Domain | Single (Domain) | Presence only | Dismo (1.1-4) | 1000 background cells |
| GLM | Single (GLM) | Presence-pseudoabsence | SSDM (0.2.8) | 1000 pseudoabsence cells |
| GAM | Single (GAM) | Presence-pseudoabsence | SSDM (0.2.8) | 1000 pseudoabsence cells |
| MARS | Single (MARS) | Presence-pseudoabsence | SSDM (0.2.8) | 1000 pseudoabsence cells |
| FDA | Single (FDA) | Presence-pseudoabsence | sdm (1.0-89) | Pseudoabsences = presence cells |
| CTA | Single (CTA) | Presence-pseudoabsence | sdm (1.0-89) | Pseudoabsences = presence cells |
| RF | Single (RF) | Presence-pseudoabsence | SSDM (0.2.8) | Pseudoabsences = presence cells |
| SVM | Single (SVM) | Presence-pseudoabsence | SSDM (0.2.8) | Pseudoabsences = presence cells |
| Maxent | Single (Maxent) | Presence background | Dismo (1.1-4) | 10,000 background cells |
| Biomod2 | Ensemble (GLM, GBM, GAM, MARS, Maxent, RF, CTA, ANN, and FDA) | Presence-pseudoabsence | biomod2 (3.4.11) | Pseudoabsences = presence cells × 10 |
| KDE | Single (KDE) | Presence only | Hypervolume (2.0.12) | 1000 background cells |
Figure 3General scheme of the comparison of potential distributions.
Results of pairwise Mann-Whitney tests for the values of distance (three variables). A total of 17 cases of statistically significant differences were highlighted. × = statistically significant difference.
| Bioclim | Biomod2 | CTA | Domain | FDA | GAM | GLM | KDE | MARS | Maxent | RF | SVM | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||||
|
| - | |||||||||||
|
| - | - | ||||||||||
|
| - | - | - | |||||||||
|
| - | - | - | - | ||||||||
|
| - | - | - | - | × | |||||||
|
| - | - | - | - | - | × | ||||||
|
| - | - | - | - | × | - | × | |||||
|
| × | × | × | × | × | × | × | - | ||||
|
| - | - | - | - | - | × | - | × | × | |||
|
| - | - | - | - | - | - | - | - | × | - | ||
|
| - | - | - | - | - | - | - | × | × | - | - |
Figure 4Distribution of the single results for each of the 25 species and procedure (3 variables) (A). Mean values for each procedure (B).
Results of pairwise Mann-Whitney tests for the summary values of distance (first three axes of PCA [35 variables]). A total of 10 cases of statistically significant differences were highlighted. × = statistically significant difference.
| Bioclim | Biomod2 | CTA | Domain | FDA | GAM | GLM | KDE | MARS | Maxent | RF | SVM | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||||
|
| - | |||||||||||
|
| - | - | ||||||||||
|
| - | - | - | |||||||||
|
| - | - | - | - | ||||||||
|
| - | - | - | - | - | |||||||
|
| - | - | - | - | - | - | ||||||
|
| - | - | - | - | × | - | - | |||||
|
| × | × | × | × | × | × | × | - | ||||
|
| - | - | - | - | - | - | - | - | × | |||
|
| - | - | - | - | - | - | - | - | × | - | ||
|
| - | - | - | - | - | - | - | - | - | - | - |
Figure 5Distribution of the single results for each of the 25 species and each procedure (first three axes of PCA [35 variables]) (A). Mean values for each procedure (B).