| Literature DB >> 30755826 |
Marlon E Cobos1, A Townsend Peterson1, Narayani Barve1,2, Luis Osorio-Olvera1,3,4.
Abstract
BACKGROUND: Ecological niche modeling is a set of analytical tools with applications in diverse disciplines, yet creating these models rigorously is now a challenging task. The calibration phase of these models is critical, but despite recent attempts at providing tools for performing this step, adequate detail is still missing. Here, we present the kuenm R package, a new set of tools for performing detailed development of ecological niche models using the platform Maxent in a reproducible way.Entities:
Keywords: Extrapolation risks; Model calibration; Model projections; Model selection; Species distribution models
Year: 2019 PMID: 30755826 PMCID: PMC6368831 DOI: 10.7717/peerj.6281
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Schematic description of the ecological niche modeling process, and steps that can be performed using the kuenm package.
Color bars under each step of the ecological niche modeling process reflect an approximate range of times that may be needed for execution.
Description of the main functions implemented in the kuenm R package.
Additional details can be found in the main text of this manuscript and the package tutorial.
| Functions | Description |
|---|---|
| kuenm_start | Generates an R Markdown file that serves as a guide to perform the main processes implemented in kuenm. This file contains a brief description of each process and chunks of code that will help beginner users in performing each of the analyses. This file can be saved in distinct formats (e.g., HTML, DOCX, and PDF) to record all the code to be used and other user comments, making the research more sharable and reproducible. |
| kuenm_cal | Creates Maxent candidate models. These models are created with multiple combinations of regularization multipliers, feature classes, and sets of environmental predictors. For each combination, it creates one Maxent model with the full set of occurrences, and another with training occurrence data only. Inputs are names of files and folders present in the working directory. Outputs include a folder containing all of the models and a file with Java codes for running candidate models (batch in Windows or bash in Unix), these files are written in the working directory and not stored in memory to avoid RAM limitations. |
| kuenm_ceval | Completes the process of calibration by evaluating candidate model performance and selecting the best ones, based on significance (partial ROC; |
| kuenm_mod | Takes the result of model evaluation and creates final models with the parameter sets selected as best. Model projections are allowed, and are called by defining the folder in which subdirectories with transfer environmental data are located; these transfers are performed automatically. Inputs are names of files and folders present in working directory. Three options of extrapolation are facilitated using this function when transfers are performed (free extrapolation, extrapolation and clamping, and no extrapolation; see |
| kuenm_feval | Evaluates final models based on partial ROC statistics and omission rates as assessed with independent occurrence data. Models created with the best parameter settings can be evaluated if independent data are available, to assess and evaluate their quality. Inputs are names of files and folders in the working directory; the output of this evaluation (a table with the results) is written directly to the directory. |
| kuenm_mmop | Calculates the mobility-oriented parity (MOP; |
| kuenm_omrat | Calculates omission rates of single models based on single or multiple threshold values ( |
| kuenm_proc | Calculates statistical significance of single models based on the partial ROC and a threshold value ( |
| kuenm_mop | Calculates the MOP metric for comparisons of environmental conditions between a calibration area and a single area or scenario to which models will be transferred. Inputs and outputs are objects stored in memory; output includes a map resulting from this analysis. |
Figure 2Directory structure and data for starting (A) and when finished (B) using kuenm R package functions.
Roman numerals represent data needed and generated by the package: using the start function (I), creating candidate models (II), evaluating candidate models (III), preparing projection layers (IV), generating final models and its transfers (V), evaluating final models with independent data (VI), and analyzing extrapolation risks in projection areas or scenarios (VII).
Model performance under optimal parameters (*) and default parameters (−), regarding regularization multiplier (RM), feature classes (FC), and sets of predictors (Pred. Sets), for the models of the example species.
Delta AICc of models with default settings are relative to the selected models. Bold numbers indicate final models that met the statistical significance and omission rate criteria during evaluation with independent data.
| RM | FC | Pred. Sets | partial ROC | Omission rate 5% | AICc | Delta AICc | Weight AICc | Number of parameters |
|---|---|---|---|---|---|---|---|---|
| Tick | ||||||||
| *0.10 | lqp | Set 3 | 3346.46 | 0.00 | 0.95 | 14.00 | ||
| −1.00 | lqph | Set 1 | 0.08 | 3385.65 | 39.19 | 0.00 | 41.00 | |
| −1.00 | lqph | Set 2 | 0.08 | 3358.27 | 11.81 | 0.00 | 29.00 | |
| −1.00 | lqph | Set 3 | 0.09 | 3348.13 | 1.67 | 0.00 | 22.00 | |
| Toad | ||||||||
| *0.70 | p | Set 3 | 1508.23 | 0.00 | 0.34 | 3.00 | ||
| *0.10 | pq | Set 3 | 1508.39 | 0.16 | 0.98 | 9.00 | ||
| *3.00 | lqt | Set 3 | 0.00 | 1509.89 | 1.66 | 0.11 | 3.00 | |
| *4.00 | lh | Set 3 | 1510.08 | 1.86 | 0.08 | 3.00 | ||
| −1.00 | lqh | Set 1 | 0.29 | 0.25 | 1531.90 | 23.67 | 0.00 | 17.00 |
| −1.00 | lqh | Set 2 | 0.29 | 0.25 | 1524.25 | 16.03 | 0.00 | 14.00 |
| −1.00 | lqh | Set 3 | 0.16 | 0.19 | 1530.01 | 21.78 | 0.00 | 14.00 |
Figure 3Omission rates and AICc values for all, non-significant, and selected “best” candidate models for the tick (A) and the toad (B).
Models were selected based on statistical significance, omission rates, and AICc criteria.
Figure 4Geographic summary of the results of the analyses performed for the two example species.
(A–B) Logistic output of the final models that met the selection criteria, transferred to projection areas in current and (C–F) future periods (models were produced allowing extrapolation and clamping). (G–J) Extrapolation risk in future projections (MOP results).