| Literature DB >> 28168023 |
Imelda Somodi1, Nikolett Lepesi2, Zoltán Botta-Dukát1.
Abstract
It has long been a concern that performance measures of species distribution models react to attributes of the modeled entity arising from the input data structure rather than to model performance. Thus, the study of Allouche et al. (Journal of Applied Ecology, 43, 1223, 2006) identifying the true skill statistics (TSS) as being independent of prevalence had a great impact. However, empirical experience questioned the validity of the statement. We searched for technical reasons behind these observations. We explored possible sources of prevalence dependence in TSS including sampling constraints and species characteristics, which influence the calculation of TSS. We also examined whether the widespread solution of using the maximum of TSS for comparison among species introduces a prevalence effect. We found that the design of Allouche et al. (Journal of Applied Ecology, 43, 1223, 2006) was flawed, but TSS is indeed independent of prevalence if model predictions are binary and under the strict set of assumptions methodological studies usually apply. However, if we take realistic sources of prevalence dependence, effects appear even in binary calculations. Furthermore, in the widespread approach of using maximum TSS for continuous predictions, the use of the maximum alone induces prevalence dependence for small, but realistic samples. Thus, prevalence differences need to be taken into account when model comparisons are carried out based on discrimination capacity. The sources we identified can serve as a checklist to safely control comparisons, so that true discrimination capacity is compared as opposed to artefacts arising from data structure, species characteristics, or the calculation of the comparison measure (here TSS).Entities:
Keywords: Cohen's kappa; model performance; predictive models; sample size; species distribution models
Year: 2017 PMID: 28168023 PMCID: PMC5288248 DOI: 10.1002/ece3.2654
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Confusion matrix of matches and mismatches of predictions and observations
| Observation | Prediction | ||
|---|---|---|---|
| 1 | 0 | Σ | |
| 1 | True positives (TP) | False negatives (FN) | No. positive observations ( |
| 0 | False positives (FP) | True negatives (TN) |
|
| Σ | Number of positive predictions ( |
| Total number of observations ( |
Figure 1Subcases of beta distribution with parameters defined in Table 6. The sampling of probability values for presence “observations” is carried out according to these curves in our simulations. The individual predicted probability values appear in our simulated predictions with such densities. Lines represent: a) quadratic, b) linear, c) square root, d) 1/16th power curve
Is there prevalence dependence in TSS? Answers for cases examined in our study
| Species occupy suitable sites only, and model goodness changes. | Species occupy unsuitable sites also, and model goodness is fixed (for our analysis). Binary predictions considered only. Source of species' distribution difference: | |||
|---|---|---|---|---|
| Binary predictions | Continuous predictions | Missed presence | Fallacious absence | Fallacious presence |
| No | Yes for small sample size, No for large sample size | Yes | Yes | Yes, except if the rate of fallacious presences is proportional to the number of unsuitable sites |
Confusion matrix of matches and mismatches of predictions and observations assuming different rates of false‐negative and false‐positive errors, e 1 and e 2
| Observation | Prediction | ||
|---|---|---|---|
| 1 | 0 | Σ | |
| 1 | TP = (1 − | FN = |
|
| 0 | FP = | TN = (1 − |
|
| Σ |
|
|
|
Contingency table when the model is assumed to be perfect, but there are missed presences in the observations. “e” denotes the rate of missed presences
| Observation | Prediction | ||
|---|---|---|---|
| 1 | 0 | Σ | |
| 1 | TP = (1 − | FN = 0 |
|
| 0 | FP = | TN = ( |
|
| Σ |
|
|
|
Contingency table when the model is assumed to be perfect, but there are fallacious absences in the observations
| Observation | Prediction | ||
|---|---|---|---|
| 1 | 0 | Σ | |
| 1 | TP = | FN = 0 |
|
| 0 | FP = | TN = ( |
|
| Σ |
|
|
|
Contingency table when the model is assumed to be perfect, but there are fallacious presences and their amount is proportional to the number of unsuitable sites in the observations
| Observation | Prediction | ||
|---|---|---|---|
| 1 | 0 | Σ | |
| 1 | TP = | FN = |
|
| 0 | FP = 0 | TN = |
|
| Σ |
|
|
|
The f 0 and f 1 functions used in our simulations are specific cases of the beta distribution if α = 1 or β = 1. The table shows the corresponding other parameter of the beta distribution producing the probability function of selecting a certain probability value for presence observations. Selections for absence observations follow the opposite trend. The rbeta function in R generates random numbers with such distributions (Appendix S1)
| Curve type |
|
|
|---|---|---|
| Quadratic | α = 3, β = 1 | β = 3, α = 1 |
| Linear | α = 2, β = 1 | β = 2, α = 1 |
| Square root | α = 1.5, β = 1 | β = 1.5, α = 1 |
| 16th root | α = 17/16, β = 1 | β = 17/16, α = 1 |
Figure 2Demonstration of the dependence of the maximum value of TSS on prevalence. The ratio of presences and absences in the observations (prevalence) was varied from 0.05 to 0.95 in increments of 0.05. Average maximum values from 1,000 simulations are shown for four model scenarios (a)–(d). For details, see Fig 1
Figure 3Decrease of prevalence effect on maximum TSS with increasing sample size. Sample size equals to (a) 100, (b) 1,000, (c) 10,000. The pattern is similar for all model goodness scenarios; here, sampling according to the square root function (medium model quality) is used as an example