| Literature DB >> 26757155 |
Peter M Rose1, Mark J Kennard1, David B Moffatt2, Fran Sheldon1, Gavin L Butler3.
Abstract
Species distribution models are widely used for stream bioassessment, estimating changes in habitat suitability and identifying conservation priorities. We tested the accuracy of three modelling strategies (single species ensemble, multi-species response and community classification models) to predict fish assemblages at reference stream segments in coastal subtropical Australia. We aimed to evaluate each modelling strategy for consistency of predictor variable selection; determine which strategy is most suitable for stream bioassessment using fish indicators; and appraise which strategies best match other stream management applications. Five models, one single species ensemble, two multi-species response and two community classification models, were calibrated using fish species presence-absence data from 103 reference sites. Models were evaluated for generality and transferability through space and time using four external reference site datasets. Elevation and catchment slope were consistently identified as key correlates of fish assemblage composition among models. The community classification models had high omission error rates and contributed fewer taxa to the 'expected' component of the taxonomic completeness (O/E50) index than the other strategies. This potentially decreases the model sensitivity for site impact assessment. The ensemble model accurately and precisely modelled O/E50 for the training data, but produced biased predictions for the external datasets. The multi-species response models afforded relatively high accuracy and precision coupled with low bias across external datasets and had lower taxa omission rates than the community classification models. They inherently included rare, but predictable species while excluding species that were poorly modelled among all strategies. We suggest that the multi-species response modelling strategy is most suited to bioassessment using freshwater fish assemblages in our study area. At the species level, the ensemble model exhibited high sensitivity without reductions in specificity, relative to the other models. We suggest that this strategy is well suited to other non-bioassessment stream management applications, e.g., identifying priority areas for species conservation.Entities:
Mesh:
Year: 2016 PMID: 26757155 PMCID: PMC4710458 DOI: 10.1371/journal.pone.0146728
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Reference site locations for each dataset used, and major river systems in the study area.
The river disturbance index (RDI–see [47] for details of its derivation) provides context for the ‘least disturbed’ reference sites; low RDI values indicate low levels of human pressures in the upstream catchment. Note that the season dataset is represented by all ‘training’ sites in the SEQ section of the study area.
Metrics used to evaluate model performance at species and assemblage levels.
| Code | Metric | Description |
|---|---|---|
| AUC | Area under curve of the receiving operator characteristics plot | Ranges from 0.5 to 1; Higher values indicate a better fit |
| Se | Sensitivity | Correct prediction of presence (ranges from 0 to 1; 1 indicates perfect prediction) |
| Sp | Specificity | Correct prediction of absence (ranges from 0 to 1; 1 indicates perfect prediction) |
| K | Cohen's Kappa | Generally ranges from 0 to 1; higher values indicate a better fit |
| CCR | Correct Classification Rate | Proportion of sites correctly classified as either present or absent |
| Mean O/E | Mean of O/E index | Indicates model accuracy |
| Bandwidth | 90th percentile minus 10th percentile of O/E index | Estimate of model precision; used to develop 'bands' of impairment |
| r2 | Pearson r-squared regression coefficient of O vs E | Model goodness of fit |
| SD | Standard deviation of O/E index | Estimate of model precision |
| Slope | Slope of the linear regression of O vs E | Indicative of model bias (e.g. slope <1 indicates underestimation of richness at diverse sites) |
| Intercept | Y-intercept of linear regression of O vs E | Indicative of model bias |
| BC | Bray-Curtis dissimilarity index | Dissimilarity between forecast and observed community (i.e. lower is better) |
| SD BC | Standard deviation of the BC index | Estimate of BC model precision |
Ranking of GIS-based predictor variable importance for predicting fish assemblage composition for each modelling strategy (1 indicates the most important variable).
| Modelling strategy | ||||||
|---|---|---|---|---|---|---|
| Predictor | ENS | DFA | RF | MARS | MANN | Average |
| Mean segment elevation | 1 | 6 | 1 | 1 | 1 | 2.0 |
| Catchment average slope | 3 | 3 | 3 | 2 | 6 | 3.4 |
| Distance to outlet (the sea) | 6 | 7 | 2 | - | 3 | 4.5 |
| Mean annual runoff | 2 | 5 | 9 | - | 2 | 4.5 |
| Catchment shape (elongation ratio) | 5 | 4 | 7 | - | 4 | 5.0 |
| Maximum upstream elevation | 4 | - | 4 | - | 12 | 6.7 |
| Stream and sub-catchment average annual rainfall | 7 | - | 8 | - | 5 | 6.7 |
| Stream and sub-catchment hottest month mean temperature | 12 | 1 | 6 | - | 9 | 7.0 |
| Average slope of downstream flow path | 9 | - | 10 | - | 7 | 8.7 |
| Catchment relief ratio | 8 | - | 5 | - | 13 | 8.7 |
| Catchment percentage unconsolidated rocks | 16 | 2 | 14 | - | 10 | 10.5 |
| Catchment percentage igneous rocks | 10 | - | 13 | - | 14 | 12.3 |
| Modelled annual terrestrial mean net primary productivity | 17 | - | 12 | - | 8 | 12.3 |
| Coefficient of variation of monthly totals of accumulated soil water surplus | 11 | - | 11 | - | 17 | 13.0 |
| Stream and valley percentage siliciclastic/undifferentiated sedimentary rocks | 13 | - | 15 | - | 11 | 13.0 |
| Stream and valley percentage unconsolidated rocks | 15 | - | 16 | - | 15 | 15.3 |
| Catchment percentage metamorphic rocks | 14 | - | 17 | - | 19 | 16.7 |
| Stream and valley percentage metamorphic rocks | 18 | - | 19 | - | 18 | 18.3 |
| Stream and valley percentage mixed sedimentary and igneous rocks | 20 | - | 20 | - | 16 | 18.7 |
| Catchment percentage mixed sedimentary and igneous rocks | 19 | - | 18 | - | 20 | 19.0 |
See S2 Table for predictor variable descriptions. ENS–Single species ensemble model; DFA–RIVPACS community model using a discriminant function classifier; RF–RIVPACS model using a random forest classifier; MARS–Multi-species response multivariate adaptive regression splines model; MANN–Multi-species response artificial neural network model.
AUC averaged among species using (i) all predictor variables and (ii) GIS-based predictors alone, for three datasets.
| Model | Variables | Training (n = 103) | Space (n = 25) | Season (n = 79) | Average |
|---|---|---|---|---|---|
| ENS | All variables | 0.99 | 0.81 | 0.86 | 0.89 |
| GIS only | 0.99 | 0.82 | 0.85 | 0.89 | |
| DFA | All variables | 0.82 | 0.77 | 0.80 | 0.80 |
| GIS only | 0.81 | 0.76 | 0.79 | 0.79 | |
| RF | All variables | 0.86 | 0.81 | 0.78 | 0.82 |
| GIS only | 0.86 | 0.80 | 0.79 | 0.82 | |
| MARS | All variables | 0.85 | 0.84 | 0.83 | 0.84 |
| GIS only | 0.85 | 0.78 | 0.83 | 0.82 | |
| MANN | All variables | 0.86 | 0.77 | 0.74 | 0.79 |
| GIS only | 0.86 | 0.76 | 0.80 | 0.81 |
No significant differences (at p<0.05) were detected between (i) and (ii) for any model/dataset combination, assessed by one-way ANOVAs. ENS–Single species ensemble model; DFA–RIVPACS community model using a discriminant function classifier; RF–RIVPACS model using a random forest classifier; MARS–Multi-species response multivariate adaptive regression splines model; MANN–Multi-species response artificial neural network model.
Assemblage level model evaluation metrics typically used to assess model quality for stream bioassessment for each model and dataset.
| Dataset | Model | Mean O/E50 | SD O/E50 | Band width O/E50 | r2 O/E50 | Slope O/E50 | Intercept O/E50 | Mean BC | SD BC |
|---|---|---|---|---|---|---|---|---|---|
| Training | ENS | 1.08 | 1.11 | -0.14 | |||||
| (n = 103) | DFA | 1.03 | 0.29 | 0.71 | 0.61 | 1.06 | -0.09 | 0.42 | 0.10 |
| RF | 1.05 | 0.32 | 0.61 | 0.74 | 1.15 | -0.35 | 0.42 | 0.10 | |
| MARS | 0.25 | 0.57 | 0.73 | 0.98 | 0.38 | 0.10 | |||
| MANN | 1.06 | 0.24 | 0.48 | 0.87 | 0.17 | 0.38 | 0.13 | ||
| Space | ENS | 0.91 | 0.33 | 0.72 | 0.51 | 0.64 | 0.85 | 0.15 | |
| (n = 25) | DFA | 0.29 | 0.56 | 0.78 | 0.18 | 0.46 | 0.13 | ||
| RF | 1.02 | 0.30 | 1.13 | -0.21 | 0.47 | ||||
| MARS | 0.92 | 0.51 | 0.77 | 0.90 | 0.42 | 0.12 | |||
| MANN | 0.92 | 0.31 | 0.62 | 0.60 | 0.72 | 0.52 | 0.45 | 0.15 | |
| Season | ENS | 0.91 | 0.89 | 0.06 | 0.52 | ||||
| (n = 79) | DFA | 1.01 | 0.27 | 0.66 | 0.60 | 1.09 | -0.30 | 0.52 | 0.09 |
| RF | 1.03 | 0.29 | 0.62 | 0.68 | 1.26 | -0.81 | 0.52 | 0.09 | |
| MARS | 0.97 | 0.25 | 0.61 | 0.69 | 0.09 | ||||
| MANN | 0.26 | 0.65 | 0.75 | 0.96 | 0.13 | 0.52 | 0.10 | ||
| Time | ENS | 0.83 | 0.71 | 0.13 | |||||
| (n = 23; 331 samples) | DFA | 1.18 | 0.34 | 1.06 | 0.42 | 0.78 | 1.28 | 0.44 | |
| RF | 1.16 | 0.38 | 0.95 | 0.43 | 0.85 | 0.44 | 0.11 | ||
| MARS | 0.34 | 0.86 | 0.40 | 0.66 | 1.25 | 0.42 | 0.11 | ||
| MANN | 1.04 | 0.30 | 0.71 | 0.52 | 0.84 | 0.78 | 0.42 | 0.11 | |
| Method | ENS | 0.76 | 0.50 | 0.66 | 0.51 | 0.12 | |||
| (n = 33) | DFA | 0.97 | 0.30 | 0.58 | 0.42 | 0.78 | 0.62 | 0.45 | 0.11 |
| RF | 0.29 | 0.67 | 0.49 | 0.55 | 0.49 | ||||
| MARS | 0.93 | 0.27 | 0.60 | 0.43 | 0.77 | 0.57 | 0.43 | 0.11 | |
| MANN | 0.86 | 0.28 | 0.57 | 0.71 | 0.47 | 0.11 |
Bold and italicised text indicates the ‘best’ metric value for each dataset/model combination. See Table 1 for metric codes and descriptions. ENS–Single species ensemble model; DFA–RIVPACS community model using a discriminant function classifier; RF–RIVPACS model using a random forest classifier; MARS–Multi-species response multivariate adaptive regression splines model; MANN–Multi-species response artificial neural network model.
Fig 2Projected species distributions (at a cut-off threshold of 0.5) for (a) Hypseleotris klunzingeri and (b) Melanotaenia duboulayi.
Green stream segments are predicted presences; grey segments are predicted absences. The circles are sites that were sampled in autumn/winter 2013 (i.e. the training and space datasets; n = 128). Red circles are observed presences, open circles are observed absences. ENS–Single species ensemble model; DFA–RIVPACS community model using a discriminant function classifier; RF–RIVPACS model using a random forest classifier; MANN–Multi-species response artificial neural network model; MARS–Multi-species response multivariate adaptive regression splines model.
AUC for each species, averaged among datasets.
Asterisks denotes rare species (defined as <10% prevalence in the training dataset).
| Species | Observed prevalence | ENS | DFA | RF | MARS | MANN | Average |
|---|---|---|---|---|---|---|---|
| 9% | 0.89 | 0.75 | 0.75 | 0.84 | 0.85 | 0.82 | |
| 15% | 0.78 | 0.66 | 0.72 | 0.63 | 0.61 | 0.68 | |
| 95% | 0.83 | 0.84 | 0.86 | 0.77 | 0.79 | 0.82 | |
| 13% | 0.80 | 0.81 | 0.85 | 0.83 | 0.78 | 0.81 | |
| 7% | 0.95 | 0.79 | 0.78 | 0.85 | 0.93 | 0.86 | |
| 6% | 0.95 | 0.82 | 0.90 | 0.95 | 0.96 | 0.91 | |
| 43% | 0.97 | 0.92 | 0.93 | 0.93 | 0.91 | 0.93 | |
| 29% | 0.83 | 0.71 | 0.76 | 0.82 | 0.77 | 0.78 | |
| 35% | 0.94 | 0.95 | 0.95 | 0.95 | 0.94 | 0.95 | |
| 54% | 0.84 | 0.75 | 0.80 | 0.79 | 0.76 | 0.79 | |
| 18% | 0.90 | 0.77 | 0.71 | 0.73 | 0.79 | 0.78 | |
| 8% | 0.73 | 0.69 | 0.73 | 0.69 | 0.73 | 0.71 | |
| 67% | 0.78 | 0.73 | 0.77 | 0.74 | 0.76 | 0.76 | |
| 12% | 0.95 | 0.60 | 0.64 | 0.79 | 0.75 | 0.74 | |
| 9% | 0.83 | 0.85 | 0.75 | 0.83 | 0.74 | 0.80 | |
| 11% | 0.92 | 0.85 | 0.77 | 0.81 | 0.70 | 0.81 | |
| 3% | 0.96 | 0.93 | 0.95 | 0.95 | 0.95 | 0.95 | |
| 18% | 0.77 | 0.68 | 0.75 | 0.69 | 0.69 | 0.72 | |
| 26% | 0.83 | 0.74 | 0.75 | 0.70 | 0.79 | 0.76 | |
| 17% | 0.81 | 0.71 | 0.79 | 0.78 | 0.72 | 0.76 | |
| 14% | 0.83 | 0.69 | 0.70 | 0.75 | 0.71 | 0.74 | |
| 68% | 0.90 | 0.73 | 0.84 | 0.83 | 0.86 | 0.83 | |
| 15% | 0.98 | 0.93 | 0.94 | 0.90 | 0.97 | 0.94 | |
| 55% | 0.83 | 0.73 | 0.75 | 0.77 | 0.73 | 0.76 | |
| 5% | 0.76 | 0.79 | 0.72 | 0.81 | 0.72 | 0.76 | |
| Number of ‘good’ predictions (AUC>0.8) | 19 | 9 | 8 | 13 | 8 | 11 |
ENS–Single species ensemble model; DFA–RIVPACS community model using a discriminant function classifier; RF–RIVPACS model using a random forest classifier; MARS–Multi-species response multivariate adaptive regression splines model; MANN–Multi-species response artificial neural network model.
Mean species level evaluation metrics for each model and dataset.
| Dataset (no. sites) | Model | AUC | K | Se | Sp | CCR |
|---|---|---|---|---|---|---|
| Training (n = 103) | ENS | |||||
| DFA | 0.81B | 0.16B | 0.28B | 0.87 | 0.86B | |
| RF | 0.86B | 0.17B | 0.28B | 0.88 | 0.87B | |
| MARS | 0.85B | 0.26B | 0.37B | 0.87 | 0.87B | |
| MANN | 0.86B | 0.37B | 0.43B | 0.91 | 0.89B | |
| Space (n = 25) | ENS | 0.87 | ||||
| DFA | 0.76 | 0.14 | 0.29 | 0.86 | 0.87 | |
| RF | 0.80 | 0.14 | 0.29 | 0.86 | 0.87 | |
| MARS | 0.78 | 0.18 | 0.31 | 0.87 | ||
| MANN | 0.76 | 0.20 | 0.33 | 0.88 | 0.87 | |
| Season (n = 79) | ENS | 0.88 | ||||
| DFA | 0.79 | 0.14B | 0.27B | 0.86 | 0.86 | |
| RF | 0.79 | 0.14B | 0.27B | 0.87 | 0.86 | |
| MARS | 0.83 | 0.24B | 0.36AB | 0.86 | 0.87 | |
| MANN | 0.80 | 0.25B | 0.35AB | 0.87 | ||
| Time (n = 23; 331 samples) | ENS | 0.81 | 0.83 | |||
| DFA | 0.73 | 0.12B | 0.28B | 0.85 | 0.86 | |
| RF | 0.74 | 0.13B | 0.29B | |||
| MARS | 0.77 | 0.13B | 0.33AB | 0.82 | 0.83 | |
| MANN | 0.78 | 0.19AB | 0.37AB | 0.84 | 0.85 | |
| Method (n = 33) | ENS | 0.84 | 0.84 | |||
| DFA | 0.79 | 0.18 | 0.33 | 0.87 | 0.86 | |
| RF | 0.77 | 0.15 | 0.31 | 0.86 | 0.86 | |
| MARS | 0.78 | 0.20 | 0.36 | |||
| MANN | 0.74 | 0.17 | 0.37 | 0.84 | 0.84 | |
| Average | ENS | |||||
| DFA | 0.78 | 0.15 | 0.29 | 0.86 | 0.86 | |
| RF | 0.79 | 0.15 | 0.29 | 0.86 | 0.86 | |
| MARS | 0.80 | 0.20 | 0.35 | 0.86 | 0.86 | |
| MANN | 0.79 | 0.24 | 0.37 | 0.87 | 0.87 |
Significant differences in mean evaluation metrics among models are denoted by different letters (assessed by one-way ANOVAs and Tukey’s HSD tests). Bold values indicate the highest mean evaluation metric for each model/dataset. See Table 2 for metric codes and descriptions. ENS–Single species ensemble model; DFA–RIVPACS community model using a discriminant function classifier; RF–RIVPACS model using a random forest classifier; MARS–Multi-species response multivariate adaptive regression splines model; MANN–Multi-species response artificial neural network model.
Desirable model evaluation properties for several common stream bioassessment applications of SDMs.
| Model Application | Desirable evaluation properties | Notes/references |
|---|---|---|
| Bioassessment (reporting at the catchment scale) | Balanced Sp and Se, mean O/E close to unity, low BC, High r2, low SD, low bias, low omission rate for assemblages with few taxa. | Balance between type I error (incorrectly diagnosing an impaired stream as ‘reference’) and type II error (incorrectly diagnosing a reference stream as impaired). Low availability of modelled taxa can lead to coarse estimates of ecological condition [ |
| Bioassessment (regulatory/compliance, at the site scale) | High Sp, mean O/E close to unity, low BC, High r2, low SD, low bias. | Certainty of species loss is required to be confident that acceptable limits have been breached |
| Biodiversity mapping | Balanced Sp and Se, high r2, low SD, low bias, O/E0 close to unity | Requires a good regression fit of O vs. E |
| Species conservation and reserve design | High Sp | High commission errors may lead to protection of habitats where target species may not actually inhabit (leading to potentially wasted limited resources) (e.g. [ |
| Population discovery and range extension (survey gap analysis) | High Se | Commission errors are acceptable (e.g. accurate model, incomplete data such as species difficult to sample efficiently) |
| Climate change | Balanced Sp and Se, or high Se | [ |
| Restocking and translocation suitability; habitat restoration | High Se | Focus on low omission error because species absences may be due to impacts and local extinctions (usually the impetus for restocking/restoration) |
| Predicting site susceptibility to invasive species | High Se | Omission errors are less acceptable because of the costs associated with incorrectly identifying unsuitable habitat. |
Notes are sourced from [2], [16], and [83]. Refer to Table 1 for a description of the model evaluation property acronyms.