| Literature DB >> 32699354 |
Chongliang Zhang1, Yong Chen2, Binduo Xu1, Ying Xue1, Yiping Ren3,4,5.
Abstract
Species distribution models (SDMs) have been increasingly used to predict the geographic distribution of a wide range of organisms; however, relatively fewer research efforts have concentrated on rare species despite their critical roles in biological conservation. The present study tested whether community data may improve modelling rare species by sharing information among common and rare ones. We chose six SDMs that treat community data in different ways, including two traditional single-species models (random forest and artificial neural network) and four joint species distribution models that incorporate species associations implicitly (multivariate random forest and multi-response artificial neural network) or explicitly (hierarchical modelling of species communities and generalized joint attribute model). In addition, we evaluated two approaches of data arrangement, species filtering and conditional prediction, to enhance the selected models. The model predictions were tested using cross validation based on empirical data collected from marine fisheries surveys, and the effects of community data were evaluated by comparing models for six selected rare species. The results demonstrated that the community data improved the predictions of rare species' distributions to certain extent but might also be unhelpful in some cases. The rare species could be appropriately predicted in terms of occurrence, whereas their abundance tended to be underestimated by most models. Species filtering and conditional predictions substantially benefited the predictive performances of multiple- and single-species models, respectively. We conclude that both the modelling algorithms and community data need to be carefully selected in order to deliver improvement in modelling rare species. The study highlights the opportunity and challenges to improve prediction of rare species' distribution by making the most of community data.Entities:
Mesh:
Year: 2020 PMID: 32699354 PMCID: PMC7376031 DOI: 10.1038/s41598-020-69157-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Predictive performances of models on the distribution of Japanese seahorse (Hippocampus mohnikei). The prediction of occurrence was evaluated by the area under the curve of receiver operating characteristic (auc) and Cohen’s coefficient (κ), and prediction of abundance was evaluated by partial relative bias of non-zero data (PRB) and root mean square error (RMSE). The dash line in the last plot denotes a baseline of RMSE derived from all-zero predictions.
A summary of model predictive performances for target rare species.
| Measures | Models | Sp1 | Sp2 | Sp3 | Sp4 | Sp5 | Sp6 |
|---|---|---|---|---|---|---|---|
| AUC | RF | 0.875 | 0.644 | 0.949 | 0.959 | 0.800 | 0.911 |
| ANN | 0.711 | 0.572 | 0.628 | 0.634 | 0.582 | 0.633 | |
| MRF | 0.893 | 0.618 | 0.956 | 0.962 | 0.784 | 0.926 | |
| MANN | 0.722 | 0.576 | 0.929 | 0.935 | 0.751 | 0.929 | |
| HMSC | 0.802 | 0.670 | 0.941 | 0.901 | 0.765 | 0.908 | |
| GJAM | 0.724 | 0.640 | 0.932 | 0.860 | 0.688 | 0.896 | |
| κ | RF | 0.208 | − 0.046 | 0.486 | 0.562 | 0.289 | 0.616 |
| ANN | 0.134 | 0.030 | 0.358 | 0.353 | 0.102 | 0.320 | |
| MRF | 0.243 | 0.163 | 0.601 | 0.528 | 0.205 | 0.644 | |
| MANN | 0.088 | 0.030 | 0.486 | 0.532 | 0.243 | 0.634 | |
| HMSC | 0.041 | − 0.034 | 0.493 | 0.248 | 0.126 | 0.541 | |
| GJAM | 0.046 | − 0.034 | 0.408 | 0.142 | 0.069 | 0.497 | |
| RMSE | RF | 0.299 | 0.419 | 0.377 | 0.652 | 0.569 | 0.588 |
| ANN | 0.652 | 0.797 | 0.533 | 1.259 | 1.402 | 1.195 | |
| MRF | 0.300 | 0.416 | 0.414 | 0.648 | 0.577 | 0.559 | |
| MANN | 0.337 | 0.453 | 0.389 | 0.729 | 0.612 | 0.587 | |
| HMSC | 0.297 | 0.414 | 0.384 | 0.732 | 0.569 | 0.611 | |
| GJAM | 0.300 | 0.409 | 0.393 | 0.760 | 0.582 | 0.628 | |
| Zero | 0.300 | 0.397 | 0.418 | 0.838 | 0.601 | 0.776 |
Each cell denotes the average values of the performance measures for a combination of species and models, respectively. Large values of AUC and κ represented high predictive accuracy of species occurrence and small values of RMSE represent high predictive accuracy of species abundance. The row of “Zero” denotes a baseline of RMSE when all predicted values are zeros.
Figure 2The influences of species filtering on the predictive performance of JSDMs. The levels in the X-axis denoted different thresholds of species correlation for selecting ancillary species (LV1 denoted a large set of species selected and LV3 denoted a small set. Three species are illustrated as examples and the full results are shown in Supplementary Information).
Figure 3The effects of conditional prediction on improving predictive performances. The ΔRMSE indicates the decreases of RMSE in conditional models compared to that of single-species RF and ANN, respectively. RF-OBS and ANN-OBS denote the predictions conditioning on real observations (survey data), and others are conditioning on the prediction of JSDMs (Three species are illustrated as examples and the full results are shown in Supplementary Information).
A summary of predictive models used in this study.
| Categories | Models | Full names | How to address species associations | R packages | References |
|---|---|---|---|---|---|
| SSDM | RF | Random forest | None | randomForest (v4.6-14) | Breiman[ |
| ANN | Artificial neural network | None | nnet (v7.3-12) | Basheer and Hajmeer[ | |
| Machine-learning JSDM | MRF | Multivariate random forest | Implicitly incorporated from compositional similarity | MultivariateRandomForest (v1.1.5) | Segal and Xiao[ |
| MANN | Multiresponse artificial neural network | Implicitly incorporated from neuron connections | nnet (v7.3-12) | Olden[ | |
| Regression-based JSDM | HMSC | Hierarchical Modelling of Species Communities | Explicitly incorporated with latent variables | HMSC (v2.2-0)a | Ovaskainen et al.[ |
| GJAM | Generalized Joint Attribute Model | Explicitly incorporated with a covariance matrix | gjam (v2.2.6) | Clark et al.[ |
aR codes of HMSC are available on Github (https://github.com/guiblanchet/HMSC), and others are available on CRAN.