| Literature DB >> 28097060 |
Chunrong Mi1, Falk Huettmann2, Yumin Guo1, Xuesong Han1, Lijia Wen1.
Abstract
Species distribution models (SDMs) have become an essential tool in ecology, biogeography, evolution and, more recently, in conservation biology. How to generalize species distributions in large undersampled areas, especially with few samples, is a fundamental issue of SDMs. In order to explore this issue, we used the best available presence records for the Hooded Crane (Grus monacha, n = 33), White-naped Crane (Grus vipio, n = 40), and Black-necked Crane (Grus nigricollis, n = 75) in China as three case studies, employing four powerful and commonly used machine learning algorithms to map the breeding distributions of the three species: TreeNet (Stochastic Gradient Boosting, Boosted Regression Tree Model), Random Forest, CART (Classification and Regression Tree) and Maxent (Maximum Entropy Models). In addition, we developed an ensemble forecast by averaging predicted probability of the above four models results. Commonly used model performance metrics (Area under ROC (AUC) and true skill statistic (TSS)) were employed to evaluate model accuracy. The latest satellite tracking data and compiled literature data were used as two independent testing datasets to confront model predictions. We found Random Forest demonstrated the best performance for the most assessment method, provided a better model fit to the testing data, and achieved better species range maps for each crane species in undersampled areas. Random Forest has been generally available for more than 20 years and has been known to perform extremely well in ecological predictions. However, while increasingly on the rise, its potential is still widely underused in conservation, (spatial) ecological applications and for inference. Our results show that it informs ecological and biogeographical theories as well as being suitable for conservation applications, specifically when the study area is undersampled. This method helps to save model-selection time and effort, and allows robust and rapid assessments and decisions for efficient conservation.Entities:
Keywords: Black-necked crane (Grus nigricollis); Generality (transferability); Hooded crane (Grus monacha); Random Forest; Rare species; Species distribution models (SDMs); Undersampled areas; White-naped crane (Grus vipio)
Year: 2017 PMID: 28097060 PMCID: PMC5237372 DOI: 10.7717/peerj.2849
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Study areas for three species cranes.
Environmental GIS layers used to predict breeding distributions of three cranes.
| Environmental layers | Description | Source | Website |
|---|---|---|---|
| Bio_1 | Annual mean temperature (°C) | WorldClim |
|
| Bio_2 | Monthly mean (max temp–min temp) (°C) | WorldClim |
|
| Bio_3 | Isothermality (BIO2/BIO7) (*100 °C) | WorldClim |
|
| Bio_4 | Temperature seasonality (standard deviation *100 °C) | WorldClim |
|
| Bio_5 | Max temperature of warmest month (°C) | WorldClim |
|
| Bio_6 | Min temperature of coldest month (°C) | WorldClim |
|
| Bio_7 | Annual temperature range (BIO5-BIO6) (°C) | WorldClim |
|
| Bio_12 | Annual precipitation (mm) | WorldClim |
|
| Bio_13 | Precipitation of wettest month (mm) | WorldClim |
|
| Bio_14 | Precipitation of driest month (mm) | WorldClim |
|
| Bio_15 | Precipitation seasonality (mm) | WorldClim |
|
| Altitude | Altitude (m) | WorldClim |
|
| Aspect | Aspect (°) | Derived from Altitude |
|
| Slope | Slope | Derived from Altitude |
|
| Landcover | Land cover | ESA |
|
| Disroad | Distance to roads (m) | Road layer from Natural Earth |
|
| Disrard | Distance to railways (m) | Railroad layer from Natural Earth |
|
| Disriver | Distance to rivers (m) | River layer from Natural Earth |
|
| Dislake | Distance to lakes (m) | Lake layer from Natural Earth |
|
| Discoastline | Distance to coastline (m) | Coastline layer from Natural Earth |
|
| Dissettle | Distance to settlements (m) | Settle layer from Natural Earth |
|
Figure 2Detailed study areas showing the presence of and testing data used for the three cranes.
(A) Hooded Cranes, (B) White-naped Cranes, (C) Black-necked Cranes.
AUC and TSS values for four machine learning models and their Ensemble model with three crane species based on literature testing data.
| Accuracy metric (samples) | Species distribution model | ||||
|---|---|---|---|---|---|
| TreeNet | Random Forest | CART | Maxent | Ensemble | |
| Hooded Crane ( | |||||
| AUC | 0.504 | 0.500 | 0.558 | 0.558 | |
| TSS | 0.000 | 0.000 | 0.137 | 0.117 | |
| White-naped Crane ( | |||||
| AUC | 0.605 | 0.564 | 0.712 | ||
| TSS | 0.210 | 0.128 | 0.424 | 0.508 | |
| Black-necked Crane ( | |||||
| AUC | 0.528 | 0.830 | 0.672 | 0.805 | |
| TSS | 0.055 | 0.660 | 0.345 | 0.611 | |
Figure 3Violin plots of the Relative Index of Occurrence (RIO) for four SDMs and Ensemble model for Hooded Cranes and White-naped Cranes based on satellite tracking data.
(A) Violin plots of Hooded Cranes, (B) violin plots of White-naped Cranes.
Figure 4Violin plots of Relative Index of Occurrence (RIO) values for four SDMs and Ensemble model for three cranes based on calibration data from Threatened Birds of Asia.
(A) Violin plots for Hooded Cranes, (B) violin plots for White-naped Cranes, (C) violin plots for Black-necked Cranes.
Figure 5Prediction maps for Hooded Cranes and zoomed-in maps showing the four models (TreeNet, Random Forest, CART and Maxent) and Ensemble model in detail.
(A–E) Prediction map for Hooded Cranes, (F–J) zoomed-in map for Hooded Cranes.
Figure 6Prediction maps for White-naped Cranes and zoomed-in maps showing the four models (TreeNet, Random Forest, CART and Maxent) and Ensemble model in detail.
(A–E) Prediction map for White-naped Cranes, (F–J) zoomed-in map for White-naped Cranes.
Figure 7Prediction maps for Black-necked Cranes and zoomed-in maps showing the four models (TreeNet, Random Forest, CART and Maxent) and Ensemble model in detail.
(A–E) Prediction map for Black-necked Cranes, (F–J) zoomed-in map for Black-necked Cranes.
Figure 8Violin plots of Relative Index of Occurrence (RIO) values for four SDMs and Ensemble model for three cranes based on calibration data from Threatened Birds of Asia.
(A) Violin plots for Hooded Cranes, (B) violin plots for White-naped Cranes, (C) violin plots for Black-necked Cranes.