| Literature DB >> 26053548 |
Chloe Bellamy1, John Altringham1.
Abstract
Conservation increasingly operates at the landscape scale. For this to be effective, we need landscape scale information on species distributions and the environmental factors that underpin them. Species records are becoming increasingly available via data centres and online portals, but they are often patchy and biased. We demonstrate how such data can yield useful habitat suitability models, using bat roost records as an example. We analysed the effects of environmental variables at eight spatial scales (500 m - 6 km) on roost selection by eight bat species (Pipistrellus pipistrellus, P. pygmaeus, Nyctalus noctula, Myotis mystacinus, M. brandtii, M. nattereri, M. daubentonii, and Plecotus auritus) using the presence-only modelling software MaxEnt. Modelling was carried out on a selection of 418 data centre roost records from the Lake District National Park, UK. Target group pseudoabsences were selected to reduce the impact of sampling bias. Multi-scale models, combining variables measured at their best performing spatial scales, were used to predict roosting habitat suitability, yielding models with useful predictive abilities. Small areas of deciduous woodland consistently increased roosting habitat suitability, but other habitat associations varied between species and scales. Pipistrellus were positively related to built environments at small scales, and depended on large-scale woodland availability. The other, more specialist, species were highly sensitive to human-altered landscapes, avoiding even small rural towns. The strength of many relationships at large scales suggests that bats are sensitive to habitat modifications far from the roost itself. The fine resolution, large extent maps will aid targeted decision-making by conservationists and planners. We have made available an ArcGIS toolbox that automates the production of multi-scale variables, to facilitate the application of our methods to other taxa and locations. Habitat suitability modelling has the potential to become a standard tool for supporting landscape-scale decision-making as relevant data and open source, user-friendly, and peer-reviewed software become widely available.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26053548 PMCID: PMC4460044 DOI: 10.1371/journal.pone.0128440
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Map of the species’ roost records used from the Lake District National Park, NW England.
Sample size: P. pipistrellus—129; P. pygmaeus—80; N. noctula—10; M. brandtii / mystacinus—24; M. daubentonii—51; M. nattereri—23; P. auritus—102. NB. Drawn at this scale, overlap between roosts masks many species’ roost locations. See Fig 5 for separate species’ roost maps. Crown database right 2010. An Ordnance Survey/EDINA supplied service.
Fig 5Habitat suitability maps made using each species’ pruned set of variables.
HSI = Habitat Suitability Index. The Lake District National Park boundary is marked in white. Species roost locations are coloured by test subsets where appropriate. Crown database right 2010. An Ordnance Survey/EDINA supplied service.
The fifteen habitat variables used for analysis.
All layers were produced at 8 different spatial scales except the two distance variables.
| GIS data layer | Description | Original data source |
|---|---|---|
| Distance to inland water (m) | Euclidean distance to nearest inland water feature | OS MasterMap Topography Layer |
| Distance to woodland edge (m) | Euclidean distance to nearest woodland edge | OS MasterMap Topography Layer |
| Majority aspect (categorical: flat, N, NE, E, SE, S, SW, W, NW) | Majority aspect at multiple scales | OS Land-Form PROFILE DTM |
| Mean altitude (m a.s.l.) | Mean altitude at multiple scales | OS Land-Form PROFILE DTM |
| Mean slope (°) | Mean slope at multiple scales | OS Land-Form PROFILE DTM |
| Cover of inland water (%) | Percentage cover of inland water at multiple scales | OS MasterMap Topography Layer |
| Cover of deciduous wood (%) | Percentage cover of deciduous wood at multiple scales | OS MasterMap Topography Layer |
| Cover of coniferous wood (%) | Percentage cover of coniferous wood at multiple scales | OS MasterMap Topography Layer |
| Cover of mixed wood (%) | Percentage cover of mixed wood at multiple scales | National Inventory of Woodland & Trees & OS MasterMap Topography Layer |
| Cover of buildings (%) | Percentage cover of buildings at multiple scales | OS MasterMap Topography Layer |
| Cover of manmade surfaces (%) | Percentage cover of manmade surfaces and structures at multiple scales | OS MasterMap Topography Layer |
| Cover of ancient wood (%) | Percentage cover of (non-replanted) ancient wood at multiple scales | Ancient Woodland Inventory (Provisional) for England |
| Habitat richness (%) | Proportion of 13 habitat types present at multiple scales | OS MasterMap Topography Layer |
| Maximum woodland patch (km2) | Size of the largest woodland patch within or intersecting the different sized scale windows | OS MasterMap Topography Layer |
| Woodland edge density (km/km2) | Length of woodland edge per unit area at multiple scales | OS MasterMap Topography Layer |
Fig 2Distance to water and woodland edge response curves.
These graphs show probability of a species’ roost (p) at a location based on these distances and are based on the results of univariate models to prevent any other interacting or collinear variables affecting the relationships found. Variables which were found to have poor predictive power for a species (AUC ≤ 0.5 or test gain < 0.01) are not shown.
Fig 3Variable performance.
These graphs show the strength of association (as test AUC) between each species’ presence and individual environmental variables at different spatial scales. The average predictive power of the distance variables is shown as a dashed line: these were independent of scale. Environmental variables with a predictive power ≤ 0.5 are no better than random. Only variables retained in pruned models are shown. NB. The scale range is not linear for improved clarity at small scale.
Mean model performance.
Measured using random 5-fold cross validation (P. pipistrellus, P. pygmaeus, M. daubentonii and P. auritus), or jackknife validation (N. noctula, M. brandtii/mystacinus and M. nattereri) tests.
| Species | Train set | Test set | Train AUC | Test AUC | Test gain | Test omission rates |
|---|---|---|---|---|---|---|
|
| 103.2 | 25.8 | 0.740±0.02 | 0.713±0.05 | 0.285±0.17 |
|
|
| 63.2 | 15.8 | 0.835±0.01 | 0.802±0.05 | 0.620±0.20 |
|
|
| 9 | 1 | 0.889±0.02 | 0.865±0.12 | 0.848±0.78 | 0.000 |
|
| 23 | 1 | 0.897±0.01 | 0.829±0.05 | 0.687±0.64 | 0.000 |
|
| 40.8 | 10.2 | 0.910±0.01 | 0.888±0.05 | 1.143±0.37 |
|
|
| 22 | 1 | 0.858±0.01 | 0.801±0.25 | 0.562±1.23 | 0.000 |
|
| 81.6 | 20.4 | 0.804±0.02 | 0.769±0.05 | 0.344±0.45 |
|
Train set = average number of training data; Test set = average number of test data; Test omission rates = average proportion of test data which fell outside of the suitable area. Omission rates which are significantly lower than expected by chance alone are in bold type. Asterisks signify level of significance (*** p<0.001), all values ± S.D.
Mean model performance from spatially constrained 5-fold cross validation.
rSAC lag = largest distance within which data pairs retained significant, positive spatial autocorrelation.
| Spp. | rSAC lag (km) | Mean (km) | Min (km) | rSAC pairs (%) | Train set | Test set | Train AUC | Test AUC | Test gain | Test omission |
|---|---|---|---|---|---|---|---|---|---|---|
|
| 8 | 29.1 ±16.7 | 0.08 | 7.6 | 103.2 | 25.8 | 0.746 ±0.03 | 0.650 ±0.12 | 0.013 ±0.41 | 0.168* |
|
| 7 | 24.0 ±15.3 | 0.10 | 8.1 | 64.8 | 16.2 | 0.839 ±0.02 | 0.714 ±0.11 | 0.088 ±0.60 | 0.197* |
|
| 16 | 27.5 ±14.9 | 0.12 | 29.7 | 81.6 | 20.4 | 0.807 ±0.03 | 0.718 ±0.14 | 0.082 ±0.76 | 0.264* |
Mean = the mean distance between all spatially aggregated training and test data pairs; Min = the minimum distance between spatially constrained training and test data pairs; rSAC pairs = the percentage of all spatially constrained training and test data pairs that still fell within the rSAC lag; Train set = average number of training data; Test set = average number of test data; omission rates = average proportion of test data which fell outside of the suitable area; omission rates which are significantly lower than expected by chance alone are in bold type. Asterisks signify level of significance (* p<0.05); all mean values ± S.D.
Fig 4Representative MaxEnt response curves.
These graphs show the probability of a species’ presence at a location for a range of parameters. These graphs are based on univariate models to prevent interacting or collinear variables from affecting the relationships modelled. Variables found to have poor predictive power for a species (AUC ≤ 0.5 or test gain < 0.01) are not shown.