| Literature DB >> 28640819 |
Keith B Aubry1, Catherine M Raley1, Kevin S McKelvey2.
Abstract
The availability of spatially referenced environmental data and species occurrence records in online databases enable practitioners to easily generate species distribution models (SDMs) for a broad array of taxa. Such databases often include occurrence records of unknown reliability, yet little information is available on the influence of data quality on SDMs generated for rare, elusive, and cryptic species that are prone to misidentification in the field. We investigated this question for the fisher (Pekania pennanti), a forest carnivore of conservation concern in the Pacific States that is often confused with the more common Pacific marten (Martes caurina). Fisher occurrence records supported by physical evidence (verifiable records) were available from a limited area, whereas occurrence records of unknown quality (unscreened records) were available from throughout the fisher's historical range. We reserved 20% of the verifiable records to use as a test sample for both models and generated SDMs with each dataset using Maxent. The verifiable model performed substantially better than the unscreened model based on multiple metrics including AUCtest values (0.78 and 0.62, respectively), evaluation of training and test gains, and statistical tests of how well each model predicted test localities. In addition, the verifiable model was consistent with our knowledge of the fisher's habitat relations and potential distribution, whereas the unscreened model indicated a much broader area of high-quality habitat (indices > 0.5) that included large expanses of high-elevation habitat that fishers do not occupy. Because Pacific martens remain relatively common in upper elevation habitats in the Cascade Range and Sierra Nevada, the SDM based on unscreened records likely reflects primarily a conflation of marten and fisher habitat. Consequently, accurate identifications are far more important than the spatial extent of occurrence records for generating reliable SDMs for the fisher in this region. We strongly recommend that practitioners avoid using anecdotal occurrence records to build SDMs but, if such data are used, the validity of resulting models should be tested with verifiable occurrence records.Entities:
Mesh:
Year: 2017 PMID: 28640819 PMCID: PMC5480872 DOI: 10.1371/journal.pone.0179152
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fisher occurrence records from the Pacific States used to evaluate the effects of data quality on the performance and reliability of Maxent species distribution models.
| Data Set | Region | Year | References | |
|---|---|---|---|---|
| Verifiable records | Western Washington | 1990–1997 | 0 | [ |
| Western Oregon | 1994–2001 | 29 | [ | |
| California | 1989–1994 | 96 | [ | |
| Unscreened records | Western Washington | 1955–1992 | 94 | [ |
| Western Oregon | 1954–1993 | 145 | [ | |
| California | 1960–1987 | 150 | [ |
aA series of standardized remote-camera surveys involving > 17,000 sample nights were conducted throughout western Washington by various researchers from 1990 to 1997; no fishers were detected.
bWe excluded 22 unscreened fisher occurrence records from Washington compiled by Aubry and Houston [33] because they had spatial precision > 5.5 km or were located in different ecoregions [34] outside our analysis area, and included 1 additional record from 1992 that was not reported by Aubry and Houston [33].
cWe excluded 11 unscreened fisher occurrence records in Oregon compiled by Aubry and Lewis [16] because they were located in different ecoregions [34] outside our analysis area.
Fig 1Maps depicting (A) the geographic extent of our analysis area in the Pacific States and the locations of verifiable and unscreened fisher occurrence records used in Maxent modeling, and (B) the elevational gradient and major physiographic regions that occur within our analysis area.
Environmental covariates considered for generating species distribution models for the fisher in the Pacific States using Maxent.
The 6 covariates selected for Maxent modeling are shaded gray.
| Category | Covariate | Reference | Description |
|---|---|---|---|
| Climatic | Winter snowfall (SNOW) | [ | Average precipitation (mm) as snow during winter (Dec-Feb) from 1971 to 2000. Derived from 4 x 4-km PRISM data and downscaled to 250 x 250-m pixels. |
| Topographic | Topographic position index (TPI) | [ | Estimate of terrain ruggedness, based on differences in elevation between a cell and the cells in the surrounding neighborhood. Negative values indicate valleys or canyon bottoms, and positive values indicate ridges or hilltops. Calculated from a 250 x 250-m digital elevation model. |
| Vegetative | Forest biomass (BIOMASS) | [ | Aboveground live forest biomass (mg/ha). Used at the original resolution of 250 x 250-m pixels. |
| Vegetation class (VEGCLASS) | [ | Forest vegetation classes based on canopy cover, basal area of hardwoods, and quadratic mean diameter of dominant and codominant trees. Derived from 30 x 30-m raster data resampled to 250 x 250-m pixels using a majority algorithm. | |
| Stand age (AGE) | [ | Basal area weighted stand age (year). Derived from 30 x 30-m raster data resampled to 250 x 250-m pixels using bilinear interpolation. | |
| Canopy cover (CANCOV) | [ | Canopy cover of all live trees (%). Derived from 30 x 30-m raster data resampled to 250 x 250-m pixels using bilinear interpolation. | |
| Diameter diversity index (DDI) | [ | Diameter diversity index of forest stands based on tree densities in different DBH classes. Derived from 30 x 30-m raster data resampled to 250 x 250-m pixels using bilinear interpolation. | |
| Quadratic mean diameter (QMD) | [ | Quadratic mean diameter of all dominant and codominant trees (cm). Derived from 30 x 30-m raster data resampled to 250 x 250-m pixels using bilinear interpolation. |
Fig 2Results from jackknife tests of regularized training gain (upper graph) and test gain (lower graph) generated by running the final verifiable fisher distribution model in Maxent with an independent test sample.
Fig 3Results from jackknife tests of regularized training gain (upper graph) and test gain (lower graph) generated by running the final unscreened fisher model in Maxent with an independent test sample.
Fig 4Species distribution maps for the fisher in the Pacific States created using logistic values (relative habitat-quality indices) generated in Maxent based on (A) the final verifiable fisher model, and (B) the final unscreened fisher model.
Fig 5Mean values for continuous covariates included in the final verifiable and unscreened fisher distribution models by relative habitat quality classes.
Elevation (ELEV) was not included as a covariate in the modeling process, but is presented here to help elucidate the contribution of SNOW to the final models.