| Literature DB >> 23951296 |
Eric L Berlow1, Roland A Knapp, Steven M Ostoja, Richard J Williams, Heather McKenny, John R Matchett, Qinghua Guo, Gary M Fellers, Patrick Kleeman, Matthew L Brooks, Lucas Joppa.
Abstract
A central challenge of conservation biology is using limited data to predict rare species occurrence and identify conservation areas that play a disproportionate role in regional persistence. Where species occupy discrete patches in a landscape, such predictions require data about environmental quality of individual patches and the connectivity among high quality patches. We present a novel extension to species occupancy modeling that blends traditional predictions of individual patch environmental quality with network analysis to estimate connectivity characteristics using limited survey data. We demonstrate this approach using environmental and geospatial attributes to predict observed occupancy patterns of the Yosemite toad (Anaxyrus (= Bufo) canorus) across >2,500 meadows in Yosemite National Park (USA). A. canorus, a Federal Proposed Species, breeds in shallow water associated with meadows. Our generalized linear model (GLM) accurately predicted ~84% of true presence-absence data on a subset of data withheld for testing. The predicted environmental quality of each meadow was iteratively 'boosted' by the quality of neighbors within dispersal distance. We used this park-wide meadow connectivity network to estimate the relative influence of an individual Meadow's 'environmental quality' versus its 'network quality' to predict: a) clusters of high quality breeding meadows potentially linked by dispersal, b) breeding meadows with high environmental quality that are isolated from other such meadows, c) breeding meadows with lower environmental quality where long-term persistence may critically depend on the network neighborhood, and d) breeding meadows with the biggest impact on park-wide breeding patterns. Combined with targeted data on dispersal, genetics, disease, and other potential stressors, these results can guide designation of core conservation areas for A. canorus in Yosemite National Park.Entities:
Mesh:
Year: 2013 PMID: 23951296 PMCID: PMC3741202 DOI: 10.1371/journal.pone.0072200
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Explanatory variables used in the General Linear Models and General Additive Models chosen from the full set because they were not highly correlated (r < 0.9).
|
|
|
|---|---|
|
| Elevation of the meadow centroid |
|
| Maximum terrain slope along the shortest path to the nearest meadow |
|
| Shannon Diversity index of mapped vegetation classes within each meadow polygon. This measure combines the number of vegetation polygons of each type and their aerial extent. |
|
| Inter-annual average (1986-2006) Tasseled Cap Wetness, averaged within each year over all 30m LANDSAT pixels within the meadow. |
|
| Inter-annual standard deviation (1986-2006) of the Tasseled Cap Wetness, averaged within each year over all 30m LANDSAT pixels within the meadow. |
|
| Inter-annual average (1986-2006) of the spatial standard deviation in Tasseled Cap Wetness across LANDSAT pixels within the meadow. |
|
| Inter-annual average (1986-2006) Normalized Difference Vegetation Index, averaged within each year over all 30m LANDSAT pixels within the meadow. |
|
| Inter-annual standard deviation (1986-2006) of the Normalized Difference Vegetation Index, averaged within each year over all 30m LANDSAT pixels within the meadow. |
|
| Inter-annual average (1986-2006) of the spatial standard deviation in Normalized Difference Vegetation Index across LANDSAT pixels within the meadow. |
|
| Inter-annual average (2002-2007) proportion of days in the water year that the meadow had >50% snow covered area (estimated from daily MODIS). |
|
| Inter-annual variability (2002-2007) in snow-covered days, measured as the standard deviation among water years in the proportion of days with >50% snow covered area (estimated from daily MODIS). |
|
| Inter-annual average (2002-2007) of the first date after April 1st that the meadow had <25% snow covered area (estimated from daily MODIS) |
|
| Inter-annual standard deviation (2002-2007) of the first data after April 1st that the meadow had <25% snow covered area (estimated from daily MODIS) |
|
| Inter-annual average (1980-1997) of the mean monthly precipitation for the meadow estimated from Daymet. |
|
| Inter-annual standard deviation (1980-1997) of the mean monthly precipitation for the meadow estimated from Daymet. |
The different models varied in details of variable selection, coefficients, and functional forms, but the rank-estimates of breeding probabilities predicted by all models were highly correlated (Spearman’s correlation > 0.95 for all comparisons). Our focus in this study is on prediction of breeding occupancy (and the management consequences of those predictions), rather than interpretation of model coefficients. See the Methods for more details on how these variables were derived.
AUC scores, Maximum Discrimination Threshold (MDT) probabilities (the model probability threshold that maximizes both true presences and true absences), and the true presence and true absence rate for the three General Linear Models, three General Additive Models, and the Network-Boosted GLM.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
|
| AUC | . | 0.88 | 0.86 |
| MDT | . | 0.16 | 0.13 | ||
| %True Presence | . | 81 | 80 | ||
| %True Absence | . | 80 | 82 | ||
|
|
| AUC | . | 0.87 | 0.86 |
| MDT | . | 0.16 | 0.12 | ||
| %True Presence | . | 79 | 80 | ||
| %True Absence | . | 80 | 79 | ||
|
|
| AUC | . | 0.88 | 0.86 |
| MDT | . | 0.56 | 0.53 | ||
| %True Presence | . | 80 | 84 | ||
| %True Absence | . | 80 | 84 | ||
|
|
| AUC | . | 0.88 | 0.86 |
| MDT | . | 0.16 | 0.12 | ||
| %True Presence | . | 81 | 80 | ||
| %True Absence | . | 80 | 82 | ||
|
|
| AUC | . | 0.91 | 0.87 |
| MDT | . | 0.18 | 0.11 | ||
| %True Presence | . | 84 | 80 | ||
| %True Absence | . | 84 | 79 | ||
|
|
| AUC | . | 0.90 | 0.84 |
| MDT | . | 0.16 | 0.11 | ||
| %True Presence | . | 81 | 80 | ||
| %True Absence | . | 81 | 80 | ||
|
|
| AUC | 0.864 | . | . |
| MDT | 0.88 | . | . | ||
| %True Presence | 78 | . | . | ||
| %True Absence | 78 | . | . |
The latter includes contributions from nearby meadows (and their neighbors, neighbors of neighbors, etc) to explore the potential influence of dispersal. All the others models only include variables intrinsic to the meadow and were used to predict observed breeding based on meadow covariates (See Methods for more model descriptions).
Figure 1Performance of the “Environmental Model.”
a) The distribution of probabilities for the three classes of data for the Environmental Model (see Methods: “100subsampleGLM”). Data are combined from all surveys from 1996–2010. b) ROC plot for the training and test data. The red line shows the fit of the model to the training data, while the blue shows the fit relative to the test data. The curves represent those drawn from the average probabilities across 100 models.
Figure 2Disentangling intrinsic meadow ‘environmental quality’ from its spatial ‘network quality’ with respect to predicted breeding.
a) Normalized Network Improved breeding probabilities ((Network Boosted probability – Environmental Model probability) / (1 – Environmental Model probability)) for each meadow as a function of the mean Environmental Model probability. Symbol colors represent field survey results (Breeding detected (red), Breeding not detected (blue), and Meadow not visited (black). b-c) Total Influence (G) of meadow deletion (the decrease in park-wide breeding probability summed across all meadows after deleting that meadow from the network) as a function of the Environmental Model and the Network-Boosted GLM probabilities. Horizontal dotted lines indicate the 90th percentile of meadow deletion impact, and solid red symbols are the known breeding meadows within that 90th percentile. Vertical dotted lines in all panels represent the threshold probability that maximizes both true positives and true negatives predicted by the model.
Figure 3Watershed scale patterns of predicted breeding.
a) Watersheds with high mean predicted probability of breeding (Environmental Model) tend to have low coefficient of variation (CV) in breeding probabilities among meadows within the watershed. b) Watersheds with high mean breeding probability across also show high normalized network improvement in the predicted breeding probabilities due to spatial clustering of ‘intrinsically good’ meadows. ‘Normalized network improvement’ is the proportional increase in Environmental Model predicted probability relative to the maximum improvement possible.
Figure 4Observed and predicted distribution of meadows and watersheds with breeding in Yosemite National Park.
The watersheds (Cal 2.2.2 watershed planning units) are colored by the percent of meadows within the basin that are predicted by the Network Boosted GLM model to have breeding. The symbols are centroids for all 2,558 contiguous meadows in the park. Pink and/or red symbols indicate a meadow with observed breeding at least once between 1992–2010. Blue symbols were visited at least once with no detected. Small black dots are meadows that have not been visited. Solid read symbols are meadows with breeding that are in 90th percentile of impact on park-wide breeding probability if deleted. Note that meadows with high deletion impact are not in the watersheds with highest proportion of ‘good’ meadows.