| Literature DB >> 25384371 |
Jason T May1, Larry R Brown, Andrew C Rehn, Ian R Waite, Peter R Ode, Raphael D Mazor, Kenneth C Schiff.
Abstract
We used boosted regression trees (BRT) to model stream biological condition as measured by benthic macroinvertebrate taxonomic completeness, the ratio of observed to expected (O/E) taxa. Models were developed with and without exclusion of rare taxa at a site. BRT models are robust, requiring few assumptions compared with traditional modeling techniques such as multiple linear regression. The BRT models were constructed to provide baseline support to stressor delineation by identifying natural physiographic and human land use gradients affecting stream biological condition statewide and for eight ecological regions within the state, as part of the development of numerical biological objectives for California's wadeable streams. Regions were defined on the basis of ecological, hydrologic, and jurisdictional factors and roughly corresponded with ecoregions. Physiographic and land use variables were derived from geographic information system coverages. The model for the entire state (n = 1,386) identified a composite measure of anthropogenic disturbance (the sum of urban, agricultural, and unmanaged roadside vegetation land cover) within the local watershed as the most important variable, explaining 56% of the variance in O/E values. Models for individual regions explained between 51 and 84% of the variance in O/E values. Measures of human disturbance were important in the three coastal regions. In the South Coast and Coastal Chaparral, local watershed measures of urbanization were the most important variables related to biological condition, while in the North Coast the composite measure of human disturbance at the watershed scale was most important. In the two mountain regions, natural gradients were most important, including slope, precipitation, and temperature. The remaining three regions had relatively small sample sizes (n ≤ 75 sites) and had models that gave mixed results. Understanding the spatial scale at which land use and land cover affect taxonomic completeness is imperative for sound management. Our results suggest that invertebrate taxonomic completeness is affected by human disturbance at the statewide and regional levels, with some differences among regions in the importance of natural gradients and types of human disturbance. The construction and application of models similar to the ones presented here could be useful in the planning and prioritization of actions for protection and conservation of biodiversity in California streams.Entities:
Mesh:
Year: 2014 PMID: 25384371 PMCID: PMC4226928 DOI: 10.1007/s10661-014-4086-x
Source DB: PubMed Journal: Environ Monit Assess ISSN: 0167-6369 Impact factor: 2.513
Fig. 1Map of California with Perennial Stream Assessment (PSA) regions
Model variable codes, units, scale, and description. Scale refers to spatial area of analysis (point, at the collection; 1 km, watershed area within 1 km of site; 5 km, watershed area within 5 km of site; WS, upstream watershed area)
| Scale | ||||||
|---|---|---|---|---|---|---|
| Variable code | Unit | Point | 1 km | 5 km | WS | Description |
| Response variables-invertebrate metrics | ||||||
| O/E | Ratio | X | Ratio of number of observed taxa at a site to the expected taxa based on modeled reference sites, all taxa | |||
| O/E50 | Ratio | X | Ratio of number of observed taxa at a site to the expected taxa based on modeled reference sites, common taxa with a probability of capture at 50 % sites | |||
| Explanatory variables | ||||||
| Temp | °C | X | 30-year (1971–2000) average max temperature | |||
| PPT | mm | X | 30-year (1971–2000) average annual precipitation | |||
| Slope | % | X | Gradient at reach level | |||
| Elevation | m | X | Elevation at site | |||
| DamDist | m | Distance to nearest upstream dam in catchment | ||||
| Area | km2 | X | Watershed area | |||
| Ag | % | X | X | X | Percent agricultural lands (row crop, pasture) | |
| AgUrb21 | % | X | X | X | Percent developed land (urban, row crop, pasture, nlcd class 21) | |
| Burns20052009 | # | X | Number of burns between 2005 and 2009 | |||
| CalPipe100kPer | % | X | Percent canals or pipes at the 100-k scale | |||
| CanalPipeDist100k | m | X | Total length of FTYPE equal to Canal Ditch and Pipeline in NHD+ | |||
| CaO | % | X | Percent calcite mineral content | |||
| CODE_21 | % | X | X | X | Percentage of urban/recreational grass (nlcd 21) | |
| CondQrm | μS/cm | X | Predicted electric conductivity (Olson and Hawkins 2012) | |||
| DamCount | # | X | X | X | Number of dams | |
| DamDensArea | dams/km2 | X | X | X | Density of dams, by area | |
| DamStorage | km2 | X | Total dam storage | |||
| GravelMines | # | X | X | X | Count of gravel mines | |
| GravelMinesDens | gravel mines/km2 | X | X | X | Density of gravel mines | |
| Grazing | % | X | X | X | Percent of area allotted to grazing on USFS and BLM lands | |
| IMPERVMEAN | % | X | X | X | Impervious surfaces from NLCD | |
| MAFLOWU | cfs | X | Cumulative annual discharge in NHD+, unit area method | |||
| MgO | % | X | Percent magnesium oxide mineral content | |||
| Mines | # | X | X | X | Count of mines | |
| MinesDens | mines/km2 | X | X | X | Density of mines | |
| Ngeo | % | X | Percent nitrogen geology | |||
| NewAg | % | X | X | Percent new ag (nlcd class 36, 46, or 56) | ||
| NewUrb | % | X | X | Percent new urban (nlcd class 32, 42, 52, or 62) | ||
| PctSed | % | X | Percent sedimentary geology | |||
| Pop | # people/km2 | X | X | X | Population density in 2000 | |
| rDDENSC12 | roads/km2 | X | X | X | Total density of paved roads (nlcd class 1 and 2) | |
| rDDENSC123 | roads/km2 | X | X | X | Total density of paved and unpaved roads (nlcd classes 1, 2, and 3) | |
| STREAMORDER | # | X | Strahler stream order | |||
| URBAN | % | X | X | X | Percentage of polygon designated as urban (nlcd class 22, 23, 24) | |
Median values of selected environmental variables for the state and by region. The minimum and maximum values are shown in parenthesis. For detailed variable descriptions, see Table 1 for variable codes: Area, AGURB21, Elevation, CondQrm, DamDensArea, IMPERVMEAN, MinesDens, and PPT, respectively
| Statewide ( | Lahontan ( | Central Valley ( | Coastal Chaparral ( | Modoc ( | Interior Chaparral ( | North Coast ( | South Coast ( | Sierras ( | |
|---|---|---|---|---|---|---|---|---|---|
| Watershed area (km2) | 43.2 | 89.1 | 103.9 | 320.6 | 468.4 | 112.8 | 28.7 | 48.8 | 39.8 |
| (0.12–40,509) | (1.1–2,307.1) | (5.8–31,185.2) | (1.79–8,722) | (1.81–8,812.4) | (2.94–575.9) | (1.1–40,509) | (0.12–4,118) | (1.2–4,565) | |
| Watershed percent developed land (%) | 2.03 | 0 | 30.13 | 5.45 | 0.46 | 2.06 | 2.16 | 5.75 | 0.06 |
| (0–100) | (0–45.27) | (0.34–100) | (0–80.78) | (0–41.98) | (0–63.73) | (0–83.93) | (0–100) | (0–34.38) | |
| Elevation at sampling site (m) | 638 | 2,037 | 28 | 102 | 1,361 | 379 | 535 | 459 | 1,433 |
| (0–3,107) | (1,258–3,107) | (3–141) | (0–1,269) | (0–2,205) | (66–940) | (3–1,753) | (0–2,246) | (512–2,997) | |
| Percent canals or pipes at the 100 k scale (%) | 0 | 0 | 9.28 | 0 | 0 | 0 | 0 | 0 | 0 |
| (0–68.65) | (0–17.24) | (0–68.65) | (0–47.39) | (0–17.99) | (0–54.64) | (0–19.02) | (0–60.89) | (0–54.31) | |
| Predicted conductivity (μS/cm) | 177 | 53 | 313 | 310 | 106 | 228 | 119 | 382 | 71 |
| (20–879) | (24–130) | (54–642) | (121–862) | (48–394) | (59–377) | (48–255) | (53–879) | (20–207) | |
| Watershed dam density (number of dams/watershed area) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| (0–0.009) | 0 | (0–0.008) | (0–0.009) | (0–0.001) | 0 | (0–0.001) | (0–0.002) | (0–0.001) | |
| Watershed impervious area (%) | 0.150 | 0.104 | 1.700 | 0.167 | 0.092 | 0.179 | 0.050 | 0.367 | 0.112 |
| (0–57.55) | (0–11.10) | (0.07–51.81) | (0–32.93) | (0–12.89) | (0–18.56) | (0–32.96) | (0–57.55) | (0–10.17) | |
| Watershed mine density (number of mines/watershed area) | 0.018 | 0 | 0.042 | 0.024 | 0.004 | 0.049 | 0.015 | 0.026 | 0.009 |
| (0–10.25) | (0–0.62) | (0–1.69) | (0–1.38) | (0–1.50) | (0–2.48) | (0–0.75) | (0–0.99) | (0–10.25) | |
| Annual precipitation (mm) | 715 | 790 | 562 | 629 | 519 | 802 | 1,426 | 516 | 1,136 |
| (86–3,038) | (119–1,477) | (289–703) | (275–1,677) | (86–1,725) | (384–1,631) | (409–2,724) | (271–1,184) | (168–3,038) |
Fig. 2Boxplots of observed over expected (O/E) and (O/E50) values for statewide and for each PSA region. The top, filled box plots represent O/E and the bottom, open box plots represent O/E50 values
Boosted regression tree results by perennial stream assessment region for O/E and O/E50. The variables shown are those included in the final model with their relative importance shown in parentheses. The ΔAIC value represents the difference in AIC between the model shown and the next best model
| Response variable |
| ΔAIC | Variables in order of influence (left to right) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Statewide ( | |||||||||
| O/E | 0.50 | 27 | AgUrb21_1km (56) | CondQrm (15) | SLOPE (11) | PPT (9) | Temp (9) | ||
| O/E50 | 0.57 | 85 | AgUrb21_1km (45) | SLOPE (19) | PPT (14) | CondQrm (12) | Temp (10) | ||
| Coastal group | |||||||||
| Coastal Chaparral ( | |||||||||
| O/E | 0.77 | 10 | Pop_5km (31) | PPT (15) | Ag_WS (14) | CondQrm (13) | Elevation (12) | MgO (9) | MinesDens_WS (6) |
| O/E50 | 0.84 | 48 | Pop_5km (33) | PPT (18) | Ag_WS (16) | CondQrm (12) | Elevation (11) | MgO (10) | |
| North Coast ( | |||||||||
| O/E | 0.63 | 34 | AgUrb21_WS (21) | Elevation (19) | Ngeo (19) | SLOPE (14) | MAFLOWU (14) | PPT (13) | |
| O/E50 | 0.76 | 24 | AgUrb21_WS (30) | SLOPE (25) | MAFLOWU (18) | Ngeo (17) | PPT (10) | ||
| South Coast ( | |||||||||
| O/E | 0.68 | 83 | UrBAN_1km (45) | MAFLOWU (16) | SLOPE (14) | CondQrm (13) | Temp (12) | ||
| O/E50 | 0.59 | 21 | UrBAN_1km (35) | SLOPE (16) | Ngeo (14) | Temp (13) | CondQrm (12) | MAFLOWU (10) | |
| Mountain group | |||||||||
| Lahontan ( | |||||||||
| O/E | 0.65 | 6 | SLOPE (33) | PPT (20) | MAFLOWU (16) | IMPERVMEAN_WS (16) | Temp (15) | ||
| O/E50 | 0.75 | 9 | SLOPE (27) | PPT (26) | Temp (21) | IMPERVMEAN_WS (14) | MAFLOWU (12) | ||
| Sierras ( | |||||||||
| O/E | 0.51 | 13 | SLOPE (36) | PPT (22) | Temp (20) | rDDENSC12_WS (12) | NewUrb_5km (10) | ||
| O/E50 | 0.60 | 30 | PPT (42) | SLOPE (24) | rDDENSC12_WS (12) | GrAZING_WS (12) | Temp (10) | ||
| Miscellaneous group | |||||||||
| Central Valley ( | |||||||||
| O/E | 0.64 | 4 | CondQrm (64) | SLOPE (25) | AgUrb21_1km (6) | Temp (5) | |||
| O/E50 | 0.44 | 4 | CaO (26) | SLOPE (24) | Temp (22) | CondQrm (14) | PctSed_ws (7) | Ag_WS (7) | |
| Interior Chaparral ( | |||||||||
| O/E | 0.65 | 26 | SLOPE (35) | CondQrm (29) | IMPERVMEAN_WS (27) | Ag_WS (9) | |||
| O/E50 | 0.71 | 5 | PPT (39) | CondQrm (35) | IMPERVMEAN_1km (26) | ||||
| Modoc ( | |||||||||
| O/E | 0.64 | 7 | CondQrm (26) | Ag_WS (21) | Temp (20) | IMPERVMEAN_WS (18) | SLOPE (15) | ||
| O/E50 | 0.66 | 3 | CondQrm (29) | IMPERVMEAN_WS (26) | Ag_WS (23) | Temp (22) | |||
Fig. 3Partial dependency plots for the final statewide model (n = 1,386) of the response form of O/E (y-axis = fitted function of O/E) based on the effect of individual explanatory variables with the response of all other variables removed. Variables are shown in order of model importance: a percent developed land at 1 km scale (AGURB21_1km); b predicted conductivity (CondQrm); c slope; d average precipitation (PPT); and e average temperature (Temp). The top tick marks of each plot indicate deciles of the predictor variable
Fig. 4Partial dependency plots for the final Coastal Chaparral model (n = 216) of the response form of O/E50 (y-axis = fitted function of O/E50) based on the effect of individual explanatory variables with the response of all other variables removed. Variables are shown in order of model importance: a population density in 2000 at 5 km scale (Pop_5km); b average precipitation (PPT); c agricultural land use at watershed scale (Ag_WS); d predicted conductivity (CondQrm); e average temperature (Temp); and f Percent magnesium oxide mineral content (MgO). The top tick marks of each plot indicate deciles of the predictor variable
Fig. 5Partial dependency plots for the final Sierra region (n = 203) of the response form of O/E (y-axis = fitted function of O/E50) based on the effect of individual explanatory variables with the response of all other variables removed. Variables are shown in order of model importance: a slope; b average precipitation (PPT); c average temperature (Temp); d total density of paved roads at watershed scale (rDDENSC12_WS); and e percent new urban at 5 km scale (NewUrb_5km). The top tick marks of each plot indicate deciles of the predictor variable
Fig. 6Regional regressions and plots of observed O/E values as a function of O/E values predicted using the statewide model. The dashed line represents the 1:1 line
Geographic information systems sources and references
| Spatial dataset | Data source | Source data format | Processing format | Resolution | Reference |
|---|---|---|---|---|---|
| Hydrography | National Hydrography Dataset (NHD) | Vector | Vector | 1:24,000 | U.S. Geological Survey, National Hydrography Dataset, Digital data, accessed, January 2010 at |
| Land Cover 1992 | National Land Cover Dataset 1992 (NLCD) | Raster | Vector | 30 m | U.S. Geological Survey, National Land Cover Dataset, Digital data, accessed, January 2010 at |
| Land Cover 2001 | NCLD 2001 | Raster | Vector | 30 m | U.S. Geological Survey, National Land Cover Dataset, Digital data, accessed, January 2010 at |
| Elevation | National Elevation Dataset (NED) | Raster | Raster | 10 m | U.S. Geological Survey, National Land Cover Dataset, Digital data, accessed, January 2010 at |
| Slope | NED | Raster | Raster | 10 m | U.S. Geological Survey, National Land Cover Dataset, Digital data, accessed, January 2010 at |
| Roads | U.S. Census Bureau Tiger | Vector | Vector | 1:100,000 | U.S. Census Bureau, TIGER line data, Digital data, accessed, January 2010 at |
| Population density | U.S. Census Bureau | Vector | Raster | 30 m | U.S. Census Bureau, Census 2000, Digital data, accessed, January 2010 at |
| Precipitation and temperature | Oregon State University PRISM | Raster | Raster | 30 arc-seconds | Parameter-elevation relationships on Independent Slopes Model Group, Oregon State University, Precipitation and Temperature data for the USA, digital data, accessed, January 2010 at |
| Dams | National inventory of dams | Vector | Vector | Various | U.S. Army Corps of Engineers, National Inventory of Dams, digital data, not publicly available |