| Literature DB >> 30747915 |
Ryan A McManamay1, Christopher R DeRolph1.
Abstract
Stream classifications are important for understanding stream ecosystem diversity while also serving as tools for aquatic conservation and management. With current rates of land and riverscape modification within the United States (US), a comprehensive inventory and evaluation of naturally occurring stream habitats is needed, as this provides a physical template upon which stream biodiversity is organized and maintained. To adequately represent the heterogeneity of stream ecosystems, such a classification needs to be spatially extensive where multiple stream habitat components are represented at the highest resolution possible. Herein, we present a multi-layered empirically-driven stream classification system for the conterminous US, constructed from over 2.6 million stream reaches within the NHDPlus V2 stream network. The classification is based on emergent natural variation in six habitat layers meaningful at the stream-reach resolution: size, gradient, hydrology, temperature, network bifurcation, and valley confinement. To support flexibility of use, we provide multiple alternative approaches to developing classes and report uncertainty in classes assigned to stream reaches. The stream classification and underlying data provide valuable resources for stream conservation and research.Entities:
Year: 2019 PMID: 30747915 PMCID: PMC6371895 DOI: 10.1038/sdata.2019.17
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Datasets used in developing the US stream classification system.
| Dataset | Theme | Description | Spatial Resolution | Temporal Resolution |
|---|---|---|---|---|
| Pred = Predictor; Dist = Disturbance; Hyd = Hydrology; Temp = Temperature; Bif = Network Bifurcation; ValC = Valley Confinement aNHDPlus V2: | ||||
| NHDPlus V2a | Multiple | Hydrography of streamflow lines, waterbodies, and associated attributes summarized for local catchments at 1:100 k resolution | Stream Reaches, Catchments | NA |
| StreamCatb | Pred; Dist | Large suite of variables summarized in for NHDPlus V2 local catchments and upstream networks | Stream Reaches, Catchments | Decadal; Discrete Annual |
| WorldClimc | Pred | Bioclimatic variables summarized at seasonal periods from 30-yr normals (1970–2000) | 1 km2 | Decadal averages |
| USGS NWISd | Hyd; Temp | Measurements of discharge (daily) and water temperature (daily and field grap samples) from ~1950 to 2017 | Point | Daily; Grab Samples |
| StreamNete | Temp | Daily water temperature data from deployable loggers (~1995–2000) | Point | Daily |
| Hill | Temp | July–August averages of daily water temperature (1999–2008) | Point | Seasonal |
| Deweber and Wagner[ | Temp | July–August averages of daily water temperature (2002 to 2010) | Point | Seasonal |
| Wieferich | Bif | Common identifiers of reaches belonging to the same hydrologically meaningful unit. Identifiers are used to correct unmeaningful junctions, primarily resulting from intersections of reaches with quadrangle map boundaries | Stream Reaches | NA |
| NED 30-mf | ValC | National Elevation Dataset (30 m) | 30 m grid | NA |
| NRSAg | ValC | US Environmental Protection Agency (EPA) National Rivers and Streams Assessment (NRSA). Provides river width in habitat assessments of sites. | Point | NA |
| Downing | ValC | Assemblage of river width information from published literature | Point | NA |
| NARWidth v0.1h | ValC | Landsat‐derived dataset of river widths in North America for >2.4 × 105 km of rivers wider than 30 m | River Segments | NA |
| PRISM 30 y Normalsi | ValC | Gridded spatial datasets of average historical climate conditions, in this case, precipitation | 800 m grid | 30-year mean |
| NFHP[ | Dist | National Fish Habitat Partnership assessment of habitat degradation within NHDPlus V1 catchments | Stream Catchments | NA |
Predictor variables and their sources assembled for random forest models.
| Variable Name | Description | Source |
|---|---|---|
| For information on sources, please see footnotes of | ||
| WsAreaSqKm | Watershed area (square km) at NHDPlus stream segment outlet, i.e., at the most downstream location of the vector line segment | StreamCat |
| ElevWs | Mean watershed elevation (m) | StreamCat |
| StreamOrde | Modified Strahler Stream Order | NHDPlusV2 |
| ArbolateSu | Arbolate Sum - Kilometers of stream upstream of the bottom of the NHDFlowline feature | NHDPlusV2 |
| slopeAvgWs∗ | Mean upstream channel slopes | NHDPlusV2 |
| elevWsRng | Watershed range in elevation | NHDPlusV2 |
| PrecipWs | PRISM climate data - 30-year normal mean precipitation (mm): Annual period: 1981-2010 within the watershed | StreamCat |
| TminWs∗ | PRISM climate data - 30-year normal minimum temperature (C°): Annual period: 1981-2010 within the watershed | StreamCat |
| TmeanWs | PRISM climate data - 30-year normal mean temperature (C°): Annual period: 1981-2010 within the watershed | StreamCat |
| TmaxWs | PRISM climate data - 30-year normal maximum temperature (C°): Annual period: 1981-2010 within the watershed | StreamCat |
| PrecipVC | Mean annual precipitation in area upstream of the bottom of flowline in millimeters ∗ 100 for the 1971 to 2000 time period | NHDPlusV2 |
| TempVC | Mean annual temperature in area upstream of the bottom of flowline in degrees centigrade ∗ 100 for the 1971 to 2000 time period | NHDPlusV2 |
| bcm10AvgWs | Mean temperature of warmest quarter | WorldClim |
| PET0001 | Potential Evapotranspiration (mm) in catchment | NHDPlusV2 |
| PctWaterWs | % of watershed area classified as as lithology type: water | StreamCat |
| PctDecid2011Ws | % of watershed area classified as deciduous forest land cover (NLCD 2011 class 41) | StreamCat |
| PctConif2011Ws | % of watershed area classified as evergreen forest land cover (NLCD 2011 class 42) | StreamCat |
| PctMxFst2011Ws | % of watershed area classified as mixed deciduous/evergreen forest land cover (NLCD 2011 class 43) | StreamCat |
| PctOw2011Ws | % of watershed area classified as open water land cover (NLCD 2011 class 11) | StreamCat |
| PctIce2011Ws | % of watershed area classified as ice/snow land cover (NLCD 2011 class 12) | StreamCat |
| PctHbWet2011Ws | % of watershed area classified as herbaceous wetland land cover (NLCD 2011 class 95) | StreamCat |
| PctWdWet2011Ws | % of watershed area classified as woody wetland land cover (NLCD 2011 class 90) | StreamCat |
| PctShrb2011Ws | % of watershed area classified as shrub/scrub land cover (NLCD 2011 class 52) | StreamCat |
| PctGrs2011Ws | % of watershed area classified as grassland/herbaceous land cover (NLCD 2011 class 71) | StreamCat |
| WtDepWs | Mean seasonal water table depth (cm) of soils (STATSGO) within watershed | StreamCat |
| OmWs | Mean organic matter content (% by weight) of soils (STATSGO) within watershed | StreamCat |
| PermWs | Mean permeability (cm/hour) of soils (STATSGO) within watershed | StreamCat |
| RckDepWs | Mean depth (cm) to bedrock of soils (STATSGO) within watershed | StreamCat |
| ClayWs | Mean % clay content of soils (STATSGO) within watershed | StreamCat |
| SandWs | Mean % sand content of soils (STATSGO) within watershed | StreamCat |
| PctCarbResidWs | % of watershed area classified as as lithology type: carbonate residual material | StreamCat |
| PctNonCarbResidWs | % of watershed area classified as as lithology type: non-carbonate residual material | StreamCat |
| PctAlkIntruVolWs | % of watershed area classified as as lithology type: alkaline intrusive volcanic rock | StreamCat |
| PctSilicicWs | % of watershed area classified as as lithology type: silicic residual material | StreamCat |
| PctExtruVolWs | % of watershed area classified as as lithology type: extrusive volcanic rock | StreamCat |
| PctColluvSedWs | % of watershed area classified as as lithology type: colluvial sediment | StreamCat |
| PctGlacTilClayWs | % of watershed area classified as as lithology type: glacial till, clayey | StreamCat |
| PctGlacTilLoamWs | % of watershed area classified as as lithology type: glacial till, loamy | StreamCat |
| PctGlacTilCrsWs | % of watershed area classified as as lithology type: glacial till, coarse-textured | StreamCat |
| PctGlacLakeCrsWs | % of watershed area classified as as lithology type: glacial outwash and glacial lake sediment, coarse-textured | StreamCat |
| PctGlacLakeFineWs | % of watershed area classified as as lithology type: glacial lake sediment, fine-textured | StreamCat |
| PctHydricWs | % of watershed area classified as as lithology type: hydric, peat and muck | StreamCat |
| PctEolCrsWs | % of watershed area classified as as lithology type: eolian sediment, coarse-textured (sand dunes) | StreamCat |
| PctEolFineWs | % of watershed area classified as as lithology type: eolian sediment, fine-textured (glacial loess) | StreamCat |
| PctSalLakeWs | % of watershed area classified as as lithology type: saline like sediment | StreamCat |
| PctAlluvCoastWs | % of watershed area classified as as lithology type: alluvium and fine-textured coastal zone sediment | StreamCat |
| PctCoastCrsWs | % of watershed area classified as as lithology type: coastal zone sediment, coarse-textured | StreamCat |
| KffactWs | The Kf factor is used in the Universal Soil Loss Equation (USLE) and represents a relative index of susceptibility of bare, cultivated soil to particle detachment and transport by rainfall within watershed | StreamCat |
| HydrlCondWs | Mean lithological hydraulic conductivity (micrometers per second) content in surface or near surface geology within watershed | StreamCat |
| CompStrgthWs | Mean lithological uniaxial compressive strength (megaPascals) content in surface or near surface geology within watershed | StreamCat |
| RunoffWs | Mean runoff (mm) within watershed | StreamCat |
| BFIWs | Base flow is the component of streamflow that can be attributed to ground-water discharge into streams. The BFI is the ratio of base flow to total flow, expressed as a percentage, within watershed | StreamCat |
| Q0001A | Mean Annual Flow (cfs) estimated from Enhanced Runoff Method (EROM), applies to monthly averages below | NHDPlusV2 |
| Q0001Ajan | Mean January flow (cfs) using EROM | NHDPlusV2 |
| Q0001Afeb | Mean Febuary flow (cfs) using EROM | NHDPlusV2 |
| Q0001Amar | Mean March flow (cfs) using EROM | NHDPlusV2 |
| Q0001Aapr | Mean April flow (cfs) using EROM | NHDPlusV2 |
| Q0001Amay | Mean May flow (cfs) using EROM | NHDPlusV2 |
| Q0001Ajun | Mean June flow (cfs) using EROM | NHDPlusV2 |
| Q0001Ajul | Mean July flow (cfs) using EROM | NHDPlusV2 |
| Q0001Aaug | Mean August flow (cfs) using EROM | NHDPlusV2 |
| Q0001Asep | Mean September flow (cfs) using EROM | NHDPlusV2 |
| Q0001Aoct | Mean October flow (cfs) using EROM | NHDPlusV2 |
| Q0001Anov | Mean November flow (cfs) using EROM | NHDPlusV2 |
| Q0001Adec | Mean December flow (cfs) using EROM | NHDPlusV2 |
| Qwsa | Q0001A divided by WsAreaSqKm | This study |
Figure 1Conceptual diagram of various scenarios of stream network bifurcation and divergence.
For each scenario, reaches are labeled according to their Strahler stream order. Bifurcation (Bif.) classes, divergence (Div.) classes, and the number of upstream and downstream reaches are noted. Naturally-occurring (i.e. meaningful) stream junctions are distinguished from non-meaningful stream reach junctions arising from quadrangle map boundaries. Scenarios include (a) a common, simplified stream junction, (b) a more complex junction with more than 2 upstream contributing reaches, (c) a situation of stream divergence, and (d) non-meaningful stream junctions arising from map boundaries. In the case of (d), reaches immediately occurring downstream of non-meaningful junctions are assigned to the same class as their upstream neighboring reach.
Datasets provided by the US Stream Classification System.
| Dataset | Classification Variables | Continuous/Nominal Variables |
|---|---|---|
| Size and Gradient | Size class; Gradient class | Strahler Stream Order, Mean Annual Discharge; Stream-reach slope |
| Hydrology | Hydrologic classes (Bayesian Gaussian Mixed Model and Ward’s Agglomerative Hydrologic classes) | Probabilities of class assignment from random forests |
| Temperature | Temperature Classes (Maheu | Probabilities of class assignment from random forests; Predicted July–August water temperature |
| Bifurcation network | Bifurcation Classes, Divergence Classes | Upstream Reach Count; Downstream reach count; Upstream orders; ecological unit identifier; flagged reaches (for unmeaningful junctions) |
| Valley Confinement | Valley confinement classes | Stream reach length (RL); catchment area; river width area (RWA); valley bottom area (VBA); valley bottom length (VBL); VBL:RL ratio; VBA:RWA ratio |
Figure 2Thresholds for determining partitions between size classes.
Box plots (upper and lower quantiles) of discharge according to stream order. Class breaks represent average values between corresponding medians for each stream order.
Thresholds used to partition classes based on univariate continuous data.
| Size | Range (m3s−1) | Gradient | Range (Rise/Run) | Avg Summer Temperature | Range (°C) |
|---|---|---|---|---|---|
| Headwater (HW) | 0–0.057 | Very Low (VL) | <0.001 | Very Cold (VC) | <10 |
| Small Creek (SC) | 0.057–0.283 | Low (L) | 0.001–0.005 | Cold (CD) | 10–15 |
| Large Creek (LC) | 0.283–1.133 | Moderate (M) | 0.005–0.02 | Cold-Cool (CC) | 15–18 |
| Small River (SR) | 1.133–5.663 | Moderate High (MH) | 0.02–0.04 | Cool (CL) | 18–21 |
| Medium River (MR) | 5.663–22.65 | High (H) | 0.04–0.1 | Cool-Warm (CW) | 21–24 |
| Mainstem (MS) | 22.65–70.79 | Steep (S) | >0.1 | Warm (W) | >24 |
| Large River (LR) | 70.79–283.2 | …. | …. | …. | …. |
| Great River (GR) | >283.2 | …. | …. | …. | …. |
Figure 3Size and gradient stream classes of the conterminous US.
(a) Eight size classes based on discharge values mapped to stream reaches. (b) Size gradient values mapped to stream reaches.
Gaussian mixed model hydrologic class names and their codes.
| Mixture Model Classes | Abbr. Code | Num. Code |
|---|---|---|
| Names and their geographic and hydrologic descriptions are provided by McManamay | ||
| Intermittent Flashy 1 | IF1 | 1 |
| Late Timing Runoff | LTR | 2 |
| Perennial Runoff 1 | PR1 | 3 |
| Perennial Runoff 2 | PR2 | 4 |
| Super Stable Groundwater | SSGW | 5 |
| Stable High Baseflow | SHBF | 6 |
| Intermittent Flashy SW | IFSW | 7 |
| Snowmelt 2 | SNM2 | 8 |
| Perennial Flashy | PF | 9 |
| Intermittent Flashy 2 | IF2 | 10 |
| Western Coastal Runoff | WCR | 11 |
| Stable High Runoff | SHR | 12 |
| Harsh Intermittent | HI | 13 |
| Snowmelt 1 | SNM1 | 14 |
| Glacial High Runoff | GHR | 15 |
Nested hierarchy of hydrologic classes developed using Ward’s agglomerative method.
| 2 clusters | 4 clusters | 8 clusters | 14 clusters | 30 clusters |
|---|---|---|---|---|
| Clustering was based on Principal Components reducing the dimensionality of hydrologic statistics summarized for discharge among 2512 US Geological Survey stream gages. | ||||
| 1. Low Baseflow (LBF) | 1. Perennial (P) | 1. Perennial Runoff (PR) | 1. Perennial Runoff 1 (PR1) | 1. Perennial Runoff 1 N (PR1N) |
| 4. Perennial Runoff 1 S (PR1S) | ||||
| 3. Perennial Runoff 2 (PR2) | 3. Perennial Runoff 2 W (PR2W) | |||
| 5. Perennial Runoff 2 E (PR2E) | ||||
| 4. Perennial Flashy (PF) | 5. Perennial Flashy (PF) | 7. Perennial Flashy 1 (PF1) | ||
| 11. Perennial Flashy 2 (PF2) | ||||
| 30. Perennial Flashy 3 (PF3) | ||||
| 8. Western Runoff (WR) | 14. Western Runoff (WR) | 29. Western Runoff 1 (WR1) | ||
| 28. Western Runoff 2 (WR2) | ||||
| 4. Intermittent (I) | 5. Intermittent (I) | 7. Unpredictable Intermittent (UI) | 19. Unpredictable Intermittent 1 (UI1) | |
| 9. Unpredictable Intermittent 2 (UI2) | ||||
| 8. Late Timing Intermittent (LTI) | 10. Late Timing Intermittent 1 (LTI1) | |||
| 22. Late Timing Intermittent 2 (LTI2) | ||||
| 10. Intermittent Flashy 1 (IF1) | 14. Intermittent Flashy 1 (IF1) | |||
| 12. Intermittent Flashy 2 (IF2) | 17. Intermittent Flashy 2 A (IF2A) | |||
| 21. Intermittent Flashy 2B (IF2B) | ||||
| 7. Intermittent SW (ISW) | 13. Intermittent SW (ISW) | 24. Intermittent SW A (IFSWA) | ||
| 26. Intermittent SW B (IFSWB) | ||||
| 18. Intermittent SW C (IFSWC) | ||||
| 23. Intermittent SW D (IFSWD) | ||||
| 2. High Baseflow (HBF) | 2. Snowmelt (SNM) | 2. Snowmelt 1 (SNM1) | 2. Snowmelt 1 (SNM1) | 2. Snowmelt 1 A (SNM1A) |
| 25. Snowmelt 1B (SNM1B) | ||||
| 6. Snowmelt 2 (SNM2) | 11. Snowmelt 2 (SNM2) | 16. Snowmelt 2 A (SNM2A) | ||
| 20. Snowmelt 2B (SNM2B) | ||||
| 15. Glacial Snowmelt (GSNM) | ||||
| 3. Stable Baseflow (SBF) | 3. Stable Baseflow (SBF) | 4. Stable High Baseflow (SHBF) | 6. Stable High Baseflow (SHBF) | |
| 27. Western Stable High Baseflow (WSHBF) | ||||
| 6. Super Stable GW 1 (SSGW1) | 8. Super Stable GW 1 (SSGW1) | |||
| 9. Super Stable GW 2 (SSGW2) | 12. Super Stable GW 2A (SSGW2A) | |||
| 13. Super Stable GW 2B (SSGW2B) | ||||
Accuracies, cross-validation error rates, and propabilities for random forest models predicting hydrologic and temperature classes.
| Cluster approach | Theme | Classes (clusters) | N obs | OOB error rate (%) | Classification accuracy (%) | Median Prob. |
|---|---|---|---|---|---|---|
| OOB = out-of-bag error rate, i.e. cross validation error rate. Median Prob. = Median probability of predominant class assignment to all reaches compared to expected probabilities (in parentheses). | ||||||
| Gaussian Mixture Model Bayesian | HYDR | 15 | 2512 | 23.2 | 76.8 | 0.43 (0.07) |
| Ward’s agglomerative | HYDR | 2 | 2512 | 4.94 | 95.1 | 0.91 (0.50) |
| Ward’s agglomerative | HYDR | 4 | 2512 | 9.12 | 90.9 | 0.73 (0.25) |
| Ward’s agglomerative | HYDR | 8 | 2512 | 17.0 | 83.0 | 0.58 (0.13) |
| Ward’s agglomerative | HYDR | 14 | 2512 | 24.6 | 75.4 | 0.46 (0.07) |
| Ward’s agglomerative | HYDR | 30 | 2512 | 33.8 | 66.2 | 0.34 (0.03) |
| Maheu | TEMP | 6 | 135 | 28.2 | 71.9 | 0.45 (0.17) |
Figure 4Importance of different predictors used in random forest models.
Random forests were used to predict (a) hydrologic classes, (b) temperature classes or average July—August temperature, or (c) river width. Normalized importance refers to node impurity values for the Gini index (classification) or mean-squared error (regression) that are scaled from 0 to 1 using (max – xi)/(max (x) – min(x)). Normalized importance was averaged across all random forest models for hydrologic classes and temperature. Error bars represent 1 SE. Note: only 1 random forest model was developed for river width.
Figure 5Maps of hydrologic classes assigned to stream reaches in the conterminous US.
Different clustering approaches used developed hydrologic classes at stream gauges (points) were mapped to stream reaches including: (a) Fifteen gaussian mixture model classes, and several Ward’s agglomerative clustering solutions for (b) two, (c) four, (d) eight, (e) fourteen, and (f) thirty classes. Acronyms for classes are described in Tables 5 and 6.
Figure 6Temperature classes within stream reaches of the conterminous US.
Temperature classes were mapped to stream reaches using (a) Maheu et al.[3] thermal regime classes developed for stream gages (points), and (b) average July–August temperature values taken from multiple datasets (points). Inset in panel b shows the level of detail within continuous values of July–August temperature (°C) underlying the classification.
Figure 7Thresholds for determining partitions between temperature classes.
Goodness-of-fit and tabular accuracy for different numbers of temperature clusters using Jenks method.
Figure 8Network Bifurcation and valley confinement of stream reaches of the conterminous US.
(a) Information used to develop network bifurcation classes in streams included the number of upstream reaches contributing to each reach and divergences in channels. Mapping bifurcation classes was impractical as there are 348 different types. (b) Valley confinement classes mapped to stream reaches. Inset provides example of valley bottoms underlaying streams of varying widths.