| Literature DB >> 31741781 |
Jared E Siegel1,2, Carol J Volk2.
Abstract
Spatial and temporal patterns in stream temperature are primary factors determining species composition, diversity and productivity in stream ecosystems. The availability of spatially and temporally continuous estimates of stream temperature would improve the ability of biologists to fully explore the effects of stream temperature on biota. Most statistical stream temperature modeling techniques are limited in their ability to account for the influence of variables changing across spatial and temporal gradients. We identified and described important interactions between climate and spatial variables that approximate mechanistic controls on spatiotemporal patterns in stream temperature. With identified relationships we formed models to generate reach-scale basin-wide spatially and temporally continuous predictions of daily mean stream temperature in four Columbia River tributaries watersheds of the Pacific Northwest, USA. Models were validated with a testing dataset composed of completely distinct sites and measurements from different years. While some patterns in residuals remained, testing dataset predictions of selected models demonstrated high accuracy and precision (averaged RMSE for each watershed ranged from 0.85-1.54 °C) and was only 17% higher on average than training dataset prediction error. Aggregating daily predictions to monthly predictions of mean stream temperature reduced prediction error by an average of 23%. The accuracy of predictions was largely consistent across diverse climate years, demonstrating the ability of the models to capture the influences of interannual climatic variability and extend predictions to timeframes with limited temperature logger data. Results suggest that the inclusion of a range of interactions between spatial and climatic variables can approximate dynamic mechanistic controls on stream temperatures. ©2019 Siegel and Volk.Entities:
Keywords: Autocorrelation; Climate change; GAM; Interaction; Salmon; Spatiotemporal; Statistical model; Stream temperature
Year: 2019 PMID: 31741781 PMCID: PMC6857678 DOI: 10.7717/peerj.7892
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Map of study basins and environmental data collection sites.
Maps of study watersheds showing the stream logger sites used for model training and model testing (validation) datasets for the Wenatchee/Chiwawa (A), the M.F. John Day (B), and the Tucannon (C). The location of stream gages and climatic stations where data for environmental covariates were collected are also shown. Due to limitations in the spatial distribution of site coverage in the M.F. John Day and the Tucannon, the modeled stream networks were restricted to the upper basin and mainstem channel respectively.
Figure 2Model fit and validation years for each study basin.
Representation of study methodology: Seasonal models (spring and fall) were fit across all years of data in training datasets (2012–2017). Models were subsequently used to predict daily data in the testing datasets (1997–2011) in the Wenatchee, Chiwawa, M.F. John Day, and Tucannon river basins.
Dataset characteristics table.
Characteristics of model fitting and testing datasets shown for entire datasets and by month. The average temporal representation of sites varied across rivers and datasets. Effective coverage is the average percent of days with logger coverage for all sites within the respective time series (data days/[sites*years*365]). Data days by month is from all sites across all years. While data in fitting datasets were relatively evenly distributed across the year, data in validation datasets disproportionately cover summer months, particularly in the M.F. John Day and the Tucannon.
| Training datasets | Testing datasets | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Basin | Wenatchee | Chiwawa | M.F.J.D. | Tucannon | Wenatchee | Chiwawa | M.F.J.D. | Tucannon | |
| year range | 2012–2017 | 2012–2017 | 2012–2017 | 2012–2017 | 2003–2011 | 2003–2011 | 1997–2011 | 2001–2011 | |
| sites | 40 | 11 | 45 | 41 | 36 | 3 | 179 | 21 | |
| sites per year (avg.) | 28 | 8 | 22 | 34 | 20 | 3 | 34 | 13 | |
| effective coverage (%) | 52% | 56% | 33% | 67% | 42% | 63% | 16% | 55% | |
| data days | 45,629 | 13,479 | 32,407 | 59,953 | 32,929 | 4,171 | 63,561 | 25,072 | |
| data days per year (avg.) | 7,605 | 2,247 | 5,401 | 9,992 | 3,659 | 463 | 4,889 | 2,279 | |
| data days by month | |||||||||
| Jan. | 4,051 | 1,209 | 2,493 | 5,052 | 1,862 | 232 | 434 | 514 | |
| Feb. | 3,585 | 1,080 | 2,237 | 4,609 | 1,686 | 202 | 392 | 477 | |
| Mar. | 4,115 | 1,209 | 2,506 | 5,028 | 1,972 | 235 | 365 | 527 | |
| Apr. | 4,031 | 1,170 | 2,473 | 4,803 | 1,978 | 237 | 713 | 531 | |
| May | 4,158 | 1,219 | 2,587 | 4,962 | 2,103 | 238 | 2,325 | 2,886 | |
| Jun. | 3,988 | 1,135 | 2,777 | 4,757 | 2,294 | 242 | 8,115 | 3,590 | |
| Jul. | 3,640 | 1,109 | 3,170 | 4,997 | 3,779 | 513 | 14,367 | 4,128 | |
| Aug. | 3,747 | 1,176 | 3,369 | 5,517 | 5,544 | 806 | 15,078 | 4,278 | |
| Sep. | 3,214 | 937 | 3,021 | 5,376 | 4,928 | 729 | 13,364 | 3,973 | |
| Oct. | 3,866 | 1,100 | 2,907 | 5,361 | 3,020 | 349 | 5,897 | 3,250 | |
| Nov. | 3,575 | 1,050 | 2,412 | 4,681 | 1,901 | 190 | 1,986 | 462 | |
| Dec. | 3,659 | 1,085 | 2,455 | 4,810 | 1,862 | 198 | 525 | 456 | |
Table showing the variables considered for stream temperature models.
Temporal (A) and spatial (B) variables considered as covariates in model selection with description of variable calculation, spatial and temporal characteristics, and rationale for inclusion.
| Variable | Description | Spatial | Temporal | Rationale |
|---|---|---|---|---|
| Day of year (days) | NA | 1 day means | Accounts for seasonal changes in length of days and solar angle | |
| Average air temperature from 3 day period before predicted day (°C) | From one point at headwaters | 5 day means | The influence of air temperature on stream temperature accumulates over time | |
| Average air temperature from 5 day period before predicted day (°C) | From one point at headwaters | 3 day means | The influence of air temperature on stream temperature accumulates over time | |
| Difference between utilized averaged temperature variable (T3 | From one point at headwaters | 1 day means | Air temperature effects temperature in real time | |
| Snowpack depth | From one point at headwaters | 1 day means | More snowpack contributes colder water to streamflow | |
| April 1st snowpack depth (cm) | From one point at headwaters | 1 day means | Magnitude of late snowpack has prolonged effect on stream temperature into the summer (delayed discharge, riparian growth) | |
| Flow at USGS gage (m3/s) | From one point near mouth | 1 day means | Higher discharge creates more insulation against atmospheric influences. Seasonally different relationship (cooling in summer, warming in winter) | |
| Average elevation of catchment area (m) | Summarized by catchment area of the stream reach | NA | Catchments with higher terrain will have cooler streams even if the site is at a lower elevation | |
| Difference between | Summarized by catchment area of the stream reach | NA | Higher elevation sites experience cooler air temperatures and are closer to cooler headwaters | |
| Catchment area of site (km2) | Summarized by catchment area of the stream reach | NA | Sites further from headwaters have more time to be effected by atmospheric temperatures | |
| Estimated Base flow index (mean low flow ÷ mean annual discharge) | Summarized by catchment area of the stream reach | NA | Areas with higher groundwater influence will have mitigated stream temperatures (lower highs and higher lows). Developed by | |
| Percentage of catchment covered by lakes | Summarized by catchment area of the stream reach | NA | Lakes slow down water leading to increased atmospheric warming | |
| Slope of stream reach | Summarized by stream reach | NA | Steeper streams move cooler water downstream faster | |
| Forest cover percentage of reach contributing area ( | Summarized by reach contributing area and catchment area of the stream reach | NA | Forested areas provide more stream shading and retention of moisture/snowpack | |
Table showing the variables and interactions utilized for each model and fitting/testing statistics.
Model selection table showing retained covariate (Cov.) and interactions (Int.) in GAM and linear models for the Wenatchee, Chiwawa, M.F. John Day and Tucannon watersheds (A) and fitting and model prediction statistics for selected models (B). Retained variables are represented by grey boxes. Glaciers and lakes are minimally present or not present in the M.F. John Day and the Tucannon watersheds and thus were not considered as a covariates. Knots (K) were limited in GAM model fitting to reduce risk of overfitting. Selected model AIC values, as well as model fit and testing dataset prediction metrics, are provided including the root mean squared error (RMSE) and Nash-Stutclife Coeficient (NSC). Statistics are shown for the model fit (Train), cross validation (C.V.), and testing dataset predictions (Test).
| Cov. | Int. | K | Wenatchee | Chiwawa | M.F.J.D | Tucannon | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Spring | Fall | Spring | Fall | Spring | Fall | Spring | Fall | |||||||||||
| GAM | Lin. | GAM | Lin. | GAM | Lin. | GAM | Lin. | GAM | Lin. | GAM | Lin. | GAM | Lin. | GAM | Lin. | |||
| 5 | ||||||||||||||||||
| 5 | ||||||||||||||||||
| 4 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | NA | NA | NA | NA | NA | NA | NA | NA | ||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | ||||||||||||||||||
| 3 | NA | NA | NA | NA | NA | NA | NA | NA | ||||||||||
| Terms | 14 | 27 | 12 | 24 | 12 | 22 | 11 | 22 | 13 | 21 | 12 | 19 | 11 | 15 | 8 | 13 | ||
| Δ | 0 | 1426 | 0 | 954 | 0 | 334 | 0 | 297 | 0 | 4256 | 0 | 884 | 0 | 4391 | 0 | 3337 | ||
| 0.95 | 1.12 | 0.97 | 1.15 | 0.62 | 0.63 | 0.78 | 0.81 | 1.11 | 1.42 | 1.27 | 1.46 | 0.85 | 0.91 | 1.08 | 1.15 | |||
| 1.09 | 1.06 | 1.35 | 1.35 | 0.67 | 1.17 | 0.84 | 1.10 | 1.30 | 1.44 | 1.54 | 1.53 | 0.87 | 0.92 | 1.09 | 1.16 | |||
| 1.18 | 1.30 | 1.32 | 1.44 | 0.70 | 0.78 | 0.93 | 0.89 | 1.52 | 1.81 | 1.56 | 1.80 | 0.88 | 1.01 | 1.00 | 1.01 | |||
| 0.96 | 0.95 | 0.95 | 0.95 | 0.98 | 0.97 | 0.95 | 0.96 | 0.96 | 0.95 | 0.94 | 0.94 | 0.97 | 0.96 | 0.95 | 0.95 | |||
| 0.94 | 0.93 | 0.93 | 0.91 | 0.98 | 0.97 | 0.95 | 0.96 | 0.90 | 0.85 | 0.89 | 0.85 | 0.97 | 0.96 | 0.95 | 0.95 | |||
Figure 3Example of interaction effect on stream temperature (flow and air temperature).
Conditional surface plots showing the modeled effects of averaged air temperature variables (T5a and T3a) interacting with flow (F) on stream temperature (Tw) for the best spring (A–D) and fall models (E–F) from each of the study watersheds. This effect was not retained in the M.F. John Day and the Tucannon fall models. Relationships are presented holding all other variables in models at median values.
Figure 4Model validation prediction accuracy results.
Graphs showing the accuracy of the best validation predictions (linear or GAM) for combined fall and spring models versus measured daily (A–D) and monthly (E–H) measurements of stream temperature. Testing dataset prediction statistics are shown for each watershed including the root mean squared error (RMSE Test), the mean absolute error (MAE), and the Nash-Sutcliffe Coefficient (NSC Test).
Figure 5Prediction accuracy by year.
Graphs showing daily testing dataset model predictions versus measured stream temperature by year in the Wenatchee (A–I), Chiwawa (J–R), M.F. John Day (S–JJ) and Tucannon (HH–RR) watersheds. The root mean squared error for the testing datasets predictions by year (RMSE Test) are also shown.
Monthly prediction statistics table.
Testing dataset prediction error (RMSE) and goodness of fit measures (NSC) for selected GAM and linear models shown for aggregated monthly predictions for the Wenatchee, Chiwawa, M.F. John Day and Tucannon watersheds respectively.
| Wenatchee | Chiwawa | M.F.J.D. | Tucannon | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RMSE Test | NSC Test | RMSE Test | NSC Test | RMSE Test | NSC Test | RMSE Test | NSC Test | |||||||||
| Month | GAM | Lin | GAM | Lin | GAM | Lin | GAM | Lin | GAM | Lin | GAM | Lin | GAM | Lin | GAM | Lin |
| January | 0.81 | 1.01 | 0.35 | 0.00 | 0.15 | 0.21 | 0.75 | 0.52 | 0.58 | 0.71 | −0.09 | −0.66 | 0.51 | 0.69 | 0.63 | 0.32 |
| February | 0.58 | 0.69 | 0.50 | 0.29 | 0.35 | 0.34 | 0.42 | 0.47 | 0.58 | 0.90 | −0.11 | −1.65 | 0.46 | 0.51 | 0.78 | 0.72 |
| March | 0.68 | 0.65 | 0.55 | 0.58 | 0.57 | 0.31 | −0.47 | 0.57 | 0.79 | 0.92 | 0.19 | −0.09 | 0.57 | 0.68 | 0.75 | 0.65 |
| April | 0.77 | 0.80 | 0.66 | 0.64 | 0.45 | 0.38 | 0.00 | 0.29 | 1.19 | 1.31 | 0.50 | 0.40 | 0.90 | 0.89 | 0.80 | 0.80 |
| May | 0.89 | 0.94 | 0.76 | 0.73 | 0.25 | 0.28 | 0.94 | 0.93 | 1.31 | 1.46 | 0.61 | 0.52 | 0.75 | 0.74 | 0.91 | 0.92 |
| June | 1.16 | 1.20 | 0.80 | 0.78 | 0.58 | 0.72 | 0.94 | 0.90 | 1.28 | 1.56 | 0.78 | 0.68 | 0.60 | 0.64 | 0.96 | 0.96 |
| July | 1.28 | 1.46 | 0.78 | 0.71 | 0.70 | 0.91 | 0.90 | 0.83 | 1.36 | 1.73 | 0.74 | 0.57 | 0.68 | 0.70 | 0.96 | 0.96 |
| August | 1.40 | 1.47 | 0.69 | 0.66 | 0.87 | 0.78 | 0.74 | 0.79 | 1.38 | 1.71 | 0.64 | 0.44 | 0.73 | 0.72 | 0.95 | 0.96 |
| September | 0.90 | 1.00 | 0.77 | 0.72 | 0.46 | 0.41 | 0.76 | 0.81 | 1.09 | 1.45 | 0.70 | 0.47 | 0.60 | 0.58 | 0.95 | 0.96 |
| October | 0.82 | 0.92 | 0.77 | 0.71 | 0.44 | 0.43 | 0.86 | 0.87 | 1.15 | 1.31 | 0.63 | 0.52 | 0.77 | 0.78 | 0.88 | 0.87 |
| November | 0.99 | 1.37 | 0.78 | 0.57 | 0.87 | 0.70 | −6.09 | −3.52 | 1.28 | 1.46 | 0.08 | −0.19 | 0.71 | 0.95 | 0.85 | 0.74 |
| December | 1.00 | 1.19 | 0.59 | 0.42 | 1.02 | 0.38 | −8.62 | −0.32 | 1.50 | 1.59 | −3.02 | −3.56 | 0.91 | 1.13 | −0.33 | −1.03 |
| Year | 1.03 | 1.15 | 0.95 | 0.94 | 0.62 | 0.60 | 0.98 | 0.98 | 1.26 | 1.56 | 0.92 | 0.88 | 0.70 | 0.72 | 0.98 | 0.98 |
Figure 6The ability of models to distinguish between years by month.
Monthly aggregated testing dataset model predictions from selected GAM models shown versus measured temperatures by month for the two sites with the Wenatchee with the most continuous time series of data (located in Nason Creek [red circles] and Peshastin Creek [blue triangles]). Note, the range of axes vary by month to better show variability between predictions and measurements.
Figure 7Example of accuracy and precision for average sites.
Measured stream temperature, predicted values from selected GAM models, and residual error for the entire time series of single sites selected from the Wenatchee River training dataset (A, Site WC503432-000155, Peshastin Creek) and testing dataset (B, Site #219, Nason Creek). These sites were chosen for display due to having near median values of RMSE out of all sites in the respective datasets and largely continuous timeseries. Horizonal dashed lines presented at −2 and 2 °C to provide guidance on size of residuals.
Sensitivity analysis for the number of sites and years in fitting datasets.
Table showing the error of testing dataset predictions (RMSE Test) in model sensitivity analyses varying the number of years (A) and number of sites (B) utilized in the training datasets independently. RMSE Test was calculated from 100 iterations of randomly chosen sites/years out of all available for each sensitivity scenario (or all combinations if there were fewer than 100). The percent increase in prediction error from models fit with all data are shown in parenthesis. Model selection was not repeated for each scenario, and thus the same model formulas for each watershed was used for all scenario iterations. Note, sites have varying temporal coverage out of the entire time series of the fitting dataset and thus the removal of different sites/years represents different quantities of data. Percentage given for each watershed represents the average effective coverage percentage for each site in the fitting datasets (data days/[sites*years*365]).
| Wenatchee (52%) | Chiwawa (56%) | M.F. John Day (33%) | Tucannon (67%) | |||||
|---|---|---|---|---|---|---|---|---|
| GAM | Linear | GAM | Linear | GAM | Linear | GAM | Linear | |
| 6 (All) | 1.26 | 1.37 | 0.85 | 0.85 | 1.55 | 1.80 | 0.95 | 0.98 |
| 5 | 1.28 (2%) | 1.38 (1%) | 0.88 (3%) | 0.85 (1%) | 1.68 (8%) | 1.88 (4%) | 0.96 (1%) | 0.99 (1%) |
| 4 | 2.13 (70%) | 1.40 (2%) | 1.33 (55%) | 0.88 (4%) | 1.94 (10%) | 1.97 (10%) | 0.99 (4%) | 1.01 (4%) |
| 3 | 2.53 (101%) | 1.68 (22%) | 2.71 (217%) | 1.03 (22%) | 2.33 (51%) | 2.07 (15%) | 1.23 (30%) | 1.09 (11%) |
| All | 1.26 | 1.37 | 1.55 | 1.80 | 0.95 | 0.98 | ||
| 35 | 1.32 (5%) | 1.39 (1%) | 1.68 (9%) | 1.83 (2%) | 0.95 (0%) | 0.98 (0%) | ||
| 30 | 1.42 (14%) | 1.41 (3%) | 1.79 (16%) | 1.86 (3%) | 0.95 (0%) | 0.98 (0%) | ||
| 25 | 1.63 (30%) | 1.45 (6%) | 1.94 (25%) | 1.88 (5%) | 0.96 (2%) | 1.00 (2%) | ||
| 20 | 1.88 (50%) | 1.54 (12%) | 2.48 (60%) | 1.97 (10%) | 0.97 (2%) | 1.01 (3%) | ||
| 15 | 2.58 (106%) | 1.72 (25%) | 3.14 (103%) | 2.11 (17%) | 1.06 (12%) | 1.07 (9%) | ||
| 10 | 6.30 (401%) | 2.81 (104%) | 9.47 (511%) | 3.07 (70%) | 1.33 (40%) | 1.26 (30%) | ||